Return Guardrail token usage #62

steven10a · 2025-12-01T23:20:51Z

Implements an update for issue 41 - thank you @thuanng-a11y for reporting!

For Guardrails that use LLM's, return token usage data
Per guardrail token usage is returned via its info_dict and can be accessed with

for gr in response.guardrail_results.all_results:
    usage = gr.info.get("token_usage")
    if usage:
        print(f"{gr.info['guardrail_name']}: {usage['total_tokens']} tokens")

Additionally, the total tokens used in a response can be returned with the total_guardrail_token_usage helper
total_guardrail_token_usage(response). This works for all clients (GuardrailAgent, GuardrailAsyncOpenAI, ...) and works with streaming and non-streaming
Updated the documentation to reflect this new functionality
Updated example scripts to show implementation
Added tests

Copilot

Pull request overview

This PR implements token usage tracking for LLM-based guardrails, addressing issue #41. The implementation provides per-guardrail token statistics and aggregated totals across all guardrail calls, working seamlessly with all client surfaces (OpenAI clients, Agents SDK, streaming and non-streaming).

Key Changes:

Introduced TokenUsage dataclass and helper functions (extract_token_usage, token_usage_to_dict, aggregate_token_usage_from_infos) to capture and aggregate token consumption data
Updated all LLM-based guardrails (Jailbreak, Custom Prompt Check, Prompt Injection Detection, Hallucination Detection) to return token usage alongside their analysis results
Added unified total_guardrail_token_usage() helper function that works across all guardrails surfaces for easy token tracking

Reviewed changes

Copilot reviewed 27 out of 27 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/guardrails/types.py	Adds `TokenUsage` dataclass, extraction utilities, and `total_guardrail_token_usage()` unified interface
src/guardrails/_base_client.py	Adds `total_token_usage` property to `GuardrailResults` for token aggregation
src/guardrails/checks/text/llm_base.py	Updates `run_llm()` to return token usage tuple; modifies `create_llm_check_fn()` to include token usage in result info
src/guardrails/checks/text/jailbreak.py	Updates jailbreak guardrail to capture and include token usage in results
src/guardrails/checks/text/prompt_injection_detection.py	Updates prompt injection detection to capture and include token usage in results
src/guardrails/checks/text/hallucination_detection.py	Updates hallucination detection to capture and include token usage in results
src/guardrails/agents.py	Updates agent guardrail wrappers to propagate token usage in output_info even for successful checks
src/guardrails/init.py	Exports `total_guardrail_token_usage` helper function for public API
tests/unit/test_types.py	Comprehensive tests for TokenUsage, extraction, aggregation, and unified helper
tests/unit/test_base_client.py	Tests for token aggregation in GuardrailResults
tests/unit/checks/test_llm_base.py	Updates tests to verify token usage is returned from LLM calls
tests/unit/checks/test_jailbreak.py	Updates tests to mock token usage in return values
tests/unit/checks/test_prompt_injection_detection.py	Updates tests to mock token usage in return values
tests/unit/test_agents.py	Adds test verifying successful agent guardrails return info with token usage
docs/quickstart.md	Documents token usage tracking with examples for all client surfaces
docs/agents_sdk_integration.md	Documents token usage tracking for Agents SDK with per-stage examples
examples/basic/hello_world.py	Demonstrates token usage tracking in basic example
examples/basic/multi_bundle.py	Demonstrates token usage tracking in streaming example
examples/basic/local_model.py	Demonstrates token usage tracking with local models
src/guardrails/utils/anonymizer.py	Code formatting cleanup (unrelated to token usage)
src/guardrails/checks/text/pii.py	Code formatting cleanup (unrelated to token usage)
src/guardrails/checks/text/urls.py	Code formatting cleanup (unrelated to token usage)
src/guardrails/client.py	Code formatting cleanup (unrelated to token usage)
src/guardrails/evals/core/async_engine.py	Code formatting cleanup (unrelated to token usage)
tests/unit/evals/test_guardrail_evals.py	Code formatting cleanup (unrelated to token usage)
tests/unit/evals/test_async_engine.py	Code formatting cleanup (unrelated to token usage)
tests/unit/checks/test_anonymizer_baseline.py	Code formatting cleanup (unrelated to token usage)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…o dev/steven/token_count

steven10a · 2025-12-01T23:30:44Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-12-01T23:33:50Z

src/guardrails/types.py

+    return {
+        "prompt_tokens": total_prompt if has_any_data else None,
+        "completion_tokens": total_completion if has_any_data else None,
+        "total_tokens": total if has_any_data else None,


Avoid reporting missing token fields as zero

aggregate_token_usage_from_infos uses a single has_any_data flag while totals are initialised to zero, so if any guardrail contributes only one of the token fields (e.g., a provider exposes total_tokens but not prompt_tokens/completion_tokens, or a run only records prompt tokens), the function returns 0 for the missing fields instead of None. That misstates usage and under-reports costs whenever some components are unavailable, even though no data exists for those fields. Consider tracking availability per field or leaving fields as None unless a value was actually aggregated.

Useful? React with 👍 / 👎.

I am okay with this implementation. OpenAI clients and popular 3rd parties were tested and return the same three token fields. A hypothetical edge case client may return different fields but in that case we will be returning that there is no token data to be reported.

gabor-openai

LGTM TY

steven10a added 2 commits December 1, 2025 17:59

Returning token usage by Guardrails

b79a8c0

Fix AttributionError on total_token_usage

9788083

Copilot AI review requested due to automatic review settings December 1, 2025 23:20

Copilot started reviewing on behalf of steven10a December 1, 2025 23:21 View session

Copilot finished reviewing on behalf of steven10a December 1, 2025 23:25

Copilot AI reviewed Dec 1, 2025

View reviewed changes

Merge branch 'main' of github.com:openai/openai-guardrails-python int…

fc95040

…o dev/steven/token_count

chatgpt-codex-connector bot reviewed Dec 1, 2025

View reviewed changes

steven10a requested a review from gabor-openai December 1, 2025 23:51

gabor-openai approved these changes Dec 2, 2025

View reviewed changes

gabor-openai merged commit 2615892 into main Dec 2, 2025
3 checks passed

gabor-openai deleted the dev/steven/token_count branch December 2, 2025 19:42

steven10a mentioned this pull request Dec 2, 2025

Cannot get LLM usage tokens/cost for each task in Guardrails (Jailbreak, Hallucination, Custom Prompt Check...) #41

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Return Guardrail token usage #62

Return Guardrail token usage #62

Uh oh!

steven10a commented Dec 1, 2025 •

edited by gabor-openai

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

steven10a commented Dec 1, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Dec 1, 2025

Uh oh!

steven10a Dec 1, 2025

Uh oh!

gabor-openai left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Return Guardrail token usage #62

Return Guardrail token usage #62

Uh oh!

Conversation

steven10a commented Dec 1, 2025 • edited by gabor-openai Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

steven10a commented Dec 1, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

steven10a Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

gabor-openai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

steven10a commented Dec 1, 2025 •

edited by gabor-openai

Loading