Keep response spans active during LLM hooks by smf-h · Pull Request #2779 · openai/openai-agents-python

smf-h · 2026-03-26T04:03:42Z

Summary

keep the active
esponse span current while RunHooks.on_llm_start/on_llm_end and AgentHooks.on_llm_start/on_llm_end execute.
export hook-attached
esponse span metadata and cover both streamed and non-streamed paths with regression tests.
preserve compatibility with custom Model implementations that still use the older method signatures, and finish streamed
esponse spans before tool/handoff processing so streamed traces match the non-streamed hierarchy.

Test plan

uv run pytest tests/test_llm_hook_tracing.py -q
uv run pytest tests/test_agent_tracing.py tests/test_tracing_errors.py tests/test_tracing_errors_streamed.py -q
uv run pytest tests/test_agent_prompt.py tests/test_agent_runner_streamed.py tests/test_cancel_streaming.py tests/test_hitl_session_scenario.py tests/test_streaming_tool_call_arguments.py tests/test_run_state.py -q
uv run ruff check
uv run pyright --project pyrightconfig.json
uv run mypy . --exclude site still reports the pre-existing examples/tools/image_generator.py:24 unused ype: ignore, which I also reproduced in a clean baseline worktree.

Issue number

Closes #1844

Checks

I've added new tests (if relevant)
I've added/updated the relevant documentation
I've run make lint and make format
I've made sure tests pass

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7927f0f533

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-26T04:08:57Z

src/agents/run_internal/run_loop.py

-                )
+        stream_failed_retry_attempts: list[int] = [0]
+        retry_stream = stream_response_with_retry(
+            get_stream=lambda: model.stream_response(**stream_request_kwargs),


Preserve positional invocation for streamed model calls

run_single_turn_streamed now calls model.stream_response via model.stream_response(**stream_request_kwargs). In the previous implementation, the required model arguments were passed positionally, so legacy custom Model implementations with positional-only parameters (or different parameter names but the same positional order) still worked; with this change they now raise TypeError at runtime and streamed runs fail before producing output. Please keep the required parameters positional (or add a fallback path) to maintain compatibility with existing custom streamed models.

Useful? React with 👍 / 👎.

Restored positional-argument compatibility for both streamed and non-streamed custom models. Required parameters are positional again, optional values continue through kwargs, and response_span is still passed only when the model supports it.

seratch · 2026-03-26T04:25:19Z

I think this PR still has two merge blockers.

It breaks backward compatibility for some custom streamed Model implementations. run_single_turn_streamed() now ends up calling model.stream_response(**stream_request_kwargs). I reproduced a runtime regression with a custom model whose first required parameters are positional-only: the same model works on the base commit (7dc7fa2b) but fails on this PR with: TypeError: ... got some positional-only arguments passed as keyword arguments The new regression test only covers legacy models that omit response_span but still accept the existing keyword names, so it does not catch this case.
Exporting ResponseSpanData.metadata appears to break live tracing export. On this PR head, a simple traced Runner.run(...) / Runner.run_streamed(...) against FakeModel completes, but the tracing client logs: 400 Unknown parameter: 'data[1].span_data.metadata' I could reproduce that on the PR head but not on the base commit. ResponseSpanData.export() now includes metadata, and the OpenAI tracing sanitizer does not remove it before sending the payload, so this looks like a real runtime incompatibility for the default tracing path.

Because both issues reproduce at runtime, I don't think this is ready to merge yet. Also, the CI checks are failing.

…-1844

smf-h · 2026-03-26T05:17:29Z

Pushed follow-up fixes for both blockers on the latest head:

Restored positional invocation compatibility for both streamed and non-streamed custom models. Required parameters are positional again, while optional values still flow through kwargs and response_span remains conditional.
Sanitized response span metadata before sending payloads to the OpenAI tracing ingest API, and added a regression test covering that path.

Local verification on this branch:

UV_FROZEN=1 uv sync --all-extras --all-packages --group dev.
uv run pytest tests/test_trace_processor.py tests/test_llm_hook_tracing.py tests/test_tracing.py tests/test_agent_tracing.py tests/test_tracing_errors.py tests/test_tracing_errors_streamed.py tests/mcp/test_mcp_tracing.py tests/test_openai_responses.py -q.
uv run pytest tests/test_agent_prompt.py tests/test_agent_runner_streamed.py tests/test_cancel_streaming.py tests/test_hitl_session_scenario.py tests/test_streaming_tool_call_arguments.py tests/test_run_state.py -q.
uv run ruff check.
uv run pyright --project pyrightconfig.json.
uv run mypy . --exclude site still reports the pre-existing examples/tools/image_generator.py:24 unused-ignore baseline.

The new Tests workflow run for this head is currently waiting for maintainer approval before the matrix starts: https://github.com/openai/openai-agents-python/actions/runs/23578395345 .

Expose response spans during LLM hooks

7927f0f

github-actions bot added the feature:core label Mar 26, 2026

smf-h mentioned this pull request Mar 26, 2026

Customizing traces and spans: Ability to add metadata to spans #1844

Open

chatgpt-codex-connector bot reviewed Mar 26, 2026

View reviewed changes

seratch marked this pull request as draft March 26, 2026 04:25

smf-h added 2 commits March 26, 2026 12:38

Merge remote-tracking branch 'upstream/main' into codex/span-metadata…

0c250f5

…-1844

Update MCP tracing snapshots after upstream merge

1d00566

github-actions bot added the dependencies label Mar 26, 2026

Sanitize response metadata for tracing export

a3c368b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep response spans active during LLM hooks#2779

Keep response spans active during LLM hooks#2779
smf-h wants to merge 4 commits intoopenai:mainfrom
smf-h:codex/span-metadata-1844

smf-h commented Mar 26, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Uh oh!

smf-h Mar 26, 2026

Uh oh!

seratch commented Mar 26, 2026

Uh oh!

smf-h commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

smf-h commented Mar 26, 2026

Summary

Test plan

Issue number

Checks

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

smf-h Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

seratch commented Mar 26, 2026

Uh oh!

smf-h commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants