[rollout-trace] Add rollout trace capture and reducer by cassirer-openai · Pull Request #17982 · openai/codex

cassirer-openai · 2026-04-15T19:27:02Z

Summary

Adds opt-in rollout tracing for Codex sessions.

A trace records raw runtime evidence into a local bundle, then reduces that bundle into a semantic state.json graph. The goal is to make complex failures inspectable across model requests, compaction, code-mode exec, nested tool calls, terminal operations, and multi-agent v2 child threads.

The best review entry point is the new rollout trace README. It has the system diagrams and explains the main invariant: hot-path code observes first, while the offline reducer interprets later.

At a high level:

codex-core emits best-effort raw observations through a thin recorder.
codex-rollout-trace owns the bundle schema, writer, reduced model, and reducer.
Raw payloads preserve exact evidence; reduced objects describe what the model saw, what Codex did, and how information moved.
Multi-agent support is v2-only. Spawned child threads share the root trace bundle, so one state.json contains the parent/child graph and spawn/task/result/close edges.

This also adds:

codex debug trace-reduce <trace-bundle>

to replay a raw bundle and write state.json.

Review Guide

The commits are ordered by layer:

eff9328 Add rollout trace crate
Schema, README, writer, raw payload model, reduced model, reducer, and reducer tests.
856ef51 Add core rollout trace recorder
Core recorder facade plus inference and compaction capture.
2abf9dc Trace tool and code-mode runtime boundaries
Tool dispatch capture, code cells, terminal/write_stdin/exec relationships, and code-mode provenance.
c657343 Trace rollout sessions and multi-agent v2 edges
Session trace creation/inheritance and multi-agent v2 graph edges, including close-agent shutdown edges.
55d7e43 Add debug trace reduction command
CLI entrypoint for reducing a bundle to state.json.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d364b9651e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-16T00:23:12Z

+        let Some(source_item_id) = self.latest_assistant_message_item_for_turn(
+            &observed.child_thread_id,
+            &observed.child_codex_turn_id,
+        ) else {
+            bail!(
+                "agent result edge {} could not find a child result message",
+                observed.edge_id
+            );


Avoid hard-failing agent-result edges without child message

queue_agent_result_interaction_edge bails when no assistant message exists for the child turn. Child threads can terminate (e.g., aborted/failed) before producing any assistant message, yet still emit AgentResultObserved. In that valid case, replay_bundle errors and no state.json is produced. Fall back to a non-message source anchor instead of bailing.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-16T00:23:13Z

+            let previous_items = self
+                .rollout
+                .inference_calls
+                .values()
+                .find(|inference| {
+                    inference.upstream_request_id.as_deref() == Some(previous_response_id)
+                })


Restrict previous_response_id lookup to the same thread

Incremental request reconstruction matches previous_response_id across all inference_calls without checking thread_id. In multi-thread traces, identical upstream IDs (common with fixtures or non-global provider IDs) can pull the wrong thread’s prefix, corrupting conversation linkage or causing mismatch failures. Filter candidates by the current thread.

Useful? React with 👍 / 👎.

cassirer-openai force-pushed the codex/rollout-trace-submission branch 2 times, most recently from 0274455 to 9ccb870 Compare April 15, 2026 23:02

cassirer-openai added 5 commits April 15, 2026 16:07

Add rollout trace crate

eff9328

Add core rollout trace recorder

856ef51

Trace tool and code-mode runtime boundaries

2abf9dc

Trace rollout sessions and multi-agent v2 edges

c657343

Add debug trace reduction command

55d7e43

cassirer-openai force-pushed the codex/rollout-trace-submission branch from 9ccb870 to 55d7e43 Compare April 15, 2026 23:10

cassirer-openai requested a review from jif-oai April 15, 2026 23:35

codex: fix CI failure on PR #17982

d364b96

cassirer-openai marked this pull request as ready for review April 16, 2026 00:16

cassirer-openai assigned jif-oai Apr 16, 2026

chatgpt-codex-connector bot reviewed Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rollout-trace] Add rollout trace capture and reducer#17982

[rollout-trace] Add rollout trace capture and reducer#17982
cassirer-openai wants to merge 6 commits intomainfrom
codex/rollout-trace-submission

cassirer-openai commented Apr 15, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cassirer-openai commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review Guide

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cassirer-openai commented Apr 15, 2026 •

edited

Loading