Centralize truncation in conversation history #5652

aibrahim-oai · 2025-10-24T21:47:58Z

move the truncation logic to conversation history to use on any tool output. This will help us in avoiding edge cases while truncating the tool calls and mcp calls.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

codex-rs/core/src/tools/mod.rs

aibrahim-oai · 2025-10-24T23:07:48Z

@codex review this

chatgpt-codex-connector

💡 Codex Review

codex/codex-rs/core/src/client_common.rs

Lines 71 to 84 in 30faba1

    
           pub(crate) fn get_formatted_input(&self) -> Vec<ResponseItem> { 
        
               let mut input = self.input.clone(); 
        
               // when using the *Freeform* apply_patch tool specifically, tool outputs 
        
               // should be structured text, not json. Do NOT reserialize when using 
        
               // the Function tool - note that this differs from the check above for 
        
               // instructions. We declare the result as a named variable for clarity. 
        
               let is_freeform_apply_patch_tool_present = self.tools.iter().any(|tool| match tool { 
        
                   ToolSpec::Freeform(f) => f.name == "apply_patch", 
        
                   _ => false, 
        
               }); 
        
               if is_freeform_apply_patch_tool_present { 
        
                   reserialize_shell_outputs(&mut input); 
        
               }

Shell tool outputs still serialized as JSON unless freeform apply_patch is enabled

The new truncation flow expects function_call_output content sent back to the model to be plain text with Exit code/Wall time headers, and the added integration tests assert that these outputs are not valid JSON. However, Prompt::get_formatted_input only converts the JSON payload produced by format_exec_output_for_model into a structured plain‑text string when a freeform apply_patch tool is present (client_common.rs lines 71‑84). For normal sessions that use only the standard shell tool, the reserialization step never runs, so the next prompt still contains the original JSON blob. This leaves the function_call_output valid JSON (and without the expected line prefixes), causing the new tests to fail and the model to see a different format than intended. The reserialization should happen whenever shell outputs are present, not only when the freeform apply_patch tool is configured.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

pakrym-oai · 2025-10-24T23:31:33Z

codex-rs/core/tests/suite/truncation.rs

+    let output = mock2
+        .single_request()
+        .function_call_output_text(call_id)
+        .expect("function_call_output present for rmcp call");


and anyhow::Result

pakrym-oai · 2025-10-24T23:32:47Z

codex-rs/core/tests/suite/truncation.rs

+// Verifies that an MCP tool call result exceeding the model formatting limits
+// is truncated before being sent back to the model.
+#[tokio::test(flavor = "multi_thread", worker_threads = 1)]
+async fn mcp_tool_call_output_exceeds_limit_truncated_for_model() -> Result<()> {


should we have all truncation tests in one place or colocated with tool-specific tests ( next to other mcp tests)?

it's distributed now like the shell tool ones. I feel we should at least have some of them in one place to test the truncation functionality. wdut?

pakrym-oai · 2025-10-24T23:33:32Z

codex-rs/core/tests/suite/truncation.rs

+5
+6
+.*
+\[\.{3} omitted \d+ of 400 lines \.{3}\]


can we fully hardcode?

pakrym-oai · 2025-10-24T23:34:00Z

codex-rs/core/tests/suite/truncation.rs

+        .expect("function error output present");
+
+    // Expect plaintext with byte-truncation marker and no omitted-lines marker
+    assert!(


can we use regex match approach ?

wait should we hardcode or use regex?

pakrym-oai · 2025-10-24T23:46:52Z