fix(codex): auth-aware default model — fixes codex empty builds (#82 Gap 3) by AbirAbbas · Pull Request #85 · Agent-Field/SWE-AF

AbirAbbas · 2026-06-29T18:22:38Z

Summary

Root-causes and fixes the codex empty-builds observation in #82 (Gap 3). It is not a structured-output problem — it's a model/auth mismatch, confirmed live.

SWE-AF defaulted every codex role to gpt-5.3-codex. The -codex models are only available with OpenAI API-key auth — under ChatGPT-account auth they return:

HTTP 400 invalid_request_error:
"The 'gpt-5.3-codex' model is not supported when using Codex with a ChatGPT account."

…in ~3 seconds. The coder turned that fast error into files_changed: [] → "Coder agent failed" → the foundation issue failed → cascade → empty build. That's the reporter's exact symptom (~9–11s, codex-only; claude_code fine, because it never touched codex).

Fix

1. Auth-aware codex default model (schemas.py, fast/schemas.py)
Resolve the codex base model by auth mode instead of a constant:

API-key auth (SWE_CODEX_AUTH_MODE=api_key, or auto with OPENAI_API_KEY set) → gpt-5.3-codex (unchanged).
ChatGPT-account auth (chatgpt, or auto with no key) → gpt-5.5 (a ChatGPT-compatible model).
Explicit overrides (models.default / per-role / SWE_DEFAULT_MODEL) still win.

2. Codex model/auth errors are fatal + surfaced (fatal_error.py, execution_agents.py)
A model/auth 400 is non-retryable, so:

check_fatal_harness_error now matches "not supported when using Codex with a ChatGPT account" and "requires a newer version of Codex" → raises FatalHarnessError with the real message, short-circuiting the retry cap.
run_coder includes the underlying error_message in the CoderResult summary when no result parses, so an empty result always carries why (the reporter saw a bare "Coder agent failed" with no reason).

Validation Contract

ChatGPT-auth codex default = gpt-5.5; API-key-auth codex default = gpt-5.3-codex; explicit overrides win; claude_code/open_code unaffected.
A codex "model not supported on ChatGPT account" / "needs newer CLI" error is fatal (no silent retry) and its message reaches the build output.

Test Plan — verified live (codex CLI 0.142.4, ChatGPT auth)

gpt-5.3-codex → 400 "not supported … ChatGPT account" in 3s; resolved gpt-5.5 wrote a real hello.py in 8s.
resolve_runtime_models / fast_resolve_models return gpt-5.5 under chatgpt env, gpt-5.3-codex under api_key env (new unit tests).
New fatal-pattern tests for both codex error strings.
Full make check on py3.12: 1010 passed, 1 skipped.

Scope

SWE-AF-only; independent of #84 (Gaps 1 & 2) and of the agentfield SDK — no cross-repo dependency. Tip for ChatGPT-plan users: gpt-5.5 runs at high reasoning effort (slower per call); set models.coder or SWE_DEFAULT_MODEL to override if you want lower latency.

Refs #82 (Gap 3).

🤖 Generated with Claude Code

On ChatGPT-account auth the `-codex` models return HTTP 400 "not supported when using Codex with a ChatGPT account" — in ~3s. SWE-AF defaulted every codex role to `gpt-5.3-codex` regardless of auth mode, so a ChatGPT-auth codex build hit that 400, the coder got an error result, returned files_changed:[], the foundation issue failed and cascaded — an empty build (#82 Gap 3). The same plan on claude_code worked, which is why it looked codex-specific. Resolve the codex base model by auth mode instead: keep `gpt-5.3-codex` for API-key auth (where the -codex models are available), and use a ChatGPT-compatible model (`gpt-5.5`) when codex authenticates via a ChatGPT account (SWE_CODEX_AUTH_MODE=chatgpt, or auto with no OPENAI_API_KEY). Explicit model overrides (models.default / per-role / SWE_DEFAULT_MODEL) still win. Verified live: gpt-5.3-codex 400s in 3s on ChatGPT auth; the resolved gpt-5.5 writes real code in ~8s. Refs #82 (Gap 3). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…coder A codex model/auth 400 (e.g. a `-codex` model under ChatGPT-account auth, or a model needing a newer Codex CLI) is non-retryable: retrying with the same model and auth fails identically. Previously it was neither matched as fatal nor surfaced — the coder fell through to a bare `files_changed:[]` / "Coder agent failed" with no reason, burning the retry cap and cascading into a silent empty build. - fatal_error.py: match "not supported when using Codex with a ChatGPT account" and "requires a newer version of Codex" so check_fatal_harness_error raises FatalHarnessError with the real message and short-circuits retries. - run_coder: when the harness returns no parseable result, include the underlying error_message in the CoderResult summary so an empty result always carries *why*. Refs #82 (Gap 3). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

AbirAbbas and others added 2 commits June 29, 2026 14:14

AbirAbbas merged commit 8e79bc1 into main Jun 29, 2026
2 checks passed

AbirAbbas deleted the fix/issue-82-codex-model-auth branch June 29, 2026 19:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(codex): auth-aware default model — fixes codex empty builds (#82 Gap 3)#85

fix(codex): auth-aware default model — fixes codex empty builds (#82 Gap 3)#85
AbirAbbas merged 2 commits into
mainfrom
fix/issue-82-codex-model-auth

AbirAbbas commented Jun 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

AbirAbbas commented Jun 29, 2026

Summary

Fix

Validation Contract

Test Plan — verified live (codex CLI 0.142.4, ChatGPT auth)

Scope

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant