Switches `CodeVerifier` to use `aiohttp` instead of `requests`, enabling native async, and avoiding using `asyncio.to_thread`. #1227

finbarrtimbers · 2025-11-24T21:31:45Z

Changes needed to support this:

Instance level session cache, instead of class level session cache
Uses backoff for retries, instead of requests native support

I am currently working on moving the calls to reward_fn in #1225, which requires calling the reward function asynchronously from LLMRayActor, and if we use asyncio.to_thread it starves the thread pool.

Runs:

Single GPU GRPO: Beaker
Multi-node GRPO: Beaker
Tool use GRPO: Beaker

Note

Replaces requests with aiohttp in CodeVerifier to enable native async calls with backoff retries and event-loop–scoped session caching, adds session cleanup, introduces aiohttp dependency, and documents a Beaker logs command.

Verifier (open_instruct/ground_truth_utils.py):
- Switch HTTP client from requests to aiohttp; implement native-async _verify_code and async_call.
- Add exponential backoff (backoff) for post retries; compute dynamic timeouts.
- Introduce event-loop–scoped session cache via weakref.WeakKeyDictionary; add cleanup_all_sessions.
- Update sync __call__ to use a fresh aiohttp.ClientSession via asyncio.run.
- Minor type hints and code extraction (extract_python_code unchanged).
Dependencies:
- Add aiohttp>=3.9.0 to pyproject.toml (lockfile updated accordingly).
Docs:
- Add Beaker logs tip in AGENTS.md (beaker experiment logs $EXPERIMENT_ID).

^{Written by Cursor Bugbot for commit 25391d7. This will update automatically on new commits. Configure here.}

open_instruct/ground_truth_utils.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

open_instruct/ground_truth_utils.py

cursor · 2025-11-25T22:22:23Z

open_instruct/ground_truth_utils.py

+            max_tries=3,
+            max_time=60,
+            giveup=lambda e: isinstance(e, aiohttp.ClientResponseError) and e.status < 500,
+        )


Bug: Timeout retries waste execution server resources

The backoff decorator retries on asyncio.TimeoutError, which includes timeouts from the session.post() call. When code execution legitimately exceeds the timeout duration, retrying won't help and wastes execution server resources by running the same slow code multiple times. The previous requests implementation only retried specific 5xx status codes and didn't retry on timeouts.

cursor · 2025-11-25T22:22:23Z

open_instruct/ground_truth_utils.py

+            backoff.expo,
+            (aiohttp.ClientError, asyncio.TimeoutError),
+            max_tries=3,
+            max_time=60,


Bug: Backoff max_time too short for long requests

The backoff max_time=60 is too short for requests that can timeout after up to 300 seconds. When a request times out after 300 seconds, the elapsed time exceeds max_time, preventing any retries despite max_tries=3. This makes the retry mechanism ineffective for slow code executions that legitimately timeout.

finbarrtimbers · 2025-12-01T19:17:38Z

Turns out this isn't actually the bottleneck, so I'm closing this for now.

finbarrtimbers added 4 commits November 24, 2025 14:05

Firest commit. Replaces request with aiohttp.

01bf37a

updated code

1d6079b

updated code

459dbee

Uses own session for each CodeVerifier instance

721a2c9

Updated code.

49edc3a

finbarrtimbers requested a review from saurabh111233212 November 25, 2025 17:25

finbarrtimbers marked this pull request as ready for review November 25, 2025 17:25

finbarrtimbers enabled auto-merge November 25, 2025 17:25

cursor bot reviewed Nov 25, 2025

View reviewed changes

open_instruct/ground_truth_utils.py Show resolved Hide resolved

saurabh111233212 approved these changes Nov 25, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 25, 2025

View reviewed changes

open_instruct/ground_truth_utils.py Outdated Show resolved Hide resolved

finbarrtimbers disabled auto-merge November 25, 2025 17:30

updated code.

809e143

cursor bot reviewed Nov 25, 2025

View reviewed changes

open_instruct/ground_truth_utils.py Show resolved Hide resolved

Added experiment instructions

81bdc6c

cursor bot reviewed Nov 25, 2025

View reviewed changes

open_instruct/ground_truth_utils.py Outdated Show resolved Hide resolved

finbarrtimbers added 2 commits November 25, 2025 15:01

weak cache

a35701c

Merge branch 'main' into finbarr/async-code

25391d7

cursor bot reviewed Nov 25, 2025

View reviewed changes

finbarrtimbers closed this Dec 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switches `CodeVerifier` to use `aiohttp` instead of `requests`, enabling native async, and avoiding using `asyncio.to_thread`. #1227

Switches `CodeVerifier` to use `aiohttp` instead of `requests`, enabling native async, and avoiding using `asyncio.to_thread`. #1227

finbarrtimbers commented Nov 24, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot Nov 25, 2025

Uh oh!

cursor bot Nov 25, 2025

Uh oh!

finbarrtimbers commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Switches CodeVerifier to use aiohttp instead of requests, enabling native async, and avoiding using asyncio.to_thread. #1227

Switches CodeVerifier to use aiohttp instead of requests, enabling native async, and avoiding using asyncio.to_thread. #1227

Conversation

finbarrtimbers commented Nov 24, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot Nov 25, 2025

Choose a reason for hiding this comment

Bug: Timeout retries waste execution server resources

Uh oh!

cursor bot Nov 25, 2025

Choose a reason for hiding this comment

Bug: Backoff max_time too short for long requests

Uh oh!

finbarrtimbers commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Switches `CodeVerifier` to use `aiohttp` instead of `requests`, enabling native async, and avoiding using `asyncio.to_thread`. #1227

Switches `CodeVerifier` to use `aiohttp` instead of `requests`, enabling native async, and avoiding using `asyncio.to_thread`. #1227

finbarrtimbers commented Nov 24, 2025 •

edited by cursor bot

Loading