Skip to content

Pull requests: allenai/olmo-eval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Handle distributed port collisions
#220 opened Jun 18, 2026 by undfined Collaborator Loading…
Add support for olmo-core inference
#218 opened Jun 16, 2026 by undfined Collaborator Loading…
Roryd/science lit
#217 opened Jun 16, 2026 by donovanr Contributor Draft
13 tasks
multimodal evaluation
#209 opened Jun 13, 2026 by jason718 Loading…
6 of 9 tasks
Safety Suite Update
#208 opened Jun 12, 2026 by mgmorgan23 Contributor Loading…
7 of 13 tasks
Add general:posttrain:dev suite
#193 opened May 26, 2026 by finbarrtimbers Contributor Loading…
2 tasks done
Propagate suite-level CLI overrides to constituent task specs
#180 opened May 15, 2026 by finbarrtimbers Contributor Loading…
1 task
Report cumulative vLLM generation progress across batches
#179 opened May 15, 2026 by finbarrtimbers Contributor Loading…
1 task
Finbarr/prebuild sandbox
#158 opened Apr 27, 2026 by finbarrtimbers Contributor Draft
13 tasks
how to use add a serialized task walkthru
#121 opened Apr 8, 2026 by IanMagnusson Contributor Loading…
3 tasks done
DNM: Tweaks to support MSWEA + swe-bench
#118 opened Apr 3, 2026 by undfined Collaborator Draft
WIP: add swebench as external eval
#109 opened Apr 1, 2026 by aetting Draft
1 of 13 tasks
[WIP] Hard reasoning
#102 opened Mar 25, 2026 by rlebras Contributor Loading…
13 tasks
adds simple smoke tests
#76 opened Feb 27, 2026 by warmbowski Contributor Loading…
3 of 13 tasks
ProTip! Follow long discussions with comments:>50.