fix: limit minerva sympy memory consumption by mitja-kleider · Pull Request #222 · Aleph-Alpha-Research/eval-framework

mitja-kleider · 2026-04-16T14:17:07Z

No description provided.

mitja-kleider · 2026-04-16T14:19:23Z

+# Virtual-address-space budget for a single sympy simplify() call.
+# Pathological expressions can cause sympy to allocate tens of GiBs;
+# this cap turns that into a caught MemoryError instead of an OOM kill.
+_SYMPY_MEMORY_BUDGET_BYTES = 2 * 1024**3  # 2 GiB


This budget is quite arbitrary and not configurable, I'll leave it to the eval team to decide if this is sufficient.

Interesting. Why is this better than an OOM Kill? Because then we are "more sure" that the OOM was caused by the benchmark container and not something else?

Yes, the OOMKill will kill the whole eval, while this will just cause the function to return False. Not entirely sure if this is the desired behavior or whether we instead somehow want to mark the sample as "can not be processed in eval".

Got it. This starts a sub-process for each and every sample to check and only the sub-process "is terminated". I wonder if this is a bit expensive?!? This benchmark already runs pretty long. Do you have an idea about the impact on the runtime?

Yes, it seems to be too expensive. @prabhuteja12 has tested it. As all intermediate checkpoint evals were successful now, I am wondering whether this is worth it at all or if we just keep it as is and restarted flaky future evals.

fix: limit minerva sympy memory consumption

6e5f122

mitja-kleider commented Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: limit minerva sympy memory consumption#222

fix: limit minerva sympy memory consumption#222
mitja-kleider wants to merge 1 commit into
Aleph-Alpha-Research:mainfrom
mitja-kleider:minerva-memory-limit

mitja-kleider commented Apr 16, 2026

Uh oh!

mitja-kleider Apr 16, 2026

Uh oh!

volkerstampa Apr 20, 2026

Uh oh!

mitja-kleider Apr 20, 2026

Uh oh!

volkerstampa Apr 21, 2026

Uh oh!

mitja-kleider Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mitja-kleider commented Apr 16, 2026

Uh oh!

mitja-kleider Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

volkerstampa Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

mitja-kleider Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

volkerstampa Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

mitja-kleider Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants