Add kcxain/autoresearch-slurm to notable forks (Slurm / HPC clusters)#422
Open
kcxain wants to merge 1 commit intokarpathy:masterfrom
Open
Add kcxain/autoresearch-slurm to notable forks (Slurm / HPC clusters)#422kcxain wants to merge 1 commit intokarpathy:masterfrom
kcxain wants to merge 1 commit intokarpathy:masterfrom
Conversation
Agolid
pushed a commit
to Agolid/autoresearch
that referenced
this pull request
Mar 27, 2026
- Slurm / HPC cluster support for autoresearch - Enables running on shared compute clusters Fixes karpathy#422
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Hi! This adds my Slurm port to the notable forks list.
What it does: lets the agent run on HPC clusters managed by Slurm. Instead of executing
uv run train.pydirectly, the agent submitssbatchjobs from the login node, polls withsqueue, and reads results from the job log — so compute nodes don't need internet access. The rest of the rules are unchanged: one mutabletrain.py, one metric (val_bpb), fixed 5-minute budget, keep-or-revert via git.