Open-Sci+MixtureVitae on Jupiter by harshraj172 · Pull Request #28 · OpenEuroLLM/oellm-autoexp

harshraj172 · 2026-02-13T23:54:51Z

Adds the config to pretrain open-sci architecture model with MixtureVitae dataset on jupiter

# Conflicts: # README.md # config/autoexp.yaml # config/container/jupiter.yaml # config/slurm/jupiter.yaml # config/sweep/minimal.yaml # oellm_autoexp/backends/megatron_backend.py # scripts/run_autoexp_container.py

kpoeppel · 2026-02-17T16:25:13Z

config/backend/megatron/data_mixturevitae.yaml

Ideally, make this use a common $OELLM_DATASETS_TOKENIZED_DIR based path, such that this config works across clusters. (You can keep the absolute one as a comment such that people know where to copy it from from another cluster).

kpoeppel · 2026-02-17T16:26:32Z

config/container/jupiter.yaml

As a comment, we should unify the Container names / paths, such that we don't change the config again and again.
But that's more of a general problem.

kpoeppel · 2026-02-17T16:29:01Z

config/experiments/opensci_ref_0.13b_mixvitae_jupiter.yaml

+  env:
+    CUDA_DEVICE_MAX_CONNECTIONS: "1"
+    PYTORCH_CUDA_ALLOC_CONF: "expandable_segments:True"
+    NCCL_SOCKET_IFNAME: ib0


The following options might be actually useful in general on jupiter, right?
So we could add them to slurm/jupiter.yaml ?

kpoeppel · 2026-02-17T16:33:25Z

config/slurm/jupiter.yaml

  gres: "gpu:4"
  gpu_bind: "none"
-  time: "12:00:00"
+  partition: booster


I typically "externalize" these to env variables: ${oc.env:SLURM_PARTITION,booster} / ${oc.env:SLURM_ACCOUNT,jureap59}
This way, one can "update" the project more easily.

kpoeppel · 2026-02-17T16:36:25Z

submodules/Megatron-LM

If you update the submodules/Megatron-LM, you should re-run the scripts for dataclass/config generation (scripts/generate_megatron_config.py , scripts/generate_megatron_dataclass.py)

kpoeppel · 2026-02-17T16:36:47Z

output/juwels-array.sbatch

Thanks for removing those!

harshraj172 added 4 commits November 5, 2025 02:23

jupiter setup

94fac41

Merge remote-tracking branch 'origin/main' into harsh/jupiter_setup

389bc28

# Conflicts: # README.md # config/autoexp.yaml # config/container/jupiter.yaml # config/slurm/jupiter.yaml # config/sweep/minimal.yaml # oellm_autoexp/backends/megatron_backend.py # scripts/run_autoexp_container.py

open-sci configs for jupiter

62c0471

jupiter booster max set to 12h

e8deb98

harshraj172 requested review from JeniaJitsev and kpoeppel February 13, 2026 23:54

kpoeppel requested changes Feb 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open-Sci+MixtureVitae on Jupiter#28

Open-Sci+MixtureVitae on Jupiter#28
harshraj172 wants to merge 4 commits intomainfrom
harsh/jupiter_setup

harshraj172 commented Feb 13, 2026

Uh oh!

kpoeppel Feb 17, 2026

Uh oh!

kpoeppel Feb 17, 2026

Uh oh!

kpoeppel Feb 17, 2026

Uh oh!

kpoeppel Feb 17, 2026

Uh oh!

kpoeppel Feb 17, 2026

Uh oh!

kpoeppel Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

harshraj172 commented Feb 13, 2026

Uh oh!

kpoeppel Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

kpoeppel Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

kpoeppel Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

kpoeppel Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

kpoeppel Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

kpoeppel Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants