-
Notifications
You must be signed in to change notification settings - Fork 3
Temperature test and results #43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| parser = argparse.ArgumentParser(description="Count accuracy for JSONL files with gold and answer fields") | ||
| parser.add_argument("jsonl_path", type=str, help="Path to the JSONL file") | ||
|
|
||
| args = parser.parse_args() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we allow CLI use for information/description files?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is not really an experiment, but a post-processing tool. Should be fine in my view
| @@ -0,0 +1,9 @@ | |||
| import pandas as pd | |||
|
|
|||
| df = pd.read_json("data/out/distillation/mmlu_synth_gptoss_a_t0_8.jsonl", lines=True) | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Switching to read_ndjson will save us a couple of minutes :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wouldn't it be better to stick to one file per experiment?
Files for all experiments will also be attached.
Alternative: describe key changes in the experiments in plain text (either in .md or as comments in the code itself).
No description provided.