-
Notifications
You must be signed in to change notification settings - Fork 27
Open
Description
Core file: dataset_info.jsonl`
TODO Items
- So far, almost all
ERRORs result from the use of the google drive link, which can work sometimes but will fail as well sometimes. We can move them to S3 gradually (Since most of them are from summarization tasks, so maybe @yixinL7 and @xcfcode could help out with this part. languagesfor several datasets should be added.
Some other follow-up things that should be done after task refactoring:
- IMPORTANT: update
get_dataset_info.pyanddataset_info.json/ make sure it could be applied to explainaboard_web db: Post-refactoring (update get_dataset_info & dataset scripts) #203 - update docs for newly-introduced task schema.
- make sure all datasets include
- languages
- other important metadata
- add task schema for (also think about modality-dependent schema)
glue-stsbsupergluepolyprompt
- reformat the organization of some datasets
adv_mtl
- add unit test for checking the validity of the newly-introduced script of dataset loader.
Metadata
Metadata
Assignees
Labels
No labels