Hyperparameter details for fine tuning bert-large in the oos-train setting

Hi,

I am using huggingface to fine-tune bert large on the CLINC dataset. I follow the hyperparameters mentioned in `hyperparams.csv` but there's ~3 point difference in inscope accuracy for the `oos-train` setting (93.49 v/s 96.9 for Full version of the dataset; similarly for the OOS-Plus setting). I am wondering if this is due to some HF defaults, for e.g., HF defaults to 1.0 for gradient clipping, I am not sure what did you use. Would it be possible to clarify a bit more about your fine-tuning process? It'd be very helpful.

Thanks,
Gaurav.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Hyperparameter details for fine tuning bert-large in the oos-train setting #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hyperparameter details for fine tuning bert-large in the oos-train setting #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions