-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi-Dataset Validation (LM-Loss/Perplexity) #178
Conversation
@jlamypoirier What do you think of such approach? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have implemented it so that the config will look something like this: data:
datasets:
Training: # training dataset, hardcoded name
type: memmap
path: some_path1
Test: # test dataset, hardcoded name
type: memmap
path: some_path2
validation_dataset_name1: # validation dataset, any name
type: memmap
path: some_path3
validation_dataset_name2: # validation dataset, any name
type: memmap
path: some_path4
training:
training_iters: 2
test_iters: 2
validation:
validation_dataset_name1:
interval: 2
iterations: 1
validation_dataset_name2:
interval: 2
iterations: 1 @jlamypoirier, have I got it right? |
Maybe to add |
Yes
I don't think it's worth it, but feel free to fix the capitalization if you find a way to do it without breaking backward comparibility |
β¦heir usage config to EvaluationConfig
I have changed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Almost ready to merge, some minor issues and suggestions
Guys can we get to a conclusion here please? Iβd like us to merge this by tomorrow end of day, at the latest. Thanks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bigximik Adjusted the code, ready to merge if it works for you.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thank you both, looks good to me!
I have updated the instruction fine-tuning documentation that came from the merge but created a separate issue for moving the check of used dataset definitions to |
β¨ Description
Closes #65
π Type of change
Select all that apply:
π Changes
List the key changes introduced in this PR:
training.validation
config field is a dictionary of validation dataset names and their application parameters.β Checklist
Make sure the following tasks are completed before submitting the PR:
General
Testing