Multi-Dataset Validation (LM-Loss/Perplexity) #178

bigximik · 2025-03-07T15:23:43Z

✨ Description

Closes #65

🔍 Type of change

Select all that apply:

🐛 Bug fix (non-breaking change that addresses a specific issue)
🚀 New feature (non-breaking change that adds functionality)
⚠️ Breaking change (a change that could affect existing functionality)
📈 Performance improvement/optimization (improves speed, memory usage, or efficiency)
🛠️ Code refactor (non-functional changes that improve code readability, structure, etc.)
📦 Dependency bump (updates dependencies, including Dockerfile or package changes)
📝 Documentation change (updates documentation, including new content or typo fixes)
🔧 Infrastructure/Build change (affects build process, CI/CD, or dependencies)

📝 Changes

List the key changes introduced in this PR:

Added support for multipole validation datasets with loss and other existing metrics tracking. This introduces breaking change as now training.validation config field is a dictionary of validation dataset names and their application parameters.
Changed existing tests to accommodate new config format
Changed and extended documentation to describe new config format

✅ Checklist

Make sure the following tasks are completed before submitting the PR:

General

📜 I have read and followed the contributing guidelines.
🏷️ I am using a clear and descriptive PR title that summarizes the key change or feature introduced.
🎉 The functionality is complete, and I have tested the changes.
📝 I have updated the documentation if needed.
⚠️ The change does not introduce any new issues (e.g., runtime warnings, type checker errors, linting problems, unhandled edge cases).
🧩 I have commented my code, especially in hard-to-understand areas.

Testing

🧪 I have added or updated tests to cover my changes.
✔️ New and existing tests pass locally with my changes.
🚦 I have tested these changes on GPUs and verified training stability.
🏋️ I have tested the changes on realistic training workloads, if applicable.

bigximik · 2025-03-07T15:25:20Z

@jlamypoirier What do you think of such approach?

jlamypoirier

@bigximik That's really close to the approach I had in mind. I would suggest moving some of the changes from the data to the trainer though, it would simplify things and we'll want it anyway in the next step of #65.

fast_llm/data/data/gpt/data.py

fast_llm/engine/training/trainer.py

…n configs, not tested

bigximik · 2025-03-19T15:46:06Z

I have implemented it so that the config will look something like this:

data:
  datasets:
    Training: # training dataset, hardcoded name
      type: memmap
      path: some_path1
    Test:  # test dataset, hardcoded name
      type: memmap
      path: some_path2
    validation_dataset_name1:  # validation dataset, any name
      type: memmap
      path: some_path3
    validation_dataset_name2:  # validation dataset, any name
      type: memmap
      path: some_path4

training:
  training_iters: 2
  test_iters: 2
  validation:
    validation_dataset_name1:
      interval: 2
      iterations: 1
    validation_dataset_name2:
      interval: 2
      iterations: 1

@jlamypoirier, have I got it right?

bigximik · 2025-03-19T15:49:41Z

Maybe to add training_dataset_name and test_dataset_name into TrainingConfig with default values Training and Test so people can override it if they like?

jlamypoirier · 2025-03-19T22:04:14Z

I have implemented it so that the config will look something like this:
@jlamypoirier, have I got it right?

Yes

Maybe to add training_dataset_name and test_dataset_name into TrainingConfig with default values Training and Test so people can override it if they like?

I don't think it's worth it, but feel free to fix the capitalization if you find a way to do it without breaking backward comparibility

bigximik · 2025-03-24T12:04:09Z

in wandb metrics for different validation datasets look like this:

…i_validation

fast_llm/data/data/gpt/data.py

fast_llm/engine/training/config.py

docs/quick-start.md

…heir usage config to EvaluationConfig

bigximik · 2025-03-27T15:53:44Z

I have changed validation to evaluations and ValidationConfig to EvaluationConfig, but kept the phase as Validation.
So, in the documentation, it is still referred to as validation datasets, which need to be defined in data.datasets, while their usage should be specified in training.evaluations.

jlamypoirier

Almost ready to merge, some minor issues and suggestions

fast_llm/data/data/gpt/data.py

fast_llm/data/dataset/config.py

tscholak · 2025-03-27T19:36:05Z

Guys can we get to a conclusion here please? I’d like us to merge this by tomorrow end of day, at the latest. Thanks

jlamypoirier

@bigximik Adjusted the code, ready to merge if it works for you.

tscholak

thank you both, looks good to me!

bigximik · 2025-03-28T09:54:16Z

I have updated the instruction fine-tuning documentation that came from the merge but created a separate issue for moving the check of used dataset definitions to _validate in TrainerConfig (#213).

initial implementation, not tested

03c4ee9

bigximik requested review from jlamypoirier and tscholak March 7, 2025 15:24

jlamypoirier reviewed Mar 10, 2025

View reviewed changes

fast_llm/data/data/gpt/data.py Outdated Show resolved Hide resolved

fast_llm/engine/training/trainer.py Outdated Show resolved Hide resolved

bigximik added 3 commits March 19, 2025 11:01

reverted gpt data implementation

12b866a

implemented simple data object with no phases and multiple vavlidatio…

7a6d0e8

…n configs, not tested

clean up

c4b3dd5

Toolkit User added 4 commits March 20, 2025 13:56

fixes for trainer

061caeb

tests fixes

f4e6e0d

metric key formatting fix

56740b5

documentaion changes

44facac

bigximik marked this pull request as ready for review March 24, 2025 12:38

bigximik requested a review from jlamypoirier March 24, 2025 12:38

Merge branch 'main' of github.com:ServiceNow/Fast-LLM into denis/mult…

84ac26f

…i_validation

jlamypoirier reviewed Mar 25, 2025

View reviewed changes

fast_llm/data/data/gpt/data.py Outdated Show resolved Hide resolved

fast_llm/engine/training/config.py Outdated Show resolved Hide resolved

docs/quick-start.md Outdated Show resolved Hide resolved

Toolkit User added 8 commits March 26, 2025 08:45

normalize dataset names to lowercase internally

a921dca

added backward compatibility for validation field

582c72f

comment style

3bca46f

added warning for unused datasets

be9b1f6

reversed configs in test for pre evaluations

099e1be

moved multiple validation datasets to evaluations field and renamed t…

26d3e91

…heir usage config to EvaluationConfig

changed test configs to the new naming conventions

c6356c2

updated docs

04c23bb

jlamypoirier reviewed Mar 27, 2025

View reviewed changes

fast_llm/data/data/gpt/data.py Outdated Show resolved Hide resolved

fast_llm/data/data/gpt/data.py Show resolved Hide resolved

fast_llm/data/data/gpt/data.py Outdated Show resolved Hide resolved

fast_llm/data/dataset/config.py Outdated Show resolved Hide resolved

jlamypoirier added 3 commits March 27, 2025 16:25

changes

727ba50

Merge remote-tracking branch 'origin/main' into denis/multi_validation

c26b332

fixes

95bffa9

jlamypoirier approved these changes Mar 27, 2025

View reviewed changes

fix

55742c1

tscholak approved these changes Mar 28, 2025

View reviewed changes

docs updata

71442b1

jlamypoirier merged commit 21182c2 into main Mar 28, 2025
4 checks passed

jlamypoirier deleted the denis/multi_validation branch March 28, 2025 21:03

Multi-Dataset Validation (LM-Loss/Perplexity) #178

Multi-Dataset Validation (LM-Loss/Perplexity) #178

Uh oh!

Conversation

bigximik commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✨ Description

🔍 Type of change

📝 Changes

✅ Checklist

General

Testing

Uh oh!

bigximik commented Mar 7, 2025

Uh oh!

jlamypoirier left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bigximik commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bigximik commented Mar 19, 2025

Uh oh!

jlamypoirier commented Mar 19, 2025

Uh oh!

bigximik commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bigximik commented Mar 27, 2025

Uh oh!

jlamypoirier left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tscholak commented Mar 27, 2025

Uh oh!

jlamypoirier left a comment

Choose a reason for hiding this comment

Uh oh!

tscholak left a comment

Choose a reason for hiding this comment

Uh oh!

bigximik commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bigximik commented Mar 7, 2025 •

edited

Loading

bigximik commented Mar 19, 2025 •

edited

Loading

bigximik commented Mar 24, 2025 •

edited

Loading

bigximik commented Mar 28, 2025 •

edited

Loading