Skip to content

spellml/gpt2-imdb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

gpt2-imdb

pytorch scripts training GPT2 (trained from scratch) on the IMDB movie reviews dataset. The model and data are via hugginface transformers.

Even with the standard settings GPT2 is such a large model that it's hard to fit many items per batch onto a single video card! That makes this training script useful for benchmarking a large-scale attention NLP model training job.

To run on Spell:

spell run --machine-type v100 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_1/ \
  "python models/model_1.py"
spell run --machine-type v100x4 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_2/ \
  "python models/model_2.py"
spell run --machine-type v100x8 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_2/ \
  "python models/model_2.py"
spell run --machine-type v100x4 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_3/ \
  "python models/model_3.py"
spell run --machine-type v100x8 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_3/ \
  "python models/model_3.py"

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published