gpt2-imdb

pytorch scripts training GPT2 (trained from scratch) on the IMDB movie reviews dataset. The model and data are via hugginface transformers.

Even with the standard settings GPT2 is such a large model that it's hard to fit many items per batch onto a single video card! That makes this training script useful for benchmarking a large-scale attention NLP model training job.

To run on Spell:

spell run --machine-type v100 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_1/ \
  "python models/model_1.py"

spell run --machine-type v100x4 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_2/ \
  "python models/model_2.py"

spell run --machine-type v100x8 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_2/ \
  "python models/model_2.py"

spell run --machine-type v100x4 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_3/ \
  "python models/model_3.py"

spell run --machine-type v100x8 \
  --github-url https://github.com/spellml/gpt2-imdb.git \
  --pip transformers --pip nlp --pip tokenizers \
  --tensorboard-dir /spell/tensorboards/model_3/ \
  "python models/model_3.py"

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
models		models
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gpt2-imdb

About

Releases

Packages

Languages

spellml/gpt2-imdb

Folders and files

Latest commit

History

Repository files navigation

gpt2-imdb

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages