TinyGPT

TinyGPT is a minimal C++11 implementation of GPT-2 inference, built from scratch and mainly inspired by the picoGPT project.

For more details, check out the accompanying blog post: Write a GPT from scratch (TinyGPT)

Features

Fast BPE tokenizer, inspired by tiktoken.
CPU and CUDA inference.
KV cache enabled.

tinygpt::tokenizer is faster than both HuggingFace Tokenizers and OpenAI tiktoken，the encoding speed was measured using the ~/benches/tokenizer.py script on a machine with an Intel(R) Xeon(R) Platinum 8255C CPU @ 2.50GHz.

Build and Run

1. Get the code

git clone --recurse-submodules https://github.com/keith2018/TinyGPT.git

2. Download GPT-2 model file

python3 tools/download_gpt2_model.py

if success, you'll see the file model_file.data in directory assets/gpt2

3. Build and Run

mkdir build
cmake -B ./build -DCMAKE_BUILD_TYPE=Release
cmake --build ./build --config Release

This will generate the executable file and copy assets to directory app/bin, then you can run the demo:

cd app/bin
./TinyGPT_demo
[DEBUG] TIMER TinyGPT::Model::loadModelGPT2: cost: 800 ms
[DEBUG] TIMER TinyGPT::Encoder::getEncoder: cost: 191 ms
INPUT:Alan Turing theorized that computers would one day become
GPT:the most powerful machines on the planet.
INPUT:exit

Dependencies

Tensor
- TinyTorch https://github.com/keith2018/TinyTorch
JsonParser
- RapidJSON https://github.com/Tencent/rapidjson
Regex
- pcre2 https://github.com/PCRE2Project/pcre2
HashMap
- ankerl::unordered_dense https://github.com/martinus/unordered_dense
ConcurrentQueue
- moodycamel::ConcurrentQueue https://github.com/cameron314/concurrentqueue

License

This code is licensed under the MIT License (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
app		app
assets		assets
benches		benches
docs		docs
python/tinygpt		python/tinygpt
src		src
test		test
third_party		third_party
tools		tools
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyGPT

Features

Build and Run

1. Get the code

2. Download GPT-2 model file

3. Build and Run

Dependencies

License

About

Releases

Packages

Languages

License

keith2018/TinyGPT

Folders and files

Latest commit

History

Repository files navigation

TinyGPT

Features

Build and Run

1. Get the code

2. Download GPT-2 model file

3. Build and Run

Dependencies

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages