Skip to content

Activity

Add instruction finetuned model

mattiacurripushed 1 commit to main • 5ea4ff2…da92e06 • 
on Sep 26, 2024

Import fix

mattiacurripushed 1 commit to main • 1cc1349…5ea4ff2 • 
on Sep 26, 2024

Added chainlit app + lil refactor classification_finetuning

mattiacurripushed 1 commit to main • 894cdeb…1cc1349 • 
on Sep 25, 2024

Added optional saving in better training loop

mattiacurripushed 1 commit to main • ad6f3ff…894cdeb • 
on Sep 24, 2024

Fix training loop import in instruction_finetuning

mattiacurripushed 1 commit to main • 451bcd9…ad6f3ff • 
on Sep 23, 2024

Finetuning refactor + plot refactor

mattiacurripushed 2 commits to main • 6acaff0…451bcd9 • 
on Sep 23, 2024

Deleted gpt2.py

mattiacurripushed 3 commits to main • 057f66e…6acaff0 • 
on Sep 22, 2024

Refactor SimpleDataset class

mattiacurripushed 1 commit to main • 3667084…057f66e • 
on Sep 20, 2024

Refactor SmolGPT class

mattiacurripushed 1 commit to main • bb0ed04…3667084 • 
on Sep 20, 2024

Adjusted SelfAttention imports

mattiacurripushed 1 commit to main • ca97a54…bb0ed04 • 
on Sep 20, 2024

Refactor of SelfAttention classes

mattiacurripushed 1 commit to main • 790eb59…ca97a54 • 
on Sep 20, 2024

Moved SimpleTokenizer to a specific file

mattiacurripushed 1 commit to main • 7c14aa3…790eb59 • 
on Sep 19, 2024

LoRa Fine-Tuning

mattiacurripushed 1 commit to main • 8c70d26…7c14aa3 • 
on Sep 17, 2024

Finished evaluation

mattiacurripushed 1 commit to main • c0aaa3a…8c70d26 • 
on Sep 15, 2024

Setup handling of ollama and loaded models

mattiacurripushed 1 commit to main • 9823dae…c0aaa3a • 
on Sep 15, 2024

Saved response of model to test set

mattiacurripushed 1 commit to main • a6f6c11…9823dae • 
on Sep 14, 2024

FIne-tuning of gpt2-small done

mattiacurripushed 4 commits to main • 6da1a8c…a6f6c11 • 
on Sep 14, 2024

Finished Model Classification Fine-Tuning

mattiacurripushed 3 commits to main • 3274af6…6da1a8c • 
on Sep 13, 2024

Finished Dataset Preparation for Classification Fine-Tuning

mattiacurripushed 1 commit to main • e643f82…3274af6 • 
on Sep 13, 2024

deleted gpt_download.py

mattiacurripushed 2 commits to main • 93c9294…e643f82 • 
on Sep 13, 2024

Updated .gitignore

mattiacurripushed 1 commit to main • 54ffea4…93c9294 • 
on Sep 11, 2024

Added GPT2 weights download

mattiacurripushed 1 commit to main • 72061a4…54ffea4 • 
on Sep 11, 2024

Better training loop + saving model

mattiacurripushed 1 commit to main • 246fc6c…72061a4 • 
on Sep 10, 2024

Add loss plot and 'smart' text generation

mattiacurripushed 3 commits to main • adb3415…246fc6c • 
on Sep 10, 2024

Loss function and train-val split implemented

mattiacurripushed 2 commits to main • e816ecb…adb3415 • 
on Sep 10, 2024

Implemented basic generation of text

mattiacurripushed 6 commits to main • 1d2296d…e816ecb • 
on Sep 9, 2024

Implemented MultiHeadAttention class w/ batched matrix multiplication

mattiacurripushed 3 commits to main • eddd366…1d2296d • 
on Sep 7, 2024

Implemented SelfAttentionV1 and SelfAttentionV2

mattiacurripushed 3 commits to main • d23343f…eddd366 • 
on Sep 6, 2024

Update README.md

mattiacurripushed 1 commit to main • 03e0f94…d23343f • 
on Sep 5, 2024

Add README

mattiacurripushed 1 commit to main • 1a376f7…03e0f94 • 
on Sep 3, 2024