Activity

Add instruction finetuned model

mattiacurripushed 1 commit to main • 5ea4ff2…da92e06 •

on Sep 26, 2024

Import fix

mattiacurripushed 1 commit to main • 1cc1349…5ea4ff2 •

on Sep 26, 2024

Added chainlit app + lil refactor classification_finetuning

mattiacurripushed 1 commit to main • 894cdeb…1cc1349 •

on Sep 25, 2024

Added optional saving in better training loop

mattiacurripushed 1 commit to main • ad6f3ff…894cdeb •

on Sep 24, 2024

Fix training loop import in instruction_finetuning

mattiacurripushed 1 commit to main • 451bcd9…ad6f3ff •

on Sep 23, 2024

Finetuning refactor + plot refactor

mattiacurripushed 2 commits to main • 6acaff0…451bcd9 •

on Sep 23, 2024

Deleted gpt2.py

mattiacurripushed 3 commits to main • 057f66e…6acaff0 •

on Sep 22, 2024

Refactor SimpleDataset class

mattiacurripushed 1 commit to main • 3667084…057f66e •

on Sep 20, 2024

Refactor SmolGPT class

mattiacurripushed 1 commit to main • bb0ed04…3667084 •

on Sep 20, 2024

Adjusted SelfAttention imports

mattiacurripushed 1 commit to main • ca97a54…bb0ed04 •

on Sep 20, 2024

Refactor of SelfAttention classes

mattiacurripushed 1 commit to main • 790eb59…ca97a54 •

on Sep 20, 2024

Moved SimpleTokenizer to a specific file

mattiacurripushed 1 commit to main • 7c14aa3…790eb59 •

on Sep 19, 2024

LoRa Fine-Tuning

mattiacurripushed 1 commit to main • 8c70d26…7c14aa3 •

on Sep 17, 2024

Finished evaluation

mattiacurripushed 1 commit to main • c0aaa3a…8c70d26 •

on Sep 15, 2024

Setup handling of ollama and loaded models

mattiacurripushed 1 commit to main • 9823dae…c0aaa3a •

on Sep 15, 2024

Saved response of model to test set

mattiacurripushed 1 commit to main • a6f6c11…9823dae •

on Sep 14, 2024

FIne-tuning of gpt2-small done

mattiacurripushed 4 commits to main • 6da1a8c…a6f6c11 •

on Sep 14, 2024

Finished Model Classification Fine-Tuning

mattiacurripushed 3 commits to main • 3274af6…6da1a8c •

on Sep 13, 2024

Finished Dataset Preparation for Classification Fine-Tuning

mattiacurripushed 1 commit to main • e643f82…3274af6 •

on Sep 13, 2024

deleted gpt_download.py

mattiacurripushed 2 commits to main • 93c9294…e643f82 •

on Sep 13, 2024

Updated .gitignore

mattiacurripushed 1 commit to main • 54ffea4…93c9294 •

on Sep 11, 2024

Added GPT2 weights download

mattiacurripushed 1 commit to main • 72061a4…54ffea4 •

on Sep 11, 2024

Better training loop + saving model

mattiacurripushed 1 commit to main • 246fc6c…72061a4 •

on Sep 10, 2024

Add loss plot and 'smart' text generation

mattiacurripushed 3 commits to main • adb3415…246fc6c •

on Sep 10, 2024

Loss function and train-val split implemented

mattiacurripushed 2 commits to main • e816ecb…adb3415 •

on Sep 10, 2024

Implemented basic generation of text

mattiacurripushed 6 commits to main • 1d2296d…e816ecb •

on Sep 9, 2024

Implemented MultiHeadAttention class w/ batched matrix multiplication

mattiacurripushed 3 commits to main • eddd366…1d2296d •

on Sep 7, 2024

Implemented SelfAttentionV1 and SelfAttentionV2

mattiacurripushed 3 commits to main • d23343f…eddd366 •

on Sep 6, 2024

Update README.md

mattiacurripushed 1 commit to main • 03e0f94…d23343f •

on Sep 5, 2024

Add README

mattiacurripushed 1 commit to main • 1a376f7…03e0f94 •

on Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add instruction finetuned model

Import fix

Added chainlit app + lil refactor classification_finetuning

Added optional saving in better training loop

Fix training loop import in instruction_finetuning

Finetuning refactor + plot refactor

Deleted gpt2.py

Refactor SimpleDataset class

Refactor SmolGPT class

Adjusted SelfAttention imports

Refactor of SelfAttention classes

Moved SimpleTokenizer to a specific file

LoRa Fine-Tuning

Finished evaluation

Setup handling of ollama and loaded models

Saved response of model to test set

FIne-tuning of gpt2-small done

Finished Model Classification Fine-Tuning

Finished Dataset Preparation for Classification Fine-Tuning

deleted gpt_download.py

Updated .gitignore

Added GPT2 weights download

Better training loop + saving model

Add loss plot and 'smart' text generation

Loss function and train-val split implemented

Implemented basic generation of text

Implemented MultiHeadAttention class w/ batched matrix multiplication

Implemented SelfAttentionV1 and SelfAttentionV2

Update README.md

Add README