dl-project-llm

Introduction

We run Efficient LLM Context Distillation model below:

how to install:

 conda env create -f relu_ranger.yaml
 conda activate relu_ranger

If in Visual Studio, you can CTRL + P => select python interpretert to select relu_ranger as your default environment.

Run jupyter notebook and open up the relevant file: opt-125m.ipynb or teacher_student.ipynb

Run all code to get model outputs.

Interpretation:

The project implements context distillation by training a student model on a KL-divergence loss derived from a teacher model. Additionally, LoRA (Low-Rank Adaptation) is incorporated.

Models

Running the Model in Google Colab

Upload run_models.ipynb notebook and its dependencies (contained in context_utils.py, training_utils.py, and data_utils.py) on Google Colab.

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
images		images
results		results
test		test
.gitignore		.gitignore
README.md		README.md
compile_results.ipynb		compile_results.ipynb
context_utils.py		context_utils.py
data_utils.py		data_utils.py
dev_and_test.tsv		dev_and_test.tsv
relu_ranger.yaml		relu_ranger.yaml
requirements.txt		requirements.txt
run_models.ipynb		run_models.ipynb
training_utils.py		training_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dl-project-llm

Introduction

how to install:

Interpretation:

Models

Running the Model in Google Colab

About

Uh oh!

Releases

Packages

Languages

upadraj/In-context

Folders and files

Latest commit

History

Repository files navigation

dl-project-llm

Introduction

how to install:

Interpretation:

Models

Running the Model in Google Colab

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages