Skip to content

Commit

Permalink
feat: add DeepSeek-R1-Distill-Qwen-7B
Browse files Browse the repository at this point in the history
  • Loading branch information
katopz committed Jan 21, 2025
1 parent c788bc8 commit b1e3fb7
Showing 1 changed file with 12 additions and 5 deletions.
17 changes: 12 additions & 5 deletions src/ml/frameworks/apple-mlx.md
Original file line number Diff line number Diff line change
Expand Up @@ -154,13 +154,20 @@ open https://huggingface.co/settings/tokens
pip install --upgrade huggingface_hub
huggingface-cli login

# MODEL=SeaLLMs/SeaLLM-7B-v2
# MODEL_NAME=SeaLLM-7B-v2
MODEL=scb10x/typhoon-v1.5-8b-instruct
MODEL_NAME=typhoon-v1.5-8b-instruct
# Pick model
MODEL=deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

# Define the name
MODEL_NAME=DeepSeek-R1-Distill-Qwen-7B

# Clear old stuff
rm -rf mlx_model

# Convert and upload
python3 -m mlx_lm.convert --hf-path ${MODEL} -q --upload-repo mlx-community/${MODEL_NAME}-4bit-mlx
python3 -m mlx_lm.generate --model mlx-community/${MODEL_NAME}-4bit-mlx --prompt "สอนเขียน helloworld ด้วย Rust หน่อย" --eos-token "<|eot_id|>"

# Try uploaded model
python -m mlx_lm.generate --model ${MODEL} --max-tokens 512 --prompt "Translate to English: おはようご ざいます"
```

## mlx-server (Not production grade)
Expand Down

0 comments on commit b1e3fb7

Please sign in to comment.