We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
What Jeremy used to review the initial Claude Code PR:
lemonade-server.exe serve --ctx-size 128000 --port 11434 --llamacpp rocm --llamacpp-args="--no-mmap -b 16384 -ub 16384 -fa on" --log-level debug ollama launch claude --model Qwen3-Coder-30B-A3B-Instruct-GGUF
Some presets for Qwen3.5-35B-A3B with parameters recommended by Unsloth
Qwen3.5-35B-A3B-NoThinking.json
Qwen3.5-35B-A3B-ThinkingCoder.json
Qwen3.5-122B-A10B-NoThinking.json
Qwen3.5-122B-A10B-ThinkingCoder.json
Thinking preset for Qwen3.5-9B. Can be easily adapted to the other small version of Qwen3.5
Qwen3.5-9B-Thinking.json
Note ctx_size = 0 sets the context size to the model's preferred context size, 256K in this case.
ctx_size = 0