feat: swap Tier 3 from GPT-OSS-20B to Qwen3.5-27B by Lightheartdevs · Pull Request #573 · Light-Heart-Labs/DreamServer

Lightheartdevs · 2026-03-23T03:50:23Z

Summary

Swapping to Qwen3.5-27B (16.7GB Q4_K_M) — same family as Tier 1-2, proven llama.cpp compatibility, fits 20-39GB VRAM.

Test plan

Perplexica search queries return results (structured output works)
Open WebUI chat works
OpenClaw agent responds
bash tests/test-tier-map.sh passes

🤖 Generated with Claude Code

GPT-OSS-20B uses special tokens (<|start|>, <|channel|>, <|constrain|>) for structured output that are incompatible with llama.cpp's JSON grammar mode. This causes Perplexica (which uses generateObject) to fail with HTTP 500 on every query. Pure chat inference worked fine but structured output / tool calling was broken. Qwen3.5-27B (16.7GB Q4_K_M) is the same model family as Tier 1-2 (Qwen 3.5), proven compatible with llama.cpp structured output, and fits in 20-39GB VRAM tier. Updated across all platforms, tests, agent templates, docs, and disk estimation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Qwen3.5-27B-Q4_K_M.gguf not qwen3.5-27b-Q4_K_M.gguf — sed lowercased it during the bulk replace. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Follow-up to #573 — docs still referenced old Qwen3 8B/4B/14B models. Updated to match current tier map: - T1/T2/ARC: Qwen3.5 9B - T3: Qwen3.5 27B - ARC_LITE: Qwen3.5 4B Files: root README, FAQ, INTEL-ARC-GUIDE, MACOS-QUICKSTART, SUPPORT-MATRIX Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Follow-up to #573 — docs still referenced old Qwen3 8B/4B/14B models. Updated to match current tier map: - T1/T2/ARC: Qwen3.5 9B - T3: Qwen3.5 27B - ARC_LITE: Qwen3.5 4B Files: root README, FAQ, INTEL-ARC-GUIDE, MACOS-QUICKSTART, SUPPORT-MATRIX Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Lightheartdevs and others added 2 commits March 22, 2026 23:50

fix: correct GGUF_FILE case in Tier 3 test assertion

8794180

Qwen3.5-27B-Q4_K_M.gguf not qwen3.5-27b-Q4_K_M.gguf — sed lowercased it during the bulk replace. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Lightheartdevs merged commit 3077e66 into main Mar 23, 2026
14 of 20 checks passed

Lightheartdevs mentioned this pull request Mar 23, 2026

docs: update all tier tables to Qwen 3.5 models #574

Merged

Lightheartdevs mentioned this pull request Mar 23, 2026

feat: NVIDIA Multi-GPU Detection, Topology-Aware Assignment & Parallelism #501

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: swap Tier 3 from GPT-OSS-20B to Qwen3.5-27B#573

feat: swap Tier 3 from GPT-OSS-20B to Qwen3.5-27B#573
Lightheartdevs merged 2 commits intomainfrom
feat/tier3-qwen35-27b

Lightheartdevs commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Lightheartdevs commented Mar 23, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant