Skip to content

Commit 84644ab

Browse files
localai-botmudler
andauthored
chore(model gallery): 🤖 add 1 new models via gallery agent (#6908)
chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <[email protected]>
1 parent b8f40dd commit 84644ab

File tree

1 file changed

+60
-0
lines changed

1 file changed

+60
-0
lines changed

gallery/index.yaml

Lines changed: 60 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22950,3 +22950,63 @@
2295022950
- filename: apollo-astralis-4b.i1-Q4_K_M.gguf
2295122951
sha256: 94e1d371420b03710fc7de030c1c06e75a356d9388210a134ee2adb4792a2626
2295222952
uri: huggingface://mradermacher/apollo-astralis-4b-i1-GGUF/apollo-astralis-4b.i1-Q4_K_M.gguf
22953+
- !!merge <<: *qwen3
22954+
name: "qwen3-vlto-32b-instruct-i1"
22955+
urls:
22956+
- https://huggingface.co/mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF
22957+
description: |
22958+
**Model Name:** Qwen3-VL-32B-Instruct (Text-Only Variant: Qwen3-VLTO-32B-Instruct)
22959+
**Base Model:** Qwen/Qwen3-VL-32B-Instruct
22960+
**Repository:** [mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF](https://huggingface.co/mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF)
22961+
**Type:** Large Language Model (LLM) – Text-Only (Vision-Language model stripped of vision components)
22962+
**Architecture:** Qwen3-VL, adapted for pure text generation
22963+
**Size:** 32 billion parameters
22964+
**License:** Apache 2.0
22965+
**Framework:** Hugging Face Transformers
22966+
22967+
---
22968+
22969+
### 🔍 **Description**
22970+
22971+
This is a **text-only variant** of the powerful **Qwen3-VL-32B-Instruct** multimodal model, stripped of its vision components to function as a high-performance pure language model. The model retains the full text understanding and generation capabilities of its parent — including strong reasoning, long-context handling (up to 32K+ tokens), and advanced multimodal training-derived coherence — while being optimized for text-only tasks.
22972+
22973+
It was created by loading the weights from the full Qwen3-VL-32B-Instruct model into a text-only Qwen3 architecture, preserving all linguistic and reasoning strengths without the need for image input.
22974+
22975+
Perfect for applications requiring deep reasoning, long-form content generation, code synthesis, and dialogue — with all the benefits of the Qwen3 series, now in a lightweight, text-focused form.
22976+
22977+
---
22978+
22979+
### 📌 Key Features
22980+
22981+
- ✅ **High-Performance Text Generation** – Built on top of the state-of-the-art Qwen3-VL architecture
22982+
- ✅ **Extended Context Length** – Supports up to 32,768 tokens (ideal for long documents and complex tasks)
22983+
- ✅ **Strong Reasoning & Planning** – Excels at logic, math, coding, and multi-step reasoning
22984+
- ✅ **Optimized for GGUF Format** – Available in multiple quantized versions (IQ3_M, Q2_K, etc.) for efficient inference on consumer hardware
22985+
- ✅ **Free to Use & Modify** – Apache 2.0 license
22986+
22987+
---
22988+
22989+
### 📦 Use Case Suggestions
22990+
22991+
- Long-form writing, summarization, and editing
22992+
- Code generation and debugging
22993+
- AI agents and task automation
22994+
- High-quality chat and dialogue systems
22995+
- Research and experimentation with large-scale LLMs on local devices
22996+
22997+
---
22998+
22999+
### 📚 References
23000+
23001+
- Original Model: [Qwen/Qwen3-VL-32B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-32B-Instruct)
23002+
- Technical Report: [Qwen3 Technical Report (arXiv)](https://arxiv.org/abs/2505.09388)
23003+
- Quantization by: [mradermacher](https://huggingface.co/mradermacher)
23004+
23005+
> ✅ **Note**: The model shown here is **not the original vision-language model** — it's a **text-only conversion** of the Qwen3-VL-32B-Instruct model, ideal for pure language tasks.
23006+
overrides:
23007+
parameters:
23008+
model: Qwen3-VLTO-32B-Instruct.i1-Q4_K_S.gguf
23009+
files:
23010+
- filename: Qwen3-VLTO-32B-Instruct.i1-Q4_K_S.gguf
23011+
sha256: 789d55249614cd1acee1a23278133cd56ca898472259fa2261f77d65ed7f8367
23012+
uri: huggingface://mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF/Qwen3-VLTO-32B-Instruct.i1-Q4_K_S.gguf

0 commit comments

Comments
 (0)