|
22950 | 22950 | - filename: apollo-astralis-4b.i1-Q4_K_M.gguf |
22951 | 22951 | sha256: 94e1d371420b03710fc7de030c1c06e75a356d9388210a134ee2adb4792a2626 |
22952 | 22952 | uri: huggingface://mradermacher/apollo-astralis-4b-i1-GGUF/apollo-astralis-4b.i1-Q4_K_M.gguf |
| 22953 | +- !!merge <<: *qwen3 |
| 22954 | + name: "qwen3-vlto-32b-instruct-i1" |
| 22955 | + urls: |
| 22956 | + - https://huggingface.co/mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF |
| 22957 | + description: | |
| 22958 | + **Model Name:** Qwen3-VL-32B-Instruct (Text-Only Variant: Qwen3-VLTO-32B-Instruct) |
| 22959 | + **Base Model:** Qwen/Qwen3-VL-32B-Instruct |
| 22960 | + **Repository:** [mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF](https://huggingface.co/mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF) |
| 22961 | + **Type:** Large Language Model (LLM) – Text-Only (Vision-Language model stripped of vision components) |
| 22962 | + **Architecture:** Qwen3-VL, adapted for pure text generation |
| 22963 | + **Size:** 32 billion parameters |
| 22964 | + **License:** Apache 2.0 |
| 22965 | + **Framework:** Hugging Face Transformers |
| 22966 | + |
| 22967 | + --- |
| 22968 | + |
| 22969 | + ### 🔍 **Description** |
| 22970 | + |
| 22971 | + This is a **text-only variant** of the powerful **Qwen3-VL-32B-Instruct** multimodal model, stripped of its vision components to function as a high-performance pure language model. The model retains the full text understanding and generation capabilities of its parent — including strong reasoning, long-context handling (up to 32K+ tokens), and advanced multimodal training-derived coherence — while being optimized for text-only tasks. |
| 22972 | + |
| 22973 | + It was created by loading the weights from the full Qwen3-VL-32B-Instruct model into a text-only Qwen3 architecture, preserving all linguistic and reasoning strengths without the need for image input. |
| 22974 | + |
| 22975 | + Perfect for applications requiring deep reasoning, long-form content generation, code synthesis, and dialogue — with all the benefits of the Qwen3 series, now in a lightweight, text-focused form. |
| 22976 | + |
| 22977 | + --- |
| 22978 | + |
| 22979 | + ### 📌 Key Features |
| 22980 | + |
| 22981 | + - ✅ **High-Performance Text Generation** – Built on top of the state-of-the-art Qwen3-VL architecture |
| 22982 | + - ✅ **Extended Context Length** – Supports up to 32,768 tokens (ideal for long documents and complex tasks) |
| 22983 | + - ✅ **Strong Reasoning & Planning** – Excels at logic, math, coding, and multi-step reasoning |
| 22984 | + - ✅ **Optimized for GGUF Format** – Available in multiple quantized versions (IQ3_M, Q2_K, etc.) for efficient inference on consumer hardware |
| 22985 | + - ✅ **Free to Use & Modify** – Apache 2.0 license |
| 22986 | + |
| 22987 | + --- |
| 22988 | + |
| 22989 | + ### 📦 Use Case Suggestions |
| 22990 | + |
| 22991 | + - Long-form writing, summarization, and editing |
| 22992 | + - Code generation and debugging |
| 22993 | + - AI agents and task automation |
| 22994 | + - High-quality chat and dialogue systems |
| 22995 | + - Research and experimentation with large-scale LLMs on local devices |
| 22996 | + |
| 22997 | + --- |
| 22998 | + |
| 22999 | + ### 📚 References |
| 23000 | + |
| 23001 | + - Original Model: [Qwen/Qwen3-VL-32B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-32B-Instruct) |
| 23002 | + - Technical Report: [Qwen3 Technical Report (arXiv)](https://arxiv.org/abs/2505.09388) |
| 23003 | + - Quantization by: [mradermacher](https://huggingface.co/mradermacher) |
| 23004 | + |
| 23005 | + > ✅ **Note**: The model shown here is **not the original vision-language model** — it's a **text-only conversion** of the Qwen3-VL-32B-Instruct model, ideal for pure language tasks. |
| 23006 | + overrides: |
| 23007 | + parameters: |
| 23008 | + model: Qwen3-VLTO-32B-Instruct.i1-Q4_K_S.gguf |
| 23009 | + files: |
| 23010 | + - filename: Qwen3-VLTO-32B-Instruct.i1-Q4_K_S.gguf |
| 23011 | + sha256: 789d55249614cd1acee1a23278133cd56ca898472259fa2261f77d65ed7f8367 |
| 23012 | + uri: huggingface://mradermacher/Qwen3-VLTO-32B-Instruct-i1-GGUF/Qwen3-VLTO-32B-Instruct.i1-Q4_K_S.gguf |
0 commit comments