GGUF support needed #104

clover1980 · 2024-08-30T06:54:53Z

Hi,
This project is great, but i can't understand how it working with Ollama models but not with current widespread quantization standard as GGUF. Ollama itself don't support GGUF, it require to reformat models into it's own format, so it's very niche by itself and support the smallest number of models (mostly the few popular), all others models ever published are quantized in GGUF (also it's easy to do that from huggingface format with llama.cpp itself or change by AUTOGGUF). Also very limited variety of quality levels in ollama models, you won't find many models there in best no loss q8 quality. My model not presented in Ollama models list and i really don't want to reformat 30billions GGUF into ollama format, that's a huge waste of time and require huge storage space by my tests (not counting RAM).
Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GGUF support needed #104

GGUF support needed #104

clover1980 commented Aug 30, 2024 •

edited

Loading

GGUF support needed #104

GGUF support needed #104

Comments

clover1980 commented Aug 30, 2024 • edited Loading

clover1980 commented Aug 30, 2024 •

edited

Loading