Mistral 3 large #17699

adhusch · 2025-12-02T18:45:06Z

adhusch
Dec 2, 2025

Hello,

Mistral opensourced Mistral 3 large (among other models). Its size is quite a challenge, as i guess it will even in Q4 need at least 4-8 H100s (on the other hand 1-3 B300 should do?) but it's currently a very unique model. Any Chance for llama.cpp support? I mean, Nvidia has already forseen a row for llama.cpp in the compatibility table https://developer.nvidia.com/blog/nvidia-accelerated-mistral-3-open-models-deliver-efficiency-accuracy-at-any-scale/ 😉

Thanks a lot

Cheers

Answered by DocShotgun

Dec 2, 2025

It seems like it's essentially a DeepSeekV3 architecture, with different tensor names and vision support. I don't think it would necessarily be hard to add.

vLLM implementation: vllm-project/vllm#29757

View full answer

DocShotgun · 2025-12-02T18:56:57Z

DocShotgun
Dec 2, 2025

It seems like it's essentially a DeepSeekV3 architecture, with different tensor names and vision support. I don't think it would necessarily be hard to add.

vLLM implementation: vllm-project/vllm#29757

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mistral 3 large #17699

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Mistral 3 large #17699

Uh oh!

Uh oh!

adhusch Dec 2, 2025

Replies: 1 comment

Uh oh!

DocShotgun Dec 2, 2025

adhusch
Dec 2, 2025

DocShotgun
Dec 2, 2025