-
-
Notifications
You must be signed in to change notification settings - Fork 323
Open
Description
Problem
Tencent released this new model:
https://huggingface.co/tencent/Hunyuan-A13B-Instruct
Architecture: HunYuanMoEV1ForCausalLM
Solution
Add support for the new MoE architecture
Alternatives
No response
Explanation
It matches bigger models on benchmarks. It has a decent size to run locally, plus if it can be fit fully in VRAM the MoE architecture should make it pretty fast.
It has 256K context too.
Examples
No response
Additional context
No response
Acknowledgements
- I have looked for similar requests before submitting this one.
- I understand that the developers have lives and my issue will be answered when possible.
- I understand the developers of this program are human, and I will make my requests politely.
Originalimoc
Metadata
Metadata
Assignees
Labels
No labels