[REQUEST] Support for Hunyuan-A13B-Instruct

### Problem

Tencent released this new model:
https://huggingface.co/tencent/Hunyuan-A13B-Instruct

Architecture: HunYuanMoEV1ForCausalLM

### Solution

Add support for the new MoE architecture

### Alternatives

_No response_

### Explanation

It matches bigger models on benchmarks. It has a decent size to run locally, plus if it can be fit fully in VRAM the MoE architecture should make it pretty fast. 
It has 256K context too.

### Examples

_No response_

### Additional context

_No response_

### Acknowledgements

- [x] I have looked for similar requests before submitting this one.
- [x] I understand that the developers have lives and my issue will be answered when possible.
- [x] I understand the developers of this program are human, and I will make my requests politely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[REQUEST] Support for Hunyuan-A13B-Instruct #798

Problem

Solution

Alternatives

Explanation

Examples

Additional context

Acknowledgements

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[REQUEST] Support for Hunyuan-A13B-Instruct #798

Description

Problem

Solution

Alternatives

Explanation

Examples

Additional context

Acknowledgements

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions