Base line model testing #14

Aisuko · 2024-06-23T01:20:36Z

Aisuko
Jun 23, 2024
Maintainer

We want to run the model on consumer-grade hardware. It means we will support different accelerate methods on CPU and GPU.

For CPU, ggml it the best choice.
For GPU, we support distribution by using Hugging Face libs

And, we want the model as small as it can be. So, we should choose a base line model. And as HuggingFace transformers supports gguf(format of ggml). So, we have many of choices below:

Phi-3 gguf -3.8B
gpt-2 - 1.5B

The reason we choice the base line model size larger than 1B is that we should make sure the results of model are reasonable and useful. I will upload some notebooks or model test result to support this later.

@Micost

Aisuko · 2024-06-23T03:08:57Z

Aisuko
Jun 23, 2024
Maintainer Author

Aisuko Jun 24, 2024
Maintainer Author

OpemELM's architecture can be seen a template of the customise small language models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SkywardAI

Base line model testing #14

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

SkywardAI

Base line model testing #14

Aisuko Jun 23, 2024 Maintainer

Replies: 1 comment · 1 reply

Aisuko Jun 23, 2024 Maintainer Author

Update:

LLamacpp

The model repos

Hugging Face transformers with gguf

Related issues

RLHF notebooks

Aisuko Jun 24, 2024 Maintainer Author

Aisuko
Jun 23, 2024
Maintainer

Replies: 1 comment 1 reply

Aisuko
Jun 23, 2024
Maintainer Author

Aisuko Jun 24, 2024
Maintainer Author