Exporting Keras Llama Checkpoint to HF #2062

salrowili · 2025-01-29T08:30:58Z

Hi,

As the Llama3 is popular model, it would be great if we can have a script that export Llama Keras checkpoint to HF. The code is a already exist for Gemma : https://github.com/keras-team/keras-hub/blob/8ca2076d651532f7270ffa2beba44d377b100bbf/tools/gemma/export_gemma_to_hf.py

Something similar to Llama would be a great!
Thanks

Gopi-Uppari · 2025-01-30T06:47:41Z

Hi,

Noted it.

We can use the keras_hub library to easily integrate Keras models with Hugging Face. Here iss how to export a Keras Llama model to Hugging Face:

Load a pre-trained Llama model from the Hugging Face Hub using keras_hub.
Save the fine-tuned model locally.
Upload the model to the Hugging Face Hub for easy sharing and future use.

For more details, please check the reference provided.

Thank you.

salrowili · 2025-01-30T07:12:08Z

Hi @Gopi-Uppari ,
Thank you for taking care of this.

Yes, i have used Keras to load my custom load using "keras_hub.models.Llama3CausalLM.from_preset("./local_folder") and it work pretty well. However, the main reason to ask for this feature because the built-in generate method with keras_hub give me little options regarding things like terminators, top_k, Sample, temperature. i had to hard coded it.

Gopi-Uppari · 2025-02-06T09:10:54Z

Hi @salrowili,

Okay, could you please confirm if this issue is resolved for you with the above comment ? Please feel free to close the issue if it is resolved ?

Thank you.

salrowili · 2025-02-06T09:42:46Z

Hi @Gopi-Uppari ,
The issue has not been resolved because we still need to have a script that convert Keras checkpoint to HF format for Llama. The solution you have suggested and what "keras_hub.models.Llama3CausalLM.from_preset" do is the opposite . It load HF model into Keras compatible checkpoint.

sachinprasadhs · 2025-02-07T19:39:09Z

@salrowili , If you you are asking about the weight files for Llama in HF, here is the link https://huggingface.co/keras/llama3_8b_en_int8/tree/main

salrowili · 2025-02-07T19:50:46Z

Hi @sachinprasadhs ,
Thank you for trying to help. To illustrate the issue that i am facing take this example.
I have used keras_hub to finetune llama3 8b on my custom SFT data. Then, i want to convert the finetuned llm to HF so i can use it in production with HF library. Thus, what i am looking for is a script that can convert the finetuned Keras checkpoint to HF format. A similar script exists for Gemma 2 [ tools/gemma/export_gemma_to_hf.py ] but not for llama.

sachinprasadhs · 2025-02-12T23:05:33Z

Understood, Thanks for clarifying, as of now we don't have conversion script to convert from Keras to HuggingFace for Llama, instead we have a conversion script to convert HuggingFace checkpoint to Keras checkpoint here https://github.com/keras-team/keras-hub/blob/master/tools/checkpoint_conversion/convert_llama_checkpoints.py
You can refer this above script and create one for Keras to HuggingFace.

github-actions bot added the Gemma Gemma model specific issues label Jan 29, 2025

sachinprasadhs self-assigned this Feb 12, 2025

sachinprasadhs added the stat:awaiting response from contributor label Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exporting Keras Llama Checkpoint to HF #2062

Exporting Keras Llama Checkpoint to HF #2062

salrowili commented Jan 29, 2025 •

edited

Loading

Gopi-Uppari commented Jan 30, 2025

salrowili commented Jan 30, 2025

Gopi-Uppari commented Feb 6, 2025

salrowili commented Feb 6, 2025

sachinprasadhs commented Feb 7, 2025

salrowili commented Feb 7, 2025 •

edited

Loading

sachinprasadhs commented Feb 12, 2025

Exporting Keras Llama Checkpoint to HF #2062

Exporting Keras Llama Checkpoint to HF #2062

Comments

salrowili commented Jan 29, 2025 • edited Loading

Gopi-Uppari commented Jan 30, 2025

salrowili commented Jan 30, 2025

Gopi-Uppari commented Feb 6, 2025

salrowili commented Feb 6, 2025

sachinprasadhs commented Feb 7, 2025

salrowili commented Feb 7, 2025 • edited Loading

sachinprasadhs commented Feb 12, 2025

salrowili commented Jan 29, 2025 •

edited

Loading

salrowili commented Feb 7, 2025 •

edited

Loading