8-bit training #1

yctam · 2024-04-01T03:36:27Z

Does the codebase support 8-bit training similar to peft library?

I was trying to fine-tune on llama2-7b on 24Gb 4090 cards. Below is the error I got:
File "/home/nlp/JORA/examples/train.py", line 14, in
main()
File "/home/nlp/JORA/examples/train.py", line 10, in main
train_lora(config, dataset, 'checkpoints')
File "/home/nlp/JORA/jora/common.py", line 246, in train_lora
lora_params, opt_state, total_loss, loss, key = train_step_lora(lora_params, loraConfig, params, opt_state, total_loss, data_batch, key)
jaxlib.xla_extension.XlaRuntimeError: RESOURCE_EXHAUSTED: Out of memory while trying to allocate 16320586680 bytes.
BufferAssignment OOM Debugging.
BufferAssignment stats:
parameter allocation: 3.58GiB
constant allocation: 1.95MiB
maybe_live_out allocation: 264.00MiB
preallocated temp allocation: 15.20GiB
total allocation: 19.04GiB

aniquetahir · 2024-04-02T01:48:49Z

For now its using bfloat16. The main reason being no bitsandbytes equivalent for JAX yet. However, there is also some potential for inclusion of 8bit through TransformerEngine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8-bit training #1

8-bit training #1

yctam commented Apr 1, 2024

aniquetahir commented Apr 2, 2024 •

edited

Loading

8-bit training #1

8-bit training #1

Comments

yctam commented Apr 1, 2024

aniquetahir commented Apr 2, 2024 • edited Loading

aniquetahir commented Apr 2, 2024 •

edited

Loading