Skip to content

Commit 74e6e7b

Browse files
authored
Update DPO Max Seq Len (#2176)
1 parent de8b57c commit 74e6e7b

File tree

4 files changed

+4
-4
lines changed

4 files changed

+4
-4
lines changed

recipes/configs/llama2/7B_lora_dpo.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@ model:
3232
tokenizer:
3333
_component_: torchtune.models.llama2.llama2_tokenizer
3434
path: /tmp/Llama-2-7b-hf/tokenizer.model
35-
max_seq_len: 1024
35+
max_seq_len: 1024 # higher increases memory
3636

3737
checkpointer:
3838
_component_: torchtune.training.FullModelHFCheckpointer

recipes/configs/llama2/7B_lora_dpo_single_device.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ model:
3131
tokenizer:
3232
_component_: torchtune.models.llama2.llama2_tokenizer
3333
path: /tmp/Llama-2-7b-hf/tokenizer.model
34-
max_seq_len: 1024
34+
max_seq_len: 1024 # higher increases memory
3535

3636
checkpointer:
3737
_component_: torchtune.training.FullModelHFCheckpointer

recipes/configs/llama3_1/8B_lora_dpo.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@ model:
3232
tokenizer:
3333
_component_: torchtune.models.llama3.llama3_tokenizer
3434
path: /tmp/Meta-Llama-3.1-8B-Instruct/original/tokenizer.model
35-
max_seq_len: null
35+
max_seq_len: 1024 # higher increases memory
3636

3737
checkpointer:
3838
_component_: torchtune.training.FullModelHFCheckpointer

recipes/configs/llama3_1/8B_lora_dpo_single_device.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ model:
3131
tokenizer:
3232
_component_: torchtune.models.llama3.llama3_tokenizer
3333
path: /tmp/Meta-Llama-3.1-8B-Instruct/original/tokenizer.model
34-
max_seq_len: null
34+
max_seq_len: 1024 # higher increases memory
3535

3636
checkpointer:
3737
_component_: torchtune.training.FullModelHFCheckpointer

0 commit comments

Comments
 (0)