-
Notifications
You must be signed in to change notification settings - Fork 492
Issues: allenai/OLMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to train the tinymodel(Like 300M or 150M)
type/question
An issue that's a question
#759
opened Dec 3, 2024 by
yongding-tao
Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2?
type/question
An issue that's a question
#758
opened Nov 29, 2024 by
Taoer1996
About eos_token_id in config file (20M, 1B)
type/question
An issue that's a question
#757
opened Nov 29, 2024 by
lllabmaster
OLMo-2 held-out validation data
type/question
An issue that's a question
#755
opened Nov 27, 2024 by
chawins
Difference between 0724 and 0424 7B models
type/documentation
An issue or pull request related to documentation
#746
opened Nov 13, 2024 by
jiahai-feng
Fail to load tokenizer for checkpoints
type/bug
An issue about a bug
#741
opened Oct 24, 2024 by
tresiwald
Error Encountered During Multi-Node Pretraining with Torchrun
type/bug
An issue about a bug
#737
opened Oct 21, 2024 by
Zehui127
8-bit allgather support
type/question
An issue that's a question
#722
opened Sep 19, 2024 by
yaroslavvb
Which mmlu validation setting is recommend?
type/question
An issue that's a question
#714
opened Aug 27, 2024 by
mathfinder
[Quick question]: How do I turn off FSDP?
type/question
An issue that's a question
#703
opened Aug 15, 2024 by
candygocandy
RuntimeError: Triton Error [CUDA]: invalid device context
type/bug
An issue about a bug
#700
opened Aug 13, 2024 by
andymvp2018
slurm script for: configs/official/OLMo-7B.yaml
type/question
An issue that's a question
#699
opened Aug 13, 2024 by
andymvp2018
Gflops computation is faulty for FSDP due to bug in
OLMo.num_params()
#695
opened Aug 7, 2024 by
AkshitaB
why CrossEntropyLoss is zero,i
type/question
An issue that's a question
#692
opened Aug 6, 2024 by
aizhweiwei
Olmo 0724 An issue about a bug
-hf
checkpoints don't load the proper config when instantiating with OLMoForCausalLM
type/bug
#689
opened Aug 5, 2024 by
sarahwie
Model ladder has no documentation
type/documentation
An issue or pull request related to documentation
#683
opened Jul 31, 2024 by
IanMagnusson
mlp_ratio not adjusted in config if mlp_hidden_size is set
type/bug
An issue about a bug
#673
opened Jul 21, 2024 by
Muennighoff
Does global_train_batch_size support gradient accumulation?
type/question
An issue that's a question
#672
opened Jul 21, 2024 by
jinzhuoran
Is there explicitly instruction-following data in the version of Dolma used to train v1?
type/question
An issue that's a question
#658
opened Jul 15, 2024 by
john-hewitt
Can long text be splitted into short texts?
type/question
An issue that's a question
#655
opened Jul 12, 2024 by
CoinCheung
Cannot convert internal OLMo checkpoint to HF
type/bug
An issue about a bug
#654
opened Jul 11, 2024 by
viking-sudo-rm
start_index not getting reset in data loader when moving to new epoch
type/bug
An issue about a bug
#650
opened Jul 10, 2024 by
leon-g-xu
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.