allenai / OLMo Public

Notifications You must be signed in to change notification settings
Fork 492
Star 4.9k

Code
Issues 52
Pull requests 52
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: allenai/OLMo

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

52 Open 151 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

How to train the tinymodel(Like 300M or 150M) type/question

An issue that's a question

#759 opened Dec 3, 2024 by yongding-tao

Question about the OLMo2 Stage 2 training procedures: was the optimizer state from Stage 1 used during the training of Stage 2? type/question

An issue that's a question

#758 opened Nov 29, 2024 by Taoer1996

About eos_token_id in config file (20M, 1B) type/question

An issue that's a question

#757 opened Nov 29, 2024 by lllabmaster

OLMo-2 held-out validation data type/question

An issue that's a question

#755 opened Nov 27, 2024 by chawins

Difference between 0724 and 0424 7B models type/documentation

An issue or pull request related to documentation

#746 opened Nov 13, 2024 by jiahai-feng

TypeError - running example code type/bug

An issue about a bug

#743 opened Nov 3, 2024 by KPK101

Fail to load tokenizer for checkpoints type/bug

An issue about a bug

#741 opened Oct 24, 2024 by tresiwald

Error Encountered During Multi-Node Pretraining with Torchrun type/bug

An issue about a bug

#737 opened Oct 21, 2024 by Zehui127

Missing OLMo checkpoints

#726 opened Oct 3, 2024 by mirandrom

8-bit allgather support type/question

An issue that's a question

#722 opened Sep 19, 2024 by yaroslavvb

Expected Data Format type/question

An issue that's a question

#715 opened Aug 27, 2024 by aflah02

Which mmlu validation setting is recommend? type/question

An issue that's a question

#714 opened Aug 27, 2024 by mathfinder

[Quick question]: How do I turn off FSDP? type/question

An issue that's a question

#703 opened Aug 15, 2024 by candygocandy

RuntimeError: Triton Error [CUDA]: invalid device context type/bug

An issue about a bug

#700 opened Aug 13, 2024 by andymvp2018

slurm script for: configs/official/OLMo-7B.yaml type/question

An issue that's a question

#699 opened Aug 13, 2024 by andymvp2018

Gflops computation is faulty for FSDP due to bug in OLMo.num_params()

#695 opened Aug 7, 2024 by AkshitaB

why CrossEntropyLoss is zero,i type/question

An issue that's a question

#692 opened Aug 6, 2024 by aizhweiwei

Olmo 0724 -hf checkpoints don't load the proper config when instantiating with OLMoForCausalLM type/bug

An issue about a bug

#689 opened Aug 5, 2024 by sarahwie

Model ladder has no documentation type/documentation

An issue or pull request related to documentation

#683 opened Jul 31, 2024 by IanMagnusson

mlp_ratio not adjusted in config if mlp_hidden_size is set type/bug

An issue about a bug

#673 opened Jul 21, 2024 by Muennighoff

Does global_train_batch_size support gradient accumulation? type/question

An issue that's a question

#672 opened Jul 21, 2024 by jinzhuoran

Is there explicitly instruction-following data in the version of Dolma used to train v1? type/question

An issue that's a question

#658 opened Jul 15, 2024 by john-hewitt

Can long text be splitted into short texts? type/question

An issue that's a question

#655 opened Jul 12, 2024 by CoinCheung

Cannot convert internal OLMo checkpoint to HF type/bug

An issue about a bug

#654 opened Jul 11, 2024 by viking-sudo-rm

start_index not getting reset in data loader when moving to new epoch type/bug

An issue about a bug

#650 opened Jul 10, 2024 by leon-g-xu

Previous 1 2 3 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly