-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
1. README and Code Structure Mismatch
- README refers to
training/, but the actual directory isds_training/. generate_all_train_datasets.shis mentioned in README but does not exist. Instead,generate_all_train_datasets_v1.shandgenerate_all_train_datasets_v2.share present.
2. Missing Dependency (sentencepiece)
LlamaTokenizerrequiressentencepiece, but it is not listed inrequirements.txt.- Running the code without it causes an
ImportError.
3. Default /output Directory Causes Permission denied
- Scripts attempt to write to
/output, which requires root access. - Users without root permissions encounter
Permission deniederrors.
These issues prevent users from running the code without modifications.
Metadata
Metadata
Assignees
Labels
No labels