-
Notifications
You must be signed in to change notification settings - Fork 280
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adding a simple optimizer registry.
CLA Signed
This label is managed by the Meta Open Source bot.
#876
opened Feb 21, 2025 by
balancap
Loading…
Create logging directory in wandb logger
CLA Signed
This label is managed by the Meta Open Source bot.
#874
opened Feb 21, 2025 by
K-H-Ismail
Loading…
Refactor Checkpointer
CLA Signed
This label is managed by the Meta Open Source bot.
#871
opened Feb 20, 2025 by
fegin
Loading…
Show GC execution time
CLA Signed
This label is managed by the Meta Open Source bot.
#870
opened Feb 20, 2025 by
fegin
Loading…
Configure arbitrary frozen modules via config
CLA Signed
This label is managed by the Meta Open Source bot.
#869
opened Feb 20, 2025 by
lkhphuc
Loading…
add structure to torchtitan files
CLA Signed
This label is managed by the Meta Open Source bot.
#867
opened Feb 20, 2025 by
tianyu-l
Loading…
Do full GC for checkpointing related GC calls
CLA Signed
This label is managed by the Meta Open Source bot.
[Reland] Add Dynamic Model Import and ModelSpec Definition (#837)
ci-no-td
CLA Signed
This label is managed by the Meta Open Source bot.
#854
opened Feb 18, 2025 by
fduwjj
Loading…
[Not for landing] piggy back on titan for scale init test
CLA Signed
This label is managed by the Meta Open Source bot.
[NOT READY TO LAND] Integrate TorchFT
CLA Signed
This label is managed by the Meta Open Source bot.
#834
opened Feb 11, 2025 by
fegin
Loading…
Add force_recompute_fp8_weight_in_bwd when FSDP
CLA Signed
This label is managed by the Meta Open Source bot.
#832
opened Feb 11, 2025 by
c0g
Loading…
profile with modules and stack
CLA Signed
This label is managed by the Meta Open Source bot.
#829
opened Feb 10, 2025 by
carmocca
Loading…
[not for land] add repro script for https://github.com/pytorch/torchtitan/pull/808
CLA Signed
This label is managed by the Meta Open Source bot.
#815
opened Feb 1, 2025 by
danielvegamyhre
•
Draft
add configuration for float8 with rowwise scaling, via recipe lookup
CLA Signed
This label is managed by the Meta Open Source bot.
#808
opened Jan 27, 2025 by
vkuzo
Loading…
[cp] Add cudnn attention support to Context Parallel
CLA Signed
This label is managed by the Meta Open Source bot.
Make CheckpointManager friendlier to custom StorageWriter/StorageReader
CLA Signed
This label is managed by the Meta Open Source bot.
#789
opened Jan 12, 2025 by
dimdi-y
Loading…
Register backward hook for the whole optim_dict to enable working at multi schedule pp
CLA Signed
This label is managed by the Meta Open Source bot.
[Not for land] Integrate float8nocompile, an experimental feature for high performance
CLA Signed
This label is managed by the Meta Open Source bot.
#778
opened Jan 7, 2025 by
danielvegamyhre
Loading…
[PoC] Typed JobConfig
CLA Signed
This label is managed by the Meta Open Source bot.
#767
opened Jan 1, 2025 by
jaysonfrancis
Loading…
[MoE][PoC] Expert Parallel: tp and tp2ep
CLA Signed
This label is managed by the Meta Open Source bot.
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.