You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Aug 7, 2024. It is now read-only.
Summary:
Added an option in test_fsdp.py to compile fsdp. With compile mode, the numerics check can still pass.
However, note that the compile now works with a workaround. And fullgraph needs to be False. We still need to fix the issue.
When running "./test/test_fsdp.sh", three settings will be testet:
1. Fp8 = False
2. Fp8 = True, Compile = False
3. Fp8 = True, Compile = True (with fullgraph = False)
For example:
```
$ ./test/test_fsdp.sh
launching IS_FP8 False, compile_fsdp False, fullgraph False
-------------------------------------------Mode: generate-------------------------------------------
Success: ✅
------------------------------------------Mode: single_gpu------------------------------------------
Success: ✅
---------------------------------------------Mode: fsdp---------------------------------------------
NCCL version 2.19.3+cuda12.1
-------------------------------------------Mode: analyze--------------------------------------------
output testing single_gpu vs FSDP success
state dict testing single_gpu vs FSDP success
Success: ✅
✅ All Tests Passed ✅
launching IS_FP8 True, compile_fsdp False, fullgraph False
-------------------------------------------Mode: generate-------------------------------------------
Success: ✅
------------------------------------------Mode: single_gpu------------------------------------------
Success: ✅
---------------------------------------------Mode: fsdp---------------------------------------------
NCCL version 2.19.3+cuda12.1
-------------------------------------------Mode: analyze--------------------------------------------
output testing single_gpu vs FSDP success
state dict testing single_gpu vs FSDP success
Success: ✅
✅ All Tests Passed ✅
launching IS_FP8 True, compile_fsdp True, fullgraph False
-------------------------------------------Mode: generate-------------------------------------------
Success: ✅
------------------------------------------Mode: single_gpu------------------------------------------
Success: ✅
---------------------------------------------Mode: fsdp---------------------------------------------
NCCL version 2.19.3+cuda12.1
[rank0]:[2023-12-15 14:49:02,616] [0/0] torch._dynamo.variables.torch: [WARNING] Profiler function <class 'torch.autograd.profiler.record_function'> will be ignored
[rank0]:[2023-12-15 14:49:02,618] [0/0] torch._dynamo.variables.torch: [WARNING] Profiler function <class 'torch.autograd.profiler.record_function'> will be ignored
[rank1]:[2023-12-15 14:49:02,706] [0/0] torch._dynamo.variables.torch: [WARNING] Profiler function <class 'torch.autograd.profiler.record_function'> will be ignored
[rank1]:[2023-12-15 14:49:02,708] [0/0] torch._dynamo.variables.torch: [WARNING] Profiler function <class 'torch.autograd.profiler.record_function'> will be ignored
-------------------------------------------Mode: analyze--------------------------------------------
output testing single_gpu vs FSDP success
state dict testing single_gpu vs FSDP success
Success: ✅
✅ All Tests Passed ✅
```
Pull Request resolved: #160
Reviewed By: vkuzo
Differential Revision: D52224302
Pulled By: y-sq
fbshipit-source-id: 4c29479771f4cd100b8c5a9549d321eb13b49739
0 commit comments