Megatron-FSDP: Fix insufficient double buffers during gradient reduce#4054
Merged
cspades merged 3 commits intoNVIDIA:mainfrom Apr 2, 2026
Merged
Megatron-FSDP: Fix insufficient double buffers during gradient reduce#4054cspades merged 3 commits intoNVIDIA:mainfrom
cspades merged 3 commits intoNVIDIA:mainfrom