-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add permute fusion into hybrid ep
complexity: low
#4089
opened Apr 1, 2026 by
Autumn1998
Loading…
5 tasks
Refactor BackendSpecProvider to use Protocols to define the types it returns
community-request
#4087
opened Apr 1, 2026 by
nschank
Loading…
2 of 5 tasks
Support CP (no sequence packing)
community-request
#4086
opened Mar 31, 2026 by
ankisinha-nvidia
•
Draft
5 tasks
Set tensor-parallel attributes irrespective of perform_initialization
complexity: low
Final Review
PR is in the "final review" stage
Fix THD RoPE offsets for local packed shards
community-request
Final Review
PR is in the "final review" stage
Add permute/unpermute fusion with dispatch/combine in Hybrid-EP
#4073
opened Mar 31, 2026 by
Autumn1998
Loading…
5 tasks
Preserve type of decorated methods/classes
community-request
Final Review
PR is in the "final review" stage
#4062
opened Mar 30, 2026 by
nschank
Loading…
2 of 5 tasks
Enable NullTokenizer for pretraining to reduce I/O access
complexity: low
Final Review
PR is in the "final review" stage
Run functional tests
Megatron-FSDP: Fix insufficient double buffers during gradient reduce
complexity: low
Final Review
PR is in the "final review" stage
module: megatron-fsdp
#4054
opened Mar 30, 2026 by
shjwudp
Loading…
5 tasks
Add Dual Chunk Attention (DCA) for long-context training
community-request
#4048
opened Mar 29, 2026 by
Ternura143
•
Draft
5 of 10 tasks
fix: wait for async P2P send before deallocating output tensor
#4047
opened Mar 28, 2026 by
ZhiyuLi-Nvidia
•
Draft
5 tasks
Fix HybridDeviceOptimizer KeyError after mixed-precision param replacement
community-request
Final Review
PR is in the "final review" stage
#4046
opened Mar 28, 2026 by
ma-ben
Loading…
Docs: improve docstrings and comments in example training loop
community-request
#4041
opened Mar 27, 2026 by
DhineshPonnarasan
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2026-03-29.