-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Add PRESHARDED LoadFormat for zero-disk P2P RDMA weight loading
#12898
opened Apr 9, 2026 by
KavinKrishnan
Loading…
[None][feat] Enable mamba/linear attention cache reuse in scheduler
#12896
opened Apr 9, 2026 by
VALLIS-NERIA
Collaborator
•
Draft
3 tasks
[None][perf] Use sliding-64 batch sizes for padding-enabled CUDA graphs
#12895
opened Apr 9, 2026 by
yijingl-nvidia
Collaborator
Loading…
1 task
Revert "[None][infra] Bump black and tornado (#12876)"
#12894
opened Apr 9, 2026 by
niukuo
Collaborator
Loading…
1 task
Fix missing disagg_request_id fallback in context responses
Community want to contribute
PRs initiated from Community
#12893
opened Apr 9, 2026 by
pich4ya
Loading…
[None][fix] Batch addSequence with pre-claim to fix host offloading M…
#12892
opened Apr 9, 2026 by
liji-nv
Collaborator
Loading…
1 task done
[None][infra] update docker images
#12891
opened Apr 9, 2026 by
niukuo
Collaborator
Loading…
1 task done
[None][fix] Update moe hidden_size in communicator for nemotron-h
#12890
opened Apr 9, 2026 by
Wanli-Jiang
Collaborator
Loading…
1 task done
[https://nvbugs/6050489][fix] fix agg pp4 hang issue
#12888
opened Apr 9, 2026 by
bo-nv
Collaborator
Loading…
1 task
[None][feat] Add benchmark for all allreduce backend
#12887
opened Apr 9, 2026 by
yilin-void
Collaborator
Loading…
1 task done
[https://nvbugs/6055474][test] Fix RTX-6000 with wrong moe backend
#12886
opened Apr 9, 2026 by
yufeiwu-nv
Collaborator
Loading…
1 task done
[None][test] Enable test for kv_cache_manager_v2 for A10
#12885
opened Apr 9, 2026 by
lowsfer
Member
Loading…
1 task done
[TRTLLM-11585][feat] Add CUTEDSL moe backend for nemotron-h
#12884
opened Apr 9, 2026 by
Wanli-Jiang
Collaborator
•
Draft
1 task done
[TRTLLM-11878][feat] Gen-only sync KV transfer for dis-agg
#12882
opened Apr 9, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[TRTLLM-11861][infra] Support wildcard in bot stage-list/extra-stage commands
#12881
opened Apr 9, 2026 by
mzweilz
Collaborator
Loading…
5 tasks
[None][fix] Batch addSequence with pre-claim to fix host offloading M…
#12878
opened Apr 9, 2026 by
liji-nv
Collaborator
Loading…
1 task done
[None][feat]: support different head_dim for different level with kvc…
#12871
opened Apr 9, 2026 by
WeiHaocheng
Collaborator
•
Draft
1 task
[None][fix] Fix contrained decoding for GLM5
#12869
opened Apr 9, 2026 by
cascade812
Collaborator
Loading…
1 task done
[None][fix] Guard CUDA event elapsed_time in perf_metrics_manager to prevent executor crash
Community want to contribute
PRs initiated from Community
#12868
opened Apr 9, 2026 by
yifjiang
Contributor
Loading…
2 tasks done
[None][infra] Cherry pick fix for 1.2.0rc6.post4
#12867
opened Apr 9, 2026 by
yuanjingx87
Collaborator
Loading…
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-09.