Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][fix] Update CUTLASS C++ to 4.4.2
#12897 opened Apr 9, 2026 by depaulmillz Loading…
[None][perf] Use sliding-64 batch sizes for padding-enabled CUDA graphs
#12895 opened Apr 9, 2026 by yijingl-nvidia Collaborator Loading…
1 task
Revert "[None][infra] Bump black and tornado (#12876)"
#12894 opened Apr 9, 2026 by niukuo Collaborator Loading…
1 task
Fix missing disagg_request_id fallback in context responses Community want to contribute PRs initiated from Community
#12893 opened Apr 9, 2026 by pich4ya Loading…
[None][fix] Batch addSequence with pre-claim to fix host offloading M…
#12892 opened Apr 9, 2026 by liji-nv Collaborator Loading…
1 task done
[None][infra] update docker images
#12891 opened Apr 9, 2026 by niukuo Collaborator Loading…
1 task done
[None][fix] Update moe hidden_size in communicator for nemotron-h
#12890 opened Apr 9, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[https://nvbugs/6050489][fix] fix agg pp4 hang issue
#12888 opened Apr 9, 2026 by bo-nv Collaborator Loading…
1 task
[None][feat] Add benchmark for all allreduce backend
#12887 opened Apr 9, 2026 by yilin-void Collaborator Loading…
1 task done
[https://nvbugs/6055474][test] Fix RTX-6000 with wrong moe backend
#12886 opened Apr 9, 2026 by yufeiwu-nv Collaborator Loading…
1 task done
[None][test] Enable test for kv_cache_manager_v2 for A10
#12885 opened Apr 9, 2026 by lowsfer Member Loading…
1 task done
[TRTLLM-11585][feat] Add CUTEDSL moe backend for nemotron-h
#12884 opened Apr 9, 2026 by Wanli-Jiang Collaborator Draft
1 task done
[None][refactor] Modularize resource_manager.py into a package
#12883 opened Apr 9, 2026 by eopXD Collaborator Draft
5 tasks done
[TRTLLM-11878][feat] Gen-only sync KV transfer for dis-agg
#12882 opened Apr 9, 2026 by Shixiaowei02 Collaborator Loading…
1 task done
[TRTLLM-11861][infra] Support wildcard in bot stage-list/extra-stage commands
#12881 opened Apr 9, 2026 by mzweilz Collaborator Loading…
5 tasks
debug
#12880 opened Apr 9, 2026 by yuanjingx87 Collaborator Draft
1 task
[None][fix] Batch addSequence with pre-claim to fix host offloading M…
#12878 opened Apr 9, 2026 by liji-nv Collaborator Loading…
1 task done
[None][feat] EXAONE-4.5 Support
#12873 opened Apr 9, 2026 by yechank-nvidia Collaborator Loading…
[None][fix] Fix contrained decoding for GLM5
#12869 opened Apr 9, 2026 by cascade812 Collaborator Loading…
1 task done
[None][fix] Guard CUDA event elapsed_time in perf_metrics_manager to prevent executor crash Community want to contribute PRs initiated from Community
#12868 opened Apr 9, 2026 by yifjiang Contributor Loading…
2 tasks done
[None][infra] Cherry pick fix for 1.2.0rc6.post4
#12867 opened Apr 9, 2026 by yuanjingx87 Collaborator Loading…
1 task done
ProTip! What’s not been updated in a month: updated:<2026-03-09.