-
Notifications
You must be signed in to change notification settings - Fork 169
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add video_grid_thw plumbing for multimodal video inputs
#2404
opened Apr 25, 2026 by
HyperBlaze456
Loading…
Fix XLA Compilation warning
ready
ONLY add when PR is ready to merge/full CI is needed
#2402
opened Apr 25, 2026 by
kyuyeunk
Collaborator
Loading…
[BugFix] use deepcopy's model_config when calling vllm_get_model to prevent config mutation
#2399
opened Apr 25, 2026 by
yaochengji
Collaborator
Loading…
[Qwen3.5] Enable jittable vision tower for Qwen3.5
ready
ONLY add when PR is ready to merge/full CI is needed
#2396
opened Apr 24, 2026 by
lk-chen
Collaborator
Loading…
Optimize GDN conv1d
ready
ONLY add when PR is ready to merge/full CI is needed
#2394
opened Apr 24, 2026 by
helloworld1
Collaborator
Loading…
Extend attn_dp_expert to emulate attn_dp.
#2392
opened Apr 24, 2026 by
NicoGrande
Collaborator
Loading…
feat: custom traces, flow events, kv cache metadata
#2391
opened Apr 24, 2026 by
rushabh-46
Loading…
[TPU KV Offloading] [Feat] KV cache offloading to host memory
#2390
opened Apr 24, 2026 by
juncgu-google
Collaborator
Loading…
[CI] Implement interactive wizard for CI model and feature onboarding
ready
ONLY add when PR is ready to merge/full CI is needed
#2385
opened Apr 24, 2026 by
boe20211
Collaborator
Loading…
Fix forward n-d buffer with jitted unpack
ready
ONLY add when PR is ready to merge/full CI is needed
#2362
opened Apr 22, 2026 by
pv97
Collaborator
Loading…
update libs - fix sc kernel
ready
ONLY add when PR is ready to merge/full CI is needed
#2356
opened Apr 22, 2026 by
clee1994
Collaborator
Loading…
[Kernel][Batched RPA] Increase prefill batch size
ready
ONLY add when PR is ready to merge/full CI is needed
[DeepSeek] Adding torchax e2e MMLU test
ready
ONLY add when PR is ready to merge/full CI is needed
#2350
opened Apr 21, 2026 by
gpolovets1
Collaborator
Loading…
create and opensource kernel tuning infra
ready
ONLY add when PR is ready to merge/full CI is needed
#2346
opened Apr 21, 2026 by
patrickji2014
Collaborator
Loading…
Append MoE expert IDs when enable_return_routed_experts is enabled
ready
ONLY add when PR is ready to merge/full CI is needed
#2343
opened Apr 21, 2026 by
pv97
Collaborator
Loading…
Add env variable for overriding rpa block sizes
ready
ONLY add when PR is ready to merge/full CI is needed
#2338
opened Apr 20, 2026 by
wenxindongwork
Collaborator
Loading…
[Disagg/qwen3.5] disagg support for qwen3.5 (4/n bench script)
ready
ONLY add when PR is ready to merge/full CI is needed
#2337
opened Apr 20, 2026 by
wyzhang
Collaborator
Loading…
GLM-5.1-FP8 MLA multi-host + multi-host weight loader + FP8 MoE direct path
#2324
opened Apr 20, 2026 by
yiqiliu2
Loading…
3 tasks done
Add support for Int4-CompressedTensors MoE
ready
ONLY add when PR is ready to merge/full CI is needed
#2306
opened Apr 17, 2026 by
dmmolitor
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.