Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support overriding logic for hybrid kv cache padding ready ONLY add when PR is ready to merge/full CI is needed
#1285 opened Dec 11, 2025 by kyuyeunk Loading…
[Bugfix][Refactor] Fix compressed tensor moe init ready ONLY add when PR is ready to merge/full CI is needed
#1283 opened Dec 11, 2025 by kyuyeunk Loading…
[Kernel][Misc] Remove jax.named_scope ready ONLY add when PR is ready to merge/full CI is needed
#1278 opened Dec 10, 2025 by kyuyeunk Loading…
[do not review][do not submit] ready ONLY add when PR is ready to merge/full CI is needed
#1277 opened Dec 10, 2025 by QiliangCui Loading…
Move the If nightly==1 check out of command.
#1276 opened Dec 10, 2025 by QiliangCui Loading…
add new kernel and quantization support matrices
#1275 opened Dec 10, 2025 by boe20211 Loading…
Add default 'auto' MODEL_IMPL_TYPE that resolves based on architecture ready ONLY add when PR is ready to merge/full CI is needed
#1255 opened Dec 5, 2025 by xingliu14 Loading…
docs: update support matrices and improve visuals
#1250 opened Dec 5, 2025 by RobMulla Loading…
Avoid installing CUDA related stuff
#1246 opened Dec 4, 2025 by wdhongtw Loading…
update run_in_docker script for running on local env ready ONLY add when PR is ready to merge/full CI is needed
#1243 opened Dec 4, 2025 by ernie-chang Loading…
Add workflow to build vLLM-TPU wheel using PyPI tpu-inference ready ONLY add when PR is ready to merge/full CI is needed
#1241 opened Dec 4, 2025 by ylangtsou Draft
[CI] Fix awq dtype ready ONLY add when PR is ready to merge/full CI is needed
#1220 opened Dec 2, 2025 by kyuyeunk Loading…
[Oncall] update the SchedulerConfig interface
#1219 opened Dec 2, 2025 by bzgoogle Loading…
Add a SP e2e test.
#1209 opened Dec 2, 2025 by vanbasten23 Loading…
Save size in scalar scratch for bo and bq ready ONLY add when PR is ready to merge/full CI is needed
#1201 opened Dec 1, 2025 by rupengliu-meta Loading…
[Qwix/Flax] Upgrade to Flax 0.12.0 + Qwix 0.1.4
#1170 opened Nov 25, 2025 by jrplatin Loading…
[do not merge] test status check POC ready ONLY add when PR is ready to merge/full CI is needed
#1168 opened Nov 25, 2025 by khluu Loading…
[Feat][TPU Offload] KV cache offload to local cpu buffer ready ONLY add when PR is ready to merge/full CI is needed
#1163 opened Nov 24, 2025 by juncgu-google Loading…
DP support for GPT OSS
#1096 opened Nov 13, 2025 by wenxindongwork Draft
ProTip! no:milestone will show everything without a milestone.