Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix][CI] Fix tests/distributed/test_torchrun_example_moe.py bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed
#40349 opened Apr 20, 2026 by NickLucche Collaborator Loading…
[Bugfix][Reasoning] Strip grouped think markers from streaming deltas bug Something isn't working
#40348 opened Apr 20, 2026 by wuyingjun-lucky Contributor Loading…
[Bugfix][Gemma4] Fix vision fp16 overflow causing <pad> output bug Something isn't working
#40347 opened Apr 20, 2026 by wenqiangire-commits Loading…
[Docs] [Misc] add sig list table in community governance process documentation Improvements or additions to documentation
#40342 opened Apr 20, 2026 by pacoxu Contributor Loading…
1 of 4 tasks
[Bugfix] Normalize malformed dict prompts that carry token IDs in prompt bug Something isn't working verified Run pre-commit for new contributors without triggering other tests
#40339 opened Apr 20, 2026 by Alchuang22-dev Loading…
[LoRA] MoE LoRA Refactor gpt-oss Related to GPT-OSS models intel-gpu Related to Intel GPU nvidia ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#40338 opened Apr 20, 2026 by jeejeelee Collaborator Loading…
4 tasks
[Perf] Integrate flash-maxsim Triton kernels for late-interaction scoring v1 verified Run pre-commit for new contributors without triggering other tests
#40337 opened Apr 20, 2026 by roipony Loading…
5 of 6 tasks
Qwen 3 VL: Track and use buffer correctly qwen Related to Qwen models
#40336 opened Apr 20, 2026 by wdhongtw Contributor Loading…
3 of 4 tasks
[MM][Misc] Support image+video mixed inputs (per prompt) for VLM examples documentation Improvements or additions to documentation
#40335 opened Apr 20, 2026 by shen-shanshan Contributor Loading…
3 of 4 tasks
[Model] fix(dflash): dtype mismatch in combine_hidden_states qwen Related to Qwen models
#40334 opened Apr 20, 2026 by ciphernaut Loading…
3 of 4 tasks
[ROCm] Allow Triton MXFP4 MoE support checks on gfx11xx gpt-oss Related to GPT-OSS models rocm Related to AMD ROCm
#40333 opened Apr 20, 2026 by wangrui6 Loading…
3 of 4 tasks
[Startup] Import hygiene for api_server hot path frontend
#40328 opened Apr 20, 2026 by simon-mo Collaborator Loading…
5 tasks
[Doc] Sync CLI guide with actual help modes and launch subcommand documentation Improvements or additions to documentation
#40326 opened Apr 20, 2026 by wangrui6 Loading…
4 tasks
[vLLM IR] Update the pre commit to enforce imports of vllm
#40325 opened Apr 20, 2026 by R3hankhan123 Contributor Loading…
4 tasks
Fix Gemma 4 + BitsAndBytes startup failure reported in #38884
#40321 opened Apr 20, 2026 by SouthWest7 Contributor Loading…
5 tasks
[Docs] [QeRL] Layerwise Reloading Documentation documentation Improvements or additions to documentation
#40317 opened Apr 20, 2026 by kylesayrs Contributor Loading…
[Model] Use AutoWeightsLoader for GPT2
#40312 opened Apr 20, 2026 by cben484 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.