-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Test][Misc] Refactor Eagle Proposer unit tests and fix formatting
module:tests
#8720
opened Apr 25, 2026 by
SidaoY
Contributor
Loading…
[CI]Fix the error caused by layer_sharding in dsv32
#8719
opened Apr 25, 2026 by
Nagisa125
Contributor
Loading…
[BugFix] MTP recurrent batch size after lmhead TP logits truncation
#8718
opened Apr 25, 2026 by
ichaoren
Contributor
Loading…
[CI]Fix the error caused by layer_sharding in dsv32
module:tests
nightly-test
#8717
opened Apr 25, 2026 by
Nagisa125
Contributor
Loading…
[Feature] support w8a8 quantization for Xlite
#8715
opened Apr 25, 2026 by
wwwumr
Contributor
Loading…
[BugFix][EPLB] Validation logic optimization for EPLB and MTP support redundant experts
module:core
#8710
opened Apr 25, 2026 by
shenchuxiaofugui
Collaborator
Loading…
[Ops][Feature] Support Bailing quantization
module:quantization
#8709
opened Apr 25, 2026 by
alex101-ops
Contributor
Loading…
[Feature][Doc] Add AI QoS module, tuning tool, and user guide
documentation
Improvements or additions to documentation
module:tests
module:tools
#8706
opened Apr 25, 2026 by
Maybe2191
Loading…
[Doc][Misc] Add requirements for writing command outputs in deployment template
documentation
Improvements or additions to documentation
#8704
opened Apr 25, 2026 by
herizhen
Contributor
Loading…
[BugFix][Platform] Guard GPU-specific parallel config params on Ascend NPU
module:core
#8703
opened Apr 25, 2026 by
underfituu
Contributor
Loading…
2 tasks done
[Refactor] Replace BailingMoELinearAttention monkey-patching with PluggableLayer
module:core
module:ops
#8702
opened Apr 25, 2026 by
ghphotoframe
Contributor
Loading…
[Doc][v0.18.0] Fix documentation formatting and improve code examples
#8701
opened Apr 25, 2026 by
MrZ20
Contributor
Loading…
[WIP][Feature]Using torch.float8_e8m0fnu instead of torch_npu.float8_e8m0fnu
module:quantization
#8700
opened Apr 25, 2026 by
lijiahang226
Contributor
Loading…
[Doc][Misc] Refactor and simplify KV Pool documentation and scripts
documentation
Improvements or additions to documentation
#8696
opened Apr 25, 2026 by
internel-error
Loading…
[BugFix] non-stream recompute text join
#8695
opened Apr 25, 2026 by
wangxiaoteng888
Contributor
Loading…
[Doc] Translated Doc files 2026-04-25
documentation
Improvements or additions to documentation
#8689
opened Apr 25, 2026 by
vllm-ascend-ci
Collaborator
Loading…
[Doc][v0.13.0] remove duplicate --net=host flag in DeepSeek-V4 tutorial
#8688
opened Apr 25, 2026 by
LGiki
Loading…
[CI][Cherry-pick] Relax TTFT benefits threshold from 0.4 to 0.5 to account for DP load imbalance
#8684
opened Apr 24, 2026 by
underfituu
Contributor
Loading…
[CI][Main] Relax TTFT benefits threshold from 0.4 to 0.5 to account for DP load imbalance
module:tests
#8683
opened Apr 24, 2026 by
underfituu
Contributor
Loading…
[CI] add nightly MiniMax-M2.5-w8a8-QuaRot
ci/build
module:tests
nightly-test
#8681
opened Apr 24, 2026 by
weixinAc
Loading…
[CI] Add nightly case:GLM-5_1-W8A8
ci/build
module:tests
nightly-test
#8680
opened Apr 24, 2026 by
guxin108
Contributor
Loading…
[BugFix] Fix DSV3.1 W4A8 TTFT degradation
ready
read for review
ready-for-test
start test by label for PR
#8675
opened Apr 24, 2026 by
wangbj127
Contributor
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.