support vLLM >=0.11.0 (V1 engine) for better performance by Jzz1943 · Pull Request #1640 · FunAudioLLM/CosyVoice

Jzz1943 · 2025-11-10T08:33:46Z

Support running CosyVoice2 inference with vLLM 0.11.0(V1 engine only) for better performance.

Under the same conditions, compared with vLLM 0.9.0 (V0 engine), the first-chunk latency for inference with vLLM 0.11.0 (V1 engine) is reduced by approximately 15+ ms. Additionally, the first-chunk latency is more stable, with much smaller fluctuations than the V0 engine.

Upstream improvements from FunAudioLLM/CosyVoice: - PR FunAudioLLM#1640: Support vLLM 0.11.0+ (V1 engine) for better performance - First-chunk latency reduced by ~15ms - More stable latency with smaller fluctuations - Backward compatible with vLLM 0.9.0 - PR FunAudioLLM#1129: Add limited support for MPS devices (Apple Silicon) - Enables partial compatibility with M1/M2/M3/M4 Macs - Auto-enables JIT on MPS for better performance - ONNX models fall back to CPU (ONNX Runtime limitation) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

aluminumbox · 2025-12-31T02:36:14Z

thanks for the update, will merge it soon

Upstream improvements from FunAudioLLM/CosyVoice: - PR FunAudioLLM#1640: Support vLLM 0.11.0+ (V1 engine) for better performance - First-chunk latency reduced by ~15ms - More stable latency with smaller fluctuations - Backward compatible with vLLM 0.9.0 - PR FunAudioLLM#1129: Add limited support for MPS devices (Apple Silicon) - Enables partial compatibility with M1/M2/M3/M4 Macs - Auto-enables JIT on MPS for better performance - ONNX models fall back to CPU (ONNX Runtime limitation) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

support vLLM >=0.11.0 (V1 engine only)

6816fc6

Jzz1943 changed the title ~~support vLLM >=0.11.0 (V1 engine only)~~ support vLLM >=0.11.0 (V1 engine) for better performance Nov 13, 2025

ayutaz mentioned this pull request Dec 10, 2025

feat: vLLM V1 engine + MPS (Apple Silicon) サポート追加 ayutaz/CosyVoice#4

Merged

4 tasks

Merge branch 'main' into main

f88a14e

aluminumbox merged commit 8811772 into FunAudioLLM:main Dec 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support vLLM >=0.11.0 (V1 engine) for better performance#1640

support vLLM >=0.11.0 (V1 engine) for better performance#1640
aluminumbox merged 2 commits intoFunAudioLLM:mainfrom
Jzz1943:main

Jzz1943 commented Nov 10, 2025 •

edited

Loading

Uh oh!

aluminumbox commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jzz1943 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aluminumbox commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Jzz1943 commented Nov 10, 2025 •

edited

Loading