support vLLM >=0.11.0 (V1 engine) for better performance #1640

Jzz1943 · 2025-11-10T08:33:46Z

Support running CosyVoice2 inference with vLLM 0.11.0(V1 engine only) for better performance.

Under the same conditions, compared with vLLM 0.9.0 (V0 engine), the first-chunk latency for inference with vLLM 0.11.0 (V1 engine) is reduced by approximately 15+ ms. Additionally, the first-chunk latency is more stable, with much smaller fluctuations than the V0 engine.

Upstream improvements from FunAudioLLM/CosyVoice: - PR FunAudioLLM#1640: Support vLLM 0.11.0+ (V1 engine) for better performance - First-chunk latency reduced by ~15ms - More stable latency with smaller fluctuations - Backward compatible with vLLM 0.9.0 - PR FunAudioLLM#1129: Add limited support for MPS devices (Apple Silicon) - Enables partial compatibility with M1/M2/M3/M4 Macs - Auto-enables JIT on MPS for better performance - ONNX models fall back to CPU (ONNX Runtime limitation) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

support vLLM >=0.11.0 (V1 engine only)

6816fc6

Jzz1943 changed the title ~~support vLLM >=0.11.0 (V1 engine only)~~ support vLLM >=0.11.0 (V1 engine) for better performance Nov 13, 2025

ayutaz mentioned this pull request Dec 10, 2025

feat: vLLM V1 engine + MPS (Apple Silicon) サポート追加 ayutaz/CosyVoice#4

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support vLLM >=0.11.0 (V1 engine) for better performance #1640

support vLLM >=0.11.0 (V1 engine) for better performance #1640

Uh oh!

Jzz1943 commented Nov 10, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

support vLLM >=0.11.0 (V1 engine) for better performance #1640

Are you sure you want to change the base?

support vLLM >=0.11.0 (V1 engine) for better performance #1640

Uh oh!

Conversation

Jzz1943 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Jzz1943 commented Nov 10, 2025 •

edited

Loading