CosyVoice Desktop Application

A PyQt6-based cross-platform desktop interface for CosyVoice3 intelligent voice synthesis, featuring a unique pixel-art aesthetic.

System Requirements

OS: macOS 11+, Windows 10+, or Ubuntu 20.04+
Python: 3.10
RAM: 8GB (16GB Recommended)
GPU: Optional (Supports NVIDIA CUDA & Apple Silicon MPS)

Architecture

The application follows a clean layered structure to ensure stability and performance:

Presentation: PyQt6 UI with retro pixel-art styling.
Controller: Event handling and UI logic.
Service: Business logic encapsulation.
Worker: Asynchronous task processing via QThread.
Core: CosyVoice3, PyTorch, and audio processing engine.

Usage Guide

Audio Cloning

Reference Audio: Select a clear .wav file via the BROWSE button.
Model Selection: Choose your preferred CosyVoice model from the dropdown.
Synthesis: Enter your text (supports multiple languages) and adjust the Pitch (-12 to +12).
Generate: Click GENERATE and wait for the progress bar to complete.

Model Management

Access the MODEL DOWNLOAD page to manage your synthesis engines:

View real-time download speed and progress.
Download individual models or use DOWNLOAD ALL.
Refresh to update current model statuses.

Supported Models

Model Name	Size	Description
CosyVoice3-0.5B-2512	~1.2 GB	Latest flagship model (Recommended)
CosyVoice2-0.5B	~980 MB	Balanced performance
CosyVoice-300M	~600 MB	Lightweight and fast
CosyVoice-TTSFRD	~550 MB	Optimized for fast response

Built with PyQt6 + CosyVoice3

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
backend		backend
data		data
docs		docs
resources		resources
scripts		scripts
static/voices		static/voices
ui		ui
.gitignore		.gitignore
.gitmodules		.gitmodules
CosyVoice_app.pro		CosyVoice_app.pro
LICENSE		LICENSE
PRIVACY_POLICY.md		PRIVACY_POLICY.md
README.md		README.md
THIRD_PARTY_LICENSES.md		THIRD_PARTY_LICENSES.md
main.py		main.py
requirements-cuda-windows.txt		requirements-cuda-windows.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CosyVoice Desktop Application

System Requirements

Architecture

Usage Guide

Audio Cloning

Model Management

Supported Models

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CosyVoice Desktop Application

System Requirements

Architecture

Usage Guide

Audio Cloning

Model Management

Supported Models

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages