@@ -29,14 +29,29 @@ EchoForge wraps this technology in a user-friendly web interface and API, making
2929
3030### Direct CSM Implementation
3131
32- EchoForge includes a direct CSM implementation that bypasses adapter layers and directly uses the CSM model. This approach offers several advantages:
32+ EchoForge now uses the Direct CSM implementation by default for faster voice generation, especially on CUDA-enabled devices.
3333
34- - ** Improved Performance ** : Direct access to the model reduces overhead
35- - ** Better Audio Quality ** : Fewer transformations lead to clearer voice output
36- - ** Simplified Architecture ** : Reduces complexity in the voice generation pipeline
37- - ** Fallback Mechanisms ** : Automatic fallback to CPU if CUDA fails
34+ #### Features
35+ - Up to 25x faster generation on GPU compared to CPU
36+ - Maintains the same high-quality voice output
37+ - Automatically falls back to CPU if CUDA is unavailable
3838
39- For more details, see the [ Direct CSM documentation] ( docs/DIRECT_CSM.md ) .
39+ #### Usage
40+ To start the server with Direct CSM enabled:
41+ ```
42+ python run.py --direct-csm
43+ ```
44+
45+ When using the API, specify ` device=cuda ` for faster generation:
46+ ```
47+ curl -X POST http://localhost:8779/api/generate \
48+ -H "Content-Type: application/json" \
49+ -d '{"text": "Your text here", "voice": "male", "temperature": 0.7, "device": "cuda"}'
50+ ```
51+
52+ #### Performance
53+ - CUDA generation: ~ 3-6 seconds
54+ - CPU generation: ~ 150 seconds
4055
4156## Installation
4257
0 commit comments