A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Feb 18, 2026 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs, Kokoro, Typecast or xAI
List of open-source TTS, voice cloning, and music generation models
Eleven Labs text to speech package for NodeJS. You can use the official package at: https://www.npmjs.com/package/elevenlabs
🦆💰 A bot that uses Uberduck (and FakeYou) AI to make bit donations have an AI voice.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
Voice AI agent for reactivating cold leads through personalized calls, assessing their interest with AI agents, and syncing insights directly to your CRMs.
Voice-powered AI assistant platform — connect any LLM, any TTS, with a live web canvas, music generation, and agent orchestration using openclaw. Install: npx openvoiceui setup
Free voice cloning for creators using Coqui XTTS-v2 on Google Colab. Clone your voice with just a few minutes of audio. Complete guide to build your own notebook.
Преобразование голоса на основе VITS. Ориентировано на простоту, качество и производительность.
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source code for the open-source TTS models, including the removed 7B version. Try the VibeVoice online service
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the WhatsApp Desktop App.
Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA 12.8). One-click install.
Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included.
Add a description, image, and links to the ai-voice topic page so that developers can more easily learn about it.
To associate your repository with the ai-voice topic, visit your repo's landing page and select "manage topics."