π Book Reader is an AI-powered text-to-speech (TTS) application that converts text into high-quality speech using various models. It supports multiple languages and formats, allowing users to generate and listen to audio files easily.
- ποΈ Generate speech from text using AI models
- π Supports multiple languages
- π Outputs in WAV formats
- β‘ Asynchronous background processing
- π΅ List and play generated audio files
- Frontend: Alpine.js, Bootstrap
- Backend: Flask (Python)
- Database: SQLite / MySQL (for task management)
- TTS Models: ONNX-based models
1οΈβ£ Clone the repository:
git clone https://github.com/your-username/book-reader.git
cd book-reader2οΈβ£ Set up a virtual environment (optional but recommended):
python -m venv venv
source venv/bin/activate # On macOS/Linux
venv\Scripts\activate # On Windows3οΈβ£ Install dependencies:
pip install -r requirements.txt4οΈβ£ Run the application:
flask runThe app will be available at http://127.0.0.1:5000/ π
π Usage Enter your text in the input field. Select a language and a speech model. Click Generate Speech and wait for processing. Listen or download the generated audio from the listed files.
π API Endpoints GET /languages β Fetch available languages GET /models/ β Get models for a language POST /generate β Generate speech (returns task ID) GET /check_status/<task_id> β Check generation status GET /audio_files β List generated audio files
π€ Contributing Pull requests are welcome! Feel free to open an issue for discussions.