A command-line tool that automatically transcribes Chinese podcasts and translates them to English.
It's not perfect, but it works ;)
- Audio Processing: Converts various audio formats to optimized WAV
- Speech Recognition: Uses Vosk for accurate Chinese speech recognition
- Translation: Automatically translates Chinese text to English
- Progress Tracking: Shows real-time progress during long operations
- Timestamps: Preserves timing information in transcripts
# Clone the repository
git clone https://github.com/yourusername/chinese-podcast-transcriber.git
cd chinese-podcast-transcriber
# Install dependencies
pip install vosk transformers tqdm
# Download Vosk Chinese model
wget https://alphacephei.com/vosk/models/vosk-model-cn-0.22.zip
unzip vosk-model-cn-0.22.zip
mv vosk-model-cn-0.22 model-cnMake sure FFmpeg is installed on your system.
Basic usage:
python translate-podcasts.py your-podcast.mp3Advanced options:
python translate-podcasts.py your-podcast.mp3 \
--model_path path/to/model \
--transcript_file chinese_output.txt \
--translated_file english_output.txt \
--batch_size 15 \
--verbose- Python 3.7+
- FFmpeg
- Vosk
- Transformers
- tqdm
- GNU Affero General Public License v3.0