AI Closed Captions

AI-driven English closed captions generator, optimized for RTX 5080 hardware. This project provides tools for transcribing audio from video files into English subtitles, with a focus on anime content where matching English dubs are often unavailable.

Overview

This repository contains experimental scripts for generating closed captions using AI transcription models. The primary working implementation is transcribe_faster_whisper.sh, a wrapper script that orchestrates the underlying Python components for transcription.

An earlier attempt using WhisperX was aborted due to compatibility and performance issues.

Features

The main pipeline (transcribe_faster_whisper.sh) includes the following capabilities:

API Context Integration: Leverages external APIs to gather contextual information about the video content, improving transcription accuracy.
Sign/Song Merging: Automatically detects and merges signs, songs, and other non-dialogue elements into the subtitle track.
Existing Subtitle Detection and Merging: Identifies pre-existing closed caption tracks in the video file. If found, AI generation of new tracks is skipped, and existing tracks are merged with any detected signs/songs tracks.

Usage

Note: This project is highly experimental and requires significant manual intervention. It is not a fully automated solution.

Ensure you have the necessary dependencies installed (see requirements_fw.txt for Python packages).
Prepare your video file and gather context:
- Know the expected list of existing subtitle tracks.
- Manually input the proper name of the video, TV show, or movie to fetch relevant context.
Run the primary script:
```
./transcribe_faster_whisper.sh [video_file] [options]
```
Customize prompts and context as needed for optimal results.

Caveats and Limitations

Experimental Status: This is an ongoing, incomplete project. Results vary greatly depending on the video file.
Manual Knowledge Required: Success depends on your familiarity with the target video file, including its subtitle tracks and content details.
Context Prompting: Requires manual input of accurate video names and contextual information to improve transcription quality.
Anime Focus: Primarily tested and used for anime content, which is notorious for lacking synchronized English subtitles for dubs.
Hardware Specific: Optimized for RTX 5080; performance may vary on other systems.

Use at your own risk, and expect to iterate on results manually.

Requirements

Python 3.x
Dependencies listed in requirements_fw.txt
RTX 5080 GPU (recommended)
FFmpeg or similar for video processing

Contributing

This is a personal project, but feel free to fork and experiment. Pull requests for improvements are welcome, though the project is not actively maintained.

License

MIT License - Intended for personal, non-commercial use. See LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
contexts		contexts
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements_fw.txt		requirements_fw.txt
show_gemini_models.py		show_gemini_models.py
test-anime-api.py		test-anime-api.py
transcribe_client.py		transcribe_client.py
transcribe_daemon.py		transcribe_daemon.py
transcribe_faster_whisper.py		transcribe_faster_whisper.py
transcribe_faster_whisper.sh		transcribe_faster_whisper.sh
transcribe_whisperx.py		transcribe_whisperx.py
transcribe_whisperx.sh		transcribe_whisperx.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Closed Captions

Overview

Features

Usage

Caveats and Limitations

Requirements

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Closed Captions

Overview

Features

Usage

Caveats and Limitations

Requirements

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages