🎙️ Transcriptocast AI

title	emoji	colorFrom	colorTo	sdk	sdk_version	app_file	pinned
Transcriptocast AI Demo	🎙️	blue	purple	docker	3.10.0	app.py	false

🎙️ Transcriptocast AI

A powerful AI-powered application that provides audio transcription, text summarization, and multi-language translation capabilities. Built with FastAPI and deployed on Hugging Face Spaces.

🌟 Key Features

Audio Transcription: Convert audio to text using OpenAI's Whisper model
Text Summarization: Generate concise summaries using Facebook's BART model
Multi-language Translation: Translate between multiple languages using mBART

🚀 Quick Start

Local Development

Clone the repository:

git clone https://github.com/yourusername/transcriptocast.git
cd transcriptocast

Install dependencies:

pip install -r requirements.txt

Run the application:

uvicorn app:app --host 0.0.0.0 --port 7860

Docker Deployment

Build and run using Docker:

docker build -t transcriptocast .
docker run -p 7860:7860 transcriptocast

📚 API Documentation

Endpoints

Transcribe Audio (POST /transcribe)
- Converts audio files to text
- Accepts: Audio file (MP3, WAV, etc.)
- Returns: Transcribed text
Summarize Text (POST /summarize)
- Generates concise summaries
- Accepts: Text input
- Returns: Summary
Translate Text (POST /translate)
- Translates text between languages
- Accepts: Text and language codes
- Returns: Translated text

🛠️ Technical Stack

Backend Framework: FastAPI
AI Models:
- Whisper (OpenAI) for transcription
- BART (Facebook) for summarization
- mBART (Facebook) for translation
Deployment: Hugging Face Spaces
Container: Docker

🔧 Configuration

The application uses the following environment variables:

TRANSFORMERS_CACHE: Cache directory for models
HF_HOME: Hugging Face home directory

📦 Model Information

Whisper Model

Type: Speech-to-Text
Version: Base
Use Case: Audio transcription
Size: ~1GB

BART Model

Type: Text Summarization
Model: facebook/bart-large-cnn
Use Case: Text summarization
Features: Abstractive summarization

mBART Model

Type: Machine Translation
Model: facebook/mbart-large-50-many-to-many-mmt
Use Case: Multi-language translation
Languages: 50+ languages

🌐 Live Demo

Try the live demo at: Hugging Face Space

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📫 Contact

For any questions or suggestions, please open an issue in the repository.

Made with ❤️ by Prashant Ambati

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Transcriptocast AI

🌟 Key Features

🚀 Quick Start

Local Development

Docker Deployment

📚 API Documentation

Endpoints

🛠️ Technical Stack

🔧 Configuration

📦 Model Information

Whisper Model

BART Model

mBART Model

🌐 Live Demo

📝 License

🤝 Contributing

📫 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Prashant-ambati/transcriptocast

Folders and files

Latest commit

History

Repository files navigation

🎙️ Transcriptocast AI

🌟 Key Features

🚀 Quick Start

Local Development

Docker Deployment

📚 API Documentation

Endpoints

🛠️ Technical Stack

🔧 Configuration

📦 Model Information

Whisper Model

BART Model

mBART Model

🌐 Live Demo

📝 License

🤝 Contributing

📫 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages