🎓 Voice Tutor Agent

A real-time voice-based AI tutor built with Pipecat, Sarvam AI (Indian-language STT/TTS), and Google Gemini 2.5 Pro (reasoning LLM).

Based on the Sarvam AI Tutor Agent cookbook.

Pipeline

Student Audio → Sarvam STT (Saaras v3) → Gemini 2.5 Pro → Sarvam TTS (Bulbul v3) → Audio Output

Features

🗣️ Multilingual speech recognition — auto-detects Indian languages
🧠 Gemini 2.5 Pro reasoning — strong problem-solving for math, science, and more
🔊 Natural Indian-English voice — Sarvam Bulbul v3 with clear articulation
📚 Multi-subject tutor — Maths, Science, Languages, Social Studies
🎯 Adaptive teaching — adjusts explanations to student level
🎤 Browser UI — beautiful mic mute/unmute interface with live transcript

Quick Start

1. Prerequisites

Python 3.9+
API keys from:
- Sarvam AI — STT & TTS
- Google AI Studio — Gemini LLM

2. Install Dependencies

pip install -r requirements.txt

3. Set Up Environment

cp .env.example .env
# Edit .env and add your real API keys

4. Run the Agent

python3 server.py

This starts the FastAPI server which:

Serves the web UI at http://localhost:7860
Handles WebRTC signaling at /api/offer
Spawns the tutor bot for each new connection

5. Use the Tutor

Open http://localhost:7860 in your browser
Click "Connect to Tutor"
Click the mic button to unmute
Start speaking — the tutor will respond!

Project Structure

tutor_agent/
├── tutor_agent.py      # Main agent — Pipecat pipeline (Sarvam + Gemini)
├── static/
│   └── index.html      # Browser UI with mic button
├── requirements.txt    # Python dependencies
├── .env.example        # API key template
└── README.md

Customization

Change Language

Edit tutor_agent.py:

# Hindi tutor
stt = SarvamSTTService(..., language="hi-IN")
tts = SarvamTTSService(..., target_language_code="hi-IN", speaker="simran")

Available Languages

en-IN hi-IN bn-IN ta-IN te-IN gu-IN kn-IN ml-IN mr-IN pa-IN od-IN unknown (auto-detect)

Available Voices

Female: Ritu, Priya, Neha, Pooja, Simran, Kavya, Ishita (default), Shreya, Roopa, and more
Male: Shubh, Aditya, Rahul, Rohan, Amit, Dev, and more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎓 Voice Tutor Agent

Pipeline

Features

Quick Start

1. Prerequisites

2. Install Dependencies

3. Set Up Environment

4. Run the Agent

5. Use the Tutor

Project Structure

Customization

Change Language

Available Languages

Available Voices

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
static		static
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
architecture.md		architecture.md
prompts.py		prompts.py
requirements.txt		requirements.txt
server.py		server.py
tutor_agent.py		tutor_agent.py

Folders and files

Latest commit

History

Repository files navigation

🎓 Voice Tutor Agent

Pipeline

Features

Quick Start

1. Prerequisites

2. Install Dependencies

3. Set Up Environment

4. Run the Agent

5. Use the Tutor

Project Structure

Customization

Change Language

Available Languages

Available Voices

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages