Interview Ace 🎙️

AI-powered real-time interview assistant with dual-agent voice recognition and smart Q&A generation.

What it does: Sits as a transparent overlay on top of your video call (Zoom, Teams, Google Meet). Listens to the interviewer's questions in real time, searches your resume and prep materials, and suggests smart answers — all within seconds.

Features

🎤 Real-time voice capture from system audio + microphone
🗣️ Speaker diarization — automatically separates interviewer from candidate
📝 Live transcription powered by Whisper
🧠 RAG-based answer generation using your resume, job description, and notes
💡 Multi-LLM support (OpenAI, Anthropic, Google, DeepSeek)
🖥️ Electron overlay — transparent, always-on-top, floats over any video call
⚡ WebSocket-based — low-latency real-time streaming

Quick Start

Prerequisites

Python 3.11+
Node.js 18+
An LLM API key (OpenAI, Anthropic, etc.)

1. Clone & Configure

git clone https://github.com/SangJieGe/Interview-Ace.git
cd Interview-Ace
cp .env.example .env
# Edit .env with your API keys

2. Backend

cd backend
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
uvicorn backend.main:app --host 0.0.0.0 --port 8000 --reload

3. Frontend

cd frontend
npm install
npm run dev

4. Electron Overlay (optional)

cd frontend
npm run electron:dev

5. Download ML Models

bash scripts/download_models.sh

Docker (Alternative)

docker-compose up --build

This starts backend, frontend, and ChromaDB vector store.

How It Works

┌──────────────┐     ┌──────────────────────────────────┐
│  Video Call   │     │         Interview Ace             │
│  (Zoom/Teams) │────▶│                                  │
│               │     │  Voice Agent ──→ Knowledge Agent  │
│  Interviewer  │     │  (listen +      (search +        │
│  asks question│     │   transcribe)    generate answer) │
│               │     │       │                │          │
└──────────────┘     │       ▼                ▼          │
                      │  📝 Transcript   💡 Answer        │
                      │  shown live      shown live       │
                      └──────────────────────────────────┘

Voice Agent (Agent 2) captures audio, detects speech, identifies who's speaking, and transcribes using Whisper
Knowledge Agent (Agent 1) takes the transcribed question, searches your documents via RAG, and generates a contextual answer using an LLM
Both transcript and answer appear in real-time on the overlay

Configuration

All configuration is via environment variables. See .env.example for the full list.

Variable	Description	Default
`LLM_PROVIDER`	LLM provider (`openai`, `anthropic`, `google`, `deepseek`)	`openai`
`LLM_API_KEY`	API key for the LLM provider	—
`LLM_MODEL`	Model to use	`gpt-4o`
`WHISPER_MODEL`	Whisper model size (`tiny`, `base`, `small`, `medium`, `large-v3`)	`base`
`AUDIO_DEVICE_INDEX`	Audio input device index	`0`
`VECTOR_DB`	Vector database (`chromadb`, `pinecone`)	`chromadb`

Tech Stack

Layer	Technology
Backend	Python, FastAPI, WebSocket
Speech-to-Text	OpenAI Whisper
Voice Activity Detection	Silero VAD
Speaker Diarization	Embedding-based comparison
RAG	ChromaDB + sentence-transformers
LLM	OpenAI / Anthropic / Google / DeepSeek
Frontend	React, TypeScript, Tailwind CSS
Desktop	Electron (transparent overlay)

Project Structure

Interview-Ace/
├── backend/              # Python FastAPI server
│   ├── api/              # REST + WebSocket routes
│   ├── agents/           # Agent 1 (Knowledge) + Agent 2 (Voice)
│   ├── core/             # Config, audio utilities
│   ├── models/           # Pydantic schemas
│   └── rag/              # RAG retrieval engine
├── frontend/             # React + Electron app
│   ├── electron/         # Electron main process
│   └── src/              # React components + hooks
├── scripts/              # Setup & utility scripts
├── docs/                 # Architecture & design docs
└── docker-compose.yml    # Docker deployment

See docs/architecture.md for the detailed system design.

Roadmap

Contributing

See CONTRIBUTING.md for development setup and guidelines.

License

MIT License — see LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interview Ace 🎙️

Features

Quick Start

Prerequisites

1. Clone & Configure

2. Backend

3. Frontend

4. Electron Overlay (optional)

5. Download ML Models

Docker (Alternative)

How It Works

Configuration

Tech Stack

Project Structure

Roadmap

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
docs		docs
frontend		frontend
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

Interview Ace 🎙️

Features

Quick Start

Prerequisites

1. Clone & Configure

2. Backend

3. Frontend

4. Electron Overlay (optional)

5. Download ML Models

Docker (Alternative)

How It Works

Configuration

Tech Stack

Project Structure

Roadmap

Contributing

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages