GitHub - pn-dev-in/AI-Voice-Assistant-System: An applied AI assistant demonstrating voice interaction, LLM reasoning, intent classification, and safe tool execution in a modular, local-first architecture.

AI Voice Assistant System

Local-First Voice AI with Memory, Safety Filtering & Task Execution

What is this?

A local-first AI voice assistant that supports memory, task execution, safety filtering, and daily planning using speech input and LLM reasoning.

The system uses Whisper for speech recognition, Groq-hosted LLMs for reasoning, and a structured orchestration pipeline for controlled execution.

Features

Push-to-talk voice interaction (Ctrl + Alt + Space)
Speech-to-text using Whisper
LLM reasoning via Groq (Llama 3.1)
Long-term memory (remember & recall facts)
Task execution (note saving & reading)
Safety intent filtering
Daily briefing planning mode
Local-first architecture

🏗️ System Architecture

Mic Input
→ Speech-to-Text (Whisper)
→ Mode Detection
→ Memory / Task / Query Routing
→ Safety Filtering
→ LLM Reasoning
→ Text-to-Speech Output

This layered design ensures clear separation of concerns, predictable behavior, and strong safety boundaries.

Project Status

Current:

Core voice pipeline implemented
Memory + task tools operational
Safety layer integrated
Daily briefing mode implemented

Planned:

UI layer
Cloud deployment (Azure App Service)
Modular tool expansion

🔐 Privacy & Safety Principles

This assistant is designed to be trustworthy by default.

No always-on microphone
No silent memory storage
No OS-level or shell command execution
No user data stored remotely
All personal data remains local to the machine
LLM access is used strictly for reasoning and language generation

🎯 Intended Role Fit

This project is aligned with:

AI/ML Engineer (Applied Systems)
AI Engineer – Voice / Conversational AI
Software Engineer (AI-integrated systems)

🚀 Getting Started

1. Clone the repository

git clone https://github.com/<your-username>/<repo-name>.git
cd <repo-name>
2. Create a virtual environment
python -m venv venv
venv\Scripts\activate
3. Install dependencies
pip install -r requirements.txt
4. Configure environment variables
Create a .env file in the project root:

GROQ_API_KEY=your_api_key_here
Note: API keys and personal data are never committed to the repository.

5. Run the assistant
python src/main.py
Press Ctrl + Alt + Space to speak

Press ESC to exit

🧠 Why This Project Exists
This assistant was built to reduce friction in daily thinking and productivity by providing a private, voice-first interface to intelligence, instead of repeatedly opening web-based chat applications.

The goal is not autonomy.
The goal is useful presence with human control.

📌 Notes
This project is intended primarily for personal use

Users must supply their own API keys

Personal memory and notes are stored locally and never shared

Future improvements are guided by real usage, not feature bloat

🔮 Future Work (Optional)

Performance tuning

Config-driven behavior

Optional open-source framework version (without personal data)


📁 Project Structure

voice_ai_agent/
├── src/
│   ├── main.py            # Application entry point
│   │
│   ├── audio/             # Speech-to-text (Whisper)
│   ├── brain/             # LLM reasoning logic
│   ├── safety/            # Intent classification & guardrails
│   ├── tools/             # Sandboxed task execution
│   ├── memory/            # Explicit, user-controlled memory
│   ├── modes/             # Interaction modes (normal, briefing)
│   └── voice/             # Text-to-speech
│
├── user_data/             # Local-only personal data (gitignored)
├── config.yaml            # Runtime configuration
├── requirements.txt
└── README.md

⚠️ Disclaimer

This project is provided for personal and educational use.
No guarantees are made regarding fitness for production or commercial deployment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Voice Assistant System

What is this?

Features

🏗️ System Architecture

Project Status

🔐 Privacy & Safety Principles

🎯 Intended Role Fit

🚀 Getting Started

1. Clone the repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
src		src
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AI Voice Assistant System

What is this?

Features

🏗️ System Architecture

Project Status

🔐 Privacy & Safety Principles

🎯 Intended Role Fit

🚀 Getting Started

1. Clone the repository

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages