Skip to content

pn-dev-in/AI-Voice-Assistant-System

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Voice Assistant System

Local-First Voice AI with Memory, Safety Filtering & Task Execution

What is this?

A local-first AI voice assistant that supports memory, task execution, safety filtering, and daily planning using speech input and LLM reasoning.

The system uses Whisper for speech recognition, Groq-hosted LLMs for reasoning, and a structured orchestration pipeline for controlled execution.

Features

  • Push-to-talk voice interaction (Ctrl + Alt + Space)
  • Speech-to-text using Whisper
  • LLM reasoning via Groq (Llama 3.1)
  • Long-term memory (remember & recall facts)
  • Task execution (note saving & reading)
  • Safety intent filtering
  • Daily briefing planning mode
  • Local-first architecture

🏗️ System Architecture

Mic Input
→ Speech-to-Text (Whisper)
→ Mode Detection
→ Memory / Task / Query Routing
→ Safety Filtering
→ LLM Reasoning
→ Text-to-Speech Output


This layered design ensures clear separation of concerns, predictable behavior, and strong safety boundaries.


Project Status

Current:

  • Core voice pipeline implemented
  • Memory + task tools operational
  • Safety layer integrated
  • Daily briefing mode implemented

Planned:

  • UI layer
  • Cloud deployment (Azure App Service)
  • Modular tool expansion

🔐 Privacy & Safety Principles

This assistant is designed to be trustworthy by default.

  • No always-on microphone
  • No silent memory storage
  • No OS-level or shell command execution
  • No user data stored remotely
  • All personal data remains local to the machine
  • LLM access is used strictly for reasoning and language generation

🎯 Intended Role Fit

This project is aligned with:

  • AI/ML Engineer (Applied Systems)
  • AI Engineer – Voice / Conversational AI
  • Software Engineer (AI-integrated systems)

🚀 Getting Started

1. Clone the repository

git clone https://github.com/<your-username>/<repo-name>.git
cd <repo-name>
2. Create a virtual environment
python -m venv venv
venv\Scripts\activate
3. Install dependencies
pip install -r requirements.txt
4. Configure environment variables
Create a .env file in the project root:

GROQ_API_KEY=your_api_key_here
Note: API keys and personal data are never committed to the repository.

5. Run the assistant
python src/main.py
Press Ctrl + Alt + Space to speak

Press ESC to exit

🧠 Why This Project Exists
This assistant was built to reduce friction in daily thinking and productivity by providing a private, voice-first interface to intelligence, instead of repeatedly opening web-based chat applications.

The goal is not autonomy.
The goal is useful presence with human control.

📌 Notes
This project is intended primarily for personal use

Users must supply their own API keys

Personal memory and notes are stored locally and never shared

Future improvements are guided by real usage, not feature bloat

🔮 Future Work (Optional)

Performance tuning

Config-driven behavior

Optional open-source framework version (without personal data)


📁 Project Structure

voice_ai_agent/
├── src/
│   ├── main.py            # Application entry point
│   │
│   ├── audio/             # Speech-to-text (Whisper)
│   ├── brain/             # LLM reasoning logic
│   ├── safety/            # Intent classification & guardrails
│   ├── tools/             # Sandboxed task execution
│   ├── memory/            # Explicit, user-controlled memory
│   ├── modes/             # Interaction modes (normal, briefing)
│   └── voice/             # Text-to-speech
│
├── user_data/             # Local-only personal data (gitignored)
├── config.yaml            # Runtime configuration
├── requirements.txt
└── README.md

⚠️ Disclaimer

This project is provided for personal and educational use.
No guarantees are made regarding fitness for production or commercial deployment.

About

An applied AI assistant demonstrating voice interaction, LLM reasoning, intent classification, and safe tool execution in a modular, local-first architecture.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages