🎤 Alexa-like Voice Assistant

A sophisticated conversational AI voice assistant built with AWS Strands, Nova Premiere, and multi-agent guardrails. Experience natural voice conversations with enterprise-grade safety validation.

✨ Features

🎤 Voice & Text Input: Seamless speech recognition and text input
🔊 Text-to-Speech: Natural voice responses with speech synthesis
🛡️ Multi-Layer Safety: Advanced guardrails for content validation
💬 Conversational AI: Context-aware responses using Nova Premiere
🌐 Modern Web Interface: Responsive, accessible chat interface
⚡ Real-time Processing: Fast multi-agent orchestration
📊 Health Monitoring: System status and performance metrics
🔄 Session Management: Persistent conversation context

🏗️ Architecture

Multi-Agent System

ChatAgent: Generates natural conversational responses using Nova Premiere model
SafetyAgent: Validates responses using Strands guardrails (toxicity, relevance, grounding)
Orchestrator: Coordinates multi-agent workflow and session management

Technology Stack

Backend: FastAPI with async processing
Frontend: Vanilla JavaScript with Web Speech APIs
AI Model: AWS Nova Premiere via Strands SDK
Safety: Built-in Strands guardrails system
Deployment: Docker containerization ready

🚀 Quick Start

Prerequisites

Python 3.8+
AWS account with Bedrock access
Modern web browser (Chrome, Firefox, Safari, Edge)

Option 1: Automated Setup (Recommended)

Windows:

scripts\start.bat

Linux/macOS:

chmod +x scripts/start.sh
./scripts/start.sh

Option 2: Manual Setup

Clone and setup environment:

git clone <repository>
cd alexa-voice-assistant
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Configure AWS credentials:

cp .env.example .env

Edit .env file:

AWS_REGION=us-east-1
AWS_ACCESS_KEY_ID=your_access_key_here
AWS_SECRET_ACCESS_KEY=your_secret_key_here
DEBUG=True
LOG_LEVEL=INFO

Test the system:

python test_system.py

Start the application:

python main.py

Open your browser: Navigate to http://localhost:8000

🐳 Docker Deployment

Using Docker Compose (Recommended)

# Set environment variables
export AWS_ACCESS_KEY_ID=your_key
export AWS_SECRET_ACCESS_KEY=your_secret

# Start the application
docker-compose up -d

Using Docker directly

# Build the image
docker build -t alexa-voice-assistant .

# Run the container
docker run -d \
  -p 8000:8000 \
  -e AWS_ACCESS_KEY_ID=your_key \
  -e AWS_SECRET_ACCESS_KEY=your_secret \
  -e AWS_REGION=us-east-1 \
  alexa-voice-assistant

💬 Usage Guide

Voice Interaction

Click the microphone button 🎤 to start voice recognition
Speak clearly into your microphone
Wait for processing - the system will convert speech to text
Listen to the response - the assistant will speak back to you

Text Interaction

Type your message in the text input field
Press Enter or click the Send button
Read and listen to the assistant's response

Interface Controls

🎤 Speak Button: Activate voice recognition
Send Button: Submit text message
Clear Button: Reset conversation history
System Status: View health and performance metrics

Browser Permissions

Microphone Access: Required for voice input
Audio Playback: Required for voice responses
JavaScript: Required for full functionality

🛡️ Safety Features

Multi-Layer Validation

Toxicity Detection: Blocks harmful or offensive content
Relevance Filtering: Ensures responses are on-topic
Grounding Checks: Prevents hallucinations and false information
Content Safety: Validates appropriateness of responses

Fallback Responses

When safety violations are detected, the system provides appropriate fallback messages:

"I'm sorry, I can't provide that type of information. Let's try a different topic."
"I'm not sure about that. Could you ask me something else?"
"I don't have reliable information about that right now."

📊 Monitoring & Health

System Health Endpoint

curl http://localhost:8000/api/health

Health Indicators

Green: System operating normally
Yellow: Minor issues detected
Red: System errors or failures

Metrics Available

Active session count
Safety incident statistics
Response time performance
Agent status monitoring

🔧 Configuration

Environment Variables

Variable	Description	Default
`AWS_REGION`	AWS region for Bedrock	`us-east-1`
`AWS_ACCESS_KEY_ID`	AWS access key	Required
`AWS_SECRET_ACCESS_KEY`	AWS secret key	Required
`DEBUG`	Enable debug mode	`False`
`LOG_LEVEL`	Logging level	`INFO`
`SESSION_TIMEOUT`	Session timeout (seconds)	`3600`

Model Configuration

Model: amazon.nova-premiere-v1:0
Temperature: 0.7 (balanced creativity)
Max Tokens: 2048 (sufficient for conversations)
Top-p: 0.9 (focused responses)

🧪 Testing

Run System Tests

python test_system.py

Test Coverage

✅ Agent initialization and configuration
✅ Multi-agent orchestration workflow
✅ Safety validation and guardrails
✅ Session management and cleanup
✅ Error handling and recovery
✅ API endpoint functionality

🔍 Troubleshooting

Common Issues

"Speech recognition not supported"

Use a modern browser (Chrome, Firefox, Safari, Edge)
Ensure HTTPS or localhost for microphone access

"AWS credentials not found"

Check your .env file configuration
Verify AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY are set
Ensure your AWS account has Bedrock access

"Model unavailable"

Verify your AWS region supports Nova Premiere
Check your AWS account has Bedrock model access
Try switching to us-east-1 region

"Microphone permission denied"

Allow microphone access in browser settings
Refresh the page after granting permissions
Check browser security settings

Debug Mode

Enable debug mode for detailed logging:

export DEBUG=True
export LOG_LEVEL=DEBUG
python main.py

Logs Location

Development: Console output
Docker: /app/logs/ directory
Production: Configure external log aggregation

🤝 Contributing

Development Setup

Fork the repository
Create a feature branch
Make your changes
Run tests: python test_system.py
Submit a pull request

Code Style

Follow PEP 8 guidelines
Use type hints where appropriate
Add docstrings for functions and classes
Include error handling and logging

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support

For issues and questions:

Check the troubleshooting section above
Review the system health endpoint
Check application logs for error details
Ensure all prerequisites are met

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
agents		agents
models		models
scripts		scripts
static		static
templates		templates
utils		utils
.env.example		.env.example
.gitignore		.gitignore
DEPLOYMENT.md		DEPLOYMENT.md
Dockerfile		Dockerfile
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt
test_system.py		test_system.py

Folders and files

Latest commit

History

Repository files navigation

🎤 Alexa-like Voice Assistant

✨ Features

🏗️ Architecture

Multi-Agent System

Technology Stack

🚀 Quick Start

Prerequisites

Option 1: Automated Setup (Recommended)

Option 2: Manual Setup

🐳 Docker Deployment

Using Docker Compose (Recommended)

Using Docker directly

💬 Usage Guide

Voice Interaction

Text Interaction

Interface Controls

Browser Permissions

🛡️ Safety Features

Multi-Layer Validation

Fallback Responses

📊 Monitoring & Health

System Health Endpoint

Health Indicators

Metrics Available

🔧 Configuration

Environment Variables

Model Configuration

🧪 Testing

Run System Tests

Test Coverage

🔍 Troubleshooting

Common Issues

Debug Mode

Logs Location

🤝 Contributing

Development Setup

Code Style

📄 License

🆘 Support

🔮 Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages