RAG Bot with GPT-OSS Integration

A Retrieval-Augmented Generation (RAG) bot that integrates with GPT-OSS via Hugging Face and provides access via Slack and WhatsApp. The runtime now starts cleanly on low-resource machines by using a lightweight hashing embedder unless sentence-transformers is installed.

Features

🤖 RAG System: Upload documents and query them using natural language
💬 Slack Integration: Access the bot directly from Slack
📱 WhatsApp Integration: Query via WhatsApp using Twilio
🔍 Vector Search: Powered by ChromaDB for efficient document retrieval
🧠 AI Responses: Uses GPT-OSS via Hugging Face for intelligent answers
🍓 Low-Resource Startup: Runs without pulling GPU-only torch dependencies by default
🔐 Safer Defaults: Optional API key protection, Twilio signature validation, and SSRF protections for scrape/media URLs

Quick Start

1. Setup

chmod +x setup.sh
./setup.sh

2. Configure

Edit .env file with your credentials:

cp .env.example .env
nano .env

Recommended configuration:

API_KEY: Protects REST endpoints such as /upload, /query, /stats, and /research/*
HUGGINGFACE_API_TOKEN: Enables LLM-generated answers. Without it, the bot falls back to context excerpts.

Optional (for integrations):

Slack: SLACK_BOT_TOKEN, SLACK_SIGNING_SECRET
WhatsApp: TWILIO_ACCOUNT_SID, TWILIO_AUTH_TOKEN, TWILIO_PHONE_NUMBER
Public webhook deployments: PUBLIC_BASE_URL

3. Start

./start.sh

By default the app binds to 127.0.0.1. Set HOST=0.0.0.0 only when you intend to expose it behind a firewall or reverse proxy.

API Endpoints

GET / - API information
GET /health - Health check
POST /upload - Upload documents (PDF or text)
POST /query - Query the RAG system
GET /stats - Get knowledge base statistics
POST /slack/events - Slack webhook
POST /whatsapp/webhook - WhatsApp webhook

Usage Examples

Upload a Document

curl -X POST "http://localhost:8000/upload" \
  -H "X-API-Key: your_api_key" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@document.pdf"

Query the System

curl -X POST "http://localhost:8000/query" \
  -H "X-API-Key: your_api_key" \
  -H "Content-Type: application/json" \
  -d '{"question": "What is machine learning?"}'

Slack Setup

Create a Slack app at https://api.slack.com/apps
Add bot token scopes: app_mentions:read, chat:write, files:read
Enable events: app_mention, file_shared
Set event request URL: https://your-domain.com/slack/events
Install app to workspace

WhatsApp Setup

Create Twilio account at https://www.twilio.com
Set up WhatsApp sandbox or get approved number
Configure webhook URL: https://your-domain.com/whatsapp/webhook
Keep VALIDATE_TWILIO_SIGNATURES=true in production

Raspberry Pi Deployment

System Requirements

Raspberry Pi 4 (4GB+ RAM recommended)
Python 3.10+
16GB+ SD card

Performance Tips

Use SSD instead of SD card for better I/O
Increase swap space if needed
Monitor temperature and use cooling

Auto-start Service

Create systemd service:

sudo nano /etc/systemd/system/ragbot.service

[Unit]
Description=RAG Bot Service
After=network.target

[Service]
Type=simple
User=pi
WorkingDirectory=/home/pi/ragbot
ExecStart=/home/pi/ragbot/.venv/bin/python main.py
Restart=always
RestartSec=10

[Install]
WantedBy=multi-user.target

Enable and start:

sudo systemctl enable ragbot.service
sudo systemctl start ragbot.service

Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   Slack/WhatsApp│    │   FastAPI App   │    │   FlexaAI API   │
│                 │───▶│                 │───▶│                 │
│   User Input    │    │   RAG System    │    │   GPT-OSS-120B  │
└─────────────────┘    └─────────────────┘    └─────────────────┘
                              │
                              ▼
                       ┌─────────────────┐
                       │   ChromaDB      │
                       │   Vector Store  │
                       └─────────────────┘

Troubleshooting

Common Issues

Large dependency installs: The default install no longer requires sentence-transformers. If you want higher-quality embeddings, install it manually after the core app is working.
Slow Responses: Check network connection to Hugging Face
ChromaDB Errors: Ensure write permissions to chroma_db and user_data directories
Import Errors: Activate the virtual environment before running
Webhook validation failures: Set PUBLIC_BASE_URL so Twilio signature checks see the public URL Twilio calls

Logs

Check application logs:

tail -f logs/ragbot.log

Health Check

curl http://localhost:8000/health

Contributing

Fork the repository
Create feature branch
Make changes
Test on Raspberry Pi
Submit pull request

License

MIT License - see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
tests		tests
.env.example		.env.example
.env.example.new		.env.example.new
.gitignore		.gitignore
COMPLETE_INTEGRATION_SUMMARY.md		COMPLETE_INTEGRATION_SUMMARY.md
DEPLOYMENT.md		DEPLOYMENT.md
EC2_ASTERISK_SETUP.md		EC2_ASTERISK_SETUP.md
INTEGRATION_GUIDE.md		INTEGRATION_GUIDE.md
INTEGRATION_SUMMARY.md		INTEGRATION_SUMMARY.md
PDF_GUIDE.md		PDF_GUIDE.md
QUICK_START_EC2.md		QUICK_START_EC2.md
QUO_INTEGRATION.md		QUO_INTEGRATION.md
README.md		README.md
VOICE_SETUP.md		VOICE_SETUP.md
config.py		config.py
embeddings.py		embeddings.py
fix_service.sh		fix_service.sh
infobip_client.py		infobip_client.py
install_ec2_asterisk.sh		install_ec2_asterisk.sh
install_service.sh		install_service.sh
main.py		main.py
quo_client.py		quo_client.py
rag_system.py		rag_system.py
ragbot.service		ragbot.service
requirements.txt		requirements.txt
requirements.txt.new		requirements.txt.new
security_best_practices_report.md		security_best_practices_report.md
security_utils.py		security_utils.py
setup.sh		setup.sh
setup_cloudflare.sh		setup_cloudflare.sh
slack_bot.py		slack_bot.py
start.sh		start.sh
telegram_bot.py		telegram_bot.py
telegram_bot_hf.py		telegram_bot_hf.py
test_api.py		test_api.py
test_pdf_upload.py		test_pdf_upload.py
test_whatsapp.py		test_whatsapp.py
user_manager.py		user_manager.py
user_rag_system.py		user_rag_system.py
voice_agent.py		voice_agent.py
voice_agent_v2.py		voice_agent_v2.py
web_research.py		web_research.py
whatsapp_bot.py		whatsapp_bot.py
whatsapp_bot_infobip.py		whatsapp_bot_infobip.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Bot with GPT-OSS Integration

Features

Quick Start

1. Setup

2. Configure

3. Start

API Endpoints

Usage Examples

Upload a Document

Query the System

Slack Setup

WhatsApp Setup

Raspberry Pi Deployment

System Requirements

Performance Tips

Auto-start Service

Architecture

Troubleshooting

Common Issues

Logs

Health Check

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Bot with GPT-OSS Integration

Features

Quick Start

1. Setup

2. Configure

3. Start

API Endpoints

Usage Examples

Upload a Document

Query the System

Slack Setup

WhatsApp Setup

Raspberry Pi Deployment

System Requirements

Performance Tips

Auto-start Service

Architecture

Troubleshooting

Common Issues

Logs

Health Check

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages