RAGio - AI-Powered Document Analysis Platform

A comprehensive full-stack application that combines document processing, AI-powered chat, and intelligent document analysis using Retrieval-Augmented Generation (RAG) technology.

Features • Installation • Documentation • Deployment • Contributing

Screenshots

Landing Page

Document Management

PDF Document Chat

CSV Document Chat

Overview

RAGio is an enterprise-grade document analysis platform that leverages advanced AI technologies to help users interact with their documents intelligently. The platform uses Retrieval-Augmented Generation (RAG) to provide accurate, context-aware responses based on document content.

Key Capabilities

Multi-format Document Processing: PDF, CSV, TXT
AI-Powered Chat: Context-aware conversations with your documents
Semantic Search: Advanced vector search using Qdrant
Multi-Document Analysis: Ask questions across multiple documents simultaneously
Secure Authentication: JWT-based user authentication and authorization
Real-time Monitoring: Comprehensive observability with Prometheus, Loki, and Grafana

Features

AI-Powered Document Analysis

RAG Technology: Multi-stage retrieval-augmented generation pipeline
- Query expansion using LLM
- Hybrid retrieval (dense + sparse/BM25)
- BGE re-ranking for quality
- Cross-encoder verification
- Context assembly with deduplication
Multi-Document Chat: Simultaneous analysis across multiple documents
Intelligent Search: Semantic vector search with Qdrant
Cross-Document Analysis: Compare and synthesize information from multiple sources
Source Attribution: Clear indication of information sources with page references

Document Processing

Multi-Format Support: PDF, CSV, TXT files
Document Viewer: Built-in viewers for various file formats
File Management: Secure storage with MinIO object storage
Background Processing: Async document processing with Celery

Advanced Chat System

Context-Aware Conversations: AI remembers conversation context
Multi-Document Chat: Ask questions spanning multiple documents
Real-time Responses: Fast, intelligent answers with source citations
Chat History: Persistent conversation history with caching
Token Management: Intelligent handling of large documents
Suggested Questions: AI-generated question suggestions

Security & Authentication

JWT Authentication: Secure user authentication with access and refresh tokens
User Management: Registration, login, and session management
File Ownership: Secure document access control
CORS Protection: Configurable cross-origin request security
Rate Limiting: API rate limiting to prevent abuse
Input Validation: Comprehensive data validation

Performance & Scalability

Database Optimization: PostgreSQL with connection pooling
Vector Search: High-performance semantic search with Qdrant
Background Processing: Async document processing with Celery workers
Health Monitoring: Comprehensive system monitoring
Production Ready: Multi-worker uvicorn configuration

Observability

Prometheus Metrics: Application and system metrics
Loki Logging: Centralized log aggregation
Grafana Dashboards: Pre-configured dashboards for monitoring
Health Checks: Detailed health endpoints
Performance Tracking: Request tracing and performance monitoring

Technology Stack

Frontend

React 18.3.1 - Modern UI framework
React Router 6 - Client-side routing
Axios - HTTP client for API communication
Zustand - State management
React Hook Form - Form management
Tailwind CSS - Utility-first CSS framework
Flowbite React - UI component library

Backend

FastAPI 0.104.1 - High-performance Python web framework
SQLAlchemy 2.0 - Database ORM
PostgreSQL 15 - Primary database
Qdrant - Vector database for semantic search
MinIO - Object storage for documents
Redis 7 - Caching and Celery broker
Celery 5.3 - Distributed task queue
Uvicorn - ASGI server

AI/ML

LangChain - AI/ML framework
OpenRouter API - LLM provider (Mistral, Llama, etc.)
Sentence Transformers - Text embeddings (all-MiniLM-L6-v2)
Cross-Encoder - Re-ranking models (ms-marco-MiniLM-L-6-v2)
Hugging Face Transformers - Model loading and inference

DevOps & Monitoring

Docker - Containerization
Docker Compose - Multi-service orchestration
Prometheus - Metrics collection
Loki - Log aggregation
Grafana - Visualization and dashboards
Flower - Celery monitoring

Architecture

Key Components

Frontend: React SPA with modern UI components
Backend API: FastAPI REST API with async support
RAG Pipeline: Multi-stage retrieval and generation pipeline
Vector Database: Qdrant for semantic search
Object Storage: MinIO for document storage
Task Queue: Celery for background document processing
Monitoring: Prometheus, Loki, and Grafana stack

Prerequisites

Before you begin, ensure you have the following installed:

Python 3.12+
Node.js 18+ and npm
Docker and Docker Compose
Git

System Requirements

RAM: Minimum 8GB (16GB recommended)
Storage: 10GB+ free space
CPU: Multi-core processor recommended

Installation

1. Clone the Repository

git clone https://github.com/your-username/ragio.git
cd ragio

2. Backend Setup

cd backend

# Create virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Copy environment template
cp .env.example .env
# Edit .env with your configuration

3. Frontend Setup

cd frontend

# Install dependencies
npm install

# Copy environment template
cp .env.example .env
# Edit .env and set REACT_APP_API_URL

4. Start Services

cd backend

# Start all Docker services (PostgreSQL, Redis, MinIO, Qdrant, etc.)
docker-compose up -d

# Start the backend server
python start.py

# Or start manually
uvicorn app.main:app --reload --host 0.0.0.0 --port 8080

5. Start Frontend

cd frontend
npm start

The application will be available at:

Frontend: http://localhost:3000
Backend API: http://localhost:8080
API Docs: http://localhost:8080/docs
Grafana: http://localhost:3001
MinIO Console: http://localhost:9001

Configuration

Backend Environment Variables

Create backend/.env from backend/.env.example:

# Application
ENVIRONMENT=development
HOST=0.0.0.0
PORT=8080

# Database
DATABASE_URL=postgresql://postgres:admin@localhost:5432/hcp

# Qdrant
QDRANT_HOST=localhost
QDRANT_PORT=6333

# MinIO
MINIO_ENDPOINT=localhost:9000
MINIO_ACCESS_KEY=minioadmin
MINIO_SECRET_KEY=minioadmin
MINIO_BUCKET_NAME=documents

# Redis
REDIS_HOST=localhost
REDIS_PORT=6379

# Security (IMPORTANT: Generate strong keys!)
SECRET_KEY=your-secret-key-here
REFRESH_SECRET_KEY=your-refresh-secret-key-here

# CORS
ALLOWED_ORIGINS=http://localhost:3000
ALLOWED_HOSTS=localhost,127.0.0.1

# AI/ML
OPENROUTER_API_KEY=your-openrouter-api-key
OPENROUTER_MODEL=mistralai/mistral-7b-instruct:free

Frontend Environment Variables

Create frontend/.env:

REACT_APP_API_URL=http://localhost:8080

Generate Secret Keys

python3 -c "import secrets; print(secrets.token_urlsafe(32))"

API Documentation

Authentication Endpoints

POST /api/auth/register    # User registration
POST /api/auth/login        # User login
POST /api/auth/refresh     # Refresh access token
POST /api/auth/logout      # User logout

Document Management

POST   /api/document/upload           # Upload and process documents
GET    /api/document/files            # List user's documents
GET    /api/document/{id}              # Get document details
DELETE /api/document/{id}              # Delete document
GET    /api/document/{id}/download    # Download document

Chat Endpoints

POST /api/chat/{file_id}                    # Chat with single document
GET  /api/chat/messages/{file_id}           # Get chat history

POST /api/chat/general                      # General chat (all documents)
GET  /api/chat/general/messages             # General chat history

Health & Monitoring

GET /api/health/           # Basic health check
GET /api/health/detailed   # Detailed health with metrics
GET /api/health/metrics    # System and application metrics
GET /metrics               # Prometheus metrics endpoint

Interactive API Documentation

Swagger UI: http://localhost:8080/docs
ReDoc: http://localhost:8080/redoc

Quick Start

# Backend
cd backend
cp .env.example .env
# Edit .env with production values
docker-compose -f docker-compose.prod.yml up -d --build

# Frontend
cd frontend
cp .env.example .env
# Set REACT_APP_API_URL to production backend URL
npm run build
# Deploy build/ folder to your web server

Docker Compose Production

docker-compose -f docker-compose.prod.yml up -d

Environment Variables for Production

Set ENVIRONMENT=production
Generate strong SECRET_KEY and REFRESH_SECRET_KEY
Update ALLOWED_ORIGINS with production domains
Configure SSL/HTTPS
Set up proper database credentials

Project Structure

ragio/
├── backend/
│   ├── app/
│   │   ├── db/              # Database models and connection
│   │   ├── middleware/      # Custom middleware (error, rate limit, etc.)
│   │   ├── routes/          # API endpoints
│   │   ├── services/        # Business logic (RAG, chat, etc.)
│   │   ├── tasks/           # Celery background tasks
│   │   └── utils/           # Utilities (logger, auth, etc.)
│   ├── grafana/             # Grafana dashboards and provisioning
│   ├── loki/                # Loki configuration
│   ├── prometheus/          # Prometheus configuration
│   ├── promtail/            # Promtail configuration
│   ├── docker-compose.yml   # Development docker-compose
│   ├── docker-compose.prod.yml  # Production docker-compose
│   ├── Dockerfile           # Multi-stage Dockerfile
│   ├── requirements.txt     # Python dependencies
│   ├── start.py             # Startup script
│   └── .env.example         # Environment template
│
├── frontend/
│   ├── public/              # Static assets
│   ├── src/
│   │   ├── components/      # React components
│   │   ├── hooks/           # Custom React hooks
│   │   ├── stores/          # Zustand state management
│   │   └── utils/           # Utilities
│   ├── package.json         # Node dependencies
│   └── .env.example         # Environment template
│
└── README.md                # This file

Monitoring & Observability

Grafana Dashboards

Access Grafana at http://localhost:3001 (default credentials: admin/admin)

Pre-configured dashboards:

API Performance: Request rates, latency, error rates
Backend Latency: Response time metrics
Celery Workers: Task queue monitoring
Error Rate: Error tracking and alerts
RAG Pipeline: RAG-specific metrics
Logs Overview: Centralized log viewing

Prometheus Metrics

Application Metrics: Custom business metrics
System Metrics: CPU, memory, disk usage
Database Metrics: Connection pool stats
Celery Metrics: Task execution metrics

Health Checks

# Basic health
curl http://localhost:8080/api/health/

# Detailed health with metrics
curl http://localhost:8080/api/health/detailed

# Prometheus metrics
curl http://localhost:8080/metrics

Testing

Integration Tests

# Test API endpoints
curl http://localhost:8080/api/health/

# Test authentication
curl -X POST http://localhost:8080/api/auth/register \
  -H "Content-Type: application/json" \
  -d '{"email":"test@example.com","password":"test123"}'

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Make your changes
Commit changes: git commit -m 'Add amazing feature'
Push to branch: git push origin feature/amazing-feature
Open a Pull Request

Code Guidelines

Backend: Follow PEP 8 style guidelines
Frontend: Use ESLint and Prettier
Tests: Add tests for new features
Documentation: Update docs for API changes
Commits: Use conventional commit messages

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

FastAPI - Excellent Python web framework
React - Powerful frontend library
LangChain - AI/ML framework
Qdrant - Vector search database
MinIO - Object storage
OpenRouter - LLM API provider
Hugging Face - Model hub and transformers

Support

API Docs: http://localhost:8080/docs (when running)
Issues: GitHub Issues
Discussions: GitHub Discussions

Roadmap

Upcoming Features

Performance Improvements

Built with FastAPI, React, PostgreSQL, Redis, Qdrant, MinIO, and AI/ML technologies

Star this repo if you find it helpful!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Images		Images
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

RAGio - AI-Powered Document Analysis Platform

Screenshots

Landing Page

Document Management

PDF Document Chat

CSV Document Chat

Table of Contents

Overview

Key Capabilities

Features

AI-Powered Document Analysis

Document Processing

Advanced Chat System

Security & Authentication

Performance & Scalability

Observability

Technology Stack

Frontend

Backend

AI/ML

DevOps & Monitoring

Architecture

Key Components

Prerequisites

System Requirements

Installation

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

4. Start Services

5. Start Frontend

Configuration

Backend Environment Variables

Frontend Environment Variables

Generate Secret Keys

API Documentation

Authentication Endpoints

Document Management

Chat Endpoints

Health & Monitoring

Interactive API Documentation

Quick Start

Docker Compose Production

Environment Variables for Production

Project Structure

Monitoring & Observability

Grafana Dashboards

Prometheus Metrics

Health Checks

Testing

Integration Tests

Contributing

Code Guidelines

License

Acknowledgments

Support

Roadmap

Upcoming Features

Performance Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages