RAGScope Pro: Insight & Observability Platform

RAGScope Pro is an enterprise-grade visualization and experimentation workbench for Retrieval-Augmented Generation (RAG) systems.

Designed for AI Engineers and Data Scientists, this platform moves beyond simple "chat with PDF" tutorials. (HarryPotter) It provides a rigorous environment to benchmark, visualize, and debug complex retrieval strategies—including Hybrid Search, Reranking, and HyDE—using real-time execution logs and cost analysis.

graph TD
    subgraph Frontend_Layer [UI and Observability]
        UI[Streamlit Interface]
        Monitor[Cost and Latency Monitoring]
        Trace[Execution Tracing]
        ABTest[A/B Testing Dashboard]
    end

    subgraph Orchestration_Layer [LangChain Logic]
        Router[Query Router]
        Strategies[RAG Strategies: Hybrid / HyDE / Multi-Query]
        Reranker[Cross-Encoder Reranker]
    end

    subgraph Data_Layer [Knowledge Base]
        VectorDB[ChromaDB - Vector Search]
        BM25[BM25 - Keyword Search]
        Docs[Raw Documents]
    end

    subgraph Inference_Layer [LLM Provider]
        Groq[Groq API]
        Llama[Llama 3 70B]
    end

    %% Flow of Information
    UI --> Router
    Router --> Strategies
    Strategies --> VectorDB
    Strategies --> BM25
    
    VectorDB --> Reranker
    BM25 --> Reranker
    
    Reranker --> Groq
    Groq --> Llama
    Llama --> UI
    
    %% Monitoring Feedback
    Groq -.-> Monitor
    Strategies -.-> Trace
    Llama -.-> ABTest

    %% Styling
    style Frontend_Layer fill:#f0f0f0,stroke:#333
    style Orchestration_Layer fill:#e1f5fe,stroke:#01579b
    style Data_Layer fill:#f1f8e9,stroke:#33691e
    style Inference_Layer fill:#fff3e0,stroke:#e65100

Key Features

Advanced RAG Strategies

Implements 8 production-ready patterns to handle complex queries:

Hybrid Search: Weighted ensemble of BM25 (Keyword) and Vector Search (Semantic).
Reranking: Second-pass relevance scoring using Cross-Encoder logic.
HyDE (Hypothetical Document Embeddings): Generates hallucinated answers to bridge the semantic gap.
Multi-Query & Sub-Query: Query expansion and decomposition for complex reasoning.
Parent-Document Retrieval: Returns full context from small, precise index chunks.

Observability & Analytics

A/B Testing Dashboard: Compare two different RAG pipelines side-by-side (e.g., Vector Only vs. Hybrid + Rerank).
Execution Tracing: Real-time logging of every step (Query Rewriting -> Retrieval -> Scoring -> Generation).
Cost & Latency Monitoring: Live calculation of token usage and processing time per query.

Interactive Learning Module

CS101-Style Visuals: Built-in educational module with Graphviz flowcharts explaining "How It Works" for each technique.

Tech Stack

Component	Technology	Description
Frontend	Streamlit	Reactive web interface with custom CSS styling.
LLM Inference	Groq API	Ultra-low latency inference for Llama 3 (70B).
Orchestration	LangChain	Chain management and prompt engineering.
Vector DB	ChromaDB	Local, persistent vector storage for embeddings.
Embeddings	HuggingFace	`all-MiniLM-L6-v2` for efficient semantic encoding.
Keyword Search	BM25	Sparse retrieval for exact match capabilities.
Visualization	Graphviz	Automated flowchart generation for system architecture.

Project Structure

ragscope-pro/
├── data/                   # Raw knowledge base (.txt files)
├── processed_data/         # Persisted Vector Database (ChromaDB)
├── src/
│   ├── modules/            # Core Business Logic
│   │   ├── config.py       # Global settings & presets
│   │   ├── database.py     # Vector DB & File I/O operations
│   │   ├── llm.py          # LLM Provider initialization
│   │   ├── rag_pipeline.py # RAG Algorithms & Logic
│   │   ├── languages.py    # Localization (EN/TH)
│   │   ├── ui.py           # UI Components & CSS
│   │   └── visuals.py      # Graphviz Flowchart Rendering
│   ├── app.py              # Main Application Entry Point
│   └── ingest.py           # Data Processing Script
├── requirements.txt        # Dependency list
└── README.md               # Documentation

Installation & Setup

1. Prerequisites

Ensure you have Python 3.9+ and Graphviz installed on your system (required for flowcharts).

# MacOS
brew install graphviz

# Windows
winget install graphviz

2. Clone Repository

git clone https://github.com/sitta07/RAGScope.git
cd ragscope-pro

3. Virtual Environment

python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install --upgrade pip

4. Install Dependencies

pip install -r requirements.txt

5. Ingest Data (Build the Brain)

Place your .txt files in the data/ folder (default includes Harry Potter lore), then run:

python src/ingest.py

This will generate the processed_data/ directory containing the Vector Index.

Usage

Run the application:

streamlit run src/app.py

Configuration (Bring Your Own Key)

This application uses Groq API for high-speed inference.

Get a free API Key at console.groq.com.

Enter the key in the Welcome Screen when the app launches.

(Optional) For deployment, set groq_api_key in Streamlit Secrets.

👨‍💻 Author

Sitta Boonkaew
AI Engineer Intern @ AI SmartTech

📄 License

This project is a personal project .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGScope Pro: Insight & Observability Platform

Key Features

Advanced RAG Strategies

Observability & Analytics

Interactive Learning Module

Tech Stack

Project Structure

Installation & Setup

1. Prerequisites

2. Clone Repository

3. Virtual Environment

4. Install Dependencies

5. Ingest Data (Build the Brain)

Usage

Configuration (Bring Your Own Key)

👨‍💻 Author

📄 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data		data
processed_data		processed_data
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

sitta07/RAGScope

Folders and files

Latest commit

History

Repository files navigation

RAGScope Pro: Insight & Observability Platform

Key Features

Advanced RAG Strategies

Observability & Analytics

Interactive Learning Module

Tech Stack

Project Structure

Installation & Setup

1. Prerequisites

2. Clone Repository

3. Virtual Environment

4. Install Dependencies

5. Ingest Data (Build the Brain)

Usage

Configuration (Bring Your Own Key)

👨‍💻 Author

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages