Chat RAG Explorer

An educational application designed to demonstrate the implementation of a Chat interface with Large Language Models (LLMs) and, in future steps, Retrieval-Augmented Generation (RAG).

This project uses Flask for the backend, OpenRouter for LLM access (supporting models like GPT-4, Claude 3, Llama 3, etc.), and vanilla JavaScript for a clean, streaming chat interface.

Prerequisites

Python 3.11+
uv (for package management)
An OpenRouter API Key

Quick Start

Clone the repository

git clone https://github.com/yourusername/chat-rag-explorer.git
cd chat-rag-explorer
uv sync
uv run pytest

Set up the environment variables
```
cp .env.example .env
```
Edit .env and add your API key:
```
OPENROUTER_API_KEY=sk-or-v1-your-key-here
```
See Logging Configuration for optional logging settings.
Run the application
```
uv run main.py
```
Port in use? The app auto-finds an available port (8000-8004).
Explore Open your browser to http://127.0.0.1:8000.

Features

Real-time Streaming: Server-Sent Events (SSE) to stream LLM responses token-by-token
Model Selection: Dynamic model picker with all available OpenRouter models, grouped by provider
Conversation History: Multi-turn conversation support with context retention
Metrics Sidebar: Real-time session metrics including token usage
Markdown Support: Secure rendering using Marked.js and DOMPurify (works offline)
Clean UI: Responsive interface built with vanilla HTML/CSS/JS

Content Preparation (RAG)

To ingest your own documents for RAG retrieval, see the utils/README.md for CLI tools:

split.py - Split large markdown files into chapters by heading pattern
ingest.py - Two-phase workflow: preview chunks → inspect → ingest to ChromaDB

The ingest tool writes human-readable chunk previews to data/chunks/ so you can tune chunking parameters before committing to the vector database.

Sample Data Included: A pre-built ChromaDB with ~2,000 chunks from the D&D SRD 5.2 ships at data/chroma_db_sample/. To use it, configure your RAG settings to point to this path.

Learn More

The sections below provide deeper insight into the application's architecture, testing, logging system, and development roadmap.

Architecture

Project Structure

chat-rag-explorer/
├── chat_rag_explorer/           # Main package
│   ├── static/                  # CSS, JS, and local libraries
│   ├── templates/               # HTML templates
│   ├── __init__.py              # App factory
│   ├── logging.py               # Centralized logging configuration
│   ├── routes.py                # Web endpoints
│   ├── services.py              # LLM integration logic
│   ├── rag_config_service.py    # ChromaDB connection management
│   ├── prompt_service.py        # System prompt CRUD operations
│   └── chat_history_service.py  # Conversation logging to JSONL
├── utils/                       # CLI utilities for content preparation
│   ├── README.md                # Utility documentation
│   ├── split.py                 # Split 1 page markdown into chapters
│   └── ingest.py                # Ingest markdown into ChromaDB
├── data/
│   ├── corpus/                  # Source markdown documents
│   ├── chunks/                  # Chunk previews for inspection (gitignored)
│   ├── chroma_db/               # Your ChromaDB vector store (gitignored)
│   └── chroma_db_sample/        # Pre-built sample DB (D&D SRD 5.2)
├── prompts/                     # System prompt templates (markdown)
├── logs/                        # Application logs (gitignored)
├── tests/                       # Test suite
├── config.py                    # Configuration settings (environment variable mapping)
├── main.py                      # Application entry point
├── pyproject.toml               # Dependencies and project metadata (uv)
├── .env.example                 # Template for environment variables (.env)
└── .env                         # Secrets and local overrides (gitignored)

Design Patterns

Modular Architecture: Flask Blueprints and Application Factory pattern
Centralized Logging: Request ID correlation and configurable log levels
Modern Python Tooling: Uses uv for fast dependency management

Logging

The application features a comprehensive logging system for debugging and monitoring.

Logging Configuration

Set these environment variables in your .env file:

Variable	Default	Description
`LOG_LEVEL_APP`	`DEBUG`	Log level for application code
`LOG_LEVEL_DEPS`	`INFO`	Log level for dependencies (Flask, httpx, etc.)
`LOG_TO_STDOUT`	`true`	Output logs to console
`LOG_TO_FILE`	`true`	Write logs to file
`LOG_FILE_PATH`	`logs/app.log`	Path to log file
`CHAT_HISTORY_ENABLED`	`false`	Enable chat interaction logging
`CHAT_HISTORY_PATH`	`logs/chat-history.jsonl`	Path to chat history file

Backend Logging

Startup Banner: On application start, logs configuration summary with masked API key:

============================================================
Chat RAG Explorer - Starting up
============================================================
Configuration:
  - OpenRouter Base URL: https://openrouter.ai/api/v1
  - OpenRouter API Key: sk-or-v1...6a0d
  - Default Model: openai/gpt-3.5-turbo
============================================================

Request Correlation: All API requests include a unique request ID for tracing:

[a1b2c3d4] POST /api/chat - Model: openai/gpt-4, Messages: 3, Content length: 150 chars
[a1b2c3d4] Starting chat stream - Model: openai/gpt-4
[a1b2c3d4] Token usage - Prompt: 45, Completion: 120, Total: 165
[a1b2c3d4] POST /api/chat - Stream completed (1.523s, 42 chunks)

Performance Metrics: Timing information for requests, including time-to-first-chunk (TTFC) for streams.

Frontend Logging

The browser console includes structured logs with session tracking:

[2025-12-26T15:30:00.000Z] [sess_abc123] INFO: Chat request initiated {model: "openai/gpt-4", messageLength: 50}
[2025-12-26T15:30:01.500Z] [sess_abc123] DEBUG: Time to first chunk {ttfc_ms: "823.45"}
[2025-12-26T15:30:02.000Z] [sess_abc123] INFO: Chat response completed {chunks: 42, totalTime_ms: "1523.00"}

Open browser DevTools (F12) -> Console to view frontend logs.

Testing

The project uses pytest with randomized test ordering to catch hidden state dependencies.

Running Tests

uv run pytest                     # Run all tests (randomized order)
uv run pytest -v                  # Verbose output
uv run pytest -x                  # Stop on first failure
uv run pytest --cov               # Run with coverage report
uv run pytest -k "test_name"      # Run specific test by name

Multi-Version Testing

Use nox to run tests across Python 3.11, 3.12, and 3.13:

nox                               # Run on all Python versions
nox -s tests-3.12                 # Run on specific version
nox -- -x                         # Pass args to pytest

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chat RAG Explorer

Prerequisites

Quick Start

Features

Content Preparation (RAG)

Learn More

Architecture

Project Structure

Design Patterns

Logging

Logging Configuration

Backend Logging

Frontend Logging

Testing

Running Tests

Multi-Version Testing

Test Philosophy

Roadmap

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github/workflows		.github/workflows
chat_rag_explorer		chat_rag_explorer
data		data
logs		logs
prompts		prompts
tests		tests
utils		utils
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
config.py		config.py
main.py		main.py
noxfile.py		noxfile.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

AlteredCraft/chat-rag-explorer

Folders and files

Latest commit

History

Repository files navigation

Chat RAG Explorer

Prerequisites

Quick Start

Features

Content Preparation (RAG)

Learn More

Architecture

Project Structure

Design Patterns

Logging

Logging Configuration

Backend Logging

Frontend Logging

Testing

Running Tests

Multi-Version Testing

Test Philosophy

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages