-
Notifications
You must be signed in to change notification settings - Fork 2
User Guide
KHemanthRaju edited this page Nov 18, 2025
·
1 revision
This guide will walk you through using the RAG Process Visualizer to understand the RAG pipeline.
The first step is to input your document:
- Enter Text Manually: Type or paste your document text into the text area
- Use Sample Documents: Click on any of the sample documents to load them instantly
-
Configure Chunk Size:
- Use the slider or number input to set chunk size (50-500 characters)
- Smaller chunks = more granular, larger chunks = more context
- Default is 200 characters
- Process Document: Click "Process Document" to begin
After processing, you'll see:
- Chunk Count: Number of chunks created
- Chunk Size: The configured chunk size
- Chunk Preview: Visual cards showing each chunk with its content
The chunks are created by splitting the document at sentence boundaries while respecting the chunk size limit.
This is a manual step - you control when embeddings are generated:
- Review the chunks created
- Click "Generate Embeddings" button
- Watch as chunks are converted to vector embeddings
- View embedding statistics:
- Total vectors created
- Vector dimensions (384)
- Sample vector preview
Before querying, you must store embeddings:
- Click "Store in Vector Database" button
- Wait for the storage process to complete
- You'll see a success message confirming storage
- The query stage will now be enabled
Once embeddings are stored:
- Enter your query in the search box
- Or click a sample query to load it
- Click "Search & Generate"
- The system will:
- Embed your query
- Search for similar chunks
- Retrieve top matches
You'll see:
- Query Display: Your original query
- Relevant Chunks: Top 3 most relevant chunks
- Similarity Scores: Percentage match for each chunk
- Ranking: Chunks ordered by relevance
The final stage shows:
- Generated Response: AI-generated answer based on retrieved context
- Context Used: The chunks that informed the response
- Source Tracking: See which chunks contributed to the answer
At the top of the page, you'll see a Pipeline Progress Tracker that shows:
- ✅ Green: Completed stages
- 🔵 Blue (Pulsing): Currently active stage
- ⚪ Gray: Pending stages
This gives you a real-time view of where you are in the pipeline.
- Small chunks (50-150): Better for precise, specific queries
- Medium chunks (150-300): Good balance for most use cases
- Large chunks (300-500): Better for queries requiring more context
- Be specific and clear
- Use keywords from your documents
- Ask complete questions rather than fragments
You can process multiple documents:
- Process first document
- Generate embeddings
- Store in Vector DB
- Process additional documents
- Generate and store their embeddings
- All documents will be searchable together
Click the "Reset Pipeline" button to:
- Clear all documents
- Remove all chunks and embeddings
- Start fresh with a new pipeline
- 🔵 Blue: Document input stage
- 🟡 Yellow: Chunking stage
- 🟣 Purple: Embedding generation
- 🟢 Teal: Vector DB storage
- 🟢 Green: Query processing
- 🟠 Orange: Retrieval stage
- 🟣 Indigo: Generation stage
- Pulsing borders: Indicate active processing
- Fade-in effects: Show new content appearing
- Progress bars: Display similarity scores
- Flow connectors: Show pipeline progression
- Explore the Architecture to understand how it works
- Check the API Reference for technical details
- Review Features for advanced capabilities