MEVE Framework

Simple 5-phase pipeline for efficient context retrieval in RAG systems.

Simple Flow

Query → kNN Search → Verify → [Fallback?] → Prioritize → Budget → Result

Block Diagram

block-beta
    columns 5
    
    Query["Query Input"]
    
    space
    VectorDB[("Vector Store")]
    BM25DB[("BM25 Index")]
    space
    
    Phase1["Phase 1<br/>kNN Search<br/>k_init=5"]
    Phase2["Phase 2<br/>Verification<br/>τ ≥ 0.5"]
    Decision{"Decision<br/>|chunks| < n_min?"}
    Phase3["Phase 3<br/>BM25 Fallback"]
    Combine["Combine<br/>Contexts"]
    
    Phase4["Phase 4<br/>Prioritization<br/>Remove Duplicates"]
    Phase5["Phase 5<br/>Token Budget<br/>t_max=100"]
    Output["Final Context<br/>for LLM"]
    
    Query --> Phase1
    VectorDB --> Phase1
    Phase1 --> Phase2
    Phase2 --> Decision
    Decision -->|"No"| Combine
    Decision -->|"Yes"| Phase3
    BM25DB --> Phase3
    Phase3 --> Combine
    Combine --> Phase4
    Phase4 --> Phase5
    Phase5 --> Output
    
    style Query fill:#e1f5fe
    style Phase1 fill:#f3e5f5
    style Phase2 fill:#f3e5f5
    style Decision fill:#fff3e0
    style Phase3 fill:#f3e5f5
    style Combine fill:#e8f5e8
    style Phase4 fill:#f3e5f5
    style Phase5 fill:#f3e5f5
    style Output fill:#e1f5fe
    style VectorDB fill:#f5f5f5
    style BM25DB fill:#f5f5f5

How It Works

Phase 1 - kNN Search: Find similar chunks using vector search
Phase 2 - Verify: Check if chunks are actually relevant
Phase 3 - Fallback: If not enough good chunks, try BM25 search
Phase 4 - Prioritize: Remove duplicates, rank by importance
Phase 5 - Budget: Pack best chunks within token limit

Key Settings

k_init = 5          # How many chunks to find initially
tau_relevance = 0.5 # Minimum relevance score to keep
n_min = 3           # Minimum chunks needed (triggers fallback if less)
t_max = 100         # Maximum tokens allowed

Quick Example

# Setup
engine = MeVeEngine(config, vector_store, bm25_index)

# Ask question
result = engine.run("What is the Eiffel Tower?")

# Get optimized context for LLM
print(result)

What Makes It Smart

Quality First: Only keeps relevant chunks
Backup Plan: Falls back to BM25 if vector search fails
No Duplicates: Removes redundant information
Budget Aware: Fits within token limits
Fast: Efficient pipeline design

Usage

from meve_engine import MeVeEngine, setup_simulation_data
from meve_data import MeVeConfig

# Setup
vector_store, bm25_index = setup_simulation_data()
config = MeVeConfig(k_init=5, tau_relevance=0.5, n_min=3, theta_redundancy=0.85, t_max=100)

# Initialize engine
engine = MeVeEngine(config, vector_store, bm25_index)

# Run query
result = engine.run("Your query here")

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docs		docs
services		services
.env		.env
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
Makefile		Makefile
README.md		README.md
TODO.md		TODO.md
meve_data.py		meve_data.py
meve_engine.py		meve_engine.py
phase1_knn.py		phase1_knn.py
phase2_verification.py		phase2_verification.py
phase3_fallback.py		phase3_fallback.py
phase4_prioritization.py		phase4_prioritization.py
phase5_budgeting.py		phase5_budgeting.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MEVE Framework

Simple Flow

Block Diagram

How It Works

Key Settings

Quick Example

What Makes It Smart

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MEVE Framework

Simple Flow

Block Diagram

How It Works

Key Settings

Quick Example

What Makes It Smart

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages