HybridMind

HybridMind is a local-native hybrid vector–graph store for agent memory. It provides a clean, self-contained implementation that combines FAISS exact inner-product search, an Okapi BM25 index with NLTK stemming, a NetworkX directed graph, and SQLite into a single .mind file format. Repository: github.com/a3ro-dev/hybridmind.

Problem

Pure vector retrieval ignores explicit relational structure; graph-only retrieval lacks semantic filtering and scales poorly when edges are sparse or noisy. Agent memory systems need both: semantic alignment to the query and re-ranking or traversal grounded in declared relationships, without mandatory remote services.

Approach

HybridMind is an engineering system that correctly applies known hybrid retrieval techniques without external cloud dependencies.

Late Fusion Scoring. Hybrid retrieval ranks candidates by a weighted linear score fusion—a well-known late fusion technique in information retrieval—combining vector similarity and graph proximity:

Score(q,n) = α·V(q,n) + β·G(A,n),     α + β = 1

Symbol	Meaning
q, n	Query and candidate node
V(q,n)	Base vector score: Cosine similarity between query and node embeddings, plus a BM25 exact-match overlap boost for lexical precision.
G(A,n)	Graph score: max over anchors a in A of 1/(1 + d(a,n)); d is shortest directed path length (either direction)
A	Anchor set; if omitted, defaults to the top-3 vector hits

Default weights α = 0.6, β = 0.4 (semantic primacy). Full definition, anchors, and weight rationale: docs/ALGORITHM.md.

Ingest-Time Neighborhood Averaging. Stored vectors are L2-normalized after blending the text embedding with the mean of the top-5 vector neighbors: 0.7·e_raw + 0.3·e_neighbors (docs/ARCHITECTURE.md, Embedding Engine). This is a practical, non-training variant of GraphSAGE-style aggregation used to provide a graph-aware embedding space. Formulation and caveats: docs/ALGORITHM.md §3.

Architecture

Layered stack: FastAPI / Pydantic → embedding engine, vector and graph query engines, hybrid ranker → SQLite (WAL), FAISS IndexFlatIP, NetworkX DiGraph → atomic .mind persistence (manifest, DB, vectors, graph). ASCII diagram and data-flow for hybrid search: docs/ARCHITECTURE.md.

Quick start

Use the project virtual environment for all Python commands.

python3 -m venv .venv
# Windows PowerShell: .\.venv\Scripts\Activate.ps1
# Unix: source .venv/bin/activate
pip install -r requirements.txt
.\.venv\Scripts\python.exe -m uvicorn main:app --host 127.0.0.1 --port 8000

Python SDK (sdk/memory.py):

from sdk.memory import HybridMemory

memory = HybridMemory(base_url="http://127.0.0.1:8000")
nid = memory.store("Transformer models use self-attention.")
memory.relate(nid, "other-node-uuid", "derived_from")
results = memory.recall("attention mechanisms", top_k=5, mode="hybrid")

Tests and benchmarks:

python3 -m pytest tests/ -v
./scripts/run_all_benchmarks.sh

Further integration notes: docs/AGENT_INTEGRATION.md.

API overview

Area	Methods (HTTP)
Nodes	`POST/GET/PUT/DELETE /nodes`, `GET /nodes/{id}`
Edges	`POST/GET/PUT/DELETE /edges`, `GET /edges/node/{node_id}`, `GET /edges/types`
Search	`POST /search/vector`, `GET /search/graph`, `POST /search/hybrid`, `POST /search/compare`, `GET /search/path/{source}/{target}`, `GET /search/stats`
Bulk	`POST /bulk/nodes`, `POST /bulk/edges`, `POST /bulk/import`, `POST /bulk/unstructured`, `DELETE /bulk/clear`
Comparison	`POST /comparison/effectiveness`
Ops	`GET /health`, `GET /ready`, `GET /live`, `POST /snapshot`, `POST /cache/clear`, `POST /admin/compact`, `POST /admin/clear`

SDK: HybridMemory.store, relate, recall (mode: hybrid | vector), trace (vector anchor then GET /search/graph), forget, compact, stats.

Evaluation & Benchmarks

The system is empirically evaluated on targeted benchmarks demonstrating clear regime-of-validity boundaries:

Semantic Paraphrase & Exact Lexical Lookup: Vector alone (with BM25 exact match boost) achieves 100% precision@3 without graph assistance.
Edge-Dependent Multi-Hop Retrieval: Graph-heavy hybrid (vector=0.1, graph=0.9) successfully surfaces multi-hop answers, recovering 100% recall where vector-only yields 0%.
Ingest-Time Neighborhood Averaging: Conditioning embeddings on neighbors improves test retrieval of related cross-domain concepts from 66% (without averaging) to 100% (with averaging).
Ablation Studies: Isolated runs (BM25 only, Vector only, Hybrid) confirm the linear combination of Score = α·V + β·G correctly blends semantic space with structural reality, without inflating claims via unsupported deep graph traversals.

Run benchmarks with: ./scripts/run_all_benchmarks.sh

Reviewer-Grade Limitations

Graph Sparsity Failure: The graph component is functionally useless if explicit cross-domain edges do not exist. Hybrid search defaults to vector-only if no anchors are found.
Domain-Separation from Embeddings: all-MiniLM-L6-v2 struggles to differentiate certain document types (e.g. Stack Exchange QA vs Wikipedia paragraphs), which can lead to vector-search contamination that graph edges alone cannot fix.
BM25 Exact Overlap Limits: BM25 excels at keyword matching but fails to label semantic relevance that lacks exact keyword overlap.
Ingest Scalability: Single-threaded execution of Python's Transformer models bounds ingestion to ~5 requests per second, making this explicitly a local-agent tool, not an enterprise search backend.

Citation

@software{hybridmind2025,
  title        = {HybridMind: Local-Native Hybrid Vector--Graph Memory},
  author       = {a3ro-dev},
  year         = {2025},
  url          = {https://github.com/a3ro-dev/hybridmind}
}

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.cursor		.cursor
api		api
benchmarks		benchmarks
cli		cli
docs		docs
engine		engine
middleware		middleware
models		models
scripts		scripts
sdk		sdk
storage		storage
tests		tests
ui		ui
.dockerignore		.dockerignore
.gitignore		.gitignore
AGENT.md		AGENT.md
AUDIT_REPORT.md		AUDIT_REPORT.md
Dockerfile		Dockerfile
FINAL_STATUS.md		FINAL_STATUS.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
config.py		config.py
docker-compose.yml		docker-compose.yml
locomo_checkpoint.json		locomo_checkpoint.json
locomo_report.json		locomo_report.json
main.py		main.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_tests.py		run_tests.py
start.bat		start.bat
tmp_ogb.py		tmp_ogb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HybridMind

Problem

Approach

Architecture

Quick start

API overview

Evaluation & Benchmarks

Reviewer-Grade Limitations

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HybridMind

Problem

Approach

Architecture

Quick start

API overview

Evaluation & Benchmarks

Reviewer-Grade Limitations

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages