research(nightly): muvera-fde — MUVERA Fixed Dimensional Encodings (NeurIPS 2024)#436
Draft
research(nightly): muvera-fde — MUVERA Fixed Dimensional Encodings (NeurIPS 2024)#436
Conversation
Implements arXiv:2405.19504 (NeurIPS 2024, Google Research) as a new standalone Rust crate `ruvector-muvera`. Key results (x86_64, cargo --release, 4 CPUs): - 329× QPS over brute-force MaxSim (FDE-small, 5K docs, 32 tokens, d=128) - 16× memory reduction (256 f32s vs 4,096 f32s per doc) - 301× search speedup on 1K-doc Criterion bench (61.8ms → 205µs/query) - 12/12 unit + doc tests passing, cargo bench green Deliverables: - crates/ruvector-muvera/ — FdeEncoder, MuveraIndex<B>, VectorBackend trait - docs/adr/ADR-193-muvera-fde.md — architecture decision record - docs/research/nightly/2026-05-08-muvera-fde/README.md — research doc with SOTA survey, algorithm walkthrough, real benchmark tables https://claude.ai/code/session_01393yTCKC5VvRYFxnZ38KH6
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Implements MUVERA Fixed Dimensional Encodings (arXiv:2405.19504, NeurIPS 2024, Google Research) as a new standalone Rust crate
ruvector-muvera.MUVERA compresses ColBERT-style multi-token embedding sets into fixed-dimension single vectors via SimHash space partitioning + Rademacher random projection, enabling standard HNSW/IVF indexing for multi-vector workloads with a formal ε-approximation guarantee on Chamfer/MaxSim similarity.
Gist: https://gist.github.com/ruvnet/d5783fe3ec893249b1f7590f00a2c087
Key Results (x86_64, cargo --release, 4 CPUs)
Criterion micro-benchmarks (1K docs, d=128, 32 tokens/doc)
Demo (5K docs, clustered embeddings, 50 clusters σ=0.25)
Encode latency per document
Deliverables
crates/ruvector-muvera/— FdeEncoder, MuveraIndex<B: VectorBackend>, VectorBackend trait, FlatBackend, 12/12 tests passingdocs/adr/ADR-193-muvera-fde.md— ADR with context, decision, alternativesdocs/research/nightly/2026-05-08-muvera-fde/README.md— full research doc (SOTA survey, algorithm walkthrough, benchmark tables, roadmap)Architecture
VectorBackend trait makes HNSW/IVF backends pluggable — see ADR-193 §Alternatives.
Tests
See
docs/research/nightly/2026-05-08-muvera-fde/README.mdanddocs/adr/ADR-193-muvera-fde.mdfor full analysis.https://claude.ai/code/session_01393yTCKC5VvRYFxnZ38KH6