🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
-
Updated
Apr 16, 2026 - Python
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
PyLate efficient inference engine
High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.
ColFastVLM: Towards low-latency indexing in visual document retrieval
🌐 Build and share your personal website with ease using mjsushanth.github.io, a simple and effective static site generator.
Repo for portfolio, containing working redirects to all projects.
Add a description, image, and links to the late-interaction topic page so that developers can more easily learn about it.
To associate your repository with the late-interaction topic, visit your repo's landing page and select "manage topics."