min_rag

A minimal example of RAG, based on DSPy.

set up

To set up the environment:

git clone https://github.com/DerwenAI/min_rag.git
cd min_rag

python3 -m venv venv
source venv/bin/activate

python3 -m pip install -U pip wheel
python3 -m pip install -r requirements.txt

If you want to use ChatGPT instead of a locally hosted LLM:

set the OPENAI_API_KEY environment variable to your OpenAI API key
set the run_local = False flag in "demo.py"

Otherwise this uses ollama to download and orchestrate a local LLM.

The gpt-oss:20b model is set by default, and to have it running locally:

ollama pull gpt-oss:20b

Or change the "rag.lm_name" configuration setting to a different model which you have downloaded and run locally.

running

To load the vector database from markdown files, then run a question/answer chat bot based on RAG:

python3 demo.py

Then ask questions.

Change the markdown files in data/talks to add new content, or point to a different directory.

"For those we hold close, and for those we never meet."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

min_rag

set up

running

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

min_rag

set up

running