🚀 Multi Tool Agent

A FastAPI-based multi-tool agent system that supports document ingestion, vector search, intelligent query handling, and retrieval evaluation using OpenAI and ChromaDB.

This project demonstrates how to build a modular agent architecture with multiple embedding backends, tool definitions, and automated evaluation pipelines.

📂 Project Breakdown

⚙️ Settings Management

Environment variables are managed using pydantic-settings.
Configure your OpenAI key, model names, and other values in the .env file.
Settings are automatically loaded at runtime.

📥 Ingestion Layer

Two ingestion APIs allow you to store documents in ChromaDB using different embedding strategies:

Sentence Transformer Embeddings
- Uses ChromaDB’s default sentence transformer for embeddings.
- Stores the embeddings in a persistent client on disk.
OpenAI Embeddings
- Uses OpenAI’s embedding models for ingestion.
- Stored in a separate persistent client for comparison.

Ingestion Process:

Document Loader: Uses pymupdf for TXT/document parsing.
Chunking Strategy: Fixed size chunks of 100 tokens with 50 token overlap.
VectorStore: Persistent client of chromaDB ensures embeddings are saved locally.

🤖 Message Agent Layer

Provides an API to handle user queries.
Accepts structured input via a Pydantic model.
Loads bot.json containing:
- System instructions / LLM prompt.
- Tool definitions.

Agent Workflow:

Initializes the agent object using Agent class with the system prompt, tool definitions, and chat history.
The run() method calls the OpenAI responses API with:
- query
LLM decides which tool to invoke in run method.
The selected tool is dynamically executed and returns results.
The agent run method recursively re-runs with updated chat history until a final output is generated.

📊 Evaluation Layer

Provides an API to evaluate retrieval quality from both embedding approaches (Sentence Transformer vs OpenAI Embeddings).
Measures:
- Best Model (via LLM-as-a-judge).
- Latency (time required to retrieve chunks).
LLM outputs which embedding method is more suitable.

🛠️ Tech Stack

FastAPI – Async Web framework
ChromaDB – Vector database
OpenAI – LLM & embeddings
pydantic-settings – Settings management
pymupdf – Document loading
uv – Dependency management
Docker – Containerization

📑 APIs Overview

After starting the server, visit Swagger UI (http://0.0.0.0:8000/docs) to explore the 4 APIs:

POST rag/chroma/ingest – Ingest the data with default sentence transformer.
POST rag/chroma/ingest/openai – Ingest the data with OpenAI embeddings.
POST rag/message-agent – Query via Message Agent.
POST rag/evaluate – Outputs Best Model for retrieval and latency.

🚀 Getting Started

1. Clone the repo

git clone https://github.com/sunilvepanjeri/multi-agent.git

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
app		app
.env		.env
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Multi Tool Agent

📂 Project Breakdown

⚙️ Settings Management

📥 Ingestion Layer

🤖 Message Agent Layer

📊 Evaluation Layer

🛠️ Tech Stack

📑 APIs Overview

🚀 Getting Started

1. Clone the repo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 Multi Tool Agent

📂 Project Breakdown

⚙️ Settings Management

📥 Ingestion Layer

🤖 Message Agent Layer

📊 Evaluation Layer

🛠️ Tech Stack

📑 APIs Overview

🚀 Getting Started

1. Clone the repo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages