Chatbot-LLM

Vulcan Lab – Entrance Exam Project

Demo

🎥 Video Demo (Don't worry, I deleted the API key :3):
Click here

A modular, memory-augmented chatbot system built with LLMs, structured outputs, and a vector database for long-term conversational context.

Overview

This project implements a stateful LLM chatbot with:

Structured outputs (JSON Schema–enforced)
Short-term & long-term memory
Context augmentation via vector search
Scalable multi-user / multi-chat architecture

The system periodically summarizes conversations and stores them in a vector database (Milvus), enabling retrieval and augmentation for future queries.

Architecture Highlights

LLM Provider: Groq API
Vector Database: Milvus
Memory Types:
- Short-term: current context window
- Long-term: summarized session memory (vectorized)
Core Components:
- Query understanding (ambiguity detection, query rewriting)
- Context augmentation
- Structured session summarization
- Grounded answer generation

Workflow

Getting Started

1. Prerequisites

Docker & Docker Compose
Python 3.10+
Groq API key

2. Start Milvus (Vector Database)

docker compose up -d

3. Configure Environment Variables

Create a .env file in the project root:

GROQ_API_KEY=your_groq_api_key_here

Create a free Groq API key at:
https://console.groq.com/keys

4. Run the Chatbot

chmod u+x ./run.sh
./run.sh

Or run directly:

python -m src.main --config ./configs/app.yaml

Structured Output Examples

Query Understanding Pipeline

{
  "original_query": "...",
  "is_ambiguous": true,
  "rewritten_query": "...",
  "needed_context_from_memory": [
    "user_profile.prefs",
    "open_questions"
  ],
  "clarifying_questions": [],
  "final_augmented_context": {}
}

Session Memorization

{
  "session_summary": {
    "user_profile": {
      "prefs": [],
      "constraints": []
    },
    "key_facts": [],
    "decisions": [],
    "open_questions": [],
    "todos": []
  },
  "message_range_summarized": {
    "from": 0,
    "to": 42
  }
}

Key Design Assumptions

The chatbot enforces structured LLM outputs via JSON Schema
The architecture supports horizontal scalability through user_id and chat_id
Long conversations are supported via vectorized session memory

Limitations

Ambiguous query classification relies solely on the LLM (no dedicated classifier yet)
Context input size can grow large and needs further optimization
Context augmentation is currently prompt-based

Configuration Documentation

Example configs/app.yaml:

# User info
chat_id: "001"
user_id: "user_123"

# App config
model_name: "openai/gpt-oss-120b"
chat_history_path: "chatbot_logs/"
reload: true
chatbot_temperature: 0.2
max_completion_tokens: 500
max_context_length: 1000

# Database config
uri: "http://localhost:19530"
token: ""
db_name: "chatbot_db"

session_collection_name: "session_memory"
chat_logs_collection_name: "chat_logs"
context_window_collection_name: "context_window"

embedding_dimension: 384
index_type: "IVF_FLAT"
metric_type: "COSINE"
nlist: 128
nprobe: 10
topk: 5

References

Groq API Keys: https://console.groq.com/keys
Milvus Quickstart: https://milvus.io/docs/quickstart.md
RAG Context Refinement Agent: https://devpost.com/software/rag-context-refinement-agent
LangChain Groq Integration: https://github.com/langchain-ai/langchain/tree/master/libs/partners/groq

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
chatbot_logs		chatbot_logs
configs		configs
content		content
src		src
volumes/minio/.minio.sys		volumes/minio/.minio.sys
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
docker.yaml		docker.yaml
remaining_task.txt		remaining_task.txt
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chatbot-LLM

Demo

Overview

Architecture Highlights

Workflow

Getting Started

1. Prerequisites

2. Start Milvus (Vector Database)

3. Configure Environment Variables

4. Run the Chatbot

Structured Output Examples

Query Understanding Pipeline

Session Memorization

Key Design Assumptions

Limitations

Configuration Documentation

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Chatbot-LLM

Demo

Overview

Architecture Highlights

Workflow

Getting Started

1. Prerequisites

2. Start Milvus (Vector Database)

3. Configure Environment Variables

4. Run the Chatbot

Structured Output Examples

Query Understanding Pipeline

Session Memorization

Key Design Assumptions

Limitations

Configuration Documentation

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages