Multi-Agent Chat System

Overview

This project implements a modular multi-agent chat system for research, analysis, and memory retrieval. It features:

Coordinator: Receives user queries, routes tasks, and synthesizes results.
ResearchAgent: Retrieves knowledge from a mock knowledge base.
AnalysisAgent: Summarizes, compares, and extracts insights from research results.
MemoryAgent: Stores and retrieves structured knowledge, conversation, and agent state.
MemoryLayer: Persists all records with metadata and supports vector-based retrieval.

See DEVELOPMENT_PHASES.md for a step-by-step breakdown of the development process.

Architecture & Sequence Flow

Sequence:

User submits a query.
Coordinator logs and routes the query:
- Simple: ResearchAgent → MemoryAgent (recall)
- Complex: ResearchAgent → AnalysisAgent → MemoryAgent
- Memory: MemoryAgent directly
Agents process and store results with provenance and timestamps.
Coordinator returns the answer and saves transcripts/results in outputs/.

Flow Diagram:

flowchart LR
    User --> Coordinator
    Coordinator --> ResearchAgent
    ResearchAgent --> AnalysisAgent
    AnalysisAgent --> MemoryAgent
    Coordinator --> MemoryAgent
    MemoryAgent --> Coordinator

Agent Responsibilities

Coordinator:

Receives the user query.

Decides that ResearchAgent should handle the task.

Logs and outputs the result.

ResearchAgent:

Looks up the query in a small mock knowledge base (data/knowledge_base.json).

Returns a list of results with a confidence score.

MemoryLayer:

Stores all results with timestamp, agent name, topic, and confidence.

Provides minimal retrieval (will be expanded in future prototypes).

Confidence Score

Confidence values are simulated and rule-based:

0.9 for exact keyword matches.

0.3 for fuzzy or partial matches.

Example:

Query: "what are the main types of neural networks?" → Topic matched as "neural networks" but with punctuation. → Confidence returned: 0.3.

Limitation: Confidence does not reflect semantic similarity yet. This will be improved in Prototype 2.

How to Run the Service & Tests

Local Setup:

conda create -n multiagent-chat python=3.10 -y
conda activate multiagent-chat
pip install -r requirements.txt

Run interactive chat:

python run.py

Run scenario tests:

python run_scenarios.py

Docker:

docker build -t multiagent-chat .
docker run -it multiagent-chat
docker run -it multiagent-chat python run_scenarios.py

Docker Compose:

docker-compose build
docker-compose up

This will run the service interactively (python run.py).
The outputs/ folder is mounted for easy access to results on your host machine.
To stop the service:

docker-compose down

Repository Structure

multiagent-chat/
    ├── agents/
    │   ├── research_agent.py
    │   ├── analysis_agent.py
    │   ├── memory_agent.py
    ├── data/
    │   └── knowledge_base.json
    ├── memory/
    │   └── memory_layer.py
    ├── coordinator.py
    ├── run.py
    ├── run_scenarios.py
    ├── requirements.txt
    ├── README.md
    └── outputs/   (sample outputs will be stored here)

Current Status (Prototype 1)

✔ Coordinator routes query to ResearchAgent.

✔ ResearchAgent retrieves from mock knowledge base.

✔ MemoryLayer stores result.

✔ Scenario runner works for a simple query.

Limitations

Confidence scores are rule-based, not semantic.

Only ResearchAgent is fully functional.

AnalysisAgent and MemoryAgent are stubs.

No real vector similarity search yet.

Error handling is minimal.

Memory Design & Retrieval Approach

MemoryLayer provides three structured stores:
- Conversation records: logs all user/system messages with provenance and timestamps.
- Knowledge records: stores synthesized findings, analysis, and research results.
- Agent state records: tracks agent actions, tasks, and outcomes.
Retrieval uses a TF-IDF-like vector search for relevance.
All records include provenance, agent, and timestamp for traceability.
MemoryAgent exposes explicit retrieval methods for each store.

Example Outputs

Sample from outputs/simple_query.txt:

Prompt: What are the main types of neural networks?
Result:
{'result': "Summary for 'What are the main types of neural networks?': Moderate relevance in 1 topic(s): neural networks | Weak signals from 3 topic(s), may be less reliable."}

Sample from outputs/memory_test.txt:

Prompt: What did we discuss about neural networks earlier?
Result:
{'result': "(from memory) Previous discussion on 'What did we discuss about neural networks earlier?': [{'topic': 'neural networks', 'agent': 'Research+Analysis', 'provenance': 'Coordinator-details'}, ...]"}

All outputs are saved in the outputs/ folder for inspection.

(Optional) LLM Configuration & Fallback Behavior

The current system uses rule-based agents and vector search.
For future LLM integration:
- Add configuration in the Coordinator to select LLM models.
- Implement fallback logic if no relevant results are found.

Authors

This project was developed by Adil Sheraz.

For questions or collaboration, please contact via GitHub Issues or repository email.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Chat System

Overview

Architecture & Sequence Flow

Agent Responsibilities

Confidence Score

Limitation: Confidence does not reflect semantic similarity yet. This will be improved in Prototype 2.

How to Run the Service & Tests

Repository Structure

Current Status (Prototype 1)

Limitations

Memory Design & Retrieval Approach

Example Outputs

(Optional) LLM Configuration & Fallback Behavior

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
agents		agents
data		data
memory		memory
memory_data		memory_data
outputs		outputs
.gitignore		.gitignore
DEVELOPMENT_PHASES.md		DEVELOPMENT_PHASES.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
coordinator.py		coordinator.py
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt
run.py		run.py
run_scenarios.py		run_scenarios.py

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Chat System

Overview

Architecture & Sequence Flow

Agent Responsibilities

Confidence Score

Limitation: Confidence does not reflect semantic similarity yet. This will be improved in Prototype 2.

How to Run the Service & Tests

Repository Structure

Current Status (Prototype 1)

Limitations

Memory Design & Retrieval Approach

Example Outputs

(Optional) LLM Configuration & Fallback Behavior

Authors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages