Multi-Agent LLM Debate Framework

A modular framework for orchestrating structured debates between multiple large language models (LLMs) with specialized judge evaluation. This project implements an adversarial training approach to enhance LLM argumentative reasoning.

🌟 Features

Multi-Agent Architecture: Orchestrates debates between opposing LLM agents
Structured Debate Protocol: Implements formal opening, rebuttal, and closing rounds
Adversarial Critique System: Agents analyze and critique opposing arguments
Evidence Self-Check Mechanism: Ensures factual accuracy and reduces source fabrication
Multi-Dimensional Judge Framework: Seven specialized judges evaluate different aspects of argument quality
Local-Based: Compatible with Ollama-hosted models

📋 Requirements

Python 3.8+
Ollama for local model hosting
YAML for configuration files
Required Python packages (see Environment Setup)

🚀 Installation

Environment Setup

Clone this repository:

git clone https://github.com/[username]/multi-agent-llm-debate.git
cd multi-agent-llm-debate

Create and activate the conda environment:

conda env create -f debate-env.yml
conda activate debate-env

Install Ollama following instructions at ollama.ai
Download required models via Ollama: It's present in the first cell code which can be edited. Selective downloads / Download All can be done.

📁 Project Structure

.
├── .ipynb_checkpoints/     # Jupyter notebook checkpoints
├── prompts/                # YAML configuration files for debate prompts
│   ├── debate_prompts.yml  # Core debate prompts
│   ├── judge_prompts.yml   # Judge evaluation prompts
├── results/                # Debate outputs and judge evaluations
│   ├── agent_records/      # Saved debate transcripts
│   ├── judge_records/      # Evaluation results
│   ├── perfect_debate_transcripts/ # Curated debate examples, for Judgement Pipeline
├── debate-env.yml          # Conda environment configuration
├── MultiLLM Debate.ipynb   # Main notebook for running debates
├── OLLAMA EDA, Test Scripts.ipynb # Ollama exploration and testing scripts

🧠 Core Components

1. Prompt Management System

PromptManager class loads and formats debate prompts from YAML files
Modular design allows testing different prompt strategies
Phase-specific guidance for opening, rebuttal, and closing rounds

2. Multi-Agent Debate Engine

MultiAgentDebate class orchestrates structured interactions
Implements preparation, critique, and rebuttal phases
Manages context and maintains debate state
Generates enhanced arguments based on adversarial feedback

3. Judge Evaluation Pipeline

JudgeEvaluator class assesses debate quality across multiple dimensions
Specialized judges for logical, factual, rhetorical, and ethical aspects
Meta-judge synthesizes evaluations into composite assessment

⚙️ Customization

Modifying Debate Prompts

Edit the YAML files in the prompts/ directory to customize:

Debate instructions and structure
Critique guidelines
Evidence check parameters
Judge evaluation criteria

Adding New Models

Update the OllamaDebateManager.models dictionary to include new models:

self.models = {
    "custom_model": "model_name:tag",
    # Add more models here
}

📊 Results and Evaluation

Debate results and judge evaluations are saved to:

results/agent_records/ - Full debate transcripts
results/judge_records/ - Judge evaluations and scores

📝 Citation

If you use this framework in your research, please cite:

@misc{markapudi2025socraiticcircle,
  title={SocrAItic Circle: Enhancing LLM Reasoning Through Multi-Agent Debate Frameworks},
  author={Markapudi, Joel},
  year={2025},
  institution={Northeastern University}
}

Contributions

TBD

📄 License

TBD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-Agent LLM Debate Framework

🌟 Features

📋 Requirements

🚀 Installation

Environment Setup

📁 Project Structure

🧠 Core Components

1. Prompt Management System

2. Multi-Agent Debate Engine

3. Judge Evaluation Pipeline

⚙️ Customization

Modifying Debate Prompts

Adding New Models

📊 Results and Evaluation

📝 Citation

Contributions

📄 License

FilesExpand file tree

README_SETUP.md

Latest commit

History

README_SETUP.md

File metadata and controls

Multi-Agent LLM Debate Framework

🌟 Features

📋 Requirements

🚀 Installation

Environment Setup

📁 Project Structure

🧠 Core Components

1. Prompt Management System

2. Multi-Agent Debate Engine

3. Judge Evaluation Pipeline

⚙️ Customization

Modifying Debate Prompts

Adding New Models

📊 Results and Evaluation

📝 Citation

Contributions

📄 License