Role-Adaptive Multi-Robot Communication Framework

This repository presents the Role-Adaptive Communication Framework for multi-robot systems powered by Large Language Models (LLMs). The framework enables heterogeneous robots to communicate adaptively according to their assigned roles, personalities, and interaction contexts. It integrates Retrieval-Augmented Generation (RAG) for contextual memory, adaptive persona configuration via system prompts, and a centralised orchestration architecture for efficient coordination and scalability.

Developed as part of the University of Auckland Part IV Research Project (2025), this system combines emotion-aware communication, dynamic LLM configuration, and robot-to-robot (RRI) task delegation across multiple platforms.

1. Project Overview

The Role-Adaptive Multi-Robot Framework represents an evolution from a one-to-one adaptive learning system to a fully integrated multi-agent communication system.

It allows each robot to operate under a unique persona and functional configuration, while maintaining shared context through a centralised server.
The system supports both Human-Robot Interaction (HRI) and Robot-Robot Interaction (RRI) for cooperative operations across heterogeneous platforms.

Key Capabilities

Centralised Orchestration Server for multi-robot management
Role-Adaptive Communication powered by modular LLM integration
Persona Assignment through detailed JSON-based configuration files
Dynamic Task Delegation between heterogeneous robot agents
WebSocket Communication for low-latency, bidirectional streaming
RAG-based Memory System for personalised and context-aware interactions
Scalable Client Architecture supporting simultaneous robot connections

2. System Architecture

The system consists of a Centralised Server and multiple Robot Clients, communicating via asynchronous WebSockets.
Each robot registers to the server using a JSON configuration file that defines its role, LLM model, and modules (e.g., emotion recognition, speech, face recognition).

Core Components

Layer	Description
Central Server	Hosts orchestration logic, LLM APIs, RAG memory, and module routing.
Robot Client	Connects to the server with a JSON-based configuration defining its functional modules and personality.
RAG Database	Stores conversation history and key user information for context retrieval.
LLM Integration	Supports ChatGPT, Gemini, DeepSeek, and Llama for flexible persona configurations.
RRI Module	Handles robot-to-robot task delegation and message routing.

Communication Flow

Each robot loads its configuration (robot_config.json) and connects to the server.
The server assigns a session and activates modules (LLM, emotion, STT, etc.).
Robots communicate through real-time WebSocket channels.
Robots may exchange context and tasks via RRI through the central server.

3. Hardware Platforms

Platform	Description
CHATBOX	Compact companion robot with 12 DOF and emotion-aware interaction capabilities.
Silbot	Mobile humanoid robot used for navigation and guiding roles, with expressive arm gestures.
IrobiQ	Healthcare-focused service robot for elderly care and chronic disease management.
Pepper	Semi-humanoid social robot with 20 DOF for customer service and education tasks.
EveR-4 H22	High-fidelity humanoid robotic head capable of advanced emotional expression.

4. Software Architecture

4.1. Server Modules

LLM Orchestrator – routes queries to appropriate LLMs (GPT, Gemini, DeepSeek, Llama)
RAG Engine – retrieves relevant information from stored conversations
Emotion Recognition – processes facial input for emotional context
Speech Modules – includes Whisper STT engines
RRI Handler – manages robot-to-robot task exchanges and message routing

4.2. Client Configuration

Each robot client is defined by a JSON configuration file specifying:

Role and personality prompt
Functional modules
LLM model selection
Hardware settings (e.g., Arduino, camera, ports)

Example:

{
  "robot_name": "CHATBOX",
  "client_id": "robot_01",
  "server_url": "ws://<server-ip>:5000",
  "modules": ["gpt", "emotion", "speech"],
  "robot_role": "You are ChatBox, a compassionate companion robot for children. You respond with empathy and encouragement.",
  " llm_config ": {
    "provider": " openrouter " ,
    "model": " openrouter - model - name - here !" ,
    "temperature": 0.7 ,
    "max_tokens": 1000
  },
  "hardware_configuration": {
    "arduino_port": "/dev/ttyUSB0",
    "arduino_baud": 115200,
    "arduino_timeout": 2.0
  }
}

5. Role-Adaptive Communication

Each robot client exhibits distinct communication characteristics derived from its role definition and personality prompt within the configuration file.
This ensures that the robot’s dialogue, tone, and behavioural output align naturally with its assigned function in the environment.

Communication Personalities

Role	Communication Style	Description
Receptionist Robot	Polite, informative, and concise	Welcomes users, provides guidance, and delegates navigation tasks.
Guide Robot	Friendly, directive, and confident	Physically escorts users, provides directional support.
Companion Robot	Warm, empathetic, and nurturing	Engages in supportive dialogue with emotional awareness.

The system integrates both linguistic adaptation and contextual consistency, using LLM configuration parameters such as temperature, top-p, and role-specific system prompts.

6. Robot-to-Robot Communication

The Robot-to-Robot Interaction (RRI) protocol enables contextual and cooperative task management between heterogeneous robots.
Messages are exchanged via the central WebSocket server, ensuring synchronised task flow and shared dialogue context.

Key Features

Task Delegation: Transfers a user’s request to the most suitable robot.
Context Retention: Maintains dialogue state and emotional context across robots.
Verbal Coordination: Expresses transitions through human-understandable speech.

Example Dialogue

Receptionist Robot: “The nearest restroom is at the end of the hallway. Silbot can guide you there.” Receptionist Robot: “Silbot, please escort the user to the restroom.” Silbot: “Certainly! Please follow me.”

This interaction flow demonstrates seamless task delegation between role-specific robots while preserving conversational context.

7. System Flow Diagram

8. Project Structure

multi-client-communication-system/
├── Modules/
│ ├── gpt_client.py
│ ├── rag_module.py 
│ ├── emotion_processor.py
│ └── speech_processor.py 
│
├── ServerController/
│ ├── client_manager.py
│ ├── database.py 
│ ├── supabase_client.py 
│ ├── request_router.py 
│ ├── server_controller.py 
│ ├── server.py 
│ └── websocket_manager.py
│
├── models/
│ └── emotion_recognition.pth
│
├── client/
│ ├── InputModules/
| │ ├── camera_input.py 
| │ ├── realsense_input.py 
| │ ├── text_input.py
| │ └── voice_input.py 
| |
│ ├── OutputModules/
| │ ├── arduino_output.py 
| │ ├── console_output.py
| │ ├── edge_tts_output.py
| │ └── tts_output.py
| |
│ ├── dockerfile 
│ ├── .env 
│ ├── client_config.json 
│ ├── client.py 
│ ├── robot.py 
│ ├── run_docker.sh
│ └── requirements.txt
│
└── README.md # Project documentation

9. Academic Significance

The Role-Adaptive Communication Framework contributes to the field of Human–Robot Interaction (HRI) and Multi-Agent Coordination by demonstrating:

Scalable and modular design for multi-robot collaboration
Adaptive LLM-driven dialogue aligned with robot roles and personalities
Persistent contextual awareness through RAG-based long-term memory
Seamless robot-to-robot communication for cooperative task execution

This research advances multi-agent conversational systems by introducing adaptive, personality-based communication across heterogeneous service robots.

Achievements

🏆 Best Context Design Award – IEEE RO-MAN 2025 (Amsterdam, Netherlands)
🏆 Innovative Solution Award – ICSR 2025 (Naples, Italy)

10. Setup Instructions

1. Server Setup

Clone the repository:

git clone git@github.com:JaySong/multi-client-communication-system.git
cd multi-client-communication-system

Run the server_controller:

conda create --name multi-robot-server python=3.10
conda activate multi-robot-server
cd v5.0.0
pip install -r requirement.txt
cd ServerController
python server_controller.py

Build and run the Docker container:

docker build -t multi-robot-server .
./run_docker.sh
python3 robot.py

Once the container is running, the central orchestration server will automatically initialise:

WebSocket listener on port 5000
RAG memory module (FAISS + MongoDB backend)
LLM manager (OpenAI, Gemini, DeepSeek, Llama)
RRI communication handler for inter-robot coordination

The server console will display connection logs for each connected client, confirming successful session registration.

2. Robot-to-Robot Demonstration

The Robot-to-Robot Interaction (RRI) protocol allows different robots to coordinate naturally through the central server. This enables smooth task delegation and consistent conversation flow between robots of different roles.

Interact with CHATBOX (the receptionist robot):

Ask: “Where is the nearest restroom?”

CHATBOX will respond with:

“The restroom is located at the end of the hallway. Silbot can guide you there.”

CHATBOX delegates the task via RRI:

“Silbot, please escort the user to the restroom.”

SILBOT acknowledges and executes the action:

“Certainly! Please follow me.”

This example demonstrates adaptive, role-consistent collaboration between robots within a shared communication environment.

11. Version History

A high-level overview of the project's evolution, from a single-robot prototype to a multi-agent framework.

v1.0.0: Initial prototype. Processing (emotion recognition, ChatGPT) ran locally on a laptop physically attached to an Arduino robot.
v2.0.0: Portability update. Migrated all processing from the laptop to an onboard Jetson Nano using Docker.
v2.1.0: Hybrid processing. Offloaded emotion recognition to a Google Colab server to free up Jetson resources; ChatGPT remained local on the Jetson.
v2.2.0: Server-centric. Moved all heavy processing (Emotion + ChatGPT) to the server, turning the Jetson into a thin client.
v2.3.0: Real-time communication. Replaced HTTP polling with WebSockets, dramatically reducing latency and token/API usage.
v2.3.2: Enhanced monitoring. Upgraded the web monitoring UI to include chat history, face detection boxes, and emotion confidence levels.
v3.0.0: Local network deployment. Major rewrite to remove the Colab/ngrok dependency. The system now runs entirely on a local network with a modular server architecture.
v3.1.0: Core AI features. Integrated Speech-to-Text (Faster-Whisper) and RAG (FAISS + MongoDB) for conversational memory.
v3.2.0: Hands-free operation. Implemented a keyword-based "wake word" system and forced USB mic selection, removing the need for keyboard input.
v4.0.0: Multi-client architecture. A complete server rewrite to support multiple concurrent clients with isolated resources. The client was also redesigned with a modular plugin system (Input/Output).
v4.0.1: Performance & RAG update. Optimized monitor streaming (300% faster) and re-integrated RAG using a robust Supabase (PostgreSQL) backend for persistent user memory.
v4.1.0: Hardware integration. Activated the client plugin system by adding new hardware modules: Intel RealSense (with Depth), standard webcams, and Arduino/ESP32 for physical emotion expression.
v4.1.1: Face recognition. Added a FaceRecoProcessor module using dlib for real-time face detection, encoding, and identification.
v5.0.0 (Current): Role-adaptive framework. Final rewrite focused on the research contribution: enabling Role-Adaptive Communication and Robot-to-Robot Interaction (RRI). Introduced a flexible LLM backend (supporting OpenRouter, Gemini, etc.) and a centralized orchestration server for multi-agent coordination.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Role-Adaptive Multi-Robot Communication Framework

1. Project Overview

Key Capabilities

2. System Architecture

Core Components

Communication Flow

3. Hardware Platforms

4. Software Architecture

4.1. Server Modules

4.2. Client Configuration

5. Role-Adaptive Communication

Communication Personalities

6. Robot-to-Robot Communication

Key Features

Example Dialogue

7. System Flow Diagram

8. Project Structure

9. Academic Significance

Achievements

10. Setup Instructions

1. Server Setup

2. Robot-to-Robot Demonstration

11. Version History

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
ChatBox @ c7312f8		ChatBox @ c7312f8
client		client
dlib		dlib
docker		docker
dockerV1		dockerV1
emotion_understanding		emotion_understanding
rag_indexes		rag_indexes
silbot		silbot
v1.0.0		v1.0.0
v2.0.0		v2.0.0
v2.1.0		v2.1.0
v2.2.0		v2.2.0
v2.3.0		v2.3.0
v2.3.2		v2.3.2
v3.0.0		v3.0.0
v3.1.0		v3.1.0
v3.2.0		v3.2.0
v4.0.0		v4.0.0
v4.0.1		v4.0.1
v4.1.0		v4.1.0
v4.1.1		v4.1.1
v5.0.0		v5.0.0
v6.0.0		v6.0.0
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
faiss_index.bin		faiss_index.bin
requirements.txt		requirements.txt
wordcloud_latest.png		wordcloud_latest.png

Folders and files

Latest commit

History

Repository files navigation

Role-Adaptive Multi-Robot Communication Framework

1. Project Overview

Key Capabilities

2. System Architecture

Core Components

Communication Flow

3. Hardware Platforms

4. Software Architecture

4.1. Server Modules

4.2. Client Configuration

5. Role-Adaptive Communication

Communication Personalities

6. Robot-to-Robot Communication

Key Features

Example Dialogue

7. System Flow Diagram

8. Project Structure

9. Academic Significance

Achievements

10. Setup Instructions

1. Server Setup

2. Robot-to-Robot Demonstration

11. Version History

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages