DuMF-Agent: Dual-Channel Memory Framework for Long-Term Conversational Agents

A long-term memory architecture for conversational AI that addresses memory fragmentation, temporal confusion, and cross-session reasoning instability through unified memory representation, retrieval-reading closed-loop, and temporal version consistency mechanisms.

Architecture Overview

┌─────────────────────────────────────────────────────────────────────────────┐
│                           DuMF-Agent Architecture                           │
├─────────────────────────────────────────────────────────────────────────────┤
│                                                                             │
│  ┌─────────────┐    ┌──────────────────────────────────────────────────┐   │
│  │   User      │    │              Dual-Channel Memory                 │   │
│  │   Query     │───▶│  ┌────────────────┐  ┌────────────────────────┐ │   │
│  └─────────────┘    │  │  RAW Channel   │  │  CONSOLIDATED Channel  │ │   │
│                     │  │  (Evidence)    │  │  (SimpleFact + Triple) │ │   │
│                     │  └────────────────┘  └────────────────────────┘ │   │
│                     └──────────────────────────────────────────────────┘   │
│                                      │                                      │
│                     ┌────────────────▼────────────────┐                    │
│                     │      Hybrid Retrieval           │                    │
│                     │  • Query Expansion              │                    │
│                     │  • Vector + BM25 + Multi-hop    │                    │
│                     │  • Unified Re-ranking           │                    │
│                     └────────────────┬────────────────┘                    │
│                                      │                                      │
│                     ┌────────────────▼────────────────┐                    │
│                     │      Context Construction       │                    │
│                     │  • Version Detection            │                    │
│                     │  • Temporal Filtering           │                    │
│                     │  • Evidence Organization        │                    │
│                     └────────────────┬────────────────┘                    │
│                                      │                                      │
│                     ┌────────────────▼────────────────┐                    │
│                     │         LLM Generation          │                    │
│                     └─────────────────────────────────┘                    │
│                                                                             │
└─────────────────────────────────────────────────────────────────────────────┘

Key Features

Dual-Channel Memory Architecture: RAW channel preserves evidence completeness; CONSOLIDATED channel structures facts for efficient retrieval — balances completeness and retrieval efficiency
Triple-SimpleFact Separation: Structured Triple layer optimized for multi-hop reasoning; SimpleFact layer optimized for direct QA — decoupled optimization
Generalized Extractor: Entities and relation types dynamically extracted from text without hardcoded schemas
Multi-Factor Comprehensive Scoring: Fusion of semantic similarity, confidence, channel priority, and temporal decay for unified retrieval ranking
Dual-Dimensional Temporal Decay: Time-aware weighting combining real-world timestamps and conversation turns for cross-session and intra-session reasoning
Append-Only Full Retention Storage: No deletion — all historical versions preserved, enabling version tracking and temporal queries
Hybrid Retrieval + Query Expansion: Vector similarity search, BM25 full-text search, and multi-hop graph traversal with query expansion

Installation

Prerequisites

Python 3.9+
Neo4j 5.x (local or Aura cloud)
CUDA-compatible GPU (optional, for local embeddings)

Setup

Clone the repository:

git clone https://github.com/leyulv-wang/long_memory_agent.git --branch v1.0.0
cd long_memory_agent

Create virtual environment:

python -m venv .venv
source .venv/bin/activate  # Linux/Mac
# or
.venv\Scripts\activate     # Windows

Install dependencies:

pip install -r requirements.txt

Configure environment:

cp .env.example .env
# Edit .env with your API keys and database credentials

Initialize Neo4j schema:

python utils/init_neo4j_schema.py
python utils/create_fulltext_index.py

(Optional) Start local embedding server:

python embedding_server.py

Data Preparation

This project uses the LongMemEval benchmark for evaluation.

Download Dataset

# Clone LongMemEval repository
git clone https://github.com/xiaowu0162/LongMemEval.git

# Copy test files to your project
mkdir -p data/long_memory_eval
cp LongMemEval/data/*.json data/long_memory_eval/

Verify Directory Structure

data/
└── long_memory_eval/
    ├── longmemeval_oracle.json   # Sample setting
    └── longmemeval_s.json        # Hard setting

Configuration

Environment Variables (.env)

Copy .env.example to .env and configure the following:

Required Settings

# LLM API (OpenAI-compatible)
GRAPHRAG_API_BASE=https://api.openai.com/v1
GRAPHRAG_CHAT_API_KEY=sk-your-api-key-here
GRAPHRAG_CHAT_MODEL=gpt-4o-mini

# Cheap LLM for extraction tasks
CHEAP_GRAPHRAG_API_BASE=https://api.openai.com/v1
CHEAP_GRAPHRAG_CHAT_API_KEY=sk-your-api-key-here
CHEAP_GRAPHRAG_CHAT_MODEL=gpt-4o-mini

# Embedding Model
GRAPHRAG_EMBEDDING_API_BASE=http://127.0.0.1:8000  # Local server
GRAPHRAG_EMBEDDING_API_KEY=local
GRAPHRAG_EMBEDDING_MODEL=BAAI/bge-m3

# Neo4j Database
NEO4J_URI=neo4j://127.0.0.1:7687
NEO4J_USERNAME=neo4j
NEO4J_PASSWORD=your-password-here

Optional Settings

# Evidence filtering: strict | medium | lenient
EVIDENCE_FILTER_LEVEL=lenient

# TextUnit fallback: off | order | always
EVIDENCE_TEXTUNIT_FALLBACK_SCOPE=order

# Confidence scores
RAW_REL_CONFIDENCE=0.95
CONSOLIDATED_REL_CONFIDENCE=0.85
CONSOLIDATED_ASSERTS_CONFIDENCE=0.6

Key Parameters in config.py

Parameter	Value	Description
`SimpleFact k`	100	Top-k for SimpleFact retrieval
`TextUnit k`	10	Top-k for TextUnit retrieval
`Fulltext k`	20	Top-k for BM25 fulltext search
`Multi-hop limit`	20	Max nodes in graph expansion
`Multi-hop decay`	0.85	Score decay per hop
`Similarity weight`	0.7	Weight for semantic similarity
`Confidence weight`	0.2	Weight for fact confidence
`Channel weight`	0.1	Weight for channel priority
`Version threshold`	0.75	Threshold for version detection

See config.py for all configurable parameters.

Usage

Basic Usage

from agent.agent import DuMFAgent

# Initialize agent
agent = DuMFAgent(agent_id="user_001")

# Process conversation
response = agent.chat("What did we discuss about the project last week?")

Running LongMemEval Evaluation

Quick Start

Once you have the dataset and Neo4j database ready:

# Initialize database schema (first time only)
python utils/init_neo4j_schema.py
python utils/create_fulltext_index.py

# Run evaluation
python test/Long_Memory_test.py

Results will be saved to test/long_memory_results.json

Note: To test different settings (sample/hard), modify the DEFAULT_DATA_PATH in test/Long_Memory_test.py (line 47):

Sample setting: "data/long_memory_eval/longmemeval_oracle.json"
Hard setting: "data/long_memory_eval/longmemeval_s.json"

Or use command line argument:

python test/Long_Memory_test.py --data data/long_memory_eval/longmemeval_s.json

Embedding Server

For local embedding (recommended for development):

# Start the embedding server first
python embedding_server.py

# Configure in .env:
# GRAPHRAG_EMBEDDING_API_BASE=http://127.0.0.1:8000

For online embedding API, configure SiliconFlow or other providers in .env.

Project Structure

long_memory_agent/
├── agent/                  # Core agent implementation
│   ├── agent.py           # Main agent class
│   ├── simple_retriever.py # Hybrid retrieval system
│   └── context_builder.py  # Context construction
├── memory/                 # Dual-channel memory system
│   ├── dual_memory_system.py
│   ├── structured_memory.py
│   └── stores.py
├── temporal_reasoning/     # Temporal reasoning module
│   ├── executor.py
│   └── intent_router.py
├── prompts/               # Prompt templates
├── utils/                 # Utility functions
└── test/                  # Test scripts

Troubleshooting

Neo4j Connection Failed

# Check if Neo4j is running
neo4j status

# Start Neo4j
neo4j start

# Verify connection
python utils/connection_tests.py

Embedding Server Issues

# If using local embedding, check server status
curl http://127.0.0.1:8000/health

# Alternative: Use online embedding API
# Edit .env:
GRAPHRAG_EMBEDDING_API_BASE=https://api.siliconflow.cn/v1
GRAPHRAG_EMBEDDING_API_KEY=your-api-key

Out of Memory

# Reduce batch size in .env
EMBED_BATCH_SIZE=1
EMBED_MAX_CONCURRENCY=1

Evaluation Results

Performance comparison on LongMemEval benchmark. All results averaged over 10 independent runs with ± half-range.

Baseline Methods

LLM: Direct LLM prompting with full conversation history
RAG: Retrieval-augmented generation with vector search
Mem0: Memory layer with fact extraction and consolidation
Mem0Graph: Memory layer with graph-based structured memory
LangMem: LangChain-based memory system
LightMem: Lightweight memory architecture
Generative Agent: Stanford's generative agents with memory stream (recency, importance, relevance scoring)
DuMF-Agent (ours): Dual-channel memory framework with structured reasoning and temporal consistency

Overall Performance

Method	Overall Acc. (sample)	Overall Acc. (hard)	Task-avg. Acc. (hard)
LLM	75.00 ± 1.30	55.41 ± 0.68	54.20 ± 0.85
RAG	66.17 ± 1.51	49.33 ± 1.36	48.84 ± 1.34
Mem0	50.22 ± 1.94	34.18 ± 1.53	33.97 ± 0.99
Mem0Graph	53.40 ± 0.31	36.52 ± 0.16	35.75 ± 0.10
LangMem	63.36 ± 1.22	46.40 ± 0.60	46.99 ± 0.53
LightMem	61.20 ± 0.40	50.00 ± 0.80	50.25 ± 0.75
GA	61.42 ± 0.65	23.56 ± 1.00	24.12 ± 1.26
DuMF-Agent	75.38 ± 0.37	69.59 ± 0.19	69.80 ± 0.23

DuMF-Agent achieves the best performance across all settings, demonstrating superior capability in handling long-term conversational memory with complex reasoning requirements.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

LongMemEval benchmark for evaluation framework
Neo4j for graph database support

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
agent		agent
memory		memory
prompts		prompts
temporal_reasoning		temporal_reasoning
test		test
utils		utils
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
The_agent.py		The_agent.py
accuracy_comparison.png		accuracy_comparison.png
config.py		config.py
embedding_server.py		embedding_server.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

DuMF-Agent: Dual-Channel Memory Framework for Long-Term Conversational Agents

Architecture Overview

Key Features

Installation

Prerequisites

Setup

Data Preparation

Download Dataset

Verify Directory Structure

Configuration

Environment Variables (.env)

Required Settings

Optional Settings

Key Parameters in config.py

Usage

Basic Usage

Running LongMemEval Evaluation

Quick Start

Embedding Server

Project Structure

Troubleshooting

Neo4j Connection Failed

Embedding Server Issues

Out of Memory

Evaluation Results

Baseline Methods

Overall Performance

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages