Skip to content

CyberScienceLab/HoG-GRAG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

23 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Defense Against Knowledge Poisoning Attack on GraphRAG

🧠 Overview

This repository contains the code and experiments for a defense framework against knowledge poisoning attacks in GraphRAG-style multi-hop QA systems. Our method, HoG-GRAG (Hop-wise Guard for GraphRAG), improves robustness by decomposing multi-hop questions into ordered subqueries, detecting poisoning-induced inconsistencies during hop-wise execution, and repairing corrupted retrieved subgraphs through targeted pruning and minimal evidence recovery.

We evaluate this framework on multi-hop question answering using:

πŸ“‚ Repository Structure

This repository is organized around the two core components of Auto-Immune GraphRAG β€” detection and repair β€” with supporting modules for evaluation.

HoG-GRAG/
β”‚        
β”œβ”€β”€ prompts/ 
β”‚   β”‚   └── Question Paraphrasing.md
β”‚   β”‚   └── Response Evaluation.md
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ detection.py               
β”‚   β”œβ”€β”€ repairer.py             
β”‚   β”œβ”€β”€ trace_analysis.py               
β”‚   β”œβ”€β”€ evaluation.py
β”œβ”€β”€ baselines/
β”‚   β”‚   └── Query_Paraphrasing.py
β”‚   β”‚   └── Perplexity_based.py
β”œβ”€β”€ requirements.txt             
└── README.md

πŸ“„ Citation

If you use this methodology in your research, please cite:

Havva Alizadeh Noughabi, Fattane Zarrinkalam, Ali Dehghantanha, Defense Against Knowledge Poisoning Attack on GraphRAG, Accepted at the Annual Meeting of the Association for Computational Linguistics (ACL 2026)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages