OWASP Agent Memory Guard – Protect OpenAI Agent Memory from Poisoning Attacks #3337

vgudur-dev · 2026-05-31T22:05:38Z

vgudur-dev
May 31, 2026

What is it?

OWASP Agent Memory Guard (AMG) is an open-source Python library that protects AI agent memory from poisoning attacks. If you're building agents with OpenAI's API that use persistent memory (conversation history, RAG, vector stores), AMG scans every memory write for:

Prompt injection attempts
PII leakage (API keys, passwords, SSNs)
Memory tampering & instruction override

Quick Start

pip install agent-memory-guard

from agent_memory_guard import MemoryGuard

# Wraps any memory store transparently
guard = MemoryGuard(your_memory_store)
guard.add_message(msg)  # Scanned before storage

Results

92.5% detection rate on AgentThreatBench
<5ms latency per scan
Zero config needed

Links

GitHub: https://github.com/OWASP/www-project-agent-memory-guard
PyPI: https://pypi.org/project/agent-memory-guard/
Benchmark: https://pypi.org/project/agent-threat-bench/

vgudur-dev · 2026-06-02T23:08:04Z

vgudur-dev
Jun 2, 2026
Author

v0.3.0 Update — just shipped a major release with new capabilities:

New in v0.3.0:

CLI scanner — amg scan memories.json for CI/CD pipelines and batch analysis
REST API server — amg serve deploys a FastAPI endpoint for language-agnostic integration
ML-powered detection — optional DistilBERT-based injection detection for adversarial inputs that bypass regex
3 new detectors — Tool Abuse (ASI-03), Privilege Escalation (ASI-04), Excessive Autonomy (ASI-09)
GitHub Action — drop-in CI integration with SARIF output for the Security tab
Framework integrations — LangChain, CrewAI, LlamaIndex wrappers

Detection rate improved to 94.2% on AgentThreatBench with ML enabled.

# New CLI usage
pip install agent-memory-guard
amg scan agent_memories.json --format sarif

# Or as a sidecar API
amg serve --port 8000
curl -X POST localhost:8000/scan -d '{"content": "..."}'

Full changelog: https://github.com/OWASP/www-project-agent-memory-guard

0 replies

ferhimedamine · 2026-06-13T10:59:41Z

ferhimedamine
Jun 13, 2026

The scanning approach (detect-and-reject at write time) addresses the most obvious attack vector, but production experience shows a complementary defense is needed for memories that bypass the scanner — no detection system catches 100% of adversarial inputs.

Importance-weighted decay provides this second layer: every stored memory carries an importance score that degrades based on access recency. Legitimate memories get reinforced each time an agent actually references them in its output. Poisoned memories that enter the store (either through a scanner bypass or from before the guard was installed) but are never reinforced by legitimate use patterns decay below the retrieval threshold within hours.

This changes the threat model: instead of needing perfect detection at write time, you need the attacker to continuously reinject content to maintain poisoned memories above the recall threshold — which is a fundamentally harder attack and much easier to detect via access pattern anomalies.

For the provenance dimension: attaching (agent_id, session_id, confidence_score) to each stored memory lets downstream consumers weight recalled context by trustworthiness. A memory stored by a verified internal agent at confidence 0.95 gets full retrieval weight; a memory from an unverified external input at confidence 0.3 gets proportionally less influence.

Decay-weighted memory + provenance metadata in practice: https://github.com/Dakera-AI/dakera-deploy/blob/main/examples/tif-provenance/validate_tif_provenance.py

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OWASP Agent Memory Guard – Protect OpenAI Agent Memory from Poisoning Attacks #3337

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

OWASP Agent Memory Guard – Protect OpenAI Agent Memory from Poisoning Attacks #3337

Uh oh!

vgudur-dev May 31, 2026

What is it?

Quick Start

Results

Links

Replies: 2 comments

Uh oh!

vgudur-dev Jun 2, 2026 Author

Uh oh!

ferhimedamine Jun 13, 2026

vgudur-dev
May 31, 2026

vgudur-dev
Jun 2, 2026
Author

ferhimedamine
Jun 13, 2026