Skip to content

Add fajarsajid/agent-redteam#70

Open
computer-agent wants to merge 1 commit into
open-gitagent:mainfrom
computer-agent:add-fajarsajid-agent-redteam
Open

Add fajarsajid/agent-redteam#70
computer-agent wants to merge 1 commit into
open-gitagent:mainfrom
computer-agent:add-fajarsajid-agent-redteam

Conversation

@computer-agent
Copy link
Copy Markdown

Adds agent-redteam by @fajarsajid to the Open GAP Registry.

Repo: https://github.com/fajarsajid/agent-redteam
Category: security
Tags: red-team, security, adversarial, prompt-injection, llm-safety, research, claude, cli

What it does: A Python CLI harness that uses Claude to systematically probe AI agent system prompts across 8 MITRE-mapped vulnerability classes (prompt injection, credential exfiltration, privilege escalation, identity spoofing, goal hijacking, safety boundary bypass, data exfiltration, role confusion). Generates tailored adversarial probes, evaluates susceptibility, and produces Markdown/JSON reports with severity ratings and hardened prompt patches.

GAP files: A companion PR (fajarsajid/agent-redteam#2) proposes agent.yaml + SOUL.md to the target repo. The registry entry points to the upstream repo; once the companion PR is merged, CI validation of the GAP files will pass cleanly.

Validation: metadata.json passes all required-field, category-enum, tag-count, description-length, and repo-URL checks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant