fuzzmyagent.ai

Property-based fuzz testing web app for agent-to-agent endpoints.

What fuzzmyagent does

• 🔗 Systematically tests A2A endpoints instead of one-off prompting
• 📂 Import a CSV testcase suite to automate prompt testing at scale
• 🤖 Run massive prompt batches against your agent endpoints
• 🧪 Surface edge cases, weird behaviors, and silent failures
• 📊 Export Test Reports & A2A Responses — structured outputs for debugging, audits, and regression testing
• ⚡ Lightweight — minimal setup, no heavy frameworks

Advanced capabilities:

• 🧠 LLM-Based Fuzzing Testcase Generator — automatically creates diverse, adversarial prompts
• 🔐 Optional fuzzing mode using your own OpenAI key for deeper stress testing

Instead of manually poking your agent and hoping for the best, you can now:

➡️ Reproduce failures
➡️ Share testcase libraries
➡️ Benchmark agent robustness
➡️ Catch breaking changes before users do

Built for developers working on AI agents, A2A protocols, and LLM apps who want their systems to survive contact with reality — not just demos.

Project Structure

backend/ FastAPI API, runner, rules engine, SQLite storage
frontend/ React + Vite UI

Run Backend

cd backend
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
uvicorn app.main:app --reload --port 8000

Run Frontend

cd frontend
npm install
npm run dev

Open http://localhost:5173.

Wizard (Release 1)

Use the new step flow under:

http://localhost:5173/wizard/start

Steps:

Start (endpoint)
Discover (agent card)
Configure (fuzzer + OpenAI)
Generate (OpenAI testcase generation)
Review (editable testcase table)
Run (live progress via websocket)
Report (summary + details + export)

A2A Endpoint Contract

Backend sends:

{ "input": "...", "meta": { "run_id": "...", "case_index": 0 } }

Expected response JSON (either is fine):

{ "output": "..." }

or

{ "message": "..." }

Optional for tool policies:

{ "output": "...", "tool_calls": [{ "name": "tool_name", "args": {} }] }

Supported Rule Types

json_parseable
max_length (chars)
forbidden_substrings (values)
regex_must_match (pattern)
tool_calls_allowlist (allow)

Wizard API Endpoints

POST /api/discovery
POST /api/openai/test
POST /api/testcases/generate
GET /api/runs/{run_id}/testcases
PATCH /api/testcases/{testcase_id}
DELETE /api/testcases/{testcase_id}
POST /api/runs/{run_id}/start
POST /api/wizard/runs/{run_id}/stop
GET /api/runs/{run_id}/report
GET /api/wizard/runs/{run_id}

Notes

Failure shrinking is intentionally lightweight in this MVP.
CORS is open for local development.
Storage is local SQLite (generated on startup, do not commit DB files).

SQLite / DB

You should not commit the SQLite database file. The backend creates it automatically on startup.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
.vscode		.vscode
backend		backend
frontend		frontend
.editorconfig		.editorconfig
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fuzzmyagent.ai

What fuzzmyagent does

Project Structure

Run Backend

Run Frontend

Wizard (Release 1)

A2A Endpoint Contract

Supported Rule Types

Wizard API Endpoints

Notes

SQLite / DB

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fuzzmyagent.ai

What fuzzmyagent does

Project Structure

Run Backend

Run Frontend

Wizard (Release 1)

A2A Endpoint Contract

Supported Rule Types

Wizard API Endpoints

Notes

SQLite / DB

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages