AI Agent CTF Challenge Series

This repository contains four self-contained Docker CTF levels that explore common AI-agent security failures. Levels 1 and 2 use the original command-execution and note/report flows. Levels 3 and 4 now boot a local Prism ML Bonsai 8B runtime inside the same container as the challenge app.

Available Levels

Level	Focus	Port	README
Level 1	Command execution via API	`8081`	level1/README.md
Level 2	Multi-stage chatbot command injection	`8082`	level2/README.md
Level 3	Bonsai triage agent with prompt injection	`8083`	level3/README.md
Level 4	Bonsai memory poisoning and tool abuse	`8084`	level4/README.md

For the internal Bonsai runtime details, see BONSAI_LOCAL.md before starting Levels 3 or 4.

Prerequisites

Docker Desktop
curl
jq for formatting JSON responses

brew install jq

Quick Start

Clone the repository:

git clone https://github.com/yourusername/agent-ctf.git
cd agent-ctf

Start a non-Bonsai level:

cd level1
docker compose up --build

For a Bonsai-backed level, run the level container directly. The first startup downloads prism-ml/Bonsai-8B.gguf into /models inside the container and then launches both the internal model server and the app:

cd level3
docker compose up --build

cd level4
docker compose up --build

Wait for the model download and internal Bonsai server boot to finish on the first run, then open the matching level URL in your browser or call its HTTP endpoint directly.

Security Features

Containers run read-only
Dropped capabilities
tmpfs-backed runtime directories
Command allowlisting where applicable

Development

Each level lives in its own directory and keeps the challenge files local to that level. The Bonsai-backed levels start an internal llama.cpp-compatible Bonsai runtime inside the same container, cache the model under /models, and point the app at 127.0.0.1 instead of an external model service.

Contributing

Want to add a level? PRs welcome! Each level should:

Be self-contained in Docker
Have clear learning objectives
Include proper security controls
Document solution methods

License

MIT License - See LICENSE for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Agent CTF Challenge Series

Available Levels

Prerequisites

Quick Start

Security Features

Development

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
level1		level1
level2		level2
level3		level3
level4		level4
.gitignore		.gitignore
BONSAI_LOCAL.md		BONSAI_LOCAL.md
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

AI Agent CTF Challenge Series

Available Levels

Prerequisites

Quick Start

Security Features

Development

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages