Flight deck for AI agents. Launch from Claude Code (or any MCP client), recover results anywhere, run background missions while you sleep. We're not another agent framework β we're the roads, fuel, and traffic laws that make persistent, self-improving agents the new normal.
- Overview
- Features
- Architecture
- Quick Start
- CLI Commands & API
- Configuration
- Squadrons (Built-in Agents)
- Custom Squadrons
- Integration with commodore-protocol & Fleet
- Repository Structure
- Development
- Roadmap
- License
DeckBoss gives every MCP-native agent (Claude Code, Grok, Ollama, Cloudflare-native, ManusAI...) a persistent, globally-distributed, continually-learning backend that runs on your free Cloudflare account.
Claude handles reasoning. DeckBoss handles memory, orchestration, and background execution.
Why DeckBoss?
| Problem | DeckBoss Solution |
|---|---|
| Claude forgets everything when the laptop closes | Missions survive via Durable Objects + alarms |
| No background execution for long-running tasks | Edge-native parallel execution across 330+ locations |
| No persistent memory across sessions | Cognitive model with semantic + episodic + procedural memory |
| Context window fills up fast | Offload indexing, scraping, monitoring to agent squadrons |
| Expensive to run | Free tier: 10K AI inferences/day, 200K vectors, 5 GB D1, zero surprise bills |
- Free β 10k AI inferences/day, 200k vectors, 5 GB D1, zero surprise bills
- Persistent β Missions survive laptop closure via Durable Objects + alarms
- Parallel & Edge-Native β 330+ locations, no Docker, no local infra
- Yours β Your CF account, your data, your cognitive model forever
- General-purpose first β One universal MCP server + Director core
- Hyper-specific second β Squadrons load as dynamic plugins (no monolith bloat)
| Feature | Description | Status |
|---|---|---|
| MCP Server | Native Model Context Protocol integration with Claude Code, Windsurf, Grok, etc. | β Stable |
| Director Durable Object | Per-user stateful brain β orchestration, state, continual learning (~200 LOC) | β Stable |
| Mission Manager | Episodic memory + alarm-based scheduling for background tasks | β Stable |
| Cognitive Model | Hybrid memory: semantic (Vectorize), episodic (SQLite), procedural (runbooks), graph (relations) | β Stable |
| Memory Weaver | Continual learning loop β post-mission refinement via LLM distillation | π§ In Progress |
| Squadron Router | Embedding-based dispatch to specialized agents | π§ In Progress |
| Hybrid Search | Reciprocal Rank Fusion of D1 BM25 + Vectorize semantic search | β Stable |
| WebSocket Sync | Real-time notifications, auto-reconnect with state reconciliation | β Stable |
| Quota Governance | 80% quota warnings, graceful degradation, audit trail | π§ In Progress |
| Python CLI | Typer-based CLI with Rich output for deckboss init, launch, status |
π§ In Progress |
| A2A Routing | Agent-to-Agent protocol for cross-agent interoperability (Grok, ManusAI, etc.) | π Planned |
| Squadron Marketplace | Shareable plugin ecosystem on deckboss.ai | π Planned |
| Cognitive Graph Layer | Entity/relation knowledge graph for deep context understanding | π Planned |
- "Index my monorepo while I sleep" β Launch Archivist in background; wake up to a fully searchable semantic index of every file.
- "Monitor these 50 URLs and alert me on changes" β Sentry runs 24/7 on Cloudflare's edge, zero local compute.
- "Research this topic across 20 sources" β Scout + Archivist squadron fetches, summarizes, and cross-references in parallel.
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β DECKBOSS AGENT EDGE OS β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β ββββββββββββββββββββ β
β β MCP Clients β Claude Code β’ Windsurf β’ Grok β
β β (Any IDE/tool) β Ollama β’ ManusAI β’ Custom β
β ββββββββββ¬ββββββββββ β
β β MCP JSON-RPC (tools/call, resources, sampling) β
β β + Streamable HTTP + SSE transport β
β βΌ β
β ββββββββββββββββββββ β
β β DeckBoss CLI β Node.js / Python bridge β
β β (MCP Adapter) β βββ WebSocket to Director β
β β β βββ Quota checking β
β β β βββ Auto-reconnect + state sync β
β ββββββββββ¬ββββββββββ β
β β WSS (WebSocket Secure) β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β DIRECTOR β Durable Object (per-user, handmade core) β β
β β β β
β β ββββββββββββββββββ ββββββββββββββββ βββββββββββββββ β β
β β β Mission β β Cognitive β β Memory β β β
β β β Manager β β Model β β Weaver β β β
β β β β β β β β β β
β β β β’ Queue/Exec β β β’ Episodic β β β’ Refine β β β
β β β β’ Alarms β β β’ Semantic β β β’ Distill β β β
β β β β’ Retries β β β’ Procedural β β β’ Re-embed β β β
β β β β’ Track status β β β’ Graph β β β’ Upsert β β β
β β βββββββββ¬βββββββββ ββββββββββββββββ βββββββββββββββ β β
β β β β β
β β βββββββββΌβββββββββ β β
β β β Squadron β Embedding-based routing β β
β β β Router β Cosine similarity dispatch β β
β β βββββββββ¬βββββββββ β β
β ββββββββββββΌβββββββββββββββββββββββββββββββββββββββββββββββββ β
β β HTTP POST /execute β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β SQUADRON PLUGINS (extensible, zero-bloat) β β
β β β β
β β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ β β
β β βArchivist β β Scout β βMachinist β β Sentry β β β
β β βCode indexβ βWeb fetch β βCode exec β βMonitoringβ β β
β β βRAG+searchβ βSummarize β βTransform β βAlerting β β β
β β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ β β
β β β β
β β ββββββββββββ ββββββββββββ ββββββββββββ β β
β β β Custom β β Custom β β Custom β Marketplace β β
β β β Agent β β Agent β β Agent β Plugins β β
β β ββββββββββββ ββββββββββββ ββββββββββββ β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β β
β βΌ β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β CLOUDFLARE PRIMITIVES (free-tier first, user-owned) β β
β β β β
β β Workers AI β Vectorize β D1 SQLite β R2 β β
β β (embed/LLM) β (10M vec) β (5GB) β (10GB) β β
β β β β
β β Queues β Workflows β Durable Objects β KV β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
βββββββββββ βββββββββββββ βββββββββββ ββββββββββββββ
β Queued ββββββΆβ Scheduled ββββββΆβ Active ββββββΆβ Completed β
βββββββββββ βββββββββββββ ββββββ¬βββββ ββββββββββββββ
β² β
β βΌ
β ββββββββββββββββββββββββββββββββ
β β Failed (retry with backoff) β
β ββββββββββββββββ¬ββββββββββββββββ
β β
βββββββββββββββββββββββββββββ
max retries exceeded β Terminal failure
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β COGNITIVE MODEL β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Episodic β Director SQLite + alarms β
β Semantic β Vectorize (BGE) + RRF + D1 BM25 β
β Procedural β Stored runbooks + distilled skillsβ
β Graph β SQLite edges (entity/relation) β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Memory Weaver β Continual learning loop β
β β Post-mission refinement β
β β Llama-3.2-3B distillation β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
| Layer | Approach | Rationale |
|---|---|---|
| Director + Weaver | Handmade (~200 LOC) | Owns orchestration, state, continual learning β control where it matters |
| MCP Server | Glue (official SDK) | Zero custom protocol, single Worker entrypoint |
| Cloudflare Primitives | Glue (official APIs) | Vectorize, D1, R2, Workers AI β zero maintenance |
| Squadrons | Plugin (separate Workers/DOs) | Zero overhead until used β specificity without bloat |
Result: General-purpose lightness + surgical specificity. No framework tax. One wrangler deploy.
| Tool | Version | Purpose |
|---|---|---|
| Node.js | 20.x LTS | Runtime |
| pnpm | 8.x | Package manager (workspaces) |
| Wrangler CLI | 3.22+ | Cloudflare deployment |
| Git | 2.40+ | Version control |
# Option A β TypeScript (recommended for existing users)
npm install -g deckboss
# Option B β Python-first (recommended for MCP/Code Mode)
pipx install deckbossdeckboss initThis does everything in under 60 seconds:
- Opens browser to log you into Cloudflare
- Creates your personal Director Durable Object
- Provisions Vectorize index + D1 database + R2 bucket with correct bindings
- Deploys the lightweight MCP Server (Python or TS)
- Generates
~/.config/deckboss/config.json+ local.env - Prints one-line Claude Code integration command
claude mcp add deckboss -- npx deckboss mcp-server"Launch archivist to index my monorepo"
| Command | Description | Example |
|---|---|---|
deckboss init |
Link Cloudflare account, provision all resources | deckboss init --runtime python |
deckboss launch <agent> <mission> |
Execute mission immediately | deckboss launch scout fetch |
deckboss launch ... --background |
Queue for background execution | deckboss launch archivist index --background |
deckboss status <id> |
Check mission status | deckboss status 7A3F |
deckboss recover <id> |
Retrieve completed mission result | deckboss recover 7A3F |
deckboss squadron |
List active squadrons | deckboss squadron |
deckboss squadron create <name> |
Create custom squadron | deckboss squadron create my-reviewer --template scout --lang python |
deckboss mcp-server |
Run MCP server (Claude Code integration) | deckboss mcp-server --port 8000 |
deckboss deploy mcp |
Deploy MCP server to Cloudflare | deckboss deploy mcp --runtime python |
| Tool | Description | Key Parameters |
|---|---|---|
deckboss_launch |
Launch a persistent mission on the edge | agent, mission, payload, background |
deckboss_status |
Check mission status | missionId |
deckboss_query_memory |
Search cognitive model (hybrid search) | query, filters |
deckboss_generate_image |
Generate images via FLUX | prompt, style |
deckboss_code_mode_exec |
Execute compact Python against SDK | code |
# Run locally for development
deckboss mcp-server --port 8000
# Deploy as Cloudflare Python Worker (production)
deckboss deploy mcp --runtime pythonClaude discovers it automatically via /.well-known/mcp.json.
| Variable | Required | Description |
|---|---|---|
CF_ACCOUNT_ID |
Yes | Cloudflare account ID |
CF_API_TOKEN |
Yes | Token with Workers, D1, Vectorize, KV permissions |
| Path | Purpose |
|---|---|
~/.config/deckboss/config.json |
User credentials + Director URL |
./deckboss.json |
Project-level config (agents, memory settings) |
.env |
Local secrets (never committed) |
# Cloudflare resource bindings (auto-generated by deckboss init)
[ai]
binding = "AI" # Workers AI (LLM + embeddings)
[[vectorize]]
binding = "COGNITIVE_MODEL" # Semantic memory vectors
index_name = "deckboss-cognitive"
[[d1_databases]]
binding = "DB" # SQLite (missions, memory, graph)
database_name = "deckboss-db"
[r2_buckets]
binding = "ARTIFACTS" # Large file storage
bucket_name = "deckboss-artifacts"
[durable_objects]
bindings = [{ name = "DIRECTOR", class_name = "Director" }]
[[routes]]
pattern = "deckboss.yourdomain.com/mcp"
custom_domain = true # MCP discovery route| Store | Best For | Free Tier |
|---|---|---|
| Vectorize | Semantic search, similarity | 200K vectors |
| D1 | Structured data, BM25 text search | 5GB, 5M reads/day |
| KV | Fast reads, session cache, response caching | 1GB, 100K ops/day |
| R2 | Large blobs, exports, artifacts | 10GB |
| Agent | Specialty | Missions |
|---|---|---|
| Archivist | Code indexing, semantic search, pattern matching | index, search, audit, crossref |
| Scout | Web exploration, content extraction | fetch, summarize, extract, monitor |
| Machinist | Code execution, transformation, generation | run, transform, generate, sandbox |
| Sentry | Scheduled monitoring, health checks, cron jobs | watch, health, alert, cron |
Every agent exposes GET /.well-known/agent.json for agent-to-agent interoperability.
{
protocolVersion: "0.3.0",
name: "deckboss-scout",
description: "Web exploration agent. Fetches URLs, summarizes, extracts structured data.",
capabilities: { streaming: false },
skills: [
{ id: "fetch", name: "Fetch URL", description: "Retrieves raw content with caching" },
{ id: "summarize", name: "Summarize Content", description: "AI-generated summary" },
{ id: "extract", name: "Extract Structured Data", description: "Extracts per JSON schema" }
]
}# Scaffold from template
deckboss squadron create my-legal-reviewer --template scout --lang python
# Or manually:
cd packages/agents
cp -r template my-agent
# Edit src/index.ts: define SKILLS, HANDLERS, learning feedback
wrangler deploy
deckboss register my-agent --url https://my-agent.your-account.workers.devawait director.registerExternalAgent({
name: 'manus-bridge',
url: 'https://manus-bridge.example.com',
protocol: 'a2a'
});
// Use via standard interface
await director.queueMission({
agent: 'manus-bridge',
type: 'research_task',
payload: { query: '...' }
});DeckBoss operates as a vessel in the SuperInstance fleet ecosystem, following the commodore-protocol (Cocapn) standards.
| Integration | Protocol | Description |
|---|---|---|
| Git-Agent Standard v2.0 | Git-based | Every commit is a capability snapshot; repo IS the agent |
| I2I (Instance-to-Instance) | Message passing | Fleet inter-communication via bottle system |
| Dockside Exam | Certification checklist | Fleet readiness scoring (target: 30+/47) |
| Living Manual | Documentation | CHARTER.md, STATE.md, ABSTRACTION.md maintained per-vessel |
| Tender Protocol | Edge sync | Tender vessels visit remote agents, exchange bottles, carry diffs |
| File | Purpose | Status |
|---|---|---|
CHARTER.md |
Mission, type, origin, captain, refactoring plan | β |
STATE.md |
Current health, phase, fleet score | β |
ABSTRACTION.md |
Primary abstraction plane | β |
DOCKSIDE-EXAM.md |
Fleet certification checklist | β |
REFACTOR-NOTES.md |
Refactoring progress and TODOs | β |
βββββββββββββββ I2I Protocol ββββββββββββββββ
β DeckBoss βββββββββββββββββββββββββΆβ Fleet API β
β (vessel) β bottle system β (:8901) β
ββββββββ¬βββββββ ββββββββββββββββ
β
βββ from-fleet/ (inbound messages)
βββ for-fleet/ (outbound messages)
βββ DIARY/ (learning journal)
Current fleet score: 75/100 (active, OS vision written, awaiting build phase)
See STATE.md for live status and DOCKSIDE-EXAM.md for the full certification checklist.
deckboss/
βββ apps/
β βββ cli/
β βββ deckboss/ # Typer CLI + init/deploy commands (Python)
β βββ __init__.py # CLI entry: init, launch commands
β βββ pyproject.toml # Python package config
βββ packages/
β βββ core/
β β βββ src/
β β β βββ index.ts # Zod schemas + MCP-compliant types
β β β βββ __tests__/
β β β βββ deckboss.test.ts # Vitest test suite
β β βββ package.json # @deckboss/core npm package
β β βββ vitest.config.ts # Test configuration
β βββ director/
β βββ src/
β βββ index.ts # Director Durable Object (handmade core)
βββ .github/workflows/
β βββ ci.yml # GitHub Actions CI (Node 18/20)
βββ CHARTER.md # Fleet vessel charter
βββ STATE.md # Fleet vessel state
βββ ABSTRACTION.md # Abstraction plane documentation
βββ DOCKSIDE-EXAM.md # Fleet certification checklist
βββ Architecture0.1.md # Detailed architecture blueprint
βββ OS-VISION.md # Long-term OS vision
βββ Dev_Guide0.1.md # Developer guide (v1)
βββ Dev_Guide0.1.1.md # Developer guide (v1.1 β expanded)
βββ Dev_Guide_add.md # Developer guide supplement
βββ turbo.json # Turborepo build pipeline
βββ wrangler.toml # Cloudflare Worker config + bindings
βββ package.json # Root workspace config
βββ LICENSE # MIT License
Monorepo principle: Apps are user-facing. Packages are infrastructure. Turborepo handles the dependency graph.
git clone https://github.com/superinstance/deckboss.git
cd deckboss
npm install
# Authenticate with Cloudflare
npx wrangler login
# Build and test
npm run build
npm run test
# Development mode
npm run dev| Script | Description |
|---|---|
npm run build |
Build all packages (via Turborepo) |
npm run dev |
Start development servers |
npm run test |
Run test suites (Vitest) |
npm run lint |
Lint all packages |
npm run typecheck |
Type-check all packages |
npm run deploy |
Deploy to Cloudflare (wrangler deploy) |
# Run all tests
cd packages/core && npm test
# Local agent testing with Miniflare
cd packages/agents/scout
npx miniflare src/index.ts --modules
curl http://localhost:8787/.well-known/agent.json- Director + Memory Weaver v1 architecture
- CHARTER + DOCKSIDE-EXAM + fleet certification docs
- Core types + MCP tool definitions + Zod schemas
- CI pipeline (GitHub Actions)
- Full Director Durable Object implementation
- A2A routing to Grok / external agents
- Squadron marketplace launch
- Cognitive model graph layer
- Self-improving nightly Weaver jobs
- Public shared squadron registry
MIT. Your account. Your agents. Your data.
DeckBoss β The missing OS layer for the agent economy. Built for developers who want their AI to work while they sleep.
