Agents Fleet (prototype)

AI coding agents like Claude Code and Codex are powerful, but they have no built-in cost controls—one runaway session can silently burn $20–$50 with no visibility into what’s happening or when to stop. Agents Fleet gives you a local web UI to launch and monitor agent sessions and automatically stop them when they hit a token or USD budget.

Local-first “mission control” for AI coding agent CLIs (and any shell commands): launch sessions in a repo, stream live output to a web UI, stop them, and keep a persisted history.

✨ Recently Shipped

Real-time usage tracking (PR #2): Parse Claude Code status lines for accurate token/cost counting instead of estimates
Budget enforcement that actually works

This repository contains a working MVP:

pnpm workspace monorepo
React + Vite + TypeScript “Mission Control” web app
Node + Express + TypeScript server:
- SQLite persistence (data/agents_fleet.sqlite)
- session + terminal history HTTP APIs
- WebSocket streaming (/ws):
  - live PTY output for shell/CLI sessions
  - live Claude SDK chat streaming + tool events
shared TypeScript types (packages/shared)

Demo

Screenshots

New session form

Live terminal (interactive agents)

Claude Code (interactive TUI rendered via xterm.js)

OpenAI Codex (interactive)

Per-session artifacts (git diff snapshots)

Artifacts tab (changed files + diff)

Artifacts view (larger change set)

Live output vs persisted terminal history

Terminal (live)

Terminal (persisted)

Budget enforcement (auto-stop)

Token budget cutoff

USD budget cutoff

Cost estimate vs actual (CLI-reported)

SQLite persistence (debug views)

The MVP persists several tables in data/agents_fleet.sqlite:

sessions: session metadata + budgets + estimated token/cost + stop reason
pty_chunks: raw PTY stream (ANSI included) used for Terminal (persisted) replay
stdin_events: input audit trail (stored separately; not injected into replay)
session_markers: lifecycle markers like stop_requested, budget_exceeded, process_exit
session_artifacts: per-session artifacts (currently: git snapshot with changedFiles[] + combined staged/unstaged diff captured on stop/exit)

Earlier iterations used a line-based logs table. The current design persists terminal history as raw PTY chunks (pty_chunks) for xterm.js replay, which is much closer to real scrollback (especially for TUIs like Claude/Codex).

Videos

Tip: GitHub renders MP4 previews nicely in README. .mov files are ignored by default in .gitignore to avoid bloating git history.

screenshots/Agents_Fleet__Mission_Control_for_Your_Local_AI_Workers.mp4

Architecture

See ARCHITECTURE.md.

Prerequisites

Node.js 20.x
pnpm (Corepack is fine)

Setup

COREPACK_HOME="$PWD/.corepack" pnpm install

Run (dev) — one command

pnpm dev:one

On first run, this may optionally prompt you for ANTHROPIC_API_KEY and save it to .env.local (gitignored). Press Enter to skip.

This will:

install dependencies (if needed)
start apps/server + apps/web in parallel

Open: http://localhost:5173

Run (dev) — manual (two terminals)

COREPACK_HOME="$PWD/.corepack" pnpm -C apps/server dev
COREPACK_HOME="$PWD/.corepack" pnpm -C apps/web dev

Create a session

Open the web app (Vite prints the URL, typically http://localhost:5173).
Enter:
- Repo path: absolute path to a local repository (must be a directory)
- Command: any shell command to run in that repo

Example commands:

node -e "console.log('hello')"
git status
node -e "setinterval(()=>console.log('tick',Date.now()),200)"
node -e "setInterval(()=>console.log(Date.now()),200)"
claude
codex

Interactive sessions (e.g. Claude)

Claude Code / Codex (PTY)

Start a session with command claude (or codex if installed).

(Recommended) Claude Code status line for accurate budget tracking

Claude Code can run a custom status line command that receives structured JSON about the current session (context window usage, estimated cost, etc.).

For the most reliable budget tracking in Agents Fleet, configure a single-line status line that prints parse-friendly key/value pairs.

Create the script:

#!/bin/bash
input=$(cat)

CTX_IN=$(echo "$input" | jq -r '.context_window.total_input_tokens // 0')
CTX_OUT=$(echo "$input" | jq -r '.context_window.total_output_tokens // 0')
CTX_SIZE=$(echo "$input" | jq -r '.context_window.context_window_size // 0')
CTX_PCT=$(echo "$input" | jq -r '.context_window.used_percentage // 0' | cut -d. -f1)

COST=$(echo "$input" | jq -r '.cost.total_cost_usd // 0')
COST_FMT=$(printf '$%.6f' "$COST")

# Single-line, parse-friendly output:
# Use a unique prefix + delimiter to make parsing reliable even with TUI redraws.
echo "AF|ctx=${CTX_IN}/${CTX_SIZE}(${CTX_PCT}%)|in=${CTX_IN}|out=${CTX_OUT}|cost=${COST_FMT}"

Save it as ~/.claude/agents_fleet_statusline.sh and make it executable:

chmod +x ~/.claude/agents_fleet_statusline.sh

Update ~/.claude/settings.json:

{
  "statusLine": {
    "type": "command",
    "command": "~/.claude/agents_fleet_statusline.sh",
    "padding": 1,
    "refreshInterval": 1
  }
}

Notes:

Requires jq to be installed (brew install jq on macOS).
cost.total_cost_usd is an estimate computed client-side by Claude Code and may differ from your actual bill.
Type directly into the Terminal (live) pane (xterm.js).
Use Terminal (persisted) to replay and scroll through the recorded PTY output (xterm.js replay).

(Recommended) Codex status line for accurate budget tracking

Codex can also show session usage in a single-line status line. For Agents Fleet, the simplest reliable setup is to keep Codex’s built-in status line enabled and ensure it includes the usage fields below.

Update ~/.codex/config.toml:

[tui]
status_line = ["model-with-reasoning","current-dir","context-remaining","context-used","total-input-token","total-output-tokens","weekly-limit","five-hour-limit","run-state","task-progress"]
status_line_use_color = true

Make sure the output stays on one line in the Codex TUI.

Notes:

The config above matches the usage fields Agents Fleet can parse for budget tracking.
If you change the field list, keep it single-line so PTY replay remains parse-friendly.
Type directly into the Terminal (live) pane (xterm.js).
Use Terminal (persisted) to replay and scroll through the recorded PTY output (xterm.js replay).

Claude (SDK) chat (tool-calling)

Prerequisite: set ANTHROPIC_API_KEY (required). The server will reject Claude SDK requests if it’s missing.

Switch to Claude (SDK) in the UI.
Provide a repo path and chat normally.
The assistant can propose run_command tool calls; you must Approve or Reject each command.
Tool output is capped (100KB) and stored as session artifacts.

Screenshots:

Claude SDK session stopped by budget

Claude SDK tool call + output

Claude SDK tool permission gate (Approve/Reject)

Budgets (estimated)

Optional Budget USD and/or Budget tokens apply to the entire session lifetime.
Token estimation: ceil(text.length / 4).
Cost estimation:
- shell/PTY sessions use the default rates in apps/server/src/budget.ts
- Claude SDK sessions use a model-based pricing table (computeModelCostUsd) and SDK-reported usage when available.
If a budget is exceeded, the session is stopped automatically and stop_reason becomes budget_exceeded.

Note: USD cost is still an estimate unless you configure model pricing to match your account/contract.

Configure pricing via a remote API (PRICING_API_URL, must be https) or via local overrides (PRICING_JSON inline JSON / PRICING_JSON_PATH file path). See apps/server/src/pricing.ts for schema + env vars.

Stop a session

Select a running session and click Stop.
The server will attempt graceful termination first, then force-kill if needed (best-effort, cross-platform).

Per-session artifacts (git diff snapshots)

On session stop and/or exit, Agents Fleet can capture a git snapshot for the session repo and store it in SQLite.

UI: open the Artifacts tab (next to Terminal tabs) to view changed files + diff.
Storage: session_artifacts table.
Toggle: set AGENTS_FLEET_CAPTURE_GIT_ON_END=0 to disable capture.

Scripts

pnpm dev:one installs deps if needed and runs dev for all workspaces (web + server).
pnpm dev runs dev for all workspaces (web + server) in parallel.
pnpm check runs lint + typecheck + test + build.
pnpm build builds all workspaces.
pnpm typecheck runs TypeScript checks across workspaces.

Tests

COREPACK_HOME="$PWD/.corepack" pnpm -C apps/server test

Notes

If you see Corepack cache permission errors, the COREPACK_HOME="$PWD/.corepack" prefix keeps Corepack’s cache inside the repo.

Data location

SQLite DB: data/agents_fleet.sqlite (local only; do not commit).

Known limitations

PTY sessions do not preserve stdout/stderr separation.
Token/cost is an estimate unless the CLI provides actual usage.
Some TUIs (notably Claude) may clear/restore the alternate screen on exit. The persisted replay is a faithful stream replay, so end-of-session scrollback may differ from what you remember seeing just before exit.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
apps		apps
data		data
packages/shared		packages/shared
screenshots		screenshots
scripts		scripts
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agents Fleet (prototype)

✨ Recently Shipped

Demo

Screenshots

Videos

Architecture

Prerequisites

Setup

Run (dev) — one command

Run (dev) — manual (two terminals)

Create a session

Interactive sessions (e.g. Claude)

Claude Code / Codex (PTY)

(Recommended) Claude Code status line for accurate budget tracking

(Recommended) Codex status line for accurate budget tracking

Claude (SDK) chat (tool-calling)

Budgets (estimated)

Stop a session

Per-session artifacts (git diff snapshots)

Scripts

Tests

Notes

Data location

Known limitations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agents Fleet (prototype)

✨ Recently Shipped

Demo

Screenshots

Videos

Architecture

Prerequisites

Setup

Run (dev) — one command

Run (dev) — manual (two terminals)

Create a session

Interactive sessions (e.g. Claude)

Claude Code / Codex (PTY)

(Recommended) Claude Code status line for accurate budget tracking

(Recommended) Codex status line for accurate budget tracking

Claude (SDK) chat (tool-calling)

Budgets (estimated)

Stop a session

Per-session artifacts (git diff snapshots)

Scripts

Tests

Notes

Data location

Known limitations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages