@@ -11,22 +11,22 @@ How to run Gambit, the agent harness framework, locally and observe runs.
1111 - Command help: ` deno run -A src/cli.ts help <command> ` (or
1212 ` deno run -A src/cli.ts <command> -h ` ).
1313- Run once:
14- ` deno run -A src/cli.ts run <deck> [--context <json|string>] [--message <json|string>] [--model <id>] [--model-force <id>] [--trace <file>] [--state <file>] [--stream] [--responses] [--verbose] `
14+ ` deno run -A src/cli.ts run <deck> [--context <json|string>] [--message <json|string>] [--model <id>] [--model-force <id>] [--trace <file>] [--state <file>] [--stream] [--responses] [--verbose] [--worker-sandbox|--no-worker-sandbox|--legacy-exec] `
1515- Check models: ` deno run -A src/cli.ts check <deck> `
1616- REPL: ` deno run -A src/cli.ts repl <deck> ` (defaults to
1717 ` src/decks/gambit-assistant.deck.md ` in a local checkout). Streams by default
1818 and keeps state in memory for the session.
19- - Test bot (CLI):
20- ` deno run -A src/cli.ts test-bot <root-deck> --test-deck <persona-deck> [--context <json|string>] [--bot-input <json|string>] [--message <json|string>] [--max-turns <n>] [--state <file>] [--grade <grader-deck> ...] [--trace <file>] [--responses] [--verbose] `
19+ - Scenario (CLI):
20+ ` deno run -A src/cli.ts scenario <root-deck> --test-deck <persona-deck> [--context <json|string>] [--bot-input <json|string>] [--message <json|string>] [--max-turns <n>] [--state <file>] [--grade <grader-deck> ...] [--trace <file>] [--responses] [--verbose] [--worker-sandbox|--no-worker-sandbox|--legacy-exec ] `
2121- Grade (CLI):
22- ` deno run -A src/cli.ts grade <grader-deck> --state <file> [--model <id>] [--model-force <id>] [--trace <file>] [--responses] [--verbose] `
22+ ` deno run -A src/cli.ts grade <grader-deck> --state <file> [--model <id>] [--model-force <id>] [--trace <file>] [--responses] [--verbose] [--worker-sandbox|--no-worker-sandbox|--legacy-exec] `
2323- Export bundle (CLI):
2424 ` deno run -A src/cli.ts export [<deck>] --state <file> --out <bundle.tar.gz> `
2525- Debug UI: ` deno run -A src/cli.ts serve <deck> --port 8000 ` then open
2626 http://localhost:8000/ . This serves a multi-page UI:
2727
2828 - Debug (default): ` http://localhost:8000/debug `
29- - Test: ` http://localhost:8000/test-bot `
29+ - Test: ` http://localhost:8000/test `
3030 - Calibrate: ` http://localhost:8000/calibrate `
3131
3232 The WebSocket server streams turns, traces, and status updates.
@@ -46,15 +46,24 @@ How to run Gambit, the agent harness framework, locally and observe runs.
4646- ` GAMBIT_RESPONSES_MODE=1 ` : env alternative to ` --responses ` for runtime/state.
4747- ` GAMBIT_OPENROUTER_RESPONSES=1 ` : route OpenRouter calls through the Responses
4848 API (experimental; chat remains the default path).
49+ - Worker execution defaults on for deck-executing surfaces. Use
50+ ` --no-worker-sandbox ` (or ` --legacy-exec ` ) to roll back to legacy in-process
51+ execution. ` --sandbox/--no-sandbox ` still work as deprecated aliases.
52+ - ` gambit.toml ` config equivalent:
53+ ``` toml
54+ [execution ]
55+ worker_sandbox = false # same as --no-worker-sandbox
56+ # legacy_exec = true # equivalent rollback toggle
57+ ```
4958
5059## State and tracing
5160
52- - ` --state <file> ` (run/test-bot /grade/export): load/persist messages so you can
61+ - ` --state <file> ` (run/scenario /grade/export): load/persist messages so you can
5362 continue a conversation; skips ` gambit_context ` on resume. ` grade ` writes
5463 ` meta.gradingRuns ` back into the session state, while ` export ` reads the state
5564 file to build the bundle.
5665- ` --out <file> ` (export): bundle output path (tar.gz).
57- - ` --grade <grader-deck> ` (test-bot ): can be repeated; graders run in the order
66+ - ` --grade <grader-deck> ` (scenario ): can be repeated; graders run in the order
5867 provided and append results to ` meta.gradingRuns ` in the same session state
5968 file.
6069- ` --trace <file> ` writes JSONL trace events; ` --verbose ` prints trace to
@@ -91,17 +100,17 @@ How to run Gambit, the agent harness framework, locally and observe runs.
91100 ` window.gambitFormatTrace ` hook in the page; return a string or
92101 ` {role?, summary?, details?, depth?} ` to override the entry that appears in
93102 the Traces & Tools pane.
94- - The Test page reuses the same simulator runtime but drives persona/test-bot
103+ - The Test page reuses the same simulator runtime but drives persona/scenario
95104 decks so you can batch synthetic conversations, inspect per-turn scoring, and
96105 export JSONL artifacts for later ingestion. List personas by declaring
97106 ` [[testDecks]] ` entries in your root deck (for example
98107 ` gambit/examples/advanced/voice_front_desk/decks/root.deck.md ` ). Each entry’s
99108 ` path ` should point to a persona deck (Markdown or TS) that includes
100109 ` acceptsUserTurns = true ` ; the persona deck’s own ` contextSchema ` and defaults
101- power the Scenario/Test Bot form (see
110+ power the Scenario form (see
102111 ` gambit/examples/advanced/voice_front_desk/tests/new_patient_intake.deck.md ` ).
103112 Editing those deck files is how you add/remove personas now—there is no
104- ` .gambit/test-bot .md ` override.
113+ ` .gambit/scenario .md ` override.
105114- The Calibrate page is the regroup/diagnostics view for graders that run
106115 against saved Debug/Test sessions; it currently serves as a placeholder until
107116 the grading transport lands.
0 commit comments