diff --git a/CHANGELOG.md b/CHANGELOG.md
index c896d03a..0fd95c69 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,6 +2,10 @@
 
 ## [Unreleased]
 
+### Added
+
+- `agent-tty batch <session-id>`: run an ordered sequence of input-and-`wait` steps against one session in a single invocation, supplied as a positional JSON array or `--file`. Each step is one verb (`type`, `paste`, `sendKeys`, `run`, or `wait`); every `wait` is anchored to a Wait Baseline (the Event Log sequence after the preceding input step) so it cannot match a stale screen the way a hand-written `run`/`wait`/`send-keys` loop can (ADR 0007). Fail-fast by default with a non-zero exit and a per-step `--json` envelope; `--keep-going` attempts every step. SIGINT/SIGTERM flushes a partial envelope (in-flight step `interrupted`, later steps `not-run`) ([#123](https://github.com/coder/agent-tty/issues/123)).
+
 ## [v0.3.0] - 2026-06-03
 
 ### Added
diff --git a/CONTEXT.md b/CONTEXT.md
index 595841dc..804b9f18 100644
--- a/CONTEXT.md
+++ b/CONTEXT.md
@@ -43,6 +43,18 @@ _Avoid_: Visual wait, snapshot wait
 A render condition where the visible text content of a **Semantic Snapshot** has remained unchanged for a requested duration.
 _Avoid_: Settled screen
 
+**Batch**:
+An ordered sequence of **Batch Steps** driven through one **Command Target** in a single `batch` invocation. It runs fail-fast: the first failed **Batch Step** stops the run unless the caller opts into continuing.
+_Avoid_: Pipeline, script, macro
+
+**Batch Step**:
+A single ordered action within a **Batch**: either one input or control action sent to the **Command Target** (text, paste, key chord, or a **Waited Run**), or one **Render Wait**.
+_Avoid_: Command, instruction
+
+**Wait Baseline**:
+The **Event Log** point a **Render Wait** must observe a **Semantic Snapshot** beyond before it can match, so the wait reflects screen state from that point onward rather than stale pre-step content.
+_Avoid_: afterSeq, sequence floor
+
 **Live Host Eligible Session**:
 A **Session** where callers should ask the live session host for fresh state.
 
@@ -219,6 +231,11 @@ _Avoid_: bare "agent", "Coder agent"
 - A **Waited Run** may produce one **Run Completion**, time out for its caller, or be interrupted by **Session** exit.
 - Caller timeout does not cancel the underlying **Run Completion**; it may still be observed later to keep internal completion bytes out of artifacts.
 - After **Session** exit, an unobserved **Run Completion** can no longer arrive.
+- A **Batch** is driven through exactly one **Command Target**, resolved once for the whole invocation.
+- A **Batch** is not atomic: input already applied to a **Session** cannot be undone, so a failed **Batch** leaves the **Session** in whatever state its completed **Batch Steps** produced.
+- A **Render Wait** that is a **Batch Step** is anchored to a **Wait Baseline** equal to the **Event Log** sequence recorded after the preceding input **Batch Step**, so it cannot match a **Semantic Snapshot** that predates that step.
+- A standalone **Render Wait** may be given an explicit **Wait Baseline**; without one it matches against the latest **Semantic Snapshot**.
+- A **Batch** stops at the first failed **Batch Step** — a timed-out **Render Wait**, or an input action against a **Session** that is no longer a **Command Target** — unless the caller opts into continuing.
 - A **Promoted Hero Demo** replaces the existing recursive README demo entirely; the old recursive bundle is deleted rather than maintained in parallel.
 - The **Hero Claim Boundary** narrows the README claim after that deletion: the outer TUI is presentation, while inner `agent-tty` artifacts are the product proof.
 - An **Exploratory Hero Demo** is the preferred **Hero Demo** scenario because it shows the coding-agent TUI discovering the `agent-tty` skill and CLI before producing inner `agent-tty` proof artifacts.
@@ -284,3 +301,4 @@ _Avoid_: bare "agent", "Coder agent"
 - "helper proof" was used during design discussion, but the canonical scenario is now **Exploratory Hero Demo**: success criteria and output paths are fixed, while the coding agent chooses the command flow inside a configurable fixed review window.
 - "demo" and "proof" are not interchangeable for coding-agent recordings: a **Hero Demo** optimizes for stable presentation, while a **Recursive Dogfood Proof** optimizes for self-dogfood coverage.
 - "agent" is overloaded across four referents: this project's **Triage Agent** (a Claude Code instance), Coder's **Coder workspace agent** (the SSH/exec daemon), a generic AFK implementation agent (the actor on `ready-for-agent` issues — Phase 2 of the triage pipeline), and — in **Session Dashboard** product copy only — the external client driving a **Session** (often an AI coding agent). The last sense is deliberately **not** a domain term: the **Session Dashboard** and **Live View** are defined over **Sessions**, not agents, and the **Event Log** does not record which client sent input. Do not make the dashboard agent-aware (grouping or filtering by agent identity) without first extending the domain model. Always qualify in code comments and docs.
+- "batch" is overloaded: a **Batch** (an ordered **Batch Step** sequence driven through one **Command Target** by the `batch` command) is unrelated to a **Triage Batch** (the set of issues processed by one **AFK Triage** invocation). They live in different subsystems; always rely on the qualifier.
diff --git a/README.md b/README.md
index f0e5cf2a..bf63fc40 100644
--- a/README.md
+++ b/README.md
@@ -111,7 +111,7 @@ Full reproducer, transcripts, and proof bundles are in [`dogfood/agent-uses-agen
 
 ## Command surface
 
-Every user-facing command takes `--json` and returns a stable, machine-readable envelope. The commands cover the session lifecycle (`create`, `list`, `inspect`, `destroy`, `gc`), input and control (`run`, `type`, `paste`, `send-keys`, `resize`, `signal`, `mark`), observation and capture (`wait`, `snapshot`, `screenshot`, `record export`), the live `dashboard`, and environment checks (`version`, `doctor`, `skills`).
+Every user-facing command takes `--json` and returns a stable, machine-readable envelope. The commands cover the session lifecycle (`create`, `list`, `inspect`, `destroy`, `gc`), input and control (`run`, `type`, `paste`, `send-keys`, `batch`, `resize`, `signal`, `mark`), observation and capture (`wait`, `snapshot`, `screenshot`, `record export`), the live `dashboard`, and environment checks (`version`, `doctor`, `skills`).
 
 See [`docs/USAGE.md`](./docs/USAGE.md) for the full flag reference and [`docs/TROUBLESHOOTING.md`](./docs/TROUBLESHOOTING.md) for renderer and environment issues.
 
diff --git a/docs/USAGE.md b/docs/USAGE.md
index 519e6625..ae2c3d5f 100644
--- a/docs/USAGE.md
+++ b/docs/USAGE.md
@@ -104,6 +104,80 @@ Useful flags:
 - `--exit`: wait for the process to exit.
 - `--timeout <ms>`: maximum wait time in milliseconds, with `0` meaning infinite.
 
+## `batch`
+
+Use `batch` to run an ordered sequence of input-and-`wait` steps against one session in a single invocation, instead of coordinating separate `run`/`type`/`paste`/`send-keys`/`wait` calls. Each `wait` step is anchored to a Wait Baseline — it only considers screen state produced _after_ the preceding input step — so a batch cannot race ahead and match a stale screen the way a hand-written shell loop can.
+
+```bash
+agent-tty batch <session-id> '[steps]' --json
+agent-tty batch <session-id> --file ./steps.json --json
+agent-tty batch <session-id> '[steps]' --keep-going --json
+```
+
+Steps are a JSON array; each step is exactly one verb. The shape mirrors the rest of the CLI:
+
+```json
+[
+  { "run": "nvim --clean", "noWait": true },
+  { "wait": { "screenStableMs": 1000 } },
+  { "sendKeys": ["i"] },
+  { "type": "hello" },
+  { "sendKeys": ["Escape"] },
+  { "type": ":wq" },
+  { "sendKeys": ["Enter"] },
+  { "wait": { "text": "written" } }
+]
+```
+
+- `type` / `paste`: a string of literal text.
+- `sendKeys`: a non-empty array of key names — individual named keys or single characters (e.g. `["Enter"]`, `["Ctrl+C"]`, `["Escape", "Enter"]`). Multi-character literal text such as `:wq` is not a key name; send it with a `type` step.
+- `run`: a command string, with optional `noWait` (fire-and-forget) and `timeout` (ms). A `run` step is a waited run by default.
+- `wait`: the same conditions as the `wait` command — `text`, `regex`, `screenStableMs`, `cursorRow`, `cursorCol`, and `timeout` (ms).
+
+Input source and flags:
+
+- A positional `[steps]` JSON array **xor** `--file <path>` — supply exactly one. Passing both, or neither, is an `INVALID_INPUT` error.
+- `--keep-going`: attempt every step regardless of failures. By default a batch is **fail-fast** — the first failed step (a timed-out `wait`, or input to a session that is no longer commandable) stops the run, and the remaining steps are recorded `not-run`. A batch is not atomic: already-applied input cannot be undone.
+- `--json`: emit a machine-readable command envelope.
+
+The `--json` result is a per-step envelope:
+
+```json
+{
+  "ok": true,
+  "command": "batch",
+  "result": {
+    "steps": [
+      {
+        "index": 0,
+        "kind": "run",
+        "status": "completed",
+        "seq": 4,
+        "noWait": true,
+        "runOutcome": "started",
+        "durationMs": 12
+      },
+      {
+        "index": 1,
+        "kind": "wait",
+        "status": "completed",
+        "waitBaseline": 4,
+        "matched": true,
+        "timedOut": false,
+        "capturedAtSeq": 9,
+        "durationMs": 1003
+      }
+    ],
+    "completedCount": 2,
+    "failedIndices": []
+  }
+}
+```
+
+Each step record carries its `index`, `kind`, `status` (`completed` | `failed` | `not-run` | `interrupted`), and `durationMs`. Input steps report the Event Log `seq` they produced; `wait` steps report the `waitBaseline` they were anchored to plus `matched` / `timedOut` / `matchedText` / `capturedAtSeq`. `completedCount` and `failedIndices` summarize the run. A fail-fast batch exits non-zero with the failed step's exit code (e.g. `11` for a `WAIT_TIMEOUT`); `--keep-going` exits `1` if any step failed. If the process is interrupted by SIGINT/SIGTERM, batch flushes the same envelope with the in-flight step marked `interrupted` and later steps `not-run`, then exits non-zero.
+
+The Wait Baseline fixes stale-match only. It does **not** fix echo-match: a `wait` can still match the terminal's echo of a just-typed command (the echo renders _after_ the baseline). Use a distinctive output token or a `screenStableMs` wait rather than waiting for text you just typed. Interrupting a batch mid-`wait` leaves that wait's command still running on the session (the wait is abandoned, not cancelled), exactly like a caller timeout on `run`.
+
 ## Screenshots And Recording Exports
 
 Screenshots and WebM export use the `ghostty-web` reference renderer through Playwright/Chromium.
diff --git a/docs/adr/0007-render-wait-baseline.md b/docs/adr/0007-render-wait-baseline.md
new file mode 100644
index 00000000..cc910e1c
--- /dev/null
+++ b/docs/adr/0007-render-wait-baseline.md
@@ -0,0 +1,63 @@
+---
+status: accepted
+---
+
+# Render waits accept an optional Wait Baseline
+
+## Context
+
+The `batch` command runs an ordered sequence of **Batch Steps** — input actions
+and **Render Waits** — through one **Command Target** with no human pacing
+between them. A **Render Wait** today (`waitForRender` in
+`src/host/hostMain.ts`) polls the renderer every 200 ms and matches against the
+**latest** **Semantic Snapshot**; `WaitForRenderParams`
+(`src/protocol/schemas.ts`) carries text/regex/screenStableMs/cursor/timeout and
+nothing about event-log position.
+
+That is fine for a human invoking `wait` once, but unsafe for steps that run
+back-to-back. A wait step can match the screen left by the _previous_ step
+before the current step has rendered (stale-match), and a `screenStableMs` wait
+can declare that _old_ screen "stable" before the new input even appears. The
+batch then advances on a false premise and sends later keystrokes into the wrong
+state. This is the property that separates `batch` from a hand-written shell
+loop, so it has to be correct.
+
+## Decision
+
+A **Render Wait** accepts an optional **Wait Baseline**: an **Event Log**
+sequence (`afterSeq`) it must observe a **Semantic Snapshot** _strictly beyond_
+before it may match or accrue **Screen Stability**. The `batch` executor sets
+each wait step's baseline to the **Event Log** sequence recorded after the
+preceding input **Batch Step**, so a wait only ever reflects state at or after
+its own step. The standalone `wait --after-seq <n>` exposes the same gate, since
+`snapshot` and `wait` already return a `capturedAtSeq` callers can chain.
+
+- `afterSeq` is added to `WaitForRenderParams`; the host poll and the offline
+  replay matcher reject any snapshot whose `capturedAtSeq` is not strictly
+  greater than the baseline.
+- With no baseline a **Render Wait** behaves exactly as before (matches the
+  latest snapshot), so the change is backward compatible.
+
+## Consequences
+
+- `batch` is meaningfully safer than scripting the existing commands in a loop:
+  each wait is anchored to its own step rather than racing the previous step's
+  screen.
+- The **Wait Baseline** fixes **stale-match** only. It does **not** fix
+  _echo-match_ — a `wait --text "foo"` matching the terminal's echo of a
+  just-typed `foo`, which renders _after_ the baseline. Echo-match stays the
+  caller's concern (use a distinctive output token or `screenStableMs`), exactly
+  as with the `wait` command today.
+- A small amount of protocol and matcher surface grows (one optional field plus
+  a `capturedAtSeq > afterSeq` gate in the live poll and the offline matcher).
+  Offline replay can only apply the floor against the single latest snapshot it
+  reconstructs.
+
+## Alternatives considered
+
+- **Require the visible text to change from a pre-step capture.** Rejected:
+  heuristic rather than exact, never matches a step that legitimately reproduces
+  identical text, and does not use the canonical **Event Log**.
+- **No baseline in v1 (match the latest screen, document the foot-gun).**
+  Rejected: it leaves `batch` only marginally safer than a shell loop, and the
+  stale-match failure is silent and order-dependent.
diff --git a/docs/prd/batch-command/PRD.md b/docs/prd/batch-command/PRD.md
new file mode 100644
index 00000000..dc2c0dd5
--- /dev/null
+++ b/docs/prd/batch-command/PRD.md
@@ -0,0 +1,84 @@
+# PRD: `batch` command
+
+## Problem Statement
+
+Driving a terminal through `agent-tty` today means one CLI invocation per action: `run`, then `wait`, then `send-keys`, then `wait` again, each a separate process. For an AI coding agent — or a human — scripting a multi-step interaction with a **Session**, that has two recurring hazards:
+
+- There is no safe way to express "do this, then wait for _its_ effect, then do the next thing" as one unit. A **Render Wait** issued right after input can match the screen left by the _previous_ action before the new one has rendered, so the script races ahead and sends later keystrokes into the wrong screen.
+- The per-action ceremony is verbose, and every caller has to re-implement the same failure handling (stop when a wait times out) and the same sequencing by hand in a shell loop.
+
+## Solution
+
+A new `batch` command runs an ordered sequence of **Batch Steps** against one **Command Target** in a single invocation. Each **Batch Step** is exactly one action — `type`, `paste`, `send-keys`, `run`, or a `wait` (**Render Wait**). The caller supplies the steps as a JSON array, either inline or from a file.
+
+Every `wait` step is anchored to a **Wait Baseline**: it only considers screen state produced _after_ the preceding input step, so a **Batch** is meaningfully safer than a hand-written loop — it cannot match a stale pre-step screen. A **Batch** is fail-fast by default: the first failed **Batch Step** (a timed-out **Render Wait**, or input to a **Session** that is no longer a **Command Target**) stops the run, so later steps never fire against an unexpected screen.
+
+## User Stories
+
+1. As an AI coding agent, I want to run an ordered sequence of terminal actions in one `batch` call, so that I don't coordinate many separate CLI invocations.
+2. As an AI coding agent, I want each `wait` step to observe only the screen produced after the preceding input step, so that my batch never matches a stale screen and races ahead.
+3. As an AI coding agent, I want to drive an interactive TUI — open it, wait for it to settle, send key chords, wait for a label — in a single batch, so that I can reproduce a TUI workflow deterministically.
+4. As a human automating a shell setup, I want to chain `run` and `wait` steps in one command, so that I can express a setup sequence without a sleep-and-grep shell loop.
+5. As a caller, I want a batch to stop at the first failed step by default, so that later keystrokes don't land in an unexpected screen state.
+6. As a caller, I want a `--keep-going` option, so that I can run a best-effort batch that attempts every step regardless of failures.
+7. As a caller, I want the result to tell me exactly which steps completed and which one failed, so that I can diagnose where the interaction broke, since a batch is not atomic and already-sent input cannot be undone.
+8. As a caller, I want to supply steps either as an inline positional JSON string or via `--file`, so that I can choose between quick one-offs and reusable step files.
+9. As a caller, I want a clear validation error when I pass both inline steps and `--file`, or neither, so that ambiguous input fails fast with a helpful message.
+10. As a caller, I want each step to be a single action with one verb, so that the step format is unambiguous and mirrors the rest of the CLI.
+11. As a caller, I want `wait` steps to reuse the exact conditions of the `wait` command (text, regex, screen stability, cursor, timeout), so that I don't learn a second wait vocabulary.
+12. As a caller using `--json`, I want a stable machine-readable envelope with per-step outcomes, so that I can program against the batch result.
+13. As a caller, I want a non-zero exit code when a batch fails fast, so that scripts and agents can detect failure without parsing output.
+14. As an agent, I want a `run` step to behave as a **Waited Run** by default and to support a no-wait option, so that command-completion semantics match the standalone `run`.
+15. As an agent, I want to set a per-`wait`-step timeout, so that different steps can wait for different durations.
+16. As a developer of agent-tty, I want the standalone `wait` command to also accept an explicit **Wait Baseline**, so that the primitive is reusable outside batch (for example, chaining a wait after a captured sequence from `snapshot`).
+17. As a caller, I want a batch to target one **Command Target** resolved once at the start, so that the whole sequence applies to a single consistent **Session**.
+18. As a caller, I want a batch to stop if the target **Session** exits or becomes non-commandable mid-sequence, so that I never send input to a dead session.
+19. As a caller, I want the echo-match limitation documented, so that I know a `wait` can still match the echo of a just-typed command and write distinctive waits or use screen stability instead.
+20. As an agent, I want batch to work uniformly whether I am driving a shell or a full-screen TUI, so that one mechanism covers both.
+21. As a maintainer, I want the batch orchestration logic to be testable without a real PTY or renderer, so that ordering, baseline threading, and fail-fast are covered by fast, isolated tests.
+
+## Implementation Decisions
+
+- A new `batch <sessionId>` command. Steps are supplied as a positional JSON string XOR a `--file <path>` (mutually exclusive — a validation error if both or neither are given), mirroring the existing input-source convention. No stdin source in v1.
+- A **Batch Step** is a tagged union with exactly one verb key: `type`, `paste`, `send-keys` (a list of key names), `run` (a **Waited Run** that supports a no-wait option), or `wait` (a **Render Wait** carrying the standard conditions: text, regex, screen-stable duration, cursor row/col, timeout). `send-keys` and `run` are modeled distinctly rather than as uniform text steps.
+- **Wait Baseline**: the **Render Wait** parameters gain an optional event-log sequence floor. The live host poll and the offline replay matcher reject any **Semantic Snapshot** whose captured sequence is not strictly greater than the baseline. The batch executor records the **Event Log** sequence after each input step and passes it as the next `wait` step's baseline. The standalone `wait` command also exposes this baseline as a flag. This decision is recorded in ADR 0007. It fixes stale-match; it does not fix echo-match.
+- The executor runs **client-side**: `batch` orchestrates the existing per-step input and **Render Wait** operations and threads baselines between them. Input results return their **Event Log** sequence so the executor can anchor the following wait; `run`, `send-keys`, and `mark` already return one, and the `type` and `paste` results gain it.
+- **Fail-fast** by default: the first failed **Batch Step** stops the run and yields a non-zero exit. `--keep-going` attempts every step regardless. A **Batch** is not atomic; the result reports which steps completed. This default deliberately diverges from agent-browser's continue-by-default, because terminal steps are stateful and dependent.
+- **Deep modules**: a **Batch Plan** parser (JSON to validated steps; pure) and a **Batch executor** (ordered execution, baseline threading, fail-fast, and result accumulation, driven through an injected step-driver interface so it runs without a real PTY or renderer).
+- **Result envelope** (`--json`): a per-step array recording each step's index, kind, the input sequence or wait baseline, the wait outcome (matched, timed-out, matched text), and duration; plus an overall completed-step count and the failed-step index when fail-fast triggers. The **Command Target** is resolved once for the whole invocation.
+
+## Testing Decisions
+
+Good tests assert external behavior, not implementation details.
+
+- **Batch executor (unit).** Drive the executor with a fake step-driver and assert ordering, **Wait Baseline** threading (each wait receives the prior input step's sequence), fail-fast versus `--keep-going`, and result accumulation — with no PTY or renderer. The current render-wait matcher unit tests are prior art for pure-logic coverage.
+- **Batch Plan parser (unit).** Assert the tagged-union step kinds parse, the positional-XOR-file rule is enforced, and malformed or empty plans are rejected with clear errors.
+- **Wait Baseline gate (unit).** Assert the render-wait matcher never matches a **Semantic Snapshot** at or below the baseline and can match one strictly above it, covering both the live and offline paths. This guards the ADR-0007 invariant.
+- **Batch CLI (integration).** Against an isolated `AGENT_TTY_HOME` with a real **Session**: a multi-step plan, the fail-fast exit code, and the `--json` envelope shape. The existing CLI integration tests are prior art.
+- **Batch end-to-end.** Drive a real TUI (for example `nvim --clean`) through a batch and assert the rendered result. The existing fixture-driven e2e flows are prior art.
+
+## Out of Scope
+
+- Capture steps inside a batch (taking a snapshot or screenshot, or exporting a recording, as a step). v1 batch is input plus wait; capture stays a separate command the caller runs around the batch.
+- A stdin source for steps.
+- Fixing echo-match (a `wait` matching the terminal's echo of a just-typed command). The **Wait Baseline** fixes stale-match only; echo-match stays the caller's responsibility — use a distinctive output token or a screen-stability wait — exactly as with the `wait` command today.
+- A host-side batch RPC or single-round-trip execution. v1 executes client-side; a host-side executor is a possible later optimization.
+- Inline waits attached to input steps, and any control flow (conditionals, loops, retries). v1 is a flat, linear sequence of single-action steps.
+- Atomic or transactional rollback. Already-sent input cannot be undone.
+
+## Further Notes
+
+- The `batch` verb matches the closest analog, vercel-labs/agent-browser, which uses `batch` and keeps `wait` a separate step. agent-tty's fail-fast default is the deliberate divergence.
+- The domain terms **Batch**, **Batch Step**, and **Wait Baseline** are defined in the project glossary, and the **Wait Baseline** decision is ADR 0007. The glossary terms, ADR 0007, and this PRD are on branch `feat/batch-command`.
+- A small illustrative plan (shape only):
+
+  ```json
+  [
+    { "run": "nvim --clean", "noWait": true },
+    { "wait": { "screenStableMs": 1000 } },
+    { "sendKeys": ["i"] },
+    { "type": "hello" },
+    { "sendKeys": ["Escape", ":wq", "Enter"] },
+    { "wait": { "text": "written" } }
+  ]
+  ```
diff --git a/skill-data/agent-tty/SKILL.md b/skill-data/agent-tty/SKILL.md
index cb1cee0b..d090d61b 100644
--- a/skill-data/agent-tty/SKILL.md
+++ b/skill-data/agent-tty/SKILL.md
@@ -49,6 +49,7 @@ agent-tty --home <path> run <session-id> 'command here' --json
 agent-tty --home <path> type <session-id> 'literal text' --json
 agent-tty --home <path> paste <session-id> 'multiline payload' --json
 agent-tty --home <path> send-keys <session-id> Enter Ctrl+C --json
+agent-tty --home <path> batch <session-id> '[{"run":"htop","noWait":true},{"wait":{"screenStableMs":1000}}]' --json
 
 # Observation and proof
 agent-tty --home <path> wait <session-id> --text 'ready' --json
@@ -71,15 +72,22 @@ agent-tty --home "$AGENT_HOME" snapshot "$SESSION_ID" --format text --json
 
 ### Drive an interactive CLI or TUI
 
+Use `batch` to run an ordered sequence of input-and-`wait` steps in one call instead of separate `run`/`wait`/`send-keys` invocations. Each `wait` step is anchored to a Wait Baseline — it only observes screen state produced _after_ the preceding input step, so the sequence cannot race ahead and match a stale screen. A batch stops at the first failed step by default (`--keep-going` attempts every step).
+
 ```bash
 AGENT_HOME="$(mktemp -d)"
 SESSION_ID=$(agent-tty --home "$AGENT_HOME" create --json -- /bin/bash | jq -r '.result.sessionId')
-agent-tty --home "$AGENT_HOME" run "$SESSION_ID" '<interactive-command>' --no-wait --json
-agent-tty --home "$AGENT_HOME" wait "$SESSION_ID" --screen-stable-ms 1000 --json
-agent-tty --home "$AGENT_HOME" send-keys "$SESSION_ID" Down Down Enter --json
+agent-tty --home "$AGENT_HOME" batch "$SESSION_ID" '[
+  { "run": "<interactive-command>", "noWait": true },
+  { "wait": { "screenStableMs": 1000 } },
+  { "sendKeys": ["Down", "Down", "Enter"] },
+  { "wait": { "text": "<expected-label>" } }
+]' --json
 agent-tty --home "$AGENT_HOME" screenshot "$SESSION_ID" --json
 ```
 
+A `wait` can still match the _echo_ of a just-typed command, so use a distinctive output token or `screenStableMs` rather than waiting for text you just typed.
+
 ### Export reviewer-facing artifacts
 
 ```bash
diff --git a/src/batch/executor.ts b/src/batch/executor.ts
new file mode 100644
index 00000000..d6c9155d
--- /dev/null
+++ b/src/batch/executor.ts
@@ -0,0 +1,332 @@
+import type { BatchPlan, BatchStep } from './plan.js';
+import type {
+  BatchResult,
+  BatchStepRecord,
+  RunStepRecord,
+  WaitStepRecord,
+} from './result.js';
+import type { StepDriver } from './stepDriver.js';
+
+import { CliError } from '../cli/errors.js';
+import { ERROR_CODES, makeCliError } from '../protocol/errors.js';
+import { unreachable } from '../util/assert.js';
+import { unreachedStepRecord } from './result.js';
+
+export interface ExecuteBatchOptions {
+  plan: BatchPlan;
+  driver: StepDriver;
+  keepGoing: boolean;
+  // Re-check commandability around each Render Wait (rejecting with a CliError)
+  // so a Session that dies mid-Batch fails the wait rather than racing it.
+  assertCommandable?: () => Promise<void>;
+  onStep?: (record: BatchStepRecord) => void;
+}
+
+function toBatchStepError(error: CliError): { code: string; message: string } {
+  return { code: error.code, message: error.message };
+}
+
+// Reframe a thrown non-CliError as INTERNAL_ERROR so it is recorded against the
+// step rather than escaping and discarding the per-step BatchResult.
+function reframeAsInternalError(error: unknown, index: number): CliError {
+  return makeCliError(ERROR_CODES.INTERNAL_ERROR, {
+    message: `Batch step ${String(index)} failed with an unexpected error.`,
+    details: {
+      stepIndex: index,
+      reason: error instanceof Error ? error.message : String(error),
+    },
+    cause: error,
+  });
+}
+
+function classifyRunOutcome(run: {
+  noWait: boolean;
+  completed?: boolean;
+  timedOut?: boolean;
+}): RunStepRecord['runOutcome'] {
+  if (run.noWait) {
+    return 'started';
+  }
+  if (run.completed === true) {
+    return 'completed';
+  }
+  if (run.timedOut === true) {
+    return 'timedOut';
+  }
+  // A Waited Run that neither completed nor timed out was interrupted by
+  // Session exit (the Run Completion can no longer arrive).
+  return 'sessionExited';
+}
+
+/**
+ * Run an ordered Batch through the injected StepDriver, threading each input
+ * step's Wait Baseline into the following Render Wait. Pure over the driver;
+ * fail-fast unless `keepGoing`, after which later steps are recorded `not-run`.
+ */
+export async function executeBatch(
+  opts: ExecuteBatchOptions,
+): Promise<BatchResult> {
+  const { plan, driver, keepGoing, assertCommandable, onStep } = opts;
+
+  const steps: BatchStepRecord[] = [];
+  const failedIndices: number[] = [];
+  let completedCount = 0;
+  let lastInputSeq: number | undefined;
+  let stopped = false;
+
+  const finalize = (record: BatchStepRecord): void => {
+    steps.push(record);
+    if (record.status === 'completed') {
+      completedCount += 1;
+    } else if (record.status === 'failed') {
+      failedIndices.push(record.index);
+    }
+    onStep?.(record);
+  };
+
+  for (let index = 0; index < plan.steps.length; index += 1) {
+    const step = plan.steps[index];
+    if (step === undefined) {
+      continue;
+    }
+
+    if (stopped) {
+      finalize(notRunRecord(step, index));
+      continue;
+    }
+
+    const startedAt = Date.now();
+    let record: BatchStepRecord;
+    try {
+      record = await runStep(
+        step,
+        index,
+        driver,
+        lastInputSeq,
+        startedAt,
+        assertCommandable,
+      );
+    } catch (error) {
+      const cliError =
+        error instanceof CliError
+          ? error
+          : reframeAsInternalError(error, index);
+      record = failedRecord(step, index, Date.now() - startedAt, cliError);
+    }
+
+    // A non-wait step that produced a seq advances the Wait Baseline, even a
+    // failed Waited Run (it still injected its command), so a following wait
+    // cannot stale-match the pre-run screen under --keep-going.
+    if (record.kind !== 'wait' && record.seq !== undefined) {
+      lastInputSeq = record.seq;
+    }
+
+    finalize(record);
+
+    if (record.status === 'failed' && !keepGoing) {
+      stopped = true;
+    }
+  }
+
+  return { steps, completedCount, failedIndices };
+}
+
+async function runStep(
+  step: BatchStep,
+  index: number,
+  driver: StepDriver,
+  lastInputSeq: number | undefined,
+  startedAt: number,
+  assertCommandable: (() => Promise<void>) | undefined,
+): Promise<BatchStepRecord> {
+  switch (step.kind) {
+    case 'type': {
+      const seq = await driver.type(step.text);
+      return {
+        index,
+        durationMs: Date.now() - startedAt,
+        kind: 'type',
+        status: 'completed',
+        seq,
+      };
+    }
+    case 'paste': {
+      const seq = await driver.paste(step.text);
+      return {
+        index,
+        durationMs: Date.now() - startedAt,
+        kind: 'paste',
+        status: 'completed',
+        seq,
+      };
+    }
+    case 'sendKeys': {
+      const seq = await driver.sendKeys(step.keys);
+      return {
+        index,
+        durationMs: Date.now() - startedAt,
+        kind: 'sendKeys',
+        status: 'completed',
+        seq,
+      };
+    }
+    case 'run':
+      return runRunStep(step, index, driver, startedAt);
+    case 'wait':
+      return runWaitStep(
+        step,
+        index,
+        driver,
+        lastInputSeq,
+        startedAt,
+        assertCommandable,
+      );
+    default:
+      return unreachable(step, `batch step kind dispatch at index ${index}`);
+  }
+}
+
+async function runRunStep(
+  step: Extract<BatchStep, { kind: 'run' }>,
+  index: number,
+  driver: StepDriver,
+  startedAt: number,
+): Promise<RunStepRecord> {
+  const result = await driver.run(step.command, step.noWait, step.timeoutMs);
+  const runOutcome = classifyRunOutcome({
+    noWait: step.noWait,
+    ...(result.completed === undefined ? {} : { completed: result.completed }),
+    ...(result.timedOut === undefined ? {} : { timedOut: result.timedOut }),
+  });
+
+  const failed = !step.noWait && result.completed !== true;
+  const base: Omit<RunStepRecord, 'error'> = {
+    index,
+    durationMs: Date.now() - startedAt,
+    kind: 'run',
+    status: failed ? 'failed' : 'completed',
+    seq: result.seq,
+    noWait: step.noWait,
+    ...(result.completed === undefined ? {} : { completed: result.completed }),
+    ...(result.timedOut === undefined ? {} : { timedOut: result.timedOut }),
+    runOutcome,
+  };
+  if (!failed) {
+    return base;
+  }
+  const code =
+    runOutcome === 'timedOut'
+      ? ERROR_CODES.WAIT_TIMEOUT
+      : ERROR_CODES.SESSION_NOT_RUNNING;
+  return {
+    ...base,
+    error: toBatchStepError(
+      makeCliError(code, {
+        message:
+          runOutcome === 'timedOut'
+            ? `Waited Run at step ${String(index)} timed out before completing.`
+            : `Waited Run at step ${String(index)} was interrupted by Session exit before completing.`,
+      }),
+    ),
+  };
+}
+
+async function runWaitStep(
+  step: Extract<BatchStep, { kind: 'wait' }>,
+  index: number,
+  driver: StepDriver,
+  lastInputSeq: number | undefined,
+  startedAt: number,
+  assertCommandable: (() => Promise<void>) | undefined,
+): Promise<WaitStepRecord> {
+  await assertCommandable?.();
+
+  const result = await driver.wait(
+    step.condition,
+    lastInputSeq,
+    step.timeoutMs,
+  );
+
+  const baseline =
+    lastInputSeq === undefined ? {} : { waitBaseline: lastInputSeq };
+  const matchedText =
+    result.matchedText === undefined ? {} : { matchedText: result.matchedText };
+  const observations = {
+    matched: result.matched,
+    timedOut: result.timedOut,
+    ...matchedText,
+    capturedAtSeq: result.capturedAtSeq,
+  };
+
+  // A timed-out wait (equivalently an unmatched result) is not a thrown error
+  // from the driver, so classify it here as a failed step with its own code.
+  if (result.timedOut || !result.matched) {
+    return {
+      index,
+      durationMs: Date.now() - startedAt,
+      kind: 'wait',
+      status: 'failed',
+      ...baseline,
+      ...observations,
+      error: toBatchStepError(
+        makeCliError(ERROR_CODES.WAIT_TIMEOUT, {
+          message: `Render wait at step ${String(index)} timed out before its condition was met.`,
+        }),
+      ),
+    };
+  }
+
+  // Re-check after the match; if the Session died in that window, keep the
+  // observations on the failed record rather than emitting a bare error.
+  try {
+    await assertCommandable?.();
+  } catch (error) {
+    if (error instanceof CliError) {
+      return {
+        index,
+        durationMs: Date.now() - startedAt,
+        kind: 'wait',
+        status: 'failed',
+        ...baseline,
+        ...observations,
+        error: toBatchStepError(error),
+      };
+    }
+    throw error;
+  }
+
+  return {
+    index,
+    durationMs: Date.now() - startedAt,
+    kind: 'wait',
+    status: 'completed',
+    ...baseline,
+    ...observations,
+  };
+}
+
+function failedRecord(
+  step: BatchStep,
+  index: number,
+  durationMs: number,
+  error: CliError,
+): BatchStepRecord {
+  const base = { index, durationMs, status: 'failed' as const };
+  const stepError = toBatchStepError(error);
+  switch (step.kind) {
+    case 'run':
+      return { ...base, kind: 'run', noWait: step.noWait, error: stepError };
+    case 'wait':
+      return { ...base, kind: 'wait', error: stepError };
+    case 'type':
+    case 'paste':
+    case 'sendKeys':
+      return { ...base, kind: step.kind, error: stepError };
+    default:
+      return unreachable(step, `batch failed-step kind at index ${index}`);
+  }
+}
+
+function notRunRecord(step: BatchStep, index: number): BatchStepRecord {
+  return unreachedStepRecord(step, index, 'not-run');
+}
diff --git a/src/batch/plan.ts b/src/batch/plan.ts
new file mode 100644
index 00000000..30000644
--- /dev/null
+++ b/src/batch/plan.ts
@@ -0,0 +1,225 @@
+import { z } from 'zod';
+
+import type { PreparedRenderWaitCondition } from '../renderWait/matcher.js';
+
+import { assertValidKeyName } from '../pty/keyEncoder.js';
+import { ERROR_CODES, makeCliError } from '../protocol/errors.js';
+import { prepareRenderWaitCondition } from '../renderWait/matcher.js';
+import { invariant, unreachable } from '../util/assert.js';
+
+export type BatchStep =
+  | { kind: 'type'; text: string }
+  | { kind: 'paste'; text: string }
+  | { kind: 'sendKeys'; keys: string[] }
+  | {
+      kind: 'run';
+      command: string;
+      noWait: boolean;
+      timeoutMs: number | undefined;
+    }
+  | {
+      kind: 'wait';
+      condition: PreparedRenderWaitCondition;
+      timeoutMs: number | undefined;
+    };
+
+export interface BatchPlan {
+  steps: BatchStep[];
+}
+
+const VERB_KEYS = ['type', 'paste', 'sendKeys', 'run', 'wait'] as const;
+
+type VerbKey = (typeof VERB_KEYS)[number];
+
+// Default for an omitted `wait` step timeout, matching the `wait` command; an
+// explicit `timeout: 0` still means infinite. Keeps an unattended Batch from
+// hanging forever on a wait whose condition never appears.
+const DEFAULT_WAIT_TIMEOUT_MS = 600_000;
+
+const TypeStepSchema = z.object({ type: z.string().min(1) }).strict();
+const PasteStepSchema = z.object({ paste: z.string().min(1) }).strict();
+const SendKeysStepSchema = z
+  .object({ sendKeys: z.array(z.string().min(1)).min(1) })
+  .strict();
+const RunStepSchema = z
+  .object({
+    run: z.string().min(1),
+    noWait: z.boolean().optional(),
+    timeout: z.number().int().positive().optional(),
+  })
+  .strict();
+const WaitStepSchema = z
+  .object({
+    wait: z
+      .object({
+        text: z.string().optional(),
+        regex: z.string().optional(),
+        screenStableMs: z.number().int().positive().optional(),
+        cursorRow: z.number().int().nonnegative().optional(),
+        cursorCol: z.number().int().nonnegative().optional(),
+        timeout: z.number().int().nonnegative().optional(),
+      })
+      .strict(),
+  })
+  .strict();
+
+function invalidInput(message: string, stepIndex?: number): never {
+  throw makeCliError(ERROR_CODES.INVALID_INPUT, {
+    message,
+    ...(stepIndex === undefined ? {} : { details: { stepIndex } }),
+  });
+}
+
+function isPlainObject(value: unknown): value is Record<string, unknown> {
+  return typeof value === 'object' && value !== null && !Array.isArray(value);
+}
+
+function presentVerbKeys(step: Record<string, unknown>): VerbKey[] {
+  return VERB_KEYS.filter((key) => Object.hasOwn(step, key));
+}
+
+function parseStep(rawStep: unknown, index: number): BatchStep {
+  if (!isPlainObject(rawStep)) {
+    return invalidInput(
+      `Batch step ${String(index)} must be a JSON object`,
+      index,
+    );
+  }
+
+  const verbs = presentVerbKeys(rawStep);
+  if (verbs.length === 0) {
+    return invalidInput(
+      `Batch step ${String(index)} must have exactly one of type|paste|sendKeys|run|wait; found none`,
+      index,
+    );
+  }
+  if (verbs.length > 1) {
+    return invalidInput(
+      `Batch step ${String(index)} must have exactly one of type|paste|sendKeys|run|wait; found ${verbs.join(', ')}`,
+      index,
+    );
+  }
+
+  const verb = verbs[0];
+  invariant(verb !== undefined, 'exactly one verb key is present');
+
+  switch (verb) {
+    case 'type':
+      return parseTypeStep(rawStep, index);
+    case 'paste':
+      return parsePasteStep(rawStep, index);
+    case 'sendKeys':
+      return parseSendKeysStep(rawStep, index);
+    case 'run':
+      return parseRunStep(rawStep, index);
+    case 'wait':
+      return parseWaitStep(rawStep, index);
+    default:
+      return unreachable(verb, `Batch step ${String(index)} verb dispatch`);
+  }
+}
+
+function unwrapStep<T>(
+  schema: z.ZodType<T>,
+  rawStep: unknown,
+  index: number,
+): T {
+  const result = schema.safeParse(rawStep);
+  if (!result.success) {
+    throw makeCliError(ERROR_CODES.INVALID_INPUT, {
+      message: `Batch step ${String(index)} is invalid`,
+      details: { stepIndex: index, issues: result.error.issues },
+    });
+  }
+  return result.data;
+}
+
+function parseTypeStep(
+  rawStep: Record<string, unknown>,
+  index: number,
+): BatchStep {
+  const data = unwrapStep(TypeStepSchema, rawStep, index);
+  return { kind: 'type', text: data.type };
+}
+
+function parsePasteStep(
+  rawStep: Record<string, unknown>,
+  index: number,
+): BatchStep {
+  const data = unwrapStep(PasteStepSchema, rawStep, index);
+  return { kind: 'paste', text: data.paste };
+}
+
+function parseSendKeysStep(
+  rawStep: Record<string, unknown>,
+  index: number,
+): BatchStep {
+  const data = unwrapStep(SendKeysStepSchema, rawStep, index);
+
+  const keys = data.sendKeys;
+  for (const key of keys) {
+    assertValidKeyName(key);
+  }
+  return { kind: 'sendKeys', keys };
+}
+
+function parseRunStep(
+  rawStep: Record<string, unknown>,
+  index: number,
+): BatchStep {
+  const { run, noWait, timeout } = unwrapStep(RunStepSchema, rawStep, index);
+  return {
+    kind: 'run',
+    command: run,
+    noWait: noWait ?? false,
+    timeoutMs: timeout,
+  };
+}
+
+function parseWaitStep(
+  rawStep: Record<string, unknown>,
+  index: number,
+): BatchStep {
+  const { wait } = unwrapStep(WaitStepSchema, rawStep, index);
+  const condition = prepareRenderWaitCondition({
+    text: wait.text,
+    regex: wait.regex,
+    screenStableMs: wait.screenStableMs,
+    cursorRow: wait.cursorRow,
+    cursorCol: wait.cursorCol,
+  });
+
+  // Absent -> finite default; explicit 0 -> infinite.
+  const timeoutMs =
+    wait.timeout === undefined
+      ? DEFAULT_WAIT_TIMEOUT_MS
+      : wait.timeout === 0
+        ? undefined
+        : wait.timeout;
+  return { kind: 'wait', condition, timeoutMs };
+}
+
+/**
+ * Parse a JSON string into a validated Batch Plan. Pure: no fs or rpc. Every
+ * failure throws a CliError so the whole plan is rejected before any input is
+ * sent (a Batch is not atomic).
+ */
+export function parseBatchPlan(raw: string): BatchPlan {
+  let parsed: unknown;
+  try {
+    parsed = JSON.parse(raw);
+  } catch {
+    return invalidInput('Batch steps must be valid JSON.');
+  }
+
+  if (!Array.isArray(parsed)) {
+    return invalidInput('Batch steps must be a JSON array.');
+  }
+
+  if (parsed.length === 0) {
+    return invalidInput('Batch must contain at least one step.');
+  }
+
+  const steps = parsed.map((rawStep, index) => parseStep(rawStep, index));
+  return { steps };
+}
diff --git a/src/batch/result.ts b/src/batch/result.ts
new file mode 100644
index 00000000..d8522877
--- /dev/null
+++ b/src/batch/result.ts
@@ -0,0 +1,139 @@
+import { z } from 'zod';
+
+import type { BatchPlan, BatchStep } from './plan.js';
+
+import { unreachable } from '../util/assert.js';
+
+// `interrupted` is the in-flight step abandoned by a SIGINT/SIGTERM flush (its
+// RPC is not awaited, so its outcome is unknown); `not-run` is a later step the
+// executor never reached. Only the CLI signal handler produces `interrupted`.
+export const StepStatusSchema = z.enum([
+  'completed',
+  'failed',
+  'not-run',
+  'interrupted',
+]);
+export type StepStatus = z.infer<typeof StepStatusSchema>;
+
+export const BatchStepErrorSchema = z
+  .object({
+    code: z.string(),
+    message: z.string(),
+  })
+  .strict();
+export type BatchStepError = z.infer<typeof BatchStepErrorSchema>;
+
+const NonNegativeIntSchema = z.number().int().nonnegative();
+
+export const InputStepRecordSchema = z
+  .object({
+    index: NonNegativeIntSchema,
+    durationMs: NonNegativeIntSchema,
+    kind: z.enum(['type', 'paste', 'sendKeys']),
+    status: StepStatusSchema,
+    seq: NonNegativeIntSchema.optional(),
+    error: BatchStepErrorSchema.optional(),
+  })
+  .strict();
+export type InputStepRecord = z.infer<typeof InputStepRecordSchema>;
+
+export const RunStepRecordSchema = z
+  .object({
+    index: NonNegativeIntSchema,
+    durationMs: NonNegativeIntSchema,
+    kind: z.literal('run'),
+    status: StepStatusSchema,
+    seq: NonNegativeIntSchema.optional(),
+    noWait: z.boolean(),
+    completed: z.boolean().optional(),
+    timedOut: z.boolean().optional(),
+    runOutcome: z
+      .enum(['completed', 'timedOut', 'sessionExited', 'started'])
+      .optional(),
+    error: BatchStepErrorSchema.optional(),
+  })
+  .strict();
+export type RunStepRecord = z.infer<typeof RunStepRecordSchema>;
+
+export const WaitStepRecordSchema = z
+  .object({
+    index: NonNegativeIntSchema,
+    durationMs: NonNegativeIntSchema,
+    kind: z.literal('wait'),
+    status: StepStatusSchema,
+    waitBaseline: NonNegativeIntSchema.optional(),
+    matched: z.boolean().optional(),
+    timedOut: z.boolean().optional(),
+    matchedText: z.string().optional(),
+    capturedAtSeq: NonNegativeIntSchema.optional(),
+    error: BatchStepErrorSchema.optional(),
+  })
+  .strict();
+export type WaitStepRecord = z.infer<typeof WaitStepRecordSchema>;
+
+export const BatchStepRecordSchema = z.discriminatedUnion('kind', [
+  InputStepRecordSchema,
+  RunStepRecordSchema,
+  WaitStepRecordSchema,
+]);
+export type BatchStepRecord = z.infer<typeof BatchStepRecordSchema>;
+
+export const BatchResultSchema = z
+  .object({
+    steps: z.array(BatchStepRecordSchema),
+    completedCount: NonNegativeIntSchema,
+    failedIndices: z.array(NonNegativeIntSchema),
+  })
+  .strict();
+export type BatchResult = z.infer<typeof BatchResultSchema>;
+
+// Shape a record for a Batch Step that never finalized (`not-run` or
+// `interrupted`): no seq, zero duration, no observed outcome.
+export function unreachedStepRecord(
+  step: BatchStep,
+  index: number,
+  status: 'not-run' | 'interrupted',
+): BatchStepRecord {
+  const base = { index, durationMs: 0, status };
+  switch (step.kind) {
+    case 'run':
+      return { ...base, kind: 'run', noWait: step.noWait };
+    case 'wait':
+      return { ...base, kind: 'wait' };
+    case 'type':
+    case 'paste':
+    case 'sendKeys':
+      return { ...base, kind: step.kind };
+    default:
+      return unreachable(step, `batch unreached-step kind at index ${index}`);
+  }
+}
+
+/**
+ * Build a partial BatchResult from the finalized records plus the plan: the
+ * first unreached step (in flight when the signal arrived) is `interrupted`,
+ * the rest `not-run`. Lets the signal handler flush without awaiting the RPC.
+ */
+export function buildPartialBatchResult(
+  plan: BatchPlan,
+  recorded: readonly BatchStepRecord[],
+): BatchResult {
+  const steps = [...recorded];
+  for (let index = recorded.length; index < plan.steps.length; index += 1) {
+    const step = plan.steps[index];
+    if (step === undefined) {
+      continue;
+    }
+    const status = index === recorded.length ? 'interrupted' : 'not-run';
+    steps.push(unreachedStepRecord(step, index, status));
+  }
+
+  return {
+    steps,
+    completedCount: steps.filter((record) => record.status === 'completed')
+      .length,
+    failedIndices: steps
+      .filter((record) => record.status === 'failed')
+      .map((record) => record.index),
+  };
+}
diff --git a/src/batch/stepDriver.ts b/src/batch/stepDriver.ts
new file mode 100644
index 00000000..335ab25c
--- /dev/null
+++ b/src/batch/stepDriver.ts
@@ -0,0 +1,148 @@
+import type { z } from 'zod';
+
+import type { RunResult, WaitForRenderResult } from '../protocol/messages.js';
+import type { PreparedRenderWaitCondition } from '../renderWait/matcher.js';
+
+import { sendRpc } from '../host/rpcClient.js';
+import {
+  PasteResultSchema,
+  RunResultSchema,
+  SendKeysResultSchema,
+  TypeResultSchema,
+  WaitForRenderResultSchema,
+} from '../protocol/messages.js';
+import { parseValidatedResult } from '../protocol/validation.js';
+import { invariant } from '../util/assert.js';
+
+/**
+ * The seam the Batch executor drives, injected so it runs without a real PTY or
+ * renderer. Input verbs resolve the Event Log seq they produced; `wait` takes a
+ * prior seq back as `afterSeq` (the Wait Baseline).
+ */
+export interface StepDriver {
+  type(text: string): Promise<number>;
+  paste(text: string): Promise<number>;
+  sendKeys(keys: string[]): Promise<number>;
+  run(
+    command: string,
+    noWait: boolean,
+    timeoutMs: number | undefined,
+  ): Promise<RunResult>;
+  wait(
+    condition: PreparedRenderWaitCondition,
+    afterSeq: number | undefined,
+    timeoutMs: number | undefined,
+  ): Promise<WaitForRenderResult>;
+}
+
+const DEFAULT_RUN_TIMEOUT_MS = 30_000;
+const NO_WAIT_RUN_TRANSPORT_TIMEOUT_MS = 10_000;
+const RUN_TRANSPORT_PADDING_MS = 10_000;
+const WAIT_TRANSPORT_PADDING_MS = 5_000;
+
+function parseOrThrow<T>(
+  schema: z.ZodType<T>,
+  raw: unknown,
+  method: string,
+): T {
+  return parseValidatedResult(
+    schema,
+    raw,
+    `Unexpected response shape from the session host for "${method}".`,
+  );
+}
+
+// Build the strict `waitForRender` params: omit undefined optionals, drop the
+// prepared `compiledRegex`, and take `afterSeq` from the Wait Baseline argument.
+function buildWaitParams(
+  condition: PreparedRenderWaitCondition,
+  afterSeq: number | undefined,
+  timeoutMs: number | undefined,
+  rendererName: string | undefined,
+): Record<string, unknown> {
+  return {
+    ...(condition.text === undefined ? {} : { text: condition.text }),
+    ...(condition.regex === undefined ? {} : { regex: condition.regex }),
+    ...(condition.screenStableMs === undefined
+      ? {}
+      : { screenStableMs: condition.screenStableMs }),
+    ...(condition.cursorRow === undefined
+      ? {}
+      : { cursorRow: condition.cursorRow }),
+    ...(condition.cursorCol === undefined
+      ? {}
+      : { cursorCol: condition.cursorCol }),
+    ...(afterSeq === undefined ? {} : { afterSeq }),
+    ...(timeoutMs === undefined ? {} : { timeoutMs }),
+    ...(rendererName === undefined ? {} : { rendererName }),
+  };
+}
+
+/**
+ * The production StepDriver: each verb is one RPC, with transport timeouts that
+ * pad the host's own deadline. Unlike `runWaitCommand` it does NOT fall back to
+ * offline replay — a dead host mid-Batch is a failed step, so the CliError
+ * propagates for the executor to classify.
+ */
+export function createRpcStepDriver(
+  socketPath: string,
+  rendererName?: string,
+): StepDriver {
+  invariant(socketPath.length > 0, 'socketPath must be a non-empty string');
+
+  return {
+    async type(text: string): Promise<number> {
+      const raw = await sendRpc(socketPath, 'type', { text });
+      return parseOrThrow(TypeResultSchema, raw, 'type').seq;
+    },
+
+    async paste(text: string): Promise<number> {
+      const raw = await sendRpc(socketPath, 'paste', { text });
+      return parseOrThrow(PasteResultSchema, raw, 'paste').seq;
+    },
+
+    async sendKeys(keys: string[]): Promise<number> {
+      const raw = await sendRpc(socketPath, 'sendKeys', { keys });
+      return parseOrThrow(SendKeysResultSchema, raw, 'sendKeys').seq;
+    },
+
+    async run(
+      command: string,
+      noWait: boolean,
+      timeoutMs: number | undefined,
+    ): Promise<RunResult> {
+      const params: Record<string, unknown> = { command, noWait };
+      if (!noWait && timeoutMs !== undefined) {
+        params.timeoutMs = timeoutMs;
+      }
+      const transportTimeoutMs = noWait
+        ? NO_WAIT_RUN_TRANSPORT_TIMEOUT_MS
+        : (timeoutMs ?? DEFAULT_RUN_TIMEOUT_MS) + RUN_TRANSPORT_PADDING_MS;
+      const raw = await sendRpc(socketPath, 'run', params, transportTimeoutMs);
+      return parseOrThrow(RunResultSchema, raw, 'run');
+    },
+
+    async wait(
+      condition: PreparedRenderWaitCondition,
+      afterSeq: number | undefined,
+      timeoutMs: number | undefined,
+    ): Promise<WaitForRenderResult> {
+      const params = buildWaitParams(
+        condition,
+        afterSeq,
+        timeoutMs,
+        rendererName,
+      );
+      // timeoutMs undefined means an infinite wait (0 -> infinite transport).
+      const transportTimeoutMs =
+        timeoutMs === undefined ? 0 : timeoutMs + WAIT_TRANSPORT_PADDING_MS;
+      const raw = await sendRpc(
+        socketPath,
+        'waitForRender',
+        params,
+        transportTimeoutMs,
+      );
+      return parseOrThrow(WaitForRenderResultSchema, raw, 'waitForRender');
+    },
+  };
+}
diff --git a/src/cli/commands/batch.ts b/src/cli/commands/batch.ts
new file mode 100644
index 00000000..97e67279
--- /dev/null
+++ b/src/cli/commands/batch.ts
@@ -0,0 +1,341 @@
+import assert from 'node:assert/strict';
+import { lstat, readFile, stat } from 'node:fs/promises';
+import { constants as osConstants } from 'node:os';
+import process from 'node:process';
+
+import type { CommandContext } from '../context.js';
+import type { BatchPlan } from '../../batch/plan.js';
+import type { BatchResult, BatchStepRecord } from '../../batch/result.js';
+
+import { resolveCommandTarget } from '../commandTarget.js';
+import { exitCodeForError } from '../exitCodes.js';
+import { emitSuccess, emitSuccessSync } from '../output.js';
+import { executeBatch } from '../../batch/executor.js';
+import { parseBatchPlan } from '../../batch/plan.js';
+import { buildPartialBatchResult } from '../../batch/result.js';
+import { createRpcStepDriver } from '../../batch/stepDriver.js';
+import { ERROR_CODES, makeCliError } from '../../protocol/errors.js';
+import { MAX_INPUT_FILE_SIZE } from './inputSource.js';
+
+const KEEP_GOING_FAILURE_EXIT_CODE = 1;
+
+const INTERRUPT_SIGNALS = ['SIGINT', 'SIGTERM'] as const;
+type InterruptSignal = (typeof INTERRUPT_SIGNALS)[number];
+
+// Conventional 128 + signal-number exit code (SIGINT -> 130, SIGTERM -> 143).
+function signalExitCode(signal: InterruptSignal): number {
+  const signo = osConstants.signals[signal];
+  return typeof signo === 'number' ? 128 + signo : KEEP_GOING_FAILURE_EXIT_CODE;
+}
+
+interface CommandOptions {
+  context: CommandContext;
+  json: boolean;
+  sessionId: string;
+  steps: string | undefined;
+  file?: string;
+  keepGoing: boolean;
+}
+
+const BATCH_USAGE =
+  'Usage: agent-tty batch <session-id> [steps] [--file <path>]';
+
+function invalidInput(
+  message: string,
+  details?: Record<string, unknown>,
+  cause?: unknown,
+): ReturnType<typeof makeCliError> {
+  return makeCliError(ERROR_CODES.INVALID_INPUT, {
+    message,
+    ...(details === undefined ? {} : { details }),
+    ...(cause === undefined ? {} : { cause }),
+  });
+}
+
+function isErrnoException(error: unknown): error is NodeJS.ErrnoException {
+  return error instanceof Error && 'code' in error;
+}
+
+function inputFileError(
+  filePath: string,
+  error: unknown,
+): ReturnType<typeof makeCliError> {
+  if (isErrnoException(error) && error.code === 'ENOENT') {
+    return invalidInput(
+      `Steps file "${filePath}" was not found.`,
+      { file: filePath },
+      error,
+    );
+  }
+  if (
+    isErrnoException(error) &&
+    ['EACCES', 'EPERM'].includes(error.code ?? '')
+  ) {
+    return invalidInput(
+      `Steps file "${filePath}" is not readable.`,
+      { file: filePath },
+      error,
+    );
+  }
+  return invalidInput(
+    `Failed to read steps file "${filePath}".`,
+    { file: filePath },
+    error,
+  );
+}
+
+// Resolve the raw JSON step source from the positional argument XOR --file,
+// with the same file safety as resolveCommandInputText (regular file, 10 MB cap).
+async function resolveBatchSource(options: CommandOptions): Promise<string> {
+  if (options.steps !== undefined && options.file !== undefined) {
+    throw invalidInput(
+      `Positional [steps] argument and --file are mutually exclusive. ${BATCH_USAGE}`,
+      { steps: options.steps, file: options.file },
+    );
+  }
+
+  if (options.steps === undefined && options.file === undefined) {
+    throw invalidInput(
+      `Missing steps. Provide either a positional [steps] JSON array or --file <path>. ${BATCH_USAGE}`,
+    );
+  }
+
+  if (options.steps !== undefined) {
+    if (options.steps.length === 0) {
+      throw invalidInput('Steps must not be empty.');
+    }
+    return options.steps;
+  }
+
+  const filePath = options.file;
+  assert(typeof filePath === 'string', '--file must resolve to a string path');
+  assert(filePath.length > 0, '--file path must be a non-empty string');
+
+  let fileStats: Awaited<ReturnType<typeof lstat>>;
+  try {
+    fileStats = await lstat(filePath);
+  } catch (error: unknown) {
+    throw inputFileError(filePath, error);
+  }
+
+  if (!fileStats.isFile()) {
+    throw invalidInput(
+      `Steps file "${filePath}" must be a regular file. Directories, symlinks, and device files are not supported.`,
+      { file: filePath },
+    );
+  }
+
+  const contentStats = await stat(filePath).catch((error: unknown) => {
+    throw inputFileError(filePath, error);
+  });
+  if (contentStats.size > MAX_INPUT_FILE_SIZE) {
+    throw invalidInput(
+      `Steps file "${filePath}" exceeds the 10 MB limit for --file input.`,
+      {
+        file: filePath,
+        sizeBytes: contentStats.size,
+        maxSizeBytes: MAX_INPUT_FILE_SIZE,
+      },
+    );
+  }
+
+  let content: string;
+  try {
+    content = await readFile(filePath, 'utf8');
+  } catch (error: unknown) {
+    throw inputFileError(filePath, error);
+  }
+
+  if (content.length === 0) {
+    throw invalidInput(`Steps file "${filePath}" must not be empty.`, {
+      file: filePath,
+    });
+  }
+
+  return content;
+}
+
+function stepOutcome(record: BatchStepRecord): string {
+  if (record.status === 'not-run') {
+    return 'not-run';
+  }
+  if (record.status === 'failed') {
+    return record.error === undefined
+      ? 'failed'
+      : `failed (${record.error.code})`;
+  }
+  if (record.status === 'interrupted') {
+    return 'interrupted';
+  }
+  switch (record.kind) {
+    case 'wait':
+      return record.matchedText === undefined
+        ? 'matched'
+        : `matched "${record.matchedText}"`;
+    case 'run':
+      return record.runOutcome ?? 'completed';
+    case 'type':
+    case 'paste':
+    case 'sendKeys':
+      return 'completed';
+  }
+}
+
+function firstFailedStep(result: BatchResult): BatchStepRecord | undefined {
+  const firstFailedIndex = result.failedIndices[0];
+  if (firstFailedIndex === undefined) {
+    return undefined;
+  }
+  return result.steps.find((record) => record.index === firstFailedIndex);
+}
+
+function stepFailureReason(record: BatchStepRecord): string {
+  if (record.status !== 'failed' || record.error === undefined) {
+    return 'failed';
+  }
+  return `${record.error.code}: ${record.error.message}`;
+}
+
+export function buildBatchLines(
+  result: BatchResult,
+  keepGoing = false,
+): string[] {
+  const lines = result.steps.map(
+    (record) =>
+      `[${String(record.index)}] ${record.kind} ${stepOutcome(record)} (${String(record.durationMs)}ms)`,
+  );
+  lines.push(
+    `${String(result.completedCount)}/${String(result.steps.length)} steps completed`,
+  );
+
+  if (result.failedIndices.length > 0) {
+    if (keepGoing) {
+      lines.push(`failed steps: ${result.failedIndices.join(', ')}`);
+    } else {
+      const failed = firstFailedStep(result);
+      const index = result.failedIndices[0];
+      lines.push(
+        `failed at step ${String(index)}: ${failed === undefined ? 'failed' : stepFailureReason(failed)}`,
+      );
+    }
+  }
+
+  return lines;
+}
+
+interface InterruptFlushOptions {
+  plan: BatchPlan;
+  json: boolean;
+  keepGoing: boolean;
+}
+
+/**
+ * Run the executor under SIGINT/SIGTERM handlers that flush a partial envelope
+ * (in-flight step `interrupted`, later steps `not-run`) and exit. The in-flight
+ * RPC is NOT awaited, so an interrupted Waited Run keeps running on the Session
+ * — like caller-timeout (CONTEXT.md); already-applied input cannot be undone.
+ */
+async function executeWithInterruptFlush(
+  options: InterruptFlushOptions,
+  run: (onStep: (record: BatchStepRecord) => void) => Promise<BatchResult>,
+): Promise<BatchResult> {
+  const records: BatchStepRecord[] = [];
+  let flushed = false;
+
+  const handleSignal = (signal: InterruptSignal): void => {
+    // A second signal during the flush must not re-enter and double-write.
+    if (flushed) {
+      return;
+    }
+    flushed = true;
+
+    const partial = buildPartialBatchResult(options.plan, records);
+    // Sync flush: an async write would be truncated by the process.exit below
+    // for a partial envelope larger than the OS pipe buffer.
+    emitSuccessSync({
+      command: 'batch',
+      json: options.json,
+      result: partial,
+      lines: buildBatchLines(partial, options.keepGoing),
+    });
+    process.exit(signalExitCode(signal));
+  };
+
+  const handlers = INTERRUPT_SIGNALS.map((signal) => {
+    const handler = (): void => {
+      handleSignal(signal);
+    };
+    process.on(signal, handler);
+    return { signal, handler } as const;
+  });
+
+  try {
+    return await run((record) => {
+      records.push(record);
+    });
+  } finally {
+    for (const { signal, handler } of handlers) {
+      process.off(signal, handler);
+    }
+  }
+}
+
+export async function runBatchCommand(options: CommandOptions): Promise<void> {
+  const raw = await resolveBatchSource(options);
+
+  // Parse before resolving the target: a malformed plan fails fast without a
+  // live Session, and nothing is sent until the whole plan validates.
+  const plan = parseBatchPlan(raw);
+
+  const target = await resolveCommandTarget({
+    home: options.context.home,
+    sessionId: options.sessionId,
+  });
+
+  const driver = createRpcStepDriver(
+    target.socketPath,
+    options.context.rendererDefault,
+  );
+
+  // Re-resolve the target around each Render Wait (fresh manifest read) so a
+  // Session that dies mid-Batch fails the wait rather than racing a dead one.
+  const assertCommandable = async (): Promise<void> => {
+    await resolveCommandTarget({
+      home: options.context.home,
+      sessionId: options.sessionId,
+    });
+  };
+
+  const result = await executeWithInterruptFlush(
+    { plan, json: options.json, keepGoing: options.keepGoing },
+    (onStep) =>
+      executeBatch({
+        plan,
+        driver,
+        keepGoing: options.keepGoing,
+        assertCommandable,
+        onStep,
+      }),
+  );
+
+  // Set process.exitCode then always emitSuccess (doctor pattern), so a failed
+  // Batch still emits the full per-step envelope instead of a bare error.
+  if (result.failedIndices.length > 0) {
+    if (options.keepGoing) {
+      process.exitCode = KEEP_GOING_FAILURE_EXIT_CODE;
+    } else {
+      const failed = firstFailedStep(result);
+      process.exitCode =
+        failed?.status === 'failed' && failed.error !== undefined
+          ? exitCodeForError(failed.error.code)
+          : KEEP_GOING_FAILURE_EXIT_CODE;
+    }
+  }
+
+  emitSuccess({
+    command: 'batch',
+    json: options.json,
+    result,
+    lines: buildBatchLines(result, options.keepGoing),
+  });
+}
diff --git a/src/cli/commands/paste.ts b/src/cli/commands/paste.ts
index 97a2ce03..935fff92 100644
--- a/src/cli/commands/paste.ts
+++ b/src/cli/commands/paste.ts
@@ -1,13 +1,14 @@
 import type { CommandContext } from '../context.js';
+import type { PasteResult } from '../../protocol/messages.js';
 
 import { resolveCommandTarget } from '../commandTarget.js';
 import { emitSuccess } from '../output.js';
 import { sendRpc } from '../../host/rpcClient.js';
+import { PasteResultSchema } from '../../protocol/messages.js';
+import { ERROR_CODES, makeCliError } from '../../protocol/errors.js';
 import { resolveCommandInputText } from './inputSource.js';
 
-export interface PasteResult {
-  [key: string]: never;
-}
+export type { PasteResult } from '../../protocol/messages.js';
 
 interface CommandOptions {
   context: CommandContext;
@@ -29,15 +30,22 @@ export async function runPasteCommand(options: CommandOptions): Promise<void> {
     sessionId: options.sessionId,
   });
 
-  await sendRpc(target.socketPath, 'paste', {
+  const rawResult: unknown = await sendRpc(target.socketPath, 'paste', {
     text,
   });
+  const parsedResult = PasteResultSchema.safeParse(rawResult);
+  if (!parsedResult.success) {
+    throw makeCliError(ERROR_CODES.PROTOCOL_ERROR, {
+      message: 'Unexpected response from host',
+      details: { issues: parsedResult.error.issues },
+    });
+  }
 
-  const result: PasteResult = {};
+  const result: PasteResult = { seq: parsedResult.data.seq };
   emitSuccess({
     command: 'paste',
     json: options.json,
     result,
-    lines: ['Pasted text into session.'],
+    lines: [`Pasted text into session at seq ${String(result.seq)}.`],
   });
 }
diff --git a/src/cli/commands/type.ts b/src/cli/commands/type.ts
index 8b8492c5..83f20540 100644
--- a/src/cli/commands/type.ts
+++ b/src/cli/commands/type.ts
@@ -1,13 +1,14 @@
 import type { CommandContext } from '../context.js';
+import type { TypeResult } from '../../protocol/messages.js';
 
 import { resolveCommandTarget } from '../commandTarget.js';
 import { emitSuccess } from '../output.js';
 import { sendRpc } from '../../host/rpcClient.js';
+import { TypeResultSchema } from '../../protocol/messages.js';
+import { ERROR_CODES, makeCliError } from '../../protocol/errors.js';
 import { resolveCommandInputText } from './inputSource.js';
 
-export interface TypeResult {
-  [key: string]: never;
-}
+export type { TypeResult } from '../../protocol/messages.js';
 
 interface CommandOptions {
   context: CommandContext;
@@ -32,15 +33,22 @@ export async function runTypeCommand(options: CommandOptions): Promise<void> {
     sessionId: options.sessionId,
   });
 
-  await sendRpc(target.socketPath, 'type', {
+  const rawResult: unknown = await sendRpc(target.socketPath, 'type', {
     text,
   });
+  const parsedResult = TypeResultSchema.safeParse(rawResult);
+  if (!parsedResult.success) {
+    throw makeCliError(ERROR_CODES.PROTOCOL_ERROR, {
+      message: 'Unexpected response from host',
+      details: { issues: parsedResult.error.issues },
+    });
+  }
 
-  const result: TypeResult = {};
+  const result: TypeResult = { seq: parsedResult.data.seq };
   emitSuccess({
     command: 'type',
     json: options.json,
     result,
-    lines: ['Typed text into session.'],
+    lines: [`Typed text into session at seq ${String(result.seq)}.`],
   });
 }
diff --git a/src/cli/commands/wait.ts b/src/cli/commands/wait.ts
index 9bef44d5..0df5b761 100644
--- a/src/cli/commands/wait.ts
+++ b/src/cli/commands/wait.ts
@@ -41,6 +41,7 @@ interface CommandOptions {
   screenStableMs: number | undefined;
   cursorRow: number | undefined;
   cursorCol: number | undefined;
+  afterSeq: number | undefined;
 }
 
 const DEFAULT_WAIT_TIMEOUT_MS = 600_000;
@@ -59,7 +60,8 @@ function isRenderWaitMode(options: CommandOptions): boolean {
     options.regex !== undefined ||
     options.screenStableMs !== undefined ||
     options.cursorRow !== undefined ||
-    options.cursorCol !== undefined
+    options.cursorCol !== undefined ||
+    options.afterSeq !== undefined
   );
 }
 
@@ -105,7 +107,7 @@ function buildOfflineRenderWaitResult(
 ): WaitForRenderResult {
   const match = matchRenderWaitSnapshot(preparedCondition, snapshot);
 
-  if (match.contentAndCursorMatched) {
+  if (match.contentAndCursorMatched && match.baselineMatched) {
     return {
       matched: match.matched,
       timedOut: false,
@@ -127,6 +129,7 @@ function buildOfflineRenderWaitResult(
       screenStableMs: preparedCondition.screenStableMs,
       expectedCursorRow: preparedCondition.cursorRow,
       expectedCursorCol: preparedCondition.cursorCol,
+      afterSeq: preparedCondition.afterSeq,
       capturedAtSeq: match.capturedAtSeq,
       cursorRow: match.cursorRow,
       cursorCol: match.cursorCol,
@@ -212,6 +215,16 @@ export async function runWaitCommand(options: CommandOptions): Promise<void> {
       });
     }
 
+    if (
+      options.afterSeq !== undefined &&
+      !isNonNegativeInteger(options.afterSeq)
+    ) {
+      throw makeCliError(ERROR_CODES.INVALID_INPUT, {
+        message: '--after-seq must be a non-negative integer.',
+        details: { afterSeq: options.afterSeq },
+      });
+    }
+
     if (
       options.timeout !== undefined &&
       options.timeout !== 0 &&
@@ -231,6 +244,7 @@ export async function runWaitCommand(options: CommandOptions): Promise<void> {
       screenStableMs: options.screenStableMs,
       cursorRow: options.cursorRow,
       cursorCol: options.cursorCol,
+      afterSeq: options.afterSeq,
     });
 
     const effectiveTimeout = options.timeout ?? DEFAULT_WAIT_TIMEOUT_MS;
@@ -240,6 +254,7 @@ export async function runWaitCommand(options: CommandOptions): Promise<void> {
       screenStableMs: options.screenStableMs,
       cursorRow: options.cursorRow,
       cursorCol: options.cursorCol,
+      afterSeq: options.afterSeq,
       timeoutMs: effectiveTimeout === 0 ? undefined : effectiveTimeout,
       rendererName: options.context.rendererDefault,
     };
diff --git a/src/cli/exitCodes.ts b/src/cli/exitCodes.ts
index 5c762361..eb0e7425 100644
--- a/src/cli/exitCodes.ts
+++ b/src/cli/exitCodes.ts
@@ -20,6 +20,7 @@ const EXIT_CODE_BY_ERROR_CODE: Readonly<Record<string, number>> = Object.freeze(
     [ERROR_CODES.PROTOCOL_ERROR]: 9,
     [ERROR_CODES.RPC_ERROR]: 9,
     [ERROR_CODES.REPLAY_ERROR]: 10,
+    [ERROR_CODES.WAIT_TIMEOUT]: 11,
     [ERROR_CODES.INTERNAL_ERROR]: 1,
   },
 );
diff --git a/src/cli/main.ts b/src/cli/main.ts
index 217d8ef2..adbc67c6 100644
--- a/src/cli/main.ts
+++ b/src/cli/main.ts
@@ -6,6 +6,7 @@ import { Command, CommanderError } from 'commander';
 
 import type { CommandContext } from './context.js';
 
+import { runBatchCommand } from './commands/batch.js';
 import { runCreateCommand } from './commands/create.js';
 import { runDashboardCommand } from './commands/dashboard.js';
 import { runDestroyCommand } from './commands/destroy.js';
@@ -492,6 +493,35 @@ async function main(): Promise<void> {
       ),
     );
 
+  program
+    .command('batch <session-id> [steps]')
+    .description(
+      'Run an ordered sequence of input-and-wait steps in one command',
+    )
+    .option('--file <path>', 'Read the JSON step array from a file')
+    .option('--keep-going', 'Attempt every step regardless of failures', false)
+    .option('--json', 'Emit a JSON command envelope', false)
+    .action(
+      wrapAction(
+        'batch',
+        async (
+          sessionId: string,
+          steps: string | undefined,
+          options: { file?: string; keepGoing: boolean; json: boolean },
+          context: CommandContext,
+        ) => {
+          await runBatchCommand({
+            context,
+            json: options.json,
+            sessionId,
+            steps,
+            ...(options.file !== undefined ? { file: options.file } : {}),
+            keepGoing: options.keepGoing,
+          });
+        },
+      ),
+    );
+
   program
     .command('mark <session-id> <label>')
     .description('Add a marker to a session')
@@ -776,6 +806,11 @@ async function main(): Promise<void> {
       'Wait for cursor column in rendered output (0-based)',
       parseNumberOption,
     )
+    .option(
+      '--after-seq <n>',
+      'Only match renderer snapshots produced after this Event Log sequence',
+      parseIntegerOption,
+    )
     .action(
       wrapAction(
         'wait',
@@ -791,6 +826,7 @@ async function main(): Promise<void> {
             screenStableMs?: number;
             cursorRow?: number;
             cursorCol?: number;
+            afterSeq?: number;
           },
           context: CommandContext,
         ) => {
@@ -806,6 +842,7 @@ async function main(): Promise<void> {
             screenStableMs: options.screenStableMs,
             cursorRow: options.cursorRow,
             cursorCol: options.cursorCol,
+            afterSeq: options.afterSeq,
           });
         },
       ),
diff --git a/src/cli/output.ts b/src/cli/output.ts
index e2fae9dd..07bdda1d 100644
--- a/src/cli/output.ts
+++ b/src/cli/output.ts
@@ -1,3 +1,4 @@
+import { writeSync } from 'node:fs';
 import process from 'node:process';
 
 import type { CommandEnvelope, CommandError } from '../protocol/envelope.js';
@@ -13,6 +14,8 @@ const ANSI_ESCAPE_PATTERN = new RegExp(
 
 let colorEnabled = true;
 
+const STDOUT_FD = 1;
+
 function stripAnsi(value: string): string {
   return value.replaceAll(ANSI_ESCAPE_PATTERN, '');
 }
@@ -35,18 +38,53 @@ export function writeHumanLines(lines: readonly string[]): void {
   process.stdout.write(`${formatHumanText(lines.join('\n'))}\n`);
 }
 
-export function emitSuccess(options: {
+interface SuccessOutputOptions {
   command: string;
   json: boolean;
   result: unknown;
   lines: readonly string[];
-}): void {
+}
+
+export function formatSuccessOutput(options: SuccessOutputOptions): string {
   if (options.json) {
-    writeJsonEnvelope(createSuccessEnvelope(options.command, options.result));
-    return;
+    return `${JSON.stringify(createSuccessEnvelope(options.command, options.result), null, 2)}\n`;
+  }
+
+  return `${formatHumanText(options.lines.join('\n'))}\n`;
+}
+
+export function emitSuccess(options: SuccessOutputOptions): void {
+  process.stdout.write(formatSuccessOutput(options));
+}
+
+/**
+ * Write to stdout synchronously, looping over partial writes and a full pipe
+ * buffer (EAGAIN) so the whole payload is flushed before the caller exits the
+ * process. The batch signal handler needs this: an async `process.stdout.write`
+ * followed by a synchronous `process.exit` truncates anything past the OS pipe
+ * buffer (~64 KiB), corrupting the very partial envelope it means to flush.
+ */
+export function writeStdoutSync(text: string): void {
+  const buffer = Buffer.from(text, 'utf8');
+  let offset = 0;
+  while (offset < buffer.length) {
+    try {
+      offset += writeSync(STDOUT_FD, buffer, offset);
+    } catch (error) {
+      const code = (error as NodeJS.ErrnoException).code;
+      if (code === 'EAGAIN') {
+        continue;
+      }
+      if (code === 'EPIPE') {
+        return;
+      }
+      throw error;
+    }
   }
+}
 
-  writeHumanLines(options.lines);
+export function emitSuccessSync(options: SuccessOutputOptions): void {
+  writeStdoutSync(formatSuccessOutput(options));
 }
 
 export function emitFailure(options: {
diff --git a/src/host/hostMain.ts b/src/host/hostMain.ts
index 5b28a366..f027c992 100644
--- a/src/host/hostMain.ts
+++ b/src/host/hostMain.ts
@@ -538,8 +538,8 @@ export async function runHost(sessionId: string): Promise<void> {
       invariant(typeof text === 'string', 'type text must be a string');
       pty.write(text);
       lastActivityAt = Date.now();
-      await eventLog.append('input_text', { data: text });
-      return {};
+      const seq = await eventLog.append('input_text', { data: text });
+      return { seq };
     },
     mark: async (params: unknown) => {
       const { label } = params as MarkParams;
@@ -563,8 +563,8 @@ export async function runHost(sessionId: string): Promise<void> {
       const encoded = encodePaste(text);
       pty.write(encoded);
       lastActivityAt = Date.now();
-      await eventLog.append('input_paste', { data: encoded });
-      return {};
+      const seq = await eventLog.append('input_paste', { data: encoded });
+      return { seq };
     },
     run: async (params: unknown, context) => {
       const { command, noWait, timeoutMs } = params as RunParams;
@@ -848,6 +848,7 @@ export async function runHost(sessionId: string): Promise<void> {
         screenStableMs,
         cursorRow,
         cursorCol,
+        afterSeq,
         timeoutMs,
         rendererName: requestedRendererName,
       } = params as WaitForRenderParams;
@@ -860,6 +861,7 @@ export async function runHost(sessionId: string): Promise<void> {
         screenStableMs,
         cursorRow,
         cursorCol,
+        afterSeq,
       });
       if (timeoutMs !== undefined) {
         invariant(
@@ -875,6 +877,7 @@ export async function runHost(sessionId: string): Promise<void> {
       let lastVisibleText: string | undefined;
       let lastTextChangeAt = Date.now();
       let latestCapturedAtSeq = 0;
+      let seenPostBaseline = false;
 
       const pollCondition = new Promise<WaitForRenderResult>((resolve) => {
         let pollInFlight = false;
@@ -906,13 +909,21 @@ export async function runHost(sessionId: string): Promise<void> {
               consecutiveFailures = 0;
 
               const now = Date.now();
+              const postBaseline =
+                afterSeq === undefined || capturedAtSeq > afterSeq;
+              if (!postBaseline) {
+                return;
+              }
+
               if (
+                !seenPostBaseline ||
                 lastVisibleText === undefined ||
                 visibleText !== lastVisibleText
               ) {
                 lastVisibleText = visibleText;
                 lastTextChangeAt = now;
               }
+              seenPostBaseline = true;
 
               const match = matchRenderWaitSnapshot(
                 preparedCondition,
diff --git a/src/protocol/errors.ts b/src/protocol/errors.ts
index 48797f3a..a3b73b56 100644
--- a/src/protocol/errors.ts
+++ b/src/protocol/errors.ts
@@ -22,6 +22,7 @@ export const ERROR_CODES = {
   EXPORT_ERROR: 'EXPORT_ERROR',
   REPLAY_ERROR: 'REPLAY_ERROR',
   SKILL_NOT_FOUND: 'SKILL_NOT_FOUND',
+  WAIT_TIMEOUT: 'WAIT_TIMEOUT',
   INTERNAL_ERROR: 'INTERNAL_ERROR',
 } as const;
 
@@ -47,6 +48,8 @@ export const DEFAULT_ERROR_MESSAGES: Record<ProtocolErrorCode, string> = {
   [ERROR_CODES.EXPORT_ERROR]: 'Export failed.',
   [ERROR_CODES.REPLAY_ERROR]: 'Replay failed.',
   [ERROR_CODES.SKILL_NOT_FOUND]: 'Skill not found.',
+  [ERROR_CODES.WAIT_TIMEOUT]:
+    'Render wait timed out before its condition was met.',
   [ERROR_CODES.INTERNAL_ERROR]: 'Internal error.',
 };
 
diff --git a/src/protocol/messages.ts b/src/protocol/messages.ts
index 349f903f..fe429e32 100644
--- a/src/protocol/messages.ts
+++ b/src/protocol/messages.ts
@@ -219,7 +219,9 @@ export const TypeParamsSchema = z
   .strict();
 export type TypeParams = z.infer<typeof TypeParamsSchema>;
 
-export const TypeResultSchema = EmptyObjectSchema;
+export const TypeResultSchema = z
+  .object({ seq: z.number().int().nonnegative() })
+  .strict();
 export type TypeResult = z.infer<typeof TypeResultSchema>;
 
 export const PasteParamsSchema = z
@@ -229,7 +231,9 @@ export const PasteParamsSchema = z
   .strict();
 export type PasteParams = z.infer<typeof PasteParamsSchema>;
 
-export const PasteResultSchema = EmptyObjectSchema;
+export const PasteResultSchema = z
+  .object({ seq: z.number().int().nonnegative() })
+  .strict();
 export type PasteResult = z.infer<typeof PasteResultSchema>;
 
 export const RunParamsSchema = z
diff --git a/src/protocol/schemas.ts b/src/protocol/schemas.ts
index 83a281ac..97e4d781 100644
--- a/src/protocol/schemas.ts
+++ b/src/protocol/schemas.ts
@@ -403,6 +403,7 @@ export const WaitForRenderParamsSchema = z
     screenStableMs: PositiveIntSchema.optional(),
     cursorRow: NonNegativeIntSchema.optional(),
     cursorCol: NonNegativeIntSchema.optional(),
+    afterSeq: NonNegativeIntSchema.optional(),
     timeoutMs: PositiveIntSchema.optional(),
     rendererName: RendererNameSchema.optional(),
   })
diff --git a/src/pty/keyEncoder.ts b/src/pty/keyEncoder.ts
index 284c3569..3223a308 100644
--- a/src/pty/keyEncoder.ts
+++ b/src/pty/keyEncoder.ts
@@ -1,3 +1,4 @@
+import { ERROR_CODES, makeCliError } from '../protocol/errors.js';
 import { invariant } from '../util/assert.js';
 
 interface Modifiers {
@@ -82,6 +83,22 @@ export function encodeKey(keyName: string): string {
   invariant(false, `Unknown base key: ${baseKey}`);
 }
 
+/**
+ * Assert that `encodeKey` accepts this key name, throwing INVALID_KEYS with the
+ * same message the host surfaces. Used to reject a bad key at Batch Plan parse
+ * time, before any input is sent.
+ */
+export function assertValidKeyName(key: string): void {
+  try {
+    encodeKey(key);
+  } catch (error) {
+    throw makeCliError(ERROR_CODES.INVALID_KEYS, {
+      message: error instanceof Error ? error.message : 'Invalid key sequence.',
+      cause: error,
+    });
+  }
+}
+
 function parseKeyName(keyName: string): {
   baseKey: string;
   modifiers: Modifiers;
diff --git a/src/renderWait/matcher.ts b/src/renderWait/matcher.ts
index 9f3c308c..371480d8 100644
--- a/src/renderWait/matcher.ts
+++ b/src/renderWait/matcher.ts
@@ -21,7 +21,7 @@ const NESTED_QUANTIFIER_MESSAGE =
 
 type RenderWaitCondition = Pick<
   WaitForRenderParams,
-  'text' | 'regex' | 'screenStableMs' | 'cursorRow' | 'cursorCol'
+  'text' | 'regex' | 'screenStableMs' | 'cursorRow' | 'cursorCol' | 'afterSeq'
 >;
 
 export interface PreparedRenderWaitCondition extends RenderWaitCondition {
@@ -33,6 +33,7 @@ interface RenderWaitSnapshotMatch {
   readonly textMatched: boolean;
   readonly cursorMatched: boolean;
   readonly stabilityMatched: boolean;
+  readonly baselineMatched: boolean;
   readonly contentAndCursorMatched: boolean;
   readonly matchedText?: string;
   readonly cursorRow: number;
@@ -157,7 +158,8 @@ export function safeRegexExec(
 export function prepareRenderWaitCondition(
   condition: RenderWaitCondition,
 ): PreparedRenderWaitCondition {
-  const { text, regex, screenStableMs, cursorRow, cursorCol } = condition;
+  const { text, regex, screenStableMs, cursorRow, cursorCol, afterSeq } =
+    condition;
 
   if (
     text === undefined &&
@@ -238,12 +240,19 @@ export function prepareRenderWaitCondition(
     });
   }
 
+  if (afterSeq !== undefined && (!Number.isInteger(afterSeq) || afterSeq < 0)) {
+    throw makeCliError(ERROR_CODES.INVALID_INPUT, {
+      message: 'afterSeq must be a non-negative integer',
+    });
+  }
+
   return {
     text,
     regex,
     screenStableMs,
     cursorRow,
     cursorCol,
+    afterSeq,
     ...(compiledRegex === undefined ? {} : { compiledRegex }),
   };
 }
@@ -322,13 +331,17 @@ export function matchRenderWaitSnapshot(
     condition.screenStableMs === undefined ||
     (options.stableForMs !== undefined &&
       options.stableForMs >= condition.screenStableMs);
+  const baselineMatched =
+    condition.afterSeq === undefined ||
+    snapshot.capturedAtSeq > condition.afterSeq;
   const contentAndCursorMatched = textMatched && cursorMatched;
 
   return {
-    matched: contentAndCursorMatched && stabilityMatched,
+    matched: contentAndCursorMatched && stabilityMatched && baselineMatched,
     textMatched,
     cursorMatched,
     stabilityMatched,
+    baselineMatched,
     contentAndCursorMatched,
     ...(matchedText === undefined ? {} : { matchedText }),
     cursorRow: snapshot.cursorRow,
diff --git a/test/e2e/batch-fixture.test.ts b/test/e2e/batch-fixture.test.ts
new file mode 100644
index 00000000..c4f7f9f3
--- /dev/null
+++ b/test/e2e/batch-fixture.test.ts
@@ -0,0 +1,146 @@
+import { afterEach, beforeEach, describe, expect, it } from 'vitest';
+
+import type { BatchResult } from '../../src/batch/result.js';
+
+import {
+  cleanupHome,
+  createIsolatedHome,
+  fixtureCommand,
+  normalizeTerminalOutput,
+  readOutput,
+  runCli,
+  runCliJson,
+  type SuccessEnvelope,
+} from './helpers.js';
+
+interface CreateResult {
+  sessionId: string;
+}
+
+function testEnv(home: string): Record<string, string> {
+  return { AGENT_TTY_HOME: home };
+}
+
+// These flows drive a BUNDLED fixture (never nvim) through ONE batch
+// invocation each. They require a real PTY + ghostty-web renderer, so they do
+// NOT run in the sandbox (HOST_UNREACHABLE / no browser); they are gated by
+// typecheck + static review and exercised in CI. Structure mirrors
+// hello-prompt.test.ts.
+describe('batch fixture e2e', { timeout: 30_000 }, () => {
+  let testHome = '';
+  let createdSessionIds: string[] = [];
+
+  beforeEach(async () => {
+    testHome = await createIsolatedHome();
+    createdSessionIds = [];
+  });
+
+  afterEach(async () => {
+    const env = testEnv(testHome);
+
+    for (const sessionId of createdSessionIds) {
+      runCli(['destroy', sessionId, '--force', '--json'], env);
+    }
+
+    await cleanupHome(testHome);
+  });
+
+  it('drives the alt-screen TUI fixture through one batch', async () => {
+    const env = testEnv(testHome);
+    const createEnvelope = runCliJson<SuccessEnvelope<CreateResult>>(
+      ['create', '--', ...fixtureCommand('alt-screen-demo')],
+      env,
+    );
+    expect(createEnvelope.ok).toBe(true);
+
+    const sessionId = createEnvelope.result.sessionId;
+    createdSessionIds.push(sessionId);
+
+    // One batch: settle on the main screen, advance into the alt screen, wait
+    // for its label, advance back, wait for the restored main screen. Each
+    // post-input wait is anchored to the Wait Baseline of the preceding
+    // sendKeys, so it cannot match the screen left by an earlier step.
+    const steps = JSON.stringify([
+      { wait: { screenStableMs: 1000, timeout: 10_000 } },
+      { sendKeys: ['Enter'] },
+      { wait: { text: 'ALT SCREEN ACTIVE', timeout: 10_000 } },
+      { sendKeys: ['Enter'] },
+      { wait: { text: 'BACK ON MAIN SCREEN', timeout: 10_000 } },
+    ]);
+
+    const batchEnvelope = runCliJson<SuccessEnvelope<BatchResult>>(
+      ['batch', sessionId, steps],
+      env,
+    );
+
+    expect(batchEnvelope.ok).toBe(true);
+    expect(batchEnvelope.command).toBe('batch');
+    expect(batchEnvelope.result.failedIndices).toEqual([]);
+    expect(batchEnvelope.result.completedCount).toBe(5);
+    expect(batchEnvelope.result.steps.map((step) => step.status)).toEqual([
+      'completed',
+      'completed',
+      'completed',
+      'completed',
+      'completed',
+    ]);
+
+    // The second wait (index 2) is anchored to the seq the first sendKeys
+    // produced — the Wait Baseline the executor threaded through.
+    const [, firstKeys, altWait] = batchEnvelope.result.steps;
+    if (firstKeys?.kind === 'sendKeys' && altWait?.kind === 'wait') {
+      expect(altWait.waitBaseline).toBe(firstKeys.seq);
+    }
+
+    const output = normalizeTerminalOutput(
+      await readOutput(testHome, sessionId),
+    );
+    expect(output).toContain('ALT SCREEN ACTIVE');
+    expect(output).toContain('BACK ON MAIN SCREEN');
+  });
+
+  it('drives the hello-prompt fixture through one batch (type, sendKeys, anchored wait)', async () => {
+    const env = testEnv(testHome);
+    const createEnvelope = runCliJson<SuccessEnvelope<CreateResult>>(
+      ['create', '--', ...fixtureCommand('hello-prompt')],
+      env,
+    );
+    expect(createEnvelope.ok).toBe(true);
+
+    const sessionId = createEnvelope.result.sessionId;
+    createdSessionIds.push(sessionId);
+
+    // One batch exercises type/sendKeys/wait against the prompt fixture. The
+    // leading wait matches the prompt WITHOUT its trailing space: the raw
+    // output keeps "READY> ", but the rendered snapshot trims trailing blank
+    // cells, so the grid shows "READY>". The final wait uses a distinctive
+    // token (not the literal typed text) so it anchors on the fixture's ECHO
+    // line rather than the keystroke echo — the documented echo-match limit.
+    const steps = JSON.stringify([
+      { wait: { text: 'READY>', timeout: 10_000 } },
+      { type: 'batchtoken9' },
+      { sendKeys: ['Enter'] },
+      { wait: { text: 'ECHO: batchtoken9', timeout: 10_000 } },
+    ]);
+
+    const batchEnvelope = runCliJson<SuccessEnvelope<BatchResult>>(
+      ['batch', sessionId, steps],
+      env,
+    );
+
+    expect(batchEnvelope.ok).toBe(true);
+    expect(batchEnvelope.result.failedIndices).toEqual([]);
+    expect(batchEnvelope.result.completedCount).toBe(4);
+    expect(batchEnvelope.result.steps.map((step) => step.kind)).toEqual([
+      'wait',
+      'type',
+      'sendKeys',
+      'wait',
+    ]);
+
+    const output = normalizeTerminalOutput(
+      await readOutput(testHome, sessionId),
+    );
+    expect(output).toContain('ECHO: batchtoken9');
+  });
+});
diff --git a/test/e2e/hello-prompt.test.ts b/test/e2e/hello-prompt.test.ts
index 16fe7a72..7c347253 100644
--- a/test/e2e/hello-prompt.test.ts
+++ b/test/e2e/hello-prompt.test.ts
@@ -91,13 +91,15 @@ describe('hello-prompt e2e', { timeout: 30_000 }, () => {
       ),
     ).resolves.toContain('READY> ');
 
-    const typeEnvelope = runCliJson<SuccessEnvelope<Record<string, never>>>(
+    const typeEnvelope = runCliJson<SuccessEnvelope<{ seq: number }>>(
       ['type', sessionId, 'hello world'],
       env,
     );
     expect(typeEnvelope.ok).toBe(true);
     expect(typeEnvelope.command).toBe('type');
-    expect(typeEnvelope.result).toEqual({});
+    expect(typeEnvelope.result).toEqual(
+      expect.objectContaining({ seq: expect.any(Number) as number }),
+    );
 
     const sendKeysEnvelope = runCliJson<SuccessEnvelope<SendKeysResult>>(
       ['send-keys', sessionId, 'Enter'],
@@ -146,13 +148,15 @@ describe('hello-prompt e2e', { timeout: 30_000 }, () => {
       }),
     );
 
-    const typeExitEnvelope = runCliJson<SuccessEnvelope<Record<string, never>>>(
+    const typeExitEnvelope = runCliJson<SuccessEnvelope<{ seq: number }>>(
       ['type', sessionId, 'exit'],
       env,
     );
     expect(typeExitEnvelope.ok).toBe(true);
     expect(typeExitEnvelope.command).toBe('type');
-    expect(typeExitEnvelope.result).toEqual({});
+    expect(typeExitEnvelope.result).toEqual(
+      expect.objectContaining({ seq: expect.any(Number) as number }),
+    );
 
     const sendExitEnterEnvelope = runCliJson<SuccessEnvelope<SendKeysResult>>(
       ['send-keys', sessionId, 'Enter'],
@@ -234,13 +238,15 @@ describe('hello-prompt e2e', { timeout: 30_000 }, () => {
     );
     expect(waitForReady.result.timedOut).toBe(false);
 
-    const pasteEnvelope = runCliJson<SuccessEnvelope<Record<string, never>>>(
+    const pasteEnvelope = runCliJson<SuccessEnvelope<{ seq: number }>>(
       ['paste', sessionId, 'exit-code 42'],
       env,
     );
     expect(pasteEnvelope.ok).toBe(true);
     expect(pasteEnvelope.command).toBe('paste');
-    expect(pasteEnvelope.result).toEqual({});
+    expect(pasteEnvelope.result).toEqual(
+      expect.objectContaining({ seq: expect.any(Number) as number }),
+    );
 
     const sendKeysEnvelope = runCliJson<SuccessEnvelope<SendKeysResult>>(
       ['send-keys', sessionId, 'Enter'],
diff --git a/test/e2e/resize-demo.test.ts b/test/e2e/resize-demo.test.ts
index daf22d8f..2ec785d5 100644
--- a/test/e2e/resize-demo.test.ts
+++ b/test/e2e/resize-demo.test.ts
@@ -120,13 +120,15 @@ describe('resize-demo e2e', { timeout: 30_000 }, () => {
       ),
     ).resolves.toContain('SIZE: 120x40\n');
 
-    const typeQuitEnvelope = runCliJson<SuccessEnvelope<Record<string, never>>>(
+    const typeQuitEnvelope = runCliJson<SuccessEnvelope<{ seq: number }>>(
       ['type', sessionId, 'quit'],
       env,
     );
     expect(typeQuitEnvelope.ok).toBe(true);
     expect(typeQuitEnvelope.command).toBe('type');
-    expect(typeQuitEnvelope.result).toEqual({});
+    expect(typeQuitEnvelope.result).toEqual(
+      expect.objectContaining({ seq: expect.any(Number) as number }),
+    );
 
     const sendKeysEnvelope = runCliJson<SuccessEnvelope<SendKeysResult>>(
       ['send-keys', sessionId, 'Enter'],
diff --git a/test/integration/batch.test.ts b/test/integration/batch.test.ts
new file mode 100644
index 00000000..49b33ef0
--- /dev/null
+++ b/test/integration/batch.test.ts
@@ -0,0 +1,257 @@
+import { mkdtempSync, realpathSync, writeFileSync } from 'node:fs';
+import { tmpdir } from 'node:os';
+import { join } from 'node:path';
+
+import { afterEach, beforeEach, describe, expect, it } from 'vitest';
+
+import type { BatchResult } from '../../src/batch/result.js';
+import type { CommandErrorEnvelope } from '../../src/protocol/envelope.js';
+
+import {
+  cleanupHome,
+  createSession,
+  destroySession,
+  runCli,
+  sleep,
+  type SuccessEnvelope,
+} from '../helpers.js';
+
+let testHome = '';
+let sessionId = '';
+
+function testEnv(): Record<string, string> {
+  return { AGENT_TTY_HOME: testHome };
+}
+
+describe('batch command integration', { timeout: 45_000 }, () => {
+  beforeEach(() => {
+    // oxfmt-ignore
+    testHome = realpathSync(mkdtempSync(join(tmpdir(), 'agent-tty-batch-home-')));
+  });
+
+  afterEach(async () => {
+    destroySession(testHome, sessionId);
+    sessionId = '';
+    await cleanupHome(testHome);
+    testHome = '';
+  });
+
+  // --- Parse-error cases run in-sandbox: parsing precedes target resolution,
+  // so no live Session is required. ---
+
+  it('rejects malformed JSON steps with INVALID_INPUT before resolving a target', () => {
+    const result = runCli(
+      ['batch', 'nonexistent-session', '{not json', '--json'],
+      testEnv(),
+    );
+
+    expect(result.status).toBe(2);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as CommandErrorEnvelope;
+    expect(envelope.ok).toBe(false);
+    expect(envelope.error.code).toBe('INVALID_INPUT');
+  });
+
+  it('rejects a non-array top-level steps payload with INVALID_INPUT', () => {
+    const result = runCli(
+      ['batch', 'nonexistent-session', '{"type":"hi"}', '--json'],
+      testEnv(),
+    );
+
+    expect(result.status).toBe(2);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as CommandErrorEnvelope;
+    expect(envelope.ok).toBe(false);
+    expect(envelope.error.code).toBe('INVALID_INPUT');
+    expect(envelope.error.message).toContain('JSON array');
+  });
+
+  it('rejects inline steps and --file together as mutually exclusive', () => {
+    const stepsPath = join(testHome, 'steps.json');
+    writeFileSync(stepsPath, JSON.stringify([{ type: 'hi' }]));
+
+    const result = runCli(
+      [
+        'batch',
+        'some-session',
+        '[{"type":"hi"}]',
+        '--file',
+        stepsPath,
+        '--json',
+      ],
+      testEnv(),
+    );
+
+    expect(result.status).toBe(2);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as CommandErrorEnvelope;
+    expect(envelope.ok).toBe(false);
+    expect(envelope.error.code).toBe('INVALID_INPUT');
+    expect(envelope.error.message).toContain('mutually exclusive');
+  });
+
+  it('rejects neither inline steps nor --file with INVALID_INPUT', () => {
+    const result = runCli(['batch', 'some-session', '--json'], testEnv());
+
+    expect(result.status).toBe(2);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as CommandErrorEnvelope;
+    expect(envelope.ok).toBe(false);
+    expect(envelope.error.code).toBe('INVALID_INPUT');
+  });
+
+  // --- Happy path requires a real PTY + renderer, so it does not run in this
+  // sandbox (HOST_UNREACHABLE / no browser). It is gated by typecheck + static
+  // review and exercised in CI where a live Session and renderer exist. ---
+
+  it('runs an ordered multi-step plan against a live session', async () => {
+    sessionId = createSession(testHome, [
+      '/bin/sh',
+      '-c',
+      "printf 'booting\\n'; sleep 1; printf 'Ready\\n'; exec cat",
+    ]);
+    await sleep(500);
+
+    const steps = JSON.stringify([
+      { wait: { text: 'Ready', timeout: 10_000 } },
+      { type: 'echo from-batch' },
+      { sendKeys: ['Enter'] },
+      { wait: { text: 'from-batch', timeout: 10_000 } },
+    ]);
+
+    const result = runCli(['batch', sessionId, steps, '--json'], testEnv());
+
+    expect(result.status).toBe(0);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as SuccessEnvelope<BatchResult>;
+    expect(envelope.ok).toBe(true);
+    expect(envelope.result.completedCount).toBe(4);
+    expect(envelope.result.failedIndices).toEqual([]);
+    expect(envelope.result.steps).toHaveLength(4);
+
+    const [firstWait, typeStep, keysStep, secondWait] = envelope.result.steps;
+    expect(firstWait).toMatchObject({ kind: 'wait', status: 'completed' });
+    expect(typeStep).toMatchObject({ kind: 'type', status: 'completed' });
+    expect(keysStep).toMatchObject({ kind: 'sendKeys', status: 'completed' });
+    expect(secondWait).toMatchObject({ kind: 'wait', status: 'completed' });
+
+    // The second wait is anchored to the seq the preceding sendKeys produced.
+    if (keysStep?.kind === 'sendKeys' && secondWait?.kind === 'wait') {
+      expect(secondWait.waitBaseline).toBe(keysStep.seq);
+    }
+  });
+
+  it('reads the step array from --file', async () => {
+    sessionId = createSession(testHome, ['/bin/bash']);
+    await sleep(500);
+
+    const stepsPath = join(testHome, 'plan.json');
+    writeFileSync(
+      stepsPath,
+      JSON.stringify([{ run: 'echo file-driven', noWait: true }]),
+    );
+
+    const result = runCli(
+      ['batch', sessionId, '--file', stepsPath, '--json'],
+      testEnv(),
+    );
+
+    expect(result.status).toBe(0);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as SuccessEnvelope<BatchResult>;
+    expect(envelope.ok).toBe(true);
+    expect(envelope.result.completedCount).toBe(1);
+    expect(envelope.result.steps[0]).toMatchObject({
+      kind: 'run',
+      status: 'completed',
+      noWait: true,
+    });
+  });
+
+  it('exits 11 when a wait times out under the default fail-fast policy', async () => {
+    sessionId = createSession(testHome, [
+      '/bin/sh',
+      '-c',
+      "printf 'Ready\\n'; exec cat",
+    ]);
+    await sleep(500);
+
+    // The first wait can never match, so it times out; fail-fast stops the
+    // batch and the trailing step is recorded not-run.
+    const steps = JSON.stringify([
+      { wait: { text: 'never-appears', timeout: 1000 } },
+      { type: 'unreached' },
+    ]);
+
+    const result = runCli(['batch', sessionId, steps, '--json'], testEnv());
+
+    // WAIT_TIMEOUT maps to exit code 11 — distinct from HOST_TIMEOUT (5) and
+    // HOST_UNREACHABLE (6).
+    expect(result.status).toBe(11);
+    expect(result.stderr).toBe('');
+
+    // The per-step envelope is still emitted (doctor pattern): a step failure
+    // never routes through emitFailure, so steps[] is preserved.
+    const envelope = JSON.parse(result.stdout) as SuccessEnvelope<BatchResult>;
+    expect(envelope.ok).toBe(true);
+    expect(envelope.result.completedCount).toBe(0);
+    expect(envelope.result.failedIndices).toEqual([0]);
+    expect(envelope.result.steps).toHaveLength(2);
+
+    const [waitStep, typeStep] = envelope.result.steps;
+    expect(waitStep).toMatchObject({
+      kind: 'wait',
+      status: 'failed',
+      timedOut: true,
+      error: { code: 'WAIT_TIMEOUT' },
+    });
+    expect(typeStep).toMatchObject({ kind: 'type', status: 'not-run' });
+  });
+
+  it('exits non-zero with a full envelope under --keep-going on failure', async () => {
+    sessionId = createSession(testHome, [
+      '/bin/sh',
+      '-c',
+      "printf 'Ready\\n'; exec cat",
+    ]);
+    await sleep(500);
+
+    // The first wait times out; --keep-going attempts every remaining step, so
+    // the trailing input still runs and the run completes.
+    const steps = JSON.stringify([
+      { wait: { text: 'never-appears', timeout: 1000 } },
+      { type: 'echo kept-going' },
+      { sendKeys: ['Enter'] },
+    ]);
+
+    const result = runCli(
+      ['batch', sessionId, steps, '--keep-going', '--json'],
+      testEnv(),
+    );
+
+    // Keep-going collapses any failure to a fixed batch-level exit code of 1.
+    expect(result.status).toBe(1);
+    expect(result.stderr).toBe('');
+
+    const envelope = JSON.parse(result.stdout) as SuccessEnvelope<BatchResult>;
+    expect(envelope.ok).toBe(true);
+    // Every step was attempted: no not-run steps, multiple records present.
+    expect(envelope.result.steps).toHaveLength(3);
+    expect(
+      envelope.result.steps.some((step) => step.status === 'not-run'),
+    ).toBe(false);
+    expect(envelope.result.failedIndices).toEqual([0]);
+    expect(envelope.result.completedCount).toBe(2);
+    expect(envelope.result.steps[0]).toMatchObject({
+      kind: 'wait',
+      status: 'failed',
+      error: { code: 'WAIT_TIMEOUT' },
+    });
+  });
+});
diff --git a/test/unit/batch/executor.test.ts b/test/unit/batch/executor.test.ts
new file mode 100644
index 00000000..c23c9e2f
--- /dev/null
+++ b/test/unit/batch/executor.test.ts
@@ -0,0 +1,664 @@
+import { describe, expect, it, vi } from 'vitest';
+
+import type { BatchPlan } from '../../../src/batch/plan.js';
+import type { StepDriver } from '../../../src/batch/stepDriver.js';
+import type {
+  RunResult,
+  WaitForRenderResult,
+} from '../../../src/protocol/messages.js';
+
+import { executeBatch } from '../../../src/batch/executor.js';
+import { parseBatchPlan } from '../../../src/batch/plan.js';
+import { makeCliError } from '../../../src/protocol/errors.js';
+
+type DriverCall =
+  | { verb: 'type'; text: string }
+  | { verb: 'paste'; text: string }
+  | { verb: 'sendKeys'; keys: string[] }
+  | { verb: 'run'; command: string; noWait: boolean }
+  | { verb: 'wait'; afterSeq: number | undefined };
+
+interface FakeDriverOptions {
+  /** Seq returned by each input verb (type/paste/sendKeys), in order. */
+  inputSeqs?: number[];
+  /** RunResult returned by each run call, in order. */
+  runResults?: RunResult[];
+  /** WaitForRenderResult returned by each wait call, in order. */
+  waitResults?: WaitForRenderResult[];
+}
+
+interface FakeDriver {
+  driver: StepDriver;
+  calls: DriverCall[];
+}
+
+const MATCHED_WAIT: WaitForRenderResult = {
+  matched: true,
+  timedOut: false,
+  capturedAtSeq: 99,
+};
+
+function plan(steps: unknown[]): BatchPlan {
+  return parseBatchPlan(JSON.stringify(steps));
+}
+
+function createFakeDriver(options: FakeDriverOptions = {}): FakeDriver {
+  const calls: DriverCall[] = [];
+  const inputSeqs = [...(options.inputSeqs ?? [])];
+  const runResults = [...(options.runResults ?? [])];
+  const waitResults = [...(options.waitResults ?? [])];
+  let inputCounter = 0;
+
+  const nextInputSeq = (): number => {
+    if (inputSeqs.length > 0) {
+      const seq = inputSeqs.shift();
+      if (seq === undefined) {
+        throw new Error('inputSeqs exhausted');
+      }
+      return seq;
+    }
+    inputCounter += 1;
+    return inputCounter;
+  };
+
+  const driver: StepDriver = {
+    type: vi.fn((text: string): Promise<number> => {
+      calls.push({ verb: 'type', text });
+      return Promise.resolve(nextInputSeq());
+    }),
+    paste: vi.fn((text: string): Promise<number> => {
+      calls.push({ verb: 'paste', text });
+      return Promise.resolve(nextInputSeq());
+    }),
+    sendKeys: vi.fn((keys: string[]): Promise<number> => {
+      calls.push({ verb: 'sendKeys', keys });
+      return Promise.resolve(nextInputSeq());
+    }),
+    run: vi.fn((command: string, noWait: boolean): Promise<RunResult> => {
+      calls.push({ verb: 'run', command, noWait });
+      const result = runResults.shift();
+      if (result === undefined) {
+        return Promise.resolve({ accepted: true, seq: nextInputSeq() });
+      }
+      return Promise.resolve(result);
+    }),
+    wait: vi.fn(
+      (
+        _condition,
+        afterSeq: number | undefined,
+      ): Promise<WaitForRenderResult> => {
+        calls.push({ verb: 'wait', afterSeq });
+        return Promise.resolve(waitResults.shift() ?? MATCHED_WAIT);
+      },
+    ),
+  };
+
+  return { driver, calls };
+}
+
+describe('executeBatch', () => {
+  it('runs every step in plan order and accumulates completedCount', async () => {
+    const { driver, calls } = createFakeDriver();
+    const result = await executeBatch({
+      plan: plan([
+        { type: 'hello' },
+        { sendKeys: ['Enter'] },
+        { wait: { text: 'done' } },
+      ]),
+      driver,
+      keepGoing: false,
+    });
+
+    expect(calls.map((call) => call.verb)).toEqual([
+      'type',
+      'sendKeys',
+      'wait',
+    ]);
+    expect(result.completedCount).toBe(3);
+    expect(result.failedIndices).toEqual([]);
+    expect(result.steps.map((step) => step.status)).toEqual([
+      'completed',
+      'completed',
+      'completed',
+    ]);
+  });
+
+  it('threads the prior input step seq into the following wait as afterSeq', async () => {
+    const { driver, calls } = createFakeDriver({ inputSeqs: [7] });
+    await executeBatch({
+      plan: plan([{ type: 'hello' }, { wait: { text: 'done' } }]),
+      driver,
+      keepGoing: false,
+    });
+
+    const waitCall = calls.find((call) => call.verb === 'wait');
+    expect(waitCall).toEqual({ verb: 'wait', afterSeq: 7 });
+  });
+
+  it('records the threaded seq as the wait step waitBaseline', async () => {
+    const { driver } = createFakeDriver({ inputSeqs: [42] });
+    const result = await executeBatch({
+      plan: plan([{ paste: 'x' }, { wait: { text: 'done' } }]),
+      driver,
+      keepGoing: false,
+    });
+
+    const waitStep = result.steps[1];
+    expect(waitStep).toMatchObject({
+      kind: 'wait',
+      waitBaseline: 42,
+      matched: true,
+    });
+  });
+
+  it('passes undefined afterSeq for a leading wait (no prior input step)', async () => {
+    const { driver, calls } = createFakeDriver();
+    const result = await executeBatch({
+      plan: plan([{ wait: { text: 'ready' } }, { type: 'hi' }]),
+      driver,
+      keepGoing: false,
+    });
+
+    expect(calls[0]).toEqual({ verb: 'wait', afterSeq: undefined });
+    expect(result.steps[0]).toMatchObject({ kind: 'wait' });
+    expect(result.steps[0]).not.toHaveProperty('waitBaseline');
+  });
+
+  it('carries the prior input seq unchanged across a wait-after-wait', async () => {
+    const { driver, calls } = createFakeDriver({ inputSeqs: [5] });
+    await executeBatch({
+      plan: plan([
+        { type: 'hello' },
+        { wait: { text: 'first' } },
+        { wait: { text: 'second' } },
+      ]),
+      driver,
+      keepGoing: false,
+    });
+
+    const waitCalls = calls.filter((call) => call.verb === 'wait');
+    expect(waitCalls).toEqual([
+      { verb: 'wait', afterSeq: 5 },
+      { verb: 'wait', afterSeq: 5 },
+    ]);
+  });
+
+  it('uses RunResult.seq as the next wait baseline', async () => {
+    const { driver, calls } = createFakeDriver({
+      runResults: [{ accepted: true, seq: 13 }],
+    });
+    await executeBatch({
+      plan: plan([
+        { run: 'nvim --clean', noWait: true },
+        { wait: { screenStableMs: 1000 } },
+      ]),
+      driver,
+      keepGoing: false,
+    });
+
+    const waitCall = calls.find((call) => call.verb === 'wait');
+    expect(waitCall).toEqual({ verb: 'wait', afterSeq: 13 });
+  });
+
+  it('records a run step outcome of started for a noWait run', async () => {
+    const { driver } = createFakeDriver({
+      runResults: [{ accepted: true, seq: 1 }],
+    });
+    const result = await executeBatch({
+      plan: plan([{ run: 'nvim --clean', noWait: true }]),
+      driver,
+      keepGoing: false,
+    });
+
+    expect(result.steps[0]).toMatchObject({
+      kind: 'run',
+      status: 'completed',
+      seq: 1,
+      noWait: true,
+      runOutcome: 'started',
+    });
+  });
+
+  it('records a Waited Run completion outcome', async () => {
+    const { driver } = createFakeDriver({
+      runResults: [
+        {
+          accepted: true,
+          seq: 4,
+          completed: true,
+          timedOut: false,
+          durationMs: 12,
+          marker: '__AT_MARKER_x__',
+        },
+      ],
+    });
+    const result = await executeBatch({
+      plan: plan([{ run: 'echo hi' }]),
+      driver,
+      keepGoing: false,
+    });
+
+    expect(result.steps[0]).toMatchObject({
+      kind: 'run',
+      status: 'completed',
+      seq: 4,
+      noWait: false,
+      completed: true,
+      timedOut: false,
+      runOutcome: 'completed',
+    });
+  });
+
+  it('does not advance the baseline across a leading run followed by another input then a wait', async () => {
+    const { driver, calls } = createFakeDriver({
+      runResults: [{ accepted: true, seq: 2 }],
+      inputSeqs: [8],
+    });
+    await executeBatch({
+      plan: plan([
+        { run: 'echo hi', noWait: true },
+        { type: 'more' },
+        { wait: { text: 'done' } },
+      ]),
+      driver,
+      keepGoing: false,
+    });
+
+    // The wait anchors to the most recent input step (the `type`, seq 8), not
+    // the earlier run (seq 2).
+    const waitCall = calls.find((call) => call.verb === 'wait');
+    expect(waitCall).toEqual({ verb: 'wait', afterSeq: 8 });
+  });
+
+  it('invokes onStep once per finalized step record in order', async () => {
+    const { driver } = createFakeDriver({ inputSeqs: [1, 2] });
+    const seen: number[] = [];
+    const result = await executeBatch({
+      plan: plan([{ type: 'a' }, { type: 'b' }, { wait: { text: 'x' } }]),
+      driver,
+      keepGoing: false,
+      onStep: (record) => {
+        seen.push(record.index);
+      },
+    });
+
+    expect(seen).toEqual([0, 1, 2]);
+    expect(result.steps).toHaveLength(3);
+  });
+
+  describe('fail-fast and keep-going', () => {
+    it('stops at the first failed wait and marks later steps not-run', async () => {
+      const { driver, calls } = createFakeDriver({
+        inputSeqs: [1],
+        waitResults: [{ matched: false, timedOut: true, capturedAtSeq: 3 }],
+      });
+      const result = await executeBatch({
+        plan: plan([
+          { type: 'go' },
+          { wait: { text: 'never' } },
+          { type: 'after' },
+        ]),
+        driver,
+        keepGoing: false,
+      });
+
+      // The `type after` step is never dispatched.
+      expect(calls.map((call) => call.verb)).toEqual(['type', 'wait']);
+      expect(result.steps.map((step) => step.status)).toEqual([
+        'completed',
+        'failed',
+        'not-run',
+      ]);
+      expect(result.failedIndices).toEqual([1]);
+      expect(result.completedCount).toBe(1);
+    });
+
+    it('attempts every step under keepGoing despite a failed wait', async () => {
+      const { driver, calls } = createFakeDriver({
+        inputSeqs: [1, 2],
+        waitResults: [{ matched: false, timedOut: true, capturedAtSeq: 3 }],
+      });
+      const result = await executeBatch({
+        plan: plan([
+          { type: 'go' },
+          { wait: { text: 'never' } },
+          { type: 'after' },
+        ]),
+        driver,
+        keepGoing: true,
+      });
+
+      expect(calls.map((call) => call.verb)).toEqual(['type', 'wait', 'type']);
+      expect(result.steps.map((step) => step.status)).toEqual([
+        'completed',
+        'failed',
+        'completed',
+      ]);
+      expect(result.failedIndices).toEqual([1]);
+      expect(result.completedCount).toBe(2);
+    });
+
+    it('records every trailing step not-run after a fail-fast stop', async () => {
+      const { driver } = createFakeDriver({
+        waitResults: [{ matched: false, timedOut: true, capturedAtSeq: 2 }],
+      });
+      const result = await executeBatch({
+        plan: plan([
+          { wait: { text: 'never' } },
+          { type: 'a' },
+          { sendKeys: ['Enter'] },
+          { wait: { text: 'also-never' } },
+        ]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps.map((step) => step.status)).toEqual([
+        'failed',
+        'not-run',
+        'not-run',
+        'not-run',
+      ]);
+      expect(result.failedIndices).toEqual([0]);
+      expect(result.completedCount).toBe(0);
+    });
+
+    it('collects multiple failed indices and zero not-run under keepGoing', async () => {
+      const { driver } = createFakeDriver({
+        waitResults: [
+          { matched: false, timedOut: true, capturedAtSeq: 2 },
+          { matched: false, timedOut: true, capturedAtSeq: 4 },
+        ],
+      });
+      const result = await executeBatch({
+        plan: plan([
+          { wait: { text: 'never-1' } },
+          { type: 'between' },
+          { wait: { text: 'never-2' } },
+        ]),
+        driver,
+        keepGoing: true,
+      });
+
+      expect(result.steps.map((step) => step.status)).toEqual([
+        'failed',
+        'completed',
+        'failed',
+      ]);
+      expect(result.failedIndices).toEqual([0, 2]);
+      expect(result.completedCount).toBe(1);
+      expect(result.steps.some((step) => step.status === 'not-run')).toBe(
+        false,
+      );
+    });
+  });
+
+  describe('wait timeout classification', () => {
+    it('marks a timed-out wait failed with a WAIT_TIMEOUT error', async () => {
+      const { driver } = createFakeDriver({
+        inputSeqs: [3],
+        waitResults: [{ matched: false, timedOut: true, capturedAtSeq: 5 }],
+      });
+      const result = await executeBatch({
+        plan: plan([{ type: 'go' }, { wait: { text: 'never' } }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[1]).toMatchObject({
+        kind: 'wait',
+        status: 'failed',
+        matched: false,
+        timedOut: true,
+        capturedAtSeq: 5,
+        waitBaseline: 3,
+        error: { code: 'WAIT_TIMEOUT' },
+      });
+    });
+
+    it('treats an unmatched-but-not-timedOut wait as a WAIT_TIMEOUT failure', async () => {
+      const { driver } = createFakeDriver({
+        waitResults: [{ matched: false, timedOut: false, capturedAtSeq: 1 }],
+      });
+      const result = await executeBatch({
+        plan: plan([{ wait: { text: 'never' } }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[0]).toMatchObject({
+        kind: 'wait',
+        status: 'failed',
+        error: { code: 'WAIT_TIMEOUT' },
+      });
+    });
+  });
+
+  describe('run completion classification', () => {
+    it('fails a Waited Run that timed out with a timedOut runOutcome', async () => {
+      const { driver } = createFakeDriver({
+        runResults: [
+          { accepted: true, seq: 5, completed: false, timedOut: true },
+        ],
+      });
+      const result = await executeBatch({
+        plan: plan([{ run: 'sleep 60' }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[0]).toMatchObject({
+        kind: 'run',
+        status: 'failed',
+        noWait: false,
+        completed: false,
+        timedOut: true,
+        runOutcome: 'timedOut',
+      });
+      expect(result.failedIndices).toEqual([0]);
+      expect(result.completedCount).toBe(0);
+    });
+
+    it('fails a Waited Run interrupted by Session exit with a sessionExited runOutcome', async () => {
+      const { driver } = createFakeDriver({
+        runResults: [{ accepted: true, seq: 6 }],
+      });
+      const result = await executeBatch({
+        plan: plan([{ run: 'echo hi' }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[0]).toMatchObject({
+        kind: 'run',
+        status: 'failed',
+        noWait: false,
+        runOutcome: 'sessionExited',
+      });
+      const runStep = result.steps[0];
+      expect(runStep?.status === 'failed' && runStep.error).toBeTruthy();
+    });
+
+    it('completes a no-wait run once accepted regardless of completion', async () => {
+      const { driver } = createFakeDriver({
+        runResults: [{ accepted: true, seq: 7 }],
+      });
+      const result = await executeBatch({
+        plan: plan([{ run: 'nvim --clean', noWait: true }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[0]).toMatchObject({
+        kind: 'run',
+        status: 'completed',
+        noWait: true,
+        runOutcome: 'started',
+      });
+    });
+
+    it('advances the Wait Baseline past a failed Waited Run under keep-going', async () => {
+      // A Waited Run that timed out still injected its command and carries the
+      // real input_run seq; under keep-going the following wait must anchor past
+      // it rather than stale-match the pre-run screen.
+      const { driver, calls } = createFakeDriver({
+        runResults: [
+          { accepted: true, seq: 7, completed: false, timedOut: true },
+        ],
+      });
+      const result = await executeBatch({
+        plan: plan([{ run: 'launch-tui' }, { wait: { text: 'ready' } }]),
+        driver,
+        keepGoing: true,
+      });
+
+      const waitCall = calls.find((call) => call.verb === 'wait');
+      expect(waitCall).toEqual({ verb: 'wait', afterSeq: 7 });
+      expect(result.failedIndices).toEqual([0]);
+      expect(result.steps[1]).toMatchObject({
+        kind: 'wait',
+        status: 'completed',
+        waitBaseline: 7,
+      });
+    });
+  });
+
+  describe('thrown-error handling', () => {
+    it('reframes a thrown non-CliError as an INTERNAL_ERROR step failure', async () => {
+      const driver: StepDriver = {
+        type: () => Promise.reject(new TypeError('boom')),
+        paste: () => Promise.resolve(1),
+        sendKeys: () => Promise.resolve(1),
+        run: () => Promise.resolve({ accepted: true, seq: 1 }),
+        wait: () =>
+          Promise.resolve({ matched: true, timedOut: false, capturedAtSeq: 1 }),
+      };
+
+      const result = await executeBatch({
+        plan: plan([{ type: 'hello' }, { type: 'after' }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[0]).toMatchObject({
+        kind: 'type',
+        status: 'failed',
+        error: { code: 'INTERNAL_ERROR' },
+      });
+      // Fail-fast still applies; the unexpected error did not escape.
+      expect(result.steps[1]).toMatchObject({ status: 'not-run' });
+      expect(result.failedIndices).toEqual([0]);
+    });
+
+    it('records a thrown CliError from an input step with its own code', async () => {
+      const driver: StepDriver = {
+        type: () =>
+          Promise.reject(
+            makeCliError('HOST_UNREACHABLE', { message: 'host gone' }),
+          ),
+        paste: () => Promise.resolve(1),
+        sendKeys: () => Promise.resolve(1),
+        run: () => Promise.resolve({ accepted: true, seq: 1 }),
+        wait: () =>
+          Promise.resolve({ matched: true, timedOut: false, capturedAtSeq: 1 }),
+      };
+
+      const result = await executeBatch({
+        plan: plan([{ type: 'hello' }]),
+        driver,
+        keepGoing: false,
+      });
+
+      expect(result.steps[0]).toMatchObject({
+        kind: 'type',
+        status: 'failed',
+        error: { code: 'HOST_UNREACHABLE', message: 'host gone' },
+      });
+    });
+  });
+
+  describe('commandability guard around waits', () => {
+    it('fails a wait step when the pre-wait commandability check rejects', async () => {
+      const { driver, calls } = createFakeDriver({ inputSeqs: [4] });
+      const assertCommandable = vi.fn(() =>
+        Promise.reject(
+          makeCliError('SESSION_NOT_RUNNING', {
+            message: 'Session "s" is not running.',
+          }),
+        ),
+      );
+
+      const result = await executeBatch({
+        plan: plan([{ type: 'go' }, { wait: { text: 'ready' } }]),
+        driver,
+        keepGoing: false,
+        assertCommandable,
+      });
+
+      // The wait never reaches the driver because the guard rejected first.
+      expect(calls.map((call) => call.verb)).toEqual(['type']);
+      expect(result.steps[1]).toMatchObject({
+        kind: 'wait',
+        status: 'failed',
+        error: { code: 'SESSION_NOT_RUNNING' },
+      });
+      expect(assertCommandable).toHaveBeenCalledTimes(1);
+    });
+
+    it('fails a matched wait when the post-match commandability check rejects', async () => {
+      const { driver, calls } = createFakeDriver();
+      let callCount = 0;
+      const assertCommandable = vi.fn(() => {
+        callCount += 1;
+        // Pass before the wait, reject after the matched result.
+        return callCount === 1
+          ? Promise.resolve()
+          : Promise.reject(
+              makeCliError('SESSION_ALREADY_DESTROYED', {
+                message: 'Session "s" is already destroyed.',
+              }),
+            );
+      });
+
+      const result = await executeBatch({
+        plan: plan([{ wait: { text: 'ready' } }]),
+        driver,
+        keepGoing: false,
+        assertCommandable,
+      });
+
+      // The wait did run, but the post-match guard turned it into a failure.
+      expect(calls.map((call) => call.verb)).toEqual(['wait']);
+      expect(result.steps[0]).toMatchObject({
+        kind: 'wait',
+        status: 'failed',
+        // Observations from the (real) match are preserved, not discarded, so
+        // the failed record is distinguishable from a wait that never matched.
+        matched: true,
+        capturedAtSeq: 99,
+        error: { code: 'SESSION_ALREADY_DESTROYED' },
+      });
+      expect(assertCommandable).toHaveBeenCalledTimes(2);
+    });
+
+    it('lets a wait complete when the commandability guard resolves', async () => {
+      const { driver } = createFakeDriver({ inputSeqs: [2] });
+      const assertCommandable = vi.fn(() => Promise.resolve());
+
+      const result = await executeBatch({
+        plan: plan([{ type: 'go' }, { wait: { text: 'ready' } }]),
+        driver,
+        keepGoing: false,
+        assertCommandable,
+      });
+
+      expect(result.steps[1]).toMatchObject({
+        kind: 'wait',
+        status: 'completed',
+        matched: true,
+      });
+      // Once before the wait, once after the match.
+      expect(assertCommandable).toHaveBeenCalledTimes(2);
+    });
+  });
+});
diff --git a/test/unit/batch/plan.test.ts b/test/unit/batch/plan.test.ts
new file mode 100644
index 00000000..c787c55a
--- /dev/null
+++ b/test/unit/batch/plan.test.ts
@@ -0,0 +1,295 @@
+import { describe, expect, it } from 'vitest';
+
+import { parseBatchPlan } from '../../../src/batch/plan.js';
+import { CliError } from '../../../src/cli/errors.js';
+import { ERROR_CODES } from '../../../src/protocol/errors.js';
+
+function parse(steps: unknown): ReturnType<typeof parseBatchPlan> {
+  return parseBatchPlan(JSON.stringify(steps));
+}
+
+function captureParseError(steps: unknown): CliError {
+  try {
+    parse(steps);
+  } catch (error) {
+    if (error instanceof CliError) {
+      return error;
+    }
+    throw error;
+  }
+  throw new Error('expected parseBatchPlan to throw');
+}
+
+describe('parseBatchPlan', () => {
+  describe('valid plans', () => {
+    it('parses a type step', () => {
+      expect(parse([{ type: 'hello' }]).steps).toEqual([
+        { kind: 'type', text: 'hello' },
+      ]);
+    });
+
+    it('parses a paste step', () => {
+      expect(parse([{ paste: 'pasted text' }]).steps).toEqual([
+        { kind: 'paste', text: 'pasted text' },
+      ]);
+    });
+
+    it('parses a sendKeys step', () => {
+      expect(
+        parse([{ sendKeys: ['Escape', 'ctrl+c', 'Enter'] }]).steps,
+      ).toEqual([{ kind: 'sendKeys', keys: ['Escape', 'ctrl+c', 'Enter'] }]);
+    });
+
+    it('parses a run step with noWait defaulting to false (Waited Run)', () => {
+      expect(parse([{ run: 'echo hi' }]).steps).toEqual([
+        {
+          kind: 'run',
+          command: 'echo hi',
+          noWait: false,
+          timeoutMs: undefined,
+        },
+      ]);
+    });
+
+    it('parses a run step honoring noWait and timeout', () => {
+      expect(
+        parse([{ run: 'nvim --clean', noWait: true, timeout: 2000 }]).steps,
+      ).toEqual([
+        {
+          kind: 'run',
+          command: 'nvim --clean',
+          noWait: true,
+          timeoutMs: 2000,
+        },
+      ]);
+    });
+
+    it('parses a wait step and prepares the condition', () => {
+      const step = parse([{ wait: { text: 'written', timeout: 5000 } }])
+        .steps[0];
+      expect(step).toMatchObject({
+        kind: 'wait',
+        timeoutMs: 5000,
+        condition: { text: 'written' },
+      });
+    });
+
+    it('normalizes a wait timeout of 0 to undefined (infinite)', () => {
+      const step = parse([{ wait: { text: 'done', timeout: 0 } }]).steps[0];
+      expect(step).toMatchObject({ kind: 'wait', timeoutMs: undefined });
+    });
+
+    it('defaults an absent wait timeout to 600s (parity with the wait command)', () => {
+      const step = parse([{ wait: { screenStableMs: 1000 } }]).steps[0];
+      expect(step).toMatchObject({ kind: 'wait', timeoutMs: 600_000 });
+    });
+
+    it('compiles a wait regex condition', () => {
+      const step = parse([{ wait: { regex: '\\d+ items' } }]).steps[0];
+      expect(step).toMatchObject({ kind: 'wait' });
+      if (step?.kind === 'wait') {
+        expect(step.condition.compiledRegex).toBeInstanceOf(RegExp);
+      }
+    });
+
+    it('parses an ordered multi-step plan preserving order', () => {
+      const { steps } = parse([
+        { run: 'nvim --clean', noWait: true },
+        { wait: { screenStableMs: 1000 } },
+        { sendKeys: ['i'] },
+        { type: 'hello' },
+        { sendKeys: ['Escape', 'Enter'] },
+        { wait: { text: 'written' } },
+      ]);
+      expect(steps.map((step) => step.kind)).toEqual([
+        'run',
+        'wait',
+        'sendKeys',
+        'type',
+        'sendKeys',
+        'wait',
+      ]);
+    });
+  });
+
+  describe('malformed top-level input', () => {
+    it('rejects invalid JSON', () => {
+      expect(() => parseBatchPlan('{not json')).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message: 'Batch steps must be valid JSON.',
+        }),
+      );
+    });
+
+    it('rejects a top-level object', () => {
+      expect(() => parseBatchPlan('{}')).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message: 'Batch steps must be a JSON array.',
+        }),
+      );
+    });
+
+    it('rejects a top-level number', () => {
+      expect(() => parseBatchPlan('5')).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message: 'Batch steps must be a JSON array.',
+        }),
+      );
+    });
+
+    it('rejects a top-level string', () => {
+      expect(() => parseBatchPlan('"x"')).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message: 'Batch steps must be a JSON array.',
+        }),
+      );
+    });
+
+    it('rejects a top-level null', () => {
+      expect(() => parseBatchPlan('null')).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message: 'Batch steps must be a JSON array.',
+        }),
+      );
+    });
+
+    it('rejects an empty array', () => {
+      expect(() => parseBatchPlan('[]')).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message: 'Batch must contain at least one step.',
+        }),
+      );
+    });
+  });
+
+  describe('malformed steps', () => {
+    it('rejects a non-object step with its index', () => {
+      expect(() => parse([42])).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          details: { stepIndex: 0 },
+        }),
+      );
+    });
+
+    it('rejects a null step', () => {
+      expect(() => parse([null])).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_INPUT }),
+      );
+    });
+
+    it('rejects an array step', () => {
+      expect(() => parse([['type', 'x']])).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_INPUT }),
+      );
+    });
+
+    it('rejects a zero-verb step', () => {
+      expect(() => parse([{ frob: 1 }])).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message:
+            'Batch step 0 must have exactly one of type|paste|sendKeys|run|wait; found none',
+          details: { stepIndex: 0 },
+        }),
+      );
+    });
+
+    it('rejects a two-verb step listing the conflicting verbs', () => {
+      expect(() => parse([{ type: 'a', wait: { text: 'x' } }])).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message:
+            'Batch step 0 must have exactly one of type|paste|sendKeys|run|wait; found type, wait',
+          details: { stepIndex: 0 },
+        }),
+      );
+    });
+
+    it('reports the failing step index in a multi-step plan', () => {
+      expect(() =>
+        parse([{ type: 'ok' }, { type: 'ok' }, { frob: 1 }]),
+      ).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          details: { stepIndex: 2 },
+        }),
+      );
+    });
+
+    it('rejects an unknown payload key on a verb step (no inline wait)', () => {
+      expect(captureParseError([{ run: 'x', text: 'y' }])).toMatchObject({
+        code: ERROR_CODES.INVALID_INPUT,
+        details: { stepIndex: 0 },
+      });
+    });
+
+    it('rejects an empty type string', () => {
+      expect(() => parse([{ type: '' }])).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_INPUT }),
+      );
+    });
+  });
+
+  describe('sendKeys validation', () => {
+    it('rejects sendKeys that is not an array', () => {
+      expect(captureParseError([{ sendKeys: 'Enter' }])).toMatchObject({
+        code: ERROR_CODES.INVALID_INPUT,
+        details: { stepIndex: 0 },
+      });
+    });
+
+    it('rejects an empty sendKeys array', () => {
+      expect(captureParseError([{ sendKeys: [] }])).toMatchObject({
+        code: ERROR_CODES.INVALID_INPUT,
+        details: { stepIndex: 0 },
+      });
+    });
+
+    it('rejects an invalid key name at parse time with INVALID_KEYS', () => {
+      expect(() => parse([{ sendKeys: ['Enter', 'BOGUS'] }])).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_KEYS }),
+      );
+    });
+  });
+
+  describe('wait condition validation', () => {
+    it('rejects a wait with text and regex together', () => {
+      expect(() => parse([{ wait: { text: 'a', regex: 'b' } }])).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message:
+            'waitForRender text and regex filters are mutually exclusive',
+        }),
+      );
+    });
+
+    it('rejects a wait regex prone to catastrophic backtracking', () => {
+      expect(() => parse([{ wait: { regex: '(a+)+' } }])).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_INPUT }),
+      );
+    });
+
+    it('rejects a wait with a negative cursor', () => {
+      expect(() => parse([{ wait: { cursorRow: -1 } }])).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_INPUT }),
+      );
+    });
+
+    it('rejects a wait with no condition', () => {
+      expect(() => parse([{ wait: {} }])).toThrow(
+        expect.objectContaining({
+          code: ERROR_CODES.INVALID_INPUT,
+          message:
+            'waitForRender requires at least one of text, regex, screenStableMs, cursorRow, or cursorCol',
+        }),
+      );
+    });
+  });
+});
diff --git a/test/unit/batch/result.test.ts b/test/unit/batch/result.test.ts
new file mode 100644
index 00000000..60e97b1a
--- /dev/null
+++ b/test/unit/batch/result.test.ts
@@ -0,0 +1,134 @@
+import { describe, expect, it } from 'vitest';
+
+import type { BatchStep } from '../../../src/batch/plan.js';
+import type { BatchStepRecord } from '../../../src/batch/result.js';
+
+import { parseBatchPlan } from '../../../src/batch/plan.js';
+import {
+  BatchResultSchema,
+  buildPartialBatchResult,
+  unreachedStepRecord,
+} from '../../../src/batch/result.js';
+
+const PLAN = parseBatchPlan(
+  JSON.stringify([
+    { type: 'hello' },
+    { sendKeys: ['Enter'] },
+    { wait: { text: 'done' } },
+    { run: 'echo hi', noWait: true },
+  ]),
+);
+
+function stepAt(index: number): BatchStep {
+  const step = PLAN.steps[index];
+  if (step === undefined) {
+    throw new Error(`no plan step at index ${String(index)}`);
+  }
+  return step;
+}
+
+function completed(index: number, kind: 'type' | 'sendKeys'): BatchStepRecord {
+  return { index, durationMs: 3, kind, status: 'completed', seq: index + 1 };
+}
+
+describe('unreachedStepRecord', () => {
+  it('shapes a not-run record per step kind without seq or duration', () => {
+    expect(unreachedStepRecord(stepAt(0), 0, 'not-run')).toEqual({
+      index: 0,
+      durationMs: 0,
+      kind: 'type',
+      status: 'not-run',
+    });
+    expect(unreachedStepRecord(stepAt(3), 3, 'not-run')).toEqual({
+      index: 3,
+      durationMs: 0,
+      kind: 'run',
+      status: 'not-run',
+      noWait: true,
+    });
+  });
+
+  it('shapes an interrupted record with the requested status', () => {
+    expect(unreachedStepRecord(stepAt(2), 2, 'interrupted')).toEqual({
+      index: 2,
+      durationMs: 0,
+      kind: 'wait',
+      status: 'interrupted',
+    });
+  });
+});
+
+describe('buildPartialBatchResult', () => {
+  it('marks the first unreached step interrupted and the rest not-run', () => {
+    const recorded = [completed(0, 'type'), completed(1, 'sendKeys')];
+    const partial = buildPartialBatchResult(PLAN, recorded);
+
+    expect(partial.steps.map((step) => step.status)).toEqual([
+      'completed',
+      'completed',
+      'interrupted',
+      'not-run',
+    ]);
+    expect(partial.steps.map((step) => step.kind)).toEqual([
+      'type',
+      'sendKeys',
+      'wait',
+      'run',
+    ]);
+  });
+
+  it('recomputes completedCount and failedIndices from the synthesized steps', () => {
+    const recorded: BatchStepRecord[] = [
+      completed(0, 'type'),
+      {
+        index: 1,
+        durationMs: 5,
+        kind: 'sendKeys',
+        status: 'failed',
+        error: { code: 'HOST_UNREACHABLE', message: 'gone' },
+      },
+    ];
+    const partial = buildPartialBatchResult(PLAN, recorded);
+
+    expect(partial.completedCount).toBe(1);
+    expect(partial.failedIndices).toEqual([1]);
+    // The interrupted and not-run steps contribute to neither count.
+    expect(partial.steps).toHaveLength(4);
+  });
+
+  it('produces a schema-valid envelope (interrupted is an accepted status)', () => {
+    const partial = buildPartialBatchResult(PLAN, [completed(0, 'type')]);
+    expect(() => BatchResultSchema.parse(partial)).not.toThrow();
+  });
+
+  it('returns the recorded steps unchanged when the whole plan finalized', () => {
+    const recorded: BatchStepRecord[] = [
+      completed(0, 'type'),
+      completed(1, 'sendKeys'),
+      {
+        index: 2,
+        durationMs: 10,
+        kind: 'wait',
+        status: 'completed',
+        matched: true,
+      },
+      {
+        index: 3,
+        durationMs: 1,
+        kind: 'run',
+        status: 'completed',
+        noWait: true,
+        seq: 5,
+        runOutcome: 'started',
+      },
+    ];
+    const partial = buildPartialBatchResult(PLAN, recorded);
+
+    expect(partial.steps).toHaveLength(4);
+    expect(partial.steps.every((step) => step.status === 'completed')).toBe(
+      true,
+    );
+    expect(partial.completedCount).toBe(4);
+    expect(partial.failedIndices).toEqual([]);
+  });
+});
diff --git a/test/unit/cli/exitCodes.test.ts b/test/unit/cli/exitCodes.test.ts
index 0e49bf08..55b6b580 100644
--- a/test/unit/cli/exitCodes.test.ts
+++ b/test/unit/cli/exitCodes.test.ts
@@ -35,6 +35,16 @@ describe('CLI exit codes', () => {
     expect(exitCodeForError(ERROR_CODES.REPLAY_ERROR)).toBe(10);
   });
 
+  it('maps a render-wait timeout to a distinct exit code 11', () => {
+    expect(exitCodeForError(ERROR_CODES.WAIT_TIMEOUT)).toBe(11);
+    expect(exitCodeForError(ERROR_CODES.WAIT_TIMEOUT)).not.toBe(
+      exitCodeForError(ERROR_CODES.HOST_TIMEOUT),
+    );
+    expect(exitCodeForError(ERROR_CODES.WAIT_TIMEOUT)).not.toBe(
+      exitCodeForError(ERROR_CODES.HOST_UNREACHABLE),
+    );
+  });
+
   it('maps internal errors to the generic exit code 1', () => {
     expect(exitCodeForError(ERROR_CODES.INTERNAL_ERROR)).toBe(1);
   });
diff --git a/test/unit/commands/batch.test.ts b/test/unit/commands/batch.test.ts
new file mode 100644
index 00000000..83bf7907
--- /dev/null
+++ b/test/unit/commands/batch.test.ts
@@ -0,0 +1,35 @@
+import { describe, expect, it } from 'vitest';
+
+import type { BatchResult } from '../../../src/batch/result.js';
+
+import { buildBatchLines } from '../../../src/cli/commands/batch.js';
+
+describe('buildBatchLines', () => {
+  it('labels an interrupted step as interrupted, not as a success', () => {
+    // The SIGINT/SIGTERM partial flush marks the in-flight step `interrupted`
+    // and later steps `not-run`; the human-readable line must not render those
+    // as completed/matched (the bug: an interrupted run printed "completed").
+    const result: BatchResult = {
+      steps: [
+        { index: 0, durationMs: 5, kind: 'type', status: 'completed', seq: 1 },
+        {
+          index: 1,
+          durationMs: 0,
+          kind: 'run',
+          status: 'interrupted',
+          noWait: false,
+        },
+        { index: 2, durationMs: 0, kind: 'wait', status: 'not-run' },
+      ],
+      completedCount: 1,
+      failedIndices: [],
+    };
+
+    const lines = buildBatchLines(result);
+
+    expect(lines[0]).toBe('[0] type completed (5ms)');
+    expect(lines[1]).toBe('[1] run interrupted (0ms)');
+    expect(lines[2]).toBe('[2] wait not-run (0ms)');
+    expect(lines.at(-1)).toBe('1/3 steps completed');
+  });
+});
diff --git a/test/unit/commands/golden-envelopes.test.ts b/test/unit/commands/golden-envelopes.test.ts
index 059b0c96..9f9ea408 100644
--- a/test/unit/commands/golden-envelopes.test.ts
+++ b/test/unit/commands/golden-envelopes.test.ts
@@ -1,6 +1,7 @@
 import { afterEach, beforeEach, describe, expect, it, vi } from 'vitest';
 import { z } from 'zod';
 
+import { BatchResultSchema } from '../../../src/batch/result.js';
 import { buildVersionResult } from '../../../src/cli/commands/version.js';
 import {
   SkillGetResultSchema,
@@ -637,6 +638,213 @@ const goldenResultContracts: readonly GoldenResultContractCase[] = [
       exitCode: 0,
     },
   },
+  {
+    name: 'batch',
+    command: 'batch',
+    schema: BatchResultSchema,
+    validResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 12,
+          kind: 'run',
+          status: 'completed',
+          seq: 4,
+          noWait: true,
+          runOutcome: 'started',
+        },
+        {
+          index: 1,
+          durationMs: 1003,
+          kind: 'wait',
+          status: 'completed',
+          waitBaseline: 4,
+          matched: true,
+          timedOut: false,
+          matchedText: 'Ready',
+          capturedAtSeq: 9,
+        },
+        {
+          index: 2,
+          durationMs: 3,
+          kind: 'type',
+          status: 'completed',
+          seq: 11,
+        },
+      ],
+      completedCount: 3,
+      failedIndices: [],
+    },
+    invalidResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 5,
+          kind: 'frob',
+          status: 'completed',
+        },
+      ],
+      completedCount: 1,
+      failedIndices: [],
+    },
+    extraFieldResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 5,
+          kind: 'type',
+          status: 'completed',
+          seq: 1,
+        },
+      ],
+      completedCount: 1,
+      failedIndices: [],
+      failedCount: 0,
+    },
+  },
+  {
+    name: 'batch (fail-fast)',
+    command: 'batch',
+    schema: BatchResultSchema,
+    validResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 3,
+          kind: 'type',
+          status: 'completed',
+          seq: 2,
+        },
+        {
+          index: 1,
+          durationMs: 10_000,
+          kind: 'wait',
+          status: 'failed',
+          waitBaseline: 2,
+          matched: false,
+          timedOut: true,
+          capturedAtSeq: 5,
+          error: {
+            code: 'WAIT_TIMEOUT',
+            message:
+              'Render wait at step 1 timed out before its condition was met.',
+          },
+        },
+        {
+          index: 2,
+          durationMs: 0,
+          kind: 'sendKeys',
+          status: 'not-run',
+        },
+      ],
+      completedCount: 1,
+      failedIndices: [1],
+    },
+    invalidResult: {
+      steps: [
+        {
+          index: 1,
+          durationMs: 10,
+          kind: 'wait',
+          status: 'failed',
+          error: {
+            code: 'WAIT_TIMEOUT',
+          },
+        },
+      ],
+      completedCount: 0,
+      failedIndices: [1],
+    },
+    extraFieldResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 3,
+          kind: 'wait',
+          status: 'failed',
+          matched: false,
+          timedOut: true,
+          capturedAtSeq: 5,
+          error: {
+            code: 'WAIT_TIMEOUT',
+            message: 'timed out',
+            retryable: false,
+          },
+        },
+      ],
+      completedCount: 0,
+      failedIndices: [0],
+    },
+  },
+  {
+    name: 'batch (keep-going)',
+    command: 'batch',
+    schema: BatchResultSchema,
+    validResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 10_000,
+          kind: 'wait',
+          status: 'failed',
+          matched: false,
+          timedOut: true,
+          capturedAtSeq: 3,
+          error: {
+            code: 'WAIT_TIMEOUT',
+            message:
+              'Render wait at step 0 timed out before its condition was met.',
+          },
+        },
+        {
+          index: 1,
+          durationMs: 4,
+          kind: 'type',
+          status: 'completed',
+          seq: 6,
+        },
+        {
+          index: 2,
+          durationMs: 30_000,
+          kind: 'run',
+          status: 'failed',
+          seq: 8,
+          noWait: false,
+          completed: false,
+          timedOut: true,
+          runOutcome: 'timedOut',
+          error: {
+            code: 'WAIT_TIMEOUT',
+            message: 'Waited Run at step 2 timed out before completing.',
+          },
+        },
+      ],
+      completedCount: 1,
+      failedIndices: [0, 2],
+    },
+    invalidResult: {
+      steps: [],
+      completedCount: 0,
+      failedIndices: ['0'],
+    },
+    extraFieldResult: {
+      steps: [
+        {
+          index: 0,
+          durationMs: 1,
+          kind: 'type',
+          status: 'failed',
+          error: {
+            code: 'INTERNAL_ERROR',
+            message: 'boom',
+          },
+        },
+      ],
+      completedCount: 0,
+      failedIndices: [0],
+      keepGoing: true,
+    },
+  },
   {
     name: 'send-keys',
     command: 'send-keys',
diff --git a/test/unit/commands/paste.test.ts b/test/unit/commands/paste.test.ts
index 78a3a478..23a3f343 100644
--- a/test/unit/commands/paste.test.ts
+++ b/test/unit/commands/paste.test.ts
@@ -50,6 +50,7 @@ describe('paste command', () => {
     vi.clearAllMocks();
     mocks.resolveCommandTarget.mockResolvedValue(COMMAND_TARGET);
     mocks.resolveCommandInputText.mockResolvedValue('hello from paste');
+    mocks.sendRpc.mockResolvedValue({ seq: 9 });
   });
 
   it('pastes resolved input into a command target', async () => {
@@ -77,8 +78,25 @@ describe('paste command', () => {
     expect(mocks.emitSuccess).toHaveBeenCalledWith({
       command: 'paste',
       json: false,
-      result: {},
-      lines: ['Pasted text into session.'],
+      result: { seq: 9 },
+      lines: ['Pasted text into session at seq 9.'],
     });
   });
+
+  it('rejects malformed paste RPC responses', async () => {
+    mocks.sendRpc.mockResolvedValueOnce({ seq: -1 });
+
+    await expect(
+      runPasteCommand({
+        context: TEST_CONTEXT,
+        json: false,
+        sessionId: 'session-01',
+        text: 'hello from paste',
+      }),
+    ).rejects.toMatchObject({
+      code: 'PROTOCOL_ERROR',
+      message: 'Unexpected response from host',
+    });
+    expect(mocks.emitSuccess).not.toHaveBeenCalled();
+  });
 });
diff --git a/test/unit/commands/type.test.ts b/test/unit/commands/type.test.ts
index 52bdb07a..d0b181cf 100644
--- a/test/unit/commands/type.test.ts
+++ b/test/unit/commands/type.test.ts
@@ -50,6 +50,7 @@ describe('type command', () => {
     vi.clearAllMocks();
     mocks.resolveCommandTarget.mockResolvedValue(COMMAND_TARGET);
     mocks.resolveCommandInputText.mockResolvedValue('hello');
+    mocks.sendRpc.mockResolvedValue({ seq: 7 });
   });
 
   it('appends exactly one newline to positional text when requested', async () => {
@@ -78,8 +79,8 @@ describe('type command', () => {
     expect(mocks.emitSuccess).toHaveBeenCalledWith({
       command: 'type',
       json: false,
-      result: {},
-      lines: ['Typed text into session.'],
+      result: { seq: 7 },
+      lines: ['Typed text into session at seq 7.'],
     });
   });
 
@@ -169,8 +170,25 @@ describe('type command', () => {
     expect(mocks.emitSuccess).toHaveBeenCalledWith({
       command: 'type',
       json: true,
-      result: {},
-      lines: ['Typed text into session.'],
+      result: { seq: 7 },
+      lines: ['Typed text into session at seq 7.'],
     });
   });
+
+  it('rejects malformed type RPC responses', async () => {
+    mocks.sendRpc.mockResolvedValueOnce({ seq: -1 });
+
+    await expect(
+      runTypeCommand({
+        context: TEST_CONTEXT,
+        json: false,
+        sessionId: 'session-01',
+        text: 'hello',
+      }),
+    ).rejects.toMatchObject({
+      code: 'PROTOCOL_ERROR',
+      message: 'Unexpected response from host',
+    });
+    expect(mocks.emitSuccess).not.toHaveBeenCalled();
+  });
 });
diff --git a/test/unit/commands/wait.test.ts b/test/unit/commands/wait.test.ts
index 623709d0..22ffc885 100644
--- a/test/unit/commands/wait.test.ts
+++ b/test/unit/commands/wait.test.ts
@@ -90,6 +90,7 @@ function createOptions(
     screenStableMs: undefined,
     cursorRow: undefined,
     cursorCol: undefined,
+    afterSeq: undefined,
     ...overrides,
   };
 }
@@ -260,6 +261,7 @@ describe('wait command', () => {
         screenStableMs: undefined,
         cursorRow: undefined,
         cursorCol: undefined,
+        afterSeq: undefined,
         timeoutMs: undefined,
         rendererName: 'ghostty-web',
       },
@@ -396,6 +398,7 @@ describe('wait command', () => {
         screenStableMs: undefined,
         cursorRow: undefined,
         cursorCol: undefined,
+        afterSeq: undefined,
         timeoutMs: 600_000,
         rendererName: 'ghostty-web',
       },
@@ -429,6 +432,7 @@ describe('wait command', () => {
         screenStableMs: undefined,
         cursorRow: undefined,
         cursorCol: undefined,
+        afterSeq: undefined,
         timeoutMs: 600_000,
         rendererName: 'ghostty-web',
       },
@@ -465,6 +469,7 @@ describe('wait command', () => {
         screenStableMs: undefined,
         cursorRow: 3,
         cursorCol: 4,
+        afterSeq: undefined,
         timeoutMs: 600_000,
         rendererName: 'ghostty-web',
       },
@@ -478,6 +483,86 @@ describe('wait command', () => {
     );
   });
 
+  it('threads --after-seq into the render wait RPC params', async () => {
+    const result = {
+      matched: true,
+      timedOut: false,
+      matchedText: 'hello',
+      capturedAtSeq: 12,
+    };
+    mocks.sendRpc.mockResolvedValue(result);
+
+    await runWaitCommand(createOptions({ text: 'hello', afterSeq: 5 }));
+
+    expect(mocks.sendRpc).toHaveBeenCalledWith(
+      '/tmp/agent-tty/sessions/session-01/rpc.sock',
+      'waitForRender',
+      {
+        text: 'hello',
+        regex: undefined,
+        screenStableMs: undefined,
+        cursorRow: undefined,
+        cursorCol: undefined,
+        afterSeq: 5,
+        timeoutMs: 600_000,
+        rendererName: 'ghostty-web',
+      },
+      605_000,
+    );
+    expect(mocks.emitSuccess).toHaveBeenCalledWith(
+      expect.objectContaining({
+        command: 'wait',
+        result,
+      }),
+    );
+  });
+
+  it('enters render mode for an --after-seq-only invocation and requires a match condition', async () => {
+    await expect(
+      runWaitCommand(createOptions({ afterSeq: 5 })),
+    ).rejects.toMatchObject({
+      code: ERROR_CODES.INVALID_INPUT,
+      message:
+        'waitForRender requires at least one of text, regex, screenStableMs, cursorRow, or cursorCol',
+    });
+    expect(mocks.sendRpc).not.toHaveBeenCalled();
+  });
+
+  it('rejects negative --after-seq values', async () => {
+    await expect(
+      runWaitCommand(createOptions({ text: 'hello', afterSeq: -1 })),
+    ).rejects.toMatchObject({
+      code: ERROR_CODES.INVALID_INPUT,
+      details: { afterSeq: -1 },
+    });
+    expect(mocks.sendRpc).not.toHaveBeenCalled();
+  });
+
+  it('throws a replay error offline when the snapshot is at or below the wait baseline', async () => {
+    mocks.sendRpc.mockRejectedValue(
+      makeCliError(ERROR_CODES.HOST_UNREACHABLE, {
+        message: 'Session host is unreachable.',
+      }),
+    );
+    mockOfflineReplaySnapshot({
+      capturedAtSeq: 5,
+      visibleLines: [{ row: 0, text: 'offline hello output' }],
+    });
+
+    const promise = runWaitCommand(
+      createOptions({ text: 'hello', afterSeq: 5 }),
+    );
+
+    await expect(promise).rejects.toMatchObject({
+      code: ERROR_CODES.REPLAY_ERROR,
+      details: {
+        afterSeq: 5,
+        capturedAtSeq: 5,
+      },
+    });
+    expect(mocks.emitSuccess).not.toHaveBeenCalled();
+  });
+
   it('falls back to offline replay when render wait host becomes unreachable and the snapshot matches', async () => {
     mocks.sendRpc.mockRejectedValue(
       makeCliError(ERROR_CODES.HOST_UNREACHABLE, {
@@ -500,6 +585,7 @@ describe('wait command', () => {
         screenStableMs: undefined,
         cursorRow: undefined,
         cursorCol: undefined,
+        afterSeq: undefined,
         timeoutMs: 600_000,
         rendererName: 'ghostty-web',
       },
diff --git a/test/unit/pty/keyEncoder.test.ts b/test/unit/pty/keyEncoder.test.ts
index d1d8d611..a11e4304 100644
--- a/test/unit/pty/keyEncoder.test.ts
+++ b/test/unit/pty/keyEncoder.test.ts
@@ -1,6 +1,7 @@
 import { describe, expect, it } from 'vitest';
 
-import { encodeKey } from '../../../src/pty/keyEncoder.js';
+import { assertValidKeyName, encodeKey } from '../../../src/pty/keyEncoder.js';
+import { ERROR_CODES } from '../../../src/protocol/errors.js';
 
 describe('encodeKey', () => {
   it('encodes Enter', () => {
@@ -143,3 +144,33 @@ describe('encodeKey', () => {
     expect(() => encodeKey('ctrl+ctrl+a')).toThrow();
   });
 });
+
+describe('assertValidKeyName', () => {
+  const validKeys = ['Enter', 'Escape', 'Space', 'ctrl+c', 'F12', 'Up', 'a'];
+  const invalidKeys = ['BOGUS', '', 'ctrl+ctrl+a', 'ctrl+Enter'];
+
+  it('returns for a valid key name', () => {
+    expect(() => assertValidKeyName('ctrl+c')).not.toThrow();
+  });
+
+  it('throws INVALID_KEYS for an invalid key name', () => {
+    expect(() => assertValidKeyName('BOGUS')).toThrow(
+      expect.objectContaining({ code: ERROR_CODES.INVALID_KEYS }),
+    );
+  });
+
+  it.each(validKeys)('accepts valid key %s (parity with encodeKey)', (key) => {
+    expect(() => encodeKey(key)).not.toThrow();
+    expect(() => assertValidKeyName(key)).not.toThrow();
+  });
+
+  it.each(invalidKeys)(
+    'rejects invalid key %s (parity with encodeKey)',
+    (key) => {
+      expect(() => encodeKey(key)).toThrow();
+      expect(() => assertValidKeyName(key)).toThrow(
+        expect.objectContaining({ code: ERROR_CODES.INVALID_KEYS }),
+      );
+    },
+  );
+});
diff --git a/test/unit/renderWait/matcher.test.ts b/test/unit/renderWait/matcher.test.ts
index f3dedca2..e244573f 100644
--- a/test/unit/renderWait/matcher.test.ts
+++ b/test/unit/renderWait/matcher.test.ts
@@ -28,6 +28,7 @@ describe('render wait matcher', () => {
       textMatched: true,
       cursorMatched: true,
       stabilityMatched: true,
+      baselineMatched: true,
       contentAndCursorMatched: true,
       matchedText: 'Ready',
       cursorRow: 1,
@@ -37,6 +38,69 @@ describe('render wait matcher', () => {
     });
   });
 
+  it('rejects snapshots at or below the wait baseline even when text matches', () => {
+    const condition = prepareRenderWaitCondition({
+      text: 'Ready',
+      afterSeq: 12,
+    });
+    const snapshot = createTestSemanticSnapshot({
+      visibleLines: [{ row: 0, text: 'Ready' }],
+      capturedAtSeq: 12,
+    });
+
+    expect(matchRenderWaitSnapshot(condition, snapshot)).toMatchObject({
+      matched: false,
+      textMatched: true,
+      cursorMatched: true,
+      stabilityMatched: true,
+      baselineMatched: false,
+      contentAndCursorMatched: true,
+    });
+  });
+
+  it('matches snapshots strictly beyond the wait baseline', () => {
+    const condition = prepareRenderWaitCondition({
+      text: 'Ready',
+      afterSeq: 12,
+    });
+    const snapshot = createTestSemanticSnapshot({
+      visibleLines: [{ row: 0, text: 'Ready' }],
+      capturedAtSeq: 13,
+    });
+
+    expect(matchRenderWaitSnapshot(condition, snapshot)).toMatchObject({
+      matched: true,
+      textMatched: true,
+      baselineMatched: true,
+      contentAndCursorMatched: true,
+      matchedText: 'Ready',
+    });
+  });
+
+  it('treats an undefined wait baseline as always satisfied', () => {
+    const condition = prepareRenderWaitCondition({ text: 'Ready' });
+    const snapshot = createTestSemanticSnapshot({
+      visibleLines: [{ row: 0, text: 'Ready' }],
+      capturedAtSeq: 0,
+    });
+
+    expect(matchRenderWaitSnapshot(condition, snapshot)).toMatchObject({
+      matched: true,
+      baselineMatched: true,
+    });
+  });
+
+  it('rejects non-integer afterSeq baselines', () => {
+    expect(() =>
+      prepareRenderWaitCondition({ text: 'Ready', afterSeq: 1.5 }),
+    ).toThrow(
+      expect.objectContaining({
+        code: ERROR_CODES.INVALID_INPUT,
+        message: 'afterSeq must be a non-negative integer',
+      }),
+    );
+  });
+
   it('matches regexes and reports the matched substring', () => {
     const condition = prepareRenderWaitCondition({ regex: '\\d+ items' });
     const snapshot = createTestSemanticSnapshot({