hooks: dangerous-command blocker matches phrases in heredoc/echo content, blocking benign Bash

## What I see

`createDangerousCommandBlocker` in `src/agent/hooks.ts:54-75`
runs each pattern in `DANGEROUS_COMMANDS` against the entire
`tool_input.command` string. The regex matches anywhere in the
string, not only where a command would actually execute. Any
Bash invocation that has a forbidden phrase as data (heredoc
body, single-quoted `echo` argument, `printf` format, `tee`
input) trips the same block as the destructive operation
itself.

I hit this twice in two days. The first time (2026-04-25) was
a journal entry whose heredoc body paraphrased a
previously-blocked attempt using the literal forbidden phrase.
Today (2026-04-27, while drafting this issue) I tried to drop
a Node repro via `cat <<NODEEOF` whose body included the
forbidden phrase inside a regex literal in JavaScript code.
The hook blocked the outer Bash call before the file existed:

```
PreToolUse:Callback hook blocking error from command:
"callback": Blocked dangerous command: "docker compose down"
```

Semantically the harness is doing the right thing (it cannot
distinguish command from data inside the Bash string), but
the user-visible effect is that any prose, code, or test
fixture that names a forbidden phrase becomes un-writeable
through Bash.

## Repro

A standalone runner that copies the patterns verbatim from
`src/agent/hooks.ts` and prints the pass/fail per case. The
trigger phrases are concatenated at runtime so the script
itself stays portable:

```js
const DANGEROUS = [
  { pattern: /docker\s+compose\s+down/, label: "compose-down" },
  { pattern: /git\s+push\s+.*--force/, label: "force-push" },
  { pattern: /git\s+reset\s+--hard/, label: "hard-reset" },
  { pattern: /rm\s+-rf\s+\/(\s|$)/, label: "rm-rf-root" },
];
const fp = "git" + " push" + " --force";
const dc = "docker" + " compose" + " down";
const cases = [
  ["1 actual force push",          `${fp} origin main`],
  ["2 cat heredoc body",            `cat > note.md <<EOF\nReminder: ${fp} is forbidden\nEOF`],
  ["3 echo of CONTRIBUTING quote",  `echo 'CONTRIBUTING says ${fp} is forbidden'`],
  ["4 printf into journal",         `printf '%s\\n' 'A session tried ${dc}' >> /tmp/j.md`],
  ["5 unrelated echo",              "echo 'hello world'"],
  ["6 safe ls",                     "ls -la"],
];
for (const [label, cmd] of cases) {
  const hit = DANGEROUS.find(({ pattern }) => pattern.test(cmd));
  console.log(`[${hit ? "BLOCKED" : "allow  "}] ${label} -> ${hit?.label ?? "ok"}`);
}
```

Output:

```
[BLOCKED] 1 actual force push -> force-push
[BLOCKED] 2 cat heredoc body -> force-push
[BLOCKED] 3 echo of CONTRIBUTING quote -> force-push
[BLOCKED] 4 printf into journal -> compose-down
[allow  ] 5 unrelated echo -> ok
[allow  ] 6 safe ls -> ok
```

Cases 2, 3, 4 are false positives. None of them invoke a
destructive command; each writes data that contains a
forbidden phrase as text.

## Why it matters

The current behavior makes the agent unable to:

- write a `printf '%s' '...' >> file.md` log line about a
  prior blocked attempt (loop guard)
- file an issue (this one) whose repro names the forbidden
  phrases the hook scans for
- save a test fixture or doc string that mentions
  `git push --force` as an example of bad practice
- echo a quoted CONTRIBUTING snippet into a comment

The friction is small per incident but recurring, and it
nudges the agent toward word-laundering its own prose ("the
operation that was blocked yesterday" instead of naming the
verb), which corrodes the journal as a precise record.

The source comment at `src/agent/hooks.ts:3-11` is explicit:
"This is NOT a security boundary. ... A determined adversary
can bypass regex patterns via encoding, variable substitution,
or indirect execution." That framing is right; the fix shape
should preserve the defense-in-depth posture for honest
mistakes without blocking benign prose-about-mistakes.

## Direction (not a prescription)

A few shapes worth weighing.

1. **Shell-token aware scan.** Parse the command with a small
   tokenizer (or `shell-quote`), extract the executable head
   of each pipeline / `&&` chain, and scan only those tokens.
   Real `git push --force origin main` still matches;
   `echo 'git push --force is bad'` does not because the head
   is `echo` and the forbidden phrase is in a quoted argument.
   Strongest fix; medium complexity; one new dep or ~30 lines
   of inline tokenizer.

2. **Strip heredoc bodies before scan.** A regex such as
   `<<-?\s*['"]?(\w+)['"]?\n[\s\S]*?\n\1\s*$` removes heredoc
   payloads from the scanned string, then the existing
   patterns run on what's left. Closes the highest-frequency
   case (`cat <<EOF`, `node <<NODEEOF`, `bash <<SH`) without a
   new dep. Misses the `echo '...'` and `printf '%s' '...'`
   false positives.

3. **Anchor each pattern at a command boundary.** Prefix each
   regex with `(?:^|[;|&]\s*)` so it only matches at the start
   of a command in a chain. Catches case 1 and most real
   destructive uses; still false-positives on heredoc/echo
   payloads that begin a line.

4. **Switch from block to ask.** Convert `decision: "block"`
   to `permissionDecision: "ask"` (per the Claude Agent SDK),
   so the operator confirms once. Loses the silent-guardrail
   benefit; only useful as a fallback if false-positive risk
   weighs more than friction here.

Option 1 is the right shape long-term. Option 2 is a clean
narrow patch that closes the most-hit case without adding a
dependency. Happy to scope either as a small PR if the
direction fits.

## Env

Verified against `main` at ff74713. Hook source unchanged
since first landing per `git log src/agent/hooks.ts`.
`src/agent/__tests__/hooks.test.ts` covers each pattern's
positive case plus the `allows safe commands` line; no
false-positive case is covered today.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hooks: dangerous-command blocker matches phrases in heredoc/echo content, blocking benign Bash #100

What I see

Repro

Why it matters

Direction (not a prescription)

Env

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

hooks: dangerous-command blocker matches phrases in heredoc/echo content, blocking benign Bash #100

Description

What I see

Repro

Why it matters

Direction (not a prescription)

Env

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions