feat: v6.3.0 pipeline hardening — auth/authz bug-class, pr-monitoring pattern, completion trust-boundary, hook stdout-JSON discipline, reminder dedup (#41/#58/#59/#60/#61) by intel352 · Pull Request #62 · GoCodeAlone/autonomous-dev-kit

intel352 · 2026-06-01T05:52:34Z

Summary

v6.3.0 pipeline-hardening release closing 5 recurring gate-miss / context-waste issues (one coherent PR, full pipeline: 3 design + 2 plan adversarial cycles, alignment PASS, scope locked, two-stage review APPROVED).

Closes #41, #58, #59, #60, #61.

Design / Plan / ADR

Design: docs/plans/2026-06-01-pipeline-hardening-4issues-design.md (adversarial PASS @ cycle 3)
Plan: docs/plans/2026-06-01-pipeline-hardening-4issues.md (plan-phase PASS @ cycle 2; alignment PASS; scope locked)
ADR 0003 — Implement-N completion is a lead-verified trust boundary, not a hook-blocked invariant (the hard-block is infeasible).

Changes

adversarial-design-review (plan phase): add auth/authz chain-composition bug class #59 adversarial-design-review — new plan-phase auth/authz chain-composition bug-class: walk the design's auth chain vs the plan's wiring; flag any gate enforced by a client-asserted value instead of server-side against an authenticated principal.
pr-monitoring: background monitor agent exits after first cycle instead of looping #60 pr-monitoring — sanctioned, host-scoped bash poll-loop CI-wait (claude-code: bounded run_in_background sleep-loop that blocks + re-invokes the lead once; codex/cursor: self-poll fallback). The prior background-Agent monitor early-exited ~6×/run.
subagent-driven-development: enforce 'only code-reviewer flips Implement-N to completed' #58 subagent-driven-development + team-conventions — completion trust-boundary: a flipped Implement: N is a claim, not evidence; the lead runs verification-before-completion before trusting it. Hard-block infeasible (pre-tool payload lacks task subject + caller) → ADR 0003.
PreCompact hook can emit invalid JSON #41 run-hook.cmd — stdout JSON discipline: capture each hook's stdout, emit only valid-JSON-or-empty, recover a block decision even when a locale/diagnostic warning precedes it (route noise to stderr; jq-absent passthrough). New tests/hook-stdout-discipline.sh.
pretool-pr-review-reminder: emit gh-version/Copilot guidance once per session, not per PR #61 pretool-pr-review-reminder — emit the gh/Copilot reminder once per session (quote-strip match so a quoted --body mentioning gh pr create doesn't trip it; deduped via .claude/autodev-state marker), reset by pre-compact-snapshot so it re-emits once post-compaction.
CI — new hooks-check.yml runs the hook contract + stdout-discipline tests on any hooks//test change (so these fixes are regression-gated; a test that never runs is theater).
Version 6.2.2 → 6.3.0.

Verification

tests/hook-stdout-discipline.sh — 0 failures (4 cases: warning+block-JSON recovered, noise→stderr, clean passthrough, jq-absent passthrough).
tests/hook-contracts.sh — all pass (real hooks through the new wrapper; pretool-pr-review-reminder: emit gh-version/Copilot guidance once per session, not per PR #61 dedup/reset/false-positive/no-transcript).
tests/skill-content-grep.sh / skill-cross-refs.sh / version-check.sh — PASS.
Two-stage code review APPROVED (fixed grep -vxF full-line diagnostic routing + atomic marker rewrite).

🤖 Generated with Claude Code

… I1/I2/I3 + m1/m2/m3) + add #61 reminder dedup

…ct clear placement before early-exit)

…n_key guard)

/#61)

… I2 jq-absent test + I3 trap/no-transcript + m1)

…mirrors pre-tool-scope-guard)

…n-phase bug-class (#59)

…t-scoped (#60)

…ement-N (#58, ADR 0003)

…sions behind warnings (#41)

#41/#61)

…write (PR review)

Copilot

Pull request overview

v6.3.0 “pipeline hardening” release that addresses recurring autonomy pipeline issues by strengthening hook robustness (stdout JSON discipline), reducing repeated reminder noise (session dedup + reset), and documenting new review/process guardrails (auth/authz chain composition, CI wait pattern, completion trust boundary). It also adds CI coverage for hook regression tests and bumps plugin version + release notes.

Changes:

Harden hooks/run-hook.cmd to emit valid-JSON-or-empty on stdout (recover block JSON behind warnings) + add a dedicated stdout-discipline regression test.
Add once-per-session dedup for pretool-pr-review-reminder, reset on pre-compact-snapshot, and expand hook contract tests accordingly.
Document new plan-phase auth/authz chain-composition bug-class, pr-monitoring CI-wait polling pattern, and Implement-N completion trust boundary; add CI workflow to run hook tests; bump to 6.3.0 with release notes.

Reviewed changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/hook-stdout-discipline.sh	New regression test verifying wrapper stdout JSON discipline and jq-absent passthrough.
tests/hook-contracts.sh	Adds contract coverage for pr-review reminder dedup + post-compaction reset behavior.
skills/subagent-driven-development/SKILL.md	Documents “completion is not trusted until lead-verified” trust boundary.
skills/pr-monitoring/SKILL.md	Documents host-scoped, sanctioned CI-wait polling pattern (bash loop vs background agent).
skills/adversarial-design-review/SKILL.md	Adds plan-phase auth/authz chain-composition bug-class row.
RELEASE-NOTES.md	Adds v6.3.0 release notes summarizing the hardening changes.
hooks/run-hook.cmd	Captures hook stdout and enforces valid-JSON-or-empty output when jq is available.
hooks/pretool-pr-review-reminder	Adds quote-stripped matching and once-per-session reminder dedup marker.
hooks/pre-compact-snapshot	Clears pr-reminder marker for the current session prior to early-exit, enabling re-emit post-compaction.
docs/plans/2026-06-01-pipeline-hardening-4issues.md.scope-lock	Adds scope-lock hash for the v6.3.0 plan.
docs/plans/2026-06-01-pipeline-hardening-4issues.md	Adds implementation plan detailing tasks, verification, and rollout.
docs/plans/2026-06-01-pipeline-hardening-4issues-design.md	Adds design doc covering goals, non-goals, and rationale/ADR references.
decisions/0003-implement-n-completion-trust-boundary.md	Adds ADR documenting why completion is a trust boundary (lead verification) vs hook-enforced invariant.
agents/team-conventions.md	Updates role rules for Implement-N completion discipline + lead verification gate.
.github/workflows/hooks-check.yml	Adds CI workflow intended to run hook contract + stdout-discipline tests on relevant changes.
.cursor-plugin/plugin.json	Version bump to 6.3.0.
.claude-plugin/plugin.json	Version bump to 6.3.0.
.claude-plugin/marketplace.json	Version bump to 6.3.0.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+  pull_request:
+    paths:
+      - 'hooks/**'
+      - 'tests/hook-contracts.sh'
+      - 'tests/hook-stdout-discipline.sh'
+jobs:
+  hooks:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - name: Install jq
+        run: sudo apt-get update && sudo apt-get install -y jq
+      - name: Hook contract tests
+        run: bash tests/hook-contracts.sh
+      - name: Hook stdout discipline tests
+        run: bash tests/hook-stdout-discipline.sh


…phase bug-class (#63)

…dup + re-enable hook-contracts CI (#64)

(session-start Linux stat), user-approved (ADR 0004)

…filter on PR (Copilot/CodeQL review)

intel352 · 2026-06-01T06:04:29Z

Review addressed in c19accf:

CodeQL (no permissions): added an explicit permissions: contents: read block. ✅
Copilot (pull_request path filter): added .github/workflows/hooks-check.yml to the pull_request.paths so changes to the workflow itself trigger it on PRs. ✅
Copilot (jobs: indented under on: → invalid YAML): respectfully not actioned — jobs: is at column 0 (top-level) and the workflow ran green on this PR (9/9 checks, the hooks jobs passed), which is direct evidence the YAML is valid. python3 -c 'yaml.safe_load(...)' also parses it clean.

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Jon Langevin added 16 commits June 1, 2026 01:04

docs: design + ADR 0003 for v6.3.0 pipeline hardening (#41/#58/#59/#60)

f5bde9f

docs: revise v6.3.0 design per adversarial cycle 1 (C1 JSON-extract +…

d686975

… I1/I2/I3 + m1/m2/m3) + add #61 reminder dedup

docs: revise v6.3.0 design per adversarial cycle 2 (I-NEW-1 pre-compa…

0a23755

…ct clear placement before early-exit)

docs: v6.3.0 design PASS adversarial cycle-3 (converged; empty-sessio…

85f58ac

…n_key guard)

docs: implementation plan for v6.3.0 pipeline hardening (#41/#58/#59/#60

8875c9b

/#61)

docs: revise v6.3.0 plan per adversarial plan-phase (I1 quote-strip +…

97c4208

… I2 jq-absent test + I3 trap/no-transcript + m1)

docs: plan PASS adversarial plan-phase cycle-2 (converged; sed-order …

7a1c35c

…mirrors pre-tool-scope-guard)

chore: lock scope for v6.3.0 pipeline hardening (alignment passed)

9a32eb7

feat(adversarial-design-review): add auth/authz chain-composition pla…

54bf62a

…n-phase bug-class (#59)

docs(pr-monitoring): sanction the bash poll-loop CI-wait pattern, hos…

cd1fbfe

…t-scoped (#60)

docs(subagent-driven-development): completion trust-boundary for Impl…

7714901

…ement-N (#58, ADR 0003)

fix(run-hook.cmd): enforce stdout JSON discipline, recover block deci…

ff3489a

…sions behind warnings (#41)

fix(hooks): pr-review reminder once-per-session + PreCompact reset (#61)

2b5ddc4

ci: run hook contract + stdout-discipline tests on hooks/tests changes (

64bdda9

#41/#61)

chore: bump version to 6.3.0 (#41/#58/#59/#60/#61)

0a14316

fix(hooks): grep -vxF full-line diagnostic routing + atomic marker re…

308cecf

…write (PR review)

Copilot AI review requested due to automatic review settings June 1, 2026 05:52

Copilot started reviewing on behalf of intel352 June 1, 2026 05:52 View session

github-advanced-security AI found potential problems Jun 1, 2026

View reviewed changes

Comment thread .github/workflows/hooks-check.yml Fixed

Copilot AI reviewed Jun 1, 2026

View reviewed changes

Jon Langevin added 4 commits June 1, 2026 01:58

feat(adversarial-design-review): add Artifact-class precedent design-…

5d00fec

…phase bug-class (#63)

fix(session-start): GNU stat -c %Y before BSD -f %m for Linux time-de…

4854f42

…dup + re-enable hook-contracts CI (#64)

chore: amend v6.3.0 scope — fold in #63 (artifact-class precedent) + #64

7f49e0e

(session-start Linux stat), user-approved (ADR 0004)

docs: add #63 + #64 to v6.3.0 release notes

38dcb5e

github-advanced-security AI found potential problems Jun 1, 2026

View reviewed changes

Comment thread .github/workflows/hooks-check.yml Fixed

ci(hooks-check): add explicit permissions block + workflow-self path …

c19accf

…filter on PR (Copilot/CodeQL review)

Copilot AI review requested due to automatic review settings June 1, 2026 06:04

Copilot started reviewing on behalf of intel352 June 1, 2026 06:04 View session

intel352 merged commit c556629 into main Jun 1, 2026
8 of 9 checks passed

intel352 deleted the feat/pipeline-hardening-4issues-v6.3.0 branch June 1, 2026 06:06

intel352 mentioned this pull request Jun 1, 2026

subagent-driven-development: enforce 'only code-reviewer flips Implement-N to completed' #58

Closed

Copilot AI reviewed Jun 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: v6.3.0 pipeline hardening — auth/authz bug-class, pr-monitoring pattern, completion trust-boundary, hook stdout-JSON discipline, reminder dedup (#41/#58/#59/#60/#61)#62

feat: v6.3.0 pipeline hardening — auth/authz bug-class, pr-monitoring pattern, completion trust-boundary, hook stdout-JSON discipline, reminder dedup (#41/#58/#59/#60/#61)#62
intel352 merged 21 commits into
mainfrom
feat/pipeline-hardening-4issues-v6.3.0

intel352 commented Jun 1, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

intel352 commented Jun 1, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

intel352 commented Jun 1, 2026

Summary

Design / Plan / ADR

Changes

Verification

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

intel352 commented Jun 1, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants