Pipeline evidence + doc-sync hardening (#69 #70 #71 #72) → v6.4.0 by intel352 · Pull Request #73 · GoCodeAlone/autonomous-dev-kit

intel352 · 2026-06-03T17:25:51Z

Summary

Fixes four pipeline self-consistency issues with zero new skills/scripts and no heuristic scanner (per the user's anti-bloat / anti-trap / anti-circular-logic constraint). Net change ≈ +85 lines across 3 skills + a 63-line regression test + 3-manifest version bump.

adversarial-design-review should emit a committed, structured findings report that post-merge-retrospective can score #69 — adversarial-design-review now mandates committing its findings report to the repo's existing <stem>-design-review.md / <stem>-plan-review.md convention, with stable finding IDs (D1/P1, sequential across all severities) + an optional end-state Resolution. This was a phantom dependency: post-merge-retrospective already read "adversarial-review reports committed in docs/plans/" but nothing wrote them — fixed by systematizing the ad-hoc practice (2 such files already existed) rather than inventing a new artifact.
post-merge-retrospective depends on a project-local skill-activation-audit script that isn't shipped; activation evidence too weak to confirm gates ran #70 — post-merge-retrospective now reads .claude/autodev-state/in-progress.jsonl (written by the shipped record-activity PostToolUse hook in any repo) as the primary activation source, demotes the kit-dev-only tests/skill-activation-audit.sh, and degrades gracefully ("activation log unavailable", never "script does not exist"). Three edit sites fixed together (Step 5 + the ## Missed skill activations format template + the **Reads:** bullet) so the broken instruction isn't re-embedded in every future retro.
alignment-check/scope-lock should scope-check committed docs & examples, not just plan tasks (forward-referencing docs slip through) #71 + Add a design-sync gate: design docs drift from implementation within locked scope (config keys, examples, formats) with no reconciliation step #72 — finishing-a-development-branch gains Step 1e: Doc-Reconciliation Check (pre-PR), wired into both the body and the Autonomous Mode list (so it fires in autonomous runs), checking committed docs/examples for (a) unlabeled forward-references → must be Planned (PR #N) [alignment-check/scope-lock should scope-check committed docs & examples, not just plan tasks (forward-referencing docs slip through) #71] and (b) identifier drift vs the built code [Add a design-sync gate: design docs drift from implementation within locked scope (config keys, examples, formats) with no reconciliation step #72]. It's a checklist gate (not a scanner — the user's LIGHT choice) with a PR-body Doc-reconciliation: accountability token consumed by the retro. Plus adversarial-design-review gains one plan-phase Identifier / naming-convention match row (catch drift before code), explicitly distinct from the existing Config-validation row.

Design / Plan / Adversarial review reports

Design: docs/plans/2026-06-03-pipeline-evidence-doc-sync-design.md
Plan: docs/plans/2026-06-03-pipeline-evidence-doc-sync.md
Design-phase review (3 cycles → PASS): docs/plans/2026-06-03-pipeline-evidence-doc-sync-design-review.md
Plan-phase review (PASS): docs/plans/2026-06-03-pipeline-evidence-doc-sync-plan-review.md

These two committed *-review.md files are themselves the first dogfood of #69 (the new committed-report convention).

Scope Manifest

PR Count: 1 · Tasks: 7 · Status: Locked 2026-06-03T17:10:43Z (sha256 6cb6ebd9…)

PR #	Title	Tasks	Branch
1	Pipeline evidence + doc-sync hardening (#69 #70 #71 #72) → v6.4.0	Task 1–7	feat/pipeline-evidence-doc-sync

Changes (per task)

tests/pipeline-evidence-doc-sync.sh — grep-assertion regression guard for all 4 contracts (RED→GREEN TDD; verified genuinely RED on origin/main).
adversarial-design-review — committed-report mandate + deterministic path rule + stable IDs + Resolution + **Writes:**.
adversarial-design-review — plan-phase naming-convention checklist row.
post-merge-retrospective — jsonl-primary + 3 edit sites + Step-1e token consumer + back-compat.
finishing-a-development-branch — Step 1e body + Autonomous Mode list item + accountability token.
.github/workflows/skill-content-check.yml — run the new test and skill-cross-refs.sh in CI (the latter was local-only before).
Version bump to v6.4.0 across the 3 manifests.

Verification

tests/pipeline-evidence-doc-sync.sh → Results: 0 failure(s) (revert-restore proven: each assertion goes RED when its contract is removed)
tests/skill-content-grep.sh → PASS (no host-token leaks) · tests/skill-cross-refs.sh → PASS (all refs resolve)
tests/version-check.sh → PASS (3 manifests = 6.4.0) · tests/hook-contracts.sh → PASS (no regression)
plan-scope-check --verify-lock → PASS (manifest untouched)
Runtime-launch (Step 1b): the kit ships markdown skills (no binary); its runtime check is the skill-content/cross-ref/version test suite above — all green. No demo artifacts shipped.
Adversarial code review: APPROVE, 0 Critical; the 1 Important (finding-ID numbering ambiguity) + 1 Minor (test OR-branch too broad) were fixed in a90b…→ the review commit.

Doc-reconciliation: clean (Step 1e dogfooded on this PR — docs describe exactly this single PR's scope; identifiers match the built skills).

Generated by the autodev autonomous pipeline (brainstorming → adversarial design×3 → plan → adversarial plan → alignment → scope-lock → execute → adversarial code review). Copilot review is welcome but not blocking.

🤖 Generated with Claude Code

Adopt existing <stem>-design-review.md/-plan-review.md convention (D1); complete D2 retro fix incl format-template:99 + Reads bullet; narrow Step 1e trigger + add PR-body accountability token (D6); reframe Resolution optional/ end-state with retro consumer (D5); clarify phase attribution (D4); dogfood caveat (D3). Commits the design-phase review report itself (dogfoods #69).

…st site, wired token consumer) N1 (Critical): replace ambiguous stem-derivation prose with deterministic design/plan rule + both worked examples (D1<->D2 path contract). N2: Step 1e gains a second edit site in finishing Autonomous Mode list. N3: wire Doc-reconciliation token to retro Step 5 missed-activation row. N4/N5: scalpel Reads edit; trigger reword.

…s clarified

…state, wire cross-refs CI)

…nner/append out (alignment PASS drift)

) [red]

… IDs (#69)

…72)

…al script (#70)

…ist (#71 #72)

…rtion (M1)

Copilot

Pull request overview

This PR hardens the autonomous pipeline’s evidence trail and doc/implementation synchronization by making adversarial review outputs and activation evidence reliably consumable by the retrospective, and by adding a lightweight pre-PR doc reconciliation gate. It also adds a regression test + CI wiring to prevent these contracts from silently regressing, and bumps the kit version to v6.4.0.

Changes:

Require adversarial-design-review to produce a committed, deterministically-named review report with stable finding IDs (plus a plan-phase naming/identifier consistency check).
Update post-merge-retrospective to treat .claude/autodev-state/in-progress.jsonl as the primary activation evidence source, gracefully degrading when absent, and to consume the Step 1e accountability token.
Add finishing-a-development-branch Step 1e (Doc-Reconciliation Check) and a regression test (tests/pipeline-evidence-doc-sync.sh) wired into CI; bump manifests to 6.4.0.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`tests/pipeline-evidence-doc-sync.sh`	Adds grep-based regression assertions to lock in the #69–#72 contracts.
`skills/adversarial-design-review/SKILL.md`	Mandates writing+committing the review report, adds stable IDs guidance, and adds a plan-phase identifier/naming-convention checklist row.
`skills/post-merge-retrospective/SKILL.md`	Switches activation scoring to use `in-progress.jsonl` as primary evidence; adds deterministic report-path derivation and Step 1e token consumption.
`skills/finishing-a-development-branch/SKILL.md`	Adds Step 1e Doc-Reconciliation gate and wires it into Autonomous Mode control flow.
`.github/workflows/skill-content-check.yml`	Runs the new regression test and `tests/skill-cross-refs.sh` in CI, and updates path filters accordingly.
`.claude-plugin/plugin.json`	Version bump to 6.4.0.
`.claude-plugin/marketplace.json`	Version bump to 6.4.0.
`.cursor-plugin/plugin.json`	Version bump to 6.4.0.
`docs/plans/2026-06-03-pipeline-evidence-doc-sync-design.md`	Adds the design doc for the change set (dogfooding the new workflow artifacts).
`docs/plans/2026-06-03-pipeline-evidence-doc-sync-design-review.md`	Adds the design-phase adversarial review report artifact.
`docs/plans/2026-06-03-pipeline-evidence-doc-sync.md`	Adds the implementation plan doc for the change set.
`docs/plans/2026-06-03-pipeline-evidence-doc-sync-plan-review.md`	Adds the plan-phase adversarial review report artifact.
`docs/plans/2026-06-03-pipeline-evidence-doc-sync.md.scope-lock`	Adds the scope-lock hash for the plan.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

   - Any ADRs cited from the design or plan

 2. **Score each adversarial-review finding.**
+   Derive the report path by the **same deterministic rule as D1**: take the artifact filename, drop `.md`, then design → append `-review.md`, plan → append `-plan-review.md` (e.g. `…-doc-sync-design.md` → `…-doc-sync-design-review.md`; `2026-06-03-…-doc-sync.md` → `2026-06-03-…-doc-sync-plan-review.md`). Read the committed `…-design-review.md` / `…-plan-review.md` report(s). For each finding, use its stable ID; read the optional `Resolution` column as a scoring hint, **falling back to downstream evidence (code-review threads, CI) when blank or when the report is an old no-ID format**. If the report is absent → "no committed review report; reconstructed from revision history" (most pre-v6.4.0 features have none).


 | brainstorming | yes | |
 | adversarial-design-review (design) | yes | |
 | adversarial-design-review (plan) | no | <why — e.g., manual override; deferred to alignment-check> |
+| finishing Step 1e (doc-reconciliation) | yes/unverified | only when the diff touched docs/examples |


 **Findings (Critical):**
- [class] [section/line]: <description>. Recommendation: <concrete fix>.
+- `D1` [class] [section/line]: <description>. Recommendation: <concrete fix>. _Resolution: <optional — filled once at end-state: commit SHA / `accepted — reason` / `false-positive`; omit if open>._

 **Findings (Important):**
- [class] [section/line]: <description>. Recommendation: <concrete fix>.
+- `D2` [class] [section/line]: <description>. Recommendation: <concrete fix>. _Resolution: <optional>._

 **Findings (Minor):**
- [class] [section/line]: <description>. Recommendation: <concrete fix>.
+- `D3` [class] [section/line]: <description>. Recommendation: <concrete fix>. _Resolution: <optional>._


Jon Langevin added 16 commits June 3, 2026 12:43

design: pipeline evidence + doc-sync hardening (#69 #70 #71 #72)

bc6b805

design: PASS @ adversarial cycle 3 (converged); M3 overwrite-semantic…

39cf438

…s clarified

plan: pipeline evidence + doc-sync hardening (7 tasks, 1 PR → v6.4.0)

e72c7aa

plan: PASS @ plan-phase adversarial; resolve P1/P2 (tighten test RED …

36d2932

…state, wire cross-refs CI)

design: clarify Non-goal — regression test (CI harness) in-scope, sca…

2bfd92d

…nner/append out (alignment PASS drift)

chore: lock scope for pipeline-evidence-doc-sync (alignment passed)

f4faf23

test: regression guard for pipeline evidence + doc-sync (#69 #70 #71 #72

617e0e0

) [red]

feat(adversarial-review): mandate committed findings report w/ stable…

1b1601b

… IDs (#69)

feat(adversarial-review): plan-phase naming-convention checklist row (#…

366326e

…72)

feat(retro): read committed report + activation jsonl; demote kit-loc…

227e766

…al script (#70)

feat(finishing): Step 1e doc-reconciliation gate, body + autonomous l…

779d646

…ist (#71 #72)

ci: run pipeline-evidence-doc-sync contract test on skill changes

bba3527

chore(release): bump to v6.4.0 (#69 #70 #71 #72)

c928fa6

review: clarify finding-ID numbering (I1) + tighten D1<->D2 test asse…

b0ce92f

…rtion (M1)

Copilot AI review requested due to automatic review settings June 3, 2026 17:25

Copilot started reviewing on behalf of intel352 June 3, 2026 17:26 View session

intel352 merged commit f9f0eb7 into main Jun 3, 2026
8 checks passed

Copilot AI reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline evidence + doc-sync hardening (#69 #70 #71 #72) → v6.4.0#73

Pipeline evidence + doc-sync hardening (#69 #70 #71 #72) → v6.4.0#73
intel352 merged 16 commits into
mainfrom
feat/pipeline-evidence-doc-sync

intel352 commented Jun 3, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

intel352 commented Jun 3, 2026

Summary

Design / Plan / Adversarial review reports

Scope Manifest

Changes (per task)

Verification

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants