lwalden
diff --git a/‎.claude/aiagentminder-version‎
Lines changed: 1 addition & 1 deletion b/‎.claude/aiagentminder-version‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.claude/commands/aam-checkup.md‎
Lines changed: 40 additions & 1 deletion b/‎.claude/commands/aam-checkup.md‎
Lines changed: 40 additions & 1 deletion
diff --git a/‎.claude/commands/aam-grill.md‎
Lines changed: 85 additions & 0 deletions b/‎.claude/commands/aam-grill.md‎
Lines changed: 85 additions & 0 deletions
diff --git a/‎.claude/commands/aam-milestone.md‎
Lines changed: 1 addition & 1 deletion b/‎.claude/commands/aam-milestone.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.claude/commands/aam-retrospective.md‎
Lines changed: 14 additions & 9 deletions b/‎.claude/commands/aam-retrospective.md‎
Lines changed: 14 additions & 9 deletions
diff --git a/‎.claude/commands/aam-tdd.md‎
Lines changed: 100 additions & 0 deletions b/‎.claude/commands/aam-tdd.md‎
Lines changed: 100 additions & 0 deletions
diff --git a/‎.claude/commands/aam-triage.md‎
Lines changed: 105 additions & 0 deletions b/‎.claude/commands/aam-triage.md‎
Lines changed: 105 additions & 0 deletions
@@ -1 +1 @@
-1→1.3.0
+2.1.0
@@ -58,7 +58,41 @@ Read `.claude/aiagentminder-version`.
 - **INFO:** Show installed version (e.g., "v1.0.0")
 - **WARN if missing:** "No version stamp found. Run /aam-update to write one."
 
-### 7. Git Status
+### 7. PR Pipeline (conditional)
+
+Run this check only if `.claude/commands/aam-pr-pipeline.md` OR `.claude/hooks/pr-pipeline-trigger.js` exists in the project — indicating the pipeline was installed.
+
+**7a. Hook script:**
+
+Check that `.claude/hooks/pr-pipeline-trigger.js` exists.
+
+- **PASS:** Found
+- **FAIL:** Missing — "PR pipeline command is installed but hook script is missing. Run /aam-update to restore it."
+
+**7b. Hook configuration:**
+
+Read `.claude/settings.json` and check for a `PostToolUse` entry with `matcher: "Bash"` referencing `pr-pipeline-trigger.js`.
+
+- **PASS:** Entry found
+- **WARN:** Missing — "PostToolUse hook entry not in settings.json — the pipeline will not auto-trigger after `gh pr create`. Run /aam-update to restore it."
+
+**7c. Config file:**
+
+Check that `.pr-pipeline.json` exists at the project root.
+
+- **PASS:** Found
+- **WARN:** Missing — "`.pr-pipeline.json` not found — pipeline will use default settings. Copy it from the AIAgentMinder template or run /aam-update."
+
+**7d. `gh` CLI:**
+
+```bash
+gh --version
+```
+
+- **PASS:** Found — show version
+- **WARN:** Not found — "`gh` CLI not installed. The PR pipeline requires it. Install from https://cli.github.com."
+
+### 8. Git Status
 
 Check git status in the project directory:
 
@@ -85,6 +119,11 @@ AIAgentMinder Health Check — v{version}
 ✓ .claude/settings.json: valid JSON, hook entry present (compact-reorient)
 ✓ .claude/hooks/compact-reorient.js: found
 ✓ Git: branch main, remote origin configured
+[if PR pipeline installed:]
+✓ .claude/hooks/pr-pipeline-trigger.js: found
+✓ .claude/settings.json: PostToolUse:Bash hook entry present
+✓ .pr-pipeline.json: found
+✓ gh CLI: v2.45.0
 
 Status: Healthy (1 warning)
 ```
 
@@ -0,0 +1,85 @@
+# /aam-grill - Plan Interrogation
+
+Stress-test a plan or design by walking every branch of the decision tree. This is the intensive counterpart to `approach-first.md` — use it when a design is non-obvious, high-stakes, or involves multiple interdependent decisions.
+
+---
+
+## Step 1: Scope the Interrogation
+
+Read the plan or design being questioned. Sources may include:
+
+- An approach statement from `approach-first.md`
+- A feature description from `docs/strategy-roadmap.md`
+- A freeform plan the user describes
+- An existing design doc or PR description
+
+Also read:
+
+- `DECISIONS.md` — for prior decisions that constrain the design space
+- `docs/strategy-roadmap.md` — for quality tier and scope context
+
+---
+
+## Step 2: Map the Decision Tree
+
+Identify every decision branch in the plan — any point where two or more reasonable approaches exist. Present the branches as a numbered list, noting dependencies between them.
+
+Example:
+
+> Decision branches identified:
+> 1. Auth storage: session vs JWT
+> 2. Rate limiting: middleware vs API gateway (depends on #1)
+> 3. Error format: structured vs freeform
+
+---
+
+## Step 3: Walk Each Branch
+
+For each decision branch, one at a time:
+
+1. **If the codebase can answer it** — explore the code first. Check existing patterns, conventions, and constraints. Do not ask the user questions that the code can answer.
+2. **If the user needs to decide** — present:
+   - The options available
+   - Tradeoffs of each
+   - Reversal cost: **High** (hard to undo), **Medium** (costly but possible), or **Low** (easy to change later)
+   - Downstream dependencies: "If you choose X, then Y becomes constrained"
+
+Resolve one branch before moving to the next. If branches have dependencies, resolve the upstream branch first.
+
+**Continue until the user says they are satisfied or all branches are resolved.**
+
+---
+
+## Step 4: Decision Summary
+
+After all branches are resolved, produce a structured summary:
+
+```
+Grill Summary — {plan name}
+
+Decisions made:
+1. {topic}: {choice}. Alternatives: {what was considered}. Reversal cost: {H/M/L}. Rationale: {why}.
+2. ...
+
+Open questions (deferred):
+- {question} — deferred because {reason}
+
+Constraints discovered:
+- {constraint found during codebase exploration}
+```
+
+Ask the user: "Log these decisions to DECISIONS.md? (y/n)"
+
+If yes, append each decision to DECISIONS.md in the project's existing format, including alternatives considered.
+
+---
+
+## When to Use This
+
+- **Use `/aam-grill`** when a design is non-obvious, high-stakes, or has multiple interdependent decisions.
+- **Use `approach-first.md`** for routine check-ins — state intent, confirm, proceed.
+- For Rigorous+ quality tiers, consider running `/aam-grill` before architecture changes touching more than 5 files.
+
+---
+
+*Adapted from [mattpocock/skills/grill-me](https://github.com/mattpocock/skills) (MIT license).*
@@ -12,7 +12,7 @@ Read the following:
 
 1. `docs/strategy-roadmap.md` — current phase, MVP features, out-of-scope items, phase timeline
 2. `DECISIONS.md` — original stack and architecture decisions; also read the `## Known Debt` section if present
-3. `SPRINT.md` — current sprint status (if active); archived sprint lines for velocity data
+3. `SPRINT.md` — current sprint status (if active); archived sprint lines for sprint sizing data
 4. Use TaskList to get current task states
 5. Recent git log: `git log --oneline -20` — what has been merged recently
 6. File count and largest files:
 
@@ -66,20 +66,25 @@ Patterns:
 
 Read prior archived sprint lines from `SPRINT.md` (lines starting with `S{n} archived`).
 
-From each archived line, extract the velocity percentage (the `{velocity}%` value in the archive format).
+From each archived line and from the current sprint's metrics (Step 2), identify **stress indicators**:
+- **Scope churn**: any scope additions or removals occurred during the sprint
+- **Blocked issues**: any issues ended the sprint in `blocked` status
+- **Context pressure**: the sprint had 7+ planned issues
 
 **Recommendation logic:**
 
-- **Sprint 1 or 2 (insufficient data):** "Not enough sprint history for sizing recommendations yet."
-- **Sprint 3+:** Compute the median completion rate across available archived sprints.
-  - Median ≥ 90%: "You're completing nearly everything planned. Consider planning {planned + 1} to {planned + 2} issues next sprint."
-  - Median 70–89%: "Completion rate is healthy. Current sprint size of ~{planned} issues looks right."
-  - Median 50–69%: "Consistently completing about {median%} of planned work. Consider planning {recommended range} issues rather than {planned}."
-  - Median < 50%: "Sprint scope is regularly exceeding capacity. Recommend planning {recommended_max} issues or fewer next sprint."
+- **Sprint 1 (no history):** "First sprint — recommend 4–5 issues next sprint to establish a baseline. Fit whole features; avoid splitting a feature across sprints when it can fit in one."
+- **Sprint 2+:** Start from the previous sprint's planned issue count (or 5 if unavailable). Apply adjustments:
+  - No stress indicators in the most recent sprint: hold steady. Do not increase.
+  - 1 stress indicator in the most recent sprint: reduce max by 1.
+  - 2+ stress indicators in the most recent sprint: reduce both min and max by 1.
+  - Stress indicators present in 2+ of the last 3 sprints: reduce both min and max by 1 (cumulative with above).
 
-Write the recommendation as the `<!-- sizing: {min}-{max} -->` comment in the SPRINT.md archive line (see sprint-workflow.md Sprint Completion). This comment persists for the next sprint planning step to read.
+**Hard boundaries:** The recommendation must fall within 3–7 issues. Clamp to this range after all adjustments. Never recommend more than 7 regardless of history.
+
+**Feature coherence:** Always append: "Prefer fitting whole features over hitting an issue count. If a feature needs more issues than the range, plan the feature — but confirm with the user that context will stay manageable."
 
-Only surface this when there is a pattern (2+ sprints in the same completion band). Do not speculate on a single data point.
+Write the recommendation as the `<!-- sizing: {min}-{max} -->` comment in the SPRINT.md archive line (see sprint-workflow.md Sprint Completion). This comment persists for the next sprint planning step to read.
 
 ---
 
 
@@ -0,0 +1,100 @@
+# /aam-tdd - Test-Driven Development
+
+Guided TDD workflow for implementing features through red-green-refactor cycles. This is the full methodology behind `code-quality.md`'s one-liner: "Write a failing test first. Implement the minimal solution. Refactor after green."
+
+**Core principle:** Tests verify behavior through public interfaces, not implementation details. A good test survives internal refactors because it doesn't care about internal structure.
+
+---
+
+## Step 0: Read Context
+
+Read `docs/strategy-roadmap.md` and find the **Quality Tier** section.
+
+- **Standard tier and above:** TDD is the expected workflow. Proceed.
+- **Lightweight tier:** TDD is optional. Ask: "Quality tier is Lightweight — TDD is optional. Proceed with TDD anyway? (y/n)"
+
+Also read `.claude/rules/code-quality.md` if it exists — the skill complements that rule, not replaces it.
+
+---
+
+## Step 1: Planning
+
+Before writing any code:
+
+1. **Read existing tests** in the project to learn the test framework, naming conventions, file organization, and assertion style. Match the project's patterns.
+2. **Identify the public interface** — what functions, endpoints, or APIs will callers use? Aim for a small interface with deep implementation (fewer methods, simpler parameters, complexity hidden inside).
+3. **List 3-7 behaviors to test** — describe what the system does, not how. Each behavior should be observable through the public interface.
+   - Good: "returns 404 when user not found"
+   - Bad: "calls findById with the user ID"
+4. **Design for testability** — accept dependencies as parameters instead of creating them internally. Return results instead of producing side effects. Keep the surface area small.
+5. **Present the test plan** to the user for approval.
+
+You cannot test everything. Focus testing effort on critical paths and complex logic, not every edge case. Confirm priorities with the user.
+
+---
+
+## Step 2: Tracer Bullet
+
+Write ONE test that confirms ONE thing about the system — the simplest vertical slice of the feature.
+
+```
+RED:   Write the test → it fails (confirms test infrastructure works)
+GREEN: Write minimal code to pass → it passes
+```
+
+This is the tracer bullet — it proves the end-to-end path works and establishes the pattern for subsequent cycles.
+
+**Vertical slices, not horizontal layers.** Your first test should touch the real entry point and produce a real result, even if simplified. Do not write all tests first then all implementation — that produces tests of imagined behavior.
+
+---
+
+## Step 3: Incremental Loop
+
+For each remaining behavior from the test plan:
+
+```
+RED:   Write the next test → it fails
+GREEN: Write minimal code to pass → it passes
+```
+
+Rules:
+- One test at a time
+- Only enough code to pass the current test
+- Do not anticipate future tests
+- Keep tests focused on observable behavior
+
+**Mocking guidance:** Mock at system boundaries only — network calls, filesystem, clock, randomness. Do not mock your own modules or internal collaborators. Use dependency injection to swap external implementations in tests. Prefer specific mock functions per operation over a single generic mock.
+
+**Test quality check per cycle:**
+- [ ] Test describes behavior, not implementation
+- [ ] Test uses the public interface only
+- [ ] Test would survive an internal refactor
+- [ ] Code is minimal for this test
+- [ ] No speculative features added
+
+---
+
+## Step 4: Refactor
+
+All tests are green. Now look for refactor candidates:
+
+- **Duplication** — extract shared logic into functions
+- **Long methods** — break into private helpers (keep tests on the public interface)
+- **Shallow modules** — if a module's interface is as complex as its implementation, combine or deepen it
+- **Feature envy** — logic that uses another module's data more than its own belongs in that other module
+- **Primitive obsession** — raw strings or numbers where a domain type would add clarity
+- **Existing code** the new code reveals as problematic
+
+Run tests after each refactor step. Never refactor while RED — get to GREEN first.
+
+---
+
+## When to Use This
+
+- **Use `/aam-tdd`** when starting a new feature or when the test plan is non-obvious.
+- **Use `code-quality.md`** (loaded automatically at Standard+ tiers) for day-to-day TDD discipline without the full structured workflow.
+- Pairs well with `/aam-triage` — triage produces a fix plan as RED-GREEN cycles that this skill can execute.
+
+---
+
+*Adapted from [mattpocock/skills/tdd](https://github.com/mattpocock/skills) (MIT license).*
@@ -0,0 +1,105 @@
+# /aam-triage - Bug Triage
+
+Systematically investigate a bug: reproduce it, diagnose the root cause, design a durable fix plan, and create a GitHub issue with the analysis. This is the structured start to debugging — complementing `debug-checkpoint.md` which handles the structured pause when a fix stalls.
+
+---
+
+## Step 1: Capture the Problem
+
+Get a clear bug description from the user (or from the skill argument). Clarify:
+
+- **Expected behavior** — what should happen?
+- **Actual behavior** — what happens instead?
+- **Steps to reproduce** — how to trigger the bug
+- **Error output** — error message, stack trace, or unexpected output
+
+If any of these are unclear, ask before proceeding.
+
+---
+
+## Step 2: Explore and Diagnose
+
+Use the Agent tool to spawn an exploration subagent (`subagent_type: "Explore"`). The subagent's job:
+
+1. Trace the code path from the entry point (the user action or API call that triggers the bug) to the failure point.
+2. Read relevant source files, tests, and configuration — not the entire codebase.
+3. Report back:
+   - The code path involved
+   - Where the failure occurs
+   - A root cause hypothesis
+
+**Durability principle:** Describe the root cause in terms of behaviors and contracts, not line numbers or internal variable names. The diagnosis should remain useful even if the code is refactored.
+
+---
+
+## Step 3: Validate Root Cause
+
+Attempt to confirm the root cause hypothesis:
+
+- Run relevant tests to see if the failure is captured
+- Reproduce the bug if possible (run the failing path)
+- Check whether the hypothesis explains all symptoms
+
+If the hypothesis doesn't hold after 2 iterations, trigger the `debug-checkpoint.md` pattern: stop and present a structured checkpoint to the user with what's been tried, the current hypothesis, and what information would unblock progress.
+
+---
+
+## Step 4: Design Fix Plan
+
+Design the fix as a series of RED-GREEN test cycles:
+
+1. **RED** — a failing test that captures the bug behavior ("When X happens, the system should Y but currently does Z")
+2. **GREEN** — the minimal code change to make the test pass
+
+Write durable descriptions that target behaviors and contracts:
+- Good: "When an expired token is submitted, the system should return 401 and clear the session"
+- Bad: "Change line 47 in auth.js to check token.exp"
+
+If the fix involves multiple behaviors, plan multiple RED-GREEN cycles.
+
+---
+
+## Step 5: Create GitHub Issue
+
+Use `gh issue create` to create an issue with the analysis:
+
+```
+Title: [Bug]: {concise description}
+
+## Root Cause
+
+{findings from Steps 2-3 — what causes the bug and why}
+
+## Fix Plan
+
+{RED-GREEN cycles from Step 4}
+
+## Reproduction
+
+{steps from Step 1}
+```
+
+Add labels: `bug`, `triage`.
+
+If `SPRINT.md` has an active sprint, note the sprint ID in the issue body under a **Sprint Context** heading.
+
+---
+
+## Step 6: Log Findings
+
+If the root cause reveals a significant architectural insight (e.g., "auth middleware doesn't validate expired tokens", "error handling is inconsistent across API endpoints"), ask the user: "This finding may be worth recording in DECISIONS.md. Log it? (y/n)"
+
+If yes, append to DECISIONS.md in the project's existing format.
+
+---
+
+## When to Use This
+
+- **Use `/aam-triage`** when a bug needs structured investigation — not a quick fix, but root cause analysis.
+- **Use `debug-checkpoint.md`** (triggers automatically) when you're mid-fix and stuck after 3 attempts on the same error.
+- Created issues can be pulled into the next sprint via `/aam-sync-issues`.
+- Fix plans produce RED-GREEN cycles that `/aam-tdd` can execute.
+
+---
+
+*Adapted from [mattpocock/skills/triage-issue](https://github.com/mattpocock/skills) (MIT license).*