Improve SKILL.md v1.1.0: token budgeting, model allocation, Mermaid, language-agnostic

yelban · claude · yelban · commit d06c01a795bd · 2026-02-16T21:55:01.000+08:00
Compared schematic with the Cartographer skill and identified four areas where
Cartographer's engineering rigor could strengthen schematic while preserving its
core advantages (semantic grouping, deep inference, cross-validation).

Changes to SKILL.md:

- Phase 1: Add token budget estimation step (wc -c on full diff) with tiered
  agent scaling strategy (&lt;50k → 1 agent, 50k-200k → 2-3, 200k+ → 3-4)
- Phase 2: Explicitly specify Sonnet subagents for file reading/analysis with
  Opus orchestrating (plan, synthesize, infer) — cost/capability optimization
- Phase 3: Replace hardcoded '*.ts' '*.tsx' with language-agnostic
  git diff --name-only, removing TypeScript/React assumption
- Phase 4: Replace ASCII diagram instructions with Mermaid (graph TB for
  system diagrams, sequenceDiagram for data lifecycle) — native GitHub rendering

Also adds:
- CLAUDE.md: Project summary for Claude Code context
- docs/CODEBASE_MAP.md: Full codebase map (Cartographer output)
- docs/COMPARISON_schematic_vs_cartographer.md: Detailed skill comparison

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -0,0 +1,8 @@
+# Schematic
+
+A Claude Code / Codex skill that reverse-engineers detailed product & technical spec documents from git branch implementations. Uses 2-4 parallel agents to analyze changed files, cross-checks for gaps, and outputs an 11-section structured specification.
+
+**Stack**: Markdown-driven skill (no executable code) — SKILL.md is the "program"
+**Structure**: `SKILL.md` (core workflow) + `README.md` (user guide) + `.claude/settings.local.json` (permissions)
+
+For detailed architecture, see [docs/CODEBASE_MAP.md](docs/CODEBASE_MAP.md).
diff --git a/SKILL.md b/SKILL.md
@@ -9,8 +9,8 @@ description: |
   branch does". Produces a structured markdown spec covering problem statement, product requirements,
   architecture, technical design, file inventories, testing strategy, rollout plan, and risks.
 author: Codex
-version: 1.0.0
-date: 2026-02-15
+version: 1.1.0
+date: 2026-02-16
 tags: [documentation, git, branch-analysis, spec, reverse-engineering]
 ---
 
@@ -46,8 +46,16 @@ git diff --stat <base>...HEAD
 
 # 3. Count the scale
 git diff --stat <base>...HEAD | tail -1
+
+# 4. Estimate diff token budget (chars / 4 ≈ tokens)
+git diff <base>...HEAD | wc -c
 ```
 
+**Agent scaling by token budget:**
+- **<50k tokens** → single agent (read all diffs directly)
+- **50k–200k tokens** → 2–3 agents
+- **200k+ tokens** → 3–4 agents, max ~150k tokens per agent
+
 From the diff stats, categorize files into groups:
 - **Core implementation** (new modules, business logic)
 - **Integration points** (modified selectors, reducers, hooks, components)
@@ -60,6 +68,11 @@ From the diff stats, categorize files into groups:
 Launch 2-4 parallel exploration agents, each focused on a different file group. This is
 critical for efficiency — reading 50+ files sequentially is too slow.
 
+**Model allocation:** Use `subagent_type: "Explore"` with `model: "sonnet"` for all
+exploration agents. Sonnet handles file reading and analysis (best cost/capability ratio).
+The orchestrating model (Opus) plans assignments, synthesizes reports, and infers product
+motivation — it should never read diff files directly.
+
 **Agent 1: Core Implementation**
 - All new files (the heart of the feature)
 - Focus on: purpose, key types, exported functions, data flow, inter-module connections
@@ -87,8 +100,8 @@ Each agent prompt should ask for:
 After agents return, diff the analyzed files against the full file list:
 
 ```bash
-# List all non-test changed files
-git diff --stat <base>...HEAD -- '*.ts' '*.tsx' | awk '{print $1}' | sort
+# List all changed files (language-agnostic)
+git diff --name-only <base>...HEAD | sort
 
 # Show small diffs for any files not yet analyzed
 git diff <base>...HEAD -- <uncovered-files>
@@ -128,10 +141,12 @@ What is and isn't included.
 
 ## 4. Architecture
 ### 4.1 System Diagram
-ASCII diagram showing component relationships and data flow.
+Mermaid `graph TB` diagram showing component relationships and data flow.
+(Mermaid renders natively on GitHub/GitLab — prefer over ASCII.)
 
 ### 4.2 Data Lifecycle
-Step-by-step flow from initial state through steady state.
+Mermaid `sequenceDiagram` or step-by-step description showing flow
+from initial state through steady state.
 
 ## 5. Technical Design
 Subsections for each major design decision:
diff --git a/docs/CODEBASE_MAP.md b/docs/CODEBASE_MAP.md
@@ -0,0 +1,146 @@
+---
+last_mapped: 2026-02-16T13:39:48Z
+total_files: 4
+total_tokens: 2334
+---
+
+# Codebase Map
+
+> Auto-generated by Cartographer. Last mapped: 2026-02-16T13:39:48Z
+
+## System Overview
+
+Schematic is a Claude Code / Codex **skill** that reverse-engineers detailed product & technical spec documents from git branch diffs. It spawns 2-4 parallel agents to read changed files, cross-checks for gaps, and outputs an 11-section structured specification.
+
+```mermaid
+graph TB
+    subgraph Trigger
+        User["User Request"]
+    end
+
+    subgraph "Phase 1: Scope"
+        Git["git log / git diff --stat"]
+        Classify["File Classification"]
+    end
+
+    subgraph "Phase 2: Parallel Exploration"
+        A1["Agent 1: Core Implementation"]
+        A2["Agent 2: Integration Points"]
+        A3["Agent 3: Tests"]
+        A4["Agent 4: Config & Infra"]
+    end
+
+    subgraph "Phase 3-5: Synthesis"
+        CrossCheck["Cross-Check Gaps"]
+        WriteSpec["Write 11-Section Spec"]
+        Verify["Verify Completeness"]
+    end
+
+    Output["docs/ spec document"]
+
+    User --> Git --> Classify
+    Classify --> A1 & A2 & A3 & A4
+    A1 & A2 & A3 & A4 --> CrossCheck --> WriteSpec --> Verify --> Output
+```
+
+## Directory Structure
+
+```
+schematic/
+├── .claude/
+│   └── settings.local.json  # Claude Code permissions (allows uv run:*)
+├── docs/
+│   └── CODEBASE_MAP.md      # This file
+├── LICENSE                   # MIT License
+├── README.md                 # User-facing docs: install & usage
+└── SKILL.md                  # Core skill definition (11-section spec workflow)
+```
+
+## File Guide
+
+| File | Purpose | Tokens |
+|------|---------|--------|
+| `.claude/settings.local.json` | Execution permissions — allows `uv run:*` Bash commands | 26 |
+| `LICENSE` | MIT License (Copyright 2026) | 217 |
+| `README.md` | User-facing install/usage guide for Claude Code & Codex | 380 |
+| `SKILL.md` | Full skill specification: 5-phase workflow, 11-section output template | 1,711 |
+
+### SKILL.md — Core Skill Definition
+
+**Purpose**: The "source code" of this project. Defines the complete workflow Claude Code/Codex follows when triggered.
+
+**Key Sections**:
+- **Metadata**: name, version (1.0.0), tags
+- **Trigger conditions**: phrases like "analyze this branch", "reverse engineer a spec"
+- **5-Phase workflow**:
+  1. Scope the branch (git diff --stat, file classification)
+  2. Parallel deep exploration (2-4 agents reading file groups)
+  3. Cross-check for gaps (compare analyzed vs full file list)
+  4. Write the spec (11 structured sections)
+  5. Verify completeness
+- **11 Output Sections**: Problem Statement, Solution Overview, Product Requirements, Architecture, Technical Design, New Files, Modified Files, Testing Strategy, Rollout Strategy, Risks, Summary
+- **Validation criteria**: every changed file documented, architecture diagrams match data flow, product requirements match test assertions
+
+**File Classification Categories**:
+- Core implementation (new modules, business logic)
+- Integration points (selectors, reducers, hooks, components)
+- Tests (unit, integration, e2e)
+- Configuration (feature flags, env vars, types)
+- Incidental (formatting, imports, minor refactors)
+
+### README.md — User Guide
+
+**Purpose**: Quick-start guide for installing and using the skill.
+
+**Install paths**:
+- Claude Code: `~/.claude/skills/schematic`
+- Codex: `~/.codex/skills/schematic`
+
+**Trigger phrases**: "Reverse engineer a spec", "Analyze this branch", "Write a spec from the code", "Document what this branch does"
+
+## Data Flow
+
+```mermaid
+sequenceDiagram
+    participant U as User
+    participant CC as Claude Code/Codex
+    participant Git as Git CLI
+    participant Agents as Parallel Agents
+
+    U->>CC: "Analyze this branch"
+    CC->>Git: git log --oneline base..HEAD
+    CC->>Git: git diff --stat base...HEAD
+    Git-->>CC: Diff stats & file list
+    CC->>CC: Classify files into groups
+    CC->>Agents: Spawn 2-4 parallel read agents
+    Agents->>Agents: Read & analyze assigned files
+    Agents-->>CC: Analysis reports
+    CC->>Git: git diff base...HEAD -- uncovered-files
+    Git-->>CC: Remaining diffs
+    CC->>CC: Synthesize 11-section spec
+    CC->>CC: Verify all files documented
+    CC-->>U: Spec document in docs/
+```
+
+## Conventions
+
+- **No executable code**: The entire project is markdown-driven; SKILL.md serves as the "program" for Claude Code/Codex
+- **Parallel-first**: Design principle — always prefer multi-agent parallel analysis over sequential
+- **Table-heavy output**: File lists, requirements, risks all use markdown tables for scannability
+- **Three-dot diff**: Uses `git diff base...HEAD` (merge-base) for branch comparison
+- **Structured output**: Fixed 11-section template ensures consistency across runs
+
+## Gotchas
+
+1. **No Python scripts exist** despite `settings.local.json` allowing `uv run:*` — this permission is reserved for potential future use or scanner scripts
+2. **Examples assume TypeScript/React/Redux** — file classification categories (selectors, reducers, hooks, components) are framework-specific; other stacks may need adapted categories
+3. **No error handling defined** — SKILL.md doesn't specify what to do when git commands fail or agents return incomplete data
+4. **Linear git history assumed** — complex merge histories may produce confusing diffs
+5. **Branch context required** — the skill assumes the user is on the correct branch when triggered
+
+## Navigation Guide
+
+**To modify the skill workflow**: Edit `SKILL.md` — all 5 phases and the 11-section output template are defined there
+**To change execution permissions**: Edit `.claude/settings.local.json`
+**To update user-facing docs**: Edit `README.md`
+**To add a new output section**: Add to the Phase 4 section template in `SKILL.md`
diff --git a/docs/COMPARISON_schematic_vs_cartographer.md b/docs/COMPARISON_schematic_vs_cartographer.md
@@ -0,0 +1,68 @@
+# Schematic vs Cartographer 比較
+
+## 總覽
+
+| 維度 | **Schematic** | **Cartographer** |
+|------|--------------|-----------------|
+| **目標** | 從 git branch diff 反向工程出產品/技術規格文件 | 映射整個 codebase 的架構與檔案用途 |
+| **輸入** | Git branch（`git diff base...HEAD`） | 整個 codebase（所有檔案） |
+| **輸出** | 11 章節規格文件（Problem、Architecture、Risks...） | `docs/CODEBASE_MAP.md` + 更新 `CLAUDE.md` |
+| **分析對象** | **變更了什麼**（diff-centric） | **現在有什麼**（snapshot-centric） |
+| **觸發詞** | "analyze this branch"、"reverse engineer a spec" | "map this codebase"、"cartographer" |
+
+## 工作流程比較
+
+| 階段 | **Schematic（5 phases）** | **Cartographer（8 steps）** |
+|------|--------------------------|----------------------------|
+| 1 | Scope branch（git diff --stat） | Check existing map（增量更新偵測） |
+| 2 | Parallel exploration（2-4 agents by 檔案類型） | Scan codebase（Python 腳本算 token） |
+| 3 | Cross-check gaps（找遺漏檔案） | Plan subagent assignments（按 token 預算分組） |
+| 4 | Write spec（11 sections） | Spawn Sonnet subagents |
+| 5 | Verify completeness | Synthesize → Write map → Update CLAUDE.md |
+
+## 平行化策略差異
+
+**Schematic** — 按**檔案語意角色**分組：
+- Agent 1: Core implementation（新檔案）
+- Agent 2: Integration points（修改的 hooks/selectors）
+- Agent 3: Tests
+- Agent 4: Config & infra
+
+**Cartographer** — 按**目錄 + token 預算**分組：
+- 每個 agent ≤150k tokens
+- 按目錄/模組分組，保持相關程式碼在一起
+- 強制規定 **Opus 不讀檔、Sonnet 讀檔**
+
+## 設計哲學差異
+
+| 面向 | **Schematic** | **Cartographer** |
+|------|--------------|-----------------|
+| **推斷層次** | 深度推斷「為什麼」（從 tests/comments 推產品動機） | 描述「是什麼」（檔案用途、exports、依賴） |
+| **輔助工具** | 純 git 指令，無外部腳本 | 自帶 Python 掃描腳本（`scan-codebase.py`，用 tiktoken 算 token） |
+| **增量更新** | 無（每次完整分析分支） | 有（偵測 `last_mapped` 後的 git 變更，只更新變動部分） |
+| **模型指定** | 未指定用哪個模型 | 明確指定 Sonnet subagent（成本/能力平衡） |
+| **驗證機制** | Phase 3 交叉檢查 + Phase 5 完整性驗證 | 依賴 subagent 報告合併，無顯式驗證步驟 |
+| **輸出格式** | 固定 11 章節模板，重「推斷」 | Mermaid 圖 + 表格 + Navigation Guide，重「導航」 |
+
+## 互補性
+
+兩者解決不同問題，互補而非競爭：
+
+- **Cartographer**：「這個 codebase 長什麼樣？」→ 全景地圖，適合 onboarding
+- **Schematic**：「這個 branch 做了什麼？」→ 差異分析，適合 PR review / 事後文件
+
+一個典型工作流是：先用 Cartographer 建立全景認知，再用 Schematic 分析特定 branch 的變更。
+
+## Schematic 可以借鑑的地方
+
+1. **增量更新機制** — Cartographer 的 `last_mapped` + git log 偵測很實用
+2. **Python 掃描腳本** — token 預算規劃比盲目分配 agents 更精確
+3. **明確指定 subagent 模型** — 控制成本
+4. **輸出含 Mermaid 圖** — 比 ASCII 圖更美觀且 GitHub 原生支援
+
+## Cartographer 可以借鑑的地方
+
+1. **語意分組策略** — 按角色（core/integration/tests）而非純目錄分組，能產生更有洞察力的分析
+2. **交叉驗證步驟** — 明確的 gap-finding phase 避免遺漏
+3. **推斷「為什麼」** — 不只描述程式碼做什麼，還推斷產品動機
+4. **完整性驗證 checklist** — 確保輸出與輸入一一對應