Skip to content

Commit 4b22559

Browse files
committed
Merge remote-tracking branch 'origin/main' into garrytan/gstack-eval-optimization
2 parents 64bbbb2 + 9d47619 commit 4b22559

29 files changed

Lines changed: 1528 additions & 831 deletions

File tree

CHANGELOG.md

Lines changed: 24 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,29 @@
1313
- `test:evals` and `test:e2e` now auto-select based on diff (was: all-or-nothing)
1414
- New `test:evals:all` and `test:e2e:all` scripts for explicit full runs
1515

16+
## 0.6.1 — 2026-03-17 — Boil the Lake
17+
18+
Every gstack skill now follows the **Completeness Principle**: always recommend the
19+
full implementation when AI makes the marginal cost near-zero. No more "Choose B
20+
because it's 90% of the value" when option A is 70 lines more code.
21+
22+
Read the philosophy: https://garryslist.org/posts/boil-the-ocean
23+
24+
- **Completeness scoring**: every AskUserQuestion option now shows a completeness
25+
score (1-10), biasing toward the complete solution
26+
- **Dual time estimates**: effort estimates show both human-team and CC+gstack time
27+
(e.g., "human: ~2 weeks / CC: ~1 hour") with a task-type compression reference table
28+
- **Anti-pattern examples**: concrete "don't do this" gallery in the preamble so the
29+
principle isn't abstract
30+
- **First-time onboarding**: new users see a one-time introduction linking to the
31+
essay, with option to open in browser
32+
- **Review completeness gaps**: `/review` now flags shortcut implementations where the
33+
complete version costs <30 min CC time
34+
- **Lake Score**: CEO and Eng review completion summaries show how many recommendations
35+
chose the complete option vs shortcuts
36+
- **CEO + Eng review dual-time**: temporal interrogation, effort estimates, and delight
37+
opportunities all show both human and CC time scales
38+
1639
## 0.6.0.1 — 2026-03-17
1740

1841
- **`/gstack-upgrade` now catches stale vendored copies automatically.** If your global gstack is up to date but the vendored copy in your project is behind, `/gstack-upgrade` detects the mismatch and syncs it. No more manually asking "did we vendor it?" — it just tells you and offers to update.
@@ -76,7 +99,7 @@
7699
## 0.5.2 — 2026-03-17
77100

78101
- **Your design consultant now takes creative risks.** `/design-consultation` doesn't just propose a safe, coherent system — it explicitly breaks down SAFE CHOICES (category baseline) vs. RISKS (where your product stands out). You pick which rules to break. Every risk comes with a rationale for why it works and what it costs.
79-
- **See the competition before you choose.** When you opt into research, the agent browses competitor sites with screenshots and accessibility tree analysis — not just web search results. You see what the landscape looks like before making design decisions.
102+
- **See the landscape before you choose.** When you opt into research, the agent browses real sites in your space with screenshots and accessibility tree analysis — not just web search results. You see what's out there before making design decisions.
80103
- **Preview pages that look like your product.** The preview page now renders realistic product mockups — dashboards with sidebar nav and data tables, marketing pages with hero sections, settings pages with forms — not just font swatches and color palettes.
81104

82105
## 0.5.1 — 2026-03-17

CLAUDE.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -130,6 +130,23 @@ CHANGELOG.md is **for users**, not contributors. Write it like product release n
130130
- No jargon: say "every question now tells you which project and branch you're in" not
131131
"AskUserQuestion format standardized across skill templates via preamble resolver."
132132

133+
## AI effort compression
134+
135+
When estimating or discussing effort, always show both human-team and CC+gstack time:
136+
137+
| Task type | Human team | CC+gstack | Compression |
138+
|-----------|-----------|-----------|-------------|
139+
| Boilerplate / scaffolding | 2 days | 15 min | ~100x |
140+
| Test writing | 1 day | 15 min | ~50x |
141+
| Feature implementation | 1 week | 30 min | ~30x |
142+
| Bug fix + regression test | 4 hours | 15 min | ~20x |
143+
| Architecture / design | 2 days | 4 hours | ~5x |
144+
| Research / exploration | 1 day | 3 hours | ~3x |
145+
146+
Completeness is cheap. Don't recommend shortcuts when the complete implementation
147+
is a "lake" (achievable) not an "ocean" (multi-quarter migration). See the
148+
Completeness Principle in the skill preamble for the full philosophy.
149+
133150
## Local plans
134151

135152
Contributors can store long-range vision docs and design documents in `~/.gstack-dev/plans/`.

CONTRIBUTING.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,10 @@ the issue, fix it, and open a PR.
5454
This is the best way to contribute: fix gstack while doing your real work, in the
5555
project where you actually felt the pain.
5656

57+
### Session awareness
58+
59+
When you have 3+ gstack sessions open simultaneously, every question tells you which project, which branch, and what's happening. No more staring at a question thinking "wait, which window is this?" The format is consistent across all 13 skills.
60+
5761
## Working on gstack inside the gstack repo
5862

5963
When you're editing gstack skills and want to test them by actually using gstack

0 commit comments

Comments
 (0)