Commit ff46bdb
feat(results): git-native storage — design doc + implementation (#1261)
* docs: design plan for git-native results storage (#1259)
Captures the agreed architecture before implementation:
- Git is the canonical store; local clone is the working copy
- No separate index file — git tree IS the index
- Eval writes directly to clone working tree (not project-local .agentv/results/)
- Reads via git ls-tree + git cat-file --batch (no checkout)
- Pagination via cursor
- mode: github explicit in config (extension point)
Supersedes closed PR #1260. See docs/plans/git-native-results.md for full design.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* feat(results): Pass 1 — config schema + path renames
- Add `mode: 'github'` as required field to ResultsConfig
- Repurpose `results.path` as optional local filesystem path for clone
(default: ~/.agentv/results/<slug>/); reject old-style subdir values
(e.g. 'runs') with a migration message
- Rename ResultsRepoCachePaths → ResultsRepoLocalPaths
- Rename getResultsRepoCachePaths → getResultsRepoLocalPaths
- Rename cache_dir → local_dir in ResultsRepoStatus wire format
- normalizeResultsConfig: fill default path, expand ~, include mode
- Remove redundant local normalizeResultsConfig copy in remote.ts
- Update config-validator.ts to enforce mode and filesystem-path rule
- Update tests for new schema
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* fix(results): fix lint + update resolveResultsRepoRunsDir + serve tests
- Fix biome string-concat lint error (single template literal)
- resolveResultsRepoRunsDir: use normalized.path directly (new design)
- getResultsRepoStatus: check existsSync(normalized.path) for available,
set local_dir to normalized.path
- serve.test.ts: update two tests to use mode:github schema and new
default path layout (~/.agentv/results/<slug>/runs/...)
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* wip: initial git-native listing skeleton + implementation goal
- Added listGitRuns() using git ls-tree + cat-file --batch
- Improved batch parser
- Saved implementation goal document
This is early progress toward the full git-native results implementation.
More to come in follow-up commits.
* fix: remove duplicate execFileAsync declaration
* feat(results): improve git-native listing metadata shape
- Enrich GitListedRun with display_name, test_count, avg_score, size_bytes
- Update remote.ts mapping to populate ResultFileMeta fields
- Read path now returns data Studio can render
* chore: update implementation goal + docker ownership fix
- Add user: ${UID}:${GID} to docker-compose for mounted repo permissions
- Update goal document with current status
- Reinstall dependencies in worktree
* fix(results): restore git-native run listing
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* chore(results): satisfy lint
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(test): stabilize git subprocess checks
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* chore(test): satisfy lint and timeouts
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* feat(results): finish git-native results flow
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(results): complete remote-only studio flow
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* seed repo
* fix(test): isolate git env in serve regression
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(test): restore readme after temp repo setup
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(test): trim low-value flaky coverage
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(results): materialize synced remote runs
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(results): atomically materialize synced runs
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* docs(studio): clarify remote results behavior
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* fix(cli): treat AGENTV_HOME log as info
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* docs(studio): refresh remote results screenshots
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---------
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Test User <test@example.com>1 parent af118c6 commit ff46bdb
26 files changed
Lines changed: 2032 additions & 435 deletions
File tree
- apps
- cli
- src/commands/results
- test
- commands
- eval/pipeline
- results
- studio/src
- components
- lib
- routes
- projects
- web/src
- assets/screenshots
- content/docs/docs/tools
- docs/plans
- packages/core
- src
- evaluation
- loaders
- validation
- test
- evaluation
- loaders
- validation
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
1 | 2 | | |
2 | 3 | | |
3 | 4 | | |
| |||
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| 12 | + | |
11 | 13 | | |
| 14 | + | |
| 15 | + | |
12 | 16 | | |
13 | 17 | | |
14 | 18 | | |
| |||
59 | 63 | | |
60 | 64 | | |
61 | 65 | | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | 66 | | |
72 | 67 | | |
73 | 68 | | |
| |||
131 | 126 | | |
132 | 127 | | |
133 | 128 | | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
138 | 137 | | |
139 | 138 | | |
140 | 139 | | |
| |||
185 | 184 | | |
186 | 185 | | |
187 | 186 | | |
188 | | - | |
189 | | - | |
190 | | - | |
191 | | - | |
192 | | - | |
193 | | - | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
194 | 194 | | |
195 | | - | |
196 | | - | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
197 | 226 | | |
198 | 227 | | |
199 | 228 | | |
| |||
212 | 241 | | |
213 | 242 | | |
214 | 243 | | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
215 | 270 | | |
216 | 271 | | |
217 | 272 | | |
| |||
0 commit comments