docs: reconcile README/site claims with verified artifacts (E2 claim-audit, R5) by brownjuly2003-code · Pull Request #121 · brownjuly2003-code/agentflow

brownjuly2003-code · 2026-06-30T11:04:03Z

What

E2 claim-audit (road-to-9.8 R5): a line-by-line pass over every capability claim in README.md and site/, checking each maps to a runnable test/artifact, with non-goals stated without budget-framing.

Verified-correct (left as-is)

Claim	Backing artifact
1.06 s p50 / 1.99 s p95 freshness, 238 ms tuned, ~15 s TTL	`docs/freshness-benchmark.md` (measured arms)
"all six metrics" + "2.5 s p95 staleness budget"	`catalog.py` (6 `MetricDefinition`) + `config/contracts/metric.*.v1.yaml` (`p95_staleness_budget_seconds: 2.5`)
38 DV2 tables (8 hubs / 8 links / 22 sats)	`dv2-multi-branch/architecture.md` + `demo_evidence.md`
3 dbt mart models	`warehouse/.../dbt/models/marts/`
v1.5.0 published on PyPI + npm	live registry query: `agentflow-runtime` / `agentflow-client` / `@yuliaedomskikh/agentflow-client` all `1.5.0`
42 referenced doc/script paths	all exist (0 broken links)

Fixed drift

README — 12 required status checks → 13 + added build-smoke (branch protection's required_status_checks has 13 contexts; the prose was missing one). 960+ unit tests → 1,500+ (1501 verified). Dropped budget-framing paid from the Scope non-goal (rather than a paid managed cloud → a managed cloud).
site/index.html — the performance-baseline panel (56 / 260 / 330 ms, 27.8 RPS, 0%) cited docs/benchmark-baseline.json, which has since been regenerated with CI-runner numbers (140 / 610 / 23000 ms). Re-pointed the source to docs/release-readiness.md, which holds that exact aggregate run (569 requests, 0 failures).
docs/dv2-multi-branch/RELEASE_STATUS.md (the artifact the v1.5_published README badge links to) — was stale at v1.4.0; refreshed to v1.5.0 (header, status line verified 2026-06-30 via live registries, registry-table rows, tag-state row, re-verify pins; tag c99d094). Full v1.5.0 release mechanics remain for the formal release cut (E1).

Verify

Doc-coupled tests green (test_examples, test_release_artifacts, test_contract_dependencies); no code touched. Remaining residual drift out of this PR's README/site scope: docs/glossary.md carries the same stale benchmark-baseline.json citation (noted for a later docs pass).

🤖 Generated with Claude Code

…audit, R5) Line-by-line audit of every capability claim in README.md and site/ against its backing artifact. Verified-correct and left as-is: 1.06 s / 1.99 s freshness (docs/freshness-benchmark.md), 6 metrics + 2.5 s p95 staleness budget (config/contracts/metric.*.v1.yaml), 38 DV2 tables = 8 hubs / 8 links / 22 sats (dv2-multi-branch/architecture.md + demo_evidence.md), 3 dbt marts, all 42 referenced doc/script paths exist, and v1.5.0 published on PyPI + npm (confirmed via live registry query). Fixed the drift the audit found: - README: "12 required status checks" -> "13" and added build-smoke to the list (branch protection's required_status_checks has 13 contexts, build-smoke was missing from the prose); "960+ unit tests" -> "1,500+" (1501 verified this cycle); dropped the budget-framing "paid" from the Scope non-goal ("rather than a paid managed cloud" -> "rather than a managed cloud"). - site/index.html: the performance-baseline panel (56 / 260 / 330 ms, 27.8 RPS, 0%) cited docs/benchmark-baseline.json, which has since been regenerated with CI-runner numbers (140 / 610 / 23000 ms). Re-pointed the source to docs/release-readiness.md, which holds that exact aggregate run (569 requests, 0 failures). - docs/dv2-multi-branch/RELEASE_STATUS.md (the artifact the "v1.5_published" README badge links to): refreshed from v1.4.0 to v1.5.0 — header, status line (verified 2026-06-30 via live registries), registry table rows, tag-state row, and re-verify pins. PyPI agentflow-runtime/agentflow-client 1.5.0 and npm @yuliaedomskikh/agentflow-client 1.5.0, tag c99d094. Full v1.5.0 release mechanics remain for the formal release cut (E1). Verify: doc-coupled tests (test_examples, test_release_artifacts, test_contract_dependencies) green; no code touched. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

github-actions · 2026-06-30T11:04:19Z

DORA Metrics

Window: last 30 days
Branch: main
Deployment frequency: 132 total / 30.8 per week
Lead time for changes: avg 0.32h / median 0.0h
Change failure rate: 78.79% (104/132)
MTTR: 0.25h across 3 incident(s)

brownjuly2003-code merged commit 46ad065 into main Jun 30, 2026
18 of 19 checks passed

brownjuly2003-code deleted the docs/e2-claim-audit branch June 30, 2026 11:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: reconcile README/site claims with verified artifacts (E2 claim-audit, R5)#121

docs: reconcile README/site claims with verified artifacts (E2 claim-audit, R5)#121
brownjuly2003-code merged 1 commit into
mainfrom
docs/e2-claim-audit

brownjuly2003-code commented Jun 30, 2026

Uh oh!

github-actions Bot commented Jun 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

brownjuly2003-code commented Jun 30, 2026

What

Verified-correct (left as-is)

Fixed drift

Verify

Uh oh!

github-actions Bot commented Jun 30, 2026

DORA Metrics

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants