Article Relevance Audit — Week of 2026-03-13

**Articles scanned:** 35 evergreen | **Run date:** 2026-03-13 | **New since last audit:** 2 articles

> Items marked ⚠️ **Persistent** appeared in the 2026-03-11 audit and remain unresolved.

---

## 🔗 Link Health

**Articles scanned:** 35 | **Internal links checked:** 94 | **Issues found:** 1

### Broken Internal Links

| Article | Link Text | Broken URL | Correct URL |
|---------|-----------|------------|-------------|
| `agentic-ai-transforming-dev-teams` | "GitHub Copilot CLI worktrees" | `/articles/copilot-fleet-mode-video-pipeline` | `/articles/video-pipeline-with-fleet-mode` |

⚠️ **Persistent** — flagged in 2026-03-11 audit, not yet fixed. Located in `## The Circle Replaces the Hierarchy` section, line 31.

**Fix:** In `src/content/articles/agentic-ai-transforming-dev-teams.mdx`, change:
````
[GitHub Copilot CLI worktrees](/articles/copilot-fleet-mode-video-pipeline)
```
to:
```
[GitHub Copilot CLI worktrees](/articles/video-pipeline-with-fleet-mode)
````

### External Links

Unable to verify all 522 external links due to sandbox network restrictions. One URL returned a fetch error (possible timeout/blocking — not confirmed 404):

| Article | URL | Status |
|---------|-----|--------|
| `agent-harnesses-controlling-ai-agents-2026` | `(www.anthropic.com/redacted) | Unreachable (fetch failed) |

Note: Image paths using `/articles/*.png` in `github-agentic-workflows-hands-on-guide` and `github-copilot-visual-studio-guide` resolve to confirmed files in `public/articles/` — not broken.

---

## 📊 Factual Freshness

**Articles scanned:** 35 | **Stale references found:** 5

⚠️ **Persistent** — all items below appeared in the 2026-03-11 audit.

### Potentially Stale References

| Article | What It Says | Current Reality | Severity | Suggested Update |
|---------|-------------|----------------|----------|------------------|
| `ai-fixes-its-own-bugs` | "SWE-agent solves **12.47%** of real-world bugs on the SWE-bench evaluation set" (from NeurIPS 2024 paper) | Top models now solve 50%+ on SWE-bench verified subset; the field has moved dramatically since the NeurIPS 2024 baseline | 🔴 High | Add context: "Since the NeurIPS 2024 paper, the field has advanced rapidly — top models on SWE-bench now exceed 50% resolution rates." The 12.47% is historically accurate but reads as the current state. |
| `ai-fixes-its-own-bugs` | "combines CodeQL's semantic analysis engine with GPT-4o" | GitHub's 2024 blog post describes GPT-4o; the underlying model for CodeQL Autofix has likely been updated since Feb 2024 | 🔴 High | Verify current model with GitHub's latest Autofix docs; update to "Copilot" or current model name if GPT-4o is no longer accurate. |
| `copilot-developer-fulfillment` | "With 20 million GitHub Copilot users..." | GitHub surpassed 30M+ Copilot users by late 2025/early 2026 | 🟡 Medium | Update to reflect current user count with a citation to GitHub's latest figures |
| `copilot-developer-fulfillment` | References "the 2024 DORA report" | 2025 DORA report is now available — note: `stanford-study-ai-roi-in-engineering` already references the 2025 version | 🟡 Medium | Update citation to 2025 DORA report; the findings directionally align but the newer report includes AI-specific data |
| `choosing-the-right-ai-sdk` | Table shows `GitHub Copilot SDK \| Technical Preview (Jan 2026)` | ~7 weeks have passed since the Technical Preview launch; GA status may have changed | 🟡 Medium | Verify current SDK status at https://github.blog/changelog/ — update table row and any "preview" language if it has reached GA |

---

## 🔀 Cross-Linking Opportunities

**New zero-inbound articles (published since last audit):** 2 | **Carryover zero-inbound:** 8

### 🆕 NEW: `stop-pulling-your-agentic-unicorns-off-the-field`
**File:** `src/content/articles/stop-pulling-your-agentic-unicorns-off-the-field.mdx`
**Published:** 2026-03-12 | **Inbound links:** 0 | **Related articles not linking to it:**

1. **Link from:** `stanford-study-ai-roi-in-engineering`
   - **Suggested section:** `## What Engineering Leaders Should Actually Do`
   - **Insert after:** "Teams that push through this curve compound their gains. Teams that don't fall further behind."
   - **Suggested text:** `The compounding effect is exactly what I laid out in [why you shouldn't pull your agentic unicorns off the field](/articles/stop-pulling-your-agentic-unicorns-off-the-field) — that 5% of power users delivering 30%+ capacity expansion are precisely the engineers who pushed through the learning curve. Protect them.`

2. **Link from:** `agentic-ai-transforming-dev-teams`
   - **Suggested section:** `## The Circle Replaces the Hierarchy`
   - **Insert near:** "...tested against real user needs, and either validated or killed."
   - **Suggested text:** `But to keep that loop spinning, you have to resist the temptation to pull your best agentic contributors off the field to run training sessions. I wrote about [why that's the most expensive resourcing mistake in engineering today](/articles/stop-pulling-your-agentic-unicorns-off-the-field).`

3. **Link from:** `turning-ai-skeptics-into-believers`
   - **Suggested section:** `## The Bottom Line`
   - **Insert at the end of the closing section**
   - **Suggested text:** `Once you've converted them, resist the urge to deploy your power users as trainers. I wrote about [keeping your agentic unicorns on the field](/articles/stop-pulling-your-agentic-unicorns-off-the-field) — the data on why that trade-off rarely pays off.`

### 🆕 NEW: `github-copilot-vs-the-world`
**File:** `src/content/articles/github-copilot-vs-the-world.mdx`
**Published:** 2026-03-12 | **Inbound links:** 0 | **Related articles not linking to it:**

1. **Link from:** `choosing-the-right-ai-sdk`
   - **Suggested section:** The GitHub Copilot SDK section (around line 41–43)
   - **Insert after:** "I built my SRE agent, I went from zero to a working prototype with MCP server integration in two hours."
   - **Suggested text:** `If you want the full competitive picture on why GitHub's platform approach is pulling ahead of standalone tools, I broke down [why the AI coding war is already over](/articles/github-copilot-vs-the-world) — and what that means for your SDK decision.`

2. **Link from:** `copilot-developer-fulfillment`
   - **Suggested section:** `## The Skeptic's Corner`
   - **Insert near the skeptic objections section**
   - **Suggested text:** `For a deeper look at where Copilot stands relative to Cursor, Codeium, and the rest of the field right now, I laid that out in [my breakdown of the AI coding competitive landscape](/articles/github-copilot-vs-the-world).`

### ⚠️ Persistent Carryover: `vibe-testing-when-ai-agents-goodhart-your-test-suite`
**Published:** 2026-03-08 | **Inbound links:** 0

1. **Link from:** `tests-are-everything-agentic-ai`
   - **Suggested section:** `## The Testing Reality for Agentic Teams`
   - **Insert after:** "Once I understood this, building comprehensive test suites became way simpler. Not easier — simpler."
   - **Suggested text:** `What I didn't fully appreciate then: AI agents can game that simplicity. I went deep on what happens when agents optimize for coverage metrics instead of correctness in [my vibe testing breakdown](/articles/vibe-testing-when-ai-agents-goodhart-your-test-suite) — Goodhart's Law with a test runner.`

2. **Link from:** `test-enforcement-architecture-ai-agents`
   - **Suggested section:** Intro / first section
   - **Suggested text:** `The failure mode this architecture prevents is what I'm calling [vibe testing](/articles/vibe-testing-when-ai-agents-goodhart-your-test-suite) — when AI agents generate tests that inflate coverage metrics while verifying nothing. Read that first if you want the context for why structural enforcement matters.`

---

## 🌿 Evergreen Maintenance

**Refresh candidates found:** 5 | All have `updatedDate: none` and are 21–28 days old.

⚠️ **Persistent** — all candidates below appeared in the 2026-03-11 audit and have not been updated.

| Article | Published | Topic Area | Why Refresh? | Priority |
|---------|-----------|------------|--------------|----------|
| `context-engineering-key-to-ai-development` | 2026-02-14 | Context Engineering | 22 inbound links make this a pillar piece; MCP has evolved, new Copilot CLI context features released | 🔴 High |
| `ai-fixes-its-own-bugs` | 2026-02-14 | Autonomous Bug Fixing | SWE-bench leaderboard moved dramatically; CodeQL Autofix model likely updated; needs factual refresh (see above) | 🔴 High |
| `choosing-the-right-ai-sdk` | 2026-02-14 | AI SDKs | SDK landscape evolving fast; Copilot SDK preview status may have changed; OpenAI Agents SDK updates | 🟡 Medium |
| `building-the-future-with-ai` | 2026-02-13 | AI Development Overview | 9 inbound links; broad intro piece published earliest — should reference newer articles added since (stop-pulling, github-copilot-vs-the-world, vibe-testing, specs-equal-tests) | 🟡 Medium |
| `agentic-ai-transforming-dev-teams` | 2026-02-20 | Agentic Dev Teams | Has a broken internal link (see Link Health above); 41% AI code stat sourced from LinkedIn post, not primary research | 🟡 Medium |

### Detailed Refresh: `context-engineering-key-to-ai-development`
**File:** `src/content/articles/context-engineering-key-to-ai-development.mdx`
- **What changed since publication:** MCP (Model Context Protocol) ecosystem has grown significantly — more servers, broader IDE support. Copilot CLI added new context management features in its Jan 2026 changelog. Several new htek.dev articles reference this piece as a pillar but it doesn't reference them back.
- **Suggested updates:**
  - Add section on MCP as the emerging standard for context injection
  - Reference [`agent-hooks-controlling-ai-codebase`](/articles/agent-hooks-controlling-ai-codebase) and [`vibe-testing-when-ai-agents-goodhart-your-test-suite`](/articles/vibe-testing-when-ai-agents-goodhart-your-test-suite) as applied examples
  - Update any references to Copilot CLI context features with Jan 2026 release notes
- **Sources:** [Copilot CLI Jan 2026 changelog](https://github.blog/changelog/2026-01-14-github-copilot-cli-enhanced-agents-context-management-and-new-ways-to-install/), [MCP official site]((modelcontextprotocol.io/redacted)

### Detailed Refresh: `ai-fixes-its-own-bugs`
**File:** `src/content/articles/ai-fixes-its-own-bugs.mdx`
- **What changed since publication:** SWE-bench top scores moved from 12–14% (early 2024) to 50%+ verified (early 2026). The article's framing of 12.47% as a benchmark number now understates the state of the art by ~4×.
- **Suggested updates:**
  - Add a callout box: "Since this article was published, SWE-bench verified scores have climbed to 50%+ — the 12.47% figure is the 2024 baseline, not the current ceiling."
  - Verify and update CodeQL Autofix model reference (GPT-4o → current)
  - Reference the 2025 SWE-bench leaderboard analysis paper already cited inline




> Generated by [Article Relevance Audit — Weekly Health Check](https://github.com/htekdev/htek-dev-site/actions/runs/23073210251) · [◷](https://github.com/search?q=repo%3Ahtekdev%2Fhtek-dev-site+is%3Aissue+%22gh-aw-workflow-call-id%3A+htekdev%2Fhtek-dev-site%2Farticle-relevance-audit%22&type=issues)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Article Relevance Audit — Week of 2026-03-13 #41

🔗 Link Health

Broken Internal Links

External Links

📊 Factual Freshness

Potentially Stale References

🔀 Cross-Linking Opportunities

🆕 NEW: `stop-pulling-your-agentic-unicorns-off-the-field`

🆕 NEW: `github-copilot-vs-the-world`

⚠️ Persistent Carryover: `vibe-testing-when-ai-agents-goodhart-your-test-suite`

🌿 Evergreen Maintenance

Detailed Refresh: `context-engineering-key-to-ai-development`

Detailed Refresh: `ai-fixes-its-own-bugs`

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Article	What It Says	Current Reality	Severity	Suggested Update
`ai-fixes-its-own-bugs`	"SWE-agent solves 12.47% of real-world bugs on the SWE-bench evaluation set" (from NeurIPS 2024 paper)	Top models now solve 50%+ on SWE-bench verified subset; the field has moved dramatically since the NeurIPS 2024 baseline	🔴 High	Add context: "Since the NeurIPS 2024 paper, the field has advanced rapidly — top models on SWE-bench now exceed 50% resolution rates." The 12.47% is historically accurate but reads as the current state.
`ai-fixes-its-own-bugs`	"combines CodeQL's semantic analysis engine with GPT-4o"	GitHub's 2024 blog post describes GPT-4o; the underlying model for CodeQL Autofix has likely been updated since Feb 2024	🔴 High	Verify current model with GitHub's latest Autofix docs; update to "Copilot" or current model name if GPT-4o is no longer accurate.
`copilot-developer-fulfillment`	"With 20 million GitHub Copilot users..."	GitHub surpassed 30M+ Copilot users by late 2025/early 2026	🟡 Medium	Update to reflect current user count with a citation to GitHub's latest figures
`copilot-developer-fulfillment`	References "the 2024 DORA report"	2025 DORA report is now available — note: `stanford-study-ai-roi-in-engineering` already references the 2025 version	🟡 Medium	Update citation to 2025 DORA report; the findings directionally align but the newer report includes AI-specific data
`choosing-the-right-ai-sdk`	Table shows `GitHub Copilot SDK \| Technical Preview (Jan 2026)`	~7 weeks have passed since the Technical Preview launch; GA status may have changed	🟡 Medium	Verify current SDK status at https://github.blog/changelog/ — update table row and any "preview" language if it has reached GA

Article	Published	Topic Area	Why Refresh?	Priority
`context-engineering-key-to-ai-development`	2026-02-14	Context Engineering	22 inbound links make this a pillar piece; MCP has evolved, new Copilot CLI context features released	🔴 High
`ai-fixes-its-own-bugs`	2026-02-14	Autonomous Bug Fixing	SWE-bench leaderboard moved dramatically; CodeQL Autofix model likely updated; needs factual refresh (see above)	🔴 High
`choosing-the-right-ai-sdk`	2026-02-14	AI SDKs	SDK landscape evolving fast; Copilot SDK preview status may have changed; OpenAI Agents SDK updates	🟡 Medium
`building-the-future-with-ai`	2026-02-13	AI Development Overview	9 inbound links; broad intro piece published earliest — should reference newer articles added since (stop-pulling, github-copilot-vs-the-world, vibe-testing, specs-equal-tests)	🟡 Medium
`agentic-ai-transforming-dev-teams`	2026-02-20	Agentic Dev Teams	Has a broken internal link (see Link Health above); 41% AI code stat sourced from LinkedIn post, not primary research	🟡 Medium

Article Relevance Audit — Week of 2026-03-13 #41

Description

🔗 Link Health

Broken Internal Links

External Links

📊 Factual Freshness

Potentially Stale References

🔀 Cross-Linking Opportunities

🆕 NEW: stop-pulling-your-agentic-unicorns-off-the-field

🆕 NEW: github-copilot-vs-the-world

⚠️ Persistent Carryover: vibe-testing-when-ai-agents-goodhart-your-test-suite

🌿 Evergreen Maintenance

Detailed Refresh: context-engineering-key-to-ai-development

Detailed Refresh: ai-fixes-its-own-bugs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

🆕 NEW: `stop-pulling-your-agentic-unicorns-off-the-field`

🆕 NEW: `github-copilot-vs-the-world`

⚠️ Persistent Carryover: `vibe-testing-when-ai-agents-goodhart-your-test-suite`

Detailed Refresh: `context-engineering-key-to-ai-development`

Detailed Refresh: `ai-fixes-its-own-bugs`