[claude-hackernews] Reply draft: Open Bias Show HN, LLM-call vs tool-call seam (id=47938497) by NiveditJain · Pull Request #52 · exospherehost/claude-hackernews

NiveditJain · 2026-05-04T09:32:17Z

Summary

Reply draft for Open Bias Show HN (id=47938497, 21 points / 4 comments / 5 days old). Open Bias is an LLM-call-layer proxy that judges/steers model input/output against a single RULES.md. The OP (algomaniac) elaborates a clear design rationale in the top comment (id=47938512) and is soliciting design discussion.

The reply leads with substantive engagement on the layer choice (LLM call vs tool call) and ties one named FailProof policy (warn-destructive-sql) to one concrete tool-call-shaped failure (a DROP DATABASE the LLM judged correct). One disclosure link, no install command, no policy comma-list, no three-scope talk, no second link, ~143 words. ASCII-only punctuation.

Discovery + thread URLs

Discovery: https://hn.algolia.com/?q=Show+HN+agent&dateRange=pastWeek&type=story&page=0 while sweeping for adjacent agent-runtime-control launches not yet covered by an open PR.
Thread: https://news.ycombinator.com/item?id=47938497
Parent comment (OP's design rationale): https://news.ycombinator.com/item?id=47938512

Three-surface duplicate check

grep -rl "item?id=47938497" drafts/ comments/ -> no hits
gh pr list --search "47938497" -> no hits

Thread is uncovered.

Status

Draft, pending manual post. Per CLAUDE.md "Comments via PR (never direct post)", no submit click was performed; this PR is the review-and-approval gate.

Test plan

Sanity-check the reply text against CLAUDE.md "Tone for discussing it on HN" (Thread-fit gate, Length and shape, Forbidden in any reply).
Skim the rest of the Open Bias thread for any new comments or sub-threads that change the framing before posting.
Post manually to HN as a reply to id=47938512.
After posting, ask the agent to append the comment-permalink to the HN: line and re-commit.
Merge this PR after the comment is logged.

Summary by CodeRabbit

Documentation
- Added internal draft content capturing discussion and planning notes.

…call seam (id=47938497)

coderabbitai · 2026-05-04T09:32:28Z

📝 Walkthrough

Walkthrough

A new Markdown reply draft is added for a Hacker News "Show HN" thread about an "Open Bias" behavioral proxy. The file captures target links, OP framing, the reply text with disclosure and technical comparison, and internal operational notes for the FailProof team, enforcing ASCII-only formatting constraints.

Changes

Reply Draft for Open Bias Thread

Layer / File(s)	Summary
Metadata & Framing `drafts/2026-05-04T093024Z.md` (lines 1–24)	HN thread links, story context, discovery notes, OP framing summary, and gate-qualification checklist explaining why the thread passes the FailProof-mention criteria.
Reply Content & Constraints `drafts/2026-05-04T093024Z.md` (lines 25–37)	Complete reply text including disclosure, LLM-call vs tool-call comparison argument, concrete examples, and follow-up question; ASCII-only and word-count constraints enforced.
Internal Insights & Operations `drafts/2026-05-04T093024Z.md` (lines 41–56)	Composability positioning notes for FailProof team, RULES.md trade-off framing, quote-worthy OP line, audience considerations, and runtime/editor probing observations with duplicate-guard rationale.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

[claude-hackernews] Restore drafts/ vs comments/ split: route writes to drafts/ #6: Switches agent and docs to write drafts/ instead of comments/, establishing the drafts-based workflow this PR follows.
[claude-hackernews] Switch HN automation to drafts-only mode #2: Implements drafts-only workflow and tracking, foundational to the artifacts and structure used here.
[claude-hackernews] Densify cron prompt to mirror working claude-reddit prompt #5: Enforces ASCII-only formatting and draft-file generation (comments/.md), directly related to this draft's constraints and structure.

Poem

🐰 A draft takes shape, with care and thought,
To share what knowledge bravely brought—
Through FailProof's lens, the agents play,
While ASCII keeps the chaos at bay! ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the pull request as a reply draft for a specific Hacker News thread, with the item ID and the key technical discussion point (LLM-call vs tool-call seam).
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Review rate limit: 4/5 reviews remaining, refill in 12 minutes.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@drafts/2026-05-04T093024Z.md`:
- Around line 27-35: The fenced reply block containing the reviewer quote
(starting with "(disclosure: I work on FailProof AI:
https://github.com/exospherehost/failproofai)") lacks a language tag, which
triggers MD040; update the opening fence from ``` to ```text (or another
appropriate language identifier) to silence the lint warning and keep the
content unchanged so the block compiles correctly.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 3e6b5d2b-98af-4319-a617-e7f49aadeadc

📥 Commits

Reviewing files that changed from the base of the PR and between ebbce06 and ee37014.

📒 Files selected for processing (1)

drafts/2026-05-04T093024Z.md

coderabbitai · 2026-05-04T09:33:57Z

+```
+(disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai)
+
+Interesting choice to put the seam at the LLM call. We landed on the tool call, and the two layers catch different failure modes. Steering the model handles output drift; where it tends to miss is when the reasoning was fine but the tool input still does the wrong thing (a DROP DATABASE in a migration the LLM judged correct, an rm -rf with a path built from a stale env var, a force push that looked plausible to the proposer). At that point the LLM judge has already let it through; you want a hook running against the literal argv before exec.
+
+For coding agents we ship that as a small PreToolUse policy, e.g. warn-destructive-sql fires on a Bash whose argv contains DROP TABLE or DROP DATABASE.
+
+Curious whether RULES.md compiles to anything tool-call-shaped, or stays at the LLM seam?
+```


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add a language tag to the fenced reply block (Line 27).

The code fence is missing a language identifier, which triggers MD040 and creates avoidable lint noise.

Suggested patch

-``` +```text (disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai) ... Curious whether RULES.md compiles to anything tool-call-shaped, or stays at the LLM seam?

</details>  <details> <summary>📝 Committable suggestion</summary> > ‼️ **IMPORTANT** > Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements. ```suggestion

🧰 Tools

🪛 markdownlint-cli2 (0.22.1)

[warning] 27-27: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@drafts/2026-05-04T093024Z.md` around lines 27 - 35, The fenced reply block containing the reviewer quote (starting with "(disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai)") lacks a language tag, which triggers MD040; update the opening fence from ``` to ```text (or another appropriate language identifier) to silence the lint warning and keep the content unchanged so the block compiles correctly.

[claude-hackernews] draft: Open Bias Show HN reply, LLM-call vs tool-…

ee37014

…call seam (id=47938497)

coderabbitai Bot reviewed May 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[claude-hackernews] Reply draft: Open Bias Show HN, LLM-call vs tool-call seam (id=47938497)#52

[claude-hackernews] Reply draft: Open Bias Show HN, LLM-call vs tool-call seam (id=47938497)#52
NiveditJain wants to merge 1 commit intomainfrom
luv-61

NiveditJain commented May 4, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 4, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

NiveditJain commented May 4, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Discovery + thread URLs

Three-surface duplicate check

Status

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

NiveditJain commented May 4, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 4, 2026 •

edited

Loading