Skip to content

[claude-hackernews] Reply draft: Open Bias Show HN, LLM-call vs tool-call seam (id=47938497)#52

Open
NiveditJain wants to merge 1 commit intomainfrom
luv-61
Open

[claude-hackernews] Reply draft: Open Bias Show HN, LLM-call vs tool-call seam (id=47938497)#52
NiveditJain wants to merge 1 commit intomainfrom
luv-61

Conversation

@NiveditJain
Copy link
Copy Markdown
Member

@NiveditJain NiveditJain commented May 4, 2026

Summary

Reply draft for Open Bias Show HN (id=47938497, 21 points / 4 comments / 5 days old). Open Bias is an LLM-call-layer proxy that judges/steers model input/output against a single RULES.md. The OP (algomaniac) elaborates a clear design rationale in the top comment (id=47938512) and is soliciting design discussion.

The reply leads with substantive engagement on the layer choice (LLM call vs tool call) and ties one named FailProof policy (warn-destructive-sql) to one concrete tool-call-shaped failure (a DROP DATABASE the LLM judged correct). One disclosure link, no install command, no policy comma-list, no three-scope talk, no second link, ~143 words. ASCII-only punctuation.

Discovery + thread URLs

Three-surface duplicate check

  • grep -rl "item?id=47938497" drafts/ comments/ -> no hits
  • gh pr list --search "47938497" -> no hits

Thread is uncovered.

Status

Draft, pending manual post. Per CLAUDE.md "Comments via PR (never direct post)", no submit click was performed; this PR is the review-and-approval gate.

Test plan

  • Sanity-check the reply text against CLAUDE.md "Tone for discussing it on HN" (Thread-fit gate, Length and shape, Forbidden in any reply).
  • Skim the rest of the Open Bias thread for any new comments or sub-threads that change the framing before posting.
  • Post manually to HN as a reply to id=47938512.
  • After posting, ask the agent to append the comment-permalink to the HN: line and re-commit.
  • Merge this PR after the comment is logged.

Summary by CodeRabbit

  • Documentation
    • Added internal draft content capturing discussion and planning notes.

@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented May 4, 2026

📝 Walkthrough

Walkthrough

A new Markdown reply draft is added for a Hacker News "Show HN" thread about an "Open Bias" behavioral proxy. The file captures target links, OP framing, the reply text with disclosure and technical comparison, and internal operational notes for the FailProof team, enforcing ASCII-only formatting constraints.

Changes

Reply Draft for Open Bias Thread

Layer / File(s) Summary
Metadata & Framing
drafts/2026-05-04T093024Z.md (lines 1–24)
HN thread links, story context, discovery notes, OP framing summary, and gate-qualification checklist explaining why the thread passes the FailProof-mention criteria.
Reply Content & Constraints
drafts/2026-05-04T093024Z.md (lines 25–37)
Complete reply text including disclosure, LLM-call vs tool-call comparison argument, concrete examples, and follow-up question; ASCII-only and word-count constraints enforced.
Internal Insights & Operations
drafts/2026-05-04T093024Z.md (lines 41–56)
Composability positioning notes for FailProof team, RULES.md trade-off framing, quote-worthy OP line, audience considerations, and runtime/editor probing observations with duplicate-guard rationale.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

Poem

🐰 A draft takes shape, with care and thought,
To share what knowledge bravely brought—
Through FailProof's lens, the agents play,
While ASCII keeps the chaos at bay!

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately describes the pull request as a reply draft for a specific Hacker News thread, with the item ID and the key technical discussion point (LLM-call vs tool-call seam).
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


Review rate limit: 4/5 reviews remaining, refill in 12 minutes.

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@drafts/2026-05-04T093024Z.md`:
- Around line 27-35: The fenced reply block containing the reviewer quote
(starting with "(disclosure: I work on FailProof AI:
https://github.com/exospherehost/failproofai)") lacks a language tag, which
triggers MD040; update the opening fence from ``` to ```text (or another
appropriate language identifier) to silence the lint warning and keep the
content unchanged so the block compiles correctly.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 3e6b5d2b-98af-4319-a617-e7f49aadeadc

📥 Commits

Reviewing files that changed from the base of the PR and between ebbce06 and ee37014.

📒 Files selected for processing (1)
  • drafts/2026-05-04T093024Z.md

Comment on lines +27 to +35
```
(disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai)

Interesting choice to put the seam at the LLM call. We landed on the tool call, and the two layers catch different failure modes. Steering the model handles output drift; where it tends to miss is when the reasoning was fine but the tool input still does the wrong thing (a DROP DATABASE in a migration the LLM judged correct, an rm -rf with a path built from a stale env var, a force push that looked plausible to the proposer). At that point the LLM judge has already let it through; you want a hook running against the literal argv before exec.

For coding agents we ship that as a small PreToolUse policy, e.g. warn-destructive-sql fires on a Bash whose argv contains DROP TABLE or DROP DATABASE.

Curious whether RULES.md compiles to anything tool-call-shaped, or stays at the LLM seam?
```
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add a language tag to the fenced reply block (Line 27).

The code fence is missing a language identifier, which triggers MD040 and creates avoidable lint noise.

Suggested patch
-```
+```text
 (disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai)
 ...
 Curious whether RULES.md compiles to anything tool-call-shaped, or stays at the LLM seam?
</details>

<!-- suggestion_start -->

<details>
<summary>📝 Committable suggestion</summary>

> ‼️ **IMPORTANT**
> Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

```suggestion

🧰 Tools
🪛 markdownlint-cli2 (0.22.1)

[warning] 27-27: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@drafts/2026-05-04T093024Z.md` around lines 27 - 35, The fenced reply block
containing the reviewer quote (starting with "(disclosure: I work on FailProof
AI: https://github.com/exospherehost/failproofai)") lacks a language tag, which
triggers MD040; update the opening fence from ``` to ```text (or another
appropriate language identifier) to silence the lint warning and keep the
content unchanged so the block compiles correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant