Skip to content

routine: constitution-reminder-tool (2026-05-27)#18

Draft
jim4226 wants to merge 1 commit into
mainfrom
claude/daily-2026-05-27-constitution-reminder-tool
Draft

routine: constitution-reminder-tool (2026-05-27)#18
jim4226 wants to merge 1 commit into
mainfrom
claude/daily-2026-05-27-constitution-reminder-tool

Conversation

@jim4226
Copy link
Copy Markdown
Owner

@jim4226 jim4226 commented May 27, 2026

Summary

Adds Constitution.reminder() — a pure-read method that formats the active constitutional constraints as a plain-text block for mid-task injection into agent prompts. Six regression tests added; no existing behaviour changed.

Source

  • URL: https://www.anthropic.com/news/widening-conversation-ai
  • Published: 2026-05-19
  • Relevant quote: "Claude was given a tool it could call mid-task that returned a brief reminder of its own ethical commitments" — and internal evals showed this reduced misaligned behaviour.

Theme

Theme 3 — Constitutional / safety primitives. The article describes a new safety primitive: an on-demand agent-callable tool that surfaces ethical commitments mid-task. Constitution.reminder() is CSIS's equivalent.

What changed

File Change
csis/safety/constitution.py Added _CONSTRAINT_CATEGORIES tuple (7 human-readable labels mapping 1-to-1 with DISALLOWED_PATTERNS) and Constitution.reminder() -> str method
tests/test_safety.py 6 new tests: non-empty output, Phase-0 tag present, all base categories in text, operator-pattern count note, no false operator-note on default, purity of read

Test plan

python -m pytest tests/test_safety.py -v
# expect: 12 passed (6 pre-existing + 6 new)

Size / Risk

  • Size: trivial (88 LOC added, 0 deleted)
  • Risk: low — additive only; no chokepoints touched (Coordinator.__init__, _BackendTracker, writer_iteration_id, promotion CAS all untouched)

Generated by Claude Code

…lection

Anthropic's 2026-05-19 "Widening the conversation on frontier AI" post
describes an experimental safety primitive: giving an agent a tool it
can call mid-task to get a brief reminder of its ethical commitments.
Internal evals showed this reduced misaligned behavior.

Constitution.reminder() is the CSIS equivalent: a pure-read method that
formats the active constraint categories (7 base + any operator-added
patterns) as a plain-text block. The Coordinator or any agent loop can
prepend this to a prompt when injecting a constraint-awareness nudge.

The implementation is additive — no existing methods or chokepoints
touched. Six new regression tests cover: non-empty output, Phase-0 tag,
all base categories present, operator-pattern counting, no false
operator-note on a default Constitution, and purity of the read.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants