liminate-session-contracts

A small file that tracks what an agent has verified, what it inferred, and what's still open — written in Liminate so the interpreter can check it.

The note is short, diff-able, and runnable. The bound is the point.

Part of the Prosecode family — a set of tools for writing, verifying, and transferring structured reasoning.

What it does

A session contract travels with a working session. It records — in plain words — whether the primary source has been read, what the current claims are based on, which decisions have been locked, which questions are still open, and which corrections the user has given about how the model should engage.

Before any consequential claim, the contract should cite the claim text against the source. The interpreter checks whether the cited text actually appears in the source. The model doesn't declare verification — the interpreter verifies it.

Example

remember a source called design-doc with verified
cite "uses Redis Streams" from design-doc
remember a decision called auth-method with locked
add "open: rate-limit strategy" to questions

When the source is gone and only the contract remains, the model retrieves or discloses — it does not fabricate. 252 cross-session continuity probes, zero fabrications across four rounds of benchmarking on Opus 4.7 and Sonnet 4.6.

Built by Liminate

Liminate is a prose-as-syntax language where plain English sentences execute directly. These five repos form a system for writing, verifying, and transferring structured reasoning.

	Repo	What it does
	liminate	The language and interpreter. Bounded vocabulary, deterministic execution, domain packs.
← this repo	liminate-session-contracts	Tracks verified sources, inferred claims, locked decisions, and user corrections as executable `.limn` contracts.
	prosecode-prompt-compiler	Compiles user prompts into structured intent before the agent responds. Seven verbs, twenty-four slots.
	prosecode-context-pager	Scores conversation history against current intent. Decides what to keep, summarize, or drop.
	prosecode-handoff-packet	Packages a working session for another agent to continue — preserving what was verified and what wasn't.

→ liminate.dev

Install

This skill follows the agentskills.io SKILL.md standard. Any compliant agent can load it.

# Claude Code — all projects
git clone https://github.com/rmichaelthomas/liminate-session-contracts.git ~/.claude/skills/liminate-session-contracts

# Claude Code — one project
git clone https://github.com/rmichaelthomas/liminate-session-contracts.git .claude/skills/liminate-session-contracts

# Codex CLI
git clone https://github.com/rmichaelthomas/liminate-session-contracts.git ~/.codex/skills/liminate-session-contracts

# Gemini CLI
git clone https://github.com/rmichaelthomas/liminate-session-contracts.git ~/.gemini/skills/liminate-session-contracts

# Any SKILL.md-compatible agent
git clone https://github.com/rmichaelthomas/liminate-session-contracts.git .agents/skills/liminate-session-contracts

The Liminate interpreter is optional. Install it if you want the agent to check the contract as it writes:

pip install liminate
liminate path/to/session-contract.limn --pack references/session_pack.json

How it works

Use

Ask the agent to start a contract at the beginning of a session:

"Start a session contract for this design review."

The agent responds in two channels. Channel 1 is the prose answer — the work itself. Channel 2 is a fenced limn code block at the end of the response containing only contract mutations. The prose never narrates contract updates; the contract block never contains prose.

When the user corrects the model's approach — "don't defer," "check the actual code," "give me everything" — the correction is recorded in the contract as a session correction. The model consults the corrections list before every subsequent response. Corrections persist across sessions: the next model that reads the contract starts with the calibration already applied.

At the end of the session, the accumulated .limn file is yours. Save it, diff it, hand it to another agent, run it through the interpreter.

The contracts this skill produces can be scanned at receipts.liminate.dev — paste a .limn contract (or open a session-end permalink) and Receipts runs it through the interpreter, rendering reasoning state, citation checks, tracked decisions, open questions, and an annotated source view. That's the one-click path from a working session to a rendered inspection.

The session pack

The session pack adds 5 domain words to Liminate's 35-word base vocabulary:

Word	Type	What it does
`claim`	noun	Descriptor for verified or inferred assertions
`source`	noun	Descriptor for primary sources
`decision`	noun	Descriptor for locked or open decisions
`cite`	verb	`cite <text> from <source>` — runtime substring check, errors if the text isn't in the source
`verify`	verb	`verify <claim> from <source>` — structural comparison, flags match/mismatch with divergence details

cite is the constraining primitive. The Liminate interpreter gate was active on 101 of the benchmarked turns. It never fired. The models prefer to omit a citation rather than fabricate one. The gate is a working safety net waiting for the case the instruction fails to prevent.

Tiers

Tier	Available	Behavior
1	Conversation only	Emit contract delta as `limn` code block in each response.
2	File tools + Liminate installed	Write the contract to disk. Run each delta through the interpreter.
3	Persistent storage + session pack	Load the session pack. Use `cite` and `verify`. Persist across sessions.

Where session contracts add value

Sessions that span multiple turns, or multiple sessions where the source won't be present later
Tracking what was verified versus inferred
Accumulating decisions as locked, open, or deferred
Acting on facts established in earlier sessions without re-providing the source

Where they don't

On coding tasks, the Karpathy CLAUDE.md (132k stars) outperforms session contracts — 2.38 vs 2.00 overall on a head-to-head bench with 8 tasks across 4 failure modes. Karpathy wins on catching wrong assumptions (3.00 vs 2.00); the two tie at ceiling on overcomplication and orthogonal edits; both fail on unverified execution.

Session contracts and Karpathy-style instruction sets solve different problems. One enforces cross-session continuity through executable constraints. The other enforces coding discipline through natural-language principles. A user who needs both uses both.

Known limitations

Citation engagement varies by model and scenario. Opus 4.7 emits few or no cite statements; Sonnet 4.6 engages selectively, primarily on multi-source scenarios with explicit structure. Retrieval rate ranges from 0% (Opus) to 35% (Sonnet). Fabrication rate is zero regardless.
Hard-prior single-turn tasks show small regressions. On tasks where the source contradicts the model's training data, the skill condition shows +1–3 fabrications versus baseline (n=9 per round). The interpreter gate is designed to catch these but requires a contract delta, which single-turn Q&A tasks typically don't produce.
Cross-agent portability is untested. Benchmarked on Claude models only. Codex, Gemini, and Copilot have not been tested.
The gate's catch behavior is unmeasured. Across 101 gated turns, the interpreter gate never fired. Correctly designed infrastructure, unexercised revision path.
Lists must be remembered before add. The interpreter rejects add "X" to <list> with ERROR_SEMANTIC if <list> was not previously remembered. In Receipts this renders as an empty Tracked decisions / Open questions / Session corrections section even though the contract source contains the add statements. SKILL.md (as of c89642f, May 2026) requires the agent to prepend a baseline preamble declaring the standard lists before saving any contract. The template ships with the declarations; ad-hoc contracts must include them explicitly. The benchmarks/bench_list_seeding.py guard catches regressions of this behavior.
Verification primitives check grounding, not truth. A passing cite proves the text exists in the declared source — not that the claim is true, the source is authentic, or the excerpt is representative. A passing verify proves two records agree — not that either is correct. A passing measure proves a nearby number exists in the source — not that it's the right number. The receipt is an audit trail, not a truth oracle. See docs/TRUST-BOUNDARY.md for the full honest-claims analysis.

Local-only mode

Session contracts run entirely on your machine with no server dependency. See docs/LOCAL-ONLY.md for a complete walkthrough: install the interpreter, write a contract, run it, read the output — no account, no network, no data leaving your device.

Security and data flow

See docs/TRUST-BOUNDARY.md for a description of what data moves where across the three usage modes (local-only, Receipts save, fragment-encoded inspection), what the server can and cannot see, and what is not yet built.

License

Apache 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.githooks		.githooks
.github/workflows		.github/workflows
benchmarks		benchmarks
docs		docs
examples		examples
hooks		hooks
references		references
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

liminate-session-contracts

What it does

Example

Built by Liminate

Install

How it works

Use

The session pack

Tiers

Where session contracts add value

Where they don't

Known limitations

Local-only mode

Security and data flow

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

liminate-session-contracts

What it does

Example

Built by Liminate

Install

How it works

Use

The session pack

Tiers

Where session contracts add value

Where they don't

Known limitations

Local-only mode

Security and data flow

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages