ResonantForge

ResonantForge is a Python CLI that generates synthetic two-party conversation corpora with planted quality signals — built for testing and evaluating systems that score, classify, route, or analyze structured conversations. Point it at a profile, give it a seed, and it produces a realistic corpus of dialogue transcripts complete with quality-signal annotations, source-document citations, participant trajectories, and planted human review corrections. The simulation layer is fully deterministic from --seed; in --dry-run mode the entire output is bit-identical across runs, and in live mode the structure, metadata, and signal annotations are reproducible while the LLM-generated prose itself varies per call.

The shipped profiles model customer-support conversations (SaaS and professional services verticals), but the architecture is domain-agnostic. Subclassing Profile lets you generate corpora for any two-party conversation domain — sales calls, onboarding flows, technical interviews, advisory sessions, intake conversations, or anything else that fits a two-participant structure.

v0.2.0 — initial public release. Output formats and APIs may evolve in future versions; pin to a specific version if you need stability. Feedback welcome via GitHub Issues.

Who it's for

ResonantForge is built for engineers and teams building conversation evaluation systems — anyone who needs realistic, controlled, reproducible dialogue data to test scoring pipelines, validate classifiers, or benchmark routing logic without exposing real user conversations. The deterministic core makes it practical to use as a stable CI fixture. The profile system makes it adaptable to whatever conversation domain you actually work in.

Requirements

Python 3.11+
pip
(Optional) An Anthropic API key for live prose generation

60-second quickstart

No API key needed to start — --dry-run skips all LLM calls and generates deterministic placeholder prose.

git clone https://github.com/ResonantIQ/resonantforge
cd resonantforge
pip install -e .
rforge --help
rforge generate --dry-run --smoke

This produces a smoke corpus (~100 conversations) under the default output directory ./corpus/saas/. Inspect what was generated:

rforge stats corpus --profile saas
rforge inspect corpus --profile saas

For live corpus generation (calls the Anthropic API; a smoke run costs a few cents, a full run costs a few dollars at current Haiku pricing):

export ANTHROPIC_API_KEY=your-key-here
rforge generate --smoke

You can override the model with RFORGE_MODEL if you want to use something other than the current default (claude-haiku-4-5-20251001).

CLI overview

Command	What it does
`rforge generate`	Run the corpus generation pipeline. Produces JSONL artifacts under `./corpus/<profile>/`.
`rforge validate <corpus_dir> --profile <name>`	Verify a generated corpus against its manifest.
`rforge inspect <corpus_dir> --profile <name>`	Print sample records from each corpus artifact.
`rforge stats <corpus_dir> --profile <name>`	Print manifest statistics, skip rates, and determinism hashes.
`rforge replay extract-envelopes`	Freeze validator inputs into replay envelopes. One LLM call per conversation; run once.
`rforge replay run`	Run all validators against frozen envelopes at zero LLM cost. Use this for validator iteration.

Key `generate` flags

Flag	Default	Description
`--profile`	`saas`	Profile name: `saas` or `professional_services`
`--seed`	`42`	Deterministic PRNG seed. Same seed → identical simulation structure.
`--smoke`	off	Smoke mode: ~100 conversations instead of the full default.
`--dry-run`	off	Skip LLM calls; use placeholder prose. No API key required.
`--accounts N`	profile default	Number of synthetic accounts to simulate.
`--months N`	profile default	Duration of simulation in months.
`--out-root PATH`	`./corpus`	Root directory for output. Corpus lands at `<out-root>/<profile>/`.

Extending ResonantForge

To generate corpora for a new domain, subclass Profile from resonantforge/profiles/base.py. A profile defines the brand voice variants the simulated participants use, the knowledge base chunks the corpus can cite, the lexicons that drive rule-based signal extraction, the volume of planted-quality conversations to inject, and the correction patterns used to model human reviewer noise. The abstract base class enforces the interface — unimplemented methods raise NotImplementedError until overridden, so gaps surface immediately rather than silently producing empty outputs.

The current pipeline assumes a two-party dialogue structure (two participants alternating turns). Adapting Forge to multi-party conversations, monologues, or non-dialogue formats would require pipeline-level changes beyond a Profile subclass.

See resonantforge/profiles/base.py for the full interface and resonantforge/profiles/saas.py for a complete worked example.

Prior art

ResonantForge sits within a small wave of recent work on synthetic ground-truth corpora for AI evaluation. Two projects directly shaped it:

gbrain-evals (Garry Tan) was the project that first showed me that fictional ground-truth corpora — synthetic worlds with planted facts and perturbations — were a viable foundation for evaluating AI systems. The approach of seeding deterministic fictional content with controlled perturbations to test what a system can actually detect runs through both projects, though our specific architectures differ.

OrgForge (Flynt, 2026) introduced the physics-cognition boundary that ResonantForge implements directly: a deterministic engine maintains the SimEvent ground-truth bus, and LLMs operate only at designated injection points, generating surface prose from validated proposals rather than mutating state directly. Forge's QualityPlanInjector and SimEvent registry are this pattern applied to two-party conversations rather than multi-artifact organizational corpora.

If you're working in this space, both are worth your time.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.github		.github
docs		docs
replay_corpus		replay_corpus
resonantforge		resonantforge
scripts		scripts
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
COMMON_MISTAKES.md		COMMON_MISTAKES.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResonantForge

Who it's for

Requirements

60-second quickstart

CLI overview

Key `generate` flags

Extending ResonantForge

Prior art

Links

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ResonantForge

Who it's for

Requirements

60-second quickstart

CLI overview

Key generate flags

Extending ResonantForge

Prior art

Links

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Key `generate` flags

Packages