Breaking Change Detection - Design Progress

Current State: Implemented

We have a complete implementation of breaking change detection in src/db/diff/breaking.rs. This module analyzes schema diffs and classifies changes as either safe (can deploy directly) or breaking (requires a mitigation strategy).

What Was Built

MitigationStrategy enum with four strategies: DualWrite, Backfill, Ratchet, Destructive
BreakingChangeKind enum with 17 specific change types, each mapped to a mitigation strategy
BreakingChange struct with kind, mitigation strategy, and human-readable description
BreakingChangeAnalysis aggregator with query methods (is_safe(), by_mitigation(), count_by_mitigation())
analyze_breaking_changes() function that walks a NamespaceDiff
Type change classification (safe widening vs breaking narrowing)
94 passing tests (unit + integration)

API Overview

use tern::db::diff::{diff_namespaces, NamespaceDiff};
use tern::db::diff::breaking::{analyze_breaking_changes, MitigationStrategy};

let diff = diff_namespaces(&source, &target);
let analysis = analyze_breaking_changes(&diff);

if analysis.is_safe() {
    println!("Migration is safe to apply directly");
} else {
    println!("Found {} breaking changes:", analysis.len());
    for change in analysis.iter() {
        println!("  [{}] {}", change.mitigation.as_str(), change.description);
    }

    // Query by mitigation strategy
    let ratchet_count = analysis.count_by_mitigation(MitigationStrategy::Ratchet);
    println!("Changes requiring NOT VALID pattern: {}", ratchet_count);
}

Design Evolution

Original Design (Rejected)

The original implementation used a ChangeSeverity enum with three levels:

NonBreaking - Safe changes
Warning - Might fail depending on data
Breaking - Definitely problematic

The "Warning" category was flawed. If a migration might fail, it IS breaking. You cannot deploy it with confidence.

Current Design: Mitigation Strategies

Rather than classifying by severity, we classify by what kind of process is required to safely execute the change:

Strategy	Description	Examples
`DualWrite`	Requires parallel structures with synchronized writes	Rename column/table, change column type
`Backfill`	Requires populating data before completion	Add NOT NULL to existing column
`Ratchet`	Requires NOT VALID + backfill + VALIDATE pattern	Add UNIQUE/CHECK/FK/PK constraint
`Destructive`	Intentionally removes data/structure (irreversible)	Drop table/column, remove enum value

Why This Is Better

Binary safety: A change is either Safe or it requires mitigation - no ambiguous middle ground
Actionable: Each strategy implies a specific decomposition pattern
Pattern-based: Maps directly to known PostgreSQL migration patterns
Time-aware: Acknowledges that some changes fundamentally cannot be atomic

The NOT VALID Ratchet Pattern

PostgreSQL provides a mechanism to safely add constraints:

-- Step 1: Add constraint without validating existing data (instant, non-blocking)
ALTER TABLE users ADD CONSTRAINT users_email_unique UNIQUE (email) NOT VALID;

-- Step 2: New inserts/updates are now validated (the "ratchet" is engaged)
-- Meanwhile, fix any existing violations through backfill/cleanup

-- Step 3: Once all data complies, validate the constraint
ALTER TABLE users VALIDATE CONSTRAINT users_email_unique;

This pattern creates a ratchet: once engaged, it prevents new violations while giving you time to fix existing ones.

Mitigation Pattern Details

DualWrite (Rename Pattern)

To rename users.email → users.email_address:

Add new column email_address (Safe)
Deploy application that writes to BOTH columns
Backfill: UPDATE users SET email_address = email WHERE email_address IS NULL
Deploy application that reads from new column
Deploy application that writes ONLY to new column
Drop old column email (Destructive, but now safe because nothing uses it)

Backfill (NOT NULL Pattern)

To add NOT NULL to users.email:

Add CHECK constraint with NOT VALID: CHECK (email IS NOT NULL) NOT VALID (Ratchet)
Backfill any NULL values
Validate: VALIDATE CONSTRAINT ...
Add actual NOT NULL: ALTER COLUMN email SET NOT NULL
Drop the CHECK constraint (now redundant)

Ratchet (Constraint Pattern)

To add UNIQUE(email):

Add constraint NOT VALID: ADD CONSTRAINT ... UNIQUE (email) NOT VALID
Fix any existing duplicates (application-specific logic)
Validate: VALIDATE CONSTRAINT ...

Destructive

Drop operations are fundamentally different:

They're often intentional (cleaning up unused structures)
They can't be "decomposed" - they're the end state
But they must be verified safe (nothing references the dropped object)

For drops, the decomposition is temporal:

Stop using the object in application code
Wait for all old application instances to drain
Perform the drop

Open Questions for Future Work

How do we handle changes that combine multiple categories?
- Example: Rename column AND change type simultaneously
- Likely answer: Decompose into separate changes, each with its own category
Should "Destructive" be further subdivided?
- "Intentional removal" vs "Data loss risk"
- Dropping an unused table is different from dropping a table with data
How do we represent the decomposed migration steps?
- This module detects breaking changes
- A separate module would generate the decomposition
- What's the interface between them?
What about lock-related concerns?
- Some operations require ACCESS EXCLUSIVE locks
- Adding an index without CONCURRENTLY blocks writes
- Is this a separate axis of classification?

References

PostgreSQL: Adding NOT VALID constraints
Strong Migrations gem (Ruby) - similar concept in Rails ecosystem
Expand-Contract Pattern - Martin Fowler on parallel change

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Breaking Change Detection - Design Progress

Current State: Implemented

What Was Built

API Overview

Design Evolution

Original Design (Rejected)

Current Design: Mitigation Strategies

Why This Is Better

The NOT VALID Ratchet Pattern

Mitigation Pattern Details

DualWrite (Rename Pattern)

Backfill (NOT NULL Pattern)

Ratchet (Constraint Pattern)

Destructive

Open Questions for Future Work

References

FilesExpand file tree

progress.md

Latest commit

History

progress.md

File metadata and controls

Breaking Change Detection - Design Progress

Current State: Implemented

What Was Built

API Overview

Design Evolution

Original Design (Rejected)

Current Design: Mitigation Strategies

Why This Is Better

The NOT VALID Ratchet Pattern

Mitigation Pattern Details

DualWrite (Rename Pattern)

Backfill (NOT NULL Pattern)

Ratchet (Constraint Pattern)

Destructive

Open Questions for Future Work

References