Skip to content

feature: add --redact option to infrahub-backup for anonymised database exports #98

@minitriga

Description

@minitriga

Summary

Add a --redact (or similar) flag to infrahub-backup that anonymises all sensitive data values before export. This would allow customers to share database backups for debugging without exposing proprietary data.

Proposed approach

  1. Customer uses infrahub-backup to back up their production environment
  2. Restores the backup to a staging or local environment (customers already do this)
  3. Runs infrahub-backup again with the --redact flag, which executes a Cypher query to replace all value properties on AttributeValue nodes with UUIDs
  4. The resulting dump contains the full graph structure (nodes, relationships, schema, hierarchy) but no real data values

Why this matters

Having access to a customer's full graph structure (with anonymised values) would dramatically improve our ability to debug performance issues, merge corruption, and other problems that are difficult to reproduce without real-world scale and topology.

Open questions

  • Exact naming of the flag (--redact, --anonymize, etc.)
  • Whether the Cypher query should be bundled into infrahub-backup directly or provided as a separate utility
  • Whether any non-AttributeValue data also needs redaction (e.g. node names, labels)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions