A specialized C++ forensic tool designed to audit and "break" improperly redacted PDF documents. Redact Breaker analyzes PDF structures to identify leaked metadata, hidden layers, and poorly implemented pixelation/blur blocks that can be reversed to reveal sensitive information.
The tool performs a multi-stage audit of the target PDF file to identify security vulnerabilities.
graph TD
PDF[Input PDF] --> Meta[Metadata Extraction]
PDF --> Layer[Layer Dissection]
PDF --> Pixel[Pixelation Analysis]
Meta --> Report[Audit Report]
Layer --> Hidden[Detect Hidden Objects]
Pixel --> Reverse[Calculate Potential Reversal]
Hidden --> Report
Reverse --> Report
- Object Tree Inspection: Deep traversal of the PDF object graph to find discarded but still present text elements.
- Redaction Verification: Validates if black boxes are actual vector objects or just visual overlays that can be moved.
- Heuristic Reconstruction: Uses statistical analysis to attempt reconstruction of redacted text based on font metrics and layout remains.
- Batch Processing: High-performance C++ engine capable of auditing thousands of documents per minute.
- Language: C++20
- Libraries: Poppler / PoDoFo (Internal forks for forensic access)
- Formatting: JSON/MD export for forensic reporting.
Important
This tool is part of the Sentinel Data Solutions suite for government and private security auditors.
Sentinel Data Solutions | Precision PDF Forensics Developed by Zeca