This repository will host the source code for a GitHub app designed to detect sensitive data within repositories. The app automatically scans files for personal, health, and other critical data types, ensuring compliance with privacy regulations and reducing the risk of data exposure.
- Automated Scanning: Identifies sensitive data patterns in text, PDFs, images, and more.
- GitHub Integration: Triggered by repository events, such as file uploads, commits, or pull requests.
- Detailed Reports: Provides actionable insights and flags files containing sensitive information, and how to resolve them.
- Text files (
.txt) - PDF documents (
.pdf) - Word files (
.docx) - Images (
.png,.jpg,.jpeg) - Comma Seperated Value files (
.csv) - Other file types (expandable based on requirements)
- Enhance and prevent sensitive data concerns for GitHub repositories.
- Assist developers in identifying and mitigating risks related to sensitive data exposure.
- Comply with privacy standards like GDPR, HIPAA, and others.
- TBD
echo "Workflow test trigger" >> README.md
To contribute to this repository, you must fork this repo, clone your fork and develop on your own forked repository, then PR into this. do not clone from this repository directly
This repository is licensed under [your chosen license]. See the LICENSE file for more details.