EduCheck is a lightweight academic submission analysis and similarity detection platform that allows faculty to upload structured student ZIP submissions, automatically extract files, detect duplicate or highly similar work, and generate detailed CSV reports with real-time progress tracking and processing controls.
- Upload a ZIP file containing structured student submissions.
- Automatically extract and analyze all files recursively.
- Detect duplicate or highly similar practical submissions.
- Generate detailed CSV reports with full file paths.
- Preview CSV results directly inside the browser.
- Track real-time processing progress.
- Pause, resume, or cancel analysis anytime.
Faculty often spend hours manually checking practical files and identifying copied submissions. Students commonly rename files or modify roll numbers before resubmitting duplicated work. EduCheck automates this workflow by extracting submissions, analyzing document similarities, and generating structured reports instantly.
EduCheck/
│
├── static/
│ ├── favicon.ico
│ ├── style.css
│ └── script.js
│
├── templates/
│ └── index.html
│
├── uploads/
├── extracted/
├── reports/
│
├── app.py
├── detect_similarities.py
├── progress_manager.py
├── requirements.txt
├── README.md
└── LICENSE
- Backend: FastAPI + Python
- Frontend: HTML, CSS, Vanilla JavaScript
- Similarity detection: Jaccard similarity algorithm
- PDF handling: PyPDF2
- CSV generation: Pandas
- Processing: Lightweight threaded background execution
- User uploads a ZIP file containing student submission folders.
- EduCheck extracts and scans all documents recursively.
- Document contents are read and compared for similarities.
- Duplicate or highly similar files are identified.
- A detailed CSV report is generated and previewed in-browser.
Basic steps to run locally:
# Install dependencies
pip install -r requirements.txt
# Run the FastAPI server
python -m uvicorn app:app --host 0.0.0.0 --port 8000
# Open the interface in your browser
http://localhost:8000
- Real-time progress tracking
- ZIP validation & upload protection
- Pause / Resume / Cancel processing
- Browser completion notifications
- CSV preview before download
- Full file path preservation
- Lightweight and optimized architecture
- Designed to remain lightweight without heavy frontend frameworks.
- Optimized for structured academic submission folders.
- Similarity results should be manually reviewed before academic decisions.
Licensed under the Apache License 2.0.