SnapSummarize is a productivity tool designed to streamline your note-taking workflow. It automatically monitors your MacBook for new screenshots in a specified directory, extracts text from the images using OCR, and summarizes the extracted text using Ollama gemma-1b model. The summarized text is then saved as a text file, making it easier to organize and reference your notes.
Useful when you're researching online, capturing snippets of information, or simply saving time by avoiding manual transcription.
- Set Up the Watch Directory: Specify the folder where your screenshots are saved (e.g.,
~/Desktop/screenshots). - Monitor for New Screenshots: SnapSummarize continuously watches the directory for new files.
- Extract Text: When a new screenshot is detected, the app uses OCR to extract text from the image.
- Summarize the Text: The extracted text is summarized using the
gemma-1bmodel. - Save the Summary: The summarized text is saved as a
.txtfile in your output directory (e.g.,~/Desktop/screenshot_notes).
-
Clone the repo
-
Install the required dependencies:
pip install -r requirements.txt
-
Set up Ollama and ensure the gemma-1b model is pulled.
-
Run the application:
python app/main.py`
🖥️ Example Workflow
- Take a screenshot (e.g., Cmd + Shift + 4 on Mac).
- SnapSummarize detects the new screenshot in your watch directory.
- The app extracts text from the image using OCR.
- The extracted text is summarized using AI.
- The summary is saved as a .txt file in your output directory.