GemmaOCR-APP 🔎

A polished, local-first OCR web app powered by Gemma-3 Vision and Streamlit.

GemmaOCR-APP lets you upload an image, run OCR with a multimodal LLM, and receive well-structured Markdown output (headings, lists, code blocks, etc.)—all in a clean interface.

✨ Features

Local OCR pipeline with Ollama + gemma3:12b
Simple Streamlit UI with:
- image upload (png, jpg, jpeg)
- one-click text extraction
- clear/reset action
Structured output, not plain text dump
Runs on your machine (no external OCR SaaS required)

🧱 Tech Stack

Python
Streamlit (UI)
Ollama Python client (model inference)
Gemma-3 Vision model (gemma3:12b)
Pillow (image handling)

📂 Project Structure

GemmaOCR-APP/
├── app.py        # Streamlit application
├── README.md     # Project documentation
└── LICENSE       # License file

Note: app.py references ./assets/gemma3.png for the header icon. Add this file if it is missing in your local copy.

✅ Prerequisites

Before running the app, ensure:

Python 3.9+ is installed
Ollama is installed and running
The Gemma model is pulled locally

ollama pull gemma3:12b

🚀 Quick Start

1) Clone the repository

git clone https://github.com/<your-username>/GemmaOCR-APP.git
cd GemmaOCR-APP

2) Create a virtual environment

python -m venv .venv
source .venv/bin/activate    # macOS/Linux
# .venv\Scripts\activate     # Windows PowerShell

3) Install dependencies

pip install streamlit ollama pillow

4) Run the app

streamlit run app.py

Then open the local URL shown by Streamlit (usually http://localhost:8501).

🖼️ How It Works

Upload an image from the sidebar.
Click Extract Text 🔍.
The app sends your image and OCR prompt to gemma3:12b via Ollama.
The model response is rendered as Markdown in the main panel.
Click Clear 🗑️ to reset current output.

🔐 Security & Credentials

This project is designed to run locally and does not require usernames, passwords, or API keys by default.

For security reasons, do not place real personal credentials in source code, README files, or Git history.

If you later add external integrations (e.g., cloud storage, paid APIs), use environment variables and a .env file that is excluded from version control.

🛠️ Troubleshooting

`Error processing image: ...`

Confirm Ollama is running:
```
ollama list
```
Ensure gemma3:12b is installed:
```
ollama pull gemma3:12b
```
Re-launch Streamlit after model download.

Missing icon error (`./assets/gemma3.png`)

Create assets/ and add gemma3.png, or adjust the image path in app.py.

🗺️ Roadmap Ideas

Batch OCR for multiple images
Download results as .md / .txt
Bounding-box OCR visualization overlay
Language selection and prompt presets
Dockerized deployment profile

🤝 Contributing

Contributions are welcome.

Fork the repo
Create a feature branch
Commit changes
Open a pull request with a clear description

📄 License

This project is licensed under the terms in LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GemmaOCR-APP 🔎

✨ Features

🧱 Tech Stack

📂 Project Structure

✅ Prerequisites

🚀 Quick Start

1) Clone the repository

2) Create a virtual environment

3) Install dependencies

4) Run the app

🖼️ How It Works

🔐 Security & Credentials

🛠️ Troubleshooting

`Error processing image: ...`

Missing icon error (`./assets/gemma3.png`)

🗺️ Roadmap Ideas

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GemmaOCR-APP 🔎

✨ Features

🧱 Tech Stack

📂 Project Structure

✅ Prerequisites

🚀 Quick Start

1) Clone the repository

2) Create a virtual environment

3) Install dependencies

4) Run the app

🖼️ How It Works

🔐 Security & Credentials

🛠️ Troubleshooting

Error processing image: ...

Missing icon error (./assets/gemma3.png)

🗺️ Roadmap Ideas

🤝 Contributing

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`Error processing image: ...`

Missing icon error (`./assets/gemma3.png`)

Packages