Gemma OCR Assistant

A Streamlit application that leverages the multimodal capabilities of Gemma 4B to perform OCR and image analysis. The application runs locally and connects to an Ollama instance to process images and extract text.

Features

Image upload and processing
Custom prompt input for tailored analysis
Clean and intuitive user interface
Real-time text extraction and analysis
Detailed explanations of image content

Prerequisites

Python 3.8 or higher
Ollama installed and running locally
Gemma3 4B model pulled in Ollama

Setup

Clone this repository:

git clone <repository-url>
cd <repository-name>

Install dependencies:

pip install -r requirements.txt

Ensure Ollama is running with Gemma3 4B model:

ollama pull gemma3:4b

Usage

Start the Streamlit application:

streamlit run app.py

Open your web browser and navigate to the provided URL (typically http://localhost:8501)
Upload an image containing text
(Optional) Enter a custom prompt to guide the analysis
Click "Analyze Image" to process

Custom Prompts

You can customize the analysis by providing specific prompts. Example prompts:

"Extract and list all text from the image"
"Describe the layout and formatting of text"
"Analyze the context and meaning of the text"
"Focus on specific sections or types of text"

Notes

The application expects Ollama to be running on port 11434 (default)
Supported image formats: PNG, JPG, JPEG
For best results, ensure images are clear and text is readable

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
app.py		app.py
ollama_utils.py		ollama_utils.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemma OCR Assistant

Features

Prerequisites

Setup

Usage

Custom Prompts

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

PromptEngineer48/OCR_Ollama

Folders and files

Latest commit

History

Repository files navigation

Gemma OCR Assistant

Features

Prerequisites

Setup

Usage

Custom Prompts

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages