🧠 LLM OCR Extraction System

A robust Laravel web application dedicated to performing high-accuracy Optical Character Recognition (OCR) using local Large Language Models (LLMs).

📖 About the Project

This project leverages modern web architecture to process images and extract textual data with the high precision characteristic of Vision Language Models (VLMs). Designed with a focus on performance and reliability, the application offloads the heavy lifting of AI inferences to background queues, ensuring a fast, non-blocking user experience on the frontend.

By communicating with a local AI server (such as LM Studio running glm-ocr), the application guarantees data privacy, avoiding third-party cloud API costs and providing a fully self-hosted solution for text extraction, document parsing, and sanitization.

✨ Key Features

Asynchronous Processing: Long-running AI inference tasks are dispatched to isolated background Job queues, preventing HTTP request timeouts.
Real-Time UX (Polling): The UI seamlessly polls the backend for processing updates without requiring page reloads, transitioning states from 'pending' to 'completed'.
Clean Architecture: Built over solid engineering principles, featuring Actions (invokables) for single-responsibility logic routing, decoupling business rules from controllers.
Local AI Integration: Designed specifically to interact with Local LLMs via REST APIs, fully capturing, sanitizing, and filtering zero-width spaces or artifacts from AI responses.
Robust Automated Testing: A comprehensive test suite using Pest PHP covering HTTP request faking, queue state transitions, JSON structural validation, and fallback mechanisms.

🛠️ Stack & Technologies

Backend: PHP 8.4+, Laravel 13
Database / Queue: SQLite (relational records) & Database Queue Driver
AI Backend / Integration: LM Studio API (Local VLM processing)
Testing: Pest PHP (Feature & Integration tests)
Frontend: Vanilla JS & Blade Templates

🚀 Getting Started

To run this application locally, you will need PHP 8.4+, Composer, and LM Studio server running a compatible Vision Language Model locally on port 1234.

1. Installation

Clone the repository and install dependencies:

composer install

Prepare your environment file:

cp .env.example .env
php artisan key:generate

2. Database & Storage Setup

# Create SQLite Database (If using sqlite)
touch database/database.sqlite

# Run migrations
php artisan migrate

# Link local storage for image uploads
php artisan storage:link

3. Execution

You will need two terminal windows to run the application fully (due to the async architecture).

Terminal 1 - Web Server:

php artisan serve

Terminal 2 - Queue Worker:

php artisan queue:work

4. Running the Local AI (LM Studio)

Ensure LM Studio is running in the background and the Local Inference Server is started on http://127.0.0.1:1234. The model recommended for OCR tasks is glm-ocr or similar vision-capable models.

🧪 Testing

The codebase relies on Pest PHP for highly expressive and documented tests ensuring the application behaves accurately without needing to spin up a real AI server each time.

# Run the entire test suite
./vendor/bin/pest

Developed to showcase modern asynchronous architecture and Local AI integration within the Laravel ecosystem.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.agents/skills		.agents/skills
app		app
bootstrap		bootstrap
config		config
database		database
public		public
resources		resources
routes		routes
storage		storage
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmrc		.npmrc
AGENTS.md		AGENTS.md
README.md		README.md
artisan		artisan
boost.json		boost.json
composer.json		composer.json
composer.lock		composer.lock
opencode.json		opencode.json
package-lock.json		package-lock.json
package.json		package.json
phpunit.xml		phpunit.xml
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 LLM OCR Extraction System

📖 About the Project

✨ Key Features

🛠️ Stack & Technologies

🚀 Getting Started

1. Installation

2. Database & Storage Setup

3. Execution

4. Running the Local AI (LM Studio)

🧪 Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 LLM OCR Extraction System

📖 About the Project

✨ Key Features

🛠️ Stack & Technologies

🚀 Getting Started

1. Installation

2. Database & Storage Setup

3. Execution

4. Running the Local AI (LM Studio)

🧪 Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages