Parallelize OCR block processing and add optional CUDA preprocessing by zeetee1235 · Pull Request #8 · zeetee1235/DocStruct

zeetee1235 · 2026-03-12T12:21:20Z

Motivation

Reduce wall-clock OCR latency without changing fusion or parser decision logic by running independent per-block Tesseract calls concurrently.
Provide an optional fast preprocessing path that can leverage OpenCV CUDA when available to accelerate grayscale/threshold operations.

Description

Added a configurable worker pool to ocr/bridge/ocr_bridge.py to process OCR blocks in parallel using ThreadPoolExecutor and DOCSTRUCT_OCR_WORKERS to control concurrency while preserving deterministic output ordering.
Introduced CUDA-aware preprocessing helpers _to_grayscale and _otsu_threshold that use OpenCV CUDA if DOCSTRUCT_OCR_USE_CUDA=1 is set and CUDA bindings are available, otherwise silently fall back to CPU implementations.
Kept the original multi-pass OCR candidate scoring, block classification, post-processing, and LaTeX (pix2tex) handling intact and re-applied them to the ordered results to preserve existing OCR logic.
Documented the new runtime environment variables in README.md (DOCSTRUCT_OCR_WORKERS, DOCSTRUCT_OCR_USE_CUDA, and existing DOCSTRUCT_BRIDGE/DOCSTRUCT_PYTHON).

Testing

Ran Python syntax check with python3 -m py_compile ocr/bridge/ocr_bridge.py, which completed successfully.
Ran Rust test suite with cargo test -q, and the tests completed successfully (all automated tests passed).

Codex Task

…cess

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: df510233a3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-12T12:24:41Z

ocr/bridge/ocr_bridge.py

-    
-    for block in blocks:
+
+    def process_block(block: dict) -> dict | None:


Avoid Python 3.10-only union syntax in bridge

The new annotation -> dict | None in run_ocr uses PEP 604 syntax, which is only valid on Python 3.10+, so the bridge now fails to import with a SyntaxError on Python 3.9 runtimes that previously worked; this is a functional regression because the OCR process cannot start at all in those environments (README still documents Python 3.8+ support).

Useful? React with 👍 / 👎.

Improve OCR throughput with parallel workers and optional CUDA prepro…

df51023

…cess

zeetee1235 added the codex label Mar 12, 2026 — with ChatGPT Codex Connector

chatgpt-codex-connector bot reviewed Mar 12, 2026

View reviewed changes

zeetee1235 merged commit 63749b5 into main Mar 12, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize OCR block processing and add optional CUDA preprocessing#8

Parallelize OCR block processing and add optional CUDA preprocessing#8
zeetee1235 merged 1 commit intomainfrom
codex/improve-ocr-speed-with-gpu-acceleration

zeetee1235 commented Mar 12, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		for block in blocks:

		def process_block(block: dict) -> dict \| None:

Conversation

zeetee1235 commented Mar 12, 2026

Motivation

Description

Testing

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant