A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
-
Updated
Jun 25, 2019 - Python
A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
Online-handwritten version of the George Washington Dataset.
Code and procdures for handwriting object detection and recognition
Distorted Document Images dataset (DDI-100).
Creates synthetic degraded image documents that could be used to train Neural Networks
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)
A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.
A repository with anonymized invoices
~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.
A synthetic data generator for text recognition
Tools necessary to perform a multi-fold pretrained voting approach utlizing OCRopus.
Ground truth line annotations for the Berliner Börsen-Zeitung
Dataset for scene text removal
This Web application crawls PDFs from governement websites, performs table detection and displays advanced statistics.
Generate text images for training deep learning ocr model
Total Text Dataset - ICDAR 2017. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
EATEN: Entity-aware Attention for Single Shot Visual Text Extraction
Add a description, image, and links to the aniketdata topic page so that developers can more easily learn about it.
To associate your repository with the aniketdata topic, visit your repo's landing page and select "manage topics."