Skip to content

Tooltips and highlighting does not work correctly with Tesseract hOCR output #25

@stweil

Description

@stweil

Compare https://digi.bib.uni-mannheim.de/periodika/reichsanzeiger/ocr/film/abbyy/hocr/001-7920/0007.hocr?overlay=yes with https://digi.bib.uni-mannheim.de/periodika/reichsanzeiger/ocr/film/tesseract-4.0.0-20181201/001-7920/0007.hocr?overlay=yes.

Tooltips and highlighting works with the hOCR file created from ABBYY ALTO (using ocr-transform), but for the Tesseract hOCR it always shows the bbox of the whole image.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions