diff --git a/1.2/index.bs b/1.2/index.bs index 59ce6d4..a5f63da 100644 --- a/1.2/index.bs +++ b/1.2/index.bs @@ -656,10 +656,34 @@ Issue: [ocr_carea vs ocrx_block](https://github.com/kba/hocr-spec/issues/28) ### `ocrx_line` -Issue: [ocr_line vs ocrx_line](https://github.com/kba/hocr-spec/issues/19) - - * any kind of "line" returned by an OCR system that differs from the standard ocr_line above + * any kind of "line" returned by an OCR system that differs from [[#ocr_line]] * might be some kind of "logical" line + * examples include line continuations and rowspan in tables + +
+ocrx_lineany kind of "line" returned by an OCR system that differs from the standard ocr_line above
+any kind of "line" returned by an OCR system that differs from §6.1.4 ocr_line
might be some kind of "logical" line
+examples include line continuations and rowspan in tables
Consider the following snippet, containing a wide-spaced heading broken over +two physical lines:
+
An OCR engine could produce the following output, indicating the two physical +lines that form a single logical line:
+... +<span class="ocrx_line"> + <span class='ocr_line' title="bbox 16 16 860 47">Aus den Gewinn- und Verlust-</span> + <span class='ocr_line' title="bbox 302 62 603 98">rechnungen</span> +</span> +... ++
ocrx_word
+