You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
description: 'Extract text from a PDF or image document. Returns text with page:line number prefixes (e.g. "0:5 some text") so lines can be referenced for highlighting. '
563
649
+'Handles text-native and image-based PDFs (via OCR). For large documents, returns text in chunks — check "hasMore" and use "startPage" to get the next chunk. '
564
-
+'A companion data file can be provided to use existing OCR data instead of re-running recognition.',
650
+
+'A companion data file can be provided to use existing OCR data instead of re-running recognition. '
651
+
+'Use parAnnots and/or footnoteAnnots to add document structure annotations to the output.',
565
652
inputSchema: {
566
653
type: 'object',
567
654
properties: {
@@ -583,7 +670,20 @@ const TOOLS = [
583
670
},
584
671
preserveSpacing: {
585
672
type: 'boolean',
586
-
description: 'Preserve horizontal spacing from the document layout by padding words with spaces based on their position. Makes table columns visually aligned in the output. Default: false.',
673
+
description: 'Preserve horizontal spacing from the document layout by padding words with spaces based on their position. '
674
+
+'Makes table columns visually aligned in the output. Default: false.',
675
+
},
676
+
parAnnots: {
677
+
type: 'boolean',
678
+
description: 'Annotate each group of lines with its paragraph ID and type '
679
+
+'(body, title, or footnote), e.g. "--- par:abc123 [body] ---". '
680
+
+'Use this to identify which lines belong to the same paragraph. Default: false.',
681
+
},
682
+
footnoteAnnots: {
683
+
type: 'boolean',
684
+
description: 'Include footnote cross-reference annotations. Words that reference a footnote are annotated with '
685
+
+'[footnote "word" → par:ID], and footnote paragraphs show which line/word they are linked from. '
686
+
+'Best used together with parAnnots. Default: false.',
0 commit comments