Skip to content

Commit ac1ed82

Browse files
committed
docs: improve readme overview
1 parent 4693144 commit ac1ed82

1 file changed

Lines changed: 16 additions & 2 deletions

File tree

README.md

Lines changed: 16 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,22 @@
11
# Nutrient PDF to Markdown
22

3-
Standalone CLI wrapper and product documentation for Nutrient's PDF-to-Markdown extractor.
3+
Turn PDFs into structured, semantic Markdown for AI workflows.
44

5-
This repo is the shareable entrypoint for the CLI. The extraction engine itself is distributed as a proprietary prebuilt binary and runs locally on the user's machine. The `nutrient-skills` marketplace repo remains the right place to install the agent skill.
5+
Nutrient's PDF-to-Markdown extractor is built for Claude, Codex, RAG pipelines, and document-heavy automation where raw PDF text is not good enough. The goal is simple: start with clean Markdown instead of noisy extraction that burns context window, adds cleanup work, and makes downstream results less reliable.
6+
7+
This repo is the shareable CLI home for that workflow. The extraction engine itself is distributed as a proprietary signed binary and runs locally on the user's machine. The `nutrient-skills` marketplace repo remains the right place to install the agent skill.
8+
9+
## Overview
10+
11+
Good PDF-to-Markdown extraction is not just text scraping. For AI workflows, it needs to preserve reading order, headings, lists, and tables well enough that the output can be used directly in prompts, indexing pipelines, or agent tasks.
12+
13+
Nutrient focuses on the tradeoff the website calls out most clearly: speed and accuracy should not fight each other. The current benchmark snapshot shows strong structure quality, very fast extraction time per page, and a large speed advantage over several popular alternatives.
14+
15+
The trust model is also intentionally simple:
16+
17+
- documents stay in your local workflow and are not uploaded to Nutrient by this extractor
18+
- the CLI is free for up to `1,000` documents per calendar month
19+
- the public repo stays small and reviewable because the proprietary engine is shipped separately as signed binaries
620

721
## Why This Repo Exists
822

0 commit comments

Comments
 (0)