Skip to content

Discussion: Benchmarking, Scraping Performance, shared pre-scraped and indexed #371

@xuanswe

Description

@xuanswe

Hello,

Thank you for creating such an impressive tool.

Since I couldn't find a dedicated discussion space, I'll share my questions here:

Have you benchmarked this tool against others (e.g., Context7, Nia, Ref.Tools)? It might be interesting to create a YouTube video where you set up identical environments, run the same tasks with Claude Code, and compare the results.

Scraping large documentation sites (like Flutter or React) is quite slow when dealing with thousands of pages. Would it be more efficient to reuse something like Context7's llms.txt (e.g., https://context7.com/websites/flutter_dev/llms.txt?tokens=100000) instead of scraping the entire site (e.g., flutter.dev)? Which approach tends to work better in practice?

Given the slowness of scraping, what do you think about maintaining a public GitHub repository with pre-scraped and indexed docs? That way, local docs-mcp-server could simply download and use them immediately. It should also allow working with any private Git repositories, of course.

Sorry for putting multiple questions in a single issue.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestquestionFurther information is requested

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions