From c52afc6eb36374795908aec57ee131f71e8379a6 Mon Sep 17 00:00:00 2001 From: "github-actions[bot]" Date: Mon, 23 Mar 2026 19:44:41 +0000 Subject: [PATCH] sync firecrawl skill from firecrawl/cli (8c3b8eb) --- skills/firecrawl/SKILL.md | 34 +++++++++++++++++----------------- 1 file changed, 17 insertions(+), 17 deletions(-) diff --git a/skills/firecrawl/SKILL.md b/skills/firecrawl/SKILL.md index eecdfbf..5b1dba1 100644 --- a/skills/firecrawl/SKILL.md +++ b/skills/firecrawl/SKILL.md @@ -1,7 +1,7 @@ --- name: firecrawl description: | - Web scraping, search, crawling, and browser automation via the Firecrawl CLI. Use this skill whenever the user wants to search the web, find articles, research a topic, look something up online, scrape a webpage, grab content from a URL, extract data from a website, crawl documentation, download a site, or interact with pages that need clicks or logins. Also use when they say "fetch this page", "pull the content from", "get the page at https://", or reference scraping external websites. This provides real-time web search with full page content extraction and cloud browser automation — capabilities beyond what Claude can do natively with built-in tools. Do NOT trigger for local file operations, git commands, deployments, or code editing tasks. + Web scraping, search, crawling, and page interaction via the Firecrawl CLI. Use this skill whenever the user wants to search the web, find articles, research a topic, look something up online, scrape a webpage, grab content from a URL, extract data from a website, crawl documentation, download a site, or interact with pages that need clicks or logins. Also use when they say "fetch this page", "pull the content from", "get the page at https://", or reference scraping external websites. This provides real-time web search with full page content extraction and interact capabilities — beyond what Claude can do natively with built-in tools. Do NOT trigger for local file operations, git commands, deployments, or code editing tasks. allowed-tools: - Bash(firecrawl *) - Bash(npx firecrawl *) @@ -9,7 +9,7 @@ allowed-tools: # Firecrawl CLI -Web scraping, search, and browser automation CLI. Returns clean markdown optimized for LLM context windows. +Web scraping, search, and page interaction CLI. Returns clean markdown optimized for LLM context windows. Run `firecrawl --help` or `firecrawl --help` for full option details. @@ -42,25 +42,25 @@ Follow this escalation pattern: 2. **Scrape** - Have a URL. Extract its content directly. 3. **Map + Scrape** - Large site or need a specific subpage. Use `map --search` to find the right URL, then scrape it. 4. **Crawl** - Need bulk content from an entire site section (e.g., all /docs/). -5. **Browser** - Scrape failed because content is behind interaction (pagination, modals, form submissions, multi-step navigation). +5. **Interact** - Scrape first, then interact with the page (pagination, modals, form submissions, multi-step navigation). -| Need | Command | When | -| --------------------------- | ---------- | --------------------------------------------------------- | -| Find pages on a topic | `search` | No specific URL yet | -| Get a page's content | `scrape` | Have a URL, page is static or JS-rendered | -| Find URLs within a site | `map` | Need to locate a specific subpage | -| Bulk extract a site section | `crawl` | Need many pages (e.g., all /docs/) | -| AI-powered data extraction | `agent` | Need structured data from complex sites | -| Interact with a page | `browser` | Content requires clicks, form fills, pagination, or login | -| Download a site to files | `download` | Save an entire site as local files | +| Need | Command | When | +| --------------------------- | --------------------- | --------------------------------------------------------- | +| Find pages on a topic | `search` | No specific URL yet | +| Get a page's content | `scrape` | Have a URL, page is static or JS-rendered | +| Find URLs within a site | `map` | Need to locate a specific subpage | +| Bulk extract a site section | `crawl` | Need many pages (e.g., all /docs/) | +| AI-powered data extraction | `agent` | Need structured data from complex sites | +| Interact with a page | `scrape` + `interact` | Content requires clicks, form fills, pagination, or login | +| Download a site to files | `download` | Save an entire site as local files | -For detailed command reference, use the individual skill for each command (e.g., `firecrawl-search`, `firecrawl-browser`) or run `firecrawl --help`. +For detailed command reference, run `firecrawl --help`. -**Scrape vs browser:** +**Scrape vs interact:** - Use `scrape` first. It handles static pages and JS-rendered SPAs. -- Use `browser` when you need to interact with a page, such as clicking buttons, filling out forms, navigating through a complex site, infinite scroll, or when scrape fails to grab all the content you need. -- Never use browser for web searches - use `search` instead. +- Use `scrape` + `interact` when you need to interact with a page, such as clicking buttons, filling out forms, navigating through a complex site, infinite scroll, or when scrape fails to grab all the content you need. +- Never use interact for web searches - use `search` instead. **Avoid redundant fetches:** @@ -116,7 +116,7 @@ firecrawl scrape "" -o .firecrawl/3.md & wait ``` -For browser, launch separate sessions for independent tasks and operate them in parallel via `--session `. +For interact, scrape multiple pages and interact with each independently using their scrape IDs. ## Credit Usage