acp poc by developersdigest · Pull Request #79 · firecrawl/cli

developersdigest · 2026-03-26T23:46:47Z

No description provided.

new `firecrawl agent` interactive mode that auto-detects locally installed ACP providers (Claude Code, Codex, OpenCode) and launches them to gather and structure web data into CSV, JSON, or markdown. - src/utils/acp.ts: provider detection + session persistence in ~/.firecrawl/sessions/ - src/commands/agent-interactive.ts: interactive flow with suggestions, format selection, schema confirmation - src/index.ts: wire interactive mode into agent command (no prompt = interactive, or -i flag) existing `firecrawl agent "prompt"` API mode is unchanged.

replace subprocess-spawn model with proper Agent Client Protocol integration using @agentclientprotocol/sdk. firecrawl now acts as an ACP client that can connect to any ACP-compatible agent (Claude Code, Codex, Gemini CLI, Copilot, etc.) via JSON-RPC over stdio. - src/acp/client.ts: ACP client using ClientSideConnection from SDK handles session updates, permission requests, file system ops - src/acp/registry.ts: detect installed ACP agents on PATH - src/commands/agent-interactive.ts: rewired to use ACP client instead of launchAgent() subprocess spawn - package.json: add @agentclientprotocol/sdk dependency no API keys needed — each agent handles its own auth.

claude-agent-acp and codex-acp are the actual ACP-compatible binaries (from @zed-industries packages), not the raw CLI tools.

after each agent turn, prompt the user for follow-up input so they can confirm schemas, refine results, or continue the conversation. type "done", "exit", or press enter to end the session.

parse rawInput from ACP tool calls to show real commands: 🔥 searching "AI startups" 🔥 scraping https://example.com/pricing 🔥 writing output.csv ✅ done instead of opaque "Terminal..." / "Read File..." labels.

proper casing, aligned columns, professional formatting: 🔥 Search "AI startups" Done 🔥 Scrape https://example.com/pricing Done 🔥 Write output.csv Done

only show firecrawl operations (search, scrape, map, crawl) and session output writes. hide internal operations (reads, grep, python scripts, help lookups, temp files). animated spinner for in-progress work, checkmark on completion. ⠹ Searching "AI startups" ✓ Searching "AI startups" ⠼ Scraping forbes.com/lists/ai50 ✓ Scraping forbes.com/lists/ai50 ✓ Writing output.csv

each concurrent operation gets its own spinner line instead of collapsing into "(+N more)". lines clear and re-render as operations complete. ANSI escape sequences for smooth updates.

- replace broken ANSI multi-line spinner with simple start/done line pairs that don't fight with streaming text - fix search query parsing to strip pipes and redirects (was showing "query" 2>&1 | head -2000") - group unavailable agents at bottom of selection with separator

ink-based terminal UI that shows: - animated spinners per active firecrawl operation - checkmarks on completion - persistent status bar: session | agent | credits | time | format - unmounts between turns for clean user input prompts also: suppress agent stderr noise (hook warnings), enable JSX in tsconfig for .tsx support, add ink + react deps.

ink/yoga requires top-level await which breaks tsx/commonjs. replaced with zero-dependency ANSI status bar: - pinned to bottom of terminal via scroll region - animated spinner when operations are active - shows: session | agent | credits | elapsed | format - pauses for user input, resumes for next turn - no npm dependencies added

no scroll regions or ANSI cursor tricks. status prints inline at natural breakpoints (between turns, on completion). works in any terminal, pipeable, agent-friendly. · Scraping forbes.com/lists/ai50 ✓ Scraping forbes.com/lists/ai50 ── xy90n9zy · Claude Code · 12 credits · 34s · CSV

categorize every tool call into prominent (firecrawl ops, output writes) or background (processing, reading, spawning agents). prominent calls show start + done. background calls show once per label (deduplicated) in dim text so the user always knows the agent is working. · Searching "AI startups" ← prominent · Processing ← background (dim) · Spawning agents ← background (dim) · Scraping forbes.com/lists/ai50 ← prominent ✓ Scraping forbes.com/lists/ai50 · Reading sources ← background (dim) · Processing data ← background (dim) ✓ Writing output.csv

- deduplicate by URL: scraping the same URL 5x shows one start line and one done line, not ten - hide background operations entirely (agent text provides context) - visual spacing between agent text and tool call blocks - reset dedup between conversation turns - status line at each turn boundary with credits + elapsed

two modes for running the agent: - ACP: structured protocol with progress tracking and session mgmt - Pipe: inject directly into Claude Code/Codex (full terminal, interactive) user picks mode after selecting the agent. pipe mode falls back to the original launchAgent() subprocess approach. also rewrites system prompt to: - work in clear phases (plan → discover → extract → write) - stop after plan phase and wait for user confirmation - report progress with numbers (found 4 sources, extracted 50 records) - use bullet points not tables (renders better in terminals) - keep user informed at every step

after the conversation ends, show a menu with all session output files (with sizes) plus the session folder. selecting one opens it with the system default handler (open/xdg-open/start).

the main agent thread should only orchestrate — all heavy scraping and parsing is delegated to subagents that return just structured JSON records. keeps the main context window clean for planning and consolidation.

removes the doubled-up · start / ✓ done lines. now each operation prints once when it finishes — clean single line. the agent's own text provides context for what's in progress.

complete TUI rewrite: - auto-detected phase sections (Planning → Discovering → Extracting → Output) with ━━━ header lines based on tool call patterns - wire ACP usage_update events for token count + cost tracking - status line shows tokens used + cost + elapsed time - summary footer at session end - remove pipe mode (commit to ACP) - remove old buildCallbacks/describeToolCall (moved to tui.ts) - clean header: "🔥 Firecrawl Agent / Claude Code · CSV · Session xyz"

ACP agents handle their own billing — don't show dollar amounts. status line now shows: "25k tokens · 12 credits · 47s" also stop printing status before every → prompt — only at session end.

when the agent is doing non-firecrawl work (reading files, running scripts, spawning subagents), show a dim "..." on the current line. clears when agent text arrives or a firecrawl operation completes. prevents the dead cursor stare.

replace FIRECRAWL_TOOLS_BLOCK with explicit command list that excludes browser/interact tools. agent should use scrape with --wait-for for JS-rendered pages instead of browser sessions.

- display output file path (dimmed) above each → prompt - add "follow-up suggestions" instruction to system prompt: agent proposes 2-3 specific next steps after completing output, like perplexity's suggested questions

system prompt now tells the agent its session directory path and output file path. agent can write intermediate files, build data incrementally, and preserve progress if interrupted.

the module:"preserve" and moduleResolution:"bundler" changes were for ink which was removed. reverted to commonjs + node.

track whether last stdout text ended with \n. if not, write a newline before any tool call results, phase headers, or status lines to prevent them from overwriting partial agent text.

first run shows agent picker, saves selection. subsequent runs skip the prompt and use the saved default. --provider flag always overrides.

firecrawl agent "prompt" now uses ACP locally instead of the firecrawl API. prints session path as JSON so other agents can poll for results. use --api flag to opt into the old API path. output: { sessionId, sessionDir, output, agent, status }

fork a detached child process for the ACP session so the CLI exits immediately. caller gets session path to poll for results.

logs agent text, tool calls, and completions to the session directory so callers can tail -f for live progress.

tell the agent to run firecrawl --help first, show practical command examples with useful flags (--only-main-content, --wait-for, --limit, -o), and save to temp files for parsing.

tool calls now show · label... when starting and ✓ label when done. background work still shows animated spinner. deduped so each operation only shows start/done once.

agent now saves raw scraped markdown alongside the output file, organized by hostname/path like firecrawl download does. session folder becomes a self-contained package of output + sources.

categorize tool calls by title in addition to rawInput.command. catches mcp__firecrawl__firecrawl_scrape, WebSearch, WebFetch, and all other tool invocation patterns the agent might use.

- always show agent selection (no auto-skip), default to last-used - detect firecrawl MCP tools and WebSearch/WebFetch in TUI display - stronger system prompt: must use firecrawl CLI via Bash only, no MCP tools, no WebSearch, no WebFetch, no curl

spinner provides feedback while running, ✓ line on completion. no more doubled · start / ✓ done lines.

shows all sessions sorted by date with prompt, agent, format, and age. select one to resume, open the output, or open the session folder.

fork() doesn't survive tsx exit. use spawn() with process.execPath and process.execArgv to preserve the tsx loader. pass args via temp file (system prompts are too large for argv).

no more dead cursor after typing a follow-up — the working spinner with rotating messages starts instantly on every turn, including the first one and after resume.

developersdigest added 30 commits March 26, 2026 12:40

fix ACP agent binary names for adapters

6f44090

claude-agent-acp and codex-acp are the actual ACP-compatible binaries (from @zed-industries packages), not the raw CLI tools.

add conversation loop to interactive agent

80fc884

after each agent turn, prompt the user for follow-up input so they can confirm schemas, refine results, or continue the conversation. type "done", "exit", or press enter to end the session.

show what firecrawl agent is actually doing

c5cdf18

parse rawInput from ACP tool calls to show real commands: 🔥 searching "AI startups" 🔥 scraping https://example.com/pricing 🔥 writing output.csv ✅ done instead of opaque "Terminal..." / "Read File..." labels.

clean up tool call display and branding

6a1b5a6

proper casing, aligned columns, professional formatting: 🔥 Search "AI startups" Done 🔥 Scrape https://example.com/pricing Done 🔥 Write output.csv Done

multi-line spinner for parallel firecrawl operations

833b57d

each concurrent operation gets its own spinner line instead of collapsing into "(+N more)". lines clear and re-render as operations complete. ANSI escape sequences for smooth updates.

add open file/folder prompt at session end

1654f11

after the conversation ends, show a menu with all session output files (with sizes) plus the session folder. selecting one opens it with the system default handler (open/xdg-open/start).

instruct agent to use subagents for scraping

7354d8f

the main agent thread should only orchestrate — all heavy scraping and parsing is delegated to subagents that return just structured JSON records. keeps the main context window clean for planning and consolidation.

only show tool calls on completion, not on start

2b58bb8

removes the doubled-up · start / ✓ done lines. now each operation prints once when it finishes — clean single line. the agent's own text provides context for what's in progress.

remove cost display, only show tokens + credits + time

026ea95

ACP agents handle their own billing — don't show dollar amounts. status line now shows: "25k tokens · 12 credits · 47s" also stop printing status before every → prompt — only at session end.

show working indicator during background operations

83bf302

when the agent is doing non-firecrawl work (reading files, running scripts, spawning subagents), show a dim "..." on the current line. clears when agent text arrives or a firecrawl operation completes. prevents the dead cursor stare.

restrict agent to search + scrape only, no browser

7d38fc5

replace FIRECRAWL_TOOLS_BLOCK with explicit command list that excludes browser/interact tools. agent should use scrape with --wait-for for JS-rendered pages instead of browser sessions.

show output path at prompt + follow-up suggestions

22d4618

- display output file path (dimmed) above each → prompt - add "follow-up suggestions" instruction to system prompt: agent proposes 2-3 specific next steps after completing output, like perplexity's suggested questions

give agent awareness of session directory for incremental writes

9a769c2

system prompt now tells the agent its session directory path and output file path. agent can write intermediate files, build data incrementally, and preserve progress if interrupted.

default to claude code when available, skip agent selection

a398dde

default to free text input, show examples on empty enter

28d797a

frame follow-up suggestions as questions, not actions

9cc778d

rename extracting phase to gathering data

bd90538

revert tsconfig to original module/resolution settings

c93a301

the module:"preserve" and moduleResolution:"bundler" changes were for ink which was removed. reverted to commonjs + node.

Update agent-interactive.ts

26a8b67

developersdigest added 30 commits March 26, 2026 16:03

always show agent selection as first step

7392a49

fix line overwrite by ensuring newline before stderr output

4399fb7

track whether last stdout text ended with \n. if not, write a newline before any tool call results, phase headers, or status lines to prevent them from overwriting partial agent text.

add animated spinner while agent is working in background

5a39cd8

stop agent from narrating internal steps and reading workspace configs

2068011

remember last-used agent in ~/.firecrawl/preferences.json

74464fa

first run shows agent picker, saves selection. subsequent runs skip the prompt and use the saved default. --provider flag always overrides.

natural language output for headless mode instead of json

8b9c457

run headless agent in background, return immediately

80b0c82

fork a detached child process for the ACP session so the CLI exits immediately. caller gets session path to poll for results.

write agent.log during headless sessions

add0310

logs agent text, tool calls, and completions to the session directory so callers can tail -f for live progress.

expand firecrawl CLI instructions in system prompt

f7cbd21

tell the agent to run firecrawl --help first, show practical command examples with useful flags (--only-main-content, --wait-for, --limit, -o), and save to temp files for parsing.

enforce firecrawl for all web access, no alternatives

f493d92

show firecrawl operations when they start, not just on completion

c702b89

tool calls now show · label... when starting and ✓ label when done. background work still shows animated spinner. deduped so each operation only shows start/done once.

save scraped pages to session dir in sites/<hostname>/<path>/ convention

c6b885c

agent now saves raw scraped markdown alongside the output file, organized by hostname/path like firecrawl download does. session folder becomes a self-contained package of output + sources.

detect firecrawl MCP tools and built-in web tools, not just CLI

1706fe1

categorize tool calls by title in addition to rawInput.command. catches mcp__firecrawl__firecrawl_scrape, WebSearch, WebFetch, and all other tool invocation patterns the agent might use.

remove start line for tool calls, only show on completion

050a3f6

spinner provides feedback while running, ✓ line on completion. no more doubled · start / ✓ done lines.

clean up session end menu: only show output file + session folder

3412ecb

default to follow-up text input, open file menu only on done

468f937

add firecrawl agent --list to browse past sessions

f4b3b19

shows all sessions sorted by date with prompt, agent, format, and age. select one to resume, open the output, or open the session folder.

fix headless worker: use spawn with execArgv instead of fork

9813452

fork() doesn't survive tsx exit. use spawn() with process.execPath and process.execArgv to preserve the tsx loader. pass args via temp file (system prompts are too large for argv).

fix resume: look up ACP binary from registry, not raw provider name

41b6772

suggest html visualization in follow-ups when data suits it

4f75aaf

fix resume: add conversation loop, error handling, and ctrl+c support

d089338

show ctrl+c hint, detect raw CLIs and suggest ACP adapters

a7d5230

add orange ascii AGENT banner on interactive mode

5e1f2cb

expand ascii banner to firecrawl agent

5d39266

move banner to top of interactive flow, before all prompts

8a3975e

add rotating status messages to working spinner

d1ecbc4

rename agent picker label to harness

6d4d082

show spinner immediately when starting a new turn

5f94d2f

no more dead cursor after typing a follow-up — the working spinner with rotating messages starts instantly on every turn, including the first one and after resume.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

acp poc#79

acp poc#79
developersdigest wants to merge 63 commits into
mainfrom
agent-client-protocol-poc

developersdigest commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

developersdigest commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant