Browser automation CLI built for AI agents. Get past anti-bot walls, hand off to humans across platforms when stuck, run parallel tasks without cross-contamination, and isolate multiple accounts in independent browsers.
The browser an AI agent needs has to reach places standard tools can't, let a human seamlessly take over when the agent is stuck, keep parallel tasks from cross-contaminating, and be designed for LLM reasoning — not human-written scripts. A browser for agents must get four things right.
1. Break through blocks — three progressive layers
- Environment layer — stealth fingerprint spoofing, TLS rotation, proxy switching. The vast majority of blocks never trigger.
- Execution layer —
solve-captchaauto-solves CAPTCHAs;stealth-extractpulls protected pages in one command. - Human layer —
remote-assistgenerates a live URL; the user takes over from any device, and the agent continues seamlessly when done.
2. Three browser modes — by real-world scenario
| Mode | Scenario | Key trait |
|---|---|---|
chrome |
Reuse local Chrome login state | Profile import or CDP attach |
stealth privacy mode |
Frictionless batch scraping without login | Fresh fingerprint per session + proxy rotation, zero residue |
stealth fixed identity |
Logged-in accounts · multi-browser parallel | Stable fingerprint + stable IP, stable account identity, not flagged as bots |
3. Zero-interference concurrency — every agent in its own lane
- Cross-browser parallel — independent cookies, fingerprints, proxies. Sites cannot correlate them.
- Same-browser multi-session — shared login state, independent execution, tasks don't block each other.
- Privacy mode — fresh fingerprint and empty profile per session, zero residue when done.
4. Designed for agent reasoning — not human scripts
- Compact text output — indexed text format, several times more token-efficient than JSON or HTML.
- Indexed interaction —
statereturns an indexed list;click 3/input 2 "...". No DOM parsing required. - Semantic memory — every browser carries a
desc, matched to tasks by meaning. - Concurrency-safe — session ownership + explicit naming. Multi-agent operation never conflicts.
Security: confirmation gating — sensitive operations (browser create / delete, Profile import, proxy changes, security and privacy toggles) require explicit user approval. Prior approvals do not carry over. Enforced at the Skill layer, not a configuration toggle.
- Better headless — Default headless without disrupting users; stealth headless that isn't detected.
- Cross-platform remote handoff — Any device opens the link to take over, and the agent continues seamlessly.
Tell your AI agent:
Install browser-act. Skill source: https://github.com/browser-act/skills/tree/main/browser-act . Verify it works after installation.
# Extract protected page content (zero config)
browser-act stealth-extract https://example.com
# Full browser automation
browser-act --session my-task browser open <id> https://example.com
browser-act --session my-task state # See clickable elements
browser-act --session my-task click 3 # Click by index
browser-act --session my-task input 2 "hi" # Type into a fieldThe agent runs get-skills at the start of each session — gets environment state, browser list, and commands in one call:
browser-act get-skills core --skill-version 2.0.2How agents discover and use BrowserAct →
OS: Windows, macOS, Linux
Agents: Claude Code · Cursor · VS Code · OpenCode · OpenClaw · Codex · Gemini CLI — works with any agent that can execute shell commands and load Skills.
Full documentation covers anti-blocking, browser modes, sessions and concurrency, headless and remote handoff, agent design, the Skills system, and the complete command reference.
Need to extract data from the same website repeatedly at scale? Don't write scrapers by hand. Skill Forge explores a site once, discovers its APIs and data patterns, generates a deploy-ready Skill package, then runs reliably without re-exploration — 500 or 5,000 records through the same stable path.
Any website. Any data. One command to start:
Install browser-act-skill-forge. Skill source: https://github.com/browser-act/skills/tree/main/browser-act-skill-forge . Verify it works after installation.
Then tell your agent what you need:
"Forge a Skill that extracts job listings from LinkedIn — title, company, salary, URL. I'll run 300 keywords later."
30+ pre-built Skills already generated by Skill Forge, ready to install and run. Covers Amazon, Google Maps, YouTube, Reddit, WeChat, Zhihu, and more.
Browse the full Solutions Catalog →
Can't find what you need above? Generate a custom Skill for any website in minutes — no coding required. Just describe what data you want or what action to perform, and Skill Forge handles the rest.
BrowserAct Skills is free and open source. If it saves you time, please give us a ⭐ Star — it keeps the project alive and helps us ship more skills.
🎁 Bonus: Once you star the repository, you can join our Discord and post in the #claim-500-credits channel to receive 500 free credits!
Built with ❤️ by the BrowserAct Team