Objective
Develop a new Tool for the Agents4Gov (LABIC – ICMC/USP) project that enables controlled web navigation and data extraction through automated browser interaction.
The tool will be built on top of the browser-use framework and designed for secure, auditable exploration of public websites to collect structured information that can later be processed by LLMs or other Agents4Gov components.
Description
The Web Navigation Tool will provide an interface for Agents4Gov agents to:
- Open and interact with web pages.
- Click buttons, follow links, and fill forms automatically.
- Extract relevant content such as text, tables, or metadata.
This capability will support public-sector use cases such as:
- Monitoring official portals (e.g., procurement, transparency, environment, or health).
- Gathering open data from regulatory agencies.
- Validating publication or update events on public databases.
Functional Requirements
-
Input
- A target URL or search query.
- Optional navigation instructions (e.g., “click the first result”, “extract all table rows”).
- Optional configuration for maximum depth or page limit.
-
Browser Session
- Use
browser-use headless automation with secure sandboxing.
- Each session must be ephemeral and auditable (temporary cache, isolated context).
- Log every action (URL visited, element clicked, text extracted).
-
Output
Expected Behavior (User Flow)
- The user opens Open WebUI → Tools → Web Navigation Agent.
- Provides a URL or query and a instruction.
- The tool runs a
browser-use session, navigating according to the instructions.
- The user receives:
- A structured JSON summary of the navigation.
- A clear message about the number of pages visited and any warnings.
- Optional LLM-generated textual summary.
Configuration
- Valve name:
llm_web_analyzer (optional)
- Dependencies:
browser-use, requests, beautifulsoup4, playwright (or selenium)
- Security: run inside a sandboxed environment with network whitelisting (only HTTP/HTTPS).
- Logs: automatically store navigation logs and extracted text snippets in temporary storage for auditing.
Deliverables
Acceptance Criteria
Objective
Develop a new Tool for the Agents4Gov (LABIC – ICMC/USP) project that enables controlled web navigation and data extraction through automated browser interaction.
The tool will be built on top of the browser-use framework and designed for secure, auditable exploration of public websites to collect structured information that can later be processed by LLMs or other Agents4Gov components.
Description
The Web Navigation Tool will provide an interface for Agents4Gov agents to:
This capability will support public-sector use cases such as:
Functional Requirements
Input
Browser Session
browser-useheadless automation with secure sandboxing.Output
Expected Behavior (User Flow)
browser-usesession, navigating according to the instructions.Configuration
llm_web_analyzer(optional)browser-use,requests,beautifulsoup4,playwright(orselenium)Deliverables
tools/browser-use/main.py– orchestration of browser-use session and data extractionREADME.md– usage, examples, safety guidelinesrequirements.txt– dependenciestest_navigation.py– mock site testsdocs/README.mdto include this tool and usage notesAcceptance Criteria
browser-use.