Agent Browser
A real browser your agent can drive with snapshots, element refs, live human handoff, and persistent sign-in state.
Tools
| Name | Description |
|---|---|
snapshotCapture the page accessibility tree with element refs. Read this, then act by ref. | Capture the page accessibility tree with element refs. Read this, then act by ref. |
read_textGet the visible text of the page or a target element. | Get the visible text of the page or a target element. |
read_htmlGet the HTML of the page or a target element. | Get the HTML of the page or a target element. |
screenshotTake a screenshot of the page or an element. Returns inline image content and a download URL when Filesystem is available. | Take a screenshot of the page or an element. Returns inline image content and a download URL when Filesystem is available. |
current_pageGet the current URL and title. | Get the current URL and title. |
navigateGo to a URL. | Go to a URL. |
navigate_backGo back to the previous page. | Go back to the previous page. |
clickClick an element by ref from snapshot or selector. | Click an element by ref from snapshot or selector. |
typeType text into an element by ref or selector. Optionally submit with Enter. | Type text into an element by ref or selector. Optionally submit with Enter. |
fill_formFill multiple fields at once. Each field has a target, type, and value. | Fill multiple fields at once. Each field has a target, type, and value. |
select_optionSelect one or more options in a dropdown. | Select one or more options in a dropdown. |
hoverHover over an element. | Hover over an element. |
dragDrag one element onto another. | Drag one element onto another. |
press_keyPress a keyboard key, such as Enter, Backspace, ArrowDown, or a. | Press a keyboard key, such as Enter, Backspace, ArrowDown, or a. |
file_uploadSet a file on an upload input. | Set a file on an upload input. |
evaluateRun a JavaScript function on the page or a target element and return its result. | Run a JavaScript function on the page or a target element and return its result. |
wait_forWait for text to appear/disappear with an optional timeout in seconds. With only time, sleep for that many seconds. | Wait for text to appear/disappear with an optional timeout in seconds. With only time, sleep for that many seconds. |
handle_dialogAccept or dismiss a pending JavaScript dialog. | Accept or dismiss a pending JavaScript dialog. |
tabsList, create, close, or select browser tabs. | List, create, close, or select browser tabs. |
request_humanHand control to a human for login, MFA, or CAPTCHA. Returns a viewer URL. | Hand control to a human for login, MFA, or CAPTCHA. Returns a viewer URL. |
resumeTake control back from the human and continue automating. | Take control back from the human and continue automating. |
session_statusGet the current handoff state and page. | Get the current handoff state and page. |
start_sessionStart a fresh browser session. Returns the session ID and a human viewer link. | Start a fresh browser session. Returns the session ID and a human viewer link. |
end_sessionEnd the browser session and snapshot its profile. | End the browser session and snapshot its profile. |
Quick Start
import { WeavzClient } from '@weavz/sdk'
const weavz = new WeavzClient({ apiKey: 'wvz_...' })
const result = await weavz.actions.execute('agent-browser', 'snapshot', {
workspaceId: '550e8400-e29b-41d4-a716-446655440000',
integrationAlias: 'agent-browser',
input: { /* ... */ },
})Related Integrations
Works well with
Frequently Asked Questions
How do I connect Agent Browser to my app?
Agent Browser does not require an external account connection. Install the SDK, add the integration to a workspace, then execute tools programmatically.
What tools does Agent Browser support?
Agent Browser supports 24 tools including Snapshot Page, Read Text, Read HTML, Screenshot, Current Page, and more.
Can I use Agent Browser with AI agents?
Yes. Add Agent Browser to an MCP server in Tool Mode or Code Mode. Code Mode provides 3 meta-tools with on-demand API discovery so agents can compose stateful workflows without loading every action schema up front.
Can Agent Browser be used in stateful agent workflows?
Yes. Agents can combine Agent Browser tools with Filesystem, State KV, Human Gates, input partials, and other workspace integrations while keeping execution scoped to the selected workspace or end user.
What authentication does Agent Browser use?
Agent Browser does not require external authentication.
How much does the Agent Browser integration cost?
The Agent Browser integration is included on all Weavz plans including the free tier with 20,000 actions per month. No credit card required.
Connect Agent Browser in minutes
Get 20,000 free action executions every month. No credit card required.