Agent Browser

Agent Browser

A real browser your agent can drive with snapshots, element refs, live human handoff, and persistent sign-in state.

24 tools

Tools

Name
snapshot

Capture the page accessibility tree with element refs. Read this, then act by ref.

read_text

Get the visible text of the page or a target element.

read_html

Get the HTML of the page or a target element.

screenshot

Take a screenshot of the page or an element. Returns inline image content and a download URL when Filesystem is available.

current_page

Get the current URL and title.

navigate

Go to a URL.

navigate_back

Go back to the previous page.

click

Click an element by ref from snapshot or selector.

type

Type text into an element by ref or selector. Optionally submit with Enter.

fill_form

Fill multiple fields at once. Each field has a target, type, and value.

select_option

Select one or more options in a dropdown.

hover

Hover over an element.

drag

Drag one element onto another.

press_key

Press a keyboard key, such as Enter, Backspace, ArrowDown, or a.

file_upload

Set a file on an upload input.

evaluate

Run a JavaScript function on the page or a target element and return its result.

wait_for

Wait for text to appear/disappear with an optional timeout in seconds. With only time, sleep for that many seconds.

handle_dialog

Accept or dismiss a pending JavaScript dialog.

tabs

List, create, close, or select browser tabs.

request_human

Hand control to a human for login, MFA, or CAPTCHA. Returns a viewer URL.

resume

Take control back from the human and continue automating.

session_status

Get the current handoff state and page.

start_session

Start a fresh browser session. Returns the session ID and a human viewer link.

end_session

End the browser session and snapshot its profile.

Quick Start

TypeScript
import { WeavzClient } from '@weavz/sdk'

const weavz = new WeavzClient({ apiKey: 'wvz_...' })

const result = await weavz.actions.execute('agent-browser', 'snapshot', {
  workspaceId: '550e8400-e29b-41d4-a716-446655440000',
  integrationAlias: 'agent-browser',
  input: { /* ... */ },
})

Related Integrations

Works well with

Frequently Asked Questions

How do I connect Agent Browser to my app?

Agent Browser does not require an external account connection. Install the SDK, add the integration to a workspace, then execute tools programmatically.

What tools does Agent Browser support?

Agent Browser supports 24 tools including Snapshot Page, Read Text, Read HTML, Screenshot, Current Page, and more.

Can I use Agent Browser with AI agents?

Yes. Add Agent Browser to an MCP server in Tool Mode or Code Mode. Code Mode provides 3 meta-tools with on-demand API discovery so agents can compose stateful workflows without loading every action schema up front.

Can Agent Browser be used in stateful agent workflows?

Yes. Agents can combine Agent Browser tools with Filesystem, State KV, Human Gates, input partials, and other workspace integrations while keeping execution scoped to the selected workspace or end user.

What authentication does Agent Browser use?

Agent Browser does not require external authentication.

How much does the Agent Browser integration cost?

The Agent Browser integration is included on all Weavz plans including the free tier with 20,000 actions per month. No credit card required.

Connect Agent Browser in minutes

Get 20,000 free action executions every month. No credit card required.