GitHub - actionbook/actionbook: Browser action engine for AI agents. 10× faster, resilient by design.

3 min read Original article ↗

Actionbook Cover

Actionbook

GitHub last commit NPM Downloads npm version skills

Browser Action Engine for AI Agents
Actionbook provides up-to-date action manuals built for the modern web,
so your agent operates any website instantly. One tab or dozens, concurrently.

Website · GitHub · X · Discord

Table of Contents

❌ Without Actionbook

  • Slow. Agents take a snapshot after every single step, parse the page, then decide what to do next. Searching one room on Airbnb takes 15 minutes.
  • Brittle. Modern websites use virtual DOMs, streaming components, and SPAs. Agents don't understand these rendering mechanisms, so they fail to interact with dropdowns, date pickers, and dynamic content.
  • One at a time. Your agent finishes one page before it can start the next. Need to check 30 company websites? That's 30 rounds, one after another.

✅ With Actionbook

  • 10x faster. Action manuals tell agents exactly what to do. No parsing, no guessing.
  • Accurate. Built for virtual DOMs, SPAs, and streaming components. Agents operate reliably.
  • Concurrent. Stateless architecture. Operate dozens of tabs in parallel.

See an agent visits 192 First Round portfolio company websites and collects their taglines in 3 minutes. (Video is not sped up or edited)

actionbook-demo-unedited.mp4

Installation

Install via npm:

npm install -g @actionbookdev/cli

Or build from source:

cargo install --git https://github.com/actionbook/actionbook --path packages/cli --locked

The Rust-based CLI uses your existing system browser (Chrome, Brave, Edge, Arc, Chromium), so no extra browser install step is required.

Quick Start

actionbook browser start

# Open tabs
actionbook browser open https://stripe.com --session s1
actionbook browser open https://linear.app --session s1
actionbook browser open https://vercel.com --session s1

# Operate all tabs concurrently
actionbook browser snapshot --session s1 --tab t1 &
actionbook browser snapshot --session s1 --tab t2 &
actionbook browser snapshot --session s1 --tab t3 &

# Interact with each tab using refs from its snapshot
actionbook browser click @e5 --session s1 --tab t1
actionbook browser fill @e3 "hello" --session s1 --tab t2
actionbook browser click @e8 --session s1 --tab t3

When working with any AI coding assistant (Claude Code, Cursor, etc.), add this to your prompt:

Use Actionbook to understand and operate the web page.

The agent will automatically use the CLI to fetch action manuals and execute browser operations.

AI Agent Skills

Actionbook ships with Agent Skills that teach your AI agent how to use the CLI. Add them with one command:

npx skills add actionbook/actionbook

Examples

Explore real-world examples in the Examples Documentation.

Available Tools

Actionbook provides tools for searching and retrieving action manuals. See the CLI Reference for the full command list. If you're using the MCP integration, see the MCP Tools Reference.

Documentation

For comprehensive guides, API references, and tutorials, visit our documentation site:

actionbook.dev/docs

Stay tuned

We move fast. Star Actionbook on Github to support and get latest information.

Star Actionbook

Join the community:

Contributing

License

See LICENSE for the license details.