A curated list of tools, frameworks, and resources for building AI agents that can browse and interact with the web.
About Steel
Steel is an open-source browser API built specifically for AI agents. We make it easy to build AI applications that can effectively interact with the web.
✨ Get started for free here.
Contents
Autonomous Web Agents
AI agents that autonomously navigate and interact with the web through a user-friendly interface. (a.k.a Browser Agents)
- Surf.new - An open-source playground for chatting with different web agents.
- OpenAI Operator - OpenAI's AI agents that can browse the web for you.
- Browser-Use - SOTA agent and framework that makes the web LLM-friendly.
- Skyvern-AI - Framework to automate browser-based workflows.
- Google Project Mariner - A research prototype exploring the future of human-agent interaction, starting with your browser.
- Sentience API - A tool for building more deterministic and explainable web agents using semantic geometry on web content.
- Runner H - State-of-the-art AI agent that helps automate complex, cumbersome, multi-step tasks without repetitive manual input.
- WebVoyager (Agent) - Vision-enabled web agent.
- AgentGPT - Deploy autonomous AI agents in your browser.
- Agent-E - Agent & framework with HTML DOM distillation.
- Manus - A general AI agent that can execute long running tasks across tools like browsers, terminals, and text editors.
- doBrowser - An AI-powered Chrome extension that understands natural language and takes actions in your browser on your behalf.
- WebSurfer (Autogen) - MultimodalWebSurfer is a multimodal agent that can search the web and visit web pages.
- Magentic-One - A generalist multi-agent system for solving complex tasks including surfing the web via Autogen's MultimodalWebSurfer.
- Harpa.ai - An AI-powered Chrome extension & browser agent that understands natural language and takes actions on your behalf.
- Yutori - A multi-agent system that executes browser-based tasks in parallel given a natural language prompt.
- Automina - AI browser automation tool with natural language control.
- rtrvr.ai - AI web agent Chrome extension that autonomously does tasks, scrapes to Sheets, and calls APIs with prompts in your own browser.
- Nanobrowser - An open-source & local-first AI web agent Chrome extension with flexible LLM options and multi-agent system.
- Browserable - An open-source & self-hostable browser automation library for AI agents.
- Tongyi WebAgent - WebAgent for information seeking built by Tongyi Lab, Alibaba Group.
- Openwork - An MIT-licensed, open alternative to Anthropic's Cowork built with Opencode and dev-browser. Supports multiple LLM providers for launching computer-use agents to automate browser workflows.
- Dassi - An AI coworking agent in your browser that automates tasks, navigates pages, and works with files and 2000+ apps from a side panel.
Computer-use Agents
AI Web Automation Tools
Tools, frameworks and libraries that translate natural language instructions into web interactions.
- Asteroid.ai - Hosted browser agents for SMEs to automate complex workflows.
- PulsarRPA - AI-powered browser automation for data extraction.
- VimGPT - Experimental project using GPT-4 Vision to browse the web via the Vimium extension.
- Cekura.io - An AI browser agent that helps companies maintain up-to-date documentation.
- Dex by Dexterity - An AI coworker embedding into and controlling your browser.
- Autobrowser - A free, experimental Chrome extension that leverages Claude Computer Use to automate tasks in your browser.
- Bytebot - AI-powered scraping automations that evolve with your target sites.
- Runcopycat - A no-code browser automation platform that turns screen recordings into reusable automated workflows.
- Bardeen.ai - A Chrome extension that enables AI-powered browser automations, allowing users to automate tasks and workflows directly within the browser.
- Starizon.ai - Browser assistant for web task automation.
- BrowserGPT - Browser extension for page summaries and Q&A.
- Browse.ai - Chrome extension webscraping that can leverage AI for structured data extraction.
- Strawberry Browser - A personal assistant that sits in your browser, automates repetitive web actions, learns your workflows.
- Deta.surf - An integrated platform that combines a browser, file manager, and AI assistant with browser-level context.
- Comet by Perplexity - An AI-powered browser by Perplexity. Not much more details out yet.
- Dia Browser - AI-first web browser envisioned by The Browser Company (Arc).
- Reworkd - No-code web data extraction solution using agentic AI.
- Onpiste - Chrome extension that uses AI to control and read webpages, including auto summaries, web automation, scraping, and MCP support.
Dev Tools
AI Web Scrapers/Crawlers
Web crawlers & scrapers that leverage AI to navigate websites and extract content.
Web Search & Query Tools
Utilities that help agents search the web or query web data via natural language.
- AgentQL - A query language and toolkit that makes the web AI-ready.
- SerpAPI - Search API that provides Google Search results for your agents.
- Serper.dev - Performant and cost effective search API that provides Google Search results for your agents.
- Jina.ai - Neural search platform for web data.
- Exa.ai - The fastest and most accurate web search API for AI agents.
- Not Human Search - Search engine that indexes 1,750+ agent-first tools ranked by agentic readiness. Available as an MCP server with tools for searching, scoring, and monitoring agent infrastructure.
Benchmarks & Research
Datasets, benchmarks, and notable research efforts for evaluating and advancing web-capable AI agents.
Tutorials & Guides
Resources for learning how to build, deploy, or utilize AI web agents.
- LangGraph WebVoyager Tutorial - Tutorial demonstrating how to build a web navigation agent using LangGraph Agents, Vision Models, and Web Voyager.
- Build an AI Browser Agent - Step-by-step guide to create an AI that browses the web using Playwright and the Browser-Use library.
- Install & Run Browser-Use Locally - Instructions on installing the open-source Browser-Use agent with a local LLM.
- Build a Browser Agent with DeepSeek - Walks through deploying a Browser-Use web UI agent powered by the DeepSeek model on a cloud VM.
Archive
Historical or inactive projects are tracked in ARCHIVE.md.
Interested in implementing Steel?
Feel free to reach out at team@steel.dev or on Discord.
Steel is an open-source browser API built specifically for AI agents. Get started for free here.
Join the Community
- Follow @steeldotdev on X.
- Join the Discord community.
- Feel free to reach out to us at team@steel.dev
Contributing
Contributions of any kind welcome, just follow the guidelines!