BT

Browser Tools MCP – Model Context Protocol Server for Cursor

free

A Model Context Protocol (MCP) integration that enables AI agents to control and interact with real web browsers. Supports page navigation, DOM interaction, form filling, clicking, scrolling, screenshots, and content extraction for automation, research, and agent workflows.

Curated by AI Stack · Platform pick
Installation Instructions →
Category: automationCompany: AgentDesk
Compatible Tools:
Cursor (Primary)ClaudeGitHub CopilotReplit AgentWindsurf

Featured on AI Stack

Add this badge to your README or site so visitors know this MCP is listed in our directory.

Listed on AI Stack MCP Directory
<a href="https://ai-stack.dev/mcps/browser-tools-mcp" target="_blank" rel="noopener noreferrer" style="display:inline-block;padding:6px 12px;background:#1a1f27;color:#93c5fd;border:1px solid #2d323a;border-radius:6px;font-size:12px;text-decoration:none;font-family:system-ui,sans-serif;">Listed on AI Stack MCP Directory</a>

About Browser Tools MCP MCP Server

Quick overview of why teams use it, how it fits into AI workflows, and key constraints.

AgentDesk in AI Workflows Without Context Switching

Developers and product teams often have to constantly switch between different dashboards, scripts, and APIs to gather the necessary information and take actions in their daily workflows. This context switching can be time-consuming and inefficient, leading to lost productivity and increased risk of errors.

The Browser Tools Model Context Protocol (MCP) integration provides a seamless way for AI agents like Cursor to directly interact with and gather data from real web browsers, without the need for manual navigation. By enabling the agent to control the browser programmatically, it can pull the right data or perform the required actions on behalf of the user, significantly streamlining workflows and eliminating the need for constant context switching.

How Browser Tools MCP Improves AI‑Assisted Workflows

The Browser Tools MCP integration allows AI agents to perform a wide range of tasks directly within the browser, including:

  • Incident response and investigation (e.g., capturing screenshots, logs, and network activity for a problematic page)
  • Automated reporting and monitoring (e.g., regularly checking key performance metrics and KPIs on a dashboard)
  • Content summarization and extraction (e.g., pulling specific data points or text from a web application)
  • Integration with other tools and systems (e.g., seamlessly pulling information from a web-based tool into a chatbot or AI assistant)

Architecture and Data Flow

The Browser Tools MCP integration consists of three main components: a Chrome extension, a local Node.js server, and the MCP server. The Chrome extension captures browser data such as console logs, network activity, and screenshots, and sends this information to the local Node.js server. The MCP server then exposes a set of standardized tools and APIs that AI agents can use to interact with the browser data, without needing to manage the low-level details of the Chrome extension or the local server.

The local Node.js server acts as a middleware, handling the communication between the Chrome extension and the MCP server. It processes requests from the MCP server, sending commands to the Chrome extension to capture the necessary data, and then returning the results in a structured format. This approach helps to ensure that sensitive data, such as cookies and headers, are properly sanitized before being sent to the MCP server, which is then accessible to the AI agent.

When Browser Tools MCP Is Most Useful

  • AI-assisted incident investigation: The agent can quickly gather relevant logs, screenshots, and network activity to help diagnose and resolve issues.
  • Automated summarization and reporting: The agent can regularly check key performance metrics and KPIs, and generate concise summaries or reports for stakeholders.
  • Release health checks: The agent can run a series of audits to evaluate the quality, performance, and accessibility of a newly deployed web application or feature.
  • Integrating monitoring into AI workflows: The agent can pull data from web-based monitoring tools and incorporate it into its decision-making and analysis.
  • Browser-based automation and research: The agent can navigate web pages, interact with forms and UI elements, and extract data for a variety of automation and research tasks.
  • Accessibility and compliance checks: The agent can run WCAG compliance tests and report on any issues, helping to ensure web applications are accessible to all users.

Limitations and Operational Constraints

The Browser Tools MCP integration has a few key limitations and operational constraints to be aware of:

  • API key requirements: The integration requires an API key to authenticate with the MCP server, which must be securely managed by the user or organization.
  • Rate limits: The MCP server and underlying services may have rate limits in place to prevent abuse, which could impact the number of requests an AI agent can make within a given time frame.
  • Platform/host restrictions: The Chrome extension and local Node.js server must be installed and running on the same machine as the browser being monitored, which may limit the deployment options in some environments.
  • Environment/network setup: The integration requires a specific network configuration to ensure the Chrome extension, local server, and MCP server can communicate effectively, which may require additional setup in some environments.
  • Model/tooling compatibility: The integration is designed to work with a variety of MCP-compatible AI agents, but the specific capabilities and integration points may vary depending on the agent and its underlying model.

Example Configurations

For stdio Server (Browser Tools MCP Example):
https://browsertools.agentdesk.ai/installation
For SSE Server:
URL: http://example.com:8080/sse

Browser Tools MCP Specific Instructions

# 1. Clone the repo
git clone https://github.com/agentdesk/browser-tools
# 2. Install dependencies
cd browser-tools && npm install
# 3. Set up environment
cp .env.example .env
# 4. configure BROWSER_PROVIDER & credentials
# 5. Start the browser MCP server
npm start

Usage Notes

Help other developers understand when this MCP works best and where to be careful.

Best used for AI-driven browser automation, dynamic scraping, testing and intelligent workflows
Works with agent platforms that support MCP
Supports authentication, navigation, DOM selection, clicks & form submissions
Not ideal for large scale scraping without proxies or bot-management
May be blocked by CAPTCHAs or bot protection

Community field notes and related MCPs load below.