mcp-chrome
MCP ServerFreeChrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, content analysis, and semantic search.
Capabilities14 decomposed
mcp protocol bridging via native messaging
Medium confidenceExposes Chrome browser capabilities to external AI clients (Claude, etc.) through a Fastify-based Node.js server (mcp-chrome-bridge) running on port 12306 that implements the Model Context Protocol. Uses bidirectional JSON-RPC over Chrome native messaging to communicate between the extension and Node.js process, with Server-Sent Events (SSE) for streaming responses and STDIO as an alternative transport mechanism for clients that don't support HTTP.
Operates within the user's existing Chrome session (preserving login states and environment) rather than launching isolated browser instances like Playwright; uses native messaging for low-latency bidirectional communication between extension and Node.js server, enabling real-time tool execution without context serialization overhead
Faster and more stateful than Playwright-based solutions because it reuses the user's authenticated browser session and avoids the overhead of launching new browser instances per request
browser interaction recording and replay
Medium confidenceCaptures user interactions (clicks, typing, navigation) in real-time and stores them as executable workflows in IndexedDB, enabling playback and modification through a visual workflow builder. Uses a transaction-based system to batch DOM mutations and event captures, with a flow data model that represents sequences of actions as nodes in a directed graph that can be executed, edited, and scheduled.
Uses a transaction-based batch apply system with shadow DOM isolation to capture interactions without interfering with page functionality; stores workflows as a node-based graph model (not linear scripts) enabling visual editing, conditional branching, and AI-assisted modification
More user-friendly than Selenium/Playwright scripts because workflows are visual and editable; preserves browser session state unlike headless automation tools, reducing flakiness from login/session timeouts
network monitoring and request interception
Medium confidenceCaptures and analyzes network requests made by the page, enabling workflows to wait for specific API calls, extract data from responses, or modify requests. Uses Chrome DevTools Protocol (CDP) to intercept network traffic, stores request/response metadata in the workflow context, and provides tools for conditional logic based on network events.
Uses Chrome DevTools Protocol to intercept network traffic at the browser level, enabling workflows to wait for specific API calls and extract data from responses without modifying page code; integrates with the workflow system to enable conditional logic based on network events
More reliable than polling for data because it reacts to actual network events; more complete than mocking because it captures real API responses
offscreen document compute for ai inference and media encoding
Medium confidenceDelegates compute-intensive operations (transformer model inference, GIF encoding, image processing) to an offscreen document that runs in a separate execution context, preventing blocking of the main UI thread. Uses Web Workers or offscreen document APIs to parallelize computation, with message passing to communicate results back to the main extension.
Offloads compute-intensive operations to an offscreen document context, preventing UI blocking; uses message passing for result communication, enabling responsive UIs even during heavy inference or encoding tasks
More responsive than running inference on the main thread; more efficient than external API calls because computation stays local to the browser
cli interface for headless workflow execution
Medium confidenceProvides a command-line interface for executing recorded workflows in headless mode, enabling integration with CI/CD pipelines and server-side automation. Wraps the Node.js server with CLI commands for workflow execution, result reporting, and error handling, with support for parameterized workflows and output formatting.
Provides a CLI wrapper around the Node.js server that enables headless workflow execution without a GUI, integrating with standard Unix tools and CI/CD systems; supports parameterized workflows and multiple output formats for easy integration
More flexible than Selenium/Playwright CLIs because workflows are visual and editable; easier to integrate into existing automation pipelines than writing custom scripts
multi-tab and multi-window coordination
Medium confidenceEnables automation workflows to coordinate actions across multiple browser tabs and windows, with shared state management and cross-tab messaging. Uses Chrome extension message passing to synchronize state between tabs, enabling workflows that require interaction with multiple pages simultaneously or sequentially.
Implements cross-tab messaging and state synchronization through the background service worker, enabling workflows to coordinate actions across multiple tabs without requiring manual tab switching; uses a shared state store to maintain consistency
More flexible than single-tab automation because it can handle complex multi-page workflows; more reliable than manual tab switching because coordination is automated
vision-based browser control via computertool
Medium confidenceEnables AI agents to control the browser using visual perception by capturing screenshots, analyzing page layout, and executing actions (click, type, scroll) based on visual coordinates rather than DOM selectors. Implements a ComputerTool base class that accepts screenshot input, performs vision-based reasoning, and translates visual instructions into precise browser actions, supporting multi-step visual workflows.
Implements a ComputerTool abstraction that bridges vision-language models directly to browser actions, allowing agents to reason about visual layout and execute coordinate-based interactions without DOM knowledge; integrates with ONNX Runtime for local vision inference when needed
More flexible than selector-based automation for dynamic UIs; enables AI agents to handle visual elements (images, charts) that DOM selectors cannot target; slower than DOM-based tools but more robust to UI changes
semantic similarity search with onnx-based embeddings
Medium confidenceProvides vector-based semantic search over page content using transformer models (ONNX Runtime) running locally in the browser's offscreen document. Embeds page text into vector space using a pre-loaded model, stores vectors in an HNSW (Hierarchical Navigable Small World) index, and enables fast approximate nearest-neighbor search for finding relevant content without keyword matching.
Runs transformer-based embeddings locally in the browser using ONNX Runtime (no external API calls), enabling privacy-preserving semantic search; uses HNSW for efficient approximate nearest-neighbor search over large document collections without requiring a separate vector database
Faster and more private than cloud-based semantic search APIs (no data leaves the browser); more accurate than keyword search for understanding meaning; eliminates dependency on external vector databases like Pinecone or Weaviate
real-time agent chat with streaming tool execution
Medium confidenceImplements a streaming conversation interface where AI agents (Claude, etc.) can invoke browser tools in real-time and receive results within the same conversation flow. Uses a message processing pipeline that routes tool calls to the appropriate browser automation tools, captures results, and streams them back to the agent for multi-turn reasoning without waiting for full workflow completion.
Implements a message processing pipeline with a timeline-based conversation model that tracks both agent reasoning and tool execution results; uses streaming SSE to send partial results back to the agent in real-time, enabling adaptive multi-step workflows where the agent can adjust strategy based on intermediate outcomes
More interactive than batch automation because the agent sees results immediately and can adapt; preserves full conversation history for debugging and auditing unlike ephemeral tool-calling patterns
visual web editor with shadow dom isolation
Medium confidenceProvides an in-browser visual editor overlay (built with React/Vue) that allows editing page content without interfering with the original page functionality. Uses shadow DOM to isolate the editor UI from page styles, implements a transaction-based batch apply system to commit edits atomically, and supports undo/redo through a state management system that tracks all mutations.
Uses shadow DOM to completely isolate editor UI from page styles, preventing CSS conflicts; implements a transaction-based batch apply system that commits all edits atomically, reducing flakiness from partial DOM updates
More robust than direct DOM manipulation because shadow DOM isolation prevents style leakage; transaction-based commits are more reliable than incremental mutations for complex page edits
workflow builder with node-based flow editor
Medium confidenceProvides a visual node-based interface for constructing automation workflows where each node represents a browser action or decision point. Implements a flow data model that stores workflows as directed graphs with layout algorithms for automatic node positioning, supports conditional branching and loops, and integrates with the recording system to auto-generate nodes from captured interactions.
Implements a node-based flow model (not linear scripts) with automatic layout algorithms, enabling visual editing and conditional branching; integrates bidirectionally with the recording system so recorded interactions can be auto-converted to workflow nodes and vice versa
More flexible than linear script recording because the graph model supports loops and conditionals; more user-friendly than code-based automation because the visual interface requires no programming knowledge
content script injection and dom manipulation
Medium confidenceInjects content scripts into web pages to capture user interactions, monitor DOM changes, and execute browser automation commands at the page level. Uses a message passing architecture to communicate between content scripts and the background service worker, enabling real-time event capture (clicks, typing, navigation) and DOM mutations without blocking page execution.
Uses a bidirectional message passing architecture between content scripts and background worker to enable real-time interaction capture and command execution without blocking page JavaScript; implements event deduplication to avoid capturing redundant interactions
More efficient than polling for page changes because it uses event listeners; lower latency than external automation tools because commands execute in-page rather than through external APIs
project and session management with sqlite persistence
Medium confidenceManages multiple automation projects and conversation sessions using a SQLite database with Drizzle ORM, enabling users to organize workflows, save conversation history, and switch between different automation contexts. Stores project metadata, workflow definitions, execution logs, and conversation transcripts in a structured relational schema accessible through the Node.js server.
Uses Drizzle ORM with SQLite for type-safe schema management, enabling structured storage of projects, workflows, and conversations; integrates with the Node.js server to provide REST/MCP endpoints for querying and managing persistent data
More reliable than in-memory storage because data persists across server restarts; more flexible than file-based storage because SQL queries enable complex filtering and aggregation
trigger system and workflow scheduling
Medium confidenceEnables scheduling and triggering of recorded workflows based on time intervals, events, or external conditions. Implements a trigger registry that maps conditions (cron expressions, webhook events, page changes) to workflow execution, with a scheduler that manages timing and ensures reliable execution even if the browser is closed.
Implements a trigger registry that decouples trigger conditions from workflow execution, enabling flexible scheduling patterns (time-based, event-based, webhook-based) without modifying workflow definitions; uses a persistent scheduler in the Node.js server to ensure reliability
More flexible than simple cron scheduling because it supports event-based triggers; more reliable than browser-based scheduling because the Node.js server runs independently of the browser
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with mcp-chrome, ranked by overlap. Discovered automatically through the match graph.
llm-analysis-assistant
** <img height="12" width="12" src="https://raw.githubusercontent.com/xuzexin-hz/llm-analysis-assistant/refs/heads/main/src/llm_analysis_assistant/pages/html/imgs/favicon.ico" alt="Langfuse Logo" /> - A very streamlined mcp client that supports calling and monitoring stdio/sse/streamableHttp, and ca
inspector
Visual testing tool for MCP servers
@modelcontextprotocol/inspector
Model Context Protocol inspector
chrome-devtools-mcp
MCP server for Chrome DevTools
@mcp-use/inspector
MCP Inspector - A tool for inspecting and debugging MCP servers
@modelcontextprotocol/inspector-server
Server-side application for the Model Context Protocol inspector
Best For
- ✓AI agent developers building Claude-integrated browser automation workflows
- ✓Teams deploying AI assistants that need persistent browser session access
- ✓Developers migrating from REST APIs to MCP-based tool orchestration
- ✓Non-technical users automating repetitive browser tasks
- ✓QA teams creating regression test scripts visually
- ✓Business process automation teams building RPA workflows
- ✓QA teams testing API integrations through the UI
- ✓Developers debugging network issues in automation workflows
Known Limitations
- ⚠Node.js server must run continuously on port 12306; no built-in clustering or load balancing
- ⚠Native messaging adds ~50-100ms latency per round-trip vs direct extension APIs
- ⚠STDIO transport requires manual process management; HTTP/SSE is recommended for production
- ⚠Recording captures DOM state at interaction time; dynamic content loaded after recording may not replay correctly
- ⚠Complex JavaScript-driven interactions (drag-and-drop, custom gestures) may not record accurately
- ⚠Workflows stored in IndexedDB are browser-local; no built-in cloud sync or cross-device sharing
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
Repository Details
Last commit: Jan 6, 2026
About
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, content analysis, and semantic search.
Categories
Alternatives to mcp-chrome
Are you the builder of mcp-chrome?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →