What can mcp-chrome do?

mcp protocol bridging via native messaging, browser interaction recording and replay, network monitoring and request interception, offscreen document compute for ai inference and media encoding, cli interface for headless workflow execution, multi-tab and multi-window coordination, vision-based browser control via computertool, semantic similarity search with onnx-based embeddings, real-time agent chat with streaming tool execution, visual web editor with shadow dom isolation, workflow builder with node-based flow editor, content script injection and dom manipulation, project and session management with sqlite persistence, trigger system and workflow scheduling

mcp-chrome

MCP ServerFree

Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, content analysis, and semantic search.

Open Source

/ 100

14 capabilities

Capabilities14 decomposed

mcp protocol bridging via native messaging

Medium confidence

Exposes Chrome browser capabilities to external AI clients (Claude, etc.) through a Fastify-based Node.js server (mcp-chrome-bridge) running on port 12306 that implements the Model Context Protocol. Uses bidirectional JSON-RPC over Chrome native messaging to communicate between the extension and Node.js process, with Server-Sent Events (SSE) for streaming responses and STDIO as an alternative transport mechanism for clients that don't support HTTP.

Solves for

Connect Claude or other MCP-compatible AI assistants to my Chrome browser for automationExpose browser state and capabilities as tools to external AI agentsStream real-time browser events and responses back to AI clients

Best for

AI agent developers building Claude-integrated browser automation workflows

Teams deploying AI assistants that need persistent browser session access

Developers migrating from REST APIs to MCP-based tool orchestration

Requires

Chrome browser with extension installed

Node.js 16+ for mcp-chrome-bridge

MCP-compatible AI client (Claude, etc.)

Limitations

Node.js server must run continuously on port 12306; no built-in clustering or load balancing

Native messaging adds ~50-100ms latency per round-trip vs direct extension APIs

STDIO transport requires manual process management; HTTP/SSE is recommended for production

What makes it unique

Operates within the user's existing Chrome session (preserving login states and environment) rather than launching isolated browser instances like Playwright; uses native messaging for low-latency bidirectional communication between extension and Node.js server, enabling real-time tool execution without context serialization overhead

vs alternatives

Faster and more stateful than Playwright-based solutions because it reuses the user's authenticated browser session and avoids the overhead of launching new browser instances per request

browser interaction recording and replay

Medium confidence

Captures user interactions (clicks, typing, navigation) in real-time and stores them as executable workflows in IndexedDB, enabling playback and modification through a visual workflow builder. Uses a transaction-based system to batch DOM mutations and event captures, with a flow data model that represents sequences of actions as nodes in a directed graph that can be executed, edited, and scheduled.

Solves for

Record my browser interactions and replay them as automated workflowsCreate reusable automation scripts without writing codeSchedule recorded workflows to run at specific times or on triggers

Best for

Non-technical users automating repetitive browser tasks

QA teams creating regression test scripts visually

Business process automation teams building RPA workflows

Requires

Chrome extension installed and running

IndexedDB support in Chrome (enabled by default)

Sufficient disk space for workflow storage (typically <1MB per workflow)

Limitations

Recording captures DOM state at interaction time; dynamic content loaded after recording may not replay correctly

Complex JavaScript-driven interactions (drag-and-drop, custom gestures) may not record accurately

Workflows stored in IndexedDB are browser-local; no built-in cloud sync or cross-device sharing

What makes it unique

Uses a transaction-based batch apply system with shadow DOM isolation to capture interactions without interfering with page functionality; stores workflows as a node-based graph model (not linear scripts) enabling visual editing, conditional branching, and AI-assisted modification

vs alternatives

More user-friendly than Selenium/Playwright scripts because workflows are visual and editable; preserves browser session state unlike headless automation tools, reducing flakiness from login/session timeouts

network monitoring and request interception

Medium confidence

Captures and analyzes network requests made by the page, enabling workflows to wait for specific API calls, extract data from responses, or modify requests. Uses Chrome DevTools Protocol (CDP) to intercept network traffic, stores request/response metadata in the workflow context, and provides tools for conditional logic based on network events.

Solves for

Wait for a specific API call to complete before proceeding with automationExtract data from API responses for use in subsequent workflow stepsVerify that expected network requests are being made during automation

Best for

QA teams testing API integrations through the UI

Developers debugging network issues in automation workflows

Teams building data extraction workflows that depend on API responses

Requires

Chrome DevTools Protocol (CDP) support

Chrome extension with network monitoring permissions

Sufficient memory for storing request/response data

Limitations

Network interception only works for requests made by the page; cannot intercept requests from extensions or other sources

Storing large response bodies in workflow context may consume significant memory

Request modification is limited to headers and body; cannot intercept HTTPS traffic without certificate installation

What makes it unique

Uses Chrome DevTools Protocol to intercept network traffic at the browser level, enabling workflows to wait for specific API calls and extract data from responses without modifying page code; integrates with the workflow system to enable conditional logic based on network events

vs alternatives

More reliable than polling for data because it reacts to actual network events; more complete than mocking because it captures real API responses

offscreen document compute for ai inference and media encoding

Medium confidence

Delegates compute-intensive operations (transformer model inference, GIF encoding, image processing) to an offscreen document that runs in a separate execution context, preventing blocking of the main UI thread. Uses Web Workers or offscreen document APIs to parallelize computation, with message passing to communicate results back to the main extension.

Solves for

Run AI inference (embeddings, vision) without blocking the UIEncode screenshots as GIFs for workflow playback without performance impactProcess large images or documents in the background

Best for

Extension developers needing non-blocking compute

Teams running heavy ML models in the browser

Developers building responsive automation UIs

Requires

Chrome 109+ for offscreen document API

Manifest v3 extension

Sufficient system memory for parallel execution

Limitations

Offscreen documents have limited API access; cannot access DOM or some Chrome APIs

Message passing between main and offscreen context adds latency (~5-10ms per message)

Memory is not shared; large data structures must be serialized/deserialized

What makes it unique

Offloads compute-intensive operations to an offscreen document context, preventing UI blocking; uses message passing for result communication, enabling responsive UIs even during heavy inference or encoding tasks

vs alternatives

More responsive than running inference on the main thread; more efficient than external API calls because computation stays local to the browser

cli interface for headless workflow execution

Medium confidence

Provides a command-line interface for executing recorded workflows in headless mode, enabling integration with CI/CD pipelines and server-side automation. Wraps the Node.js server with CLI commands for workflow execution, result reporting, and error handling, with support for parameterized workflows and output formatting.

Solves for

Run automation workflows from CI/CD pipelinesExecute workflows on a server without a GUIIntegrate browser automation into command-line tools and scripts

Best for

DevOps teams integrating automation into CI/CD

Developers building command-line tools for browser automation

Teams running scheduled automation on servers

Requires

Node.js 16+

mcp-chrome-bridge running

Chrome or Chromium browser

Limitations

Headless execution requires a display server (Xvfb on Linux) or headless Chrome; not all pages render correctly in headless mode

CLI interface is limited to simple parameter passing; complex workflows may require code

Error messages from headless execution may be less informative than interactive debugging

What makes it unique

Provides a CLI wrapper around the Node.js server that enables headless workflow execution without a GUI, integrating with standard Unix tools and CI/CD systems; supports parameterized workflows and multiple output formats for easy integration

vs alternatives

More flexible than Selenium/Playwright CLIs because workflows are visual and editable; easier to integrate into existing automation pipelines than writing custom scripts

multi-tab and multi-window coordination

Medium confidence

Enables automation workflows to coordinate actions across multiple browser tabs and windows, with shared state management and cross-tab messaging. Uses Chrome extension message passing to synchronize state between tabs, enabling workflows that require interaction with multiple pages simultaneously or sequentially.

Solves for

Automate workflows that require switching between multiple tabsCoordinate actions across multiple browser windowsShare data between automation steps running in different tabs

Best for

Teams automating complex multi-page workflows

Developers building workflows that require tab switching

QA teams testing multi-window interactions

Requires

Chrome extension with background service worker

Message passing infrastructure for cross-tab communication

Careful state management to avoid race conditions

Limitations

Cross-tab messaging adds latency; workflows may be slower than single-tab automation

State synchronization can be complex; race conditions may occur if multiple tabs modify shared state

Closed tabs lose their state; no automatic recovery for failed tab operations

What makes it unique

Implements cross-tab messaging and state synchronization through the background service worker, enabling workflows to coordinate actions across multiple tabs without requiring manual tab switching; uses a shared state store to maintain consistency

vs alternatives

More flexible than single-tab automation because it can handle complex multi-page workflows; more reliable than manual tab switching because coordination is automated

vision-based browser control via computertool

Medium confidence

Enables AI agents to control the browser using visual perception by capturing screenshots, analyzing page layout, and executing actions (click, type, scroll) based on visual coordinates rather than DOM selectors. Implements a ComputerTool base class that accepts screenshot input, performs vision-based reasoning, and translates visual instructions into precise browser actions, supporting multi-step visual workflows.

Solves for

Let Claude see and interact with my browser visually without needing DOM selectorsAutomate complex UIs that use dynamic or obfuscated HTMLEnable AI agents to handle visual elements like images, charts, and custom components

Best for

AI agents automating legacy or third-party web applications with unstable DOM

Teams building visual RPA workflows that don't require DOM-level precision

Developers integrating vision-language models (Claude's vision) with browser automation

Requires

Vision-capable AI model (Claude 3.5+, GPT-4V, etc.)

Screenshot capability in browser (native Chrome API)

Sufficient API quota for vision inference

Limitations

Vision-based control is slower than DOM-based interaction (requires screenshot + inference per action)

Coordinate-based clicking is fragile if page layout changes between screenshot and execution

Vision model must be able to interpret the visual content; fails on heavily obfuscated or non-standard UIs

What makes it unique

Implements a ComputerTool abstraction that bridges vision-language models directly to browser actions, allowing agents to reason about visual layout and execute coordinate-based interactions without DOM knowledge; integrates with ONNX Runtime for local vision inference when needed

vs alternatives

More flexible than selector-based automation for dynamic UIs; enables AI agents to handle visual elements (images, charts) that DOM selectors cannot target; slower than DOM-based tools but more robust to UI changes

semantic similarity search with onnx-based embeddings

Medium confidence

Provides vector-based semantic search over page content using transformer models (ONNX Runtime) running locally in the browser's offscreen document. Embeds page text into vector space using a pre-loaded model, stores vectors in an HNSW (Hierarchical Navigable Small World) index, and enables fast approximate nearest-neighbor search for finding relevant content without keyword matching.

Solves for

Search for content on a page using semantic meaning, not just keywordsFind similar text passages across multiple pages or documentsEnable AI agents to locate relevant information without DOM selectors

Best for

AI agents analyzing large documents or multi-page websites semantically

Teams building semantic search features without external vector databases

Developers needing privacy-preserving search (all inference local, no API calls)

Requires

ONNX Runtime JavaScript library

Pre-trained transformer model (e.g., all-MiniLM-L6-v2)

Sufficient browser memory for model + index (typically 200-500MB)

Limitations

ONNX model inference adds 100-500ms latency per embedding depending on text length

HNSW index is in-memory; no persistence across browser sessions without manual export

Model size (typically 50-200MB) must fit in browser memory; larger models may cause slowdowns

What makes it unique

Runs transformer-based embeddings locally in the browser using ONNX Runtime (no external API calls), enabling privacy-preserving semantic search; uses HNSW for efficient approximate nearest-neighbor search over large document collections without requiring a separate vector database

vs alternatives

Faster and more private than cloud-based semantic search APIs (no data leaves the browser); more accurate than keyword search for understanding meaning; eliminates dependency on external vector databases like Pinecone or Weaviate

real-time agent chat with streaming tool execution

Medium confidence

Implements a streaming conversation interface where AI agents (Claude, etc.) can invoke browser tools in real-time and receive results within the same conversation flow. Uses a message processing pipeline that routes tool calls to the appropriate browser automation tools, captures results, and streams them back to the agent for multi-turn reasoning without waiting for full workflow completion.

Solves for

Have a streaming conversation with Claude where it controls my browser in real-timeLet the AI agent see results of each action and adapt its next steps accordinglyBuild multi-step workflows where the agent reasons about intermediate results

Best for

AI agent developers building interactive automation workflows

Teams using Claude for complex, multi-step browser tasks requiring reasoning

Developers building chat interfaces for browser automation

Requires

MCP-compatible AI client with streaming support

mcp-chrome-bridge running and accessible

API key for AI model (Claude, etc.)

Limitations

Streaming adds complexity to error handling; tool failures mid-stream may require conversation recovery

Agent reasoning latency compounds with tool execution time; complex workflows may take minutes

No built-in retry logic for failed tool calls; agent must explicitly request retry

What makes it unique

Implements a message processing pipeline with a timeline-based conversation model that tracks both agent reasoning and tool execution results; uses streaming SSE to send partial results back to the agent in real-time, enabling adaptive multi-step workflows where the agent can adjust strategy based on intermediate outcomes

vs alternatives

More interactive than batch automation because the agent sees results immediately and can adapt; preserves full conversation history for debugging and auditing unlike ephemeral tool-calling patterns

visual web editor with shadow dom isolation

Medium confidence

Provides an in-browser visual editor overlay (built with React/Vue) that allows editing page content without interfering with the original page functionality. Uses shadow DOM to isolate the editor UI from page styles, implements a transaction-based batch apply system to commit edits atomically, and supports undo/redo through a state management system that tracks all mutations.

Solves for

Edit page content visually without breaking page functionalityTest content changes before committing them to the pageBuild visual editing workflows that can be recorded and replayed

Best for

Content creators editing web pages visually

QA teams testing page modifications without code

Developers building no-code page customization tools

Requires

Chrome extension with visual editor entrypoint

React or Vue for UI rendering

Shadow DOM support (all modern browsers)

Limitations

Shadow DOM isolation prevents some CSS inheritance; custom styles may not apply correctly

Batch apply system requires careful transaction management; concurrent edits may conflict

Undo/redo is limited to the editor session; no persistence across page reloads without manual save

What makes it unique

Uses shadow DOM to completely isolate editor UI from page styles, preventing CSS conflicts; implements a transaction-based batch apply system that commits all edits atomically, reducing flakiness from partial DOM updates

vs alternatives

More robust than direct DOM manipulation because shadow DOM isolation prevents style leakage; transaction-based commits are more reliable than incremental mutations for complex page edits

workflow builder with node-based flow editor

Medium confidence

Provides a visual node-based interface for constructing automation workflows where each node represents a browser action or decision point. Implements a flow data model that stores workflows as directed graphs with layout algorithms for automatic node positioning, supports conditional branching and loops, and integrates with the recording system to auto-generate nodes from captured interactions.

Solves for

Build complex automation workflows visually without writing codeCreate conditional logic (if/then) in automation workflowsConvert recorded interactions into editable, reusable workflows

Best for

Non-technical users building RPA workflows

Business analysts designing process automation

Teams migrating from linear scripts to graph-based workflows

Requires

Chrome extension with workflow builder UI

React or similar framework for node editor

IndexedDB for workflow storage

Limitations

Node layout algorithm may produce suboptimal layouts for very large workflows (100+ nodes)

Conditional logic is limited to simple branching; complex decision trees require nested nodes

No built-in version control; workflows are stored in IndexedDB without git-like history

What makes it unique

Implements a node-based flow model (not linear scripts) with automatic layout algorithms, enabling visual editing and conditional branching; integrates bidirectionally with the recording system so recorded interactions can be auto-converted to workflow nodes and vice versa

vs alternatives

More flexible than linear script recording because the graph model supports loops and conditionals; more user-friendly than code-based automation because the visual interface requires no programming knowledge

content script injection and dom manipulation

Medium confidence

Injects content scripts into web pages to capture user interactions, monitor DOM changes, and execute browser automation commands at the page level. Uses a message passing architecture to communicate between content scripts and the background service worker, enabling real-time event capture (clicks, typing, navigation) and DOM mutations without blocking page execution.

Solves for

Capture user interactions on any web page for recordingExecute automation commands (click, type, scroll) on the current pageMonitor page changes and trigger workflows based on DOM mutations

Best for

Browser extension developers building automation tools

Teams needing real-time page interaction capture

Developers building event-driven automation workflows

Requires

Chrome extension manifest with content_scripts and host_permissions

Target website must not have restrictive CSP

Background service worker for message routing

Limitations

Content scripts cannot access cross-origin iframes; automation is limited to same-origin content

Message passing between content script and background worker adds ~10-20ms latency per command

DOM mutation monitoring can be expensive on pages with frequent updates; may cause performance degradation

What makes it unique

Uses a bidirectional message passing architecture between content scripts and background worker to enable real-time interaction capture and command execution without blocking page JavaScript; implements event deduplication to avoid capturing redundant interactions

vs alternatives

More efficient than polling for page changes because it uses event listeners; lower latency than external automation tools because commands execute in-page rather than through external APIs

project and session management with sqlite persistence

Medium confidence

Manages multiple automation projects and conversation sessions using a SQLite database with Drizzle ORM, enabling users to organize workflows, save conversation history, and switch between different automation contexts. Stores project metadata, workflow definitions, execution logs, and conversation transcripts in a structured relational schema accessible through the Node.js server.

Solves for

Organize multiple automation workflows into projectsSave and resume conversations with AI agents across sessionsTrack execution history and logs for debugging and auditing

Best for

Teams managing multiple automation projects

Developers building persistent AI agent applications

Organizations requiring audit trails for automated processes

Requires

Node.js 16+ with SQLite support

Drizzle ORM installed

Disk space for database (typically <100MB per 1000 workflows)

Limitations

SQLite is single-writer; concurrent access from multiple processes may cause lock contention

Database is local to the machine; no built-in cloud sync or multi-device access

Drizzle ORM adds abstraction overhead; complex queries may be slower than raw SQL

What makes it unique

Uses Drizzle ORM with SQLite for type-safe schema management, enabling structured storage of projects, workflows, and conversations; integrates with the Node.js server to provide REST/MCP endpoints for querying and managing persistent data

vs alternatives

More reliable than in-memory storage because data persists across server restarts; more flexible than file-based storage because SQL queries enable complex filtering and aggregation

trigger system and workflow scheduling

Medium confidence

Enables scheduling and triggering of recorded workflows based on time intervals, events, or external conditions. Implements a trigger registry that maps conditions (cron expressions, webhook events, page changes) to workflow execution, with a scheduler that manages timing and ensures reliable execution even if the browser is closed.

Solves for

Schedule workflows to run at specific times (e.g., daily reports)Trigger workflows when specific events occur (page change, webhook)Automate repetitive tasks without manual intervention

Best for

Teams automating scheduled tasks (daily reports, data collection)

Developers building event-driven automation

Organizations needing reliable workflow scheduling

Requires

mcp-chrome-bridge running continuously

Cron expression library (e.g., node-cron)

Webhook endpoint for event-based triggers (optional)

Limitations

Scheduling requires the Node.js server to be running continuously; no built-in high availability

Cron-based scheduling is limited to time-based triggers; complex event logic requires custom code

Workflow execution is not guaranteed if the browser is closed or network is unavailable

What makes it unique

Implements a trigger registry that decouples trigger conditions from workflow execution, enabling flexible scheduling patterns (time-based, event-based, webhook-based) without modifying workflow definitions; uses a persistent scheduler in the Node.js server to ensure reliability

vs alternatives

More flexible than simple cron scheduling because it supports event-based triggers; more reliable than browser-based scheduling because the Node.js server runs independently of the browser

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with mcp-chrome, ranked by overlap. Discovered automatically through the match graph.

Repository27

llm-analysis-assistant

** <img height="12" width="12" src="https://raw.githubusercontent.com/xuzexin-hz/llm-analysis-assistant/refs/heads/main/src/llm_analysis_assistant/pages/html/imgs/favicon.ico" alt="Langfuse Logo" /> - A very streamlined mcp client that supports calling and monitoring stdio/sse/streamableHttp, and ca

transport-agnostic request/response capture and replaymcp client with multi-transport protocol supportrequest-response logging and inspection dashboard

3 shared capabilities

MCP Server36

inspector

Visual testing tool for MCP servers

mcp server protocol bridging via express proxyreal-time protocol message logging and inspection

2 shared capabilities

MCP Server21

@modelcontextprotocol/inspector

Model Context Protocol inspector

interactive mcp protocol debugging and request/response inspectionmcp protocol event streaming and real-time monitoring

2 shared capabilities

MCP Server46

chrome-devtools-mcp

MCP server for Chrome DevTools

network-request-interception-and-monitoring

1 shared capability

MCP Server36

@mcp-use/inspector

MCP Inspector - A tool for inspecting and debugging MCP servers

mcp protocol message inspection and logging

1 shared capability

MCP Server20

@modelcontextprotocol/inspector-server

Server-side application for the Model Context Protocol inspector

mcp protocol message logging and request/response tracing

1 shared capability

Best For

✓AI agent developers building Claude-integrated browser automation workflows
✓Teams deploying AI assistants that need persistent browser session access
✓Developers migrating from REST APIs to MCP-based tool orchestration
✓Non-technical users automating repetitive browser tasks
✓QA teams creating regression test scripts visually
✓Business process automation teams building RPA workflows
✓QA teams testing API integrations through the UI
✓Developers debugging network issues in automation workflows

Known Limitations

⚠Node.js server must run continuously on port 12306; no built-in clustering or load balancing
⚠Native messaging adds ~50-100ms latency per round-trip vs direct extension APIs
⚠STDIO transport requires manual process management; HTTP/SSE is recommended for production
⚠Recording captures DOM state at interaction time; dynamic content loaded after recording may not replay correctly
⚠Complex JavaScript-driven interactions (drag-and-drop, custom gestures) may not record accurately
⚠Workflows stored in IndexedDB are browser-local; no built-in cloud sync or cross-device sharing

Requirements

Chrome browser with extension installedNode.js 16+ for mcp-chrome-bridgeMCP-compatible AI client (Claude, etc.)Port 12306 available on localhostChrome extension installed and runningIndexedDB support in Chrome (enabled by default)Sufficient disk space for workflow storage (typically <1MB per workflow)Chrome DevTools Protocol (CDP) support

Input / Output

Accepts: JSON-RPC method calls, Tool invocation requests with parameters, MCP protocol messages, User interactions (mouse events, keyboard input, navigation), DOM state snapshots, Page screenshots, Network request URLs, Request/response patterns, API response data, Screenshots (for encoding), Text (for embedding), Images (for processing), Workflow ID, Parameters (JSON), Output format (JSON, CSV, etc.), Tab IDs, Window IDs, Shared state data, Cross-tab messages, Screenshots (PNG/JPEG), Natural language instructions, Visual coordinates, Text content from page, Query strings (natural language), Page URLs or DOM elements, Tool invocation requests, Conversation history, DOM elements to edit, Text content, CSS styles, Recorded interactions, Manual node creation, Workflow templates, DOM events (click, input, change), Page navigation events, Automation commands (click, type, scroll), Project metadata (name, description), Workflow definitions, Conversation messages, Execution logs, Cron expressions, Webhook payloads, Event conditions, Workflow IDs

Produces: JSON-RPC responses, Server-Sent Events (SSE) streams, Tool execution results, Workflow JSON (flow data model), Execution logs with timestamps, Screenshots from replay execution, Request/response logs, Extracted data from responses, Network timing information, Encoded GIFs, Vector embeddings, Processed images, Execution results (JSON, CSV, etc.), Exit codes, Logs, Execution results from multiple tabs, Shared state snapshots, Coordination logs, Click/type/scroll actions, Visual analysis results, Execution status with visual feedback, Vector embeddings (float arrays), Ranked search results with similarity scores, HNSW index data, Streaming text responses, Conversation timeline with actions, Modified DOM, Edit transaction logs, Undo/redo state, Executable workflow definition, Visual diagram, Event logs with timestamps, DOM state snapshots, Command execution results, Project list, Workflow history, Conversation transcripts, Execution reports, Execution logs, Trigger status, Workflow results

UnfragileRank

Adoption35%(30% weight)

Quality38%(25% weight)

Ecosystem40%(25% weight)

Match Graph10%(15% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: MCP Server

14 capabilities

Visit mcp-chrome→

Repository Details

11,276

Stars

998

Forks

TypeScript

Language

MIT

License

Last commit: Jan 6, 2026

About

Alternatives to mcp-chrome

wink-embeddings-sg-100d24Repository

100-dimensional English word embeddings for wink-nlp

Compare →

voyage-ai-provider30API

Voyage AI Provider for running Voyage AI models with Vercel AI SDK

Compare →

@vibe-agent-toolkit/rag-lancedb27Agent

LanceDB implementation of RAG interfaces for vibe-agent-toolkit

Compare →

vectra41Repository

A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.

Compare →

Are you the builder of mcp-chrome?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

mcp registry

Looking for something else?

Search →

Capabilities14 decomposed

mcp protocol bridging via native messaging

Medium confidence

Solves for

Best for

AI agent developers building Claude-integrated browser automation workflows

Teams deploying AI assistants that need persistent browser session access

Developers migrating from REST APIs to MCP-based tool orchestration

Requires

Chrome browser with extension installed

Node.js 16+ for mcp-chrome-bridge

MCP-compatible AI client (Claude, etc.)

Limitations

Node.js server must run continuously on port 12306; no built-in clustering or load balancing

Native messaging adds ~50-100ms latency per round-trip vs direct extension APIs

STDIO transport requires manual process management; HTTP/SSE is recommended for production

What makes it unique

vs alternatives

Faster and more stateful than Playwright-based solutions because it reuses the user's authenticated browser session and avoids the overhead of launching new browser instances per request

browser interaction recording and replay

Medium confidence

Solves for

Record my browser interactions and replay them as automated workflowsCreate reusable automation scripts without writing codeSchedule recorded workflows to run at specific times or on triggers

Best for

Non-technical users automating repetitive browser tasks

QA teams creating regression test scripts visually

Business process automation teams building RPA workflows

Requires

Chrome extension installed and running

IndexedDB support in Chrome (enabled by default)

Sufficient disk space for workflow storage (typically <1MB per workflow)

Limitations

Recording captures DOM state at interaction time; dynamic content loaded after recording may not replay correctly

Complex JavaScript-driven interactions (drag-and-drop, custom gestures) may not record accurately

Workflows stored in IndexedDB are browser-local; no built-in cloud sync or cross-device sharing

What makes it unique

vs alternatives

network monitoring and request interception

Medium confidence

Solves for

Best for

QA teams testing API integrations through the UI

Developers debugging network issues in automation workflows

Teams building data extraction workflows that depend on API responses

Requires

Chrome DevTools Protocol (CDP) support

Chrome extension with network monitoring permissions

Sufficient memory for storing request/response data

Limitations

Network interception only works for requests made by the page; cannot intercept requests from extensions or other sources

Storing large response bodies in workflow context may consume significant memory

Request modification is limited to headers and body; cannot intercept HTTPS traffic without certificate installation

What makes it unique

vs alternatives

More reliable than polling for data because it reacts to actual network events; more complete than mocking because it captures real API responses

offscreen document compute for ai inference and media encoding

Medium confidence

Solves for

Run AI inference (embeddings, vision) without blocking the UIEncode screenshots as GIFs for workflow playback without performance impactProcess large images or documents in the background

Best for

Extension developers needing non-blocking compute

Teams running heavy ML models in the browser

Developers building responsive automation UIs

Requires

Chrome 109+ for offscreen document API

Manifest v3 extension

Sufficient system memory for parallel execution

Limitations

Offscreen documents have limited API access; cannot access DOM or some Chrome APIs

Message passing between main and offscreen context adds latency (~5-10ms per message)

Memory is not shared; large data structures must be serialized/deserialized

What makes it unique

vs alternatives

More responsive than running inference on the main thread; more efficient than external API calls because computation stays local to the browser

cli interface for headless workflow execution

Medium confidence

Solves for

Run automation workflows from CI/CD pipelinesExecute workflows on a server without a GUIIntegrate browser automation into command-line tools and scripts

Best for

DevOps teams integrating automation into CI/CD

Developers building command-line tools for browser automation

Teams running scheduled automation on servers

Requires

Node.js 16+

mcp-chrome-bridge running

Chrome or Chromium browser

Limitations

Headless execution requires a display server (Xvfb on Linux) or headless Chrome; not all pages render correctly in headless mode

CLI interface is limited to simple parameter passing; complex workflows may require code

Error messages from headless execution may be less informative than interactive debugging

What makes it unique

vs alternatives

More flexible than Selenium/Playwright CLIs because workflows are visual and editable; easier to integrate into existing automation pipelines than writing custom scripts

multi-tab and multi-window coordination

Medium confidence

Solves for

Automate workflows that require switching between multiple tabsCoordinate actions across multiple browser windowsShare data between automation steps running in different tabs

Best for

Teams automating complex multi-page workflows

Developers building workflows that require tab switching

QA teams testing multi-window interactions

Requires

Chrome extension with background service worker

Message passing infrastructure for cross-tab communication

Careful state management to avoid race conditions

Limitations

Cross-tab messaging adds latency; workflows may be slower than single-tab automation

State synchronization can be complex; race conditions may occur if multiple tabs modify shared state

Closed tabs lose their state; no automatic recovery for failed tab operations

What makes it unique

vs alternatives

More flexible than single-tab automation because it can handle complex multi-page workflows; more reliable than manual tab switching because coordination is automated

vision-based browser control via computertool

Medium confidence

Solves for

Best for

AI agents automating legacy or third-party web applications with unstable DOM

Teams building visual RPA workflows that don't require DOM-level precision

Developers integrating vision-language models (Claude's vision) with browser automation

Requires

Vision-capable AI model (Claude 3.5+, GPT-4V, etc.)

Screenshot capability in browser (native Chrome API)

Sufficient API quota for vision inference

Limitations

Vision-based control is slower than DOM-based interaction (requires screenshot + inference per action)

Coordinate-based clicking is fragile if page layout changes between screenshot and execution

Vision model must be able to interpret the visual content; fails on heavily obfuscated or non-standard UIs

What makes it unique

vs alternatives

semantic similarity search with onnx-based embeddings

Medium confidence

Solves for

Best for

AI agents analyzing large documents or multi-page websites semantically

Teams building semantic search features without external vector databases

Developers needing privacy-preserving search (all inference local, no API calls)

Requires

ONNX Runtime JavaScript library

Pre-trained transformer model (e.g., all-MiniLM-L6-v2)

Sufficient browser memory for model + index (typically 200-500MB)

Limitations

ONNX model inference adds 100-500ms latency per embedding depending on text length

HNSW index is in-memory; no persistence across browser sessions without manual export

Model size (typically 50-200MB) must fit in browser memory; larger models may cause slowdowns

What makes it unique

vs alternatives

real-time agent chat with streaming tool execution

Medium confidence

Solves for

Best for

AI agent developers building interactive automation workflows

Teams using Claude for complex, multi-step browser tasks requiring reasoning

Developers building chat interfaces for browser automation

Requires

MCP-compatible AI client with streaming support

mcp-chrome-bridge running and accessible

API key for AI model (Claude, etc.)

Limitations

Streaming adds complexity to error handling; tool failures mid-stream may require conversation recovery

Agent reasoning latency compounds with tool execution time; complex workflows may take minutes

No built-in retry logic for failed tool calls; agent must explicitly request retry

What makes it unique

vs alternatives

More interactive than batch automation because the agent sees results immediately and can adapt; preserves full conversation history for debugging and auditing unlike ephemeral tool-calling patterns

visual web editor with shadow dom isolation

Medium confidence

Solves for

Edit page content visually without breaking page functionalityTest content changes before committing them to the pageBuild visual editing workflows that can be recorded and replayed

Best for

Content creators editing web pages visually

QA teams testing page modifications without code

Developers building no-code page customization tools

Requires

Chrome extension with visual editor entrypoint

React or Vue for UI rendering

Shadow DOM support (all modern browsers)

Limitations

Shadow DOM isolation prevents some CSS inheritance; custom styles may not apply correctly

Batch apply system requires careful transaction management; concurrent edits may conflict

Undo/redo is limited to the editor session; no persistence across page reloads without manual save

What makes it unique

vs alternatives

More robust than direct DOM manipulation because shadow DOM isolation prevents style leakage; transaction-based commits are more reliable than incremental mutations for complex page edits

workflow builder with node-based flow editor

Medium confidence

Solves for

Build complex automation workflows visually without writing codeCreate conditional logic (if/then) in automation workflowsConvert recorded interactions into editable, reusable workflows

Best for

Non-technical users building RPA workflows

Business analysts designing process automation

Teams migrating from linear scripts to graph-based workflows

Requires

Chrome extension with workflow builder UI

React or similar framework for node editor

IndexedDB for workflow storage

Limitations

Node layout algorithm may produce suboptimal layouts for very large workflows (100+ nodes)

Conditional logic is limited to simple branching; complex decision trees require nested nodes

No built-in version control; workflows are stored in IndexedDB without git-like history

What makes it unique

vs alternatives

content script injection and dom manipulation

Medium confidence

Solves for

Capture user interactions on any web page for recordingExecute automation commands (click, type, scroll) on the current pageMonitor page changes and trigger workflows based on DOM mutations

Best for

Browser extension developers building automation tools

Teams needing real-time page interaction capture

Developers building event-driven automation workflows

Requires

Chrome extension manifest with content_scripts and host_permissions

Target website must not have restrictive CSP

Background service worker for message routing

Limitations

Content scripts cannot access cross-origin iframes; automation is limited to same-origin content

Message passing between content script and background worker adds ~10-20ms latency per command

DOM mutation monitoring can be expensive on pages with frequent updates; may cause performance degradation

What makes it unique

vs alternatives

More efficient than polling for page changes because it uses event listeners; lower latency than external automation tools because commands execute in-page rather than through external APIs

project and session management with sqlite persistence

Medium confidence

Solves for

Organize multiple automation workflows into projectsSave and resume conversations with AI agents across sessionsTrack execution history and logs for debugging and auditing

Best for

Teams managing multiple automation projects

Developers building persistent AI agent applications

Organizations requiring audit trails for automated processes

Requires

Node.js 16+ with SQLite support

Drizzle ORM installed

Disk space for database (typically <100MB per 1000 workflows)

Limitations

SQLite is single-writer; concurrent access from multiple processes may cause lock contention

Database is local to the machine; no built-in cloud sync or multi-device access

Drizzle ORM adds abstraction overhead; complex queries may be slower than raw SQL

What makes it unique

vs alternatives

More reliable than in-memory storage because data persists across server restarts; more flexible than file-based storage because SQL queries enable complex filtering and aggregation

trigger system and workflow scheduling

Medium confidence

Solves for

Schedule workflows to run at specific times (e.g., daily reports)Trigger workflows when specific events occur (page change, webhook)Automate repetitive tasks without manual intervention

Best for

Teams automating scheduled tasks (daily reports, data collection)

Developers building event-driven automation

Organizations needing reliable workflow scheduling

Requires

mcp-chrome-bridge running continuously

Cron expression library (e.g., node-cron)

Webhook endpoint for event-based triggers (optional)

Limitations

Scheduling requires the Node.js server to be running continuously; no built-in high availability

Cron-based scheduling is limited to time-based triggers; complex event logic requires custom code

Workflow execution is not guaranteed if the browser is closed or network is unavailable

What makes it unique

vs alternatives

More flexible than simple cron scheduling because it supports event-based triggers; more reliable than browser-based scheduling because the Node.js server runs independently of the browser

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to mcp-chrome

wink-embeddings-sg-100d24Repository

100-dimensional English word embeddings for wink-nlp

Compare →

voyage-ai-provider30API

Voyage AI Provider for running Voyage AI models with Vercel AI SDK

Compare →

@vibe-agent-toolkit/rag-lancedb27Agent

LanceDB implementation of RAG interfaces for vibe-agent-toolkit

Compare →

vectra41Repository

A lightweight, file-backed vector database for Node.js and browsers with Pinecone-compatible filtering and hybrid BM25 search.

Compare →

mcp-chrome

Capabilities14 decomposed

mcp protocol bridging via native messaging

browser interaction recording and replay

network monitoring and request interception

offscreen document compute for ai inference and media encoding

cli interface for headless workflow execution

multi-tab and multi-window coordination

vision-based browser control via computertool

semantic similarity search with onnx-based embeddings

real-time agent chat with streaming tool execution

visual web editor with shadow dom isolation

workflow builder with node-based flow editor

content script injection and dom manipulation

project and session management with sqlite persistence

trigger system and workflow scheduling

Related Artifactssharing capabilities

llm-analysis-assistant

inspector

@modelcontextprotocol/inspector

chrome-devtools-mcp

@mcp-use/inspector

@modelcontextprotocol/inspector-server

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to mcp-chrome

Are you the builder of mcp-chrome?

Get the weekly brief

Data Sources

mcp-chrome

Capabilities14 decomposed

mcp protocol bridging via native messaging

browser interaction recording and replay

network monitoring and request interception

offscreen document compute for ai inference and media encoding

cli interface for headless workflow execution

multi-tab and multi-window coordination

vision-based browser control via computertool

semantic similarity search with onnx-based embeddings

real-time agent chat with streaming tool execution

visual web editor with shadow dom isolation

workflow builder with node-based flow editor

content script injection and dom manipulation

project and session management with sqlite persistence

trigger system and workflow scheduling

Related Artifactssharing capabilities

llm-analysis-assistant

inspector

@modelcontextprotocol/inspector

chrome-devtools-mcp

@mcp-use/inspector

@modelcontextprotocol/inspector-server

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to mcp-chrome

Are you the builder of mcp-chrome?

Get the weekly brief

Data Sources