OpenChrome

Harness-Engineered Browser Automation
The MCP server that guides AI agents.

How OpenChrome compares

	OpenChrome	Playwright MCP	Chrome DevTools MCP	Vercel agent-browser
Architecture	MCP → CDP (direct)	MCP → Playwright → CDP	MCP → Puppeteer → CDP	CLI → Daemon → Playwright → CDP
RAM (20 parallel)	~300 MB	~5 GB+	impractical	impractical
Bot detection	invisible (real Chrome)	detected (TLS fingerprint)	detected (CDP signals)	detected (local) / cloud only
Chrome login reuse	built-in	extension mode only	manual	manual state files
LLM hang prevention	hint engine (30+ rules)	none	none	error rewrite (5 patterns)
Reliability mechanisms	49 (8-layer defense)	~3	~3	~5
Token compression	15x (DOM serializer)	none	none	none
Outcome classification	yes (DOM delta)	none	none	none
Cross-session learning	yes (domain memory)	none	none	none
Circuit breaker	3-level	none	none	none
Shadow DOM	all types (open + closed)	open only	invisible	invisible
MCP native	yes	yes	yes	no (CLI only)
Parallel sessions	1 Chrome, N tabs	N browsers	manual tabs	N daemons

tl;dr — OpenChrome talks directly to Chrome via CDP with zero middleware, reuses your real login sessions, and is the only browser MCP server with harness engineering — 27 intelligent subsystems that guide, protect, and optimize the AI agent at every step.

What is OpenChrome?

Imagine 20+ parallel Playwright sessions — but already logged in to everything, invisible to bot detection, and sharing one Chrome process at 300MB. That's OpenChrome.

Search across 20 sites simultaneously. Crawl authenticated dashboards in seconds. Debug production UIs with real user sessions. Connect to OpenClaw and give your AI agent browser superpowers across Telegram, Discord, or any chat platform.

You: oc compare "AirPods Pro" prices across Amazon, eBay, Walmart,
     Best Buy, Target, Costco, B&H, Newegg — find the lowest

AI:  [8 parallel workers, all sites simultaneously]
     Best Buy:  $179 ← lowest (sale)
     Amazon:    $189
     Costco:    $194 (members)
     ...
     Time: 2.8s | All prices from live pages, already logged in.

	Traditional	OpenChrome
5-site task	~250s (login each)	~3s (parallel)
Memory	~2.5 GB (5 browsers)	~300 MB (1 Chrome)
Auth	Every time	Never
Bot detection	Flagged	Invisible

Harness-Engineered, Not Just Automated

Traditional browser automation exposes raw APIs. When the AI agent fails, it's on its own — burning tokens guessing, retrying, and wandering. Harness engineering means the tool itself wraps intelligence around those APIs: preventing mistakes, recovering from errors, and guiding the agent toward efficient behavior.

The bottleneck in browser automation isn't the browser — it's the LLM thinking between each step. Every tool call costs 5–15 seconds of inference time. When an AI agent guesses wrong, it doesn't just fail — it spends another 10 seconds thinking about why, then another 10 seconds trying something else.

Playwright agent checking prices on 5 sites:

  Site 1:  launch browser           3s
           navigate                  2s
           ⚡ bot detection          LLM thinks... 12s → retry with UA
           ⚡ CAPTCHA                LLM thinks... 10s → stuck, skip
           navigate to login         2s
           ⚡ no session             LLM thinks... 12s → fill credentials
           2FA prompt               LLM thinks... 10s → stuck
           ...
           finally reaches product   after ~20 LLM calls, ~4 minutes

  × 5 sites, sequential  =  ~100 LLM calls,  ~20 minutes,  ~$2.00

  Actual work: 5 calls.  Wasted on wandering: 95 calls.

OpenChrome eliminates this entirely — your Chrome is already logged in, and the hint engine corrects mistakes before they cascade:

OpenChrome agent checking prices on 5 sites:

  All 5 sites in parallel:
    navigate (already authenticated)     1s
    read prices                          2s
    ⚡ stale ref on one site
      └─ Hint: "Use read_page for fresh refs"    ← no guessing
    read_page → done                     1s

  = ~20 LLM calls,  ~15 seconds,  ~$0.40

The hint engine watches every tool call across 9 categories — error recovery, blocking page detection, composite suggestions, repetition detection, sequence detection, pagination detection, learned patterns, success guidance, and setup hints. When it sees the same error→recovery pattern 3+ times, it promotes it to a permanent rule across sessions via the Pattern Learner.

	Playwright	OpenChrome	Savings
LLM calls	~100	~20	80% fewer
Wall time	~20 min	~15 sec	80x faster
Token cost	~$2.00	~$0.40	5x cheaper
Wasted calls	~95%	~0%

27 Harness Features Across 7 Categories

OpenChrome isn't just a browser API — it's an intelligent harness with 27 subsystems that work together:

Category	Key Features	What It Does
Guidance	Hint Engine (30+ rules, 9 types), Progress Tracker, Usage Guide	Prevents mistakes before they cascade
Resilience	Ralph Engine (7-strategy waterfall), Auto-Reconnect, Ref Self-Healing	Recovers from failures automatically
Protection	3-Level Circuit Breaker, Rate Limiter, Domain Guard	Stops runaway token waste
Feedback	Outcome Classifier, DOM Delta, Visual Summary, Hit Detection	Reports what actually happened
Learning	Pattern Learner, Strategy Learner, Domain Memory	Gets smarter across sessions
Optimization	DOM Mode (15x compression), Adaptive Screenshot, Snapshot Delta	Minimizes token consumption
Detection	Auth Redirect Detection, Blocking Page, Pagination Detector	Identifies situations early

Feature highlights

Hint Engine — 30+ rules across 9 categories (error recovery, blocking page detection, repetition loops, pagination, composite suggestions, sequence optimization, learned patterns, success guidance, setup hints). Escalates from info → warning → critical as patterns repeat. The Progress Tracker detects stuck agents within 3-5 tool calls.

Ralph Engine — When an interaction fails, Ralph automatically tries 7 strategies in sequence: AX tree click → CSS discovery → CDP coordinate dispatch → JS injection → Keyboard navigation → Raw CDP mouse events → Human-in-the-loop escalation. Each attempt is classified by the Outcome Classifier (SUCCESS / SILENT_CLICK / WRONG_ELEMENT).

3-Level Circuit Breaker — Element level (3 failures → skip, 2min reset), Page level (5 distinct failures → suggest reload), Global level (10 failures in 5min → pause all). Prevents agents from burning tokens on permanently broken elements.

Pattern Learner — When a hint rule misses, the learner observes the next 3 tool calls. If a different tool succeeds, it records the error→recovery correlation. After 3 occurrences at 60%+ confidence, it promotes the pattern to a permanent rule that fires in future sessions.

DOM Mode — Serializes the full DOM into a compact text format: strips SCRIPT/STYLE/SVG, keeps only 18 actionable attributes, deduplicates repetitive siblings, collapses nested wrapper chains. Benchmarked: ~12K tokens vs ~180K tokens for the same page (15x compression).

Desktop App (Beta)

OpenChrome is also available as a desktop app — a one-click installer that runs the MCP server locally without requiring Node.js, npm, or any command-line setup. Designed for non-developers who want browser automation without the terminal.

Note: These are unsigned builds. See installation notes below.

Download

Platform	Download
macOS (Apple Silicon)	OpenChrome_0.1.0_aarch64.dmg
macOS (Intel)	OpenChrome_0.1.0_x64.dmg
Windows (EXE)	OpenChrome_0.1.0_x64-setup.exe
Windows (MSI)	OpenChrome_0.1.0_x64_en-US.msi
Linux	Coming soon (deb/rpm available in Releases)

Get Started (non-developers)

Download the installer for your platform from the Releases page.
Install — open the .dmg / run the .exe installer / make the .AppImage executable and launch it.
Connect — the app starts the MCP server automatically. Point your MCP client (Claude, Cursor, etc.) to the local server address shown in the app.

Installation Notes

macOS: The app is not notarized. On first launch, macOS will block it. To fix:

xattr -cr /Applications/OpenChrome.app

Or right-click the app → Open → Open.

Windows: SmartScreen will show "Windows protected your PC". Click "More info" → "Run anyway".

Linux: No additional steps needed. Download the AppImage, make it executable (chmod +x), and run.

Note: The desktop app and the CLI (openchrome-mcp on npm) are separate distributions with independent version numbers. You do not need both — use whichever fits your workflow. See desktop/RELEASING.md for the desktop release process.

Quick Start

Claude Code

npx openchrome-mcp setup

Codex CLI

npx openchrome-mcp setup --client codex

One command. Configures the MCP server for the selected client. Restart your MCP client after setup completes.

Manual config

Claude Code:

claude mcp add openchrome -- npx -y openchrome-mcp@latest serve --auto-launch

VS Code / Copilot (.vscode/mcp.json):

{
  "servers": {
    "openchrome": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "openchrome-mcp@latest", "serve", "--auto-launch"]
    }
  }
}

Codex CLI (~/.codex/mcp.json):

{
  "mcpServers": {
    "openchrome": {
      "command": "npm",
      "args": ["exec", "--yes", "--prefer-online", "openchrome-mcp@latest", "--", "serve", "--auto-launch"]
    }
  }
}

Cursor / Windsurf / Other stdio MCP clients:

{
  "mcpServers": {
    "openchrome": {
      "command": "npx",
      "args": ["-y", "openchrome-mcp@latest", "serve", "--auto-launch"]
    }
  }
}

Some stdio clients wrap npx through npm exec and may parse flags differently. If your client misinterprets -y, prefer a client-specific command shape (for example the Codex CLI config above) or run a locally installed openchrome binary directly.

Examples

Parallel monitoring:

oc screenshot AWS billing, GCP console, Stripe, and Datadog — all at once
→ 4 workers, 3.1s, already authenticated everywhere

Multi-account:

oc check orders on personal and business Amazon accounts simultaneously
→ 2 workers, isolated sessions, same site different accounts

Competitive intelligence:

oc compare prices for "AirPods Pro" across Amazon, eBay, Walmart, Best Buy
→ 4 workers, 4 sites, 2.4s, works past bot detection

46 Tools

Category	Tools
Navigate & Interact	`navigate`, `interact`, `fill_form`, `find`, `computer`
Read & Extract	`read_page`, `page_content`, `javascript_tool`, `selector_query`, `xpath_query`
Environment	`emulate_device`, `geolocation`, `user_agent`, `network`
Storage & Debug	`cookies`, `storage`, `console_capture`, `performance_metrics`, `request_intercept`
Parallel Workflows	`workflow_init`, `workflow_collect`, `worker_create`, `batch_execute`
Memory	`memory_record`, `memory_query`, `memory_validate`

Full tool list (46)

navigate interact computer read_page find form_input fill_form javascript_tool page_reload page_content page_pdf wait_for user_agent geolocation emulate_device network selector_query xpath_query cookies storage console_capture performance_metrics request_intercept drag_drop file_upload http_auth worker_create worker_list worker_update worker_complete worker_delete tabs_create tabs_context tabs_close workflow_init workflow_status workflow_collect workflow_collect_partial workflow_cleanup execute_plan batch_execute lightweight_scroll memory_record memory_query memory_validate oc_stop

CLI

oc setup                    # Auto-configure
oc serve --auto-launch      # Start server
oc serve --headless-shell   # Headless mode
oc doctor                   # Diagnose issues

Cross-Platform

Platform	Status
macOS	Full support
Windows	Full support (taskkill process cleanup)
Linux	Full support (Snap paths, `CHROME_PATH` env, `--no-sandbox` for CI)

DOM Mode (Token Efficient)

read_page supports three output modes:

Mode	Output	Tokens	Use Case
`ax` (default)	Accessibility tree with `ref_N` IDs	Baseline	Screen readers, semantic analysis
`dom`	Compact DOM with `backendNodeId`	~5-10x fewer	Click, fill, extract — most tasks
`css`	CSS diagnostic info (variables, computed styles, framework detection)	Minimal	Debugging styles, Tailwind detection

DOM mode example:

read_page tabId="tab1" mode="dom"

[page_stats] url: https://example.com | title: Example | scroll: 0,0 | viewport: 1920x1080

[142]<input type="search" placeholder="Search..." aria-label="Search"/> ★
[156]<button type="submit"/>Search ★
[289]<a href="/home"/>Home ★
[352]<h1/>Welcome to Example

DOM mode outputs [backendNodeId] as stable identifiers — they persist for the lifetime of the DOM node, unlike ref_N IDs which are cleared on each AX-mode read_page call.

Stable Selectors

Action tools that accept a ref parameter (form_input, computer, etc.) support three identifier formats:

Format	Example	Source
`ref_N`	`ref_5`	From `read_page` AX mode (ephemeral)
Raw integer	`142`	From `read_page` DOM mode (stable)
`node_N`	`node_142`	Explicit prefix form (stable)

Backward compatible — existing ref_N workflows work unchanged. DOM mode's backendNodeId eliminates "ref not found" errors caused by stale references.

Session Persistence

Headless mode (--headless-shell) doesn't persist cookies across restarts. Enable storage state persistence to maintain authenticated sessions:

oc serve --persist-storage                         # Enable persistence
oc serve --persist-storage --storage-dir ./state    # Custom directory

Cookies and localStorage are saved atomically every 30 seconds and restored on session creation.

Anti-Bot & Turnstile Support

OpenChrome includes built-in defenses against Cloudflare Turnstile and similar anti-bot systems. See Turnstile Guide for details.

3-Tier Auto-Fallback for CDN/WAF Blocks

When a navigation is blocked by CDN/WAF systems (Akamai, Cloudflare, etc.), OpenChrome automatically escalates through three tiers:

Tier	Mode	What It Bypasses
1	Headless Chrome	Normal navigation — works for most sites
2	Stealth + Headless	JS-level anti-bot (PerimeterX, Turnstile, basic fingerprinting)
3	Headed Chrome	TLS/UA-level blocking (Akamai CDN, network security filters)

Tier 3 launches a real headed Chrome window with a genuine user-agent (Chrome/... instead of HeadlessChrome/...) and a different TLS fingerprint, bypassing binary-level detection that no JavaScript injection can fix.

Parameters:

autoFallback: false — disable all automatic retry
headed: true — skip directly to Tier 3 (headed Chrome)
stealth: true — use stealth mode (Tier 2) explicitly

Environment: Tier 3 requires a display (macOS/Windows desktop, or Linux with $DISPLAY). In server/container environments without a display, Tier 3 is gracefully skipped.

Known Limitations

CAPTCHA-protected sites (e.g., Reddit): Auto-fallback correctly detects and escalates through all tiers, but sites that serve CAPTCHA challenges ("Prove your humanity") to all automated clients — regardless of headless/headed mode — require human interaction to solve. This is beyond auto-fallback's scope, which targets CDN/WAF network-level blocking (TLS fingerprint, user-agent detection), not interactive CAPTCHA challenges.

FAQ: Comparison with Other Browser MCPs

Common questions from users evaluating OpenChrome against Chrome DevTools MCP, Firefox DevTools MCP, and similar tools (see #612).

Can multiple MCP clients share tabs safely?

Yes — tabs cannot clobber each other across clients.

OpenChrome identifies every tab by its CDP targetId — a stable, browser-assigned string — not by a visible 1/2/3 index. On top of stable IDs, two layers of isolation are specifically designed for multi-client scenarios:

workerId — logical tab groups per client or parallel lane. A new tab under one workerId is invisible to another and never replaces one.
profileDirectory — launches a fully separate Chrome instance bound to that profile, giving OS-level cookie / storage / extension isolation.

If client A opens five tabs and client B opens five tabs, all ten tabIds are distinct and stable; a new tab from A can never displace B's tab #3.

How do I handle sites that require interactive login (password, 2FA, CAPTCHA)?

Two mechanisms, used together in practice:

1. Persistent-profile headless — log in once, automate forever. Point OpenChrome at a persistent userDataDir (+ optional profileDirectory) and cookies / localStorage / IndexedDB survive across runs. After the first login, subsequent headless runs stay logged in until the site invalidates the session.

2. Headed fallback for the initial login or WAF-blocked sites. When a human action is genuinely required (first-time login, 2FA, CAPTCHA, WebAuthn) or when a Tier-1/Tier-2 headless attempt is blocked by a CDN/WAF, OpenChrome lazy-launches a separate headed Chrome on a different debug port. Cookies are sync'd from the real Chrome profile before launch, and the headed page is registered back into the same logical session so the surrounding workflow continues seamlessly.

Typical pattern: bootstrap the login once via the headed path → the persistent profile carries the cookies → all subsequent automation runs headless without a window.

Does OpenChrome steal focus with popup windows?

No — the "recurring popup interruptions" problem does not occur in OpenChrome.

The headed-browser focus-stealing pattern that users encounter with some MCP servers (cross-Space jumps on macOS, un-minimizable popups, per-tool-call window raises) comes from designs where the MCP drives a user-visible browser and creates OS windows as it works. OpenChrome is architected differently:

Default mode is headless — no window exists, so there is nothing to take focus.
tabs_create opens a tab, not an OS window. New tabs are created via CDP inside the already-running Chrome, and OpenChrome never calls page.bringToFront() anywhere in the codebase.
The headed fallback is lazy and reused. When it is needed, it launches once per server lifetime; every later navigation/tab runs inside that same instance. At worst you see one focus grab the very first time the fallback activates — not one per action, never one per tab.
The headed fallback is optional. If persistent-profile headless is sufficient for your sites, you will never see a browser window.

The only scenario in which a window appears at all is the first time the Tier-3 headed fallback activates in a given session.

Benchmarks

Measure token efficiency and parallel performance:

npm run benchmark                                    # Stub mode: AX vs DOM token efficiency (interactive)
npm run benchmark:ci                                 # Stub mode: AX vs DOM with JSON + regression detection
npm run benchmark -- --mode real                     # Real mode: actual MCP server (requires Chrome)
npx ts-node tests/benchmark/run-parallel.ts          # Stub mode: all parallel benchmark categories
npx ts-node tests/benchmark/run-parallel.ts --mode real --category batch-js --runs 1  # Real mode
npx ts-node tests/benchmark/run-parallel.ts --mode real --category realworld --runs 1  # Real-world benchmarks

By default, benchmarks run in stub mode — measuring protocol correctness and tool-call counts with mock responses. Use --mode real to spawn an actual MCP server subprocess and measure real performance (requires Chrome to be available).

Parallel benchmark categories:

Category	What It Measures
Multi-step interaction	Form fill + click sequences across N parallel pages
Batch JS execution	N × `javascript_tool` vs 1 × `batch_execute`
Compiled plan execution	Sequential agent tool calls vs single `execute_plan`
Streaming collection	Blocking vs `workflow_collect_partial`
Init overhead	Sequential `tabs_create` vs batch `workflow_init`
Fault tolerance	Circuit breaker recovery speed
Scalability curve	Speedup efficiency at 1–50x concurrency
Real-world	Multi-site crawl, heavy JS, pipeline, scalability with public websites (`httpbin.org`, `jsonplaceholder`, `example.com`) — NOT included in `all`, requires network

Server / Headless Deployment

OpenChrome works on servers and in CI/CD pipelines without Chrome login. All 46 tools function with unauthenticated Chrome — navigation, scraping, screenshots, form filling, and parallel workflows all work in clean sessions.

Quick start

# Single flag for optimal server defaults
openchrome serve --server-mode

--server-mode automatically sets:

Auto-launches Chrome in headless mode
Skips cookie bridge scanning (~5s faster per page creation)
Optimal defaults for server environments

What works without login

Category	Tools
Navigation & scraping	`navigate`, `read_page`, `page_content`, `javascript_tool`
Interaction	`interact`, `fill_form`, `drag_drop`, `file_upload`
Parallel workflows	`workflow_init` with multiple workers, `batch_execute`
Screenshots & PDF	`computer(screenshot)`, `page_pdf`
Network & performance	`request_intercept`, `performance_metrics`, `console_capture`

Important: MCP client required

OpenChrome is an MCP server — it responds to tool calls, not standalone scripts. Server-side usage requires an MCP client (e.g., Claude API, Claude Code, or a custom MCP client) to drive it:

MCP Client (LLM) → stdio → OpenChrome (--server-mode) → Chrome

For standalone scraping scripts without an LLM, use Playwright or Puppeteer directly.

Docker

A production-ready Dockerfile is included in the repository:

docker build -t openchrome .
docker run openchrome

Environment variables

Variable	Description
`CHROME_PATH`	Path to Chrome/Chromium binary (used by launcher)
`CHROME_BINARY`	Path to Chrome binary (used by `--chrome-binary` CLI flag)
`CHROME_USER_DATA_DIR`	Custom profile directory
`CI`	Detected automatically; adds `--no-sandbox`
`DOCKER`	Detected automatically; adds `--no-sandbox`

Individual flags

For fine-grained control, use individual flags instead of --server-mode:

openchrome serve \
  --auto-launch \
  --headless-shell \
  --port 9222

Flag	Default	Description
`--auto-launch`	`false`	Auto-launch Chrome if not running
`--headless-shell`	`false`	Use chrome-headless-shell binary
`--visible`	`false`	Show Chrome window (disables headless)
`--server-mode`	`false`	Compound flag for server deployment

Authentication

OpenChrome supports per-tenant API keys, JWT/OAuth, a legacy shared token, and an unauthenticated mode. See docs/auth.md for the full guide including quickstart, scope table, key rotation, revocation, and troubleshooting.

Under the Hood: 8-Layer Reliability

OpenChrome has 49 distinct reliability mechanisms across 8 defense layers — ensuring no single failure can hang the MCP server.

┌─────────────────────────────────────────────────────────────┐
│  Layer 7: MCP Gateway                                       │
│  Rate limiter · Tool timeout (120s) · Error recovery hints  │
├─────────────────────────────────────────────────────────────┤
│  Layer 6: Session Management                                │
│  TTL cleanup · Memory pressure · Target reconciliation      │
├─────────────────────────────────────────────────────────────┤
│  Layer 5: Request Queue                                     │
│  Per-session FIFO · Per-item timeout (120s)                 │
├─────────────────────────────────────────────────────────────┤
│  Layer 4: Circuit Breaker                                   │
│  Element (3 fails) · Page (5 fails) · Global (10/5min)     │
├─────────────────────────────────────────────────────────────┤
│  Layer 3: CDP Client                                        │
│  Adaptive heartbeat · Stale target guard · Page defenses    │
├─────────────────────────────────────────────────────────────┤
│  Layer 2: Reconnection Engine                               │
│  Auto-reconnect (5 retries) · Exponential backoff · Cookie  │
│  restore · Sleep/wake detection                             │
├─────────────────────────────────────────────────────────────┤
│  Layer 1: Self-Healing                                      │
│  Chrome watchdog · Tab health monitor · Event loop monitor  │
│  Disk monitor · Health endpoint (/health, /metrics)         │
├─────────────────────────────────────────────────────────────┤
│  Layer 0: Process Lifecycle                                 │
│  Graceful shutdown · Orphan cleanup · Atomic file writes    │
└─────────────────────────────────────────────────────────────┘

32 configurable timeouts cover every operation from CDP commands (15s) to tool execution (120s) to Chrome launch (60s). Every timeout is independently tunable via src/config/defaults.ts.

Element Intelligence

Finding elements by natural language instead of CSS selectors:

"Submit button" → normalizeQuery → parseQueryForAX → AX Tree Resolution
                                                          │
                                                     match found?
                                                     /         \
                                                   yes          no
                                                    │            │
                                              [AX result]   CSS Fallback
                                                             + Shadow DOM
                                                             + Scoring

AX-first: Uses Chrome's accessibility tree — framework-agnostic across React, Angular, Vue, Web Components
Cascading filter: 4-level deterministic priority (exact role+name → role+contains → exact name → partial)
3-tier Shadow DOM: Open roots (JS) + closed roots (CDP) + user-agent roots
Hit detection: After clicking, reports what was actually hit + nearest interactive element
i18n: Korean role keywords built-in ("버튼" → button, "링크" → link, "드롭다운" → combobox)

Development

git clone https://github.com/shaun0927/openchrome.git
cd openchrome
npm install && npm run build && npm test

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1,376 Commits
.claude		.claude
.github/workflows		.github/workflows
assets		assets
benchmark		benchmark
cli		cli
config		config
deploy		deploy
desktop		desktop
docs		docs
extension		extension
native-host		native-host
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.e2e-state.json		.e2e-state.json
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
jest.ci.config.js		jest.ci.config.js
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.cli.json		tsconfig.cli.json
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json
webpack.config.js		webpack.config.js

Folders and files

Latest commit

History

Repository files navigation

OpenChrome

How OpenChrome compares

What is OpenChrome?

Harness-Engineered, Not Just Automated

27 Harness Features Across 7 Categories

Desktop App (Beta)

Download

Get Started (non-developers)

Installation Notes

Quick Start

Examples

46 Tools

CLI

Cross-Platform

DOM Mode (Token Efficient)

Stable Selectors

Session Persistence

Anti-Bot & Turnstile Support

3-Tier Auto-Fallback for CDN/WAF Blocks

Known Limitations

FAQ: Comparison with Other Browser MCPs

Can multiple MCP clients share tabs safely?

How do I handle sites that require interactive login (password, 2FA, CAPTCHA)?

Does OpenChrome steal focus with popup windows?

Benchmarks

Server / Headless Deployment

Quick start

What works without login

Important: MCP client required

Docker

Environment variables

Individual flags

Authentication

Under the Hood: 8-Layer Reliability

Element Intelligence

Development

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 76

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages