cmux Browser Automation Skill

Give your AI coding agent eyes. Let it open a browser, see what's on the page, click buttons, fill forms, read console errors — and fix what's broken — all without leaving the terminal.

Works with Claude Code, Codex, and any CLI-based agent that can run shell commands.

Requires cmux — a free, open-source macOS terminal with a built-in WebKit browser. Download cmux

What your agent can do

Build and verify in one shot — The agent writes a component, opens the browser, snapshots the page, and confirms it renders correctly. No more "it works on my machine" — the agent checks its own work.

Debug visually — Page blank after a deploy? The agent opens it, reads console errors and JS exceptions, traces the bug to the source, fixes it, reloads, and confirms the fix. All in one loop.

Drive full user flows — Login, checkout, onboarding — the agent fills forms, clicks through steps, waits for redirects, and verifies the final state. Like an E2E test, but ad hoc and in natural language.

Inspect anything — Accessibility tree snapshots, screenshots, element queries, computed styles, cookie values, localStorage — the agent can read whatever it needs from the page.

Run JavaScript — Eval expressions, inject scripts, add styles. The agent can poke at window state, trigger app-level functions, or hide debug banners.

What's in the skill file

browser/SKILL.md is a single self-contained file with the complete CLI reference:

Category	Commands
Navigation	`open`, `open-split`, `navigate`, `back`, `forward`, `reload`
Waiting	`wait --load-state`, `--selector`, `--text`, `--url-contains`, `--function`
DOM interaction	`click`, `dblclick`, `hover`, `fill`, `type`, `press`, `select`, `check`, `scroll`
Inspection	`snapshot`, `screenshot`, `get`, `is`, `find`, `highlight`
JavaScript	`eval`, `addscript`, `addinitscript`, `addstyle`
Session state	`cookies`, `storage`, `state save/load`
Tabs & frames	`tab list/new/switch/close`, `frame`
Console & errors	`console list`, `errors list`
Dialogs & downloads	`dialog accept/dismiss`, `download`
General cmux	workspaces, splits, notifications, sidebar status

Use cases

"Build this component and verify it renders"

You: Build a signup form with email and password fields.

Agent: (writes the component, then) Let me verify it renders correctly... (opens http://localhost:3000/signup in a split, takes a snapshot, confirms the form fields are present, fills them in, submits, and checks for the success message)

"Fix this broken page"

You: The dashboard is showing a blank screen after the last deploy.

Agent: Let me check what's happening... (opens the page, runs errors list and console list, finds a TypeError: Cannot read properties of undefined, traces it to the code, fixes it, reloads, and confirms it renders)

"Test the login flow end to end"

You: Make sure the login flow works after your auth changes.

Agent: I'll walk through it... (opens the login page, fills email and password, clicks submit, waits for the redirect, snapshots the dashboard to confirm the user is logged in)

"Check what this page looks like"

You: What does the pricing page look like?

Agent: (opens /pricing in a split, takes a screenshot, and reads the accessibility tree to describe the layout and content)

Setup

Option 1: Ask your AI agent

Open Claude Code (or Codex, or any agent) and tell it:

Download https://raw.githubusercontent.com/darkspock/cmux-skill/main/browser.md and save it to ~/.claude/skills/browser/SKILL.md

For Codex, tell it to save it as browser.md in your project root and reference it in AGENTS.md.

Important: Restart your agent after installing. Skills are loaded at startup, so you need a fresh session for /browser to become available.

If you want it scoped to a single project (shared with your team), save it to .claude/skills/browser/SKILL.md in the project root and commit it.

Option 2: Manual installation

Global (all projects):

mkdir -p ~/.claude/skills/browser && curl -fsSL https://raw.githubusercontent.com/darkspock/cmux-skill/main/browser.md -o ~/.claude/skills/browser/SKILL.md

Project-scoped (shared with your team):

mkdir -p .claude/skills/browser && curl -fsSL https://raw.githubusercontent.com/darkspock/cmux-skill/main/browser.md -o .claude/skills/browser/SKILL.md

Important: Restart your agent after installing for the skill to be available.

Reference

Full command reference: SKILL.md

Official cmux docs: cmux.dev/docs/browser-automation

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
browser.md		browser.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cmux Browser Automation Skill

What your agent can do

What's in the skill file

Use cases

"Build this component and verify it renders"

"Fix this broken page"

"Test the login flow end to end"

"Check what this page looks like"

Setup

Option 1: Ask your AI agent

Option 2: Manual installation

Reference

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

cmux Browser Automation Skill

What your agent can do

What's in the skill file

Use cases

"Build this component and verify it renders"

"Fix this broken page"

"Test the login flow end to end"

"Check what this page looks like"

Setup

Option 1: Ask your AI agent

Option 2: Manual installation

Reference

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages