CI (main) Live Reports Python Install License
Let's Bring Brain To Robots
Visible household-robot demos driven by MCP tools, reusable skills, and live-agent SDK/direct runtimes.
Roboclaws is a thin demo repo for making AI-driven robotics behavior reviewable: frames, maps, tool traces, scores, and public/private evaluation boundaries are published as HTML reports instead of buried in terminal logs.
It answers three practical questions:
- How can an AI agent drive a robot?
- What context and tools does the agent need?
- What did the agent actually do in the simulated or robot-backed world?
Roboclaws treats reusable robot behavior as skills first and MCP tools as a bounded public robot capability surface.
The short version:
- Public runs use
just run::surfacewithsurface=..., optionalpreset=..., and natural-languageprompt=.... - Skills own task strategy such as map-build, cleanup, and open household goals.
- MCP exposes bounded robot capabilities such as observe, navigate, pick, place, and done.
- Private evaluator truth stays out of agent inputs and public profile metadata.
The detailed profile and skill reference is docs/human/mcp-skills-and-semantic-profiles.md.
Install the project once:
uv sync --extra devThe dev extra includes the standard MolmoSpaces/MuJoCo CPU runtime used by
local cleanup demos. Isaac Lab is scoped to the B1 / Map 12 digital-twin route
and generic local runtime proof; keep it isolated in .venv-isaaclab/ and do
not treat it as part of normal MolmoSpaces demos.
The public command grammar is named-parameter only. Public household launches name the operator-facing surface, world or scene, backend runtime, optional task preset, and agent engine separately:
just run::surface surface=<surface> agent_engine=<engine> [world=<world>] [backend=<backend>] [preset=<preset>] [prompt=<goal>] [key=value ...]For full command routing, profiles, and maintainer-only recipes, read just/README.md.
To monitor and launch the supported local SDK household routes from a standalone browser console, run:
just console::runThe console uses the same world/backend/preset/agent-engine catalog for local SDK/direct runs; it does not accept arbitrary browser-submitted shell commands.
GitHub Actions publishes the report site at
miaodx.com/roboclaws. If a link looks stale,
check the CI workflow:
Pages republishes from successful main runs.
| Demo | Run it locally | Report |
|---|---|---|
| Map build | just run::surface surface=household-world world=molmospaces/val_0 backend=mujoco preset=map-build agent_engine=openai-agents-sdk provider_profile=codex-router-responses evidence_lane=camera-grounded-labels camera_labeler=grounding-dino seed=7 scenario_setup=baseline |
Local artifact today. Use agent_engine=direct-runner only for deterministic contract baselines. |
| Household cleanup | just run::surface surface=household-world world=molmospaces/val_0 backend=mujoco preset=cleanup agent_engine=direct-runner evidence_lane=world-public-labels seed=7 scenario_setup=relocate-cleanup-related-objects relocation_count=5 |
Molmo live index |
| Open household goal | just run::surface surface=household-world world=molmospaces/val_0 backend=mujoco agent_engine=openai-agents-sdk provider_profile=codex-router-responses prompt="find something useful to drink" |
Local artifact today. |
| Planner proof | just run::surface surface=planner-proof world=planner-proof/default backend=mujoco intent=planner-proof agent_engine=direct-runner mode=dry-run |
Local artifact today. |
| Operator console | just console::run |
Local-only operator surface. |
| Maintainer gate | just agent::verify mock |
CI status: workflow |
See ARCHITECTURE.md for the code map and the full operating mode contract.
| Need | Read |
|---|---|
| Code map and operating modes | ARCHITECTURE.md |
| Human setup/runbooks/domain docs | docs/human/README.md |
| Detailed MCP profile reference | docs/human/mcp-skills-and-semantic-profiles.md |
| Eval suites and validation | docs/human/evaluation.md |
| Skill library convention | skills/README.md |
| Public command grammar | just/README.md |
| Local keys and report artifacts | docs/human/local-runtime.md |
| MolmoSpaces settings | docs/human/molmospaces-settings.md |
| Current project focus | STATUS.md |
| Agent operating rules | AGENTS.md |
MIT