hyperbox

Plug-and-play secure execution runtime for AI agent harnesses.

Hyperbox gives you one control plane for running untrusted code with explicit network policy, reusable sessions, managed background processes, and transparent isolation behavior.

Why Hyperbox

Fast iterative loop: reusable sessions avoid recreate/destroy overhead.
Security-first defaults: network=none unless you opt in.
Explicit policy controls: none, allowlist, full.
Managed process lifecycle: detach, list, stream logs, wait, cancel.
Harness-ready interface: CLI + gRPC + stdio adapter mode.
Clear policy introspection: --explain shows backend + enforcement state.

Enterprise Fit

Control area	Hyperbox behavior today
Network boundary	Locked by default (`network=none`)
Policy expansion	Explicit opt-in (`allowlist` / `full`)
Failure semantics	Invalid/unsupported policy combinations fail clearly
Isolation transparency	`--explain` prints effective backend and enforcement
Repeatability	Deterministic reusable sessions + snapshot lifecycle
Long-running tasks	First-class managed process commands (`ps/logs/wait/cancel`)

Hyperbox is the execution/control layer. Org-specific approval, identity, and compliance workflows are typically layered by your host platform.

Quickstart

cargo build -p hyperbox-cli
export PATH="$(pwd)/target/debug:$PATH"

# Locked networking by default
hyperbox run --cmd "python3 -c 'print(2 + 2)'"

# Reuse environment across commands
hyperbox run --profile full --cmd "python3 -m pip install pytest"
hyperbox run --profile full --cmd "pytest -q"

Core Workflows

Reusable sandbox session

hyperbox create --name demo --workspace "$PWD" --template auto
hyperbox run --name demo --cmd "python3 -V"
hyperbox destroy --name demo

Managed background process

OUT=$(hyperbox run --cmd "sleep 30" --detach --json)
PROC_ID=$(python -c 'import json,sys;print(json.loads(sys.argv[1])["process"]["process_id"])' "$OUT")
hyperbox ps
hyperbox logs "$PROC_ID"
hyperbox wait "$PROC_ID" --json
hyperbox cancel "$PROC_ID"

Network allowlist (exact hosts and wildcard subdomains)

hyperbox run --network allowlist --allow example.com --cmd "python3 -c \"import urllib.request; print(urllib.request.urlopen('https://example.com', timeout=8).status)\""
hyperbox run --network allowlist --allow '*.example.com' --cmd "python3 -c \"import socket; print(socket.gethostbyname('www.example.com'))\""

Template Auto-Detection

--template auto resolves runtime from command/workspace hints.
run, create, and adapter mode default to auto.
Explicit override is always available (--template python:3.12).

Built-in template families include Python, Node, Go, Rust, and Ubuntu base.

Security Model

Default profile is locked (network=none).
Allowlist validates host entries and rejects invalid forms.
Unsupported policy/backend combos fail instead of silently downgrading.
Local backend is explicitly non-isolated dev mode.
--workspace intentionally maps host files into sandbox context.

Platform Matrix

Platform	Backend	Status	Notes
macOS (Apple Silicon)	`apple` (`auto`)	Supported	Backend selected automatically; allowlist availability depends on runtime path
Linux (KVM-capable)	`firecracker` (`auto`)	Supported	VM backend with firewall-based policy controls
Any OS	`local`	Dev fallback	Non-isolated; intended for development/testing

CLI Surface

run: execute command in sandbox (--detach for managed process mode)
create: create persistent sandbox
destroy: destroy by id or affinity name
list: list active sandboxes
inspect: inspect sandbox metadata
shell: interactive shell attach/create
templates: list templates
ps: list managed processes
logs: stream/read process logs
wait: wait for process completion
cancel: cancel managed process
snapshot: create, restore, list
serve, probe, setup: host/runtime operations

For command flags:

hyperbox --help
hyperbox run --help
hyperbox create --help
hyperbox ps --help
hyperbox snapshot --help

Real E2E Validation

Run the real-user smoke matrix:

./scripts/e2e_real_smoke.sh

This validates:

create/destroy and session reuse
template detection (manifest + command hints)
ensure-once behavior
key edge failures (policy misconfiguration and invalid templates)

Agent Runtime Patterns

Pattern A: Agent uses Hyperbox as isolated VM execution substrate

Your agent process stays on the host. Every tool call goes into a Hyperbox sandbox/VM.

One-shot task sandbox (create + run bash + write/read + destroy):

cat <<'EOF' | hyperbox proxy --workspace "$PWD" --template auto --network none
{"op":"write","path":"task.sh","content":"#!/usr/bin/env bash\nset -euo pipefail\necho from-sandbox > result.txt\n"}
{"op":"exec","cmd":"bash task.sh","timeout":60}
{"op":"read","path":"result.txt"}
{"op":"destroy"}
EOF

Reusable sandbox for multi-step agent workflows:

hyperbox create --name agent-job --workspace "$PWD" --template auto --network none
hyperbox run --name agent-job --cmd "bash scripts/step1.sh"
hyperbox run --name agent-job --cmd "bash scripts/step2.sh"
hyperbox run --name agent-job --cmd "cat outputs/final.txt"
hyperbox destroy --name agent-job

Pattern B: Agent runtime itself runs inside Hyperbox

Run the full agent process inside a sandbox/VM and only expose the policy you want:

hyperbox run \
  --template python:3.12 \
  --workspace "$PWD" \
  --network allowlist \
  --allow api.openai.com \
  --allow pypi.org \
  --allow files.pythonhosted.org \
  --cmd "python3 agents/main.py"

SDK Invocation Flow (Python, host agent controlling sandbox lifecycle)

pip install -e hyperbox-py

from hyperbox import HyperboxClient

with HyperboxClient("127.0.0.1:50051") as client:
    sandbox = client.create_sandbox(
        affinity_name="agent-job",
        template="auto",
        workspace=".",
        network="none",
    )
    client.write_file(
        sandbox.id,
        "task.sh",
        "#!/usr/bin/env bash\nset -euo pipefail\necho from-sdk > result.txt\n",
    )
    run = client.run(
        sandbox_id=sandbox.id,
        command="bash task.sh",
    )
    result = client.read_file(sandbox.id, "result.txt").decode("utf-8")
    print(run.process.status, result)
    client.destroy_sandbox(sandbox.id)

Environment Variables

Variable	Purpose
`HYPERBOX_BACKEND`	Force backend (`auto`, `apple`, `firecracker`, `local`)
`HYPERBOX_APPLE_RUNTIME`	macOS runtime preference (`containerization` or `virtualization`)
`HYPERBOX_APPLE_HELPER`	Override helper command
`HYPERBOX_AGENT_ENDPOINT`	Agent endpoint for Firecracker/agent-stream paths
`HYPERBOX_AGENT_AUTOSTART`	Disable auto-start sidecar when set to `0`/`false`
`HYPERBOX_NETWORK_DRY_RUN`	Firewall dry-run behavior for Firecracker network enforcement
`HYPERBOX_LOCAL_ALLOW_UNENFORCED_NETWORK`	Allow non-`none` network in local backend (dev-only)

Troubleshooting

# Restart local control plane
pkill -f hyperbox || true
./target/debug/hyperbox list

# Start control plane manually
./target/debug/hyperbox serve --addr 127.0.0.1:50051

Development

cargo fmt --all
cargo build --workspace
cargo test --workspace

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
benchmarks		benchmarks
crates		crates
docs		docs
hyperbox-py		hyperbox-py
hyperbox-ts		hyperbox-ts
proto/hyperbox/v1		proto/hyperbox/v1
scripts		scripts
templates		templates
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DESIGN.md		DESIGN.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hyperbox

Why Hyperbox

Enterprise Fit

Quickstart

Core Workflows

Reusable sandbox session

Managed background process

Network allowlist (exact hosts and wildcard subdomains)

Template Auto-Detection

Security Model

Platform Matrix

CLI Surface

Real E2E Validation

Agent Runtime Patterns

Pattern A: Agent uses Hyperbox as isolated VM execution substrate

Pattern B: Agent runtime itself runs inside Hyperbox

SDK Invocation Flow (Python, host agent controlling sandbox lifecycle)

Environment Variables

Troubleshooting

Development

Further Reading

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

hyperbox

Why Hyperbox

Enterprise Fit

Quickstart

Core Workflows

Reusable sandbox session

Managed background process

Network allowlist (exact hosts and wildcard subdomains)

Template Auto-Detection

Security Model

Platform Matrix

CLI Surface

Real E2E Validation

Agent Runtime Patterns

Pattern A: Agent uses Hyperbox as isolated VM execution substrate

Pattern B: Agent runtime itself runs inside Hyperbox

SDK Invocation Flow (Python, host agent controlling sandbox lifecycle)

Environment Variables

Troubleshooting

Development

Further Reading

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages