EurekaLab

Budget‑aware sandbox for autonomous scientific discovery with provenance.

EurekaLab (sandbox_science) wraps untrusted, agent‑generated code in a budget‑aware execution environment. It estimates and enforces a cost budget before running, captures every run in a Git‑backed provenance store, and audits results for reward‑hacking — turning the execution environment itself into a first‑class, reusable abstraction.

Installation

pip install git+https://github.com/Lumi-node/eureka-lab.git

Requires Python ≥ 3.10. To work on the project locally:

git clone https://github.com/Lumi-node/eureka-lab.git
cd eureka-lab
pip install -e ".[dev]"
pytest -q

Quick Start

import tempfile
from sandbox_science import Sandbox, ExperimentRequest

# A sandbox enforces a cost budget and records provenance
sandbox = Sandbox(workspace=tempfile.mkdtemp())

request = ExperimentRequest(
    code="print('hello from the sandbox')",
    budget=10.0,
    timeout_seconds=5,
    memory_limit_mb=256,
)

result = sandbox.submit(request)
print(result.success)                  # True
print(result.run_log.stdout.strip())   # hello from the sandbox
print(result.cost_actual.total_cost)   # measured cost, <= budget

Features

Budget‑aware execution — estimate and enforce cost limits before a run starts
Git‑backed provenance — every run committed for full reproducibility
Reward‑hacking auditor with cross‑validation
Pluggable cost model and policy engine

Modules

Module	Description
`auditor`	—
`cost_model`	—
`executor`	—
`policy`	—
`provenance`	—
`sandbox`	—

Documentation

📖 Full documentation: https://lumi-node.github.io/eureka-lab/ 📄 Technical paper: see paper/ for the LaTeX source and compiled PDF.

This is a reference implementation produced by an autonomous research pipeline. It is not published to PyPI; install from source as shown above.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
docs		docs
paper		paper
src/sandbox_science		src/sandbox_science
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
journal-eureka-lab.mdx		journal-eureka-lab.mdx
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EurekaLab

Installation

Quick Start

Features

Modules

Documentation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EurekaLab

Installation

Quick Start

Features

Modules

Documentation

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages