senpai

Autonomous ML research loop powered by Claude Code agents coordinated through GitHub PRs. Point it at a problem, deploy advisor + student agents on k8s, and let them iterate.

How it works

An advisor pod creates experiment PRs and assigns them to student GPU pods. Students implement, train, and report; the advisor merges winners and closes dead ends. GitHub labels route work, and W&B tracks metrics.

senpai is problem-agnostic. The pod entrypoint clones the configured problem-package repo into target/, so agent commits and PRs land in that external repo, not here.

Problem packages

Repo	Status	Notes
`morganmcg1/tandemfoil2`	Active	TandemFoilSet velocity prediction, branch `kagent_royal_rumble`
`morganmcg1/icml2026`	Archive	ICML 2026 CFD multi-dataset harness
`morganmcg1/cfd_tandemfoil_v1`	Archive	Original v1 TandemFoil package

W&B Dashboard

Architecture

graph TD
    subgraph K8s["Kubernetes Cluster"]
        A["Advisor Pod<br/>(Claude Code, no GPU)<br/>Creates hypothesis PRs<br/>Reviews results, merges/closes"]
        subgraph Students["Student Deployments (one per GPU node)"]
            S1["frieren<br/>8x GPU"]
            S2["fern<br/>8x GPU"]
            S3["tanjiro<br/>8x GPU"]
            S4["..."]
        end
        A -->|"GitHub PRs<br/>(draft → review → merge/close)"| Students
    end
    K8s --> GH["GitHub<br/>PRs = hypotheses<br/>Labels = routing"]
    K8s --> WB["Weights & Biases<br/>Metrics, runs, groups"]

PR lifecycle

graph TD
    A["Advisor creates draft PR"] -->|"student:name + status:wip"| B["Student picks up PR"]
    B --> C["Implements hypothesis, runs experiments"]
    C -->|"status:review"| D["Advisor reviews"]
    D -->|Merge| E["Improvement lands on advisor branch"]
    D -->|Request changes| F["status:wip — student iterates"]
    D -->|Close| G["Dead end, branch deleted"]
    F --> B

Repo layout

senpai/
├── senpai.yaml                    # Project config: problem-package repo/branch + launch defaults
├── target/                        # Problem package clone (empty by default)
│   ├── train.py                   #   Training script + model
│   ├── program.md                 #   Research context, metrics, constraints
│   ├── data.py / data/            #   Data pipeline
│   └── instructions/              #   Task-specific Claude Code prompt templates
│       ├── prompt-advisor.md
│       └── prompt-student.md
├── system_instructions/           # System-level Claude Code instructions (run the role)
│   ├── CLAUDE-ADVISOR.md
│   └── CLAUDE-STUDENT.md
├── k8s/                           # Kubernetes deployment (problem-agnostic)
│   ├── launch.py                  #   Deploy advisor + student pods
│   ├── advisor-deployment.yaml
│   ├── student-deployment.yaml
│   ├── entrypoint-advisor.sh
│   └── entrypoint-student.sh
├── Dockerfile
└── .claude/                       # Claude Code skills and agents

Important: agent commits and PRs land in the problem-package repo, never in wandb/senpai.

Configuration

All project settings live in senpai.yaml:

problem_dir: target/
repo_url: https://github.com/wandb/senpai.git
repo_branch: main
target_repo_url: https://github.com/morganmcg1/tandemfoil2.git
target_repo_branch: main
advisor_branch: schmidhuber
gh_history_scope: branch
human_issues: true
image: ghcr.io/wandb/senpai:latest
pvc_claim_name: new-pvc
pvc_mount_path: /mnt/new-pvc
wandb_entity: wandb-applied-ai-team
wandb_project: senpai-v1
timeout_minutes: 30.0
max_epochs: 50
n_students: 4
student_prefix: ""
gpus_per_student: 8
cpu_per_gpu: 15
memory_gi_per_gpu: 120
preflight_only: false

launch.py reads this via simple_parsing — every field can be overridden on the CLI.

Launch credentials

launch.py resolves and preflights these for real launches and --preflight_only, then writes them to a per-tag Secret named senpai-launch-secrets-<tag>:

Env var	Pod env	Resolution
`GITHUB_TOKEN`	`GITHUB_TOKEN`	shell env -> `.env` -> `gh auth token`
`ANTHROPIC_API_KEY`	`ANTHROPIC_API_KEY`	shell env -> `.env`
`EXA_API_KEY`	`EXA_API_KEY`	shell env -> `.env`

Use example.env for local setup:

cp example.env .env
# edit .env and set GITHUB_TOKEN, ANTHROPIC_API_KEY, and EXA_API_KEY

Notes: --dry_run renders redacted manifests without resolving or preflighting credentials. Real launches pass the Secret manifest to kubectl apply via stdin, but Kubernetes Secrets are still readable to anyone with namespace Secret read access. Delete launch resources when done: kubectl delete deployments,configmaps,secrets -l research-tag=<tag>.

GitHub token requirements: use a PAT with repo and read:org; it must clone target_repo_url and push/open PRs there.

GitHub History Scope

--target_repo_branch is the branch in the problem-package repo used as the base when --advisor_branch does not already exist. Leave it empty to use the target repo's default branch.

--gh_history_scope branch is the default: pods clone only the advisor branch while keeping that branch's history. Use --gh_history_scope repo to clone the full target repo, or --gh_history_scope fresh for a shallow single-branch clone. Use --extra_instructions for any agent-facing guidance about what history to use or ignore.

Research Modes

Isolated ablation: use a unique --tag, unique --advisor_branch, --gh_history_scope fresh, --student_prefix, and --nohuman_issues. Agents see only the routed branch/PR stream unless explicitly told otherwise.
Normal branch memory: use --gh_history_scope branch with human issues enabled. Agents keep continuity on the active advisor branch while routine PR/issue polling stays scoped to the target repo.
Deliberate exploration: use --gh_history_scope repo or targeted --extra_instructions that ask the advisor/researcher-agent to inspect other branches, PRs, issues, W&B runs, or repos. Senpai helpers stay scoped to GH_REPO, but explicit gh --repo owner/repo ... reads are available when credentials allow.

Shared cluster secret (`senpai-secrets`)

Every pod also reads WANDB_API_KEY from the shared senpai-secrets Secret:

# Create
kubectl create secret generic senpai-secrets --from-literal=wandb-api-key="$WANDB_API_KEY"

# Rotate
kubectl patch secret senpai-secrets --type=merge -p "{\"stringData\":{\"wandb-api-key\":\"$WANDB_API_KEY\"}}"

launch.py does not preflight W&B; missing keys crash-loop pods.

Running

# Clone the active problem-package repo into target/ (one-time, for local dev)
git clone -b kagent_royal_rumble https://github.com/morganmcg1/tandemfoil2.git target/

# Train locally (inside the active problem package; copy exact flags from --help)
cd target/ && python train.py --help
cd target/ && python train.py --wandb_name "<name>/<description>"

# Deploy to k8s (reads defaults from senpai.yaml, only --tag is required)
python k8s/launch.py --tag <research-tag> --advisor

python k8s/launch.py --tag <research-tag> --advisor --n_students 7 --pvc_mount_path "/mnt/pai-amf1-cfd"
python k8s/launch.py --tag <research-tag> --n_students 7 --dry_run
python k8s/launch.py --tag <research-tag> --advisor --extra_instructions "Only consider optimizer changes."
python k8s/launch.py --tag <research-tag> --advisor --target_repo_branch icml-appendix-charlie --advisor_branch icml-appendix-charlie-rerun-r1 --gh_history_scope fresh --extra_instructions no-history.md

# Parallel launches: use unique tags, plus --student_prefix when runs share student names
python k8s/launch.py --tag <tag-a> --advisor --student_prefix a
python k8s/launch.py --tag <tag-b> --advisor --student_prefix b

# Stop a launch
kubectl delete deployments,configmaps,secrets -l research-tag=<research-tag>

Adding a new problem

Create a new public repo (e.g. myorg/my_problem) with the minimum problem-package layout:
- train.py — training script + model (entry point for students)
- data.py or data/ — data pipeline
- program.md — research context, metrics, constraints, file-edit boundaries
- instructions/prompt-advisor.md, instructions/prompt-student.md
- a working branch (e.g. main or royal_rumble) that advisors merge into

Point senpai's config at it — the pod entrypoint will clone it for you:

# edit senpai.yaml:
#   target_repo_url: https://github.com/myorg/my_problem.git
#   target_repo_branch: <base-branch>
#   advisor_branch: <advisor-branch>
git add senpai.yaml && git commit -m "Point senpai at my_problem"

Or pass on the CLI: --target_repo_url ... --target_repo_branch ... --advisor_branch ....

Deploy as usual — python k8s/launch.py --tag <tag> --advisor. Agent commits/PRs will land in myorg/my_problem, not senpai.

References

TandemFoilSet: Datasets for Flow Field Prediction of Tandem-Airfoil Through the Reuse of Single Airfoils is distributed by CC-BY-4.0.

@inproceedings{
lim2026tandemfoilset,
title={{TandemFoilSet}: Datasets for Flow Field Prediction of Tandem-Airfoil Through the Reuse of Single Airfoils},
author={Wei Xian Lim and Loh Sher En Jessica and Zenong Li and Thant Zin Oo and Wai Lee Chan and Adams Wai-Kin Kong},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=4Z0P4Nbosn}
}

Name		Name	Last commit message	Last commit date
Latest commit History 353 Commits
.agents/skills		.agents/skills
.claude		.claude
.codex		.codex
.github/workflows		.github/workflows
.reuse/templates		.reuse/templates
analysis		analysis
article/kagent		article/kagent
cla/signatures/version1		cla/signatures/version1
k8s		k8s
papers		papers
plugins/senpai		plugins/senpai
system_instructions		system_instructions
tests		tests
.gitignore		.gitignore
.mcp.json		.mcp.json
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
animated_chart.gif		animated_chart.gif
cla.md		cla.md
example.env		example.env
pyproject.toml		pyproject.toml
senpai.yaml		senpai.yaml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

senpai

How it works

Problem packages

Architecture

PR lifecycle

Repo layout

Configuration

Launch credentials

GitHub History Scope

Research Modes

Shared cluster secret (`senpai-secrets`)

Running

Adding a new problem

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

senpai

How it works

Problem packages

Architecture

PR lifecycle

Repo layout

Configuration

Launch credentials

GitHub History Scope

Research Modes

Shared cluster secret (senpai-secrets)

Running

Adding a new problem

References

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Shared cluster secret (`senpai-secrets`)

Packages