One objective in, a verified result out.
Give it a goal; it runs the full gated pipeline with expert subagents and refuses to declare success until a machine-checkable gate passes.
No extra install: clone the repo, symlink it into your skills directory, then /supergoal <objective>.
Best starting point: the landing page (bilingual English / 한국어, 3-step quickstart).
A Claude Code skill that takes a single objective through a full, gated development process using expert subagents, then refuses to declare success until a machine-checkable gate passes.
Gated lanes, a single shared vault, an untrusted claims.md re-verified by an adversary, and a
literal-bash delivery gate that is never edited to pass. Each role's persona is a bundled file in
agents/, so dispatch is harness-agnostic: it runs the same under Claude Code, Codex, agy, and
other coding CLIs (the orchestrator spawns the persona via the harness's sub-agent mechanism, or runs
it inline where none exists). Nothing to install but the skill itself. (Workflow inspired by
oh-my-symphony.)
New here? Start with the landing page -> cskwork.github.io/supergoal-skill A bilingual (English / 한국어) walkthrough with a 3-step quickstart, the three modes, how the builder-vs-verifier split catches real bugs, and the evidence it produces. Best onboarding path before you clone.
/supergoal detects the mode from your objective:
| Objective looks like | Mode | Pipeline |
|---|---|---|
| "build / ship a new app/tool" | GREENFIELD | Intake -> Validate (market/demand) -> Plan -> Human Feedback -> Build -> Verify -> QA -> Deliver |
| "fix / broken / failing / why does" | DEBUG | Intake -> Reproduce -> Diagnose -> Human Feedback -> Fix -> Verify -> Deliver |
| "add X to our existing/legacy code" | LEGACY | Intake -> Explore -> Plan -> Human Feedback -> Build -> Verify -> QA -> Deliver |
| "explain / understand / teach me X" (learn, no code) | LEARN | Intake -> Source -> Bridge -> Teach loop -> Check (explain-back) -> Journal |
/supergoal build a habit-tracker app and ship it
/supergoal the checkout page hangs intermittently in prod. fix it
/supergoal add SSO to our legacy Django monolith
A single agent given a big objective drifts: it skips validation, trusts its own "done", and leaves
unverified claims. /supergoal imposes the discipline a senior team would (see DESIGN.md and docs/research-brief.md):
- Topology, not preference, picks the architecture. Fan out for wide-and-shallow work (validation, scaffolding); single-driver for deep-and-narrow work (one bug, one feature).
- Builder != Verifier. The agent that writes code never approves it. A fresh adversarial Verify
agent re-runs every
run-to-provefrom a clean state. (claims.mdis untrusted.) - Human Feedback before implementation. After intake/repro/diagnosis/planning, the skill pauses with two briefs: plain language first, then a novice-dev-friendly technical brief with term definitions.
- Two-layer done-gate. Hard gate (tests/lint/build, deterministic) plus a soft committee (architect + security + code-review). The rubric can never override a failing test.
- Gate on the project's own suite (run in the workspace; the Verify agent independently re-runs from a clean state). Never benchmarks, never self-report.
- Bounded retry + circuit breaker. Same error 3x trips the circuit breaker: stop, root-cause, escalate. No infinite loops.
- Validate-before-build (GREENFIELD). 2. Plan freezes scope. 3. Human Feedback approval.
- Builder != Verifier. 5. Multi-expert review before deliver.
- Literal delivery gate (
templates/delivery-gate.shexits 0). 7. Bounded retry + circuit breaker.
This repo is the skill. Put it where Claude Code finds skills:
git clone https://github.com/cskwork/supergoal-skill.git
# then either symlink or copy it into your global skills dir:
ln -s "$(pwd)/supergoal-skill" ~/.claude/skills/supergoal
# or: cp -R supergoal-skill ~/.claude/skills/supergoalThen in Claude Code: /supergoal <your objective>.
SKILL.md thin spine: mode detection, gates, reference map
agents/ one persona file per role (system prompt), harness-agnostic dispatch source of truth
reference/ pipeline · experts · vault · market-research · quality-gates · debugging · qa · domain-rules · plan-grounding · learn
reference/ui-ux.md UI/UX overlay -> taste-skill v2 (reference/taste-skill-v2.md, vendored) as design authority
learn/ LEARN-mode session journals (one file per session) + README template
templates/ delivery-gate.sh · validate-gate.sh · human-feedback-gate.mjs · state.json
DESIGN.md research -> decision mapping (cited)
docs/ research-brief.md · e2e-test-plan.md · changelog/ · index.html (landing)
examples/url-shortener/ a real service the harness built/debugged/extended (audit trail in docs/changelog/)
All three modes were run end-to-end on a real, production-grade service (a zero-dependency URL
shortener, see examples/url-shortener/, 68 tests). The audit trail for
each run is in examples/url-shortener/docs/changelog/ (these early run records predate the file-set consolidation).
- GREENFIELD. The adversarial Verify caught 2 real SSRF bypasses (
[::ffff:127.0.0.1],localhost.) and an unauth-500 that all passed the builder's own green tests, before shipping. - DEBUG. Given only a symptom ("hits undercount under load"), it reproduced (200 concurrent -> 1/200), root-caused a lost-update race, stopped at Human Feedback for approval, fixed, and re-verified with anti-flake concurrency runs (0 lost across 10 trials).
- LEGACY. Added link-expiry (TTL) with zero regressions (backward-compatible with records that predate the field), committee-approved, gate-green.
Adversarial verification caught a real defect in 2 of 3 runs.
A separate evidence-only private-codebase benchmark compared plain Codex CLI, /supergoal, and
Codex Goal mode on the same hard backend task with the same hidden scorer. See
docs/experiments/2026-05-30-private-codebase-comparison/.
/supergoal: passed all hidden checks, focused regressions, neighbor checks,git diff --check, and the delivery gate.- Codex Goal mode: fixed the main code path and passed focused checks, but missed one hidden fallback/preservation coverage check.
- Plain Codex CLI: produced no usable result: idle run, no solution diff, no final output.
Concept and workflow adapted from oh-my-symphony by cskwork (https://github.com/cskwork/oh-my-symphony). Built for Claude Code.
MIT. See LICENSE.