Skip to content

cskwork/supergoal-skill

Repository files navigation

/supergoal

One objective in, a verified result out. Give it a goal; it runs the full gated pipeline with expert subagents and refuses to declare success until a machine-checkable gate passes. No extra install: clone the repo, symlink it into your skills directory, then /supergoal <objective>. Best starting point: the landing page (bilingual English / 한국어, 3-step quickstart).

A Claude Code skill that takes a single objective through a full, gated development process using expert subagents, then refuses to declare success until a machine-checkable gate passes.

Gated lanes, a single shared vault, an untrusted claims.md re-verified by an adversary, and a literal-bash delivery gate that is never edited to pass. Each role's persona is a bundled file in agents/, so dispatch is harness-agnostic: it runs the same under Claude Code, Codex, agy, and other coding CLIs (the orchestrator spawns the persona via the harness's sub-agent mechanism, or runs it inline where none exists). Nothing to install but the skill itself. (Workflow inspired by oh-my-symphony.)

New here? Start with the landing page -> cskwork.github.io/supergoal-skill A bilingual (English / 한국어) walkthrough with a 3-step quickstart, the three modes, how the builder-vs-verifier split catches real bugs, and the evidence it produces. Best onboarding path before you clone.

Modes

/supergoal detects the mode from your objective:

Objective looks like Mode Pipeline
"build / ship a new app/tool" GREENFIELD Intake -> Validate (market/demand) -> Plan -> Human Feedback -> Build -> Verify -> QA -> Deliver
"fix / broken / failing / why does" DEBUG Intake -> Reproduce -> Diagnose -> Human Feedback -> Fix -> Verify -> Deliver
"add X to our existing/legacy code" LEGACY Intake -> Explore -> Plan -> Human Feedback -> Build -> Verify -> QA -> Deliver
"explain / understand / teach me X" (learn, no code) LEARN Intake -> Source -> Bridge -> Teach loop -> Check (explain-back) -> Journal
/supergoal build a habit-tracker app and ship it
/supergoal the checkout page hangs intermittently in prod. fix it
/supergoal add SSO to our legacy Django monolith

Why it exists

A single agent given a big objective drifts: it skips validation, trusts its own "done", and leaves unverified claims. /supergoal imposes the discipline a senior team would (see DESIGN.md and docs/research-brief.md):

  • Topology, not preference, picks the architecture. Fan out for wide-and-shallow work (validation, scaffolding); single-driver for deep-and-narrow work (one bug, one feature).
  • Builder != Verifier. The agent that writes code never approves it. A fresh adversarial Verify agent re-runs every run-to-prove from a clean state. (claims.md is untrusted.)
  • Human Feedback before implementation. After intake/repro/diagnosis/planning, the skill pauses with two briefs: plain language first, then a novice-dev-friendly technical brief with term definitions.
  • Two-layer done-gate. Hard gate (tests/lint/build, deterministic) plus a soft committee (architect + security + code-review). The rubric can never override a failing test.
  • Gate on the project's own suite (run in the workspace; the Verify agent independently re-runs from a clean state). Never benchmarks, never self-report.
  • Bounded retry + circuit breaker. Same error 3x trips the circuit breaker: stop, root-cause, escalate. No infinite loops.

The non-negotiable gates

  1. Validate-before-build (GREENFIELD). 2. Plan freezes scope. 3. Human Feedback approval.
  2. Builder != Verifier. 5. Multi-expert review before deliver.
  3. Literal delivery gate (templates/delivery-gate.sh exits 0). 7. Bounded retry + circuit breaker.

Install

This repo is the skill. Put it where Claude Code finds skills:

git clone https://github.com/cskwork/supergoal-skill.git
# then either symlink or copy it into your global skills dir:
ln -s "$(pwd)/supergoal-skill" ~/.claude/skills/supergoal
# or: cp -R supergoal-skill ~/.claude/skills/supergoal

Then in Claude Code: /supergoal <your objective>.

Layout

SKILL.md            thin spine: mode detection, gates, reference map
agents/             one persona file per role (system prompt), harness-agnostic dispatch source of truth
reference/          pipeline · experts · vault · market-research · quality-gates · debugging · qa · domain-rules · plan-grounding · learn
reference/ui-ux.md  UI/UX overlay -> taste-skill v2 (reference/taste-skill-v2.md, vendored) as design authority
learn/              LEARN-mode session journals (one file per session) + README template
templates/          delivery-gate.sh · validate-gate.sh · human-feedback-gate.mjs · state.json
DESIGN.md           research -> decision mapping (cited)
docs/               research-brief.md · e2e-test-plan.md · changelog/ · index.html (landing)
examples/url-shortener/   a real service the harness built/debugged/extended (audit trail in docs/changelog/)

Proof it works (live validation)

All three modes were run end-to-end on a real, production-grade service (a zero-dependency URL shortener, see examples/url-shortener/, 68 tests). The audit trail for each run is in examples/url-shortener/docs/changelog/ (these early run records predate the file-set consolidation).

  • GREENFIELD. The adversarial Verify caught 2 real SSRF bypasses ([::ffff:127.0.0.1], localhost.) and an unauth-500 that all passed the builder's own green tests, before shipping.
  • DEBUG. Given only a symptom ("hits undercount under load"), it reproduced (200 concurrent -> 1/200), root-caused a lost-update race, stopped at Human Feedback for approval, fixed, and re-verified with anti-flake concurrency runs (0 lost across 10 trials).
  • LEGACY. Added link-expiry (TTL) with zero regressions (backward-compatible with records that predate the field), committee-approved, gate-green.

Adversarial verification caught a real defect in 2 of 3 runs.

A separate evidence-only private-codebase benchmark compared plain Codex CLI, /supergoal, and Codex Goal mode on the same hard backend task with the same hidden scorer. See docs/experiments/2026-05-30-private-codebase-comparison/.

  • /supergoal: passed all hidden checks, focused regressions, neighbor checks, git diff --check, and the delivery gate.
  • Codex Goal mode: fixed the main code path and passed focused checks, but missed one hidden fallback/preservation coverage check.
  • Plain Codex CLI: produced no usable result: idle run, no solution diff, no final output.

Credit

Concept and workflow adapted from oh-my-symphony by cskwork (https://github.com/cskwork/oh-my-symphony). Built for Claude Code.

License

MIT. See LICENSE.

About

One objective in, a verified result out. A Claude Code skill that runs a full, gated dev process with expert subagents and refuses to declare done until a machine-checkable gate passes. Bilingual (EN/한국어) onboarding & live walkthrough: https://cskwork.github.io/supergoal-skill/

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors