Umbrella: merge guard end-to-end + hook contract documentation + governance meta-pattern

> **Status (2026-05-11)**: Phase 1 + 2 + 5 ✅ DONE. Phase 4 tracked as #704. Phase 3 still future work. See [status comment](https://github.com/Synaptic-Labs-AI/PACT-Plugin/issues/677#issuecomment-4417064272) for full follow-up linkage.

## Why this exists

The merge-guard mechanism has two distinct bugs that prevent it from working end-to-end as designed:

- **#665** — `merge_guard_pre._GH_PR_NUMBER_RE` greedy-quantifier picks the wrong PR number from heredoc-style `--body` arguments (also broken by trailing `2>&1`). Produces `"Authorization token exists but does not match this operation"` even with a valid token.
- **#676** — `merge_guard_post.py` reads stdin field `tool_output`; Claude Code sends `tool_response`. **Authorization tokens have never been written automatically by the post-hook.** Every observably-clean PACT merge has gone around the guard (web UI, manual `touch`, or operator typing `gh pr merge` outside the hook surface).

Either bug alone breaks merges. Together they make the merge guard a no-op security theater: when it does block, the failure is misleading; when it appears to work, it's because operators learned workarounds.

This umbrella issue bundles **Option B** — fix both #665 and #676 in a single PR — plus the surrounding gaps that fixing the immediate bugs does NOT address.

## Scope

### Phase 1 — Pair the two bug fixes (the merge guard works end-to-end) ✅ DONE (PR #697)

- [x] **#676**: rename `tool_output` → `tool_response` in `merge_guard_post.py`. Update the test fixtures in `test_merge_guard.py`. Update the docstring header.
- [x] **#665**: fix `merge_guard_pre._GH_PR_NUMBER_RE` greedy-quantifier so it matches the FIRST PR-number-shaped digit sequence after `gh pr` (or use a stricter parse that locates the subcommand and reads its first positional arg). Add tests covering: heredoc bodies with version numbers, dates, test counts; trailing `2>&1`; cross-flag positions.
- [x] **#676 audit cleanup**: fix the 4 misleading docstring headers (`auditor_reminder.py`, `file_size_check.py`, `track_files.py`, `teachback_check.py`) where `"tool_output"` is in the docstring template but the code only reads `tool_input` + `tool_name`.
- [x] **task_lifecycle_gate.py:321** — keep the defensive `tool_response or tool_output or {}` fallback (no harm), add comment explaining WHY + extract to shared `extract_tool_response()` helper threaded into all 4 PostToolUse hooks (delivered in PR #697 cycle-2 commit `e6ae33ee`).

After Phase 1: the merge guard works as designed. AskUserQuestion + "Yes, merge" + `gh pr merge` flow succeeds without manual intervention. ✅ verified in v4.1.8 ship.

### Phase 2 — Behavioral-change communication ✅ DONE (PR #697 + v4.1.8 release)

- [x] Release notes call out the behavioral change explicitly. Operators with muscle memory of "just type `gh pr merge` directly" will hit unexpected denials. (v4.1.8 release notes shipped: https://github.com/Synaptic-Labs-AI/PACT-Plugin/releases/tag/v4.1.8)
- [x] CLAUDE.md memory pins for #665 + #676 retired (no longer apply post-merge).
- [x] Operator-impact comms in release notes: compound destructive commands cannot be authorized atomically; eval+heredoc combinations denied; canonical token schema; API/curl variants require canonical-form authorization.

### Phase 3 — Captured-fixture hardening for ALL hooks ⏳ NOT STARTED

The root cause of #676 is that `test_merge_guard.py`'s 13+ fixtures use `tool_output` — a fictional shape — and the code matches the fixtures. **Synthetic fixtures don't catch platform-shape drift.** Only fixtures captured from real Claude Code stdin during actual hook execution will detect when the platform schema changes underneath us.

- [ ] Apply the `#612` logging-shim pattern (already proven in `bootstrap_marker_writer`'s captured-fixture follow-up #672) to every PostToolUse hook in `pact-plugin/hooks/`:
  - merge_guard_post.py
  - wake_lifecycle_emitter.py
  - task_lifecycle_gate.py
  - auditor_reminder.py
  - file_size_check.py
  - track_files.py
  - teachback_check.py
  - file_tracker.py
- [ ] Each hook gets a captured-from-production fixture in `pact-plugin/tests/fixtures/{hook_name}_stdin.json` representing the actual stdin shape Claude Code sends. The `#612` shim runs on a separate scratch branch, captures during a fresh-startup PACT session, then is removed.
- [ ] Each hook's test suite gets at least one parametrized "fixture-load + dispatch" test that exercises the hook against the captured fixture.
- [ ] CI guards: a parametrized integration test over every PostToolUse hook that fails if a hook reads a stdin field that's not in the captured fixture's shape.

This is the only durable defense against the next silent platform-schema drift. **Needs separate plan-mode + scratch branches per hook; cannot run inside the same session as the fix.**

### Phase 4 — Hook contract documentation ⏳ NOT STARTED — tracked as #704

The convention "PostToolUse hooks read `tool_response`" lives implicitly in `wake_lifecycle_emitter.py`'s docstring. The 4 docstring-drift cases (`auditor_reminder.py` etc.) all carried the same incorrect template line — proving the convention propagates by template-copying without verification. PR #697 swept the immediate drift but the convention itself still needs canonical documentation.

**See #704 for full Phase 4 scope** (includes additions surfaced in PR #697: shared `extract_tool_response` helper pattern, SECURITY/PLANNING_SCAN_PATH_EXCLUDES bootstrap-self-block pattern, strict cross-op match enforcement, deny-compound-destructive pattern, eval+heredoc pre-strip detection).

### Phase 5 — Governance / meta-pattern ✅ DONE (audit comment 2026-05-07)

The merge_guard_post bug and the lead-owns-commits violation pattern (#675) and the persona §2 /clear factual error (closed in PR #671) are all instances of the same meta-pattern: **rules documented in prose with no mechanical enforcement, where the runtime cost of the violation is invisible to operators until something else surfaces it**.

- [x] Inventory current PACT rules that are prose-only (delivered as audit comment 2026-05-07 — categorized 30+ rules into A/B/C/D enforcement-state buckets)
- [x] For each, propose either a test, a hook, or an explicit acceptance test (delivered in audit comment)
- [x] File any net-new mechanical-enforcement issues that fall out of the inventory (C1 + C2 follow-ups recommended; C3 folded into C2)

## Cross-references

**Closed by PR #697 (Phase 1)**:
- **#665** — merge_guard_pre regex bug. CLOSED via PR #697 commit `73410afb` + cycle-1 expansion `04e63dd1`.
- **#676** — merge_guard_post field-name bug + audit findings. CLOSED via PR #697 commit `b74ec540` + sibling docstring sweep `d771af27`.

**Closed by PR #697 (cycle-2 + cycle-3 security remediation)**:
- **#699** F-1 cross-operation token-class mismatch. CLOSED via PR #697 cycle-3 commit `43640e45` (symmetric WRITE+READ + strict-match).
- **#700** F-2 sparse-context wildcard tokens. CLOSED via PR #697 cycle-2 commit `e1ba7ea0`.
- **#701** F-3 compound-command bypass. CLOSED via PR #697 cycle-2 commit `e1ba7ea0` (was IMPROVED-but-RESIDUAL post-cycle-1, now FULLY CLOSED).
- **#702** F-4 eval+heredoc strip-pipeline ordering. CLOSED via PR #697 cycle-2 commit `e1ba7ea0`.

**Open follow-ups from PR #697 review**:
- **#703** F-8 defense-in-depth gaps inventory (TRACKING — TTL, actor binding, HMAC, audit log, rate limiting; deferred-by-design under same-user-trust threat model)
- **#704** Phase 4 SHARED_CONVENTIONS.md hook contract documentation (next umbrella PR)
- **#696** Phase B planning-artifacts bootstrap-aware design (deferred from PR #697 cycle-2; separate from this umbrella's Phase 3/4)
- **#698** Architect HANDOFF test-strategy field + orchestrator concurrent test-engineer dispatch (PACT planning-process improvement; separate from umbrella)
- **#705** Defense #5 hallucination_gate PreToolUse hook (separate; spawned from #685; small standalone PR)

**Original cross-references (pre-PR #697)**:
- **#672** — captured-from-production fixture for `bootstrap_marker_writer` (Phase 3 sets the precedent).
- **#674** — SendMessage-then-TaskUpdate ordering invariant for `task_lifecycle_gate._has_paired_sendmessage`. Phase 5 candidate (covered in audit comment).
- **#675** — mechanically enforce lead-owns-commits via PreToolUse Bash hook. Phase 5 candidate (covered in audit comment).
- **PR #671** (merged) — closes #664, originated this audit.

## Suggested PR sequencing

1. ✅ **PR #1**: Phase 1 + Phase 2. **DONE** as PR #697 → v4.1.8. Original scope expanded mid-flight via 4 cycles of peer-review remediation to also close 6 of 8 security findings (F-1 through F-7 except F-6) — became 14 commits.
2. ⏳ **PR #2**: Phase 4 (`SHARED_CONVENTIONS.md`). **NEXT — tracked as #704.** Zero code change; pure doc.
3. ⏳ **PR #3**: Phase 3 (captured fixtures + integration tests). Larger; needs separate scratch branches per hook for capture, then a consolidation PR with all fixtures. This is the biggest scope and deserves its own architecture-phase planning.
4. ✅ **Phase 5** delivered as audit comment 2026-05-07. C1 + C2 follow-ups recommended in that comment.

## Acceptance for this umbrella

- [x] Phase 1 PR merged; merge guard demonstrably works end-to-end (verified by running `gh pr merge` after AskUserQuestion + "Yes" without manual token-touching).
- [x] Phase 2 release notes shipped on the version bump (v4.1.8 release: https://github.com/Synaptic-Labs-AI/PACT-Plugin/releases/tag/v4.1.8).
- [ ] Phase 3 capture procedure documented and at least the merge_guard_post fixture captured + integration-tested.
- [ ] Phase 4 `SHARED_CONVENTIONS.md` written and pinned. **Tracked as #704.**
- [x] Phase 5 inventory delivered as a comment on this issue with linked follow-up issues.
- [x] Close #665, #676 on Phase 1 PR merge.
- [ ] Close this umbrella when Phases 3 + 4 land OR explicitly deferred-with-justification.

## Out of scope

- Other plugins or external code that may have copied PACT's hook templates. We control PACT; downstream consumers are their own concern.
- Cross-platform Claude Code behavior verification (e.g., does the same bug exist on Linux/Windows? Probably yes since the field name is platform-side, but verifying is out of scope for the immediate fix).
- HMAC / cryptographic upgrades to the merge-guard token (out of scope per the same-user threat model documented in #663's pin).

## Background

This umbrella was filed during the post-merge investigation following PR #671 (#664 hook-driven bootstrap marker write). The merge-guard bug was discovered when the merge for #671 was blocked even after a textually-perfect AskUserQuestion + affirmative answer. Manual token write salvaged the merge; investigation traced the post-hook field-name mismatch and surfaced the broader docstring drift + the meta-pattern of prose-rules-without-mechanical-enforcement that has been visible in the merge guard, in lead-owns-commits (#675), in the SendMessage-then-TaskUpdate ordering invariant (#674), and in the persona §2 /clear factual error (closed in PR #671).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Umbrella: merge guard end-to-end + hook contract documentation + governance meta-pattern #677

Why this exists

Scope

Phase 1 — Pair the two bug fixes (the merge guard works end-to-end) ✅ DONE (PR #697)

Phase 2 — Behavioral-change communication ✅ DONE (PR #697 + v4.1.8 release)

Phase 3 — Captured-fixture hardening for ALL hooks ⏳ NOT STARTED

Phase 4 — Hook contract documentation ⏳ NOT STARTED — tracked as #704

Phase 5 — Governance / meta-pattern ✅ DONE (audit comment 2026-05-07)

Cross-references

Suggested PR sequencing

Acceptance for this umbrella

Out of scope

Background

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Umbrella: merge guard end-to-end + hook contract documentation + governance meta-pattern #677

Description

Why this exists

Scope

Phase 1 — Pair the two bug fixes (the merge guard works end-to-end) ✅ DONE (PR #697)

Phase 2 — Behavioral-change communication ✅ DONE (PR #697 + v4.1.8 release)

Phase 3 — Captured-fixture hardening for ALL hooks ⏳ NOT STARTED

Phase 4 — Hook contract documentation ⏳ NOT STARTED — tracked as #704

Phase 5 — Governance / meta-pattern ✅ DONE (audit comment 2026-05-07)

Cross-references

Suggested PR sequencing

Acceptance for this umbrella

Out of scope

Background

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions