Skip to content

fix(test): update check-orchestrator spec for 4-check baseline#225

Merged
operatoruplift merged 1 commit intomasterfrom
fix/check-orchestrator-spec-count
Apr 28, 2026
Merged

fix(test): update check-orchestrator spec for 4-check baseline#225
operatoruplift merged 1 commit intomasterfrom
fix/check-orchestrator-spec-count

Conversation

@operatoruplift
Copy link
Copy Markdown
Owner

Summary

Adding fabrication-rot as the 4th grep-guard in #224 broke the orchestrator acceptance spec. The test scaffolds fake check scripts and asserts "X passed, Y failed, Z skipped" totals; those counts shifted with the new check.

Three failing scenarios fixed:

scenario was now
fail-fast "0 passed, 1 failed, 2 not run" "3 not run"
--all "1 passed, 2 failed" "2 passed, 2 failed"
mix "1 passed, 1 failed, 1 skipped" "1 passed, 1 failed, 2 skipped"

Each scenario now includes a fabrication-rot-check entry where the prior 3-script setups are 4-script setups. The skips-missing-scripts test also asserts the new [fabrication-rot] SKIP line.

The orchestrator contract (run, skip-if-missing, fail-propagate, fail-fast) is unchanged; only the count of registered checks moves.

Test plan

The orchestrator test scaffolds fake check scripts and asserts the
"X passed, Y failed, Z skipped" summary. Adding fabrication-rot as the
4th check in #224 shifted those counts and broke 3 of 5 tests:

  fail-fast:   "0 passed, 1 failed, 2 not run" -> 3 not run
  --all:       "1 passed, 2 failed"            -> 2 passed, 2 failed
  mix:         "1 passed, 1 failed, 1 skipped" -> 1 passed, 1 failed, 2 skipped

Updated each scenario to include a fabrication-rot-check entry where
the prior 3-script setups are now 4-script setups, and adjusted the
expected counts. The skips-missing-scripts test now also asserts the
new SKIP line for fabrication-rot.

The orchestrator contract (run, skip-if-missing, fail-propagate,
fail-fast) is unchanged; only the count of registered checks moves.
@vercel
Copy link
Copy Markdown

vercel Bot commented Apr 28, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
website Ready Ready Preview, Comment Apr 28, 2026 9:24am

@operatoruplift operatoruplift merged commit 18c3a0a into master Apr 28, 2026
4 checks passed
operatoruplift added a commit that referenced this pull request Apr 28, 2026
…sion net (#226)

CLAUDE.md still listed 3 grep-guards and only 4 dashboard pages covered
by the honesty net. After this batch's PRs (#194-#225), the actual
state is:

- 4 grep-guards: copy-check, capability-check, trust-gate, fabrication-rot.
  The new one is described inline so a reader who runs into a
  fabrication-rot failure can jump to scripts/fabrication-rot-check.mjs
  for the rule + PR pointer.
- dashboard-honesty.spec.ts now covers 7 surfaces (added /memory,
  /integrations, /agents/builder, /settings on top of the original
  /app, /notifications, /workflows).
- request-id-runtime.spec.ts is at 17 probes including the new 410-Gone
  endpoints from #209 (/api/risk, /api/gold).

Pure docs update. pnpm check passes (4/4).
operatoruplift added a commit that referenced this pull request Apr 28, 2026
)

Update the "Honesty regression net" section to accurately describe
what's locked in after the local-machine claim cleanups landed in
PRs #225-#240:

- PR range bumped from #147-#224 to #147-#240
- consumer-copy.spec.ts banned-phrase list documented in full,
  including the 8 retired local-machine claims added in #240
- fabrication-rot-check.mjs documented as 20 rules (was 10), with
  the new local-machine patterns enumerated alongside the original
  retired-fabrication catalog (Gold Agent, Webacy, x402, random-
  vector, expires-in-30-days, fake-install)

This is documentation-only — no source changes. Verifies pnpm check
4/4 still passes after the description is brought up to date.
operatoruplift added a commit that referenced this pull request Apr 28, 2026
After PR #242 retired "Runs on your computer" from meta description,
Open Graph, Twitter card, and the JSON-LD SoftwareApplication block,
add explicit assertions to consumer-copy.spec.ts so a future PR
cannot silently revert the change.

OG metadata test:
- meta description must NOT contain "Runs on your computer" (was
  the centerpiece fabrication PR #242 retired)
- og:description meta tag asserted not to contain it either
- twitter:description meta tag asserted not to contain it either

JSON-LD test:
- ld+json blob must NOT contain "Runs on your computer"
- ld+json blob MUST contain "WebApplication" (the @type was changed
  from SoftwareApplication, since the web app is browser-based, not
  a Windows/macOS/Linux installer today)

Both tests already validated "drafts your email" presence + banned
sci-fi phrases; this extends them to also catch a regression of the
local-machine fabrication that was the largest meta-level surface
in the PR #225-#242 sweep.

Verification:
- pnpm exec playwright test tests/e2e/consumer-copy.spec.ts: 8/8 pass
operatoruplift added a commit that referenced this pull request Apr 28, 2026
…d state (#247)

The repo README's headline claim contradicted the line two sections
below it ("This is a Next.js 16 app deployed on Vercel"). Two
internally-inconsistent claims about hosting on the same page is the
exact pattern the homepage honesty sweep (#225-#246) was retiring.

Three changes:

1. Top pitch paragraph:
   - "It runs on your computer (not a cloud), and every action waits
     for your tap." was replaced with "The web app pauses for your
     tap before any action and emits a signed receipt afterward. The
     desktop+Ollama build on the roadmap removes the provider hop
     entirely for users who need full on-device inference."
   - This matches the homepage Privacy Policy + Compliance + blog
     post copy that landed in the sweep.

2. "Pick any AI" bullet:
   - "Claude, ChatGPT, Gemini, Grok, or a model running on your
     laptop via Ollama" -> "Claude, ChatGPT, Gemini, Grok per turn
     (the desktop build adds Ollama on the roadmap)"
   - Ollama as a roadmap is honest; framing it as already-shipped
     was not.

3. CI checks section:
   - "grep-guards: copy-check, capability-check, trust-gate" ->
     adds fabrication-rot (the 4th guard, covering 20 anchored
     patterns of retired fabrications).
   - The honesty-regression description now lists local-machine
     claims (Runs on your computer, AES-256 encrypted local storage,
     Your agent ran locally, Encrypted on your computer) alongside
     LLM Council / Gold Agent / dev-jargon items.

Verification: pnpm check 4/4 passes (20 fabrication-rot rules clean,
no source-text changes that would trip a guard).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant