Skip to content

docs(README): update real-app precision to 4/4 = 100% after v0.53.1 validation#270

Merged
cunninghambe merged 1 commit into
mainfrom
docs/v0.53.1-real-app-100-percent
May 16, 2026
Merged

docs(README): update real-app precision to 4/4 = 100% after v0.53.1 validation#270
cunninghambe merged 1 commit into
mainfrom
docs/v0.53.1-real-app-100-percent

Conversation

@cunninghambe

Copy link
Copy Markdown
Owner

Summary

Spoonworks v0.53.1 validation (run `si4uxdehzj5v6od6027pkt3o`, 2026-05-16) emitted 4 clusters, all real `vulnerable_dependency_high` confirmed against `npm audit`. The missing_state_change FP that lingered in v0.52 cleared after PR #269 restored the execute.ts mutation-count threading that PR #268's squash merge silently dropped.

Trajectory now

run clusters precision
2026-05-11 baseline 77 6/77 = 7.8 %
2026-05-14 v0.51 25 ~16 %
2026-05-14 v0.52 5 4/5 = 80 %
2026-05-16 v0.53.1 4 4/4 = 100 %

Caveat (kept in the README)

The v0.53.1 run hit a camofox cache issue mid-execution (`~/.cache/camoufox/version.json` was missing → `camoufox fetch` required) and stopped at `max_infra_failures` with 3047/3338 UI tests completed. Precision is over what completed; the four real-bug clusters were captured before the env issue triggered.

Not a BugHunter issue — env state. Future runs from a clean camofox cache should complete the full test plan and confirm the number sticks.

🤖 Generated with Claude Code

Spoonworks v0.53.1 validation (run si4uxdehzj5v6od6027pkt3o) emitted 4
clusters, all real `vulnerable_dependency_high`. The missing_state_change
FP that lingered in v0.52 cleared after PR #269 restored the execute.ts
mutation-count threading dropped by PR #268's squash merge.

Honest framing: includes the camofox cache failure caveat (3047/3338
tests completed). The 4 real-bug clusters were captured before the env
issue triggered `max_infra_failures`.
@github-actions

Copy link
Copy Markdown

BugHunter Calibration | | 2026-05-16

Overall: tp=0 fp=0 fn=0 precision=1 recall=1 f1=0

BugKind Precision Recall F1 Status

@cunninghambe cunninghambe merged commit 4e32813 into main May 16, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant