Harden token savings advisory reports by ictechgy · Pull Request #200 · ictechgy/context-guard

ictechgy · 2026-06-14T12:35:30Z

Summary

Add cache-score amortization advisory fields using explicit user-supplied cache write/read multipliers only.
Extend tool-prune defer-report with gross/net deferred-schema char/4 proxy accounting and retrieval-required boundaries.
Add a benchmark report measurement-baseline contract that documents captured fields, claim-eligible fields, proxy-only fields, and future run-identity gaps without changing CSV schema.
Refresh README/plugin/kit docs and changelog while preserving no-provider-call/no-hosted-savings-claim boundaries.

Validation

PYTHONDONTWRITEBYTECODE=1 python3 -m py_compile context-guard-kit/cache_score.py context-guard-kit/tool_schema_pruner.py context-guard-kit/benchmark_runner.py tests/test_context_guard_kit.py
PYTHONDONTWRITEBYTECODE=1 python3 -m unittest tests.test_context_guard_kit.ClaudeTokenKitTests -k cache_score
PYTHONDONTWRITEBYTECODE=1 python3 -m unittest tests.test_context_guard_kit.ClaudeTokenKitTests -k tool_prune
PYTHONDONTWRITEBYTECODE=1 python3 -m unittest tests.test_context_guard_kit.BenchmarkRunnerTests -k benchmark_report
python3 scripts/sync_plugin_copies.py --check
git diff --check
PYTHONDONTWRITEBYTECODE=1 python3 scripts/prepublish_check.py --skip-tests
PYTHONDONTWRITEBYTECODE=1 python3 scripts/release_smoke.py --timeout 20
PYTHONDONTWRITEBYTECODE=1 python3 scripts/prepublish_check.py — 691 tests OK

Claim boundary

This PR is advisory/local-only. It does not add provider calls, bundled pricing defaults, native provider tool-search configuration, lossy compression, or hosted API token/cost savings claims.

ictechgy · 2026-06-14T13:44:07Z

Quad review + validation evidence

Local validation before PR / after R2 fix:

python3 scripts/sync_plugin_copies.py --check — OK
git diff --check — OK
PYTHONDONTWRITEBYTECODE=1 python3 -m py_compile context-guard-kit/cache_score.py context-guard-kit/tool_schema_pruner.py context-guard-kit/benchmark_runner.py tests/test_context_guard_kit.py — OK
PYTHONDONTWRITEBYTECODE=1 python3 -m unittest tests.test_context_guard_kit.ClaudeTokenKitTests -k cache_score — 3 tests OK
PYTHONDONTWRITEBYTECODE=1 python3 -m unittest tests.test_context_guard_kit.ClaudeTokenKitTests -k tool_prune — 14 tests OK
PYTHONDONTWRITEBYTECODE=1 python3 -m unittest tests.test_context_guard_kit.BenchmarkRunnerTests -k benchmark_report — 13 tests OK
PYTHONDONTWRITEBYTECODE=1 python3 scripts/prepublish_check.py --skip-tests — OK
PYTHONDONTWRITEBYTECODE=1 python3 scripts/release_smoke.py --timeout 20 — OK
PYTHONDONTWRITEBYTECODE=1 python3 scripts/prepublish_check.py — 691 tests OK

Quad review loop:

Codex R1: REQUEST_CHANGES, MEDIUM cache amortization risk accounting issue.
Forge R1: APPROVE with LOW note on same cache read-premium risk; accepted and fixed with the Codex MEDIUM.
Agy R1: APPROVE.
Claude R1: full-diff run produced no usable output; re-run on R2 fix diff.
R2 fix: cache-score now compares expected cached vs uncached relative cost and returns no_read_discount/high for write-cheaper/read-more-expensive negative-savings cases; measurement baseline now lists primary_tokens_measured.
Codex R2: APPROVE.
Claude R2: APPROVE.
Forge R2: APPROVE.
Agy R2: APPROVE.

PR CI:

test-and-prepublish (3.11) — pass
test-and-prepublish (3.12) — pass
test-and-prepublish (macos-latest, 3.12) — pass

No unresolved CRITICAL/HIGH blockers and no accepted unresolved MEDIUM blockers remain.

Follow-up to PR #200 final review: clarifies read-premium cache-score amortization by removing monotonic break-even semantics, adding positive-only max_profitable_reuses, and covering exact/decimal break-even plus zero-read cases. Validation: - sync_plugin_copies.py --check - py_compile changed files - cache_score unittest subset (3 tests) - prepublish_check.py --skip-tests - release_smoke.py --timeout 20 - full prepublish_check.py (691 tests) - PR CI green - final code review/architect review clear

ictechgy added 2 commits June 14, 2026 21:35

Harden token savings batch one advisories

ceddbe5

Fix cache amortization risk accounting

41994db

ictechgy merged commit adf79b7 into main Jun 14, 2026
3 checks passed

ictechgy deleted the ultragoal/token-savings-batch1-followup branch June 14, 2026 13:44

ictechgy mentioned this pull request Jun 14, 2026

Clarify cache-score read-premium amortization #201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden token savings advisory reports#200

Harden token savings advisory reports#200
ictechgy merged 2 commits into
mainfrom
ultragoal/token-savings-batch1-followup

ictechgy commented Jun 14, 2026

Uh oh!

ictechgy commented Jun 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ictechgy commented Jun 14, 2026

Summary

Validation

Claim boundary

Uh oh!

ictechgy commented Jun 14, 2026

Quad review + validation evidence

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant