Skip to content

chore(scripts): add verify-prompt-cache.mjs diagnostic#8

Merged
wusijian007 merged 1 commit into
mainfrom
chore/probe-cache-script
May 14, 2026
Merged

chore(scripts): add verify-prompt-cache.mjs diagnostic#8
wusijian007 merged 1 commit into
mainfrom
chore/probe-cache-script

Conversation

@wusijian007

Copy link
Copy Markdown
Owner

Standalone Node script that probes whether the configured Anthropic endpoint actually supports prompt caching. Issues three back-to-back non-streaming requests with identical, padded-past-1024-token system content and a cache_control: ephemeral marker. Prints per-call input/output/cache token counts and a final OK/WARNING line.

Surfaced during M1.5b verification:

  • The agent loop (streaming) showed all-zero usage against the claude.proai.love gateway, which made it look like our caching plumbing was broken.
  • A non-streaming probe confirmed the same gateway returns real token counts via the non-streaming response, but still zero on cache_creation/cache_read — the gateway silently drops cache_control on outbound requests to Anthropic.
  • Documented both quirks in scripts/README.md so the next person to hit "myagent usage shows 0s" knows where to look.

Use:
node scripts/verify-prompt-cache.mjs

Not added to npm test — it makes real API calls.

Standalone Node script that probes whether the configured Anthropic
endpoint actually supports prompt caching. Issues three back-to-back
non-streaming requests with identical, padded-past-1024-token system
content and a `cache_control: ephemeral` marker. Prints per-call
input/output/cache token counts and a final OK/WARNING line.

Surfaced during M1.5b verification:

- The agent loop (streaming) showed all-zero usage against the
  claude.proai.love gateway, which made it look like our caching
  plumbing was broken.
- A non-streaming probe confirmed the same gateway returns real token
  counts via the non-streaming response, but still zero on
  cache_creation/cache_read — the gateway silently drops cache_control
  on outbound requests to Anthropic.
- Documented both quirks in scripts/README.md so the next person to
  hit "myagent usage shows 0s" knows where to look.

Use:
  node scripts/verify-prompt-cache.mjs

Not added to `npm test` — it makes real API calls.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
@wusijian007 wusijian007 merged commit 1137e86 into main May 14, 2026
3 checks passed
@wusijian007 wusijian007 deleted the chore/probe-cache-script branch May 14, 2026 09:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant