fix: classify Cloudflare AI errors with error codes and retry guidance by whoabuddy · Pull Request #63 · aibtcdev/x402-api

whoabuddy · 2026-03-03T00:05:43Z

Summary

Replace generic "Chat completion failed" 500 with classified errors that distinguish timeout, rate limit, model not found, and internal errors
Include error_code, retryable, and retry_after_seconds in error responses so agent consumers know whether to retry

Context

Production log analysis (Feb 28 - Mar 2) identified that all Cloudflare AI failures returned a generic 500 with no error classification, making it impossible for AI agent consumers to distinguish transient vs permanent failures.

Closes #61

Changes

chat.ts: Added classifyCloudflareAIError() helper that inspects error message/name to categorize into TIMEOUT (504), RATE_LIMIT (429), MODEL_NOT_FOUND (404), or INTERNAL_ERROR (502). Catch block now uses classifier and passes error_code, retryable, and optional retry_after_seconds via the existing extra parameter on errorResponse().

Test plan

Verify TypeScript compiles cleanly (npm run check)
Trigger timeout error — confirm 504 with error_code: "TIMEOUT", retryable: true
Trigger rate limit — confirm 429 with error_code: "RATE_LIMIT", retryable: true, retry_after_seconds: 30
Trigger unknown error — confirm 502 with error_code: "INTERNAL_ERROR", retryable: false

🤖 Generated with Claude Code

…try guidance Replace generic "Chat completion failed" 500 error with classified error responses that give AI agent consumers actionable information: - TIMEOUT (504): AbortError, "Request timed out", or error code 3046 retryable: true, retry_after_seconds: 30 - RATE_LIMIT (429): "Rate limit exceeded" or 429 in message retryable: true, retry_after_seconds: 60 - MODEL_NOT_FOUND (404): "Model not found" or 404 in message retryable: false - INTERNAL_ERROR (502): all other upstream failures retryable: false Each error response now includes error_code and retryable fields (and optionally retry_after_seconds) via the existing errorResponse() extra parameter. Log entries also include error_code and status for observability. Resolves: #61 Co-Authored-By: Claude <noreply@anthropic.com>

cloudflare-workers-and-pages · 2026-03-03T00:05:48Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Updated (UTC)
✅ Deployment successful! View logs	x402-api-staging	`1b3edb4`	Mar 03 2026, 12:05 AM

cloudflare-workers-and-pages · 2026-03-03T00:05:48Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Updated (UTC)
✅ Deployment successful! View logs	x402-api-production	`1b3edb4`	Mar 03 2026, 12:05 AM

Copilot

Pull request overview

This PR addresses a production issue (issue #61) where all Cloudflare AI chat errors returned a generic 500 "Chat completion failed", making it impossible for AI agent consumers to distinguish transient from permanent failures. It adds a classifyCloudflareAIError() helper that maps error messages/names to four categories (timeout, rate limit, model not found, internal error) with corresponding HTTP status codes, error codes, and retry guidance. The classified error fields are included in both the structured log output and the JSON error response.

Changes:

Added CloudflareAIErrorClassification interface and classifyCloudflareAIError() function that uses message/name inspection to categorize errors
Catch block now logs the classified error code and status, and returns a classified error response with error_code, retryable, and optional retry_after_seconds fields
OpenAPI schema updated to document the four new error responses (404, 429, 502, 504)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-03T00:09:25Z