Approx usage for interrupted streaming requests by KeremTurgutlu · Pull Request #55 · AnswerDotAI/fastllm

KeremTurgutlu · 2026-06-19T17:04:32Z

Streaming wrappers now build an interrupted Completion when a stream is cancelled or closed before provider usage is returned. The wrapper estimates prompt/output tokens, assumes 80% of input tokens were cached, normalizes the synthetic usage through the provider’s existing norm_usage, and tracks it with the normal AsyncChat usage accounting.

Providers can now register approx_raw_usage hooks, so approximate usage keeps the same provider-shaped raw usage and cost path as real responses. This adds hooks for OpenAI Responses, OpenAI Chat, Anthropic, and Gemini.

This lets callers such as Solveit show and log approximate token usage/cost for interrupted prompts, including cancellations before the first streamed token.

interrupted_usage_half.mov

KeremTurgutlu · 2026-06-19T17:10:43Z

+def approx_text_tokens(s): return (len(s or '') + 2)//3
+
+def approx_obj_tokens(o):
+    try: s = json.dumps(obj2dict(o), ensure_ascii=False, default=str)
+    except Exception: s = str(o)
+    return approx_text_tokens(s)


@jph00 Should we instead use the tiktoken based estimator from solveit?

Probably overkill IMO

Why json.dumps instead of str here btw?

KeremTurgutlu · 2026-06-19T17:15:10Z

+                                api_name=api_name,
+                                vendor_name=vendor_name,
+                                usage=usage)
+        chat._track(self.value)


A cancelled request exits AsyncChat._call while yielding chunks (async for chunk in res: yield chunk # exits here) and never reaches the rest of the code:

# AsyncChat._call() ... if stream: if self.prefill: yield _mk_prefill(self.prefill) res = astream_with_complete(self, res, postproc=postproc) async for chunk in res: yield chunk # exits here res = res.value

So we manually call chat._track(self.value) here to set c.use for the interrupted request.

KeremTurgutlu · 2026-06-19T17:19:12Z

-def mk_client(model=None, vendor_name=None, api_name=None, api_key=None, base_url=None, xtra_hdrs=None,
-    timeout=httpx.Timeout(connect=30, read=300, write=30, pool=10)):
+# %% ../nbs/06_acomplete.ipynb #c714601e
+def resolve_api_vendor(model=None, vendor_name=None, api_name=None, api_key=None, base_url=None):


Factored this out to be able to use it during interrupted Completion construction.

KeremTurgutlu · 2026-06-19T17:20:55Z

+                yield postproc(chunk)
+        self.value = chunk
+    except (GeneratorExit, asyncio.CancelledError):
+        api_name,vendor_name,*_ = resolve_api_vendor(chat.model, chat.vendor_name, chat.api_name, chat.api_key, chat.base_url)


api_name and vendor_name are inferred in acomplete inside mk_client and not stored in AsyncChat, so we resolve them here using the new helper.

jph00 · 2026-06-29T01:48:52Z

+FinishReason = str_enum('finish_reason', 'stop', 'tool_calls', 'length', 'content_filter', 'interrupted')
+
+# %% ../nbs/00_types.ipynb #c5a88e6f
+def approx_text_tokens(s): return (len(s or '') + 2)//3


Shouldn't this be len((s or '').split()... ?

And then *3/2 ?

This is some heuristic AI came up with, I couldn't find our pre-tiktoken estimator in the git history. If you have that available that would be awesome. Was it something like:

def str_tokens(s): return int(len(s)/3.4) + 1

from https://github.com/AnswerDotAI/solveit/blob/b3d4b09dbef1f6a7437ca1c79a81d796f9ac50ed/00_db.ipynb ?

KeremTurgutlu · 2026-06-29T14:55:30Z

@jph00 I've simplified the token approx logic down to approx_str_tokens (from solveit history) which works both for objects like chat.turn_msgs and strings like chat.turn_sysp:

def approx_str_tokens(o): return int(len(str(o))/3.4) + 1

KeremTurgutlu self-assigned this Jun 19, 2026

KeremTurgutlu added the enhancement New feature or request label Jun 19, 2026

KeremTurgutlu marked this pull request as draft June 19, 2026 17:04

KeremTurgutlu commented Jun 19, 2026

View reviewed changes

KeremTurgutlu marked this pull request as ready for review June 27, 2026 10:13

AnswerDotAI deleted a comment from KeremTurgutlu Jun 29, 2026

jph00 reviewed Jun 29, 2026

View reviewed changes

approx usage for interrupted

6ca958e

KeremTurgutlu force-pushed the interrupted-usage branch from a65b4c5 to 6ca958e Compare June 29, 2026 14:52

KeremTurgutlu requested a review from jph00 June 29, 2026 14:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Approx usage for interrupted streaming requests#55

Approx usage for interrupted streaming requests#55
KeremTurgutlu wants to merge 1 commit into
mainfrom
interrupted-usage

KeremTurgutlu commented Jun 19, 2026 •

edited

Loading

Uh oh!

KeremTurgutlu Jun 19, 2026 •

edited

Loading

Uh oh!

jph00 Jun 29, 2026

Uh oh!

jph00 Jun 29, 2026

Uh oh!

KeremTurgutlu Jun 29, 2026

Uh oh!

KeremTurgutlu Jun 19, 2026 •

edited

Loading

Uh oh!

KeremTurgutlu Jun 19, 2026

Uh oh!

KeremTurgutlu Jun 19, 2026

Uh oh!

jph00 Jun 29, 2026

Uh oh!

jph00 Jun 29, 2026

Uh oh!

KeremTurgutlu Jun 29, 2026 •

edited

Loading

Uh oh!

KeremTurgutlu commented Jun 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

KeremTurgutlu commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KeremTurgutlu Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jph00 Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

jph00 Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

KeremTurgutlu Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

KeremTurgutlu Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KeremTurgutlu Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

KeremTurgutlu Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

jph00 Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

jph00 Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

KeremTurgutlu Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KeremTurgutlu commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

KeremTurgutlu commented Jun 19, 2026 •

edited

Loading

KeremTurgutlu Jun 19, 2026 •

edited

Loading

KeremTurgutlu Jun 19, 2026 •

edited

Loading

KeremTurgutlu Jun 29, 2026 •

edited

Loading

KeremTurgutlu commented Jun 29, 2026 •

edited

Loading