feat: improve Antigravity token extraction and sync metrics#661
Conversation
roborev: Combined Review (
|
a3f544c to
adc0b8e
Compare
roborev: Combined Review (
|
ec97a0c to
794a891
Compare
|
Thanks for digging into this area — the Token block semantics: what real archives showThe PR changes I cross-checked 15 real CLI sessions (June 2–11, spanning several agy releases) that have both a
Example: block Two corollaries:
Happy to share the verification test (~150 lines against the Smaller points
Since the metric is independent of the parser questions, splitting |
794a891 to
1bc395e
Compare
|
Thanks I'm looking into it. agree can separate the sidecarRejected piece into a separate PR. |
roborev: Combined Review (
|
1bc395e to
a5d67dd
Compare
roborev: Combined Review (
|
a5d67dd to
54d2c5a
Compare
- Limit tool matching to a known whitelist to prevent false positives. - Preserve tool-only steps instead of dropping them. - Correct protobuf token field mapping based on sidecar verification. - Update CLI usage pricing and tests accordingly.
54d2c5a to
29170a8
Compare
roborev: Combined Review (
|
|
Re-ran the sidecar cross-validation harness against The tool-call allowlist, pre-guard extraction for tool-only steps, intervening-tool checks on the ID/JSON windows, dataVersion 41, and the cached>0 end-to-end tests all look right to me as well. Thanks for the quick and thorough turnaround. On roborev's remaining note (f2 required, rejecting a fully-cache-served generation where uncached input is 0): I'd treat that as an accepted tradeoff rather than a change. Every real turn carries at least the new user message uncached — all 645 observed blocks have f2 present — and with f5 now optional, f2 is the anchor that keeps decoy blocks (latency counters etc.) from passing the plausibility check. Loosening it would trade a theoretical gap for a real false-positive surface. Might be worth a one-line comment in Thanks, Vikram! |
|
Thank you for your help |
Summary
Changed
internal/parser/antigravity.go: Enhanced token extraction logic and tool-call analysisinternal/parser/antigravity_test.go: 333 lines of new/improved test casesinternal/sync/engine.go/progress.go: Added sync metric hooks