fix(pipeline): deprecate hits/misses grader format, use assertions natively by christso · Pull Request #858 · EntityProcess/agentv

christso · 2026-03-30T00:04:51Z

Fixes grader output handling in pipeline grade/bench to support the deprecated hits/misses format while graders transition to emitting assertions natively.

grade.ts: adds TODO comment marking the hits/misses fallback for future removal
bench.ts: reads LLM grader results from disk to avoid context-window loss across batches
SKILL.md: documents write-to-disk approach for LLM grader subagents

Note: @agentv/studio has pre-existing build errors unrelated to this change.

…tively

cloudflare-workers-and-pages · 2026-03-30T00:05:42Z

Deploying agentv with Cloudflare Pages

Latest commit:	`c6feef1`
Status:	✅ Deploy successful!
Preview URL:	https://3d55a48a.agentv.pages.dev
Branch Preview URL:	https://fix-deprecate-hits-misses-gr.agentv.pages.dev

View logs

…k only bench now reads LLM grader results exclusively from llm_grader_results/<name>.json per test. Removes the --llm-scores flag, stdin reading, and readStdin() — simplifying the interface to a single positional arg. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(pipeline): deprecate hits/misses grader format, use assertions na…

0de813e

…tively

christso and others added 2 commits March 30, 2026 01:03

style(pipeline): fix biome formatting on type annotation

c6feef1

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

christso merged commit 290b13d into main Mar 30, 2026
2 checks passed

christso deleted the fix/deprecate-hits-misses-grader-fallback branch March 30, 2026 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(pipeline): deprecate hits/misses grader format, use assertions natively#858

fix(pipeline): deprecate hits/misses grader format, use assertions natively#858
christso merged 3 commits intomainfrom
fix/deprecate-hits-misses-grader-fallback

christso commented Mar 30, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

christso commented Mar 30, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying agentv with Cloudflare Pages

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cloudflare-workers-and-pages bot commented Mar 30, 2026 •

edited

Loading