Scale skill evals: compare mode, per-prompt budgets, efficiency grader (uv + shared lib)#44
Open
Bwvolleyball wants to merge 57 commits into
Open
Scale skill evals: compare mode, per-prompt budgets, efficiency grader (uv + shared lib)#44Bwvolleyball wants to merge 57 commits into
Bwvolleyball wants to merge 57 commits into
Commits
Commits on May 29, 2026
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
Commits on Jun 1, 2026
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
Commits on Jun 2, 2026
- committed
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted