docs(research): add Agent Bench Lab evaluation handoff#5
Conversation
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 70f5b018d5
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| "agent_bench_lab_fit": "", | ||
| "agent_bench_lab_eval_handoff": "", | ||
| "agent_bench_lab_blockers": "", |
There was a problem hiding this comment.
Ensure the collector emits the new handoff fields
When run_daily.py promotes any item with score >= 85, build_experiment_candidate() still returns only the old keys (title, source_url, source_type, hypothesis, minimal_reversible_change, evaluation_plan, stop_condition, and reason_not_to_implement_immediately). With these new required template fields, automated daily JSON/Markdown reports for that scenario will violate the updated Research Radar contract and omit the Agent Bench Lab fit/blocker that the guardrails now require, so the generator should be updated alongside the template.
Useful? React with 👍 / 👎.
Summary
Issue
Test plan
python3 -m json.tool research-radar/templates/digest.json >/dev/nullgit diff --checkscripts/run-deterministic-tests.sh