research-claw

Here are 2 public repositories matching this topic...

ResearchClawBench: Evaluating AI Agents for Automated Research from Re-Discovery to New-Discovery

agent science benchmark ai end-to-end evaluation discovery openai codex claude ai-agent ai4science llm ai-scientist claude-code clawdbot openclaw auto-research research-claw

Measure AI agents’ performance with standardized tests across 314 tasks, 33 domains, and 4 difficulty levels for clear, reproducible comparison.

agent benchmark ai discovery openai benchmarks codex claude ai4science llm ai-scientist clawdbot openclaw agentic-evaluation auto-research research-claw

Add a description, image, and links to the research-claw topic page so that developers can more easily learn about it.

To associate your repository with the research-claw topic, visit your repo's landing page and select "manage topics."