Skip to content

ripl/manipulation_benchmark_audit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

What Are We Actually Benchmarking in Robot Manipulation?

Lean static project page for the paper What Are We Actually Benchmarking in Robot Manipulation?

Authors: Tianchong Jiang, Xiangshan Tan, Samuel Wheeler, Luzhe Sun, Tewodros W. Ayalew, Matthew Walter.

Affiliations: TTIC, University of Chicago, Argonne National Laboratory.

Public website path: https://ripl.github.io/manipulation_benchmark_audit/

Files

  • index.html: dependency-free static project page with inline CSS.
  • figures/benchmark-reported-results-stacked-area.png: paper-owned benchmark-report count figure used on the page.
  • figures/diagnostic-*.png: paper-result figures used in the four diagnostic cards.

Links

Scope

The page summarizes diagnostics for shortcut solvability, statistical significance, creeping overfitting, and data-source dependence across LIBERO, CALVIN, SimplerEnv, RoboCasa, and RoboTwin 2.0.

About

Project page for the manipulation benchmark audit paper.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages