PPO vs. SAC benchmarking for Unitree H1 humanoid standing in MuJoCo. PPO converges 7x faster; both achieve stable 20-second episodes after reward engineering fixes.
-
Updated
Mar 24, 2026 - Python
PPO vs. SAC benchmarking for Unitree H1 humanoid standing in MuJoCo. PPO converges 7x faster; both achieve stable 20-second episodes after reward engineering fixes.
Add a description, image, and links to the unitree-h1 topic page so that developers can more easily learn about it.
To associate your repository with the unitree-h1 topic, visit your repo's landing page and select "manage topics."