Reinforcement Learning on a rust simulation that only gives delayed rewards.
-
Updated
Jan 25, 2026 - Python
Reinforcement Learning on a rust simulation that only gives delayed rewards.
Sample implementation of gridworld problem in pygame with dynamic obstacles using Q-learning and PPO algorithm
Add a description, image, and links to the skrl topic page so that developers can more easily learn about it.
To associate your repository with the skrl topic, visit your repo's landing page and select "manage topics."