This repository contains practical projects in reinforcement learning. Key concepts like Multi-Armed Bandits, FMDPs, and both Model-Based and Model-Free algorithms (DP, MC, TD) are implemented. The goal is to gain hands-on experience and a deeper understanding of various RL methods.
-
- Multi-Armed bandit (MAB)
-
- Finite markov decision processes (FMDP)
-
- Dynamic programming (DP) (Model-Based)
-
- Monte carlo (MC) (Model-Free)
-
- Temporal difference (TD) (Model-Free)