Reinforcement learning (RL)

A Markov Decision Process (MDP) with unknown dynamics i.e. unknown state transition functions and reward functions, is a Reinforcement learning problem. Learning through trial and error, and the concept of delayed rewards are important features of RL problems.

There are 2 main problems in RL:

Below are some general methods to approach RL problems:

Other taxonomies include:

Resources

Emacs 29.4 (Org mode 9.6.15)