Partially Observable Markov Decision Processes (POMDPs)

A Partially Observable Markov Decision Process (POMDP) is a 7 tuple \((S,A,O,P_a,R_a,Z,\gamma)\) where

It is an MDP with hidden states, or equivalently a Hidden Markov Model (HMM) with actions.

Resources

Emacs 30.1.90 (Org mode 9.7.11)