Partially Observable Markov Decision Processes (POMDPs)

A Partially Observable Markov Decision Process (POMDP) is a 7 tuple \((S,A,O,P_a,R_a,Z,\gamma)\) where

It is an MDP with hidden states, or equivalently a Hidden Markov Model (HMM) with actions.

Resources

Emacs 29.4 (Org mode 9.6.15)