EPISODE · Jan 26, 2018 · 20 MIN
[MINI] Markov Decision Processes
from Data Skeptic · host Kyle Polich and Linh Da Tran
Formally, an MDP is defined as the tuple containing states, actions, the transition function, and the reward function. This podcast examines each of these and presents them in the context of simple examples. Despite MDPs suffering from the curse of dimensionality, they're a useful formalism and a basic concept we will expand on in future episodes.
NOW PLAYING
[MINI] Markov Decision Processes
No transcript for this episode yet
Similar Episodes
May 11, 2026 ·66m
May 11, 2026 ·67m
May 5, 2026 ·4m
May 4, 2026 ·4m