EPISODE · Feb 9, 2018 · 23 MIN
[MINI] Reinforcement Learning
from Data Skeptic · host Kyle Polich and Linh Da Tran
In many real world situations, a person/agent doesn't necessarily know their own objectives or the mechanics of the world they're interacting with. However, if the agent receives rewards which are correlated with the both their actions and the state of the world, then reinforcement learning can be used to discover behaviors that maximize the reward earned.
NOW PLAYING
[MINI] Reinforcement Learning
No transcript for this episode yet
Similar Episodes
May 11, 2026 ·66m
May 11, 2026 ·67m
May 5, 2026 ·4m
May 4, 2026 ·4m