275: Machine Learning Through Reinforcement & Contextual Bandits

from Super Data Science: ML & AI Podcast with Jon Krohn · host Jon Krohn

In this episode of the SuperDataScience Podcast, I chat with the Machine Learning Research Scientist, John Langford. You will hear about unsupervised, supervised learning and reinforcement learning, and the differences between the three. You will learn about applications of contextual bandits and reinforcement learning in general, YOLO style algorithms versus simulator algorithms, technics for avoiding local optimums. You will also learn about the balance between exploration and exploitation, learning to search and active learning. If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/275

Episode metadata supplied by the publisher feed · Published Jul 3, 2019

Embed this episode

Attribution link and audio player

NOW PLAYING

275: Machine Learning Through Reinforcement & Contextual Bandits

0:00 1:01:54

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

No similar episodes found.

Similar Podcasts

No similar podcasts found.

Frequently Asked Questions

How long is this episode of Super Data Science: ML & AI Podcast with Jon Krohn?

This episode is 1 hour and 1 minute long.

When was this Super Data Science: ML & AI Podcast with Jon Krohn episode published?

This episode was published on July 3, 2019.

Can I download this Super Data Science: ML & AI Podcast with Jon Krohn episode?

Yes. Use the download control on the episode player to save the publisher-provided media file.

URL copied to clipboard!