PodParley PodParley

What is wrong with reinforcement learning? (Ep. 82)

Episode 78 of the Data Science at Home podcast, hosted by Francesco Gadaleta, titled "What is wrong with reinforcement learning? (Ep. 82)" was published on October 15, 2019 and runs 21 minutes.

October 15, 2019 ·21m · Data Science at Home

0:00 / 0:00

Join the discussion on our Discord server   After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here.In this episode I want to shine some light on reinforcement learning (RL) and the limitations that every practitioner should consider before taking certain directions. RL seems to work so well! What is wrong with it?   Are you a listener of Data Science at Home podcast? A reader of the Amethix Blog? Or did you subscribe to the Artificial Intelligence at your fingertips newsletter? In any case let’s stay in touch! https://amethix.com/survey/     References Emergence of Locomotion Behaviours in Rich Environments https://arxiv.org/abs/1707.02286 Rainbow: Combining Improvements in Deep Reinforcement Learning https://arxiv.org/abs/1710.02298 AlphaGo Zero: Starting from scratch https://deepmind.com/blog/article/alphago-zero-starting-scratch

Join the discussion on our Discord server

 

After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here. In this episode I want to shine some light on reinforcement learning (RL) and the limitations that every practitioner should consider before taking certain directions. RL seems to work so well! What is wrong with it?

 

Are you a listener of Data Science at Home podcast? A reader of the Amethix Blog?  Or did you subscribe to the Artificial Intelligence at your fingertips newsletter? In any case let’s stay in touch!  https://amethix.com/survey/

 

 

References
The Analytics Engineering Podcast dbt Labs, Inc. Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet's best data science & analytics articles.Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering.You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com.The podcast is sponsored by dbt labs, makers of the data transformation framework dbt. To reach our team, drop a note to [email protected]. Explicit STEM.queer() Vera Sativa Machine learning, data science, feminismo y queer anarquismo.Episodios cada 2 semanas. Explicit 天方烨谈 基因频道 华大基因专业团队倾情打造,基因科普娓娓道来! Explicit Explorers Wanted 5d20 Media, LLC We are an actual play podcast using the Numenera (http://numenera.com) Discovery and Destiny rules. Set one billion years in the future, we journey across the Ninth World. There have been eight worlds before this, where civilizations rose to intergalactic heights only to fall into ashes, leaving a world of strange relics behind them. Join our ragtag crew of messy adventurers as they navigate weird ruins, contend with criminal intrigue, and ignore their own better judgment... Repeatedly.See more (https://www.explorerswanted.fm/about)Become a Patron!Campaign Two: Hearts in Orbit<img src="https://files.fireside.fm/file/fireside-uploads/images/2/213fef3d-303d-4053-8ec2-96e695eef9f5/mDJd_g4e.png" alt="Three figures, from left: Ezr Explicit
URL copied to clipboard!