Tim & Heinrich — Democraticizing Reinforcement Learning Research

from Gradient Dissent: Conversations on AI

Since reinforcement learning requires hefty compute resources, it can be tough to keep up without a serious budget of your own. Find out how the team at Facebook AI Research (FAIR) is looking to increase access and level the playing field with the help of NetHack, an archaic rogue-like video game from the late 80s.Links discussed:The NetHack Learning Environment: https://ai.facebook.com/blog/nethack-learning-environment-to-advance-deep-reinforcement-learning/Reinforcement learning, intrinsic motivation: https://arxiv.org/abs/2002.12292Knowledge transfer:https://arxiv.org/abs/1910.08210Tim Rocktäschel is a Research Scientist at Facebook AI Research (FAIR) London and a Lecturer in the Department of Computer Science at University College London (UCL). At UCL, he is a member of the UCL Centre for Artificial Intelligence and the UCL Natural Language Processing group. Prior to that, he was a Postdoctoral Researcher in the Whiteson Research Lab, a Stipendiary Lecturer in Computer Science at Hertford College, and a Junior Research Fellow in Computer Science at Jesus College, at the University of Oxford.https://twitter.com/_rocktHeinrich Kuttler is an AI and machine learning researcher at Facebook AI Research (FAIR) and before that was a research engineer and team lead at DeepMind.https://twitter.com/HeinrichKuttlerhttps://www.linkedin.com/in/heinrich-kuttler/Topics covered:0:00 a lack of reproducibility in RL1:05 What is NetHack and how did the idea come to be?5:46 RL in Go vs NetHack11:04 performance of vanilla agents, what do you optimize for18:36 transferring domain knowledge, source diving22:27 human vs machines intrinsic learning28:19 ICLR paper - exploration and RL strategies35:48 the future of reinforcement learning43:18 going from supervised to reinforcement learning45:07 reproducibility in RL50:05 most underrated aspect of ML, biggest challenges?Get our podcast on these other platforms:Apple Podcasts: http://wandb.me/apple-podcastsSpotify: http://wandb.me/spotifyGoogle: http://wandb.me/google-podcastsYouTube: http://wandb.me/youtubeSoundcloud: http://wandb.me/soundcloudTune in to our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research:http://wandb.me/salonJoin our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning:http://wandb.me/slackOur gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices:https://wandb.ai/gallery

NOW PLAYING

0:00 54:09

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

Veteran Salesman: Tap Into Raw Emotion To seal The Deal | E46

Apr 22, 2025 ·32m

Introducing the Wealth Education Podcast

Feb 27, 2025 ·0m

Inclusive Entrepreneurship Advocate: Unlocking Wealth for Underserved Entrepreneurs| E45

Dec 6, 2024 ·34m

Mastering Money Management: Become Rich for The Price of Financial Literacy| E43

Sep 27, 2024 ·29m

Onstage Presence Expert: Overcome Stage Fright with Internal Drive | E44

Sep 20, 2024 ·57m

Mastering Money Management: Intro to Financial Literacy | E42

Aug 7, 2024 ·16m

Similar Podcasts

The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! PodQuesting Dwight J Randolph- WolfShield Media PodQuesting: -By WolfShield Media and Dwight J RandolphJoin us on an exciting journey to master the world of fiction podcasting! At PodQuesting, we document our quest to improve and innovate, sharing valuable insights, strategies, and behind-the-scenes tips along the way. Whether you're an experienced podcaster or just starting your first show, our podcast is your go-to resource for everything podcasting.Discover practical advice, creative techniques, and lessons from our own experiences as we explore the ever-evolving podcasting landscape. Ready to level up your skills and embark on this adventure with us? Tune in and join the quest!Have questions or feedback? Reach out to us at [email protected] and visit our website:WolfShield.Media LIGHTS, CAMERA, SMILE! Creatives Club Media Lights, Camera, Smile, is a podcast for anyone with a dream to share something with the world, out of the overflow of themselves - be it their mind, their heart, their personalities, and much more. Each of us are alive in this moment in time, with an innate ability to have ideas and create various things to benefit both ourselves and the people around us for a reason, and here, you will find the encouragement, the inspiration, and the motivation to do just that. Hosted by Cicily, founder of Creatives Club, she dives into various topics surrounding creativity and business. Exploring entrepreneurship for creatives in a corporate reality, sharing tips and tricks in a media centered company, answering questions regarding what a creative actually is are just a few of the things discussed on this podcast. Be encouraged to create for yourself as Cicily gets vulnerable by pivoting the camera to herself for the first time.To submit questions for Cicily to answer, or have her address certain t Kaizen Blueprint Aldo Chandra "Kaizen" is a Japanese term for continuous improvement. This podcast provides a blueprint to learn about health, wealth, relationships and everything else in between. Through our podcast, we strive to inspire, educate, and motivate our audience to cultivate a mindset of lifelong learning, productivity, and personal development. By sharing insights, strategies, and practical tips, we aim to guide listeners on their journey towards realizing their fullest potential, fostering success, and creating lasting positive change.

Frequently Asked Questions

How long is this episode of Gradient Dissent: Conversations on AI?

This episode is 54 minutes long.

When was this Gradient Dissent: Conversations on AI episode published?

This episode was published on March 4, 2021.

What is this episode about?

Can I download this Gradient Dissent: Conversations on AI episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.

URL copied to clipboard!