Leveling Up AI: Reinforcement Learning with Human Feedback (Ep. 222)

from Data Science at Home · host Francesco Gadaleta <frag>

In this episode, we dive into the not-so-secret sauce of ChatGPT, and what makes it a different model than its predecessors in the field of NLP and Large Language Models.We explore how human feedback can be used to speed up the learning process in reinforcement learning, making it more efficient and effective.Whether you're a machine learning practitioner, researcher, or simply curious about how machines learn, this episode will give you a fascinating glimpse into the world of reinforcement learning with human feedback. SponsorsThis episode is supported by How to Fix the Internet, a cool podcast from the Electronic Frontier Foundation and Bloomberg, global provider of financial news and information, including real-time and historical price data, financial data, trading news, and analyst coverage. ReferencesLearning through human feedbackhttps://www.deepmind.com/blog/learning-through-human-feedback Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedbackhttps://arxiv.org/abs/2204.05862 This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceathome.substack.com

NOW PLAYING

0:00 24:39

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

No similar episodes found.

Similar Podcasts

Real Construction Talk Compass Leadership Real Construction Talk is a podcast for leaders in the construction industry. The truth is that "as the leader goes, so goes the company." RCT's goal is to open dialog about what really happens on the job site and in the office to help owners and leaders grow, deal with hard situations and fix leadership problems. More info on RCT can be found at http://www.realconstructiontalk.com and is powered by Compass Leadership LLC: http://www.compassleadership.coach. Explicit Eavesdrop on Us Jessica Terzakis The honest business podcast YOU NEED IN YOUR LIFE! We talk about what it's really like to be in business: the good, the frustrating, the "am I the only one going through this?!" kind of topics. You're in the right place if you're looking for less "how to's" and more real conversations about what you're going through as an entrepreneur.Come eavesdrop on our conversations-it'll be like joining us at the kitchen table after working a full day in your business. Explicit Big Old Life: Heather Blackbird interviews people on planet earth. Heather Blackbird loves asking questions. This podcast is a learning experience. Join me, Heather Blackbird, as I talk to people about their lives. Frequency of new episodes is a little all over the place and I'm learning as I go. Big Old Life is a small way of talking about the vastness of life, one person at a time. If you are reading this or found this podcast it's probably because someone you know gave you a link to it. :) Explicit The Truth About You Ali Knight The Truth About You is the podcast that helps you peel back the layers of conditioned thinking and tune into who you really are at soul level.Hosted by Ali Knight, Intuitive Soul Coach and Empowerment Alchemist, this is the podcast for you if you are willing to question everything, release the conditioning that holds you back, and really create the life you came here to love.Expect deep questions and real insights from Ali's lived journey and client work about how it is to be human in this game we call life.Ali is an empowerment coach, truth-seeker, energy healer and changemaker who has spent 30 years working with real humans in the fields of mental health, coaching, energy healing, and spirituality. Find out more at www.aliknightcoaching.com or on instagram @aliknightcoaching Explicit

URL copied to clipboard!

Share this episode

Similar Episodes

Similar Podcasts

Age Verification