Reward engineering for better content recommendation [Netflix]
An episode of the Snacks Weekly on Data Science podcast, hosted by Pan Wu, titled "Reward engineering for better content recommendation [Netflix]" was published on December 23, 2024 and runs 11 minutes.
December 23, 2024 ·11m · Snacks Weekly on Data Science
Summary
In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience. For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef
Episode Description
In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience.
For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef
Similar Episodes
Jun 19, 2025 ·46m
Jun 13, 2025 ·40m
May 20, 2025 ·80m
May 13, 2025 ·74m
May 7, 2025 ·64m