PodParley PodParley

Reward engineering for better content recommendation [Netflix]

An episode of the Snacks Weekly on Data Science podcast, hosted by Pan Wu, titled "Reward engineering for better content recommendation [Netflix]" was published on December 23, 2024 and runs 11 minutes.

December 23, 2024 ·11m · Snacks Weekly on Data Science

0:00 / 0:00

In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience. For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef

In this episode, we will explore Netflix’s approach to content recommendation using contextual bandits and reward engineering. We will also discuss the important role of proxy reward functions and how Netflix leverages offline machine learning models to predict delayed customer feedback, enabling them to continuously improve their recommendation engine and deliver a more personalized viewing experience.


For more details, you can refer to their published tech blog, linked here for your reference: https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef

URL copied to clipboard!