EPISODE · Dec 11, 2024 · 12 MIN
(Voiceover) OpenAI's Reinforcement Finetuning and RL for the masses
from Interconnects · host Nathan Lambert
Original post: https://www.interconnects.ai/p/openais-reinforcement-finetuningChapters00:00 Introduction04:19 The impact of reinforcement finetuning’s existence07:29 Hypotheses on reinforcement finetuning’s implementationFiguresFig. 1, Yann’s CakeFig. 2, Grader configFig. 3, RLVR learning curves This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.interconnects.ai/subscribe
NOW PLAYING
(Voiceover) OpenAI's Reinforcement Finetuning and RL for the masses
No transcript for this episode yet
Similar Episodes
May 20, 2026 ·8m
May 12, 2026 ·4m
Apr 28, 2026 ·7m
Apr 22, 2026 ·8m