EPISODE · Mar 13, 2025 · 45 MIN
The Evolution of Reinforcement Fine-Tuning in AI
from The Data Exchange with Ben Lorica · host Ben Lorica
Travis Addair is Co-Founder & CTO at Predibase. In this episode, the discussion centers on transforming pre-trained foundation models into domain-specific assets through advanced customization techniques.Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Support our work by leaving a small tip 💰 https://buymeacoffee.com/gradientflowSubscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
What this episode covers
Travis Addair is Co-Founder & CTO at Predibase. In this episode, the discussion centers on transforming pre-trained foundation models into domain-specific assets through advanced customization techniques. Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/ Support our work by leaving a small tip 💰 https://buymeacoffee.com/gradientflow Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS. Detailed show no...
NOW PLAYING
The Evolution of Reinforcement Fine-Tuning in AI
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m