EPISODE · Jun 13, 2024 · 35 MIN
Fine-tuning and Preference Alignment in a Single Streamlined Process
from The Data Exchange with Ben Lorica · host Ben Lorica
Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.Detailed show notes can be found on The Data Exchange web site.
NOW PLAYING
Fine-tuning and Preference Alignment in a Single Streamlined Process
No transcript for this episode yet
Similar Episodes
No similar episodes found.