Fine-tuning and Preference Alignment in a Single Streamlined Process episode artwork

EPISODE · Jun 13, 2024 · 35 MIN

Fine-tuning and Preference Alignment in a Single Streamlined Process

from The Data Exchange with Ben Lorica · host Ben Lorica

Jiwoo Hong and  Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon •  RSS.Detailed show notes can be found on The Data Exchange web site.

Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

NOW PLAYING

Fine-tuning and Preference Alignment in a Single Streamlined Process

0:00 35:32

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Data Exchange with Ben Lorica?

This episode is 35 minutes long.

When was this The Data Exchange with Ben Lorica episode published?

This episode was published on June 13, 2024.

What is this episode about?

Jiwoo Hong and  Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter:  https://gradientflow.substack.com/Subscribe: Apple • Spotify • Overcast • Pocket...

Can I download this The Data Exchange with Ben Lorica episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!