[39] על למידה חיזוקית באימון מודלי שפה RLHF עם מייק

What this episode covers

קשה לעבור ברחוב היום בלי לשמוע מישהו מספר לחברו על צ'אט ג'י פי טי אוLLMאחד החידושים באימון שלו, למעשה בInstructGPTהיו השימוש בלמידה חיזוקית על בסיס דאטא מתויג אנושי בתהליך הדגימהנספר על אלגוריתם הRLHFושילובו בתוך מודלי השפהLLM

Share this episode

Similar Episodes

Jesteś administratorem Fortigate? Współczuję. Szczerze.

Jun 28, 2026 ·20m

Jak naprawdę zarabiać na AI?

Jun 21, 2026 ·106m

macOS Cię okłamuje

Jun 21, 2026 ·23m

Czy Mythos już na zawsze zmieni świat?

Jun 21, 2026 ·27m

To już nie są halucynacje. Dlaczego AI tak naprawdę KONFABULUJE i jak to wyłączyć? | Piotr Brzyski Jak zmusić AI, żeby przestała zmyślać? Neurosymbolika, dowody logiczne i koniec halucynacji | Piot...

Jun 21, 2026 ·90m

The Climb You Have to Make to Conquer AI

Jun 21, 2026 ·17m

Similar Podcasts

Trustworthy AI : De-risk business adoption of AI Pamela Gupta Description: Creating AI Trust is a very complex and hard problem. It is not clear what it is and how it can be operationalized. We will demystify what is Trustworthy AI, efficient adoption and leveraging it for reducing risks in AI programs.McKinsey reports indicates companies seeing the biggest bottom-line returns from AI—those that attribute at least 20 percent of EBIT or profitability to their use of AI—are more likely than others to follow Trustworthy AI best practices, including explainability. Further, organizations that establish digital trust among consumers through responsible practices such as making AI explainable are more likely to see their annual revenue and profitability grow at rates of 10 percent or more. Spatial Web AI Podcast Denise Holt Active Inference AI & the Spatial Web The Future of AI is shared, distributed, and multi-scale.AI that is knowable, explainable, and capable of human governance.Based on the same mechanics as biological intelligence, it operates in a naturally efficient way, with no big data requirement.This is Active Inference AI & the Spatial Web. Evidence → Cognition → Discernment™️ - Your Pathway to AI Leadership Greg Twemlow XperientialAI — Pathway to AI Leadership explores how people can collaborate with AI without outsourcing judgment. The spine is a three-step method: Evidence → Cognition → Discernment — a bridge from what’s scattered to what’s chosen. Through essays, reflections, and practical examples, I show how the Context & Critique Rule™ keeps thinking visible, decisions explainable, and responsibility human. Known Unknowns Known Unknowns Known Unknowns podcast explores unexplainable mysteries. We discuss the things that lie on the fringes of reality. Things that we know that are unknown.Ghosts, Folklore, Conspiracies, and everything else that lies outside the realm of the explainable...Because it's weird out there.

Frequently Asked Questions

How long is this episode of ExplAInable?

This episode is 55 minutes long.

When was this ExplAInable episode published?

This episode was published on June 13, 2023.

What is this episode about?