Teaching Machines to Say "I Don't Know"—The AI Hallucination Problem episode artwork

EPISODE · Sep 24, 2025 · 15 MIN

Teaching Machines to Say "I Don't Know"—The AI Hallucination Problem

from AI Rounds by the Cumming School of Medicine · host Office of Faculty Development, Cumming School of Medicine, University of Calgary

Why do GenAI systems confidently state incorrect medical facts instead of saying "I don't know?" Groundbreaking research from OpenAI and Georgia Tech reveals that AI hallucinations aren't bugs to be fixed—they're inevitable consequences of how these systems are trained. This episode explores the "singleton problem" that makes AI systematically unreliable on rare facts, connects to our previous discussion of AI benchmark saturation (Episode 9), and explains why the same evaluation methods that create impressive test scores actually reward confident guessing over appropriate uncertainty. For medical faculty evaluating AI tools, understanding these statistical realities is crucial for teaching students, conducting research, and developing institutional policies that account for AI's fundamental limitations.Links from this episode:https://openai.com/index/why-language-models-hallucinate

Why do GenAI systems confidently state incorrect medical facts instead of saying "I don't know?" Groundbreaking research from OpenAI and Georgia Tech reveals that AI hallucinations aren't bugs to be fixed—they're inevitable consequences of how these systems are trained. This episode explores the "singleton problem" that makes AI systematically unreliable on rare facts, connects to our previous discussion of AI benchmark saturation (Episode 9), and explains why the same evaluation methods that create impressive test scores actually reward confident guessing over appropriate uncertainty. For medical faculty evaluating AI tools, understanding these statistical realities is crucial for teaching students, conducting research, and developing institutional policies that account for AI's fundamental limitations.Links from this episode:https://openai.com/index/why-language-models-hallucinate

NOW PLAYING

Teaching Machines to Say "I Don't Know"—The AI Hallucination Problem

0:00 15:08

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of AI Rounds by the Cumming School of Medicine?

This episode is 15 minutes long.

When was this AI Rounds by the Cumming School of Medicine episode published?

This episode was published on September 24, 2025.

What is this episode about?

Why do GenAI systems confidently state incorrect medical facts instead of saying "I don't know?" Groundbreaking research from OpenAI and Georgia Tech reveals that AI hallucinations aren't bugs to be fixed—they're inevitable consequences of how these...

Can I download this AI Rounds by the Cumming School of Medicine episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!