193: Entropy as a Lie Detector for Radiology

from Digital Pathology Podcast · host Aleksandra Zuraw, DVM, PhD

Send us Fan MailPaper Discussed in this AI Journal Club:Wienholt, P., Caselitz, S., Siepmann, R. et al. Hallucination filtering in radiology vision-language models using discrete semantic entropy. Eur Radiol (2026). https://doi.org/10.1007/s00330-026-12384-zEpisode Summary: In this deep dive, we strip away the marketing hype surrounding medical AI and confront the "black box" problem of Vision Language Models (VLMs) like GPT-4o. We examine a groundbreaking 2026 study published in European Radiology that tackles a terrifying clinical issue: these AI models are incredibly confident, articulate, and often completely wrong. We explore a clever new mathematical wrapper designed to catch the AI in a lie, forcing us to ask: how do we stop the AI from hallucinating with dangerous authority, and can we actually teach it to say "I don't know"?In This Episode, We Cover:• The Confident Liar Problem (The Baseline): Why generalist VLMs are fundamentally different from traditional, narrow medical AI. They are probabilistic engines designed to predict the next word, resulting in a dangerous baseline accuracy of just 51.7% on real-world clinical data—essentially a coin flip.• The Mathematical Lie Detector (Discrete Semantic Entropy): How turning up the AI's "temperature" to 1.0 and asking the exact same question 15 times forces the model to brainstorm, revealing its hidden uncertainties.• Semantic Clustering (Cutting through the Noise): If the AI says "pneumonia" and then "lung infection," human clinicians know it means the same thing. We discuss how the DSE algorithm groups these answers by their underlying clinical meaning to calculate whether the AI is confidently consistent (low entropy) or randomly guessing (high entropy).• The Coverage Cost vs. Accuracy Trade-Off: The dramatic results of applying a strict DSE filter. GPT-4o's accuracy jumped from roughly 51% to over 76%, but with a massive catch—it remained completely silent on over half the cases, answering only 47.3% of the clinical questions.• The Danger Zone (Where AI Fails): Breaking down the performance across modalities. While the AI shone at identifying organs and surprisingly excelled at angiography, it completely fell flat on abnormality detection. On complex 3D CT scans, the filter had to reject over 90% of the questions because the model was fundamentally confused.• The Trap of the "Confident Hallucination": Why DSE measures consistency, not truth. We explore the nightmare scenario where an AI stubbornly hallucinates the exact same lie 15 times in a row, slipping past the safety filter and creating a massive risk for "automation bias" among clinicians.• Clinical Feasibility: The surprising practicality of running 15 parallel queries in a real hospital workflow. Because they run simultaneously via an API, the safety check takes only 6 seconds and costs roughly $0.72 per question.Key Takeaway: Building safer AI might paradoxically risk creating riskier doctors. While Discrete Semantic Entropy successfully filters out the AI's digital noise and confusion—transforming a failing model into a somewhat reliable, albeit very quiet, assistant—it leaves us with a critical human factors challenge. If the system flawlessly cherry-picks the easy cases and stays silent on the hard ones, we must ensure our own diagnostic muscles don't atrophy from over-trusting the machine.Support the showGet the "Digital Pathology 101" FREE E-book and join us!

What this episode covers

NOW PLAYING

0:00 23:12

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

I'm ok

Mar 26, 2026 ·1m

REMIX: Why we over-shop and compulsively acquire, and how to stop, with Dr Jan Eppingstall

Jan 9, 2026 ·61m

REMIX: OCD and hoarding disorder with Jenna Overbaugh

Jan 2, 2026 ·47m

REMIX: Therapy and hoarding disorder - what are the options? With Dr Jan Eppingstall

Dec 26, 2025 ·78m

REMIX: ADHD and hoarding disorder with Professor Sharon Morein

Dec 21, 2025 ·46m

#207 13 actionable pieces of mental health advice from six former podcast guests

Dec 12, 2025 ·53m

Similar Podcasts

That Hoarder: Overcome Compulsive Hoarding That Hoarder Hoarding disorder is stigmatised and people who hoard feel vast amounts of shame. This podcast began life as an audio diary, an anonymous outlet for somebody with this weird condition. That Hoarder speaks about her experiences living with compulsive hoarding, she interviews therapists, academics, researchers, children of hoarders, professional organisers and influencers, and she shares insight and tips for others with the problem. Listened to by people who hoard as well as those who love them and those who work with them, Overcome Compulsive Hoarding with That Hoarder aims to shatter the stigma, share the truth and speak openly and honestly to improve lives. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! DIOSA. Carolina Sanper This podcast is a sacred space created by Carolina Sanper where you connect with your inner wisdom and embody your magnetic feminine power.It is the realization that the mystical realm is where you plant the seeds of your desired reality.It is a portal to your true essence: awareness, presence, and receiving with ease. Welcome home, DIOSA. 🖤 XXX Tech by SOVRYN Dr. Brian Sovryn The crossroads between technology, sensuality, and metaphysics - and the longest running anarchist podcast in the world! Brought to you by Dr. Brian Sovryn.

Frequently Asked Questions

How long is this episode of Digital Pathology Podcast?

This episode is 23 minutes long.

When was this Digital Pathology Podcast episode published?

This episode was published on March 4, 2026.

What is this episode about?

Is there a transcript available for this episode?

Yes, a full transcript is available for this episode. You can read the complete transcript on the episode page.

Can I download this Digital Pathology Podcast episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.

URL copied to clipboard!