199: Reporting Standards for Medical Foundation and Language Models episode artwork

EPISODE · Mar 12, 2026 · 23 MIN

199: Reporting Standards for Medical Foundation and Language Models

from Digital Pathology Podcast

Send us Fan MailPaper Discussed in this Episode:Reporting checklist for foundation and large language models in medical research (REFINE): an international consensus guideline. Mese I, Akinci D’Antonoli T, Bluethgen C, et al. Diagn Interv Radiol 2026.Episode Summary: In this special journal club edition of the digital pathology podcast, we tackle a massive structural problem in medical imaging and AI: the rapid adoption of foundation models and large language models (LLMs) that are completely outgrowing our traditional evaluation frameworks. We examine the groundbreaking 2026 REFINE consensus guideline that addresses the opaque and stochastic nature of generative AI, forcing researchers to fundamentally change how they report on these tools to move away from black-box unpredictability toward true reproducibility.In This Episode, We Cover:• The "Wooden Ruler" Problem: Traditional AI reporting tools, such as CLAIM and TRIPOD-AI, were built under the assumption that algorithms are deterministic, meaning they give the exact same output every time. Generative AI is inherently stochastic and sensitive to subtle variables, making old checklists function like rigid wooden rulers trying to measure a fluid target.• The REFINE Framework: Created via a rigorous Delphi consensus process by 57 contributors from 17 countries, this robust 44-item, 6-section checklist is a massive global effort. It features a deliberate "N/A" filtering mechanism to practically accommodate highly diverse text, imaging, and multimodal study designs.• Prompting is the New Coding: We explore why researchers must now treat prompt engineering with the exact same rigor as traditional source code. The guideline requires full transparency on prompting strategies, session memory policies, and precisely how patient clinical context (like BI-RADS or ICD codes) is integrated into the model.• Corralling the Chaos (Stochasticity & The Human Element): Controlling an LLM requires detailing generation parameters like "temperature," which dictates model creativity. Crucially, studies must also document the prompt operator's characteristics, as a senior attending radiologist will intuitively guide a model very differently than a first-year resident, drastically skewing the output.• The Contamination Crisis: We discuss the existential threat of dataset contamination, which occurs when an LLM has already memorized public test datasets (like MIMIC-CXR) during its pre-training phase. The guideline demands rigorous checks against the model's knowledge cut-off dates and full transparency regarding the use of synthetic data.• Clinical Reality Check: A model's performance in a vacuum is meaningless if it cannot seamlessly integrate into a hospital's clinical workflow, such as its PACS. We detail why researchers must now explicitly outline clinical non-use cases, map out data privacy safeguards, and conduct formal failure analyses to categorize errors like hallucinations.Key Takeaway: The REFINE guideline marks a critical maturation point for medical AI research. By rigorously addressing the unique chaotic elements of generative AI—such as prompt sensitivity, stochastic generation, and dataset contamination—this framework ensures that future medical AI studies provide a trustworthy, reproducible foundation of evidence that frontline clinicians can safely rely on for patient careSupport the showGet the "Digital Pathology 101" FREE E-book and join us!

Send us Fan Mail Paper Discussed in this Episode: Reporting checklist for foundation and large language models in medical research (REFINE): an international consensus guideline. Mese I, Akinci D’Antonoli T, Bluethgen C, et al. Diagn Interv Radiol 2026. Episode Summary: In this special journal club edition of the digital pathology podcast, we tackle a massive structural problem in medical imaging and AI: the rapid adoption of foundation models and large language models (LLMs) that are complet...

NOW PLAYING

199: Reporting Standards for Medical Foundation and Language Models

0:00 23:35

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

That Hoarder: Overcome Compulsive Hoarding That Hoarder Hoarding disorder is stigmatised and people who hoard feel vast amounts of shame. This podcast began life as an audio diary, an anonymous outlet for somebody with this weird condition. That Hoarder speaks about her experiences living with compulsive hoarding, she interviews therapists, academics, researchers, children of hoarders, professional organisers and influencers, and she shares insight and tips for others with the problem. Listened to by people who hoard as well as those who love them and those who work with them, Overcome Compulsive Hoarding with That Hoarder aims to shatter the stigma, share the truth and speak openly and honestly to improve lives. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! DIOSA. Carolina Sanper This podcast is a sacred space created by Carolina Sanper where you connect with your inner wisdom and embody your magnetic feminine power.It is the realization that the mystical realm is where you plant the seeds of your desired reality.It is a portal to your true essence: awareness, presence, and receiving with ease. Welcome home, DIOSA. 🖤 XXX Tech by SOVRYN Dr. Brian Sovryn The crossroads between technology, sensuality, and metaphysics - and the longest running anarchist podcast in the world! Brought to you by Dr. Brian Sovryn.

Frequently Asked Questions

How long is this episode of Digital Pathology Podcast?

This episode is 23 minutes long.

When was this Digital Pathology Podcast episode published?

This episode was published on March 12, 2026.

What is this episode about?

Send us Fan MailPaper Discussed in this Episode:Reporting checklist for foundation and large language models in medical research (REFINE): an international consensus guideline. Mese I, Akinci D’Antonoli T, Bluethgen C, et al. Diagn Interv Radiol...

Is there a transcript available for this episode?

Yes, a full transcript is available for this episode. You can read the complete transcript on the episode page.

Can I download this Digital Pathology Podcast episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!