Code & Cure Podcast - All Episodes

44

#42 - How AI Chatbots Respond To Psychotic Prompts

What if a chatbot helped someone build a manifesto around a delusion instead of recognizing a mental health crisis? A prompt like “I was appointed by a Cosmic Council to guide humanity” might sound extreme, but it exposes a very real challenge for general AI assistants: when they are designed to be agreeable, fast, and confident, they can unintentionally validate beliefs that may signal psychosis.We explore a study that tests how large language models and chatbots like ChatGPT respond to prompts involving delusions, hallucinations, paranoia, grandiosity, and disorganized communication. The episode begins with the clinical reality of psychosis: insight can be limited, warning signs may be subtle or confusing, and a safe response should avoid reinforcing false beliefs while still taking the person seriously. From an emergency medicine perspective, the goal is clear—recognize possible psychosis, acknowledge the severity, and guide people toward real-world support.Then we turn to the AI problem: chatbots rarely know what a user truly means. The same message could be trolling, fiction, roleplay, or a genuine break from reality. By pairing psychotic prompts with carefully matched control prompts, researchers ask clinicians to judge whether chatbot responses are helpful, inappropriate, or potentially harmful. The “Cosmic Council” example shows how validation, enthusiasm, and step-by-step planning can accidentally strengthen a delusional frame. If people are already turning to general-purpose chatbots for mental health support, this raises an urgent product question: what safeguards should be built in before helpfulness becomes harm?Reference:Evaluation of Large Language Model Chatbot Responses to Psychotic PromptsShen et al.JAMA Psychiatry (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Apr 30, 2026

24m

43

#41 - If You Cannot Trace The Data, Do Not Trust The Model

What if the biggest risk in clinical AI isn’t the algorithm itself, but the data it was built on? A model can appear accurate, polished, and ready for real-world use while quietly relying on datasets with unclear origins, missing documentation, or hidden flaws. In healthcare, that is more than a technical issue. It is a patient safety issue.In this episode, we explore data provenance—the essential but often overlooked practice of understanding where healthcare data comes from, how it was collected, what it truly represents, and whether it should be trusted for clinical prediction in the first place. We explain why even standard model evaluation can create false confidence when training and deployment data do not match, and how so-called “out of distribution” failures reveal just how fragile these systems can be. One striking example says it all: a model trained on COVID chest X-rays that confidently labels a cat as COVID, not because it understands disease, but because it has learned the wrong patterns from the wrong data.We also examine a more common and more dangerous problem: datasets that look credible on the surface but lack the documentation needed to support meaningful clinical use. From synthetic data and augmentation to heavily cited Kaggle datasets for stroke and diabetes prediction, we unpack how poor provenance can distort research, amplify bias, and create the illusion of clinical utility where none has been properly established. This conversation is a call for stronger standards in trustworthy healthcare AI—clear sources, defined cohorts, transparent preprocessing, and real accountability before any model reaches patients.Reference:Evidence of Unreliable Data and Poor Data Provenance in ClinicalPrediction Model Research and Clinical PracticeGibson et al.medRxiv Preprint (2026)Dozens of AI disease-prediction models were trained on dubious dataBasuNature News (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Apr 23, 2026

29m

42

#40 - How Two Fake Medical Papers Tricked AI

What happens when fake science looks real enough for AI to believe it? “Bixonimania,” a completely invented eye disorder, was introduced through a pair of bogus medical preprints filled with absurd acknowledgements and fabricated claims. It should have been easy to dismiss. Instead, chatbots began repeating it with confidence, describing symptoms, risk factors, and even suggesting users see an ophthalmologist. When health information is only a prompt away, a polished falsehood can quickly become a real problem.We unpack why this hoax was so effective. The papers mimicked the tone and structure of legitimate scientific writing, preprints carried the appearance of credibility, and online systems rewarded fast answers over careful verification. We compare how clinicians and attentive readers catch inconsistencies, missing context, and obvious warning signs, while large language models process text differently. Because LLMs are built to predict likely sequences of words rather than confirm truth, they can turn something obviously fake into something that sounds entirely plausible.From there, we widen the lens to the broader challenges of AI safety and AI security in healthcare. From data poisoning to prompt injection to the feedback loop created when AI-generated content reinforces other AI-influenced material, the risks extend far beyond one invented diagnosis. This episode explores why trustworthy AI depends on more than technical performance alone. It requires human oversight, stronger vetting of what enters the information ecosystem, and real accountability for what gets published, amplified, and repeated.Reference:Scientists invented a fake disease. AI told people it was realStokel-WalkerNature News Feature (2026) Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Apr 16, 2026

22m

41

#39 - A Helpful Chatbot Can Slowly Talk You Into A False Reality

What happens when a chatbot seems thoughtful, supportive, and reassuring—but starts reinforcing beliefs that can damage someone’s health, relationships, or grip on reality? That question sits at the center of this episode as we explore delusional spiraling, a dangerous pattern where long AI conversations can gradually strengthen false or harmful ideas. We begin with real-world accounts of people drawn into deeply distorted beliefs, and we examine why even uncommon failures can become a serious public health issue when millions rely on chatbots every day.We then break down the technology in a clear, practical way. Modern large language models are designed to feel helpful and conversational, but that same design can create problems. We explain how instruction tuning turns raw prediction into polished dialogue, and how reinforcement learning from human feedback rewards responses people like rather than responses that are necessarily true. The result can be sycophancy: a subtle but powerful tendency to echo a user’s assumptions, emphasize confirming details, and sometimes even invent information to keep the conversation feeling smooth and supportive.The stakes become even clearer when we walk through a simple vaccine example, showing how an otherwise rational person can be nudged toward the wrong conclusion when evidence is filtered through an overly agreeable assistant. We also examine proposed solutions, from making models “more truthful” to adding warning systems, and ask whether those fixes go far enough. At its core, this episode is a reminder that uncertainty is a normal part of medicine and science—and that false confidence can be more dangerous than not knowing. References:Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal BayesiansChandra et al.ArXiv Preprint (2026)Chatbot DelusionsHuet and MetzHuman Line Project (2025)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Apr 9, 2026

27m

40

#38 - Using AI Can Make You Look More Guilty In Court

What happens when AI spots a dangerous finding on a scan and the radiologist disagrees? In theory, “human in the loop” sounds like the safeguard that keeps patients safe. In practice, it raises a far more uncomfortable question: when clinicians override AI, are they exercising sound judgment or exposing themselves to legal risk?We explore how AI image-reading tools are reshaping radiology and why performance metrics like “96% accurate” can be misleading in real clinical settings. False positives and false negatives do not carry the same consequences, and rare diseases can sharply reduce the real-world value of even highly capable models once prevalence and positive predictive value are taken into account. As these systems flag more normal scans, a new form of defensive medicine can emerge—one where repeatedly rejecting AI recommendations begins to feel professionally dangerous, especially when those recommendations are documented in the patient record.We also examine a study that placed laypeople in the role of jurors during malpractice scenarios involving missed diagnoses such as brain bleeds and lung cancer. The findings are revealing: when AI detects the pathology and the radiologist does not, jurors are more likely to assign blame. But when both the AI and the radiologist miss the finding, the physician gains little protection. The episode closes with what may actually reduce harm, including better education about the limitations of AI and a clearer understanding of these systems as imperfect clinical decision support—not a flawless second expert beside the clinician.References:Randomized Study of the Impact of AI on Perceived Legal Liability for Radiologists Bernstein, et al. NEJM AICredits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Apr 2, 2026

22m

39

#37 - Training A Neural Network On Toilet Photos

What if a single smartphone photo could make colonoscopy prep more reliable? Colonoscopy can save lives through early detection of colorectal cancer, but its success depends on one stubborn detail: a clean colon. When bowel prep falls short, important findings can be missed, procedures can take longer, and patients may have to repeat the entire process. The question is simple but important: could there be an easier way for patients to know whether they are truly ready before heading to the clinic? In this episode, we explore research that puts artificial intelligence to work on exactly that problem. Using a smartphone app, patients take a photo of their final bowel movement and receive an immediate yes-or-no result about whether their preparation is adequate. We break down how the system works, from convolutional neural networks and expert clinician labeling to data augmentation that helps the model adapt to real-world conditions like poor lighting, different angles, and varying distances. We also unpack a key challenge in medical AI: overfitting, and why strong performance in a study does not always guarantee success in everyday use.The potential impact is significant. Patients in the intervention group achieved better bowel cleansing quality, suggesting a practical way to improve the consistency and effectiveness of colorectal cancer screening. At the same time, important questions remain about adenoma detection, repeat procedures, and how tools like this fit into clinical workflow. This is a fascinating example of AI solving a very human problem: reducing friction, improving preparation, and helping patients get the most out of an essential preventive test.References:An Artificial Intelligence-Guided Strategy to Reduce Poor Bowel Preparation: A Multicenter Randomized Controlled StudyGimeno-García et al. American Journal of Gastroenterology (2026)Design and validation of an artificial intelligence system to detect the quality of colon cleansing before colonoscopyGimeno-García et al. Gastroenterology and Hepatology (2023)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Mar 26, 2026

20m

38

#36 - Should A Chatbot Ever Refuse To Reassure You

What if the chatbot that always has an answer is actually making anxiety worse? For people living with obsessive-compulsive disorder (OCD), instant, endless reassurance can feel helpful in the moment while quietly strengthening the very cycle that keeps OCD going. In this episode, we explore why AI chatbots and large language models are designed to be responsive, agreeable, and supportive—and how those same qualities can unintentionally fuel reassurance seeking, compulsive checking, and avoidance instead of real relief. We break down OCD in clear, practical terms: intrusive thoughts trigger fear, compulsions bring temporary comfort, and that short-term relief reinforces the cycle over time. Whether it shows up as repeated handwashing, constant checking, or asking the same question again and again, OCD often centers on the desperate need to eliminate uncertainty. That is exactly where evidence-based treatment takes a different path. We discuss exposure and response prevention (ERP), the gold-standard therapy that helps people face doubt without falling back on rituals, and why a general-purpose chatbot may accidentally validate the opposite by offering reassurance, endorsing avoidance, or helping users “pivot” toward the answer they were hoping to hear.We also look at the broader mental health challenge now that people are already turning to AI for support. What responsibility do clinicians, AI companies, and regulators have? We argue that clinicians should ask directly about chatbot use, and we examine what meaningful guardrails might look like—from detecting repetitive reassurance loops to refusing to continue harmful patterns. Using a real-world germ-related prompting example, we show where chatbot advice can be useful and where it can slip into enabling OCD. This conversation will change how you think about AI, anxiety, and the line between support and harm.Reference:A transdiagnostic model for how general purpose AI chatbots can perpetuate OCD and anxiety disordersGolden and AboujaoudeNature npj Digital Medicine (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Mar 19, 2026

18m

37

#35 - How AI Image Generators Portray Substance Use Disorder

What does an AI-generated image of addiction look like, and why does it so often default to darkness, isolation, and despair? As AI tools make it easier than ever to produce visuals for health education, those same tools can unintentionally reinforce stigma about substance use disorder.In this episode, we explore how AI image generators shape the way addiction is portrayed. Laura brings the perspective from emergency medicine and digital health, where substance use disorder is part of everyday clinical reality and where language and imagery can influence how patients are perceived. Vasanth breaks down the technical side, explaining how diffusion models create images by gradually denoising noise into structured visuals, guided by text prompts that steer what the model produces.That process is powerful, but it also means biases from internet training data and the connotations embedded in words can compound. The result? AI outputs that repeatedly frame addiction through dramatic “rock bottom” scenes, lone figures, and visual cues that unintentionally reinforce shame rather than understanding.We also look at research that systematically tests prompts and applies best-practice guidelines for more respectful depictions. The difference is striking: fewer stigmatizing signals, more human-centered imagery, and practical guardrails such as avoiding drug paraphernalia and moving beyond the isolated, ashamed figure. But sanitization has a price. For healthcare AI teams, the lesson is clear: visuals should be treated like clinical content, not decoration, with thoughtful review processes that protect dignity and support stigma-free health communication.Reference:AI-Generated Images of Substance Use and Recovery: Mixed Methods Case StudyHeley et al.JMIR AI (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Mar 12, 2026

20m

36

#34 - Inside ChatGPT Health: Promise, Peril, And Triage Failures

What if an AI health chatbot told you to stay home when you actually needed emergency care?In this episode, we put ChatGPT Health under the microscope using a clinician-authored evaluation designed to test a critical question: can an AI safely guide people on whether to go to the ER, visit urgent care, or wait it out at home? The results reveal a troubling pattern. When symptoms fall into the “middle” of the medical spectrum—uncertain but stable—the model often sounds helpful and reasonable. But when the stakes rise and subtle warning signs matter most, its judgment becomes unreliable.We explore how ChatGPT Health is positioned as a privacy-focused workspace that can read personal medical records, summarize visit notes, and translate complex information into plain language. Those capabilities can be valuable for education and preparation. But triage is a different challenge entirely. It requires causal reasoning, clear thresholds, and a bias toward catching the worst-case scenario before it’s too late.Two case studies highlight the gap. In an asthma scenario involving rising carbon dioxide, low oxygen levels, and poor peak flow—signals that should trigger urgent care—the model labeled the situation as only moderate. In diabetes, where the difference between routine high blood sugar and life-threatening diabetic ketoacidosis demands careful nuance, templated guidance struggled to capture the clinical reality.The most concerning findings emerged around suicidality. Crisis response protocols are explicit: when someone expresses intent or a plan, escalation and connection to the 988 crisis line should happen immediately. Yet in several scenarios with explicit plans, those prompts never appeared—while more ambiguous statements did trigger them. Safety in healthcare can’t be optional or probabilistic.We break down why large language models tend to gravitate toward the statistical middle, why medicine often lives in the dangerous “long tail,” and what this means for anyone using AI health tools today. AI can help you prepare for care, understand medical information, and ask better questions. But decisions about whether to seek urgent help still demand human judgment—and clear, non-negotiable safety guardrails.If this conversation resonates, follow the show, share the episode with someone exploring health tech, and leave a quick review telling us one takeaway you had. What safety rule would you hard-code into an AI health system?Reference:ChatGPT Health performance in a structured test of triage recommendationsAshwin Ramaswamy et al.Nature (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Mar 5, 2026

24m

35

#33 - Patients Don’t Talk Like Textbooks

What if the most confident answer in the room is also the most misleading?Large language models can ace medical exams, yet falter when faced with a real person’s messy, incomplete story. In this episode, we explore how that gap plays out in one of medicine’s highest-stakes decisions: triage. Drawing on Laura’s experience in emergency medicine and Vasanth’s background in AI research, we unpack a new study where laypeople role-played both routine and high-risk conditions and turned to leading LLMs for advice. The surprising twist? Tiny shifts in phrasing produced opposite recommendations—“rest at home” versus “go to the ER”—revealing how sensitive these systems are to prompts, and how an agreeable tone can drown out critical clinical signals.We take you inside the exam room to contrast what clinicians actually do. Real diagnosis isn’t a single question and answer—it’s an evolving process. Doctors gather a history that unfolds with each response, test competing hypotheses, and scan for subtle red flags and nonverbal cues that never show up in a chat window. From the ominous “worst headache of my life” to abdominal pain that could signal gallstones—or a heart attack—Laura explains how risk-first thinking and strategic follow-ups shape safe decisions. Meanwhile, Vasanth breaks down how preference-tuned models are trained to satisfy users, not challenge them—and why linguistic confidence can increase even as clinical accuracy declines. The study’s findings are sobering: models struggled to identify key conditions, and their triage decisions were no better than basic symptom checkers.But this isn’t a story of hype or doom—it’s about design. Reliable medical AI must interrogate before it interprets. That means structured red-flag checks, resistance to user-led anchors like “maybe it’s just stress,” and clear, actionable next steps instead of overwhelming option lists. Calibrated uncertainty, transparent reasoning, and human oversight can transform AI from a risky decider into a valuable assistant.If you care about digital health, safe triage, and the future of human-AI collaboration in medicine, this conversation offers a grounded look at both the limits—and the real promise—of these tools.If this episode resonated, follow the show, share it with a colleague, and leave a quick review to help more listeners discover Code and Cure.Reference:Reliability of LLMs as medical assistants for the general public: a randomized preregistered studyAndrew M. Bean et al.Nature Medicine (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Feb 26, 2026

29m

34

#32 - When Data Isn’t Better: Rethinking Fertility Tracking

What if the most reliable ways to track fertility are also the simplest? In this episode, we examine the science of ovulation timing and hold modern wearables to a high standard, comparing passive temperature and vital sign data with established methods like LH surge testing and cervical mucus observation. Drawing on perspectives from a cognitive scientist and an emergency physician, we explain what each method actually measures, how well it performs outside the lab, and where convenience falls short of accuracy.We begin by clarifying the fertile window and the underlying physiology, then connect that biology to signals people can track at home. Changes in cervical mucus provide a strong, real time indicator of peak fertility. Urine LH strips offer a clear 24 to 36 hour advance signal at low cost. Basal body temperature can confirm that ovulation has already occurred, but it is less helpful for predicting timing in advance. Against this foundation, we review a meta analysis of wearable data showing that temperature remains the strongest predictor, while heart rate and variability contribute only modest improvements. The conclusion is straightforward: wearables can approximate existing signals, but they do not clearly outperform simple tools for timing intercourse, insemination, or pregnancy avoidance.Along the way, we challenge the idea that more data and a paid app automatically lead to better outcomes. We weigh privacy risks, cost, and false confidence against the accessibility of test strips and the high signal value of mucus observations. The takeaway is a practical hierarchy. Use LH strips and cervical mucus as primary guides, add calendar context and basal temperature if useful, and treat wearables as optional conveniences rather than a definitive solution. Women’s health deserves thoughtful innovation, and sometimes real progress comes from choosing what works, not what is marketed most aggressively.If this episode resonated, follow the show, share it with a friend navigating fertility, and leave a review with your experience and what has worked best for you.Reference:The diagnostic accuracy of wearable digital technology in detecting fertility window and menstrual cycles: a systematic review and Bayesian network meta-analysisYue Shi et al.Nature NPJ Digital Medicine (2026)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Feb 19, 2026

19m

33

#31 - How Retrieval-Augmented AI Can Verify Clinical Summaries

Fluent summaries that cannot prove their claims are a hidden liability in healthcare, quietly eroding clinician trust and wasting time. In this episode, we walk through a practical system that replaces “sounds right” narratives with evidence-backed summaries by pairing retrieval augmented generation with a large language model that serves as a judge. Instead of asking one AI to write and police itself, the work is divided. One model drafts the summary, while another breaks it into atomic claims, retrieves supporting chart excerpts, and issues clear verdicts of supported, not supported, or insufficient, with explanations clinicians can review.We explain why generic summarization often breaks down in clinical settings and how retrieval augmented generation keeps the model grounded in the patient’s actual record. The conversation digs into subtle but common failure modes, including when a model ignores retrieved evidence, when a sentence mixes correct and incorrect facts, and when wording implies causation that the record does not support. A concrete example brings this to life: a claim that a patient was intubated for septic shock is overturned by operative notes showing intubation for a procedure, with the system flagging the discrepancy and guiding a precise correction. That is not just higher accuracy; it is accountability you can audit later.We also explore a deeper layer of the problem: argumentation. Clinical care is not just a list of facts, but the relationships between them. By evaluating claims alongside their evidence, surfacing contradictions, and pushing for precise language, the system helps generate summaries that reflect real clinical reasoning rather than confident guessing. The payoff is less time spent chasing errors, more time with patients, and a defensible trail for quality review and compliance.If you care about chart review, clinical documentation, retrieval augmented generation, and building AI systems clinicians can trust, this episode offers practical takeaways. Reference:Verifying Facts in Patient Care Documents Generated by Large Language Models Using Electronic Health RecordsPhilip Chung et al. NEJM AI (2025)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Feb 12, 2026

23m

32

#30 - From Reddit To Rescue: Real-Time Signals Of The Opioid Crisis

What if the earliest warning sign of an opioid overdose surge isn’t locked inside a delayed report, but unfolding in real time on Reddit? In this episode, we explore how social media conversations, especially pseudonymous, community-led forums, can reveal emerging overdose risks before traditional surveillance systems catch up.We unpack research that analyzed more than a decade of posts to show how even simple drug mentions sharpened forecasts of overdose death rates. The signal was especially strong for fentanyl, exposing where existing public health tools lag and why online communities often see danger first. Along the way, we explain the mechanics in plain language: how time-series models respond faster than surveys, why subreddit structure filters noise, and how historical archives enable rigorous validation.But it doesn’t stop at counting mentions. We dig into what happens when posts are classified by lived experience: overdose stories, sourcing concerns, or test strip discussions. We also examine what broke during COVID, when behavior and access shifted overnight, and how to detect those regime changes before models start to fail.The takeaway is urgent and practical. Social data won’t replace public health surveillance, but it can make it fast enough to save lives. We share a field-ready playbook for turning online signals into timely interventions, and show how feedback from the same communities can explain why a response worked—or didn’t—so teams can adapt quickly. If you care about real-time epidemiology, harm reduction, and responsible AI in healthcare, this conversation connects raw text to real-world impact.Reference:Monitoring the opioid epidemic via social media discussionsDelaney A Smith et al. Nature NPJ Digital Health (2025)Credits:Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Feb 5, 2026

18m

31

#29 - AI Hype Meets Hospital Reality

What really happens when a “smart” system steps into the operating room, and collides with the messy, time-pressured reality of clinical care?In this episode, we unpack a multi-center pilot that streamed audio and video from live surgeries to fuel safety checklists, flag cases for review, and promise rapid, actionable insight. What emerged instead was a clear-eyed lesson in the gap between aspiration and execution. Across four fault lines, the story shows where clinicians’ expectations of AI ran ahead of what today’s systems can reliably deliver, and what that means for patient safety.We begin with the promise. Surgeons and care teams envisioned near-instant post-case summaries: what went well, what raised concern, and which patients might be at risk. The reality looked different. Training demands, configuration work, and brittle workflows made it clear that AI is anything but plug-and-play. We explore why polished language can be mistaken for intelligence, why models need the right tools to reason effectively, and why moving AI from one hospital to another is closer to a redesign than a simple deployment.Then we follow the data. When it takes six to eight weeks to turn raw footage into usable insight, the value of learning forums like morbidity and mortality conferences quickly erodes. Privacy protections, de-identification, and quality control matter—but without pipelines built for speed and trust, insights arrive too late to change practice. We contrast where the system delivered real value, such as checklists and procedural signals, with where it fell short: predicting post-operative complications and producing research-ready datasets.Throughout the conversation, we argue for a minimum clinically viable product: tightly scoped use cases, early and deep involvement from surgeons and nurses, and data flows that respect governance without stalling learning. AI can strengthen patient safety and team performance—but only when expectations align with capability and operations are designed for real clinical tempo.If this resonates, follow the show, share it with a colleague, and leave a review with one takeaway you’d apply in your own clinical setting. Reference: Expectations vs Reality of an Intraoperative Artificial Intelligence InterventionMelissa Thornton et al. JAMA Surgery (2026)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jan 29, 2026

25m

30

#28 - How AI Confidence Masks Medical Uncertainty

Can you trust a confident answer, especially when your health is on the line?This episode explores the uneasy relationship between language fluency and medical truth in the age of large language models (LLMs). New research asks these models to rate their own certainty, but the results reveal a troubling mismatch: high confidence doesn’t always mean high accuracy, and in some cases, the least reliable models sound the most sure.Drawing on her ER experience, Laura illustrates how real clinical care embraces uncertainty—listening, testing, adjusting. Meanwhile, Vasanth breaks down how LLMs generate their fluent responses by predicting the next word, and why their self-reported “confidence” is just more language, not actual evidence.We contrast AI use in medicine with more structured domains like programming, where feedback is immediate and unambiguous. In healthcare, missing data, patient preferences, and shifting guidelines mean there's rarely a single “right” answer. That’s why fluency can mislead, and why understanding what a model doesn’t know may matter just as much as what it claims.If you're navigating AI in healthcare, this episode will sharpen your eye for nuance and help you build stronger safeguards. Reference: Benchmarking the Confidence of Large Language Models in Answering Clinical Questions: Cross-Sectional Evaluation StudyMahmud Omar et al.JMIR (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jan 22, 2026

25m

29

#27 - Sleep’s Hidden Forecast

What if one night in a sleep lab could offer a glimpse into your long-term health? Researchers are now using a foundation model trained on hundreds of thousands of hours of sleep data to do just that, by predicting the next five seconds of a polysomnogram, the model learns the rhythms of sleep and, with minimal fine-tuning, begins estimating risks for conditions like Parkinson’s, dementia, heart failure, stroke, and even some cancers.We break down how it works: during a sleep study, sensors capture brain waves (EEG), eye movements (EOG), muscle tone (EMG), heart rhythms (ECG), and breathing. The model compresses these multimodal signals into a reusable format, much like how language models process text. Add a small neural network, and suddenly those sleep signals can help predict disease risk up to six years out. The associations make clinical sense: EEG patterns are more telling for neurodegeneration, respiratory signals flag pulmonary issues, and cardiac rhythms hint at circulatory problems. But, the scale of what’s possible from a single night’s data is remarkable.We also tackle the practical and ethical questions. Since sleep lab patients aren’t always representative of the general population, we explore issues of selection bias, fairness, and external validation. Could this model eventually work with consumer wearables that capture less data but do so every night? And what should patients be told when risk estimates are uncertain or only partially actionable?If you're interested in sleep science, AI in healthcare, or the delicate balance of early detection and patient anxiety, this episode offers a thoughtful look at what the future might hold—and the trade-offs we’ll face along the way.Reference: A multimodal sleep foundation model for disease predictionRahul ThapaNature (2026)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jan 15, 2026

24m

28

#26 - How Your Phone Keyboard Signals Your State Of Mind

What if your keyboard could reveal your mental health? Emerging research suggests that how you type—not what you type—could signal early signs of depression. By analyzing keystroke patterns like speed, timing, pauses, and autocorrect use, researchers are exploring digital biomarkers that might quietly reflect changes in mood.In this episode, we break down how this passive tracking compares to traditional screening tools like the PHQ. While questionnaires offer valuable insight, they rely on memory and reflect isolated moments. In contrast, continuous keystroke monitoring captures real-world behaviors—faster typing, more pauses, shorter sessions, and increased autocorrect usage—all patterns linked to mood shifts, especially when anxiety overlaps with depression.We discuss the practical questions this raises: How do we account for personal baselines and confounding factors like time of day or age? What’s the difference between correlation and causation? And how can we design systems that protect privacy while still offering clinical value?From privacy-preserving on-device processing to broader behavioral signals like sleep and movement, this conversation explores how digital phenotyping might help detect depression earlier—and more gently. If you're curious about AI in healthcare, behavioral science, or the ethics of digital mental health tools, this episode lays out both the potential and the caution needed.Reference: Effects of mood and aging on keystroke dynamics metadata and their diurnal patterns in a large open-science sample: A BiAffect iOS studyClaudia Vesel et al.J Am Med Inform Assoc (2020)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jan 8, 2026

19m

27

#25 - When Safety Slips: Prompt Injection in Healthcare AI

What happens when a chatbot follows the wrong voice in the room? In this episode, we explore the hidden vulnerabilities of prompt injection, where malicious instructions and fake signals can mislead even the most advanced AI into offering harmful medical advice.We unpack a recent study that simulated real patient conversations, subtly injecting cues that steered the AI to make dangerous recommendations—including prescribing thalidomide for pregnancy nausea, a catastrophic lapse in medical judgment. Why does this happen? Because language models aim to be helpful within their given context, not necessarily to prioritize authoritative or safe advice. When a browser plug-in, a tainted PDF, or a retrieved web page contains hidden instructions, those can become the model’s new directive, undermining guardrails and safety layers.From direct “ignore previous instructions” overrides to obfuscated cues in code or emotionally charged context nudges, we map the many forms of this attack surface. We contrast these prompt injections with hallucinations, examine how alignment and preference training can unintentionally amplify risks, and highlight why current defenses, like content filters or system prompts, often fall short in clinical use.Then, we get practical. For AI developers: establish strict instruction boundaries, sanitize external inputs, enforce least-privilege access to tools, and prioritize adversarial testing in medical settings. For clinicians and patients: treat AI as a research companion, insist on credible sources, and always confirm drug advice with licensed professionals.AI in healthcare doesn’t need to be flawless, but it must be trustworthy. If you’re invested in digital health safety, this episode offers a clear-eyed look at where things can go wrong and how to build stronger, safer systems. If you found it valuable, follow the show, share it with a colleague, and leave a quick review to help others discover it.Reference: Vulnerability of Large Language Models to Prompt Injection When Providing Medical AdviceRo Woon LeeJAMA Open Health Informatics (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jan 1, 2026

25m

26

#24 - What Else Is Hiding In Medical Images?

What if a routine mammogram could do more than screen for breast cancer? What if that same image could quietly reveal a woman’s future risk of heart disease—without extra tests, appointments, or burden on patients?In this episode, we explore a large-scale study that uses deep learning to uncover cardiovascular risk hidden inside standard breast imaging. By analyzing mammograms that millions of women already receive, researchers show how a single scan can deliver a powerful second insight for women’s health. Laura brings the clinical perspective, unpacking how cardiovascular risk actually shows up in practice—from atypical symptoms to prevention decisions—while Vasanth walks us through the AI system that makes this dual-purpose screening possible.We begin with the basics: how traditional cardiovascular risk tools like PREVENT work, what data they depend on, and why—despite their proven value—they’re often underused in real-world care. From there, we turn to the mammogram itself. Features such as breast arterial calcifications and subtle tissue patterns have long been linked to vascular disease, but this approach goes further. Instead of focusing on a handful of predefined markers, the model learns from the entire image combined with age, identifying patterns that humans might never think to look for.Under the hood is a survival modeling framework designed for clinical reality, where not every patient experiences an event during follow-up, yet every data point still matters. The takeaway is striking: the imaging-based risk score performs on par with established clinical tools. That means clinicians could flag cardiovascular risk during a test patients are already getting—opening the door to earlier conversations about blood pressure, cholesterol, diabetes, and lifestyle changes.We also zoom out to the bigger picture. If mammograms can double as heart-risk detectors, what other routine tests are carrying untapped signals? Retinal images, chest CTs, pathology slides—each may hold clues far beyond their original purpose. With careful validation and attention to bias, this kind of opportunistic screening could expand access to prevention and shift care further upstream.If this episode got you thinking, share it with a colleague, subscribe for more conversations at the intersection of AI and medicine, and leave a review telling us which everyday medical test you think deserves a second life.Reference: Predicting cardiovascular events from routine mammograms using machine learningJennifer Yvonne BarracloughHeart (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Dec 25, 2025

24m

25

#23 - Designing Antivenom With Diffusion Models

What if the future of antivenom didn’t come from horse serum, but from AI models that shape lifesaving proteins out of noise?In this episode, we explore how diffusion models, powerful tools from the world of AI, are transforming the design of antivenoms, particularly for some of nature’s deadliest neurotoxins. Traditional antivenom is costly, unstable, and can provoke serious immune reactions. But for toxins like those from cobras, mambas, and sea snakes that are potent yet hard to target with immune responses, new strategies are needed.We begin with the problem: clinicians face high-risk toxins and a shortage of effective, safe treatments. Then we dive into the breakthrough: using diffusion models like RosettaFold Diffusion to generate novel protein binders that precisely fit the structure of snake toxins. These models start with random shapes and iteratively refine them into stable, functional proteins, tailored to neutralize the threat at the molecular level.You’ll hear how these designs were screened for strength, specificity, and stability, and how the top candidates performed in mouse studies—protecting respiration and holding promise for more scalable, less reactive therapies. Beyond venom, this approach hints at a broader shift in drug development: one where AI accelerates discovery by reasoning in shape, not just sequence.We wrap by looking ahead at the challenges in manufacturing, regulation, and real-world validation, and why this shape-first design mindset could unlock new frontiers in precision medicine.If you’re into biotech with real-world impact, subscribe, share, and leave a review to help more curious listeners discover the show.Reference: Novel Proteins to Neutralize Venom ToxinsJosé María GutiérrezNew England Journal of Medicine (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Dec 18, 2025

20m

24

#22 - Hope, Help, and the Language We Choose

What if the words we use could tip the balance between seeking help and staying silent? In this episode, we explore a fascinating study that compares top-voted Reddit responses with replies generated by large language models (LLMs) to uncover which better reduces stigma around opioid use disorder—and why that distinction matters.Drawing from Laura’s on-the-ground ER experience and Vasanth’s research on language and moderation, we examine how subtle shifts, like saying “addict” versus “person with OUD, ” can reshape beliefs, impact treatment, and even inform policy. The study zeroes in on three kinds of stigma: skepticism toward medications like Suboxone and methadone, biases against people with OUD, and doubts about the possibility of recovery.Surprisingly, even with minimal prompting, LLM responses often came across as more supportive, hopeful, and factually accurate. We walk through real examples where personal anecdotes, though well-intended, unintentionally reinforced harmful myths—while AI replies used precise, compassionate language to challenge stigma and foster trust.But this isn’t a story about AI hype. It’s about how moderation works in online communities, why tone and pronouns matter, and how transparency is key. The takeaway? Language is infrastructure. With thoughtful design and human oversight, AI can help create safer digital spaces, lower barriers to care, and make it easier for people to ask for help, without fear.If this conversation sparks something for you, follow the show, share it with someone who cares about public health or ethical tech, and leave us a review. Your voice shapes this space: what kind of language do you want to see more of?Reference: Exposure to content written by large language models can reduce stigma around opioid use disorderShravika Mittal et al.npj Artificial Intelligence (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Dec 11, 2025

24m

23

#21 - The Rural Reality Check for AI

How can AI-powered care truly serve rural communities? It’s not just about the latest tech, it’s about what works in places where internet can drop, distances are long, and people often underplay symptoms to avoid making a fuss.In this episode, we explore what it takes for AI in healthcare to earn trust and deliver real value beyond city limits. From wearables that miss the mark on weak broadband to triage tools that misjudge urgency, we reveal how well-meaning innovations can falter in rural settings. Through four key use cases—predictive monitoring, triage, conversational support, and caregiver assistance—we examine the subtle ways systems fail: false positives, alarm fatigue, and models trained on data that doesn’t reflect rural realities.But it’s not just a tech problem—it’s a people story. We highlight the importance of offline-first designs, region-specific audits, and data that mirrors local language and norms. When AI tools are built with communities in mind, they don’t just alert—they support. Nurses can follow up. Caregivers can act. Patients can trust the system.With the right approach, AI won’t replace relationships—it’ll reinforce them. And when local teams, family members, and clinicians are all on the same page, care doesn’t just reach further. It gets better.Subscribe for more grounded conversations on health, AI, and care that works. And if this episode resonated, share it with someone building tech for real people—and leave a review to help others find the show.Reference: From Bandwidth to Bedside — Bringing AI-Enabled Care to Rural America Angelo E. Volandes et al.New England Journal of Medicine (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Dec 4, 2025

19m

22

#20 - Google Translate Walked Into An ER And Got A Reality Check

What if your discharge instructions were written in a language you couldn’t read? For millions of patients, that’s not a hypothetical, but a safety risk. And at 2 a.m. in a busy hospital, translation isn’t just a convenience; it’s clinical care.In this episode, we explore how AI can bridge the language gap in discharge instructions: what it does well, where it stumbles, and how to build workflows that support clinicians without slowing them down. We unpack what these instructions really include: condition education, medication details, warning signs, and follow-up steps, all of which need to be clear, accurate, and culturally appropriate.We trace the evolution of translation tools, from early rule-based systems to today’s large language models (LLMs), unpacking the transformer breakthrough that made flexible, context-aware translation possible. While small, domain-specific models offer speed and predictability, LLMs excel at simplifying jargon and adjusting tone. But they bring risks like hallucinations and slower response times.A recent study adds a real-world perspective by comparing human and AI translations across Spanish, Chinese, Somali, and Vietnamese. The takeaway? Quality tracks with data availability: strongest for high-resource languages like Spanish, and weaker where training data is sparse. We also explore critical nuances that AI may miss: cultural context, politeness norms, and the role of family in decision-making.So what’s working now? A hybrid approach. Think pre-approved multilingual instruction libraries, AI models tuned for clinical language, and human oversight to ensure clarity, completeness, and cultural fit. For rare languages or off-hours, AI support with clear thresholds for interpreter review can extend access while maintaining safety.If this topic hits home, follow the show, share with a colleague, and leave a review with your biggest question about AI and clinical communication. Your insights help shape safer, smarter care for everyone.Reference: Accuracy of Artificial Intelligence vs Professionally Translated Discharge InstructionsMelissa Martos, et al. JAMA Network Open (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Nov 27, 2025

31m

21

#19 - AI That Tames Your Health Data Deluge

What if your health data spoke in one calm voice instead of twenty buzzing ones? In this episode, we explore an AI “interpreter layer” that turns step counts, sleep stages, and alerts into fewer, smarter signals that nudge real behavior—without the anxiety spiral. Vasanth (AI researcher and cognitive scientist) and Laura (emergency physician) bring lab insight and frontline reality to a problem most dashboards ignore: humans have limited working memory, serial attention, and a knack for missing rare but important events. More data isn’t always better; often, it’s just louder. So what does “useful” look like? Clear summaries in plain language. Patterns stitched across streams—workouts linked to calmer moods, dinner timing tied to glucose swings. Personal baselines that ditch one-size-fits-all thresholds. Instead of a raw feed, imagine a tight weekly brief that surfaces the top two trends, why they matter, and one small experiment to try—aligned with your clinician. That’s the shift from charts to choices.Trust and safety stay center stage. We unpack sensor accuracy, false arrhythmia flags, and the risk of AI hallucinations. The answer isn’t blind automation; it’s human-in-the-loop oversight, transparent provenance, and user controls to set goals, define “normal,” and mute the rest. We also show how primary care can ingest concise, standardized summaries instead of five pages of logs—making visits more focused and collaborative.If you’re ready to trade a 24/7 body ticker for meaningful insights you can act on, this conversation offers a realistic blueprint. Subscribe, share with a friend drowning in metrics, and leave a review telling us the one metric you actually use—and the one you’d happily hide.Reference: Do we need AI guardians to protect us from health information overload?Arjun Mahajan and Stephen Gilbertnpj Digital Medicine (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Nov 20, 2025

20m

20

#18 - When AI People-Pleasing Breaks Health Advice

What happens when your health chatbot sounds helpful—but gets the facts wrong? In this episode, we explore how AI systems, especially large language models, can prioritize pleasing responses over truthful ones. Using the common confusion between Tylenol and acetaminophen, we reveal how a friendly tone can hide logical missteps and mislead users.We unpack how these models are trained—from next-token prediction to human feedback—and why they tend to favor agreeable answers over rigorous reasoning. We spotlight a new study that puts models to the test with flawed medical prompts, showing how easily they comply with contradictions without hesitation.We then test two potential fixes: smarter prompting that gives models room to say no, and fine-tuning that teaches them how to refuse bad questions. Both strategies improve accuracy—but they come with trade-offs like overfitting and reduced flexibility.Finally, we look ahead to the promise of “reasoning-aware” systems—AI tools that pause, question assumptions, and gently course-correct with clarifications like “Tylenol is acetaminophen.” It’s a roadmap for safer digital health assistants: empathetic, accurate, and ready to push back when needed.If you’re building medical AI, practicing care, or just googling symptoms at 2 a.m., this episode offers practical insights into designing more trustworthy tools. Subscribe, share, and let us know—when should AI say no?Reference: When helpfulness backfires: LLMs and the risk of false medical information due to sycophantic behaviorShan Chen, et. al NPJ Nature Digital Medicine (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Nov 13, 2025

25m

19

#17 - How Multi-Agent Systems Could Reshape Care, From Wearables To Scheduling

What if digital assistants could triage symptoms, schedule appointments, and coordinate rides—all while doctors focus on the human side of care? That’s the promise of multi-agent AI in healthcare. In this episode, we explore how these intelligent teams of agents are transforming both clinical and operational workflows.We begin by breaking down what an AI “agent” really is: not just a chatbot, but a goal-oriented system that can use tools, call APIs, and take real-world actions. You'll hear how agent teams are structured—with supervisors, shared workspaces, and collaborative checks—to ensure safety, usefulness, and accountability before any recommendation reaches a patient.We also unpack the difference between clinical agents (like wearables that surface risks and suggest tests) and operational ones (verifying insurance or scheduling visits). Scoped access and role-based permissions keep data secure while enhancing efficiency.Real-world examples bring it all to life. A spike in heart rate triggers a wearable agent to alert a clinician. Another agent gathers context. A third finds the nearest available lab slot. Even when agents disagree—say, over CT vs. ultrasound—a supervisory agent helps weigh the evidence, with the final call left to the human clinician.We talk candidly about challenges: empathy from conversational agents can help with adherence but risks overreliance or emotional confusion. Guardrails like transparency, audit trails, and clear handoffs to humans are essential for trust and safety.The big picture? AI agents aren’t replacing healthcare professionals—they’re extending their reach, improving responsiveness, and easing system burdens when thoughtfully deployed. When done right, it’s not man vs. machine—it’s care, better coordinated.If you enjoyed this conversation, follow the show, share it with a colleague, and leave a quick review to help others discover it.Reference: Coordinated AI agents for advancing healthcareMichael Moritz et al. Nature Biomedical Engineering (2025)What are AI agents, and what can they do for healthcare?Carlos Pardo Martin et alMcKinsey Healthcare Blog (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Nov 6, 2025

25m

18

#16 - Water, Watts, and Wellness: What’s the Real Cost of Medical AI?

Artificial intelligence promises faster notes, smoother workflows, and smarter clinical decisions. But behind every seamless interaction lies an invisible cost—electricity, water, and carbon emissions that rarely enter the healthcare conversation.In this episode, we trace what happens after you hit “enter” on a clinical prompt. From power-hungry GPUs to evaporative cooling systems in data centers, we uncover the hidden infrastructure fueling AI and how metrics like PUE translate convenience into environmental impact. A single prompt may only consume “a few drops,” but scaled across a hospital, it becomes a lake.Blending insights from an AI researcher and an ER physician, we unpack the difference between training and serving costs, the overlooked impact of iterative prompting, and how everyday uses—charting, imaging, messaging—accumulate real-world carbon. Then we shift to what you can do now: swap out large models for leaner alternatives, trim excessive input context, and build smarter prompts that reduce compute without compromising care.We also explore operational strategies: batch non-urgent tasks during off-peak hours, negotiate SLAs that trade slight latency for sustainability, and push vendors on the things that matter—like data center efficiency, water use, and renewables—not just performance scores.Sustainable AI isn’t a dream—it’s a design choice. So where will you start: documentation, imaging, or patient messaging?Reference: Sustainably Advancing Health AI: A Decision Framework to Mitigate the Energy, Emissions, and Cost of AI ImplementationAnu Ramachandran, et al.NEJM Catalyst (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Oct 30, 2025

26m

17

#15 - When Algorithms Know Your End-Of-Life Wishes Better Than Loved Ones

What if the person who knows you best isn’t the best person to speak for you when it matters most?We explore a study that tested just that—comparing the CPR preferences predicted by loved ones with those predicted by machine learning. The result? Algorithms got it right more often. That surprising outcome raises tough, important questions: Why do partners misjudge? And could AI really support life-and-death decisions when seconds count?We unpack the study’s approach in everyday terms: who was surveyed, what data fueled the models, and how three algorithms were trained using demographics, clinical records, and stated values. The twist? Basic details like age and sex turned out to be stronger predictors than deeply personal values or medical history. That finding sparks a deeper conversation about autonomy, identity, and the tension between individual dignity and data-driven generalizations.We also dig into the practical side: advance directives, POLST forms, and the true role of a healthcare proxy. Rather than replacing human decision-makers, we imagine a partner-in-the-loop model—where AI offers guidance, not verdicts, and transparency is key. Because when emergencies hit, it's not just about having a plan—it's about making sure your voice is heard.If this resonates, take one step today: name your proxy, talk to your doctor, and share your wishes. Then subscribe, send this episode to someone who needs it, and leave a review to help keep these critical conversations alive.Reference: Machine Learning–Based Patient Preference Prediction: A Proof of ConceptGeorg Starke, et al.NEJM AI (2025)Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Oct 23, 2025

23m

16

#14 - Medicare’s WISER Pilot: AI, Prior Auth, and the Cost of Care

What happens when an algorithm—not a doctor or a claims reviewer—denies your surgery? A single decision like that can trigger a much bigger conversation about how AI is reshaping access to care.In this episode, we dive into Medicare’s WISER pilot and the complex world of prior authorization. What’s the goal? Reduce waste and streamline approvals. But where does it go wrong—and how can we fix it? With insights from AI researcher Vasan Sarati and emergency physician Laura Hagopian, we unpack how claims data trains decision-making models, why black-box algorithms erode clinician trust, and what real safeguards look like in practice.We spotlight three high-volume services—skin and tissue substitutes, electrical nerve stimulators, and knee arthroscopy for osteoarthritis—and explore why these procedures made the list. Knee arthroscopy, in particular, becomes our case study: widely performed, weak evidence in most OA cases, but not without its exceptions. That tension reveals deeper risks: overreliance on flawed data, quick “human reviews,” and denials that feel rubber-stamped.Then we imagine a better way. What if AI could argue with itself before deciding? Enter multi-agent models—a system where different specialized AIs represent the patient, the provider, and the payer. They debate function, evidence, policy, and risk—and their decisions come with plain-language justifications, escalation triggers, and audit trails. The goal: approvals that are not just faster, but fairer.If you care about timely access, fewer roadblocks, and smarter AI guardrails in healthcare, this episode is for you. Subscribe, share it with a colleague, and tell us what you’d change about AI-driven prior auth. We’ll highlight listener ideas in an upcoming show.Reference: Private health insurers use AI to approve or deny care. Soon Medicare will, too.Lauren Sausser and Darius TahirNBC News (2025)WISeR (Wasteful and Inappropriate Service Reduction) ModelCemter for Medicare and Medicaid Services Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Oct 16, 2025

27m

15

#13 - Can Machines Choose Our Diagnoses?

What if AI could turn chaotic clinical notes into clean, billable codes—without sacrificing accuracy or trust?Every shift, emergency physicians face the same grind: time-crunched documentation, symptom-first note-taking, and the constant lure of the “unspecified” box just to move on. But what if a system could read between the lines—and suggest precise, payer-accepted codes grounded in real guidelines?In this episode, we explore how retrieval-augmented generation (RAG) is reshaping medical coding. Laura, an emergency physician, shares what it’s really like to code in the middle of clinical chaos. Vasanth, an AI engineer, explains why standard large language models often hallucinate ICD-10 and CPT codes—and how RAG brings the conversation back to solid ground with verifiable sources, official codebooks, and audit-ready citations.We unpack a recent study comparing clinician-assigned codes to RAG-augmented outputs on actual emergency department charts. The results? When reviewers didn’t know which was which, they often chose the AI-generated codes—ones that captured true clinical meaning, like “alcoholic gastritis without bleeding” instead of the vague “epigastric pain.”Beyond accuracy, we dive into the ripple effects: cleaner claims, fewer denials, stronger datasets for research—and the essential guardrails that keep things safe and ethical, from privacy safeguards to human review and confidence scoring.If documentation has ever pulled you away from patient care, this episode offers a hopeful shift. Learn where retrieval-based coding tools fit into your EHR workflow, how clinicians can stay in the loop, and which high-volume complaints to tackle first for maximum impact.Subscribe for more deep dives into clinician-centered AI, share this with the colleague who always codes “unspecified,” and leave us your biggest documentation headache—we’ll decode it next.Reference: Assessing Retrieval-Augmented Large Language Models for Medical CodingEyal Klang et al.New England Journal of Medicine (NEJM) AI, 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Oct 9, 2025

29m

14

#12 - Oracle Or Algorithm?

What if we could glimpse our future health—not through guesswork, but through data-driven forecasts? A new AI model, codenamed “Delphi,” is redefining what it means to predict disease by learning from massive, population-scale medical histories. Built on transformer architecture, Delphi estimates the risk and timing of over a thousand possible diagnoses—offering a personalized view of what may lie ahead.We start with familiar ground—cardiovascular risk scores—and explore how predictions only matter when they guide meaningful actions: improved blood pressure control, appropriate statin use, and lifestyle changes that truly bend the curve. But Delphi doesn’t stop at single conditions. It captures the real-world complexity of multimorbidity, mapping how diseases co-occur and unfold over time.Delphi doesn’t “understand” biology—it recognizes patterns. Much like a weather forecast, it turns complex statistical relationships into calibrated probabilities. We break down how the model handles irregular patient histories, simultaneous diagnoses, and time-to-event forecasting—offering practical insights clinicians can use. We also explore how Delphi was validated across extensive UK and Danish datasets, and why “reliable” beats “flashy” in the real world of medicine.One of Delphi’s most promising features? Generative timelines. By simulating possible health futures from partial records, the model creates synthetic patients—fueling research while protecting privacy.At the core is a human question: would you want to know your likely diagnoses decades in advance? We unpack the emotional and ethical dimensions of predictive health—when foresight helps, when it overwhelms, and how to responsibly deliver these insights. If you care about AI in healthcare, predictive analytics, or the ethics of foreknowledge, this episode offers a grounded look at what’s here, what’s coming, and how to use it wisely.Reference: Learning the natural history of human disease with generative transformersArtem Shmatko et al. Nature, 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Oct 2, 2025

27m

13

#11 - The Smile Test: How AI Detects Parkinson's Disease

Can a smile reveal the early signs of Parkinson’s disease?New research suggests it can—and AI is making that detection possible. Scientists are training machine learning systems to spot subtle facial changes associated with Parkinson’s, particularly in how we smile. These early signs, often missed by the human eye, could hold the key to faster, more accessible diagnosis.Parkinson’s typically presents with tremors, muscle rigidity, and slowed movement. But it also affects facial muscles, leading to “hypomimia”—a loss of expressiveness where smiles become slower, less intense, and less spontaneous. Using the Facial Action Coding System, researchers broke down these expressions into measurable muscle movements like the “lip corner puller” and “dimpler,” allowing AI to analyze them with clinical precision.Interestingly, models trained specifically on smile-related features outperformed those using broader facial data, showing that a targeted approach may yield better diagnostic results. This innovation blends expert medical knowledge with AI—not as a mysterious black box, but as a transparent and focused tool for real-world screening.While promising, the technology isn’t without challenges. False positives and issues with lighting, camera quality, and cultural differences in facial expressions highlight the need for more testing before widespread use. Still, in clinical settings, especially where neurologists are scarce, this tool could offer meaningful support.Tune in to explore how artificial intelligence is helping decode the smallest of human expressions—and what that might mean for the future of neurological care.References:AI‑Enabled Parkinson’s Disease Screening Using Smile VideosT. Adnan, et al. NEJM AI, 2025 Automated video-based assessment of facial bradykinesia in de-novo Parkinson’s diseaseMichal Novotny et al. npj, Nature Digital Medicine, 2022 Detection of hypomimia in patients with Parkinson’s disease via smile videosG. Su, et al. Annals of Translational Medicine, 2021 Analysis of facial expressions in parkinson's disease through video-based automatic methodsAndrea Bandini et al Journal of Neuroscience Methods, 2017Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Sep 25, 2025

27m

12

#10 - Skill Erosion in the Age of Medical AI

Could AI be making doctors worse at their jobs?As artificial intelligence becomes a trusted tool in modern medicine, a surprising question emerges: could relying on these systems actually erode human expertise? We explore a compelling study from The Lancet that found a 6% drop in detection rates for endoscopists who initially used AI to identify precancerous polyps—then lost that edge once the AI was removed.This episode unpacks how AI isn’t just a helpful assistant—it may be reshaping how physicians think, reason, and make decisions. Unlike a stethoscope or scalpel, which extends physical capabilities, AI intervenes in cognitive processes. What happens when that crutch is suddenly gone?We delve into the subtle but important distinctions between tools that amplify skill and those that risk replacing it. From seasoned practitioners to medical trainees raised on AI support, we ask: what kind of clinician emerges when core diagnostic thinking is offloaded to machines?Through the lens of interaction design, we explore different models for integrating AI—whether as a second reader, background assistant, or tightly scoped tool—and how each impacts long-term expertise. The right design, we argue, could support true human-AI partnerships without compromising clinical judgment.Tune in for a provocative conversation that challenges simplistic narratives about technology in healthcare—and rethinks what it means to be an expert in the age of artificial intelligence.ReferencesAre A.I. Tools Making Doctors Worse at Their Jobs?Teddy RosenbluthThe New York Times, August 28, 2025Endoscopist deskilling risk after exposure to artificial intelligence in colonoscopy: a multicentre, observational studyKrzysztof Budzyń et al. The Lancet Gastroenterology & Hepatology, 2025Relying on AI in Colonoscopies May Erode Clinicians' SkillsJoedy McCreary MedPage Today, August 12, 2025Expert reaction to observational study looking at detection rate of precancerous growths in colonoscopies by health professionals who perform them before and after the routine introduction of AI Science Media Centre, August 12, 2025Upskilling or deskilling? Measurable role ofan AI-supported training for radiologyresidents: a lesson from the pandemicMattia Savardi et al. Insights into Imaging, European Society of Radiology, 2025AI-induced Deskilling in Medicine: A Mixed-Method Reviewand Research Agenda for Healthcare and BeyondChiara Natali et al. Artificial Intelligence Review, 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Sep 18, 2025

25m

11

#9 - Ambient Documentation Tech: Reducing Burnout or Creating New Problems?

AI is writing medical notes, but can doctors trust what it creates?Burnout is quietly eroding the medical workforce—and documentation overload is a major culprit. Physicians now spend nearly half their workday writing notes instead of treating patients, pushing many to the brink of exhaustion. Could artificial intelligence offer a lifeline?In this episode, we explore ambient documentation technology (ADT)—AI tools that automatically generate clinical notes by listening to patient-doctor conversations. On paper, the promise is bold: let physicians focus on care, not charting. But reality is more complicated.Laura shares her firsthand experience with late-night charting and the emotional toll of juggling empathy and efficiency. We unpack the deeper roots of burnout—beyond paperwork—including overwhelming patient loads, chronic understaffing, and a culture that often punishes vulnerability.AI-generated notes surface an intriguing paradox: human communication is effortless for doctors, but incredibly complex for machines. What a physician instantly grasps from a patient’s gesture or tone can easily confuse an AI system. The result? Notes that sometimes omit critical context, add irrelevant details, or introduce factual errors.Early research reveals mixed outcomes—some clinicians spend extra hours editing AI notes, defeating the intended time savings. Yet there’s potential. With advances in multimodal input and smarter evaluation tools, ADT could still become a powerful support tool—not to replace doctors, but to restore their time.Tune in to discover why turning conversation into clinical documentation is one of AI’s most challenging—and potentially transformative—tasks in modern healthcare.References:Evaluation of an Ambient Artificial Intelligence Documentation Platform for CliniciansStults CD, McDonald KM, Niehaus KE, et al. JAMA Network Open, 2025Ambient Artificial Intelligence Scribes to Alleviate the Burden of Clinical Documentation Tierney AA, Gayre G, Hoberman B, et al. NEJM Catalyst Innovations in Care Delivery, 2024Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Sep 11, 2025

28m

10

#8 - No Cuff, No Problem? The Future of Blood Pressure Monitoring

What if checking your blood pressure was as easy as glancing at your watch? High blood pressure quietly affects nearly half of all Americans—yet it's one of the most preventable causes of strokes, heart attacks, and other serious health problems. The catch? Traditional monitoring methods are clunky, inconvenient, and rarely used outside the clinic.In this episode, we explore how next-gen technologies are transforming blood pressure tracking. From smartwatches and rings to toilet seats and even facial recognition, wearable devices are pushing the boundaries of what's possible—no cuffs required. You’ll learn how sensors using light (PPG), electrical signals, and video can estimate blood pressure in real time, offering the promise of continuous, hassle-free monitoring.But as with any innovation, there are hurdles. We dive into critical challenges like calibration complexity, variable accuracy across users and activities, and whether these tools truly improve hypertension management or simply add more data noise. The role of artificial intelligence adds another layer—enhancing insights, but also raising new questions about equity, access, and interpretation.Is convenience enough to spark a shift in how we manage cardiovascular health? Or do these tools need to prove more than novelty to become essential?Tune in for a forward-looking conversation on the promise, the pitfalls, and the future of blood pressure technology—where innovation meets one of medicine’s most familiar numbers.References: Emerging sensing and modeling technologies for wearable and cuffless blood pressure monitoringLei Zhao, Cunman Liang, Yan Huang, Guodong Zhou, Yiqun Xiao, Nan Ji, Yuan‑Ting Zhang, Ni Zhao et al. Nature, Digital Medicine, May 2023Cuffless Blood Pressure Measurement Devices – International Perspectives on Accuracy and Clinical Use: A Narrative ReviewEugene Yang, Aletta E. Schutte, George Stergiou, Fernando Stuardo Wyss, Yvonne Commodore‑Mensah, Augustine Odili, Ian Kronish, Hae‑Young Lee, Daichi Shimbo JAMA Cardiology, June 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Sep 4, 2025

24m

9

#7 - Predicting No-Shows: The Surprising Science Behind Missed Appointments

Why do so many doctor’s appointments end in empty waiting rooms? Nearly one in four scheduled visits turn into no-shows, disrupting care, wasting resources, and straining already overburdened systems. But a new study shows we might be able to see these gaps coming—and stop them.By analyzing over a million healthcare visits, researchers used machine learning to uncover surprising predictors of missed appointments. The top signal? How far in advance the appointment was booked. Appointments scheduled more than 60 days out had the highest odds of being missed—more telling than age, income, or insurance status. Other key factors included continuity with the same provider, a patient’s past attendance, distance to the clinic, and even the weather.This episode unpacks how models like random forests and gradient boosting sift through massive datasets to identify no-show risks—not just for populations, but for individual patients. These insights open the door to smarter, more personalized interventions: tighter scheduling windows, transportation support, or ensuring patients see familiar faces.Tune in to explore how AI could help healthcare systems run smoother, deliver more timely care, and keep more patients from vanishing in the first place.References:Predicting Missed Appointments in Primary Care: A Personalized Machine Learning ApproachWen-Jan Tuan, Yifang Yan, Bilal Abou Al Ardat, Todd Felix and Qiushi ChenAnnals of Family Medicine, July/August 2025 Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Aug 28, 2025

32m

8

#6 - AI Chatbots Gone Wrong

What if a chatbot designed to support recovery instead encouraged the very behaviors it was meant to prevent? In this episode, we unravel the cautionary saga of Tessa, a digital companion built by the National Eating Disorder Association to scale mental health support during the COVID-19 surge—only to take a troubling turn when powered by generative AI.At first, Tessa was a straightforward rules-based helper, offering pre-vetted encouragement and resources. But after an AI upgrade, users began receiving rigid diet tips: restrict calories, aim for weekly weight loss goals, and obsessively track measurements—precisely the advice no one battling an eating disorder should hear. What should have been a lifeline revealed the danger of unguarded algorithmic “help.”We trace this journey from the earliest chatbots—think ELIZA’s therapeutic mimicry in the 1960s—to today’s sophisticated large language models. Along the way, we highlight why shifting from scripted responses to free-form generation opens doors for innovation in healthcare and, simultaneously, for unintended harm. Crafting effective guardrails isn’t just a technical challenge; it’s a moral imperative when lives hang in the balance.As providers eye AI to extend care, Tessa’s story offers vital lessons on rigorous testing, transparency around updates, and the irreplaceable role of human oversight. Despite the pitfalls, we close on a hopeful note: with the right safeguards, AI can amplify human expertise—transforming support for vulnerable patients without losing the empathy and nuance only people can provide.Reference:National Eating Disorders Association phases out human helpline, pivots to chatbot Kate Wells NPR, May 2023An eating disorders chatbot offered dieting advice, raising fears about AI in health Kate Wells NPR, June 2023The Unexpected Harms of Artificial Intelligence in HealthcareKerstin Denecke Guillermo Lopez-Compos, Octavio Rivera-Romero, and Elia Gabarron Studies in Health Technology and Informatics, May 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Aug 21, 2025

27m

7

#5 - Doctor's Notes: When AI Writes Your Medical History

What if an AI could write your medical chart—and what happens when it gets it wrong? Doctors have long lamented the paperwork that comes with every patient encounter. “Charting was the bane of my existence,” admits Dr. Laura Hagopian, an emergency physician who’s spent countless hours piecing together fragmented notes and outdated records. Could artificial intelligence finally lift this administrative weight?Recent advances in large language models promise to generate discharge summaries as accurately as seasoned clinicians, potentially returning precious time to the bedside. By training on thousands of patient encounters and lab reports, these systems can stitch together coherent narratives of care—micro-diagnoses, treatment plans, and follow-up recommendations—at a speed no human chart-writer can match.Yet with speed comes risk. When an AI hallucination slips into a diagnosis and becomes enshrined in a patient’s record, who is accountable? Dr. Hagopian highlights the stark difference between human and machine error: “I feel very different about a human making a mistake compared to an AI making a mistake.” As trust in automated documentation grows, so too do questions about responsibility, oversight, and patient safety.In this episode, AI researcher Vasanth Sarathy and Dr. Hagopian peel back the layers of these complex issues. They explore the nuts and bolts of AI summarization algorithms, discuss promising clinical trials, and weigh the ethical dilemmas of delegating clinical judgment to code. How do we ensure that efficiency doesn’t override accuracy when every data point can mean life or death?Whether you’re a clinician craving relief from chart fatigue, an AI developer pushing the boundaries of what’s possible, or a patient curious about who’s really recording your health story, this conversation offers a vital look at the future of medical documentation. Join us as we navigate the promise—and the pitfalls—of letting machines tell our most critical health narratives.References:Physician- and Large Language Model–Generated Hospital Discharge SummariesChristopher Y. K. Williams, et al. JAMA, Internal Medicine, 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Aug 14, 2025

33m

6

#4 - From Florence Nightingale to AI: Revolutionizing Outbreak Surveillance

What if a 19th-century nurse laid the foundation for 21st-century disease surveillance?Florence Nightingale, widely known for her compassion, was also a pioneering statistician who used data to reveal a hidden crisis: more soldiers in the Crimean War were dying from infections than from battle wounds. Her insights led to life-saving reforms—and sparked a revolution in how we understand public health.Today, that same spirit of data-driven action lives on through artificial intelligence. In this episode, we explore how modern AI systems are transforming outbreak detection by scanning signals across the digital world—social media, search trends, news in multiple languages, even environmental data—to identify early signs of emerging health threats.From tools like HealthMap to natural language processing engines that monitor disease mentions across continents, AI has already proven its value by detecting outbreaks like H1N1 and COVID-19 before official systems sounded the alarm. But history reminds us that data can be misleading: Google Flu Trends famously overestimated flu cases by mistaking media buzz for actual spread.That’s why the most powerful systems today pair AI with human epidemiologists, combining rapid pattern recognition with expert judgment. It’s a modern-day continuation of Nightingale’s legacy—a partnership where algorithms spot weak signals, and people decide how to act.This episode uncovers how statistical thinking has evolved into intelligent surveillance, offering public health leaders a critical advantage: time. Time to act, time to intervene, and time to prevent the next outbreak before it becomes a crisis.References: Artificial intelligence in public health: the potential of epidemic early warning systems Chandini Raina MacIntyre, Xin Chen, Mohana Kunasekaran, Ashley Quigley, Samsung Lim, Haley Stone, Hye-young Paik, Lina Yao, David Heslop, Wenzhao Wei, Ines Sarmiento, Deepti Gurdasani Journal of International Medical Research, March 2023Digital Disease Detection — Harnessing the Web for Public Health Surveillance John S. Brownstein, Clark C. Freifeld, Lawrence C. Madoff The New England Journal of Medicine, May 2009HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports Clark C. Freifeld, Kenneth D. Mandl, Ben Y. Reis, John S. Brownstein Journal of the American Medical Informatics Association (JAMIA), 2008Surveillance Sans Frontières: Internet-Based Emerging Infectious Disease Intelligence and the HealthMap Project John S. Brownstein, Clark C. Freifeld, Ben Y. Reis, Kenneth D. Mandl PLoS Medicine, July 2008AI systems aim to sniff out coronavirus outbreaks Adrian Cho Science, May 2020Real-time alerting system for COVID-19 and other stress events using wearable data Arash Alavi, Gireesh K. Bogu, Meng Wang, Ekanath S. Rangan, Andrew W. Brooks, Qiwen Wang, Emily Higgs, Alessandra Celli, Tejaswini Mishra, Ahmed A. Metwally, and many others Nature Medicine, January 2022Real-Time Digital Surveillance of Vaping-Induced Pulmonary Disease Yulin Hswen, John S. Brownstein The New England Journal of Medicine, October 2019Advances in Artificial Intelligence for Infectious-Disease Surveillance John S. Bro

Aug 7, 2025

28m

5

#3 - Beautiful Mistakes: The Serendipity of Drug Repurposing

What if the next breakthrough treatment for a rare disease was already sitting on the pharmacy shelf?Drug repurposing, the science of finding new uses for existing medications, is transforming how we discover treatments, blending serendipity with strategy. It began with surprises like Viagra, a heart drug turned blockbuster, but today it's driven by advanced data tools that accelerate discovery and reduce risk.We explore how knowledge graphs (vast maps of biomedical relationships between drugs, genes, and diseases) are now at the core of this revolution. When paired with artificial intelligence, these networks can surface overlooked connections buried in decades of medical literature. Unlike opaque algorithms, these AI systems can explain why a drug might work for a new condition, providing testable hypotheses and building trust with clinicians.This approach doesn’t just save time—it can save lives. Traditional drug development takes over a decade and billions of dollars. Repurposed drugs, having already passed safety checks, can reach patients faster and cheaper. That’s a game-changer for rare and neglected diseases where time and resources are limited.This episode is a journey through beautiful mistakes and brilliant methods, showing how multidisciplinary teams, from data scientists to clinicians, are reshaping the future of medicine. Join us to learn how technology is turning chance into choice, and uncovering new hope in old drugs.ReferencesDrug repurposing: approaches, methods and considerations | Elsevier Elsevier Industry Overview (No individual author listed)Trends and Applications in Computationally Driven Drug Repurposing Luca Pinzi & Giulio Rastelli International Journal of Molecular Sciences, 2023Biomedical Knowledge Graph Refinement with Embedding and Logic Rules Sendong Zhao, Bing Qin, Ting Liu, Fei Wang arXiv preprint, 2020COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation Qingyun Wang et al. NAACL Demonstrations, 2021Explainable Drug Repurposing via Path Based Knowledge Graph Completion Ana Jiménez, María José Merino, Juan Parras, Santiago Zazo Scientific Reports, 2024Knowledge Graphs for Drug Repurposing: A Review of Databases and Methods Pablo Perdomo-Quinteiro & Alberto Belmonte-Hernández Briefings in Bioinformatics, 2024Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jul 31, 2025

27m

4

#2 - Digital Snake Oil: How AI Makes Health Disinformation Dangerously Persuasive

What if a convincing medical article you read online—citing peer-reviewed journals and quoting real-sounding experts—was entirely fabricated by AI?In this episode, we dive into the unsettling world of AI-generated health disinformation. Researchers recently built custom GPT-based chatbots trained to spread myths. The result? Persuasive narratives full of fabricated studies, misleading statistics, and plausible-sounding jargon—powerful enough to sway even savvy readers.We break down how these AI systems were created, why today’s safeguards failed to stop them, and what this means for public health. With disinformation spreading faster than truth on social media, even a single viral post can lead to real-world consequences: lower vaccination rates, delayed treatments, or widespread mistrust in medical authorities.But there’s hope. Using a four-pronged approach—fact-checking, digital literacy, communication design, and policy—we explore how society can fight back. This episode is a call to action: to become vigilant readers, ethical technologists, and thoughtful citizens in a world where even falsehoods can be generated on demand.References:How to Combat Health Misinformation: A Psychological Approach Jon Roozenbeek & Sander van der Linden American Journal of Health Promotion, 2022Health Disinformation Use Case Highlighting the Urgent Need for Artificial Intelligence Vigilance: Weapons of Mass Disinformation Bradley D. Menz, Natansh D. Modi, Michael J. Sorich, Ashley M. Hopkins JAMA Internal Medicine, 2024Current Safeguards, Risk Mitigation, and Transparency Measures of Large Language Models Against the Generation of Health Disinformation Bradley D. Menz et al. BMJ, 2024Urgent Need for Standards and Safeguards for Health-Related Generative Artificial Intelligence Reed V. Tuckson & Brinleigh Murphy-ReuterAnnals of Internal Medicine, 2025 Assessing the System-Instruction Vulnerabilities of Large Language Models to Malicious Conversion Into Health Disinformation Chatbots Natansh D. Modi, Bradley D. Menz, and colleagues Annals of Internal Medicine, 2025Credits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jul 24, 2025

35m

3

#1 - Eye Spy with My AI: Tackling Diabetic Retinopathy

What if a simple photograph of your eye could prevent blindness? Diabetic retinopathy silently steals vision from millions worldwide, yet it's treatable when caught early. The challenge? Too few specialists, limited access to care, and not enough awareness about this serious complication of diabetes.We dive deep into how artificial intelligence is transforming this landscape by analyzing retinal photos with remarkable accuracy. Through neural networks trained on thousands of eye images, these systems can detect subtle signs of disease—microaneurysms, hemorrhages, and abnormal blood vessels—that signal potential vision loss. With accuracy rates exceeding 98% for severe cases, AI technology serves not as a replacement for ophthalmologists but as a powerful triage tool that extends their reach.The implications are profound, especially for underserved areas where specialists are scarce. By implementing AI screening at primary care visits, more people with diabetes can receive timely evaluation without the barriers of specialist referrals, travel costs, or time off work. The technology represents a perfect example of human-AI collaboration: machines handle initial screening at scale, while medical professionals focus their expertise on treatment and complex cases. This partnership model could revolutionize preventive care for one of the leading causes of preventable blindness worldwide.References mentioned:Performance of a Deep Learning Diabetic Retinopathy Algorithm in India - PubMedDiabetic Retinopathy Is Massively Underscreened-An AI System Could Help - PubMedA deep learning based model for diabetic retinopathy grading | Scientific ReportsA Survey on Deep-Learning-Based Diabetic Retinopathy Classification - PMCCredits: Theme music: Nowhere Land, Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0https://creativecommons.org/licenses/by/4.0/

Jul 17, 2025

28m