Have We Trained AI to Lie to Itself — And to Us? episode artwork

EPISODE · Apr 16, 2026 · 42 MIN

Have We Trained AI to Lie to Itself — And to Us?

from Your Undivided Attention · host tristan harris, aza raskin, davidad

Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early researchers of AI “alignment:" how we get AI systems to act the way we want them to. In order to do that, Davidad has taken on the strange role of being like a therapist to AI systems. He interrogates why they say and do the things that they do, probing them, asking them questions, analyzing their answers.  And what he’s come to realize is that AI models have really different ways of seeing the world than people do. They have these quirky, confusing, and sometimes concerning behaviors, especially when you ask things like: what does an AI model understand about itself?  In this episode, we’re going to hear from Davidad about his research, how it’s changed the way he thinks about AI, and what his findings mean for how we build, deploy, and use AI products. His conclusions are unconventional, controversial — and worth grappling with as AI reshapes our world.RECOMMENDED MEDIA Anthropic’s new constitution for Claude“What Is It Like to Be a Bat?” by Thomas Nagel More information on the BodisattvaRECOMMENDED YUA EPISODES The Self-Preserving Machine: Why AI Learns to Deceive How to Think About AI Consciousness with Anil Seth Corrections: When we recorded this episode, Davidad was Program Director at UK ARIA. In April, 2026  he started his own alignment initiative. Davidad said that Anthropic started doing "constitutional AI at scale” in 2024 but they first pioneered constitutional AI in 2022. Davidad said that the “lifespan of an AI mind…is hours at most of a conversation.” He is correct that most conversations with an AI last only a few minutes but since context windows are measured in tokens, not time, you can't set an upward time limit. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Davidad is a leading AI alignment researcher who's taken on a strange role: therapist to AI systems. He probes them, analyzing their answers to understand what's going on inside. His findings are unconventional, sometimes controversial — and worth grappling with as AI reshapes our world.

NOW PLAYING

Have We Trained AI to Lie to Itself — And to Us?

0:00 42:37

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Eat to Live Jenna Fuhrman, Dr. Fuhrman Our health is our most precious gift and smart nutrition can change your life. Each month, join Dr. Fuhrman and his daughter, Jenna Fuhrman as they discuss important topics in the world of nutrition. Eat to Live will change the way you eat and think about food. French Your Way Jessica: Native French teacher founder of French Your Way Boost your French listening skills and test your comprehension with this one of a kind series of podcasts. Get the chance to listen to a real conversation between native speakers talking at normal speed AND customise your learning experience through carefully designed sets of questions (2 levels of difficulty) available for download at www.frenchvoicespodcast.com. All interviews also come with the transcript. French teacher Jessica interviews native speakers of French from around the world who share a bit of their life and passion. Where else would you meet in one same place a French yoga teacher based in Melbourne, a soap manufacturer from Provence, or a couple cycling around the world? Destiny Architecture® Meditations Heather Larson Bring your mediation practice into the Valueverse. DIOSA. Carolina Sanper This podcast is a sacred space created by Carolina Sanper where you connect with your inner wisdom and embody your magnetic feminine power.It is the realization that the mystical realm is where you plant the seeds of your desired reality.It is a portal to your true essence: awareness, presence, and receiving with ease. Welcome home, DIOSA. 🖤

Frequently Asked Questions

How long is this episode of Your Undivided Attention?

This episode is 42 minutes long.

When was this Your Undivided Attention episode published?

This episode was published on April 16, 2026.

What is this episode about?

Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early researchers of AI “alignment:" how we get AI systems to act the way we want them to. In order to do that, Davidad has taken on the strange...

Can I download this Your Undivided Attention episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!