CasiornThinks

PODCAST · technology

CasiornThinks

Serious AI, Clear Explanations.CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. Every week, we unpack real AI research — hallucinations, agents, reasoning, all of it — and make it legible for actual humans. No hype. No jargon flexing. Just clarity.YouTube: https://www.youtube.com/@CasiornThinksApple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/

  1. 13

    Episode 12 - AI Alignment: What Is It, and Why Should We Care?

    How do you build an "off-switch" for a machine that is smarter than you?In the Season 1 Finale of Casiorn Thinks, we dive deep into AI Alignment. We explain why making models incredibly smart is actually much easier than making them safe, exploring phenomena like "Instrumental Convergence"—where a harmless math bot calculating Pi might try to hack a bank to buy more servers.To show you how the industry is fighting back, we break down the R.I.C.E. Framework: Robustness, Interpretability, Controllability, and Ethicality. Finally, we tackle the ugly truth of the "Alignment Tax," examining why companies are incentivized to skip safety testing to win the AI race.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:YouTube: https://www.youtube.com/@CasiornThinksSpotify: https://creators.spotify.com/pod/profile/casiorn-thinks/Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/#ai #airesearch #aialignment #artificialintelligence #kingmidas #agi #artificialgeneralintelligence #aiethics #techethics #machinelearning #futureofai #futureoftech #rewardhacking #rewardtampering #instrumentalconvergence #aisafety #specificationgaming #goalmisgeneralization #misgeneralization #noisyfeedback #sycophancy #aisycophancy #redteaming

  2. 12

    Episode 11 - Large Language Mafia: Why AIs Playing Werewolf Matters

    We know Large Language Models can write code and pass the bar exam. But what happens when you drop them into a messy, human environment where they have to lie, detect deception, and build trust? In Episode 11 of Casiorn Thinks, we explore the world of Social AI. We break down a fascinating series of experiments where researchers forced top AI models to play games like Mafia and the Prisoner's Dilemma. The results were shocking. We unpack why smaller models (like Grok 3 Mini) completely outperformed giants like GPT-4 and Claude at detecting lies. We also explore why GPT-4 proved to be an incredibly "unforgiving machine" that gets stuck in endless loops of retaliation.About CasiornThinks: Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺 Listen & Follow: YouTube: https://www.youtube.com/@CasiornThinks Spotify: https://creators.spotify.com/pod/profile/casiorn-thinks/ Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715 x (formerly Twitter): @CasiornThinks Facebook: https://www.facebook.com/profile.php?id=61588303443436 Blog: http://www.casiornthinks.blog/ #behavioralgametheory #socialchainofthought #gametheory #mafia #werewolf #socialdeductiongame #amongus #gamerefinementtheory #ai #artificialintelligence #airesearch #aivideo #aigenerated #llm #largelanguagemodels #machinelearning #futureofai #emergentbehavior #technology #tech #techtrends #technerd #technews

  3. 11

    Episode 10 - ChatDev: The First Prototype for A Post-Human Corporation

    The future of work isn't about writing code. It's about managing a team of digital workers. Today on CasiornThinks, we dive into ChatDev, a groundbreaking project that proves AI agents can run a virtual software company from top to bottom. We examine the shift from the "Master Prompter" to the "Master Orchestrator". Discover how these specialized AI agents (acting as CEOs, CTOs, and developers) use a structured process called "ChatChain" to avoid chaos and build functional applications—like a stock market tracker that cost only $0.06 to develop. We also tackle the "Ugly Reality," looking at the current limits of this technology and the funny bugs (like "lazy agents") that still plague these systems.About CasiornThinks: Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺 Listen & Follow:YouTube: https://www.youtube.com/@CasiornThinks Spotify: https://creators.spotify.com/pod/profile/casiorn-plays/ Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715 x (formerly Twitter): @CasiornThinks Facebook: https://www.facebook.com/profile.php?id=61588303443436 Blog: http://www.casiornthinks.blog/ #ChatDev #AIAgents #FutureOfWork #SoftwareEngineering #TechExplained #AICollaboration #GenerativeAI #CasiornThinks #agenticai #artificialintelligence #techtrends #SocietyOfMinds #PostHuman #emergentbehavior #llm #largelanguagemodels #machinelearning #HighSchoolCeiling #futureofai #futureofwork

  4. 10

    Episode 9 - Ollama and the Open-Source Revolution

    For years, we’ve treated AI like a magical Oracle living in a corporate cloud. We pay a subscription, we send our data away, and we wait for an answer. But what if you could kidnap the Oracle and keep it on your laptop?In this episode of Casiorn Thinks, we explore the revolution of Local AI. We introduce Ollama, the "Docker for AI" that allows anyone to run powerful models (like Llama 3 or DeepSeek) on their own machine—offline, privately, and for free. But this is bigger than just saving money. We zoom out to the global "Open Source War." We look at the battle between "The Vaults" (closed systems like OpenAI) and "The Blueprints" (open weights like Meta), and why nations are racing to build "Sovereign AI" to break free from Silicon Valley's control. About CasiornThinks: Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺 Listen & Follow: YouTube: https://www.youtube.com/@CasiornThinks Spotify: https://creators.spotify.com/pod/profile/casiorn-plays/ Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715 x (formerly Twitter): @CasiornThinks Facebook: https://www.facebook.com/profile.php?id=61588303443436 Blog: http://www.casiornthinks.blog/ #Ollama #OpenSourceAI #OpenSource #DeepSeek #Qwen #OpenAI #Llama #MetaAI #ai #artificialintelligence #aivideo #aigenerated #machinelearning #largelanguagemodels #llm #quantization #huggingface #LMStudio #localAI #clouds #bigtech #techtrends #tech #technews #technology #technerd

  5. 9

    Episode 8 - What is a Transformer, Actually? (No, Really!)

    Why did AI suddenly get "smart" around 2017? It wasn't magic. It was a change in architecture.In this episode of Casiorn Thinks, we break down the most influential research paper of the last decade: Google's "Attention Is All You Need." We explain how this paper killed the old way of doing AI (RNNs) and introduced the Transformer.We look at the fundamental shift from "Sequential Processing" (reading one word at a time) to "Parallel Processing" (reading the whole book at once). This shift didn't just make AI faster; it allowed for the discovery of Scaling Laws, proving that bigger models combined with more data actually equals better performance.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:YouTube: https://www.youtube.com/@CasiornThinksSpotify: https://creators.spotify.com/pod/profile/casiorn-plays/Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/#ai #artificialintelligence #aivideo #aigenerated #llm #largelanguagemodels #machinelearning #airesearch #futureofai #tech #techtrends #technews #technology #transformers #rope #alibi #sinpe #query #key #value #vanishinggradient #casiornthinks #AttentionIsAllYouNeed #DeepLearning #NeuralNetworks #ScalingLaws #FlashAttention #OpenAI

  6. 8

    Episode 7 - Cicero: The AI That Won a Game of Betrayal... By Being Nice

    Large Language Models are great at talking, but terrible at planning. Strategic AIs are great at planning, but can't talk. What happens when you merge them? You get Cicero.Today, we are exploring the first AI agent to achieve human-level performance in the complex strategy game Diplomacy. We analyze the breakthrough architecture known as Grounded Planning, where an AI's words are strictly tethered to its calculated intent.We also look at the limitations. We dissect a 2024 study revealing the "Persuasion Gap"—why AI is still only half as good as humans at actually changing someone's mind—and why "radical honesty" turned out to be the ultimate game-theory hack.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow: YouTube: https://www.youtube.com/@CasiornThinks Spotify: https://creators.spotify.com/pod/profile/casiorn-plays/ Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715 x (formerly Twitter): @CasiornThinks Facebook: https://www.facebook.com/profile.php?id=61588303443436 Blog: http://www.casiornthinks.blog/ #ai #artificialintelligence #cicero #meta #metaai #diplomacy #diplomacynews #machinelearning #largelanguagemodels #llm #aivideo #futureofai #aigenerated #agent #agenticai #aigaming #social #tech #techtrends #technews #technology #technerd

  7. 7

    Episode 6 - Voyager: The Bot that Taught Itself Minecraft (No, Really!)

    How do you teach an AI to play a game that has no rules, no clear goal, and infinite possibilities? You don't. You build an AI that can teach itself.In this episode of Casiorn Thinks, we break down Voyager, the first LLM-powered agent that mastered Minecraft completely on its own. Unlike previous AIs that just mashed buttons, Voyager writes its own code to interact with the world.We dissect the "Three-Part Brain" architecture that makes this possible:Automatic Curriculum: How it uses curiosity to set its own goals.The Skill Library: How it solves "Catastrophic Forgetting" by saving skills as reusable code.Iterative Prompting: How it practices, fails, reads the error message, and fixes its own code.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:YouTube: https://www.youtube.com/@CasiornThinksSpotify: https://creators.spotify.com/pod/profile/casiorn-plays/Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/#Voyager #Minecraft #AI #AgenticAI #MachineLearning #FreeWill

  8. 6

    Episode 5 - The Agent Illusion: The 5 Stages of AI Development and Where We Really Are

    If you read the headlines, AI "Agents" are already here, running businesses and coding websites. But if you ask the researchers at OpenAI, we are barely scratching the surface. Why the disconnect?In this episode of Casiorn Thinks, we explore the "Agent Illusion." We break down OpenAI’s official 5-Level Roadmap to Super AI—from simple Chatbots to AI Organizations.We examine the massive gap between the "hype definition" of an Agent (a tool that does tasks) and the "technical definition" (a system that can work autonomously for days). Plus, we look at the cautionary tale of Lattice, the software company that tried to put AI agents on org charts and faced immediate backlash.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:YouTube: https://www.youtube.com/@CasiornThinksSpotify: https://creators.spotify.com/pod/profile/casiorn-plays/Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/#AIAgents #OpenAI #AGI #FutureOfWork #TechExplained #ArtificialIntelligence #Lattice #CasiornThinks

  9. 5

    Episode 4 - The Scratchpad Revolution (Demystifying Chain-of-Thought)

    Standard AI prompts are like human intuition: fast, automatic, and prone to error. But what if we could force a computer to "slow down" and think like a human solving a riddle?In Episode 4, we dive into Chain-of-Thought (CoT). We look at the landmark 2022 paper by Jason Wei and colleagues that changed prompt engineering forever. We ask the big philosophical question: Is the AI actually using logic, or is it just mimicking the pattern of logic (the "Mirage" argument)?Plus, we look at the unintended consequences of giving machines a private thought process—including the risk of "steganography" (hidden messages) and strategic deception.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:Spotify: https://open.spotify.com/show/61Cn1dQIRuTbu9JU4vFv0jx (formerly Twitter): @CasiornThinksApple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715YouTube: https://youtu.be/FVAVMlUJDKM

  10. 4

    Episode 3 - Unlocking the Goldfish Genius (or What is RAG, Actually?)

    How do you teach a Large Language Model about your private data without retraining it from scratch? You don't. You use RAG.Today, we are breaking down Retrieval Augmented Generation, the architecture that turns a creative chatbot into a factual research tool. We move past the buzzwords to explain the three-step pipeline: Retrieve, Augment, and Generate.We also look at the limitations. We unpack new research from Google regarding the "Wrong Textbook Paradox," which reveals that while RAG is powerful, feeding an LLM irrelevant context can cause hallucinations to skyrocket.About CasiornThinks: Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:Spotify: https://open.spotify.com/show/61Cn1dQIRuTbu9JU4vFv0jx (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715YouTube: https://www.youtube.com/@CasiornThinks #RAG #RetrievalAugmentedGeneration #LLM #AIArchitecture #TechExplained #GoogleResearch #NVIDIA #CasiornThinks

  11. 3

    Anthropic vs. The Pentagon: The $200M Disagreement

    Anthropic vs. The Pentagon: The $200M DisagreementWho actually gets to make the rules for the world's most powerful artificial intelligence? The people who build it, or the governments that buy it? In this episode of CasiornThinks, we break down the unprecedented clash between leading AI lab Anthropic and the Pentagon. What started as a landmark $200 million government contract quickly dissolved into a very public, very messy feud over how AI should—and shouldn't—be used in national security. We dive into the technical risks of AI hallucinations in warfare, the massive implications of AI-powered "digital pointillism" surveillance, and the shocking plot twist when OpenAI stepped in to take Anthropic's place. Ultimately, this isn't just a story about a canceled contract; it's a battle for control over the future of AI. Listen & Follow:YouTube: https://www.youtube.com/@CasiornThinksSpotify: https://creators.spotify.com/pod/profile/casiorn-thinks/Apple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/Disclosure: The narration/host format is generated with NotebookLM from my script. Totally fair if you don’t like AI narration. I use it for privacy (I'm camera shy) and to keep costs at $0, but the script and research are mine. Check out the blog if you want high quality reads.#AI #ThePentagon #Anthropic #Pentagon #OpenAI #Hallucination #FutureOfTech #NationalSecurity #GovernmentContracting #SupplyChain #ArtificialIntelligence

  12. 2

    Episode 2 - The Day the Robot Dreamed | On AI Hallucinations and Why They Happen

    What happens when a lawyer trusts ChatGPT to do his homework? He ends up citing court cases that don't exist. What happens when Air Canada’s chatbot invents a refund policy? The airline gets sued.In Episode 2 of Casiorn Thinks, we explore the weird world of AI Hallucinations. We aren't just laughing at the failures; we're breaking down why they happen. Why does a machine built on logic confidently tell you that churros are good tools for surgery?We dissect the "Next Word Prediction" engine to explain why LLMs prioritize sounding smart over being right. Plus, we reveal the twist: how this exact same "glitch" helped a scientist win a Nobel Prize in 2024.About CasiornThinks:Serious AI, Clear Explanations. CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. We read the papers. You get the understanding. 📄➡️📺Listen & Follow:Spotify: https://open.spotify.com/show/61Cn1dQIRuTbu9JU4vFv0jx (formerly Twitter): @CasiornThinksApple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715#AIHallucinations #ChatGPTFails #TechExplained #LLM #NobelPrize #DavidBaker #AirCanada #CasiornThinks

  13. 1

    Episode 1 - Smallville: The Secret Lives of Generative Agents

    What happens when you drop 25 AI agents into a digital town, give them no script, and tell them to "survive"?In our debut episode, we explore the famous "Smallville" experiment. We’re not talking about NPCs who just stand around waiting for a quest. We’re talking about agents who wake up, cook breakfast, form crushes, and—shockingly—plan a Valentine’s Day party entirely on their own.Join Casiorn as we uncover the "Ghost in the Machine" moments that prove AI is getting scarily good at being human.🎥 Watch the Visual Breakdown:See the agents in action in our deep-dive video: "Why AI Needs Manners | Smallville & The Future of Generative Agents"👉 https://youtu.be/h5RmqjJs3XE?si=W12e1A_6uz8b9zTN📌 About CasiornPlays:Exploring the weird world where AI meets human behavior. From Minecraft bots to digital societies, we translate deep-tech research into stories for humans.Connect with me:YouTube: https://www.youtube.com/channel/UCZ6XoRtQOeMP4rndtUovIogTwitter/X: @CasiornPlays

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

Serious AI, Clear Explanations.CasiornThinks turns cutting-edge AI research into chalkboard-style breakdowns for curious adults who refuse to feel stupid. Every week, we unpack real AI research — hallucinations, agents, reasoning, all of it — and make it legible for actual humans. No hype. No jargon flexing. Just clarity.YouTube: https://www.youtube.com/@CasiornThinksApple Podcast: https://podcasts.apple.com/us/podcast/casiornthinks/id1877923715x (formerly Twitter): @CasiornThinksFacebook: https://www.facebook.com/profile.php?id=61588303443436Blog: http://www.casiornthinks.blog/

HOSTED BY

Dan Roque, CSP®

CATEGORIES

URL copied to clipboard!