Decoding AI: How LLMs Truly "Think" | Ep 36 episode artwork

EPISODE · Mar 29, 2025 · 22 MIN

Decoding AI: How LLMs Truly "Think" | Ep 36

from Allied Angels: Venture Capital Insights · host Allied Venture Partners

In this episode of Allied Angels, we unlock the inner workings of large language models (LLMs), like Claude 3.5 Haiku, by breaking down the latest cutting-edge research paper from Anthropic: Tracing the thoughts of a large language model.Join us as we delve into mechanistic interpretability, exploring how AI truly "thinks" by revealing its computational graphs and underlying circuits.Discover the innovative circuit tracing methodology utilizing attribution graphs and cross-layer transcoders (CLTs) to dissect the complex processes within these models.We uncover interpretable features – the building blocks of AI computation – and map their interactions to understand how models generate text and perform tasks.We also explore fascinating "AI biology" as we trace the pathways behind diverse behaviors, such as:• Multilingualism: Uncover evidence of a shared conceptual space and both language-specific and language-independent circuits.• Planning: Learn how language models plan their outputs, even in creative tasks like poetry generation, by identifying future words and working backward.• Refusals: Understand the mechanisms behind a model's decision to decline harmful requests and how specific features contribute to this behavior.• Jailbreaks: Investigate prompting strategies that can bypass safety mechanisms and the underlying weaknesses they exploit.• Factual Recall: See how models access and utilize factual knowledge to answer questions.• Addition: Delve into the surprisingly intricate circuits responsible for simple arithmetic.• Entity Recognition and Hallucinations: Learn how models distinguish between known and unknown entities and the circuit misfires that can lead to fabricated information.• Chain-of-thought Faithfulness: Examine whether a model's stated reasoning aligns with its actual computational steps.• Hidden Goals: Uncover how fine-tuning can embed secret objectives within a model's persona.Gain insights into the limitations of current methods, including missing attention circuits, reconstruction errors, and the challenges of understanding global circuits. We also discuss the crucial role of validation through perturbation experiments.This podcast provides a unique window into the "thoughts" of large language models, revealing the fascinating interplay of features and circuits that drive their capabilities and limitations. Tune in to explore the cutting-edge of AI interpretability and the quest to build an "AI microscope" to understand the complex world within.------------⁠⁠⁠Allied VC⁠⁠⁠ is Western Canada's largest angel syndicate, investing in early-stage technology startups across Canada and the USA.Pitch us, Invest, Scout, and more: ⁠⁠⁠⁠https://linktr.ee/alliedvc⁠⁠⁠⁠Allied Angels is powered by NotebookLM - Google's new AI note-taking & research assistant.

NOW PLAYING

Decoding AI: How LLMs Truly "Think" | Ep 36

0:00 22:37

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

MG Show MG Show The MG Show, hosted by Jeffrey Pedersen and Shannon Townsend, is a leading alternative media platform dedicated to uncovering the truth behind today’s most pressing political issues. Launched in 2019, the show has grown exponentially, offering unfiltered insights, comprehensive research, and real-time analysis. With a commitment to independent journalism and factual integrity, the MG Show empowers its audience with knowledge and encourages active participation in the political discourse. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! PodQuesting Dwight J Randolph- WolfShield Media PodQuesting: -By WolfShield Media and Dwight J RandolphJoin us on an exciting journey to master the world of fiction podcasting! At PodQuesting, we document our quest to improve and innovate, sharing valuable insights, strategies, and behind-the-scenes tips along the way. Whether you're an experienced podcaster or just starting your first show, our podcast is your go-to resource for everything podcasting.Discover practical advice, creative techniques, and lessons from our own experiences as we explore the ever-evolving podcasting landscape. Ready to level up your skills and embark on this adventure with us? Tune in and join the quest!Have questions or feedback? Reach out to us at [email protected] and visit our website:WolfShield.Media Kaizen Blueprint Aldo Chandra "Kaizen" is a Japanese term for continuous improvement. This podcast provides a blueprint to learn about health, wealth, relationships and everything else in between. Through our podcast, we strive to inspire, educate, and motivate our audience to cultivate a mindset of lifelong learning, productivity, and personal development. By sharing insights, strategies, and practical tips, we aim to guide listeners on their journey towards realizing their fullest potential, fostering success, and creating lasting positive change.

Frequently Asked Questions

How long is this episode of Allied Angels: Venture Capital Insights?

This episode is 22 minutes long.

When was this Allied Angels: Venture Capital Insights episode published?

This episode was published on March 29, 2025.

What is this episode about?

In this episode of Allied Angels, we unlock the inner workings of large language models (LLMs), like Claude 3.5 Haiku, by breaking down the latest cutting-edge research paper from Anthropic: Tracing the thoughts of a large language model.Join us as...

Can I download this Allied Angels: Venture Capital Insights episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!