Agents of Intelligence podcast artwork

PODCAST · technology

Agents of Intelligence

Exploring AI with the power of AI — Agents of Intelligence is a cutting-edge podcast dedicated to covering a wide range of topics about artificial intelligence. Our process blends human insight with AI-driven research—each episode starts with a curated list of topics, followed by AI agents scouring the web for the best public content. AI-powered hosts then craft an engaging, well-researched discussion, which is reviewed by a subject matter expert before being shared with the world. The result? A seamless fusion of AI efficiency and human expertise, bringing you the most insightful conversations on AI’s latest developments, challenges, and future impact.

  1. 51

    The AI Agent Harness: Engineering Controlled GenAI Systems

    In The AI Agent Harness: Engineering Controlled GenAI Systems, this episode dives into how modern AI systems evolve from standalone models into fully orchestrated agents capable of executing complex tasks. The discussion centers on the idea that a model alone is not sufficient—it must operate within a structured harness that manages decision-making, tool usage, and system state. By separating reasoning from execution, engineers can introduce control layers such as action brokers, validation checkpoints, and policy enforcement mechanisms that ensure outputs are safe, auditable, and aligned with business rules. We explore a reference architecture for agentic systems, highlighting how components like memory, tool interfaces, and multi-agent coordination come together under a governed runtime. The episode also examines the importance of trajectory evaluation—analyzing not just final outputs but the sequence of decisions an agent makes—to improve reliability and transparency. Listeners will gain insight into how security, observability, and cost control are built into these systems from the ground up. Designed for AI/ML engineers, data scientists, and technical leaders, this episode provides a practical, high-level roadmap for implementing controlled autonomy in GenAI applications. It offers a clear perspective on how to bridge the gap between experimental AI and scalable, production-grade agent systems.

  2. 50

    Inside LLM Agents: How Large Language Models Are Becoming Autonomous Problem-Solvers

    From memory-augmented planners that refine their own code to swarms of collaborating bots that debate, learn, and evolve, this episode unpacks the latest survey of Large Language Model agents. We map the three pillars of the field—how agents are built (profiles, memory, planning), how they team up (centralized vs. decentralized vs. hybrid collaboration), and how they self-improve (autonomous optimization, co-evolution, external knowledge). Along the way, we spotlight real-world applications from scientific discovery to gaming, dig into new evaluation benchmarks, and confront the security, privacy, and ethical landmines that accompany truly autonomous AI. If you want a guided tour of where the agent revolution stands—and the hurdles it still faces—this conversation is for you.

  3. 49

    AI Agents vs. Agentic AI: From Solo Bots to Collaborative Minds

    In this episode we chart the evolution from single-purpose AI agents—think AutoGPT scheduling your calendar—to full-blown agentic systems where swarms of specialized bots plan, debate, and execute complex goals together. Drawing on Sapkota et al.’s 2025 taxonomy, we break down the key traits that separate reactive, task-bound agents from orchestrated communities of autonomous specialists; explore real-world examples from MetaGPT to drone fleets; and unpack the thorny challenges of coordination, emergent behavior, and governance that come with this paradigm shift. Along the way, we highlight the toolkits (ReAct loops, memory architectures, function calling, AZR self-play) that promise to tame multi-agent chaos and point the way toward trustworthy, scalable agentic AI.

  4. 48

    Cognitive Debt: What Happens to Your Brain When AI Writes With You

    In this episode we unpack a four-month neuroscientific study that pitted three essay-writing strategies against one another: pure brain-power, search-engine support, and large-language-model (LLM) assistance. Using EEG-based Dynamic Directed Transfer Function (dDTF) analysis, natural-language processing of the essays, and participant interviews, the researchers traced how each approach shapes neural connectivity, cognitive load, and even the sense of authorship. We explore why brain-only writers showed richer delta-band networks and deeper engagement, how AI tools can create linguistic echo chambers while saving mental effort, and what “cognitive debt” really means for learning, critical thinking, and the energy footprint of our words. By the end, you’ll have a fresh lens on the promise—and hidden costs—of hybrid cognition in the age of generative AI.

  5. 47

    Engineering That Works: Inside GitHub’s System Success Playbook

    In this episode of Code at Scale, we unpack the GitHub Engineering System Success Playbook (ESSP)—a practical, metrics-driven framework for building high-performing engineering organizations. GitHub’s ESSP reframes engineering success around the dynamic interplay of quality, velocity, and developer happiness, emphasizing that sustainable improvement comes not from isolated metrics but from system-level thinking. We explore GitHub’s three-step improvement process—identify, evaluate, implement—and dig into the 12 core metrics across four zones (including Copilot satisfaction and AI leverage). We also highlight why leading vs. lagging indicators matter, how to avoid toxic gamification, and how to turn common engineering antipatterns into learning opportunities. Whether you're scaling a dev team or transforming engineering culture, this episode gives you the blueprint to do it with intention, impact, and empathy.

  6. 46

    The AI Marketer: How Generative Models Are Rewriting Enterprise Strategy

    In this episode, we unpack how generative AI is transforming the foundations of enterprise marketing. Drawing from the white paper Generative AI in Marketing: A New Era for Enterprise Marketing Strategies, we explore the rise of large language models (LLMs), diffusion models, and multimodal tools that are now driving content creation, hyper-personalization, lead scoring, dynamic pricing, and more. From Coca-Cola’s AI-generated campaigns to JPMorgan Chase’s automated ad copy, the episode showcases real-world use cases while examining the deeper shifts in how marketing teams operate. We also confront the critical risks—data privacy, brand integrity, model bias, hallucinations—and offer strategic advice for leaders aiming to implement generative AI responsibly and at scale. If your brand is serious about leveraging AI to boost creativity, performance, and customer engagement, this is the conversation you need to hear.

  7. 45

    Agents at Work: Unlocking Autonomy with the Model Context Protocol

    In this episode, we explore the next frontier of enterprise AI: intelligent agents empowered by the Model Context Protocol (MCP). Based on a strategic briefing from Boston Consulting Group, we trace the evolution of AI agents from simple chatbots to autonomous systems capable of planning, tool use, memory, and complex collaboration. We dive deep into MCP, the open-source standard that's fast becoming the connective tissue of enterprise AI—enabling agents to securely access tools, query databases, and coordinate actions across environments. From real-world examples in coding and compliance to emerging security challenges and orchestration strategies, this episode lays out how companies can build secure, scalable agent systems. Whether you're deploying your first AI agent or managing an ecosystem of them, this episode maps the architecture, risks, and best practices you need to know.

  8. 44

    RAG Meets Reasoning: Architectures for Intelligent Retrieval and AI Agents

    In this episode, we decode three of the most compelling architectures in the modern AI stack: Retrieval-Augmented Generation (RAG), AI Agent-Based Systems, and the cutting-edge Agentic RAG. Based on the in-depth technical briefing Retrieval, Agents, and Agentic RAG, we break down how each system works, what problems they solve, and where they shine—or struggle. We explore how RAG grounds LLM responses with real-world data, how AI agents bring autonomy, memory, and planning into play, and how Agentic RAG fuses the two to tackle highly complex, multi-step tasks. From simple document Q&A to dynamic, multi-agent marketing strategies, this episode maps out the design tradeoffs, implementation challenges, and best practices for deploying each of these architectures. Whether you're building smart assistants, knowledge workers, or campaign bots, this is your blueprint for intelligent, scalable AI systems.

  9. 43

    Code, Meet Copilot: How LLMs Are Reshaping Full-Stack Development

    In this episode, we explore how Large Language Models (LLMs) like GPT-4 and GitHub Copilot are revolutionizing full-stack web development—from speeding up boilerplate generation and test writing to simplifying infrastructure-as-code and DevOps workflows. Based on the white paper Enhancing Full-Stack Web Development with LLMs, we break down the tools, use cases, architectural patterns, and best practices that define modern AI-assisted development. We cover real-world applications, including LLM-driven documentation, code refactoring, test generation, and cloud config writing. We also dive into the risks—like hallucinated code, security gaps, and over-reliance—and how to mitigate them with a human-in-the-loop approach. Whether you're a solo developer or leading a team, this episode offers a comprehensive look at the evolving toolkit for building smarter and faster with AI.  

  10. 42

    From Model to Market: The MLOps Playbook for Scalable AI

    In this episode, we dive into the nuts and bolts of MLOps—the crucial discipline that bridges the gap between machine learning development and real-world deployment. Drawing insights from Introducing MLOps by Mark Treveil and the Dataiku team, we explore what it really takes to operationalize machine learning in enterprise environments. From building reproducible models and setting up robust CI/CD pipelines to managing data drift and enforcing responsible AI practices, we walk through the entire lifecycle of a model in production. You'll learn about the diverse roles that make MLOps successful, how to align governance with risk, and why monitoring and feedback loops are essential to long-term model health. With practical case studies in credit risk and marketing, this episode delivers a comprehensive roadmap for deploying ML systems that scale—safely, ethically, and efficiently.

  11. 41

    The State of AI 2025: Power, Progress, and the Price of Intelligence

    In this special episode, we unpack the major insights from the Artificial Intelligence Index Report 2025, the definitive annual report tracking AI’s global trajectory. From breakthrough advances in training efficiency and multilingual model capabilities to serious concerns about carbon emissions, bias, and ethical risks in medicine, this report gives us a sweeping view of where AI is—and where it’s heading. We’ll dive into how AI is reshaping science, education, and the economy, discuss the exponential rise in AI patents, and explore geopolitical trends in research, talent migration, and public policy. Whether it’s massive compute powering GPT-4o, the booming generative AI investment scene, or the growing calls for responsible AI governance, this episode brings you the numbers, narratives, and nuance behind today’s AI evolution. Expect data-backed insights, expert commentary, and a big-picture look at what it means to live in the AI age.

  12. 40

    Prompt Perfect: Crafting Conversations with Large Language Models

    In this episode, we unravel the art and science of prompt engineering—the subtle, powerful craft behind guiding large language models (LLMs) to produce meaningful, accurate, and contextually aware outputs. Drawing from the detailed guide by Lee Boonstra and her team at Google, we explore the foundational concepts of prompting, from zero-shot and few-shot techniques to advanced strategies like Chain of Thought (CoT), ReAct, and Tree of Thoughts. We also dive into real-world applications like code generation, debugging, and translation, and explore how multimodal inputs and model configurations (temperature, top-K, top-P) affect output quality. Wrapping up with a deep dive into best practices—such as prompt documentation, structured output formats like JSON, and collaborative experimentation—you’ll leave this episode equipped to write prompts that actually work. Whether you’re an LLM pro or just starting out, this one’s packed with tips, examples, and aha moments.

  13. 39

    Brains and Bridges: Decoding Agent2Agent and Model Context Protocols

    In this episode of Agents of Intelligence, we dive deep into two groundbreaking protocols shaping the future of multi-agent Large Language Model (LLM) orchestration: the Agent2Agent (A2A) Protocol and the Model Context Protocol (MCP). A2A acts as the social glue between autonomous AI agents, allowing them to communicate, delegate tasks, and negotiate how best to serve the user—almost like microservices that can think. On the other side, MCP is the information highway, standardizing how these agents access and interact with external data and tools—making sure they’re never working in isolation. We’ll unpack the core design philosophies, key features, real-world use cases, and the powerful synergy between A2A and MCP when combined. Whether it’s onboarding a new employee or compiling a complex research report, these protocols are making it possible for intelligent agents to collaborate and operate with unprecedented depth and flexibility. Tune in to learn how the future of AI is being built—not just with smarter models, but with smarter ways for those models to talk, think, and act together.

  14. 38

    Beyond Benchmarks: How Long Can AI Work?

    In this episode, we unpack a groundbreaking new way of measuring AI capability—not by test scores, but by time. Drawing from the recent METR paper "Measuring AI Ability to Complete Long Tasks," we explore the concept of the 50% task-completion time horizon—a novel metric that asks: How long could a human work on a task before today's AI can match them with 50% success? We’ll explore how this time-based approach offers a more intuitive and unified scale for tracking AI progress across domains like software engineering and machine learning research. The findings are eye-opening: the time horizon has been doubling roughly every seven months, suggesting we could see "one-month AI"—systems capable of reliably completing tasks that take humans 160+ hours—by 2029. We also delve into how reliability gaps, planning failures, and context sensitivity reveal AI’s current limits, even as capabilities continue to grow exponentially. Plus, what does this mean for the future of work, safety risks, and our understanding of AGI? If you're tired of benchmark buzzwords and want to get real about how far AI has come—and how far it might go—this one's for you.

  15. 37

    AI at the Crossroads: Strategic Shifts & Surging Adoption in 2025

    In this episode, we dive deep into McKinsey’s March 2025 report on “The State of AI,” drawn from its global survey conducted in mid-2024. The findings reveal a world where AI—especially generative AI—is no longer in the experimental phase but is becoming embedded into the core operations of organizations across industries. We explore the rapid rise in adoption rates, the growing trend of redesigning workflows, and how larger companies are pulling ahead by centralizing governance and mitigating risk. We also break down the role of leadership—particularly CEO involvement—in AI strategy and outcomes, discuss the challenges and opportunities in workforce reskilling, and look at the practices that separate high-impact AI implementations from the rest. Although tangible enterprise-wide EBIT impact remains elusive for many, the strategic focus on adoption, scaling, and transformation suggests that AI's full potential is just beginning to unfold. Whether you're in tech, business leadership, or just AI-curious, this episode offers an essential snapshot of where AI is today—and where it's headed next.

  16. 36

    Decoding Generative AI: The Math Behind Machines That Create

    In this episode, we take a deep dive into the mathematical foundations of generative AI, unraveling the complex theories and equations that power models like VAEs, GANs, normalizing flows, and diffusion models. From linear algebra and probability to optimization and game theory, we explore the intricate math that enables AI to generate realistic images, text, and more. Whether you're an AI researcher, machine learning engineer, or just curious about how machines can dream up new realities, this episode will provide a rigorous yet engaging exploration of the formulas and concepts shaping the future of generative AI.

  17. 35

    Architecting the Future of AI: The Evolution of Intelligent Agents

    Join us as we explore the cutting-edge evolution of AI agent architectures, from foundational language models to multi-modal intelligence, tool-using agents, and autonomous decision-makers. This deep technical episode breaks down the building blocks of next-generation AI systems, covering retrieval-augmented generation (RAG), memory-augmented reasoning, reinforcement learning, and multi-agent collaboration—offering AI architects, engineers, and data scientists a roadmap to designing scalable and intelligent enterprise AI.

  18. 34

    AI Security Deep Dive: Safeguarding LLMs in the Cloud

    In this episode, we explore the hidden risks of deploying large language models (LLMs) like DeepSeek in enterprise cloud environments and the best security practices to mitigate them. Hosted by AI security experts and cloud engineers, each episode breaks down critical topics such as preventing sensitive data exposure, securing API endpoints, enforcing RBAC with Azure AD and AWS IAM, and meeting compliance standards like China’s MLPS 2.0 and PIPL. We’ll also tackle real-world AI threats like prompt injection, model evasion, and API abuse, with actionable guidance for technical teams working with Azure, AWS, and hybrid infrastructures. Whether you're an AI/ML engineer, platform architect, or security leader, this podcast will equip you with the strategies and technical insights needed to securely deploy generative AI models in the cloud.  

  19. 33

    Building AI at Scale: The OpenAI Response API Deep Dive

    Welcome to Building AI at Scale, the podcast where we break down the intricacies of deploying enterprise-grade AI applications. In this series, we take a deep dive into the OpenAI Response API and explore its technical implementation, performance optimization, concurrency management, and enterprise deployment strategies. Designed for software engineers, AI architects, and data engineers, we discuss key considerations when integrating the OpenAI Python SDK with agentic frameworks like LangChain and GraphChain, as well as cloud platforms like Azure and AWS. Learn how to optimize latency, handle rate limits, implement security best practices, and scale AI solutions efficiently. Whether you’re an AI veteran or leading a new generative AI initiative in your organization, this podcast provides the technical depth and real-world insights you need to build robust AI-powered systems.

  20. 32

    AI at Work: How Claude is Reshaping the Economy

    How is AI actually being used in the workplace today? In this episode, we dive into groundbreaking research from Handa et al. (Anthropic), which analyzed over four million conversations on Claude.ai to map AI’s role in different economic tasks. The study reveals that AI is most commonly applied in software development and writing, spanning about 36% of occupations for at least a quarter of their tasks. We explore the nuances of augmentation versus automation, AI’s impact on wages and job accessibility, and what this means for the future of work. Join us for an in-depth discussion on how AI is reshaping jobs—not replacing them outright—and what the data tells us about where we’re headed next.

  21. 31

    Enterprise AI Agents: Building Scalable Intelligence in the Cloud

    As AI agents take center stage in enterprise automation, decision-making, and knowledge management, organizations must navigate a complex landscape of cloud technologies, modular architectures, and security considerations. In this episode, we dive into the insights from AI Agents in the Enterprise: Cloud-Based Solutions for Scalable Intelligence by Sam Zamany of Boston Scientific. We explore how enterprises can design and deploy intelligent, autonomous AI agents using cloud-native architectures, reusable AI components, and cutting-edge frameworks like LangChain, ReAct, and Retrieval-Augmented Generation (RAG). Through real-world case studies from companies like Morgan Stanley, Bank of America, and Moderna, we highlight the transformative power of AI agents and best practices for large-scale adoption. Whether you're an IT architect, AI practitioner, or business leader, this episode will equip you with the strategies to integrate AI agents into your enterprise ecosystem successfully.

  22. 30

    AI-Driven Propaganda: How Artificial Intelligence is Shaping Political Misinformation

    In this episode, we dive deep into the world of AI-driven propaganda and the alarming rise of artificial intelligence in political misinformation. From deepfake videos that put false words in politicians’ mouths to AI-powered bots that manipulate social media discourse, the spread of disinformation has never been more sophisticated. We explore real-world case studies of AI-driven political interference, including state-sponsored campaigns by Russia and China, deepfake election scandals, and the role of algorithmic manipulation in shaping public opinion. What does this mean for democracy? How can we combat AI-generated propaganda while preserving free speech? Join us as we unpack the evolving tactics of AI-powered misinformation and discuss the future of truth in the digital age.

  23. 29

    AGI Unveiled: The Future of Thinking Machines

    In this special deep-dive episode, we explore one of the most profound questions in artificial intelligence: When will AI truly think like us? Based on the research-driven book, The Future of AGI: When Will AI Think Like Us?, this episode unpacks the technical architectures, key challenges, ethical concerns, and governance frameworks shaping the development of Artificial General Intelligence. We discuss the breakthroughs needed to create AGI, the risks of misalignment, and expert predictions on when (or if) machines will achieve human-like cognition. Join us for an in-depth narrative that goes beyond the hype to examine the real science behind AGI and what it means for the future of intelligence.

  24. 28

    Smart Cities of the Future: The Architecture of End-to-End Futuristic Urban Living

    Imagine a city that thinks, learns, and adapts in real-time—where artificial intelligence manages traffic with near-zero congestion, IoT sensors monitor air quality street by street, and blockchain secures digital governance. Smart cities are no longer a futuristic vision; they are becoming a reality. But what does it take to build these sentient cities that seamlessly integrate technology, governance, and human experience? In this podcast, we explore the end-to-end architecture of futuristic urban living—from AI-powered infrastructure to digital twins for urban planning and the cybersecurity challenges of hyper-connected spaces. We’ll break down real-world examples like Singapore, Dubai, Amsterdam, and Songdo, analyzing what works, what doesn’t, and what the future holds. Expect deep dives into AI-driven policymaking, sustainable smart grids, autonomous mobility, and the ethical dilemmas of mass urban surveillance. Whether you’re an engineer, AI enthusiast, urban planner, or simply curious about the future, this podcast will uncover the technologies shaping the cities of tomorrow.

  25. 27

    AI Agents Unveiled: The Future of Intelligent Systems and Voice Integration

    Join us on AI Agents Unveiled, a deep-dive podcast exploring the cutting-edge world of artificial intelligence agents and their seamless integration with voice technology. In this series, we break down the technical architectures of modern AI agents, discuss how voice technology enhances human-AI interaction, and tackle the challenges of real-time processing, ethical considerations, and AI-driven decision-making. From large language models and constitutional AI to multimodal interaction and industry-specific applications, we bring you expert insights on how AI is shaping the future of automation, healthcare, and beyond. Whether you’re an AI enthusiast, a developer, or a tech strategist, this podcast delivers thought-provoking discussions on the evolution, impact, and future directions of intelligent AI systems.

  26. 26

    Cyber Wars: AI in Attack and Defense

    In the ever-evolving battlefield of cybersecurity, artificial intelligence is both the weapon and the shield. Cyber Wars: AI in Attack and Defense delves into the high-stakes arms race between machine learning-powered defenses and adversarial attackers who exploit AI vulnerabilities. From adversarial machine learning (AML) and AI-driven offensive security to cutting-edge deep learning and metaheuristic models for cyberattack detection, this podcast explores how AI is shaping the future of cybersecurity. Join us as we break down real-world applications, ethical dilemmas, and the latest research in the fight to secure digital landscapes.

  27. 25

    The AI Renaissance: Creativity in Art, Music, and Writing

    How is AI reshaping the creative world? In this episode, we explore the intersection of artificial intelligence and human creativity, from AI-assisted art and music composition to automated writing tools. We discuss the ethical challenges, the evolving definition of originality, and the ways AI is enhancing—not replacing—human expression. Join us as we uncover the future of creativity in the age of AI.

  28. 24

    AI-Powered Document Intelligence: Unlocking the Future of Search & Retrieval

    Join us as we explore the cutting-edge integration of Azure AI Document Intelligence and Azure AI Search, transforming how businesses process and retrieve information. We’ll break down how AI-driven document parsing, structured chunking, and advanced indexing techniques enhance data ingestion and retrieval, making Retrieval-Augmented Generation (RAG) applications more efficient and secure.

  29. 23

    Mastering CrewAI: Building Collaborative AI Agents for Complex Workflows

    Join us as we explore CrewAI, a groundbreaking framework designed to build AI-powered teams that collaborate seamlessly to achieve complex goals. In this episode, we’ll break down how CrewAI structures intelligent agents, tasks, and processes, enabling developers to create efficient and scalable AI workflows. We'll discuss real-world applications, from automating business processes to enhancing data analysis, and share best practices for optimizing AI performance, managing memory, and ensuring security. Whether you're an AI developer, a business leader, or just curious about the future of AI collaboration, this episode will give you insights into how CrewAI is shaping the next generation of intelligent automation.

  30. 22

    Beyond the Algorithm: Fairness and Machine Learning

    Join us for a deep dive into the insights of Fairness and Machine Learning by Solon Barocas, Moritz Hardt, and Arvind Narayanan. We’ll explore how fairness is defined and measured, the legal and societal context that shapes it, and the power of causality and counterfactual reasoning in identifying discrimination. From testing bias in practice to understanding “merit and desert,” this episode unpacks the limitations and opportunities of automated decision-making—and why machine learning should never be viewed as a simple replacement for human judgment.

  31. 21

    AI & Data Engineering: The Future Unleashed

    AI is revolutionizing data engineering, automating tedious tasks, enhancing data quality, and enabling scalable, efficient workflows. In AI & Data Engineering: The Future Unleashed, we explore how Generative AI is transforming the field, discuss best practices for implementation, and tackle key challenges like governance, security, and collaboration between data engineers and data scientists. Join us as we dive into cutting-edge tools, trends, and expert insights shaping the future of AI-driven data engineering.  

  32. 20

    Neural Networks Unveiled: From Perceptrons to Transformers

    Welcome to Neural Networks Unveiled, the podcast where we break down the intricate world of artificial intelligence, one layer at a time. In this episode, we take you on a journey through the evolution of neural networks, from the fundamental concept of perceptrons to the cutting-edge transformer architectures that power today’s AI marvels. We explore how neural networks mimic the structure of the human brain to tackle complex problems, diving into key components such as neurons, layers, weights, biases, and activation functions like ReLU and sigmoid. We also unravel the mechanics of training a network, from cost functions and gradient descent to the powerful process of backpropagation. But that’s not all—we’ll also shed light on the transformer architecture, the backbone of models like ChatGPT, explaining how attention mechanisms, multi-headed attention, and token embeddings make AI smarter than ever. Whether you’re a curious beginner or an AI enthusiast, this episode will equip you with a deep understanding of how machines learn, adapt, and transform the way we interact with technology.

  33. 19

    Navigating the LangChain Ecosystem: Building Smarter AI Applications

    In this episode, we dive deep into the LangChain ecosystem, a powerful suite of tools designed to enhance the development, orchestration, and evaluation of AI applications. LangChain provides the core building blocks for LLM-powered applications, while LangGraph enables stateful, multi-agent workflows with a graph-based approach. LangSmith serves as the essential monitoring and debugging tool, ensuring applications are efficient, optimized, and scalable. We break down key concepts such as Retrieval-Augmented Generation (RAG), agent orchestration, time travel in AI workflows, and real-time debugging. Whether you’re a developer building conversational AI, a data scientist optimizing retrieval pipelines, or a business leader exploring AI-driven automation, this episode will equip you with the insights needed to harness LangChain, LangGraph, and LangSmith to their fullest potential.

  34. 18

    AI Action Summit 2025: Shaping the Future of Artificial Intelligence

    Artificial Intelligence is no longer a futuristic concept—it is reshaping our world in real time. In this episode, we explore the AI Action Summit 2025, a landmark global event taking place in Paris, co-chaired by France and India. With nearly 100 countries and over a thousand representatives from governments, academia, and the private sector, the summit aims to address the most pressing questions surrounding AI: How do we ensure global AI adoption without leaving anyone behind? How do we balance innovation with ethical responsibility? And how do we safeguard fundamental human values in an era of rapid technological transformation? Join us as we dive into the key discussions from this high-stakes international gathering, examining Europe’s leadership in AI regulation, the role of global collaboration, and the groundbreaking initiatives designed to foster trust, inclusivity, and innovation in artificial intelligence.

  35. 17

    The Great Acceleration: How CIOs Are Adopting Generative AI

    Generative AI is transforming enterprise strategy at an unprecedented pace. In this episode, we explore MIT Technology Review’s report, "The Great Acceleration: CIO Perspectives on Generative AI," which details how technology leaders are integrating AI into their organizations. From data infrastructure and model ownership to workforce transformation and governance, we break down how CIOs are making critical decisions about AI adoption—and what it means for businesses worldwide.

  36. 16

    AI Agents: The Future of Autonomous Digital Workforces

    AI Agents are more than just chatbots—they think, plan, and act autonomously. In this episode, we break down the evolution of AI Agents, exploring how they differ from traditional AI models, the critical role of memory, and emerging design patterns like ReAct, Agentic RAG, and Meta-Agents. We’ll also discuss real-world applications, industry adoption, risks, and best practices for deploying AI Agents responsibly. As companies embrace AI Agents to drive automation, productivity, and decision-making, we ask: Are we ready for a world where AI works alongside us?

  37. 15

    The Future of Martech: How AI is Reshaping Marketing in 2025

    The marketing landscape is changing faster than ever, and AI is leading the charge. In this episode, we break down Martech in 2025, a report that explores how Generative AI, APIs, AI agents, and composable martech stacks are transforming the industry. From the hype cycle of AI to the rise of custom-built marketing software, we dive into how businesses must adapt their data strategies, experimentation methods, and technology adoption to stay ahead.

  38. 14

    AI in Healthcare: The Promise and Risks of Generative AI in Medicine

    Generative AI is revolutionizing medical devices, diagnostics, and drug discovery, but can it be trusted with patient care? In this episode, we explore how AI and GenAI are transforming healthcare, from FDA regulations to privacy risks, security concerns, and legal ambiguity. With over 600 AI/ML-enabled medical devices already approved, the landscape is evolving rapidly. Can regulatory frameworks, privacy protections, and trust metrics keep up? Tune in to discover the real-world impact, challenges, and future of AI in medicine.

  39. 13

    Measuring AI: How to Evaluate and Monitor Generative Models

    How do we measure quality, safety, and reliability in generative AI? In this episode, we break down Evaluation and Monitoring Metrics for Generative AI, a detailed framework that helps developers ensure their AI models produce safe, accurate, and aligned content. From risk and safety assessments to custom evaluators, synthetic data, and A/B testing, we explore the best practices for monitoring AI systems using the Azure AI Foundry. If you're building or deploying AI, this episode is a must-listen to understand how to evaluate AI effectively.

  40. 12

    The Coming Wave: Can We Contain the Future?

    AI, biotech, quantum computing, and robotics—unstoppable forces shaping our future. In this episode, we explore The Coming Wave by Mustafa Suleyman and Michael Bhaskar, a book that warns of an impending technological tsunami that could redefine economies, governments, and global security. Can we contain these breakthroughs before they spiral out of control? Or are we headed toward an era of fragmentation and unchecked corporate power? Join us as we break down the key ideas behind the next great transformation.

  41. 11

    AI Meets Software Testing: How Meta’s TestGen-LLM is Changing the Game

    Can AI make software testing more efficient without introducing new risks? In this episode, we explore TestGen-LLM, Meta’s automated unit test improvement tool powered by Large Language Models (LLMs). Unlike conventional AI-driven code generation, TestGen-LLM follows an Assured LLM-Based Software Engineering approach, ensuring that every AI-generated test is verifiably useful, reliable, and regression-free. With successful deployments on Instagram and Facebook, this tool is revolutionizing the way engineers enhance test coverage. Tune in to discover how AI can assist—not replace—developers in writing better tests.

  42. 10

    Fine-Tuning vs. Retrieval: What’s the Best Way to Teach AI?

    When it comes to less popular knowledge, how should we train AI? Should we fine-tune it or let it retrieve information on the fly? In this episode, we break down a groundbreaking study that compares these two approaches—Fine-Tuning (FT) vs. Retrieval-Augmented Generation (RAG)—to see which one better equips AI models for niche factual knowledge. We also explore a novel approach called Stimulus RAG, which boosts retrieval accuracy without expensive fine-tuning. Tune in to find out which method wins and what it means for AI customization!

  43. 9

    GraphRAG: Revolutionizing AI Data Retrieval with Knowledge Graphs

    Data retrieval is getting a major upgrade! In this episode, we dive into Structured-GraphRAG, a new framework that enhances AI-powered retrieval by integrating knowledge graphs (KGs) with large language models. Using a case study on soccer data, this approach drastically reduces hallucinations, improves accuracy, and speeds up response times by over 98%. Join us as we explore how Structured-GraphRAG is setting a new standard for AI-driven information retrieval.

  44. 8

    FRAMES: The Next-Level Test for AI’s Fact-Checking and Reasoning Skills

    How well do AI models really think? In this episode, we explore FRAMES, a groundbreaking evaluation benchmark designed to push Retrieval-Augmented Generation (RAG) systems to their limits. Unlike traditional benchmarks, FRAMES assesses factual retrieval, reasoning, and synthesis together, exposing key weaknesses in today’s most advanced AI models. Tune in to discover why even state-of-the-art systems struggle with multi-hop reasoning—and what it means for the future of AI reliability.

  45. 7

    Phi-1: Smarter AI, Smaller Model—The Power of Textbook Training

    Bigger isn’t always better. In this episode, we break down Microsoft Research’s latest AI breakthrough—Phi-1, a 1.3 billion parameter coding model that outperforms much larger models by focusing on high-quality, textbook-style data. Discover how this approach challenges traditional scaling laws, slashes computational costs, and paves the way for more efficient AI development. Tune in as we explore the future of coding AI and why “textbooks are all you need."

  46. 6

    Beyond Automation Hype: The Economics of AI Adoption

    AI isn't taking over jobs as fast as we think—it's all about the economics. In this episode, we dive into MIT’s latest research on computer vision automation and unpack why cost, scale, and deployment matter more than just technical feasibility. From small businesses to AI-as-a-service models, we explore what actually makes automation worth the investment and what that means for the future of work.

  47. 5

    Beyond Models: The Rise of Generative AI Agents

    Generative AI is evolving beyond standalone models into fully functional agents capable of reasoning, planning, and interacting with the world through tools. In this episode, we explore the architecture of AI agents, how they differ from traditional models, and the role of tools like LangChain and Vertex AI in their development. Discover the future of AI autonomy and what it means for industries and everyday applications.

  48. 4

    Hallucination Mitigation: The Future of Multi-Agent AI Systems

    Discover how multi-agent AI frameworks are redefining the fight against hallucinations in large language models. This episode explores how layered agents, guided by the OVON framework, reduce speculative content through iterative refinement and structured data exchange. Learn about novel KPIs, empirical results, and the potential of agentic AI to enhance trust and transparency in generative AI systems.

  49. 3

    MedAgentBench: Redefining AI as Medical Agents

    Explore how MedAgentBench benchmarks large language models (LLMs) as medical agents, moving beyond chatbots to tackle real-world clinical tasks. This episode unpacks the dataset's 100 clinically derived tasks, its FHIR-compliant interactive environment, and insights into the current state of LLM performance. Learn how AI can reduce administrative burdens and improve healthcare delivery.

  50. 2

    Surgical Precision Redefined: Tackling Remote Surgery Challenges with Informer AI

    Explore how cutting-edge AI transforms remote robotic surgery. Using the Informer model, researchers tackle network-induced issues like jitter and packet loss, ensuring real-time precision for Patient Side Manipulators. Discover the role of predictive AI in overcoming latency challenges, enhancing accuracy, and reshaping surgical possibilities with the Tactile Internet.

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

Exploring AI with the power of AI — Agents of Intelligence is a cutting-edge podcast dedicated to covering a wide range of topics about artificial intelligence. Our process blends human insight with AI-driven research—each episode starts with a curated list of topics, followed by AI agents scouring the web for the best public content. AI-powered hosts then craft an engaging, well-researched discussion, which is reviewed by a subject matter expert before being shared with the world. The result? A seamless fusion of AI efficiency and human expertise, bringing you the most insightful conversations on AI’s latest developments, challenges, and future impact.

HOSTED BY

Sam Zamany

CATEGORIES

Frequently Asked Questions

How many episodes does Agents of Intelligence have?

Agents of Intelligence currently has 50 episodes available on PodParley. New episodes are automatically indexed when they're published to the podcast feed.

What is Agents of Intelligence about?

Exploring AI with the power of AI — Agents of Intelligence is a cutting-edge podcast dedicated to covering a wide range of topics about artificial intelligence. Our process blends human insight with AI-driven research—each episode starts with a curated list of topics, followed by AI agents scouring...

How often does Agents of Intelligence release new episodes?

Agents of Intelligence has 50 episodes. Check the episode list to see recent publication dates and frequency.

Where can I listen to Agents of Intelligence?

You can listen to Agents of Intelligence on PodParley by clicking any episode. We provide an embedded audio player for direct listening, and you can also subscribe via your preferred podcast app using the RSS feed.

Who hosts Agents of Intelligence?

Agents of Intelligence is created and hosted by Sam Zamany.
URL copied to clipboard!