The Practical AI Digest

PODCAST · technology

The Practical AI Digest

Distilling AI/ML theory into practical insights. One concept at a time. No jargon.

  1. 19

    AI Hardware: GPUs, TPUs and Beyond

    This episode is all about the specialized hardware that makes modern AI possible. We explain how GPUs became the workhorses of deep learning by offering massive parallelism for matrix math, and how companies like Google went further to build TPUs (Tensor Processing Units) optimized for neural network workloads. You’ll hear about the latest AI chips, from NVIDIA’s powerful GPUs driving large model training, to emerging AI accelerators like Graphcore’s IPU, Cerebras’s wafer-scale engine, and even AI on the edge (Apple’s neural engines, etc.). We discuss what each brings in terms of speed, memory, efficiency, and how they’re deployed, giving a peek into the data centers (and devices) where AI calculations run.

  2. 18

    Synthetic Data: Artificial Data for Real Insights

    In this episode, we explore how synthetic data is created and used to improve AI models. Synthetic data refers to artificial datasets generated by models (like GANs or language models) that mimic real data. We discuss how this can help in situations with little real data or strict privacy requirements for example, generating realistic medical records to train an AI without exposing any patient’s information. You’ll learn about techniques for producing synthetic images, text, and tabular data, and how they are validated to ensure they reflect real-world patterns. We also cover the benefits and challenges of synthetic data, from reducing bias and augmenting rare cases, to ensuring the synthetic data doesn’t inadvertently leak sensitive info.

  3. 17

    Explainable AI: Opening the Black Box

    In this episode, we look at how researchers are making AI models more transparent and interpretable. We discuss techniques like SHAP values and LIME that explain model predictions by attributing importance to features! So an AI system isn’t just a black box, you can understand why it made a decision. You’ll hear about example use cases (like explaining a medical AI’s diagnosis to a doctor or a loan model’s decision to a loan officer) and recent research into interpreting the internals of neural networks (from visualizing what vision models detect to “probing” language models’ knowledge). By the end, you’ll appreciate the growing toolkit for Explainable AI (XAI) and why it’s crucial for building trust in AI systems.

  4. 16

    Aligning AI with Human Intent: RLHF in Action

     In this episode, we demystify how researchers teach AI models to behave helpfully and safely using Reinforcement Learning from Human Feedback (RLHF). We discuss why even very large models can generate undesired outputs and how RLHF addresses this by incorporating human preferences. You’ll learn how methods like InstructGPT were trained: first by gathering human-written demonstration responses, then by having humans rank model outputs to train a reward model, and finally using reinforcement learning (e.g. with PPO) to fine-tune the model so that it better aligns with what users want. We also talk about improvements like Constitutional AI and why aligning AI with human values is an ongoing challenge.

  5. 15

    AI for Code: How Models Write Software

    This episode explores the rise of AI coding assistants. We discuss how models like OpenAI’s Codex (which powers GitHub Copilot) are trained on millions of code repositories to generate software from natural language prompts. You’ll hear how these models can autocomplete functions or even draft whole programs, and what they’re capable of today, as well as their limits (like generating errors or insecure code if not carefully guided). We also talk about their impact on developer productivity and the future of programming, where AI becomes a pair programmer that can handle the boilerplate, letting developers focus on the creative parts of coding.

  6. 14

    Multimodal Models: Combining Vision, Language, and More

    This episode explores multimodal AI : models that can see, read, and even hear. We explain how models like OpenAI’s CLIP learn joint representations of images and text (by matching pictures with their captions), enabling capabilities like image captioning and visual search. You’ll learn why multimodal systems represent the next leap toward more human-like AI, processing text, images, and audio together for richer understanding. We also discuss recent multimodal breakthroughs (from GPT-4’s vision features to Google’s Gemini) and how they allow AI to analyze content the way we do with multiple senses.

  7. 13

    Efficient Fine-Tuning: Adapting Large Models on a Budget

    This episode dives into strategies for fine-tuning gigantic AI models without needing massive compute. We explain parameter-efficient fine-tuning methods like LoRA (Low-Rank Adaptation), which freezes the original model and trains only small adapter weights, and QLoRA, which goes a step further by quantizing model parameters to 4-bit precision. You’ll learn why techniques like these have become essential for customizing large language models on modest hardware, how they preserve full performance, and what recent results (like fine-tuning a 65B model on a single GPU) mean for practitioners.

  8. 12

    Diffusion Models: AI Image Generation Through Noise

    In this episode, we break down what diffusion models are and why they’ve become the go-to method for AI image generation. You’ll learn how these models gradually add and remove noise to transform random pixels into coherent images, enabling use cases from art creation to image restoration. We also explore recent advances like latent diffusion, which compresses the generation process for efficiency, and discuss how diffusion techniques have achieved state-of-the-art results in text-to-image tasks while remaining flexible for control and guidance.

  9. 11

    Graph Neural Networks: Learning from Connections, Not Just Data

    This episode breaks down what graph neural networks (GNNs) are and why they matter. You’ll learn how GNNs use nodes and edges to represent relationships and how message passing lets models make sense of social, biological, and networked data. We’ll also cover recent advancements like PGNN for multi-modal graphs and common pitfalls like scalability and over-smoothing.

  10. 10

    Neuro-Symbolic AI: Combining Learning With Logic

    In this episode, we explain what neuro-symbolic AI is and why it matters. You’ll learn how neural networks handle patterns, how symbolic systems handle rules, and how combining the two can help models reason more reliably. We also cover real examples where this approach is already being applied in assistants and robotics, showing how it could make AI systems more trustworthy and useful.

  11. 9

    LLMs in Chip Design: How AI Is Entering the Hardware Workflow

    In this episode, we look at how large language models are being used in chip and hardware design. We break down what LLM-aided design actually means, how models can generate HDL code, assist with testbench creation, and even support formal verification. You'll also hear about real-world tools like ChatEDA and how companies are starting to use AI in their silicon development workflows.

  12. 8

    How Embeddings and Vector Databases Power Generative AI

    This episode explains how embedding models turn language into numerical vectors and how vector databases like Pinecone, FAISS, or Weaviate store and search those vectors efficiently. You'll learn how this system enables GenAI models to retrieve relevant information in real-time, power RAG pipelines, and scale up tool-augmented LLM workflows.

  13. 7

    Agentic AI: What Happens When Models Start Acting

    In this episode, we explore agentic AI systems built to not just predict or classify, but to plan, reason, and act autonomously. We break down what makes these models different, how they use tools, memory, and feedback to complete tasks, and why they represent the next step beyond traditional LLMs. You’ll hear how concepts like action loops, world modeling, and autonomous decision-making are shaping everything from research tools to enterprise automation.

  14. 6

    Understanding Attention: Why Transformers Actually Work

    This episode unpacks the attention mechanism at the heart of Transformer models. We explain how self-attention helps models weigh different parts of the input, how it scales in multi-head form, and what makes it different from older architectures like RNNs or CNNs. You’ll walk away with an intuitive grasp of key terms like query, key, value, and how attention layers help with context handling in language, vision, and beyond.

  15. 5

    Markov Chains, Monte Carlo, and HMC: A Deep Dive

    In this episode, we break down the essentials of Markov Chains, Monte Carlo simulations, and Markov Chain Monte Carlo methods. We explain key ideas like memoryless processes, stationary distributions, and how random sampling helps model uncertainty. We also explore gradient-based techniques like Hamiltonian Monte Carlo, highlighting their role in modern statistical modeling. Ideal for anyone curious about the mechanics behind simulations and complex probabilistic models.

  16. 4

    The Model Context Protocol (MCP): Making LLMs Actually Useful

    In this episode, we dive into the Model Context Protocol, or MCP. It’s a new standard that helps large language models connect with real-world tools, data, and APIs in a more structured way. We’ll break down how MCP works, why it matters for building smarter AI agents, and what it means for developers working on enterprise-grade AI systems.

  17. 3

    Generative Adversarial Networks (GANs) Explained: From DL Basics to Real-World Training Tips

    This episode breaks down how GANs work by starting with deep learning basics like CNNs, gradient descent, and regularization. We then get into what actually goes wrong when training these models and how to deal with it. It’s practical, straightforward, and meant for anyone trying to make sense of GANs in the real world.

  18. 2

    Bayesian vs. Frequentist Thinking in Marketing Mix Modeling

    In this episode, we unpack how Bayesian and Frequentist statistical approaches tackle marketing performance analysis, focusing on Marketing Mix Modeling (MMM). You’ll learn the key differences in interpretation, how Bayesian methods enable sequential updates and uncertainty modeling, and why they’re gaining traction in modern marketing analytics. Ideal for marketers, data scientists, and anyone curious about the “why” behind the math.

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

Distilling AI/ML theory into practical insights. One concept at a time. No jargon.

HOSTED BY

Mo Bhuiyan via NotebookLM

CATEGORIES

URL copied to clipboard!