EPISODE · May 22, 2026 · 18 MIN
GPUs, Kubernetes & AI Infrastructure Realities
from Virtually Speaking Podcast · host Virtually Speaking Podcast
At KubeCon 2026, Pete Flecha and John Nicholson sit down with VMware by Broadcom’s Frank Denneman to explore one of the biggest infrastructure conversations happening in AI today: should Kubernetes workloads run on bare metal or virtualized infrastructure? The discussion dives deep into how AI workloads are changing infrastructure design, why Kubernetes and virtualization are becoming increasingly connected, and how technologies like DRS and Dynamic Resource Allocation (DRA) are evolving to support modern GPU-intensive environments. Frank explains the operational, security, and resource management challenges organizations face as AI adoption accelerates — especially when dealing with expensive GPU clusters, multi-tenant AI workloads, and the rise of AI agents. Topics include: Why virtualization still matters for Kubernetes and AI GPU scheduling, topology awareness, and resource isolation DRA (Dynamic Resource Allocation) in Kubernetes AI infrastructure efficiency and GPU utilization Security and isolation for AI agents and workloads Token governance and AI operational guardrails Lessons learned from decades of virtualization applied to AI infrastructure If you’re trying to understand where Kubernetes, virtualization, and AI infrastructure are headed next, this is a conversation you won’t want to miss.
What this episode covers
At KubeCon 2026, Pete Flecha and John Nicholson sit down with VMware by Broadcom’s Frank Denneman to explore one of the biggest infrastructure conversations happening in AI today: should Kubernetes workloads run on bare metal or virtualized infrastructure? The discussion dives deep into how AI workloads are changing infrastructure design, why Kubernetes and virtualization are becoming increasingly connected, and how technologies like DRS and Dynamic Resource Allocation (DRA) are evolving to support modern GPU-intensive environments. Frank explains the operational, security, and resource management challenges organizations face as AI adoption accelerates — especially when dealing with expensive GPU clusters, multi-tenant AI workloads, and the rise of AI agents. Topics include: Why virtualization still matters for Kubernetes and AI GPU scheduling, topology awareness, and resource isolation DRA (Dynamic Resource Allocation) in Kubernetes AI infrastructure efficiency and GPU utilization Security and isolation for AI agents and workloads Token governance and AI operational guardrails Lessons learned from decades of virtualization applied to AI infrastructure If you’re trying to understand where Kubernetes, virtualization, and AI infrastructure are headed next, this is a conversation you won’t want to miss.
NOW PLAYING
GPUs, Kubernetes & AI Infrastructure Realities
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Jan 2, 2026 ·47m
Dec 21, 2025 ·46m