EPISODE · Jun 1, 2026 · 8 MIN
Why Kubernetes Pod Resource Limits Cause Latency Spikes
from DevOps Daily with Fexingo: CI/CD, Kubernetes, and Modern Software Operations · host Fexingo
Episode 24 of DevOps Daily with Fexingo digs into a subtle but painful Kubernetes performance trap: the relationship between pod resource limits (CPU throttling) and application latency. Lucas and Luna examine a real-world case from a fintech company where setting CPU limits too low caused tail-latency spikes during normal traffic. They explain the difference between compressible and incompressible resources, how the Completely Fair Scheduler enforces CPU limits, and why setting requests without limits — or using the 'burst' capability wisely — often delivers better performance. The hosts also cover monitoring for throttling, the role of cgroup v2, and the trade-offs of over-provisioning. This episode is essential for any team running latency-sensitive workloads on Kubernetes. #Kubernetes #DevOps #Performance #CPULimits #Latency #Throttling #CFS #CgroupV2 #PodResources #Fintech #SiteReliabilityEngineering #ContainerOrchestration #CloudNative #Infrastructure #Monitoring #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo
What this episode covers
Episode 24 of DevOps Daily with Fexingo digs into a subtle but painful Kubernetes performance trap: the relationship between pod resource limits (CPU throttling) and application latency. Lucas and Luna examine a real-world case from a fintech company where setting CPU limits too low caused tail-latency spikes during normal traffic. They explain the difference between compressible and incompressible resources, how the Completely Fair Scheduler enforces CPU limits, and why setting requests without limits — or using the 'burst' capability wisely — often delivers better performance. The hosts also cover monitoring for throttling, the role of cgroup v2, and the trade-offs of over-provisioning. This episode is essential for any team running latency-sensitive workloads on Kubernetes. #Kubernetes #DevOps #Performance #CPULimits #Latency #Throttling #CFS #CgroupV2 #PodResources #Fintech #SiteReliabilityEngineering #ContainerOrchestration #CloudNative #Infrastructure #Monitoring #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo
NOW PLAYING
Why Kubernetes Pod Resource Limits Cause Latency Spikes
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m