EPISODE · May 30, 2026 · 10 MIN
How SRE Teams Use Canary Deployments to Reduce Blast Radius
from The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering · host Fexingo
In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practice of canary deployments—a key strategy for reducing blast radius in production. They break down how teams like Etsy and Netflix use phased rollouts to catch issues early, with specific numbers: Etsy's Deployinator halved deployment failures after adopting canaries, and Netflix's Spinnaker pipeline automatically rolls back if error rates spike by just 1 percent. Lucas explains the optimal canary size (5-10 percent of traffic), the metrics to watch (latency, error rate, CPU usage), and why automating the rollout is critical. Luna questions whether canaries slow down velocity, and they discuss the trade-off between speed and safety. The episode also covers how to design a canary pipeline for microservices, including the use of feature flags and observability tools like Prometheus and Grafana. Recorded on May 30, 2026, this conversation gives SREs a practical guide to deploying with confidence, avoiding the all-at-once rollbacks that cause chaos. #CanaryDeployments #SRE #SiteReliabilityEngineering #BlastRadius #PhasedRollout #Etsy #Netflix #Deployinator #Spinnaker #FeatureFlags #Prometheus #Grafana #IncidentPrevention #DeploymentStrategies #DevOps #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo
What this episode covers
In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practice of canary deployments—a key strategy for reducing blast radius in production. They break down how teams like Etsy and Netflix use phased rollouts to catch issues early, with specific numbers: Etsy's Deployinator halved deployment failures after adopting canaries, and Netflix's Spinnaker pipeline automatically rolls back if error rates spike by just 1 percent. Lucas explains the optimal canary size (5-10 percent of traffic), the metrics to watch (latency, error rate, CPU usage), and why automating the rollout is critical. Luna questions whether canaries slow down velocity, and they discuss the trade-off between speed and safety. The episode also covers how to design a canary pipeline for microservices, including the use of feature flags and observability tools like Prometheus and Grafana. Recorded on May 30, 2026, this conversation gives SREs a practical guide to deploying with confidence, avoiding the all-at-once rollbacks that cause chaos. #CanaryDeployments #SRE #SiteReliabilityEngineering #BlastRadius #PhasedRollout #Etsy #Netflix #Deployinator #Spinnaker #FeatureFlags #Prometheus #Grafana #IncidentPrevention #DeploymentStrategies #DevOps #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo
NOW PLAYING
How SRE Teams Use Canary Deployments to Reduce Blast Radius
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m