EPISODE · May 25, 2026 · 7 MIN
How Cloudflare Handles 46 Million Requests Per Second With SRE
from The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering · host Fexingo
In this episode of The Site Reliability Podcast, Lucas and Luna dive into how Cloudflare's SRE team manages to process over 46 million HTTP requests per second across its global edge network. They explore the concept of 'edge of network' infrastructure, the role of anycast routing in distributing load, and how the team uses automated canary deployments to catch failures before they impact customers. Lucas breaks down the specific alerting thresholds that trigger human intervention versus automated rollback, and Luna challenges him on the limits of automation in incident response. The episode also covers how Cloudflare's post-incident review process differs from traditional postmortems, focusing on blameless analysis and systemic fixes. This concrete case study offers listeners a rare behind-the-scenes look at how one of the internet's largest traffic intermediaries keeps its infrastructure running smoothly. #Cloudflare #SRE #SiteReliabilityEngineering #EdgeComputing #CDN #HTTPRequests #AnycastRouting #CanaryDeployments #IncidentResponse #Postmortem #AutomatedRollback #Alerting #Infrastructure #ProductionEngineering #Uptime #Scalability #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
What this episode covers
In this episode of The Site Reliability Podcast, Lucas and Luna dive into how Cloudflare's SRE team manages to process over 46 million HTTP requests per second across its global edge network. They explore the concept of 'edge of network' infrastructure, the role of anycast routing in distributing load, and how the team uses automated canary deployments to catch failures before they impact customers. Lucas breaks down the specific alerting thresholds that trigger human intervention versus automated rollback, and Luna challenges him on the limits of automation in incident response. The episode also covers how Cloudflare's post-incident review process differs from traditional postmortems, focusing on blameless analysis and systemic fixes. This concrete case study offers listeners a rare behind-the-scenes look at how one of the internet's largest traffic intermediaries keeps its infrastructure running smoothly. #Cloudflare #SRE #SiteReliabilityEngineering #EdgeComputing #CDN #HTTPRequests #AnycastRouting #CanaryDeployments #IncidentResponse #Postmortem #AutomatedRollback #Alerting #Infrastructure #ProductionEngineering #Uptime #Scalability #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
NOW PLAYING
How Cloudflare Handles 46 Million Requests Per Second With SRE
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m