How SRE Teams Use Load Shedding to Survive Traffic Spikes episode artwork

EPISODE · Jun 4, 2026 · 9 MIN

How SRE Teams Use Load Shedding to Survive Traffic Spikes

from The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering · host Fexingo

When a massive traffic spike hits, every millisecond of latency can cost thousands of dollars. In this episode, Lucas and Luna explore load shedding — the SRE technique of intentionally dropping non-critical requests to keep core systems running. They walk through how Google SREs used load shedding during the 2020 YouTube outage, how Stripe applies graceful degradation during payment surges, and why Netflix deliberately kills low-priority traffic during peak hours. They also break down the mental shift required: treating load shedding as a feature, not a failure. If you're an SRE, platform engineer, or just someone who wonders why services fail gracefully sometimes and fall over completely other times, this one's for you. #SiteReliabilityEngineering #LoadShedding #TrafficSpikes #GoogleSRE #Stripe #Netflix #GracefulDegradation #CapacityPlanning #IncidentResponse #SREBestPractices #Observability #PriorityBasedShedding #FexingoBusiness #BusinessPodcast #Technology #Podcast #SRE #Uptime Keep every episode free: buymeacoffee.com/fexingo

When a massive traffic spike hits, every millisecond of latency can cost thousands of dollars. In this episode, Lucas and Luna explore load shedding — the SRE technique of intentionally dropping non-critical requests to keep core systems running. They walk through how Google SREs used load shedding during the 2020 YouTube outage, how Stripe applies graceful degradation during payment surges, and why Netflix deliberately kills low-priority traffic during peak hours. They also break down the mental shift required: treating load shedding as a feature, not a failure. If you're an SRE, platform engineer, or just someone who wonders why services fail gracefully sometimes and fall over completely other times, this one's for you. #SiteReliabilityEngineering #LoadShedding #TrafficSpikes #GoogleSRE #Stripe #Netflix #GracefulDegradation #CapacityPlanning #IncidentResponse #SREBestPractices #Observability #PriorityBasedShedding #FexingoBusiness #BusinessPodcast #Technology #Podcast #SRE #Uptime Keep every episode free: buymeacoffee.com/fexingo

NOW PLAYING

How SRE Teams Use Load Shedding to Survive Traffic Spikes

0:00 9:51

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

This episode is 9 minutes long.

When was this The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering episode published?

This episode was published on June 4, 2026.

What is this episode about?

When a massive traffic spike hits, every millisecond of latency can cost thousands of dollars. In this episode, Lucas and Luna explore load shedding — the SRE technique of intentionally dropping non-critical requests to keep core systems running....

Can I download this The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!