EPISODE · Jun 14, 2026 · 11 MIN
How SRE Teams Use Stress Testing to Simulate Real Workloads
from The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering · host Fexingo
Lucas and Luna explore how production stress testing goes beyond standard load testing to simulate realistic user behavior, with a deep dive into how a major streaming platform used session replay and gradual ramp-up to validate infrastructure before a global event. They unpack why stress testing must replicate authentication flows, API call patterns, and edge case traffic shapes — not just raw requests per second. The episode explains how SRE teams combine production shadowing, canary analysis, and real-user monitoring data to build stress tests that catch issues traditional benchmarks miss. A practical look at a specific technique that prevents cascading failures during peak traffic. #SiteReliabilityEngineering #StressTesting #ProductionTesting #LoadTesting #Observability #ChaosEngineering #Infrastructure #SRE #Reliability #Performance #Scalability #FaultTolerance #CapacityPlanning #RealUserMonitoring #CanaryDeployments #Technology #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
What this episode covers
Lucas and Luna explore how production stress testing goes beyond standard load testing to simulate realistic user behavior, with a deep dive into how a major streaming platform used session replay and gradual ramp-up to validate infrastructure before a global event. They unpack why stress testing must replicate authentication flows, API call patterns, and edge case traffic shapes — not just raw requests per second. The episode explains how SRE teams combine production shadowing, canary analysis, and real-user monitoring data to build stress tests that catch issues traditional benchmarks miss. A practical look at a specific technique that prevents cascading failures during peak traffic. #SiteReliabilityEngineering #StressTesting #ProductionTesting #LoadTesting #Observability #ChaosEngineering #Infrastructure #SRE #Reliability #Performance #Scalability #FaultTolerance #CapacityPlanning #RealUserMonitoring #CanaryDeployments #Technology #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
NOW PLAYING
How SRE Teams Use Stress Testing to Simulate Real Workloads
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m