How Stripe Migrated Payment Routing to 99.999% Uptime episode artwork

EPISODE · Jun 16, 2026 · 8 MIN

How Stripe Migrated Payment Routing to 99.999% Uptime

from The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org · host Fexingo

Episode 55 of The CTO Podcast dives into how Stripe rebuilt its payment routing engine to achieve 99.999% uptime. Lucas and Luna break down the architectural shift from a monolithic routing layer to a distributed, deterministic system that handles millions of transactions per second. They explore the team's decision to move away from traditional load balancers, the role of formal verification in routing logic, and how Stripe's engineers stress-tested the system with simulated global outages. Along the way, they discuss the trade-offs between latency and consistency, and why a gradual canary deployment was critical. This episode offers concrete lessons for engineering leaders designing fault-tolerant systems at scale. #Stripe #PaymentRouting #99.999PercentUptime #DistributedSystems #Architecture #FaultTolerance #FormalVerification #CanaryDeployment #LatencyConsistencyTradeoff #PaymentProcessing #EngineeringLeadership #SystemDesign #HighAvailability #BusinessAndTechnology #FexingoBusiness #BusinessPodcast #CTOPodcast #TechLeadership Keep every episode free: buymeacoffee.com/fexingo

Episode 55 of The CTO Podcast dives into how Stripe rebuilt its payment routing engine to achieve 99.999% uptime. Lucas and Luna break down the architectural shift from a monolithic routing layer to a distributed, deterministic system that handles millions of transactions per second. They explore the team's decision to move away from traditional load balancers, the role of formal verification in routing logic, and how Stripe's engineers stress-tested the system with simulated global outages. Along the way, they discuss the trade-offs between latency and consistency, and why a gradual canary deployment was critical. This episode offers concrete lessons for engineering leaders designing fault-tolerant systems at scale. #Stripe #PaymentRouting #99.999PercentUptime #DistributedSystems #Architecture #FaultTolerance #FormalVerification #CanaryDeployment #LatencyConsistencyTradeoff #PaymentProcessing #EngineeringLeadership #SystemDesign #HighAvailability #BusinessAndTechnology #FexingoBusiness #BusinessPodcast #CTOPodcast #TechLeadership Keep every episode free: buymeacoffee.com/fexingo

NOW PLAYING

How Stripe Migrated Payment Routing to 99.999% Uptime

0:00 8:49

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org?

This episode is 8 minutes long.

When was this The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org episode published?

This episode was published on June 16, 2026.

What is this episode about?

Episode 55 of The CTO Podcast dives into how Stripe rebuilt its payment routing engine to achieve 99.999% uptime. Lucas and Luna break down the architectural shift from a monolithic routing layer to a distributed, deterministic system that handles...

Can I download this The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!