How Datadog Monitors Its Own Monolith at Scale episode artwork

EPISODE · May 30, 2026 · 8 MIN

How Datadog Monitors Its Own Monolith at Scale

from The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org · host Fexingo

Episode 20 of The CTO Podcast dives into a paradox: how does Datadog, the company that sells observability software, actually monitor its own massive monolith? Lucas and Luna walk through the architecture behind Datadog's internal dogfooding strategy — a single codebase that handles millions of metrics per second. They explore the tradeoffs of keeping a monolith versus microservices, how the engineering team built an internal tool called 'Watchtower' to catch regressions before they hit customers, and why Datadog's CTO decided against splitting the core observability pipeline into separate services. Along the way, they reveal a specific threshold: 1.2 million events per second per host, and how the team tracks it. A concrete look at how one company eats its own dog food at planetary scale. #Datadog #Observability #Monolith #EngineeringArchitecture #Dogfooding #Watchtower #Scalability #MetricsPipeline #CTO #TechnicalLeadership #BusinessAndTechnology #Fexingo #FexingoBusiness #BusinessPodcast #Podcast #SoftwareEngineering #Infrastructure #SRE Keep every episode free: buymeacoffee.com/fexingo

Episode 20 of The CTO Podcast dives into a paradox: how does Datadog, the company that sells observability software, actually monitor its own massive monolith? Lucas and Luna walk through the architecture behind Datadog's internal dogfooding strategy — a single codebase that handles millions of metrics per second. They explore the tradeoffs of keeping a monolith versus microservices, how the engineering team built an internal tool called 'Watchtower' to catch regressions before they hit customers, and why Datadog's CTO decided against splitting the core observability pipeline into separate services. Along the way, they reveal a specific threshold: 1.2 million events per second per host, and how the team tracks it. A concrete look at how one company eats its own dog food at planetary scale. #Datadog #Observability #Monolith #EngineeringArchitecture #Dogfooding #Watchtower #Scalability #MetricsPipeline #CTO #TechnicalLeadership #BusinessAndTechnology #Fexingo #FexingoBusiness #BusinessPodcast #Podcast #SoftwareEngineering #Infrastructure #SRE Keep every episode free: buymeacoffee.com/fexingo

NOW PLAYING

How Datadog Monitors Its Own Monolith at Scale

0:00 8:26

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org?

This episode is 8 minutes long.

When was this The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org episode published?

This episode was published on May 30, 2026.

What is this episode about?

Episode 20 of The CTO Podcast dives into a paradox: how does Datadog, the company that sells observability software, actually monitor its own massive monolith? Lucas and Luna walk through the architecture behind Datadog's internal dogfooding...

Can I download this The CTO Podcast with Fexingo: Technical Leadership, Architecture, and Engineering Org episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!