Reliability Enablers cover art

All Episodes

Reliability Enablers — 70 episodes

#
Title
1

You (and AI) can't automate reliability away

2

#67 Why the SRE Book Fails Most Orgs — Lessons from a Google Veteran

3

#66 - Unpacking 2025 SRE Report’s Damning Findings

4

#65 - In Critical Systems, 99.9% Isn’t Reliable — It’s a Liability

5

#64 - Using AI to Reduce Observability Costs

6

#63 - Does "Big Observability" Neglect Mobile?

7

#62 - Early Youtube SRE shares Modern Reliability Strategy

8

#61 Scott Moore on SRE, Performance Engineering, and More

9

#60 How to NOT fail in Platform Engineering

10

#59 Who handles monitoring in your team and how?

11

#58 Fixing Monitoring's Bad Signal-to-Noise Ratio

12

#57 How Technical Leads Support Software Reliability

13

#56 Resolving DORA Metrics Mistakes

14

#55 3 Uses for Monitoring Data Other Than Alerts and Dashboards

15

#54 Becoming a Valuable Engineer Without Sacrificing Your Sanity

16

#53 What's Missing in Incident Response Processes?

17

Can ITIL Benefit from Site Reliability Engineering?

18

#52 Navigating Complexity within Incidents

19

#51 Whitebox vs Blackbox Monitoring

20

#50 Making Better Sense of Observability Data

21

#49 Alert Fatigue is Still an Issue - Here's How We Fix it

22

#48 Cutting Down "Toil" aka Manual Work in Software

23

#47 How to Grow Team Impact Through Learning Culture

24

#46 Platform Team Design According to Team Team Topologies

25

#45 How Team Topologies Can Guide Enabling Teams

26

#44 - Making SLOs Matter to Stakeholders

27

#43 - SLOs: a Deeper Dive into its Mechanics

28

#42 - Hitting Software SLA Targets through SLOs and SLIs

29

#41 Curbing High Observability Costs

30

#40 How to Enable Observability for Success

31

#39 How Chaos Engineering Helps Reduce Incident Risk

32

#38 The Real Cost of Software Reliability & Downtime

33

#37 An SRE Approach to Managing Technology Risk

34

#36 Avoiding Critical Platform Engineering Mistakes

35

#35 Boosting Your Observability Data's Usability

36

#34 From Cloud to Concrete: Should You Return to On-Prem?

37

#33 Inside Google's Data Center Design

38

#32 Clarifying Platform Engineering's Role (with Ajay Chankramath) BONUS EP

39

#31 Introduction to FinOps (with Ajay Chankramath)

40

#30 Clearing Delusions in Observability (with David Caudill)

41

#29 - Reacting to Google's SRE book 2016 (Chapter 1 Part 2)

42

#28 - Reacting to Google's SRE Book 2016 (Chapter 1 Part 1)

43

#27 - Growing as a Site Reliability Engineer (Part 3)

44

#26 - Growing as a Site Reliability Engineer (Part 2)

45

#25 - DORA and the Pursuit of Engineering Excellence (with Tim Wheeler)

46

#24 - Growing as a Site Reliability Engineer (Part 1)

47

#23 - The Danger of Unreliable Platforms (with Jade Rubick)

48

#22 - How Google does SRE Consulting (with Yury Niño Roa)

49

#21 - Better SRE in 2024 is all we can hope for

50

#20 Holiday Special with Stephen Townshend

51

#19 How to Develop Early Career Engineers (with John Hyland)

52

#18 Winning at SRE in Banking and Telecom (with Troy Koss)

53

#17 Lessons from SRE's Wild West Days (with Rick Boone)

54

#16 Acing Cloud Infra in Digital Media Giant (with Sreejith Chelanchery)

55

#15 Growing Reliability Engineering Across 5+ Companies (with Nash Seshan)

56

#14 Faster Incident Resolution through Data-Driven Notebooks (with Ivan Merrill)

57

#13 Making Sense of OpenTelemetry and Observability (with Adriana Villela)

58

#12 From Incident Firefighting to Reliability First (with Robert Ross)

59

#11 Rising to Staff Engineer in DevOps and SRE (with Rajesh Reddy N)

60

#10 Using AI for Kubernetes troubleshooting self-service (with Kyle Forster)

61

#9 Inside Booking.com's Site Reliability Engineering practice (with Samuele Tonon and Yoann Fouquet)

62

#8 Software Reliability Ninja Who is NOT an SRE (with Pablo Bouzada)

63

What happened to the podcast?

64

#7 Bringing HR onboard with SRE hiring and onboarding

65

#6 Building a successful SRE practice through capabilities

66

#5 Where does SRE fit into your organization's structure?

67

#4 Should organizations care about SRE?

68

#3 SRE vs DevOps vs Platform Engineering

69

#2 What is Site Reliability Engineering (SRE) and what is not SRE?

70

#1 Introducing the SREpath podcast