PodParley - Discover, Search, and Explore Podcasts

1

What the Agentic AI is happening to SRE?

Jun 12, 2026

23:45

2

You (and AI) can't automate reliability away

Dec 2, 2025

28:20

3

#67 Why the SRE Book Fails Most Orgs — Lessons from a Google Veteran

Jul 15, 2025

30:47

4

#66 - Unpacking 2025 SRE Report’s Damning Findings

Jul 1, 2025

30:16

5

#65 - In Critical Systems, 99.9% Isn’t Reliable — It’s a Liability

Jun 17, 2025

28:28

6

#64 - Using AI to Reduce Observability Costs

Jan 28, 2025

20:42

7

#63 - Does "Big Observability" Neglect Mobile?

Nov 12, 2024

29:11

8

#62 - Early Youtube SRE shares Modern Reliability Strategy

Nov 5, 2024

35:33

9

#61 Scott Moore on SRE, Performance Engineering, and More

Oct 22, 2024

38:13

10

#60 How to NOT fail in Platform Engineering

Oct 1, 2024

30:34

11

#59 Who handles monitoring in your team and how?

Sep 24, 2024

8:17

12

#58 Fixing Monitoring's Bad Signal-to-Noise Ratio

Sep 17, 2024

8:27

13

#57 How Technical Leads Support Software Reliability

Sep 10, 2024

31:34

14

#56 Resolving DORA Metrics Mistakes

Sep 4, 2024

26:47

15

#55 3 Uses for Monitoring Data Other Than Alerts and Dashboards

Aug 27, 2024

11:02

16

#54 Becoming a Valuable Engineer Without Sacrificing Your Sanity

Aug 20, 2024

37:23

17

#53 What's Missing in Incident Response Processes?

Aug 15, 2024

9:43

18

Can ITIL Benefit from Site Reliability Engineering?

Aug 13, 2024

29:23

19

#52 Navigating Complexity within Incidents

Aug 6, 2024

36:52

20

#51 Whitebox vs Blackbox Monitoring

Jul 30, 2024

9:56

21

#50 Making Better Sense of Observability Data

Jul 9, 2024

24:38

22

#49 Alert Fatigue is Still an Issue - Here's How We Fix it

Jul 2, 2024

30:13

23

#48 Cutting Down "Toil" aka Manual Work in Software

Jun 25, 2024

44:03

24

#47 How to Grow Team Impact Through Learning Culture

Jun 18, 2024

28:38

25

#46 Platform Team Design According to Team Team Topologies

Jun 11, 2024

24:07

26

#45 How Team Topologies Can Guide Enabling Teams

Jun 4, 2024

25:09

27

#44 - Making SLOs Matter to Stakeholders

May 30, 2024

20:22

28

#43 - SLOs: a Deeper Dive into its Mechanics

May 28, 2024

31:44

29

#42 - Hitting Software SLA Targets through SLOs and SLIs

May 21, 2024

29:18

30

#41 Curbing High Observability Costs

May 14, 2024

24:34

31

#40 How to Enable Observability for Success

May 7, 2024

27:59

32

#39 How Chaos Engineering Helps Reduce Incident Risk

Apr 30, 2024

24:47

33

#38 The Real Cost of Software Reliability & Downtime

Apr 23, 2024

23:51

34

#37 An SRE Approach to Managing Technology Risk

Apr 16, 2024

30:08

35

#36 Avoiding Critical Platform Engineering Mistakes

Apr 9, 2024

26:56

36

#35 Boosting Your Observability Data's Usability

Apr 2, 2024

35:04

37

#34 From Cloud to Concrete: Should You Return to On-Prem?

Mar 26, 2024

22:59

38

#33 Inside Google's Data Center Design

Mar 19, 2024

23:11

39

#32 Clarifying Platform Engineering's Role (with Ajay Chankramath) BONUS EP

Mar 14, 2024

16:58

40

#31 Introduction to FinOps (with Ajay Chankramath)

Mar 12, 2024

26:37

41

#30 Clearing Delusions in Observability (with David Caudill)

Mar 7, 2024

37:16

42

#29 - Reacting to Google's SRE book 2016 (Chapter 1 Part 2)

Feb 27, 2024

31:25

43

#28 - Reacting to Google's SRE Book 2016 (Chapter 1 Part 1)

Feb 20, 2024

25:45

44

#27 - Growing as a Site Reliability Engineer (Part 3)

Feb 13, 2024

16:29

45

#26 - Growing as a Site Reliability Engineer (Part 2)

Feb 8, 2024

19:06

46

#25 - DORA and the Pursuit of Engineering Excellence (with Tim Wheeler)

Jan 30, 2024

37:48

47

#24 - Growing as a Site Reliability Engineer (Part 1)

Jan 23, 2024

8:42

48

#23 - The Danger of Unreliable Platforms (with Jade Rubick)

Jan 16, 2024

29:05

49

#22 - How Google does SRE Consulting (with Yury Niño Roa)

Jan 9, 2024

35:51

50

#21 - Better SRE in 2024 is all we can hope for

Jan 2, 2024

32:25

51

#20 Holiday Special with Stephen Townshend

Dec 19, 2023

29:32

52

#19 How to Develop Early Career Engineers (with John Hyland)

Dec 12, 2023

40:35

53

#18 Winning at SRE in Banking and Telecom (with Troy Koss)

Dec 5, 2023

35:06

54

#17 Lessons from SRE's Wild West Days (with Rick Boone)

Nov 27, 2023

46:23

55

#16 Acing Cloud Infra in Digital Media Giant (with Sreejith Chelanchery)

Nov 21, 2023

39:24

56

#15 Growing Reliability Engineering Across 5+ Companies (with Nash Seshan)

Nov 14, 2023

42:44

57

#14 Faster Incident Resolution through Data-Driven Notebooks (with Ivan Merrill)

Nov 7, 2023

41:33

58

#13 Making Sense of OpenTelemetry and Observability (with Adriana Villela)

Oct 31, 2023

32:52

59

#12 From Incident Firefighting to Reliability First (with Robert Ross)

Oct 24, 2023

29:07

60

#11 Rising to Staff Engineer in DevOps and SRE (with Rajesh Reddy N)

Oct 17, 2023

26:38

61

#10 Using AI for Kubernetes troubleshooting self-service (with Kyle Forster)

Oct 10, 2023

24:18

62

#9 Inside Booking.com's Site Reliability Engineering practice (with Samuele Tonon and Yoann Fouquet)

Oct 2, 2023

28:54

63

#8 Software Reliability Ninja Who is NOT an SRE (with Pablo Bouzada)

Sep 11, 2023

22:39

64

What happened to the podcast?

Sep 5, 2023

3:31

65

#7 Bringing HR onboard with SRE hiring and onboarding

Jul 13, 2023

25:26

66

#6 Building a successful SRE practice through capabilities

Jun 29, 2023

15:32

67

#5 Where does SRE fit into your organization's structure?

Jun 15, 2023

17:02

68

#4 Should organizations care about SRE?

Jun 1, 2023

18:44

69

#3 SRE vs DevOps vs Platform Engineering

May 17, 2023

22:53

70

#2 What is Site Reliability Engineering (SRE) and what is not SRE?

May 4, 2023

23:55

71

#1 Introducing the SREpath podcast

Apr 20, 2023

21:06

All Episodes