Ops teams are pets, not cattle (ep. 556) episode artwork

EPISODE · Apr 19, 2023 · 23 MIN

Ops teams are pets, not cattle (ep. 556)

from The Stack Overflow Podcast

A common refrain you’ll hear these days is that servers should be scaled out, easy to replace, and interchangeable—cattle, not pets. But for the ops folks who run those servers the opposite is true. You can’t just throw any of them into an incident where they may not know the stack or system and expect everything to work out. Every operator has a set of skills that they’ve built up through research or experience, and teams should value them as such. They’re people, not pets, and certainly not cattle—you can’t just get a new one when you burn out your existing ones. On this episode of the podcast—sponsored by Chronosphere—we talk with Paige Cruz, Senior Developer Advocate at Chronosphere, about how teams can reduce the cognitive load on ops, the best ways to prepare for inevitable failures, and where the worst place to page Paige is. Episode notes:Chronosphere provides an observability platform for ops people, so naturally, the company has an interest in the happiness of those people. If you’re interested in the history of the pets vs. cattle concept , this covers it pretty well. Previously, we spoke with the CEO of Chronosphere about making incidents easier to manage. We’ve covered this topic on the blog before, and two articles came up during our conversation with Paige. You can connect with Paige on Twitter, where she has a pretty apropos handle. Congrats to Stellar Question badge winner Bruno Rocha for asking How can I read large text files line by line, without loading them into memory?, which at least 100 users liked enough to bookmark.  See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

A common refrain you’ll hear these days is that servers should be scaled out, easy to replace, and interchangeable—cattle, not pets. But for the ops folks who run those servers the opposite is true. You can’t just throw any of them into an incident where they may not know the stack or system and expect everything to work out. Every operator has a set of skills that they’ve built up through research or experience, and teams should value them as such. They’re people, not pets, and certainly not cattle—you can’t just get a new one when you burn out your existing ones. On this episode of the podcast—sponsored by Chronosphere—we talk with Paige Cruz, Senior Developer Advocate at Chronosphere, about how teams can reduce the cognitive load on ops, the best ways to prepare for inevitable failures, and where the worst place to page Paige is. Episode notes:Chronosphere provides an observability platform for ops people, so naturally, the company has an interest in the happiness of those people. If you’re interested in the history of the pets vs. cattle concept , this covers it pretty well. Previously, we spoke with the CEO of Chronosphere about making incidents easier to manage. We’ve covered this topic on the blog before, and two articles came up during our conversation with Paige. You can connect with Paige on Twitter, where she has a pretty apropos handle. Congrats to Stellar Question badge winner Bruno Rocha for asking How can I read large text files line by line, without loading them into memory?, which at least 100 users liked enough to bookmark.   See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

NOW PLAYING

Ops teams are pets, not cattle (ep. 556)

0:00 23:25

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Stack Overflow Podcast?

This episode is 23 minutes long.

When was this The Stack Overflow Podcast episode published?

This episode was published on April 19, 2023.

What is this episode about?

A common refrain you’ll hear these days is that servers should be scaled out, easy to replace, and interchangeable—cattle, not pets. But for the ops folks who run those servers the opposite is true. You can’t just throw any of them into an incident...

Can I download this The Stack Overflow Podcast episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!