PODCAST · technology
Page it to the Limit
by PagerDuty Developer Relations
Page It to the Limit is a podcast that focuses on what it means to operate software in production. Hosted by the PagerDuty Developer Relations Team, we cover the leading practices used in the software industry to improve both system reliability and the lives of the people responsible for supporting it.
-
141
Harnessing AI With Lev Andelman
Additional Resources TeraSky Mitchell Hashimoto’s post, My AI Adoption Journey OpenAI’s follow up, Harness engineering: leveraging Codex in an agent-first world Scott Rosenberg, Backstage As The Ultimate MCP Server Learn more about TeraSky’s Backstage plugins Our prior episodes on Platform Engineering feature Abby Bangser on what is Platform Engineering, Dave Bresci on IDPs, and Avantik Iyer on Backstage Join the PagerDuty Commons! PagerDuty Home Page Episode edited by Mandy Moore Transcripts by Rev
-
140
Built for Devs With Tessa Kriesel Part 2
Additional Resources Developer Relations: Trust is Built in the Trenches, Not Through Marketing. The Apple story Tessa refers to. Jobs to be Done. Built for Devs Built for Devs on Product Hunt Check out Tessa’s Developer Adoption Playbook See the whole episode recording on our YouTube channel Join the PagerDuty Commons! PagerDuty Home Page Episode edited by Mandy Moore Transcripts by Rev
-
139
The Interest Is Compounding: All About Tech Debt
Today’s episode is about technical debt, not as a cautionary tale, but as a lens. We take a closer look at where debt comes from, how it quietly rewires teams, and why paying it off is rarely just a matter of “cleaning up code.” Along the way, we’ll examine two real-world stories: one where unaddressed debt led to a $440 million disaster, and another where a company used an infrastructure overhaul to rebuild architectural trust. This is about more than code. It’s about momentum, memory, and the systems we inherit.
-
138
Autonomy in Action: Agentic AI
AI used to wait for instructions. Now, it doesn’t always ask. In this opening episode, we explore the rise of agentic AI systems that don’t just respond to input, but take initiative, set goals, and act on their own. We break down what agentic really means, why it’s different from traditional automation, and what kinds of design and trust challenges this shift introduces. Along the way, we look at how this plays out in tools that summarize, schedule, and trigger real-world workflows — and why autonomy sounds good in theory but gets messy fast. Whether you're building LLM-powered copilots, evaluating autonomous workflows, or just trying to keep your incident response human-aware, this is the groundwork you’ll need for what’s coming next.
-
137
Welcome Sid!
This week we chat with our newest Developer Advocate, Sid Verma.
-
136
Wing's Take on Cloud Development With Elad Ben-Israel
Cloud development can be tricky. It needs many different skills related to both infrastructure and software. Here's where Wing comes in. It combines infrastructure and runtime code in one language. This helps developers stay focused and creative. The result? Faster, safer, and better software. Join us to see what makes Wing's way of doing things special.
-
135
Building Trust in Security Reporting With Breanne Boland
Spotting a security issue, or even thinking you may have caused one can be nerve-wracking, and the last thing anyone wants is to accidentally create noise for another team. Getting to know your security team can help make it all a little less scary. Breanne Boland, Product Security Engineer at Gusto, joins us to talk all things security alerting and the steps to create a culture where others feel at ease reporting security concerns.
-
134
SRE Journey at Adidas With Andreia Otto
Successful Site Reliability Engineering (SRE) teams are skilled in both software and systems engineering, allowing them to manage reliable, scalable systems. They proactively identify and address potential issues, use failures as learning opportunities, and automate processes to reduce toil. They also prioritize communication and collaboration with other teams to ensure service reliability and performance. Join us, as we discuss the journey of SRE teams at Adidas.
-
133
February Book Club: After the Gold Rush by Steve McConnell
What is a software *engineer*? Software drives so much of our everyday lives, yet software development as a field has not adopted the kind of licensure other engineering disciplines have long been subject to. Hannele and Mandi discuss this classic set of essays by Steve McConnell, covering many of the same questions we still have today.
-
132
Internal Developer Platforms With Dave Bresci
Internal Developer Portals have become crucial for organizations seeking to enhance developer experience, reduce cognitive load, and adhere to company standards. In this episode, we welcome Dave Bresci, who explains why and how PagerDuty uses Backstage internally. Additionally, our Developer Advocate, Tiago Barbosa, will provide some relevant points on PagerDuty's plugin for Backstage.
-
131
Scaling Support Teams and People With John O'Donnell
In this episode, Kat chats with PagerDuty's own EMEA support team lead, John O'Donnell, about the challenges of scaling a CS Team across global offices, mentoring others, and building your own career, all while remembering to stop and take a breath once in a while
-
130
Open Source and Communities With Heitor Lessa
On this episode, Heitor Lessa, Chief Architect at AWS, shares some insights on the journey of Powertools for AWS Lambda and the practices involved in growing and maintaining an open source community.
-
129
Python in Space
Use cases for the Python programming language are everywhere. This week we talk to Mike Fiedler, Security and Safety Engineer for PyPI about keeping those use cases secure, working in Open Source, and sending Python to space.
-
128
Continuous Learning With Matt Davis
This week we talk to Matt Davis about how leaders can build a culture of learning in modern organizations. Leaders foster continuous learning opportunities for teams and help employees cope with environments where change is constant.
-
127
Pagey's Nostalgia Hour
For our 100th episode, we reached out to the folks who have been at PagerDuty the longest, and asked them to share some of their stories with us and with you! These folks are from all over PagerDuty and had some amazing one-of-a-kind experiences.
-
126
Team Topologies With Manuel Pais
This week we welcome to the show Manuel Pais who is the co-author of "Team Topologies: Organizing business and technology teams for fast flow". Manuel will walk us through some of the concepts that enable companies to deliver value more frequently and effectively to their customers by organizing their teams in an optimized way.
-
125
Threat Modeling With Gene Gotimer
Threat modeling is one of those things that teams say they should be doing, but many never quite do it. Putting together a formal threat model with input from the whole organization is daunting. Where do you start? Where do you draw the line? In this episode we talk to Gene Gotimer about how to approach threat modeling without losing focus and getting too off track in what-ifs.
-
124
Adventures in Infrastructure With Mark Hatch
Additional Resources PagerDuty Home Page Episode edited by Mandy Moore Transcripts by Rev
-
123
Welcome Tiago!
Additional Resources PagerDuty Home Page Episode edited by Mandy Moore Transcripts by Rev
-
122
Value Stream Management With Helen Beal
Additional Resources Visit the Value Stream Management Consortium for the State of VSM Reports and 20% discount on annual Influencer membership using code PAGERDUTY20 Find Flowtopia sessions and other videos on the VSM YouTube Channel The DevOps Institute PagerDuty Home Page Episode edited by Mandy Moore Transcripts by Rev
-
121
Sustainable On-Call Culture With Paige Cruz
Additional Resources PagerDuty Home Page Guide to assembling a year in review Building Greater Operational Efficiency with PagerDuty PagerDuty’s Service Ownership operations guide The Sustainable Web Manifesto Sustainable Web Design by Tom Greenwood Ecograder’s “How Green is Your Website?” tool Hubspot’s “State of Burnout in Tech” 2022 report PagerDuty’s Status Page and Status Update Notification Templates features Episode edited by Mandy Moore Transcripts by Rev
-
120
Bridging the Gap Between Customer Support and Engineering With Rachel Stephens
Additional Resources PagerDuty Home Page Past Episode: Support Career Stories PagerDuty’s Incident Response Ops Guide PagerDuty’s Customer Support Ops Guide Episode edited by Mandy Moore Transcripts by Rev
-
119
Reliability of Cloud Dependencies With Jeff Martens
Additional Resources PagerDuty Home Page Metrist Watch Jeff’s PagerDuty Summit 2022 session on YouTube Episode edited by Mandy Moore Transcripts by Rev
-
118
The Ops.IO Community With Ella Ang De Jonge and Brad Johnson
Additional Resources Join Ops.IO and follow PagerDuty! Follow LearnAboutOps on Twitter. Blink Check out our other community focused episodes: Developer Communities with Mary Thengvall and Julie Gunderson and Open Source Communities with benny Vasquez. PagerDuty Home Page. Episode edited by Mandy Moore. Transcripts by Rev.
-
117
Developer Communities With Mary and Julie
The communities that grow up around software products can have many different characteristics. This week Mary and Julie chat with Mandi about developer communities and the people parts of developer 'marketing'.
-
116
Vote for PagerDuty in the DevOps Dozen Awards 2022
PagerDuty has been nominated in the 'Best End-to-End DevOps Tool/Service' category in the DevOps Dozen Awards 2022. Head to https://devopsdozen.com to cast your vote for PagerDuty
-
115
API Security With Rob Dickinson
API security is more than just putting up a firewall and watching the perimeter of your application. You also need to observe what is going on inside. Join us as we talk with Rob Dickinson, founder and CTO of Resurface, about API security.
-
114
It's Always BGP: Networking and Other Disasters With Stuart Clark
Your network layer is the foundation of your service reliability. When things go wrong, they go very wrong. Stuart Clark joins us to talk about the stress the network can cause.
-
113
Conscientious Engineering Management With Scott Hain
Sometimes you plan to become a manager, sometimes it just happens. When leading a team with different types of tasks and responsibilities, Scott took some time to find folks who weren't just like everyone else.
-
112
Security Careers With Megg and Patrick
October is Cybersecurity Awareness Month! Our security team at PagerDuty helps our engineers keep our platforms safe, helps our employees with security training, and much more. Megg and Patrick joined us for this episode to tell us about what they do and how they got to where they are.
-
111
Service Mesh With Jason Morgan
The Linkerd service mesh for Kubernetes helps your application access resources and secures network connections. We talked with Jason Morgan of Buoyant about service meshes in general and Linkerd in particular to learn more about it.
-
110
Runbook Automation With Jake Cohen
Embracing automation in distributed systems is key for reaching scale and efficiency. This week we talk to Jake Cohen about runbook automation, what it means for teams, and how it creates opportunities for automated diagnostics.
-
109
Software Bill of Materials With Barak Brudo
The Software Bill of Materials, or SBOM, is a list of any and all components included in a software artifact. In the United States, SBOMs are a requirement for software used by the federal government. This week we talk to Barak Brudo about the mechanisms used to create and use SBOMs.
-
108
Support Career Stories
Your company's customer support team makes sure your customers feel taken care of, but they can also become valuable adds to your engineering teams. Listen to this episode for insights and advice on making the switch from Andra Burck and Isabella Applen of PagerDuty, and Pablo Gonzalez of Salto.
-
107
Mental Health With Fred Harper
In this episode, Fred Harper joins us to talk about mental health and his experiences with neurodiversity.
-
106
Best Practices With Ivan Merrill
Observability, monitoring, and other operational features of your services can't be bolted on at the end of the development process. Setting teams up for success with best practices helps organizations meet their goals.
-
105
Performance Management With Ted Neward
Managing a team's performance is more than just firing those who don't behave well and promoting those that do. It's nurturing and growing your team to help them perform at their best. Ted Neward, co-founder of Solidify/US talks about his experiencing managing the performance of software teams.
-
104
PagerDuty Summit 2022 Bonus Episode!
It's time for PagerDuty Summit 2022. In this bonus episode, some of the PagerDuty team joins us to talk about the events, the content, the swag, and all the things they're excited to share with PagerDuty's users and community.
-
103
Planning for Service End of Life With Sean Steacy
Technical services and applications don't have to live forever. Knowing when to shut down a service or feature that isn't working, and doing it in a way that keeps your users happy, is it's own practice. PagerDuty's Sean Steacy joins us to talk about PagerDuty's EOL process.
-
102
Great Open Source Communities With Benny Vasquez
Open Source software projects rely on strong communities for support, feature development, bug fixing, and any number of other technical tasks. But communities also provide users with a place to share experiences and find like-minded folks. benny Vasquez, Chair of the AlmaLinux Board of Directors, joins us to talk about what's great about community.
-
101
Not Just Documentation With Mary and Kimberly
Sharing information in technical communities is key to product and feature adoption. Sharing information within technical teams is crucial to creating shared knowledge about services and their environment. Mary and Kimberly join us to talk about how their backgrounds in Library Science help them create and manage the documentation their organizations need.
-
100
Working With SLOs With Alex Hidalgo
Service Level Objectives (SLOs) are a method for focusing work on reliability. As a tool for your team, SLOs provide insight into service performance and can act as a framework for prioritizing tasks and features.
-
99
Break It 2 the Limit
Kolton and Alex reflect on how they identified the space where they could build their respective companies and the shift from larger entities to start ups. Part 2 of a 2-part crossover with the Break Things on Purpose podcast.
-
98
Security Champions With Simon Maple
The one where we talk about security champions: how it's different from DevOps, why recognition is so critical for success, and how you can get started with championing security.
-
97
Easing Into Incident Command With Iris Carrera
Iris Carrera is a Senior Site Reliability Engineer at Dutchie focused on observability and incident response. Prior to Dutchie she worked at HashiCorp building the infrastructure that supports HashiCorp Cloud Platform. Iris has worked on infrastructure and site reliability in aerospace, cannabis, and cloud PaaS environments. Iris lives in Seattle, WA, with her partner and pup. She joined us recently to talk about the experience of being new to incident command and how checking in with your peers and yourself can smooth out the process.
-
96
Communication Breakdowns With Michael Callaghan
Communication is something we can always get better at. Michael Callaghan joins us to talk about communication mistakes we commonly make and how to learn from them.
-
95
Making Work Visible With Dominica DeGrandis
Dominica DeGrandis is the author of *Making Work Visible*, and Principal Flow Officer at Tasktop. She joined us to talk about how important it is for teams to ensure that all of their work is accounted for in planning.
-
94
The VOID With Courtney Nash
The Verica Open Incident Database is a community-contributed collection of software-related incident reports. Courtney Nash tells us all about it.
-
93
An Exegesis on HA and DR With Rich Lafferty
Rich Lafferty is a Staff SRE at PagerDuty. In this episode he shares with us what it is a Staff SRE does as well as why high availability and disaster recovery matter and why it's good to reevaluate your plans periodically.
-
92
Emergency Response With Greg Albrecht
Greg Albrecht is a tech company CTO and has been disaster deployed for hurricanes, earthquakes, wildfires, and other events requiring emergency response. He brings his experience in real-world emergency response to the technical world.
We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.
No matches for "" in this podcast's transcripts.
No topics indexed yet for this podcast.
Loading reviews...
ABOUT THIS SHOW
Page It to the Limit is a podcast that focuses on what it means to operate software in production. Hosted by the PagerDuty Developer Relations Team, we cover the leading practices used in the software industry to improve both system reliability and the lives of the people responsible for supporting it.
HOSTED BY
PagerDuty Developer Relations
CATEGORIES
Loading similar podcasts...