If you are one of the more than 10,000 companies across the globe that uses PagerDuty to manage real-time digital operations, there are a number of capabilities, features, and products that can support your efforts to run regular operational reviews, including making it easy to collect metrics and improve the richness of those metrics.
PagerDuty Analytics offers a feature called Operational Review Analytics. This feature was designed to be a companion to this Operational Review Documentation and align directly to the three most common operational reviews run at digital businesses. Three reports are accessible in the PagerDuty web application, which can be customized by adjusting the timeframe of the reports, as well as the Team, Service, and/or Organization scope. Additionally, these reports can be scheduled for delivery to your email inbox.
PagerDuty Platform Reporting#
If you are not yet using the rich and prescriptive analytics capabilities that come with the PagerDuty Analytics product, you can still find a wealth of information and metrics within the PagerDuty Platform product. PagerDuty Platform offers up to five distinct reports, surfacing every incident and alert, along with basic metrics across the system, teams, users, and escalation policies. Report availability depends on your PagerDuty Platform license type.
Team & Service Configuration#
In order to run focused Operational Review meetings with metrics, it is critical to have your on-call and incident management platform set up in a way that leads to proper visibility into the data. The two most important configurations are around your Teams and PagerDuty Services.
For Teams, make sure that all users in your PagerDuty account are part of at least one team. Set up your PagerDuty teams to mirror your software development/delivery teams. If a group of people in your company wholly owns a technical service, that group of people should be a team in PagerDuty. Make sure the team’s manager is set up as a member of the team, and use the team roles feature to signal that they are the manager. You can learn more about setting up teams from our Teams Knowledge Base page.
For Services, there are many ways to set up your PagerDuty instance, but the method that provides the best visibility into meaningful metrics is to set up one PagerDuty service for every distinct piece of software you manage. All monitoring for that software should be directed to the same service in PagerDuty, and you’ll want to ensure that there is clear team ownership of that service assigned in PagerDuty. See this blog post for best practices around setting up Services, and our Services and Integrations Knowledge Base page for how to configure services.
Modern Incident Response#
Finally, using PagerDuty to manage major incident response is not only a great way to reduce customer-impacting downtime, it can also increase the richness of the data and metrics that PagerDuty offers to help you run your Operational Reviews. PagerDuty’s Modern Incident Response product offers features such as incident priority, collaboration for multiple responders, automation, stakeholder updates, and more. Using these features will enrich the details about a Service and its health, as well as add to the available data that can surface insights into your process.
Don't forget to ensure your major incident response process is up-to-date using our open source incident response documentation.