Mean Time To Resolve (MTTR) as a Service Desk Metric

The service desk is a valuable ITSM function that ensures efficient and effective IT service delivery. A variety of metrics are available to help you better manage and achieve these goals. These metrics often identify business constraints and quantify the impact of IT incidents. Of course, the vast, complex nature of IT infrastructure and assets generate a deluge of information that describe system performance and issues at every network node. The challenge for service desk? Identifying the metrics that best describe the true system performance and guide toward optimal issue resolution.

We’ve talked before about service desk metrics, such as the cost per ticket. Another service desk metric is mean time to resolve (MTTR), which quantifies the time needed for a system to regain normal operation performance after a failure occurrence. In this article, we’ll explore MTTR, including defining and calculating MTTR and showing how MTTR supports a DevOps environment.

What is MTTR?

Beyond the service desk, MTTR is a popular and easy-to-understand metric:

In each case, the popular discussion topic is the time spent between failure and issue resolution. So, let’s define MTTR.

‘Mean time to recovery’ is the average time duration to fix a failed component and return to an operational state. This metric includes the time spent during the alert and diagnostic processes, before repair activities are initiated. (The average time solely spent on the repair process is called ‘mean time to repair’, also shortened to MTTR.) MTTR can be mathematically defined in terms of maintenance or the downtime duration:

In other words, MTTR describes both the reliability and availability of a system:

The shorter the MTTR, the higher the reliability and availability of the system. From a practical service desk perspective, this concept makes MTTR valuable: users of IT services expect services to perform optimally for significant durations as well as at specific instances. For example, Amazon Prime customers expect the website to remain fast and responsive for the entire duration of their purchase cycle, especially during the holiday season. If the website is down several times per day but only for a millisecond, a regular user may not experience the impact.

MTTR encourages DevOps

MTTR is a valuable metric for service desks on its own, but it also encourages DevOps culture and practices in a variety of ways:

By following the DevOps philosophy, service desk can achieve the wider ITSM objectives of efficiently and effectively delivering IT services. MTTR is one among many other service desk metrics that companies can use to evaluate for deeper insights into IT service management and operations activities. With any technology or metrics, however, remember that there is no ‘one size fits all’: you’ll want to determine which metrics are useful for your organization’s unique needs, and build your ITSM practice to achieve real-world business goals.