Mean Time To Recovery

Mean Time to Recovery (MTTR) is an average metric that measures the average time it takes to recover from incidents in a given time period.

Where to find it

  • Delivery > DORA Metrics
  • Health > Team Insights > Add new metric

Interpretation

  • High MTTR (>1 hour) indicates slow incident response and resolution, extended system downtime, increased customer impact, higher operational costs & reduced system reliability. You should improve incident response procedures and team training.
  • Medium MTTR (15-60 minutes) is acceptable within reasonable bounds for most organizations and indicates a balanced approach to incident response.
  • Low MTTR (<15 minutes) indicates excellent incident response capabilities, minimal system downtime, high system reliability, better customer experience & reduced operational impact.

Custom Dashboards

Don't forget you can anytime include any metric in your Custom Dashboards.