MTBF and MTTR Explained: Key Reliability Metrics for Facility Managers

By John Polus on March 27, 2026

mtbf-mttr-reliability-metrics-facility-managers

MTBF and MTTR are the two reliability metrics that tell facility managers the most important things about their equipment: how long assets typically run before failing, and how quickly the team can restore service when they do. A facility with high MTBF and low MTTR is one where failures are infrequent and recovery is fast. A facility with declining MTBF and rising MTTR is one heading toward a reliability crisis, where each failure takes longer to fix and happens more often. The gap between these two scenarios is not luck or asset age; it is the quality of the maintenance programme and the data available to manage it. This guide explains both metrics in plain terms, shows you how to calculate them for your own assets, gives you industry benchmarks to compare against, and shows you how structured CMMS deployment improves both figures within the first 12 months. Sign up free on Oxmaint to start tracking MTBF and MTTR across your portfolio today, or book a demo for a live reliability metrics walkthrough.

3,200h
Average MTBF for commercial HVAC systems in reactive-maintenance facilities in 2026. AI predictive maintenance programmes push this to 6,000 to 6,800 hours within 18 months
8.4h
Average MTTR for unplanned equipment failures in commercial buildings without CMMS-managed parts inventory and pre-staged work orders. Target with structured CMMS: under 3 hours
89%
PM compliance rate achieved by Oxmaint-deployed facility portfolios within 12 months, directly driving MTBF improvement as planned maintenance prevents the failure events that reduce the metric
4.8x
Cost ratio of reactive failure repair versus planned maintenance intervention. Every MTBF improvement that converts an unplanned event to a planned one eliminates this premium from the maintenance budget

Track MTBF and MTTR in Real Time Across Your Entire Portfolio

Oxmaint calculates MTBF and MTTR per asset automatically from work order data. No manual calculation, no spreadsheets. Reliability trend dashboards live in 14 days. Book a demo to see the reliability dashboard configured for your equipment.

Quick Definitions: MTBF and MTTR

MTBF (Mean Time Between Failures) is the average operating time between unplanned failure events for a given asset. Calculated as: Total Operating Time divided by Number of Failures in that period. A chiller that ran 12,000 hours and failed 4 times has an MTBF of 3,000 hours. MTTR (Mean Time to Repair) is the average time from the moment a failure is detected to the moment the asset is fully restored to operational status. Calculated as: Total Repair Time divided by Number of Repairs. Both metrics are calculated per asset class, per building, and across the full portfolio. Tracking both together tells the complete reliability story: how often things break and how quickly the team responds.

MTBF: How to Calculate It and What It Tells You

MTBF is a measure of reliability, not durability. It does not measure how long an asset will last before it is replaced; it measures how long it typically runs between unplanned failure events during its service life. A declining MTBF trend is an early warning that asset condition is deteriorating and maintenance intervals need attention before a breakdown occurs.

01
Identify Your Measurement Period
Choose a consistent period for MTBF calculation, typically the last 12 months of operational data per asset. Shorter periods produce less reliable MTBF figures; longer periods may blend different maintenance regimes. For benchmarking, use the same 12-month rolling window across all assets to ensure comparability.
Recommended period: 12-month rolling window per asset
02
Count Total Operating Hours
Sum all hours the asset was available and running during the measurement period. Subtract both planned downtime (scheduled PM, planned shutdowns) and unplanned downtime (failure events) from calendar time to get total operating hours. Planned downtime does not count as failure time in MTBF calculation.
Operating hours = calendar hours minus planned downtime minus unplanned downtime
03
Count Failure Events in the Period
Count all unplanned failure events during the measurement period that required reactive repair. Do not include planned PM visits or scheduled replacements. Each event where the asset stopped unexpectedly and required unplanned intervention counts as one failure for MTBF purposes, regardless of repair duration.
Count unplanned failures only; exclude scheduled PM events
04
Divide to Get MTBF
MTBF = Total Operating Hours divided by Number of Failure Events. Example: A chiller with 8,400 operating hours and 3 failure events in the measurement period has MTBF = 8,400 divided by 3 = 2,800 hours. Compare this against the industry benchmark for that equipment class to assess whether your reliability programme is working.
MTBF = Operating Hours divided by Failure Count

MTTR: How to Calculate It and Why It Matters

MTTR measures maintenance team responsiveness and repair efficiency. High MTTR indicates structural problems: parts not stocked, technicians not adequately trained on the asset, work order information incomplete, or contractor response times unacceptably long. Reducing MTTR has immediate financial impact because every hour of extended downtime during a failure event carries direct operational cost.

Detection Time
Time from Failure to Detection

The window between when a failure occurs and when the maintenance team is notified. IoT sensors and BAS monitoring reduce this to under 5 minutes. Without monitoring, detection depends on tenant complaints or scheduled walkthroughs, averaging 2 to 8 hours.

Target: under 5 minutes with IoT monitoring
Diagnosis Time
Time to Diagnose Root Cause

Time spent identifying what failed and why. Pre-built failure mode libraries in the CMMS, AI predictive context from sensor data, and complete asset maintenance history reduce diagnosis from an average of 90 minutes to under 20 minutes for common failure modes.

Target: under 20 minutes with CMMS history access
Parts Access
Parts Retrieval and Staging Time

Time to locate, retrieve, or procure the required parts and tools. Facilities with CMMS-managed parts inventory linked to asset records cut parts retrieval time by 60 to 70%. Facilities without inventory management average 3 to 8 hours on parts delays alone.

Target: under 30 minutes with stocked MRO inventory
Repair and Test
Active Repair and Return-to-Service Testing

Hands-on repair time plus system testing and verification before work order is closed. This component of MTTR is most directly improved by technician training, complete repair procedures in the CMMS, and post-repair performance verification protocols that prevent repeat callbacks on the same failure.

Varies by failure type. Document repair procedures per failure mode

MTBF and MTTR Benchmarks by Equipment Class

Use these benchmarks to assess where your facility stands relative to industry performance. Facilities achieving above-benchmark MTBF and below-benchmark MTTR have effective reliability programmes; those below MTBF benchmark or above MTTR benchmark have structural maintenance gaps that CMMS deployment and PM programme restructuring can address.

Equipment ClassReactive Facility MTBFStructured PM MTBFAI Predictive MTBFMTTR Target
Water-Cooled Chiller 2,800 to 3,400 hrs 4,200 to 5,200 hrs 5,800 to 7,200 hrs 4 to 8 hrs planned, 8 to 18 hrs emergency
Air Handling Unit (large AHU) 3,400 to 4,200 hrs 5,000 to 6,400 hrs 6,200 to 7,800 hrs 2 to 4 hrs planned, 4 to 8 hrs emergency
Centrifugal Pump 4,200 to 5,800 hrs 6,800 to 8,200 hrs 8,400 to 10,200 hrs 1 to 3 hrs planned, 3 to 6 hrs emergency
Cooling Tower Fan 2,600 to 3,800 hrs 4,400 to 5,600 hrs 5,200 to 6,800 hrs 2 to 4 hrs planned, 4 to 10 hrs emergency
Electrical Motor (above 15kW) 5,200 to 7,400 hrs 8,200 to 10,400 hrs 10,800 to 14,000 hrs 2 to 6 hrs planned, 6 to 24 hrs emergency
Elevator (hydraulic or traction) 6,400 to 8,200 hrs 9,200 to 11,400 hrs 11,800 to 14,600 hrs 1 to 2 hrs planned, 2 to 4 hrs emergency
Boiler (hot water or steam) 4,800 to 6,400 hrs 7,200 to 9,400 hrs 9,800 to 12,200 hrs 4 to 8 hrs planned, 8 to 36 hrs emergency
Generator (emergency or standby) Tested monthly, MTBF for actual run failures 3,200 to 4,800 hrs 5,400 to 7,200 hrs 7,600 to 9,400 hrs 1 to 4 hrs planned, 2 to 8 hrs emergency

How to Improve MTBF and MTTR: Four Proven Approaches

01
Extend MTBF: Deploy AI Predictive Maintenance to Prevent Failures Before They Occur
MTBF improves when failure events become less frequent. The most direct route is converting unplanned failures to planned interventions before the failure event occurs. AI predictive maintenance detects 68% of commercial building equipment failures 7 to 42 days in advance from sensor data anomalies, enabling scheduled replacement of the degrading component before the asset trips. Every prevented failure adds operating hours to the numerator of the MTBF calculation without adding to the failure count denominator, directly improving the metric. Facilities deploying AI predictive maintenance see MTBF improvement of 60 to 85% within 18 months across HVAC and mechanical systems.
02
Extend MTBF: Enforce PM Compliance With Automated Scheduling and Escalating Alerts
Skipped or delayed preventive maintenance is the primary driver of premature failure events and declining MTBF across commercial facilities. Every missed lubrication interval, delayed filter replacement, or skipped belt inspection advances asset degradation toward the next failure event. Automated PM scheduling with 30-7-1 day escalating alerts, enforced across all equipment classes in the CMMS, removes the human decision point that allows PM compliance to drift below 60% on paper-based systems. Facilities on Oxmaint achieve 89% PM compliance within 12 months, directly improving MTBF by eliminating maintenance-gap-induced failures.
03
Reduce MTTR: Attach Asset History and Repair Procedures to Every Work Order
The largest preventable component of MTTR in most facilities is diagnosis time. When a technician arrives at a failed asset with no repair history, no previous failure context, and no documented repair procedure, diagnosis takes 60 to 120 minutes. When the CMMS generates the work order pre-populated with the last 5 failure events, probable failure modes based on sensor data, recommended repair procedure, and required parts list, diagnosis drops to under 15 minutes for common failure modes. CMMS-attached asset history is the single highest-impact MTTR intervention for most facility teams.
04
Reduce MTTR: Manage Critical Parts Inventory Linked to Asset Records
Parts unavailability is the most common cause of MTTR exceeding 8 hours for emergency failures in commercial buildings. A bearing replacement that takes 45 minutes of active repair time turns into a 14-hour downtime event when the correct part is not in stock and requires emergency procurement. CMMS-managed MRO inventory linked to asset records enables minimum stock levels for critical spare parts, consumption tracking, and automatic reorder triggers, ensuring that the most common failure parts for each asset class are available without manual oversight or periodic physical audits.

Start Tracking MTBF and MTTR Automatically Across Your Portfolio

Oxmaint calculates MTBF and MTTR per asset from work order completion data automatically. No manual calculation. Trend dashboards, benchmark comparisons, and PM compliance tracking all in one platform.

How Oxmaint Tracks and Improves MTBF and MTTR

Oxmaint calculates both metrics automatically from work order data, making MTBF and MTTR live, asset-level reporting metrics rather than monthly manual calculations. Here is how each platform capability contributes to both figures.

Auto Calculation
Automatic MTBF and MTTR Calculation per Asset

Work order closures feed MTBF and MTTR calculations automatically. No manual data export or spreadsheet calculation required. Each asset's reliability metrics update in real time as work orders are opened, progressed, and closed in the CMMS. Historical trend charts available per asset and per equipment class across the full portfolio.

Updates automatically on every work order closure
PM Automation
Automated PM Scheduling to Protect MTBF

PM work orders generated automatically per equipment class and regulatory interval. 30-7-1 day escalating alerts prevent skipped maintenance. Every completed PM visit directly contributes operating hours between failure events, protecting and extending MTBF across all asset classes in the portfolio.

89% PM compliance rate within 12 months
Failure Context
Asset History on Every Reactive Work Order

Reactive work orders auto-populated with full asset repair history, previous failure modes, recommended diagnostic steps, and parts commonly required. Technicians arrive with context that reduces diagnosis time from 60 to 120 minutes to under 20 minutes for common failure modes, directly reducing MTTR on every callout.

Diagnosis time reduction: 60 to 80% versus no CMMS history
Trend Dashboard
MTBF and MTTR Trend Dashboard for Portfolio Management

Portfolio-level MTBF and MTTR trends updated in real time. Equipment ranked by reliability score, highlighting highest-priority improvement targets. Month-over-month trend showing whether the reliability programme is delivering measurable results, exportable for director and board reporting.

Ranked by reliability score across all equipment classes

Frequently Asked Questions: MTBF and MTTR for Facility Managers

QWhat is the difference between MTBF and asset lifespan?
MTBF measures how often an asset fails during its service life, not how long it will last before replacement. A chiller with a 20-year service life might have a 3,000-hour MTBF (failures every 3 months) or a 9,000-hour MTBF (failures roughly annually). Higher MTBF means fewer failures during the same service life, not a longer service life. Sign up free to track both metrics for your assets, or book a demo for a reliability dashboard walkthrough.
QHow quickly can MTBF improve after deploying a structured PM programme?
Most facilities see measurable MTBF improvement within 6 to 9 months of deploying structured preventive maintenance. The improvement curve accelerates as AI predictive maintenance converts more failure events to planned interventions. Most Oxmaint portfolios achieve 50 to 85% MTBF improvement within 18 months. Book a demo to model the expected MTBF improvement trajectory for your specific asset base.
QWhat is causing high MTTR in most commercial facilities?
The top three MTTR drivers are: slow failure detection (2 to 8 hours without monitoring versus under 5 minutes with IoT sensors), long diagnosis time (60 to 120 minutes without CMMS asset history), and parts unavailability (3 to 8 hours waiting for parts without managed inventory). Addressing all three brings MTTR from a typical 8 to 14 hours to under 3 hours for common failure modes. Sign up free to see MTTR tracking configured for your portfolio.
QCan I report MTBF and MTTR to building owners or investors as reliability evidence?
Yes. MTBF and MTTR trends are increasingly requested by institutional investors, CRE asset managers, and REIT operators as operational reliability evidence. Oxmaint exports portfolio-level MTBF and MTTR reports in formats suitable for investor reporting and board presentations. Book a demo to see the investor reporting format and portfolio reliability export.

Improve MTBF and MTTR Across Every Asset in Your Portfolio

Automatic MTBF and MTTR calculation, AI predictive maintenance to prevent failure events, PM automation to protect reliability gains, and asset history attached to every reactive work order to cut diagnosis time. Live across your full portfolio in 14 days.

Auto MTBF CalculationAuto MTTR TrackingPM Compliance AutomationAsset History on Work Orders

Continue Reading: Reliability and Smart Building Resources


Share This Story, Choose Your Platform!