How to Calculate and Improve MTTR and MTBF in Manufacturing Plants

Two numbers tell you almost everything about how reliable your manufacturing plant really is. MTTR — how long it takes to fix a failure. MTBF — how long an asset runs before the next one. Plants that obsess over these two metrics consistently outperform peers on uptime, maintenance budget, and OEE. Plants that ignore them tend to discover the cost the hard way, in the form of a $50K hour of unplanned downtime or a parts crib that's always missing the one thing the technician needs. The math is simple. The discipline of capturing failure data clean enough to trust it is what separates a reliability program from a wishlist. Start free with OxMaint to see MTTR and MTBF calculated automatically across every asset.

$50K-$250K

Per Downtime Hour

Typical cost range of unplanned production stoppage in heavy manufacturing

40-60%

MTBF Lift

Reliability gain plants achieve when 85% of work is planned proactive maintenance

10x

Cost Multiplier

Run-to-failure operations cost roughly 10x more than proactive PM programs

2 hrs

MTTR Reduction

Cutting two hours off MTTR per event recovers millions in annual production

The Two Metrics, Decoded

MTTR and MTBF are siblings, not synonyms. One measures how fast you fix things; the other measures how long things stay fixed. Together they form the complete reliability picture every plant manager and maintenance director needs on a single dashboard.

MTTR

Mean Time To Repair

How quickly your team restores a failed asset to running condition. A maintainability metric.

Formula

Total Repair Time ÷ Number of Repairs

What good looks like

Manufacturing target: 1 to 6 hours depending on asset criticality. Critical line equipment should aim under 2 hours.

MTBF

Mean Time Between Failures

How long an asset operates between one failure and the next. A reliability metric.

Formula

Total Operating Hours ÷ Number of Failures

What good looks like

Mature plants target MTBF of 700+ hours on rotating equipment. Best-in-class hits 2,000+ on critical assets.

A Worked Example — One Conveyor, One Week

Numbers click faster than definitions. Here's how MTTR and MTBF actually get calculated on a real production conveyor that runs 21 hours a day, five days a week — the same kind of asset every plant manager knows by heart.

Conveyor Line — Week 14 Failure Data

Total scheduled run time

105 hours

Number of failure events

3 failures

Total downtime from failures

12 hours

Total operating time

93 hours

MTTR Calculation

12 hrs ÷ 3 repairs = 4 hours

On average, this conveyor takes 4 hours to fix once it goes down.

MTBF Calculation

93 hrs ÷ 3 failures = 31 hours

On average, this conveyor runs 31 hours before the next failure.

Availability

31 ÷ (31 + 4) = 88.6%

The conveyor is available 88.6% of scheduled production time.

The week-by-week trend is the real story. A single number is just a snapshot — what matters is whether MTBF is climbing and MTTR is falling over time.

The Availability Equation — Why Both Metrics Matter

Tracking only one of these numbers tells you half the story. A plant with high MTBF but high MTTR is reliable but slow to recover. A plant with low MTBF and low MTTR fails often but bounces back fast. Availability is the bridge between them — and it's the number your operations director will actually report to the executive team.

Availability = MTBF ÷ (MTBF + MTTR)

High MTBF, High MTTR

MTBF 200h, MTTR 8h

96.2% available

Reliable but painful when it does fail. Recovery process needs work.

Low MTBF, Low MTTR

MTBF 40h, MTTR 1h

97.6% available

Fails constantly but recovers fast. Reliability engineering is the gap.

High MTBF, Low MTTR

MTBF 700h, MTTR 2h

99.7% available

The target state. Fewer failures, faster recovery, predictable production.

What MTTR Is Actually Made Of

The phrase "mean time to repair" hides four very different stages, and each one has its own bottleneck. Plants that improve MTTR meaningfully don't try to repair faster — they attack the four components separately, because that's where the time actually leaks.

Detect

Failure to discovery

Sensors, alarms, operator rounds, condition monitoring — speed up the moment you know something broke.

Respond

Discovery to crew on-site

Mobile work order dispatch, on-call rotations, clear escalation paths — close the gap from alert to action.

Repair

Diagnosis through fix

SOPs, asset history, parts availability, technician skill match — the actual hands-on-tools time.

Verify

Fix to production restart

Test runs, QC checks, signoff workflows — confirming the asset is truly ready before releasing it.

Industry Benchmarks — How Do You Stack Up?

Targets vary wildly by industry and asset criticality, and a "good" number for a packaging line is a disaster for a critical safety system. Use these as directional benchmarks — and always benchmark within your own asset class first.

Industry / Asset Type

Target MTTR

Target MTBF

Target Availability

Critical process equipment (continuous)

Under 2 hours

2,000+ hours

99.5%+

Manufacturing line — discrete assembly

2 to 4 hours

700 to 1,500 hours

98% to 99%

Packaging and material handling

2 to 6 hours

300 to 800 hours

95% to 98%

Power generation — turbine and boiler

Under 4 hours

5,000+ hours

99% to 99.9%

Healthcare — life-support systems

Under 15 minutes

10,000+ hours

99.99%+

IT & networked control systems

15 to 60 minutes

5,000+ hours

99.9%+

The Improvement Playbook — Two Sides of the Same Coin

Lowering MTTR and raising MTBF require different actions. MTTR is a process problem; MTBF is an engineering and PM problem. Plants that make real gains attack both with parallel programs — not by hoping one improves the other.

Lower MTTR

Recover faster from every failure

Standardize SOPs. Every critical asset should have a step-by-step repair procedure with photos, attached to the work order on a phone.

Stock the right spares. Use failure history to set min-max levels — the part the technician needs at 2am is the only metric that matters.

Invest in mobile dispatch. Cut detect-to-respond time with push notifications and offline mobile work orders.

Cross-train technicians. Skill gaps are silent MTTR killers — one specialist on vacation extends every repair until they return.

Pre-stage tools and PPE. A repair crew arriving without the right tooling is a 90-minute MTTR penalty before any wrench turns.

Run failure mode reviews. Every long repair gets a 10-minute postmortem. Patterns emerge fast and become permanent fixes.

Raise MTBF

Stretch the time between failures

Hit 85% planned maintenance. When 85% of hours are scheduled and proactive, MTBF lifts 40 to 60 percent over reactive programs.

Move to condition-based PM. Calendar PMs miss early-stage failures. Sensor-driven PM catches degradation before it cascades.

Run RCA on every major failure. Root cause analysis prevents the same failure from repeating — the highest-leverage MTBF intervention.

Use OEM-quality parts. Cheaper spares often look identical and fail at half the MTBF. The bill of materials matters.

Train operators on damage prevention. A large share of "equipment failures" trace back to operating practices, not engineering.

Audit lubrication and alignment. The two most under-tracked PM tasks deliver outsized MTBF returns when done right.

What This Looks Like Inside OxMaint

Calculating MTTR and MTBF by hand is a one-time exercise. Trending them across hundreds of assets, weekly, with failure code breakdowns, takes a system. OxMaint does it automatically the moment a work order closes — no spreadsheet, no analyst, no delay.

Auto-calculation

Every closed work order updates MTTR and MTBF for the affected asset, equipment class, and site — no manual data entry.

Failure code analytics

Breakdowns segmented by failure mode show exactly which causes are eating your reliability — and where to focus PM redesign.

Asset-level trending

Weekly and monthly trends per asset surface degrading equipment before it tips into a forced outage.

Multi-site benchmarking

Compare MTBF across plants, lines, and shifts on the same equipment class — find the best performer and copy the playbook.

PM frequency tuning

Reliability engineering teams use MTBF trends to retune PM intervals — neither too frequent nor too lax.

Executive dashboards

Plant managers and VPs see availability, MTTR, and MTBF rolled up in one view — the meeting starts on the same page.

Frequently Asked Questions

What is the difference between MTTR and MTBF?

MTTR measures maintainability — how fast you fix a failure. MTBF measures reliability — how long an asset runs between failures. Together they determine availability via Availability = MTBF / (MTBF + MTTR). Sign up free to see both calculated automatically.

Does MTBF include planned downtime?

No. MTBF only counts unplanned failures and the operating hours between them. Scheduled PM downtime is excluded so the metric reflects true asset reliability rather than maintenance schedule choices.

What MTTR target should a manufacturing plant aim for?

Most manufacturing plants target 1 to 6 hours depending on asset criticality. Critical line equipment should sit under 2 hours; non-critical assets can run 4 to 6. Book a demo to see benchmarks for your specific asset classes.

Can a CMMS calculate MTTR and MTBF automatically?

Yes. A modern CMMS like OxMaint pulls operating hours, failure events, and repair durations from work order data and computes MTTR, MTBF, and availability per asset, equipment class, and site without manual analysis.

How quickly can a plant improve these metrics?

Most plants see measurable MTTR improvement within 60 to 90 days from mobile work orders and SOP standardization. MTBF gains take longer — usually one to two quarters once condition-based PM and RCA become routine. Start free and capture your first 30-day baseline.

Turn MTTR and MTBF From Report Card to Steering Wheel.

OxMaint calculates reliability metrics automatically, trends them across every asset, and surfaces the failure modes draining your uptime — so your maintenance team stops chasing data and starts improving it.

Start Free Trial Book a Demo