Identifying Bad-Actor Equipment in Power Plants with CMMS

By Johnson on April 17, 2026

power-plant-failure-analysis-bad-actor-equipment

In most power plants, the numbers tell a brutal story: roughly 10% of the equipment is responsible for 60%–80% of all maintenance spend, and the same tags keep appearing on breakdown reports year after year. These chronic offenders are called bad actors — the pumps, valves, motors, fans, and feeders that silently drain budgets, sabotage availability, and burn out reliability teams. Yet most plants cannot name their top ten because the data lives scattered across paper logs, operator memory, and outdated spreadsheets. A modern CMMS with structured failure history changes that entirely — ranking every asset by failure frequency, repair cost, and downtime hours so your reliability engineers stop firefighting and start eliminating the real profit killers.

The Bad-Actor Pareto Reality
10% of Your Assets Cost You 60%–80% of Your Maintenance Budget
Asset Count
10% of assets
Maintenance Spend
~70% of budget
Downtime Hours
~65% of lost hours

What Exactly Qualifies as a Bad Actor?

Top 10
Top 10 Equipment
The single highest-cost or highest-impact assets in the plant — the big gas turbine bearing, the HRSG economizer, the main BFW pump. Your reliability team's single most important target list.
Criterion: Top 10 by total failure cost or lost-MWh impact
Bad Actor
Bad Actors
Any piece of equipment that has experienced three or more unplanned failures in a rolling two-year window. These are the chronic cases that point to a systemic issue, not a one-off event.
Criterion: 3+ failures in 24 months
Repeat Offender
Repeat Offenders
Assets that failed more than once in the past six months — the fresh signals that something is actively wrong. Catch these early and they never graduate to bad actor status.
Criterion: 2+ failures in 6 months

The 5 Metrics That Rank Bad Actors in Any Power Plant

01
Failure Frequency
Total failure events / Rolling 24 months
The headline count. Simple, direct, and impossible to argue with. High-frequency failures point to wear-out patterns, design flaws, or misapplied equipment.
02
Total Repair Cost
Labor + parts + contractor + rental (rolling 12 months)
The financial truth. An asset with 3 moderate failures and a $180K spend beats an asset with 6 failures and a $20K spend on any reliability priority list.
03
Downtime Hours
Cumulative unit derate or outage hours per asset
Translates directly into lost generation revenue. Critical for unspared equipment where a single failure takes the unit down.
04
MTBF Decline
Mean Time Between Failures — year-over-year trend
The leading indicator. An asset whose MTBF has dropped 40% in the last 18 months is on the path to becoming a top-10 offender — catch it now.
05
Cost of Unreliability
Repair cost + lost revenue + collateral damage + penalty exposure
The complete financial picture. CoUR is what moves bad-actor conversations from the shop floor to the executive suite — and unlocks real investment in permanent fixes.

Example: A Typical Power Plant's Top 10 Bad Actor Ranking

Here is how a thermal generation plant's bad-actor scorecard typically looks after 24 months of structured CMMS failure data. Note how cost, not just count, drives the ranking — and how the top three assets alone absorb nearly half of total corrective spend.

Bad-Actor Scorecard — Rolling 24 Months
Rank Asset Failures Repair Cost Downtime (hrs) Status
1 BFW Pump P-201A 7 $284,000 96 Critical
2 Coal Mill 3B Gearbox 5 $198,000 72 Critical
3 Condenser Vacuum Pump 9 $156,000 54 Critical
4 Soot Blower 14 11 $92,000 28 High
5 Cooling Tower Fan CT-4 4 $88,000 36 High
6 HP Feedwater Valve V-118 6 $74,000 22 High
7 Air Preheater Drive 3 $58,000 18 Watch
8 Ash Handling Conveyor 2 5 $46,000 14 Watch
9 Service Water Pump P-401 4 $38,000 12 Watch
10 ID Fan Bearing Assembly 3 $32,000 16 Watch
Stop the firefighting cycle
Find your plant's real bad actors in 30 days — not 30 months
Oxmaint automatically structures every work order, repair cost, and downtime hour into a live bad-actor scorecard. No spreadsheets. No manual Pareto. Just the ranked list your reliability team needs to stop repeat failures for good.

Why Most Power Plants Cannot Identify Their Bad Actors

Paper and Spreadsheet Records
Failure history lives in operator notebooks, maintenance log binders, and departmental spreadsheets. No one can query it across 24 months of events.
Inconsistent Failure Coding
Same bearing failure coded three different ways by three different technicians. Pattern recognition is impossible without standardized failure modes.
No Cost Roll-Up by Asset
Labor hours, spare parts consumption, and contractor invoices live in different systems and never aggregate against the asset tag number.
Maintenance Owns PdM
Vibration and oil analysis data exists but sits unused because the maintenance team is too busy firefighting to analyze trends proactively.
No Repeat-Failure Alerts
Nothing automatically flags when the same asset comes back for the third work order in six months. Repeat offenders silently become bad actors.
Turnover Wipes Tribal Knowledge
Experienced techs who knew every problem asset retire. New techs start from zero. Institutional memory evaporates without a digital failure history.

The 6-Step Bad Actor Elimination Workflow

Step 1
Capture
Log every failure with a structured work order: date, asset, failure mode code, hours to repair, parts cost, labor cost, downtime impact. This is the raw material for everything downstream.
Step 2
Rank
Apply Pareto analysis across the rolling 24-month window on failure count, repair cost, and downtime hours. The top 10 list emerges from the data — not from memory.
Step 3
Investigate
For every bad actor, run a formal Root Cause Failure Analysis. Preserve failed parts. Interview operators. Pull PdM data. Document the true underlying cause — not just the symptom.
Step 4
Engineer
Convert the RCA findings into a solution: new PM task, revised operating procedure, component redesign, spec upgrade, or installation change. Cost-justify it against CoUR.
Step 5
Execute
Fund and field-install the fix. Most bad-actor programs die here. Without field execution, every step before this was a sunk cost with zero reliability return.
Step 6
Verify
Track failure rate, MTBF, and cost on the treated asset for 6–12 months post-fix. Confirm the bad actor is eliminated. Capture the lesson learned into the library for sister plants.

Common Power Plant Bad Actors and Their Root Causes

Asset Type Typical Failure Mode Common Root Cause Permanent Fix
Boiler feed pumps Mechanical seal failure Inadequate flush flow, water contamination Seal flush upgrade, filtration add
Coal mill gearboxes Bearing and gear wear Inadequate lubrication, load cycling Oil analysis program, load smoothing
Cooling tower fans Gearbox and blade failure Vibration from blade fouling, icing Balance program, anti-icing controls
Feedwater valves Seat erosion, stem binding Throttling at wrong characteristic Trim redesign, control tuning
Soot blowers Lance warp, drive failure Thermal cycling, poor alignment Material upgrade, alignment procedure
ID and FD fans Bearing and blade failure Erosion, imbalance, misalignment Coating upgrade, precision alignment

Before and After: What a Bad Actor Program Delivers

Without a Bad Actor Program
Same 3–5 assets fail repeatedly each year
Emergency maintenance dominates schedule
Reliability team constantly firefighting
MTBF declining year over year
Unplanned outages trigger capacity penalties
Tribal knowledge lost with each retirement
With a Structured Bad Actor Program
Top 10 list refreshed quarterly from live data
Planned work ratio climbs above 80%
Reliability team drives RCAs and upgrades
MTBF on treated assets doubles or better
Forced outage rate drops 25%–40% in 18 months
Lessons captured into plant-wide library

ROI of Bad Actor Elimination — The Math That Moves the Boardroom

Cost of Unreliability per Bad Actor
Repair Cost + Lost Revenue + Collateral Damage + Penalty Exposure
Example: Boiler feed pump with $284K repair + 96 downtime hrs × $18K/hr lost generation = $2.01M annual CoUR
$2.0M
Avg CoUR of a top-3 bad actor
$150K
Typical permanent fix investment
13x
First-year return on the fix
25–40%
Forced outage rate reduction in 18 mo

Frequently Asked Questions About Bad Actor Programs

How many failures make an asset a bad actor?
The common industry definition is three or more unplanned failures in a rolling two-year window, or two failures in six months for the repeat-offender watchlist. Every plant should tune thresholds to its asset count and data maturity. Book a consultation to set your thresholds.
Why should a CMMS own the bad-actor list instead of a spreadsheet?
A CMMS captures every work order, cost, and failure code automatically against the asset tag — making ranked Pareto analysis a one-click report instead of a monthly data-cleanup project. Spreadsheets lose data, drift, and age out. Explore how Oxmaint ranks bad actors from live data.
Who should own the bad-actor program — maintenance or reliability?
Reliability engineering should own it. Maintenance is focused on "what needs fixing today"; reliability has the time and tools to analyze trends, run RCAs, and engineer permanent solutions. The CMMS is the shared data backbone between them. Book a demo to see how it works.
How long before bad-actor elimination shows measurable results?
Most plants see measurable MTBF improvement within 6 months on treated assets and 20%–40% forced outage rate reduction within 18 months. New bad actors always emerge — the program is continuous, not one-and-done. Start your first scorecard with Oxmaint's free tier.
What if the same bad actor keeps returning after a fix?
That means the RCA missed the true root cause — usually stopping at a symptom like "bearing failure" instead of "contaminated lube oil source." Rerun the analysis with broader team input and preserved failure evidence. Schedule a reliability review to audit your process.
Can small plants with limited staff run a bad-actor program?
Yes — the program scales down cleanly. Even a top-3 list run quarterly by one reliability engineer delivers outsized returns. The barrier is always data capture, not analysis complexity. A modern mobile-first CMMS makes structured capture realistic for small teams.
Your bad actors are costing you now
Turn your maintenance history into a ranked bad-actor kill list
Oxmaint structures failure data, ranks bad actors automatically, routes RCA work orders to your reliability engineers, and tracks permanent fixes to verified closure. Stop paying the repeat-failure tax. Start eliminating the top 10 that are eating your budget.

Share This Story, Choose Your Platform!