Академический Документы
Профессиональный Документы
Культура Документы
www.lifetime-reliability.com 1
PEW/PWW Course Content
www.lifetime-reliability.com 2
Why do Machines and Equipment
Continue to Fail in Companies?
• “We get reliability by creating and building a thing that can do the
duty, and preventing its failure during use.” (LRS uses Plant Wellness Way to do that.)
www.lifetime-reliability.com 3
PEW/PWW Course Content
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 4
Machines Fail because their Parts First Fail
www.lifetime-reliability.com 5
Understand How Machines are Designed
TIP: THE SECRET TO GREAT EQUIPMENT LIFE IS TO …
KEEP PARTS WITHIN THEIR DESIGN STRESS ENVELOPE!
L3
Size of a L4 L2
human
L1
hair
25 - 0.025
+ 0.025
- 0.01
25 + 0.01
Ted, when they design machines, like this shaft rotating in two bearings, they keep the parts in place by
making the gaps between them very small. The hair on your head is about 0.1 mm (0.004”) thick. On this
25 mm (1”) shaft, the gap between the metal surfaces can be as small as 0.01 mm (less then 0.0005”).
That is 10 times thinner than the thickness of your hair. That is very little space for things to move in. If
the parts get twisted and distorted then that clearance disappears and you have parts hitting each other.
Any machine in that situation will quickly fail.
www.lifetime-reliability.com 6
The Unforgiving Nature of Machine Design
How far off-center did the designer allow the shaft to move?
How much movement/angle did the bearing designer allow?
How much distortion before the parts overload and fail?
The parts’ engineering clearances mean that everything has to be exactly as the designer
planned it to be. The whole machine needs to run precisely as it should. If parts are deformed
outside of their tolerance, like in this sketch, then the bearings will fail in a matter of hours, and
not the years that they should last in a machine that is working as it was designed to operate.
Remember: The Limit of Machine Distortion is set by Design Tolerances – don’t let a
machine or its parts get twisted out of shape!
www.lifetime-reliability.com 7
Stress from Distortion
Failure
Failure
We must know what our equipment parts are made of and prevent high stress in those
with infinite life but replace those of finite life before they fail.
www.lifetime-reliability.com 10
The Equipment Designer Wanted a Long,
Trouble-Free Service Life
Strength
Failure
Factor of
Safety
Failure
Load
www.lifetime-reliability.com 11
Stress at Atomic and Microstructure Levels Destroy
Uncoil a paper clip and 1) bend it and 2) twist it as instructed by the
Presenter. Carefully count the number of cycles until it breaks.
Failure
For long operating life,
parts need to stay
Wrought Steel
Have you ever bent a metal wire back and forth until it breaks from being worked? If you have then you were performing
a stress life-cycle test. The wire does not last long when severely bent one way and then back the other way. Each bend is
an overstress, and eventually the overstressing accumulates as damage to the microstructure. The wire fatigues and fails.
The very same thing happens with the parts in yourwww.lifetime-reliability.com
machines. If you want your parts to NEVER FAIL FROM OVERSTRESS 12 –
KEEP OPERATING STRESSES BELOW INFINITE LIFE LEVELS.
Activity – Stressing Steel Parts
Paper Clip Failure Distribution Curve
Uncoil a paper clip and 1) bend it and 2) twist it as instructed by the
Presenter. Carefully count the number of cycles until it breaks.
Develop a distribution of the count of the number of
1) bending cycles to fatigue, and
2) torsion cycles to failure.
14
13
12
No of Failures
11
10
9
8
7
6
5
4
3
2
1
0
1 2 3 4 5 6 7 8 9 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4 4 4 5 5 5 5 5 5 5 5 5 5 6
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0
No of Cycles
www.lifetime-reliability.com 13
Impacts of Out-of-Roundness
Spalling
1500RPM = 25
impacts/second
3000RPM = 50
impacts/second
www.lifetime-reliability.com 14
Failing Roller Bearing Degradation Curve and
the Worsening Defect Severity
Stage 1.
Approx 10% to 20% Stage 2.
remaining life 5% to 10%
Failure Ultrasonic remaining life Stage 3.
Induced Energy Vibration Analysis Oil Analysis 1% to 5%
Detected Fault Detection Detected of remaining life
Audible Noise
Stage 4.
To Hot to Touch Remaining life
one hour to 1%
Mechanically
Loose
Ancillary
Damage
PRECISION
OPERATION AND
OPERATOR CARE MAINTENANCE
PREDICTIVE
PREVENTIVE (need to consider the
RUN TO FAILURE
probability of detection)
Catastrophic
Failure
Time
www.lifetime-reliability.com Source: Ricky Smith, Allied Reliability, 2009 15
Machinery Lubrication Article (5/2007), with additions
Operating Stresses Cause Failure
Source: Extract from ‘Mobile Plant Maintenance and the Duty Meter
Concept’, Hal Gurgenci, Zhihqiang Guan, Journal of Quality in Maintenance
Engineering, Vol 7, No4, 2001.
Walking Dragline
Tip: Because each operator handles the dragline differently, at their own work rate, there are varying stresses placed
on it. The cumulative wear on the machine is not consistent hour after hour, so using an hour-based preventive
maintenance period is inappropriate; you may be maintaining too early, or too late. The right way is to also count the
stress peaks and estimate how much life each one destroys and add that to the usage meter.
www.lifetime-reliability.com 16
The Operating Overload Cycle
Operating Performance
show no evidence of An Overload
Smooth Running
www.lifetime-reliability.com 17
Cause of Aging Failures
Time Dependent Load and Strength Variation The strength
distribution widens
Strength and falls over time.
An Another
The ‘Death’
Overload Overload
Overload
Likelihood of
failure is higher in
Load this region
Failure
Mode
Environment
Effects
and
Criticality
Operating
Analysis
Stresses
Life Cycle
Management
Strength Reliability
Of the Engineering
Material
Source: Pecht, Michael., ‘Why the traditional reliability prediction models do not work - is there an alternative?’,
CALCE Electronic Product and Systems Center of the University of Maryland, College Park, MD, 20742, USA.
www.lifetime-reliability.com 19
PEW SOLUTION: Physics of Failure Causes of
Atomic and Microstructure Stress
www.lifetime-reliability.com 20
PEW/PWW Course Content
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 21
What is Reliability?
• “Reliability is the probability that an item of plant will perform its
duty without failure over a designated time.” (Formal Definition)
• “Reliability is the chance of completing the mission.” (Military Definition)
• “Reliability is the chance of success.” (LRS Definition)
• “We get reliability by creating and building a thing that can do the
duty, and preventing its failure during use.” (LRS uses Plant Wellness Way to do that.)
www.lifetime-reliability.com 22
Machine Reliability = Sum of Parts’ Reliability
Failure from Error Failure from Induced Stress Failure from Usage
•Defective parts •Operating overload •Too many aging parts
•Poor quality assembly •Rapid aging of some parts •Many parts degraded
•Manufacture error •Local environment degradation
System •Operator error Mean of Many
The ‘failure Rate of 50-70% •Poor operating practices 10-30%
curve’ for a Systems
Failing
•Poor maintenance practices (many machines)
machine has
•Poor design choice 20-30%
a special
name
– ROCOF –
Rate of A Single System
Occurrence (a machine)
of Failure.
Time or Usage Age of System
Component
Rates of
Failing
Parts put together into machines form a system of parts. When a working part fails the machine fails.
Hence the reliability of a machine is less than the reliability of its worst part. The ROCOF curve for a machine
reflects what happens to its parts, and moves up and down as parts fail. But when we take many identical
machines and collect their parts’ failure history together, we get a ‘steady average’ ROCOF, which is
representative of the reliability of the www.lifetime-reliability.com
machine design, and its use and care over its lifetime. 23
What is the Chance of this Drinking Glass Breaking?
Its Reliability is, ‘The chance it will hold water next time you use it’
www.lifetime-reliability.com 24
Chance of Failure for a Drinking Glass
1,000,000 glasses sold in packs of 12
What can cause this glass to break?
1 83,333 households buy a pack of 12
Say average household breaks 2 glasses a year • It can be dropped, for example -
1. slip from your hand
That is 166,667 glasses broken each year which are then replaced 2. fall off a tray
Failure Rate per Year
0 12 24 Time (months)
‘Opportunity’ for
breakage arises
regularly
www.lifetime-reliability.com 25
Stop Breakage = Remove Failure Causes =
Improved Reliability
Design Change
What can cause this glass to break?
1
• It can be dropped, for example -
1. slip from your hand
2. fall off a tray
Failure Rate per Year
×
Instructions • It can be crushed,
1. jammed hard between two objects
& Training 2. stepped-on
0.167 3. squashed under a too heavy object
• It can be temperature shocked,
1. in the dish washer
0.045
$ $ $ $ 2. during washing-up
• Mistreated,
1.
2.
It can be thrown in anger
It can be smashed intentionally
+ Mistreated - smashed • Latent damage
+ Knocked - hit 1. scratched and weakened to later fail more
easily
Dropped - hand 2. chipped and weakened to later fail more
0 easily
0 12 24 Time (months)
‘Opportunity’ for
breakage arises
regularly
www.lifetime-reliability.com 26
Reliability = Remove the Chance of Failure
Dropped
Hit/Impact
Total System
10 Yrs
Wear
Puncture
Total System
60,000 km
Misaligned
Insufficient Lube
Wrong Lube
Particulate/Dirt
Moisture
Poor Fit
Overload
Total System
www.lifetime-reliability.com
5 Yrs 27
The Unreliability of Systems of Parts (i.e. a
Machine) is the Sum of Its Parts Failure Rates
The shape and position of the ‘system’ curve is adjustable by varying
the policies controlling quality and maintenance!
The reliability of a
machine is always Quality Replace
Control, Equipment,
less than its parts. System
Training, Add more
Rate of
When one part Failing Precision components to
fails the whole Assembly PM, PdM (Condition Monitoring), Precision Operation PM
machine fails.
With many parts A Single System (machine) Mean of Many
in a machine, Systems (machines)
Component
there are many Rate of
Failing
chances of failure.
‘Precise’
‘Smooth’
‘Tight’
‘Dry’
‘Clean’
‘Cool’
‘Repeatable’
www.lifetime-reliability.com 30
Where does Failure Start?
Problems start with ‘chance’ variation in our decisions… for example
±1 Std Devn Accuracy Relative
for ‘Feel’ Cost
Number of Events
Turn-of-the-Nut ± 15% 3
Fastener Elongation ± 3 - 5% 15
Strain Gauges ± 1% 20
Torque Wrench
Number of Events
± 25% 1.5
Turn-of-the-Nut ± 15% 3
Fastener Elongation ± 3 - 5% 15
Strain Gauges ± 1% 20
www.lifetime-reliability.com 33
Identify Equipment Assemblies and
Parts at Risk of Failure
*^ * Wear-out (age/usage related failure) > PM inspection
^ + From Usage (contaminate with use) > PM renewal
^
^
• Induced Stress (random failure) > PdM condition
^ > PrM/PrO precision
^
^
^*
^ ^ Installation Error (early life failure)> PrM/PrO precision
^* > ACE 3T procedures
^
^
^*•
^*
^
^+
^
** •
* •^ www.lifetime-reliability.com 34
^+
PEW/PWW Course Content
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 35
Activity – What is the Chance of Failure?
Chance of Failure = 1 – Chance of Success
Chance of Failure = 1 – Reliability
Roll Number No. of Times 1 Appears in 5 Dice
1 0
2 2
3 1
4 2
5 1
6 1
7 0
8 1
9 0
10 0
50 Dice 8
In a large number of random historic events the average chance does not accurately represent
the individual event chance. YOU CANNOT PREDICT THE OUTCOME OF A FUTURE RANDOM
EVENT BY USING PAST RESULTS (EXCEPT BY LUCK).
Estimated Life
Probable Life Uncertainty Wear-out
Zone
Rate that
parts fail
www.lifetime-reliability.com 36
Time
PEW SOLUTION: Reduce the Chance of Failure
Chance of Failure = 1 – Chance of Success = 1 – Reliability
• What failures don’t you bother repairing, but immediately replace with new?
(The risks of using rebuilt equipment are too much.)
• Which production equipment will you let fail? (The cost of failure is insignificant.)
• Which production equipment will you never allow to fail? (The cost of failure is too expensive.)
• When will you be willing to replace equipment that you will not allow fail?
(How much remaining life are you willing to give up to reduce the risk of failure?)
• What size safety and environmental failures will you allow? (Their cost is insignificant.)
www.lifetime-reliability.com 38
Acceptable Equipment Failure Domain
Risk = Consequence x [Frequency of Opportunity x Chance of Failure at Each Opportunity]
Repair Business Total
Cost per Cost per What is your tolerance for problems
Failure Event Failure Event on a piece of equipment?
$1,000K $10,000K
$100K Limit of
$10K
$10,000/Yr
$1K $10K
$0.1K $1K
Inside this Volume Accept Failure
10 10% 50% 100%
2 Chance of Failure
1
0.5
0.1
www.lifetime-reliability.com 39
Risk can be Calculated and Plotted
The ‘A’ curve is the same risk throughout
Frequency No/yr
Consequence $
www.lifetime-reliability.com 40
Risk using Log10 Chance and Consequence
Log Risk =
10 Log Consequence + Log Frequency
0.1
0.01
0.001
What is What is
All
the the
threat
likely chance
barriers
cause of the
in place
the ‘holes’
can
‘holes’ line-up
have
in the at the
‘holes’
barriers same
in them.
? time? Log Consequence $
Consequences
Hazard
www.lifetime-reliability.com 42
Want ALARP – As Low As Reasonably Practicable
ALARP Triangle
Intolerable
Risk Rising
Maximum Tolerable Risk
ALARP
Broadly Acceptable Risk
10-1 1 in 10 yr
C
10-2 1 in 100 yr
D
10-3 1 in 1,000 yr
E
10-4 1 in 10,000 yr
Low $100 $1,000 $10,000 $100,000 $1,000,000
Lowwww.lifetime-reliability.com
COST High 43
PEW SOLUTION: Asset Engineering, Operations and
Maintenance that Reduces Life Cycle Operating Risk
Engineering, Ops and
Maintenance Required
ACTUALLY PERFORMED
Inadequate Effort and Focus
Equipment Failure Rate
ACTUALLY PERFORMED
20-30% Correctly Matched
Focus with Least Effort
Time or use
www.lifetime-reliability.com 44
PEW SOLUTION: Use a Process to Create Reliability by
Reducing the Chance of Machine Component Failure
Reliability Growth
www.lifetime-reliability.com 45
Identifying Risks on a Standard Risk Matrix
This layout is the basic approach. There is full mathematical modelling as well, but this basic
table is fine to start with. The layout is universal. You calibrate it by changing consequence
descriptions to what you are willing to accept, and loss costs to what you are willing to pay.
www.lifetime-reliability.com 46
PEW SOLUTION: Uses a Tracking Risk Matrix to
Prove Asset Operating Risk Reduction
DAFT Cost
$1,000,000,000
per Event
$100,000,000
$300,000,000
$10,000,000
$30,000,000
$1,000,000
$3,000,000
$100,000
$300,000
$10,000
$30,000
$1,000
$3,000
Likelihood of Equipment
$100
$300
$30
Failure Event per Year
Event
Descriptor
Count / Time Scale Historic Description 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9
Scale
Year
Twice per
100 2 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9 9.5 10 10.5 11
week
Once per
30 1.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9 9.5 10 10.5
fortnight
Once per
10 Certain 1 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9 9.5 10
month
Once per
3 0.5 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9 9.5
quarter
Almost Event will occur on an
1 Once per year 0 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9
Certain annual basis
Event has occurred
Once every 3
0.3 several times or more in -0.5 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5
years
Likely a lifetime career
0.1
Once per 10
years
Possible
Event might occur once
in a lifetime career
-1 3.5 4 4.5 5 5.5 6 CM 6.5 7 7.5 8
PM
0.03 Unlikely somewhere from time to -1.5 3.5 4 4.5 5 5.5 6 6.5 7 7.5
years
time CM cable thermographs
Once per 100 Heard of something like
0.01 Rare -2 3.5 4 4.5 5 5.5 6 6.5 7
years it occurring elsewhere
Once every PM oil filtration
0.003 -2.5 3.5 4 4.5 5 5.5 6 6.5
300 years PM oil change
Once every Never heard of this
0.001
1,000 years
Very Rare
happening
-3
PM oil leaks from TX 3.5 4 4.5 5 5.5 6
Once every
0.0003
3,000 years
-3.5 PM water ingress paths 3.5 4 4.5 5 5.5
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 48
The Purpose of Business
I want to
$ show you the
financial
Revenue disaster that
EBITDA Profit plant and
Total Cost equipment
failures cause
a business.
Fixed Cost
Variable Cost
Output / Time
Normal Business Operations
www.lifetime-reliability.com 49
Impact of Defects and Failures on Profits
Once the equipment fails, new costs and losses start appearing.
Profits
forever lost
Added Cost Impact of a Failure Incident
$
Total Cost
Fixed Cost
Wasted Fixed Costs
Variable Cost
www.lifetime-reliability.com 50
And clearly, repeated plant and equipment failures and
stoppages totally destroy the profitability of an operation.
Total Cost
Fixed Cost
Wasted Fixed Costs
Variable Cost
t1 t2 t3 t4 t5 t6 Output / Time
Effects on Profitability of Repeated Failure Incidents
If there are lots of failures, you end up running around like headless chooks, losing
money faster and faster. It makes me laugh when I see this happening in a company.
Everyone is busy, but there little profit, … it is all lost in the ‘failure cost surges’.
www.lifetime-reliability.com 51
Benefits of Reducing Operating Risk
$ Accumulated Wasted Variable Fewer profits
Revenue lost, but ‘fire-
and Failure Costs
fighting’ is
high
Risk ($/yr) =
Total Cost
Frequency (/yr) x
Fixed Cost
Wasted Fixed Costs
Variable Cost
Consequence ($)
t1 t2 t3 t4 t5 t6 Output / Time
Effects on Profitability of Reducing Consequence Only
Fewer
$ Profits
Fortunately, we can do something Revenue
Lost
about it. There are two choices – 1)
get very good at fixing failures fast, or Total Cost
2) don’t have failures in the first place
- ZERO DEFECTS is the way to go. Fixed Cost
Wasted Fixed Costs
Variable Cost
t1 t2 Output / Time
Effects on Profit of Reducing Chance Only
www.lifetime-reliability.com 52
PEW SOLUTION: Build a Life Cycle System that Creates
Plant and Equipment Reliability Improvement
Fewer profits
$ Accumulated Wasted Variable lost, but ‘fire-
Revenue
and Failure Costs fighting’ is
high
Total Cost
Fixed Cost
Wasted Fixed Costs
Variable Cost
t1 t2 t3 t4 t5 t6 Output / Time
Effects on Profitability of Reducing Consequence Only
Fewer
$ Profits
Revenue
Lost
Total Cost
Full Highly Lifetime
High Quality
Production Reliable Health and
Maintenance
Capacity Machinery Fitness Fixed Cost
Wasted Fixed Costs
Variable Cost
t1 t2 Output / Time
Effects on Profit of Reducing Chance Only
www.lifetime-reliability.com 53
PEW/PWW Course Content
Day 1
Foundations
• Why Machines Fail
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 54
Reliability Properties for Series Systems
• Series Systems 1 1 1 n
Rsystem= R1 x R2 x R3 …..Rn
www.lifetime-reliability.com 55
Reliability Properties for Series Systems
Rsystem= R1 x R2 x R3 …Rn
• Implications for Series Systems
1 1 1 n 1 System-wide improvements lift reliability higher
than local improvements. This is why SOP’s, training
and up-skilling pay-off.
Properties of Series Systems 2 Improve the least reliable parts of the least reliable
1. The reliability of a series system can be no
higher than the least reliable component. equipment first.
3 Carry spares for series systems and keep the
2. If ‘k’ more items are added into a series system reliability of the spares high.
of items (say 1 added to a system of 2, each
with R = 0.9) the reliability of all items must 4 Standardise components so fewer spares are
rise an equal proportion (3.5%), to maintain needed.
the original system reliability. 5 Removing failure modes lifts system reliability.
(0.9 x 0.9 = 0.932 x 0.932 x 0.932 = 0.81)
This is why Root Cause Failure Analysis (RCFA) and
3. A small rise in reliability of all items (say R of Failure Mode and Effects Analysis (FMEA) pay off.
the three items rises 0.93 to 0.95, 2.2% 6 Provide pseudo-parallel equipment by providing
improvement) causes a much larger rise in tie-in locations for emergency equipment .
system reliability (from 0.81 to 0.86, 6.5%).
7 Simplify, simplify, simplify – fewer components
means higher reliability.
www.lifetime-reliability.com 56
Reliability Properties for Parallel Systems
• Parallel Systems 1
www.lifetime-reliability.com 57
Reliability Properties for Parallel Systems
Rsystem= 1-[(1- R1)x(1- R2)x …(1-Rn)]
• Implications of Parallel
1 1 1 n
Systems for Equipment
1 Use parallel arrangements when the risk of
failure has high DAFT Cost consequences.
Properties of Parallel Systems 2 Consider providing various paths for product
1. The more number of components in parallel the to take in production plants with in-series
higher the system reliability. equipment.
2. The reliability of the parallel arrangement is 3 Build redundancy into your systems so there
higher than the reliability of the most reliable is more than one way to do a thing.
component.
m m m m
m m m m
R4 R6 R8 R1 R14 R2 R14 R3 R5
12 NOTE:
• Rn = Component reliability = chance of success the part will work right.
11 • To maximize Rn specify the conditions that deliver maximum ‘chance of success’
10
9
13
7 This is a reliability block diagram. Think of the string of
3
4
2
1
8
5 6
parts as the machine’s DNA. If a ‘gene’ is faulty in the
DNA then cancer starts and grows to kill the whole body.
The Job
Rjob=
Rbusiness=
NOTE:
• Rprocessn = process reliability = chance of success the process will be done right
• To maximize Rprocessn specify the conditions that deliver maximum ‘chance of success’
Proces
Proces
ProcesProcess 13 Proces
Process 19
Proces
Proces
Proces
Proces
s 25
s 13 Process 16 s 14s 6
s 16
s 17
s1
s6
s9
Process 2 Process 10 Proces s 23Process 23
Proces
Proces Proces
Process 25
Process 17 Process 19
s 24
Process 1
Process 6
Process 9
s5 Processs 1224
Ra Pre Ma Ass Pa Pr
Process 5 wR 12 parR
Process nufR
s2 em R cka R
s 10 od
Proces
Proces
Proces
Proces
Proces
Proces
Ma ati act Process 11
bly gin uct
Process 19
Proces
s 12
s 15s 9
s3
s7
s 8s 1
Proces
Proces
Proces
Proces
Proces
1
ter Proces
on
2
ureProces
3 Proces
4
g Proces
5 s 22 Proces
s 25
s 13
s 16
s 17
Proces
Proces
Proces
ials s 4 s 26 s 18
Proces s 23
s 20
s 14
Proces
Proces Proces s 24
Raw Preparation Manufacture Assembly
Proces s 5 Packaging s 12 Product
Materials s 21 Ra Pre Ma Ass Pa Pr
wR par
R nuf
R emR ckaR od
Proces
Proces Process
Proces
Proces
Proces
Proces
s 11
R1 R2 R3 R4 Ma ati R5 act bly gin uct
s 14
s 12
s 15
s3
s 7 Proces
s8
1
ter Proces
on
2
ureProces
3 Proces
s2
4
g Proces
5 s 22
s 10
Proces
s4 s 26 s 18
Proces
Process 14
ials
s 20
s 14
Process 12
Process 3
Process 7
Process 8
Proces Proces
Proces
Proces
Proces
Process 19
Proces
Proces
Proces
Process 4 Process 26 Proces Process 18 Proces
s 25
s 13
s 16
s 17
s 1 15
s6
s9
Proces s 23
s 21 Proces
Proces
Process 11 Proces s 24
s5 s 12
Ra Pre Ma Ass Pa Pr
Process 20
Process
wR parR nuf
R em R cka
R od
Proces
Proces
Proces
Process 22
Proces
Proces
Proces
s 11
Ma ati act bly gin uct
s 14
s 12
s 15
s3
s 7 Proces
s8
1 Proces 2
ureProces
3 Proces 4 s 22
g Proces
5
14
Process 21 ter on
Proces
ials s4 s 26 Proces s 18
s 20
s 14
Process 21
NOTE:
• To maximize the ‘chance of success’ each process will be done right, specify those conditions
that deliver maximum ‘chance of success’
www.lifetime-reliability.com 62
Series System Reliability Property #1
1. The reliability of a series system can be no higher than the least reliable component.
Rsystem= R1 x R2 x R3 …Rn
R1 R2 Rsystem
0.9 0.8 0.72 Reliability with ‘Bad Actor’
12.5% 12.5% Percentage Improved
www.lifetime-reliability.com 63
Series System Reliability Property #2
2. If ‘k’ more items are added into a series system of items the reliability of all items
must rise an equal proportion to maintain the original system reliability.
Rsystem= R1 x R2 x R3 …Rn
R1 R2 R3 Rsystem
0.9 0.9 0.81 Before adding one more item
-10%
www.lifetime-reliability.com 64
Property 2 advises ‘Simplify, Simplify, Simplify’
13
11 12 14 5
10
Shaft 9
1 1
3 4 2
5 6
2 7
8 3
4
14 things to go wrong… 5 things to go wrong…
www.lifetime-reliability.com 65
PEW SOLUTION: Apply Series System Reliability
Property #3 (best practise throughout the operation)
3. A small rise in reliability of all items causes a much larger rise in system reliability
Rsystem= R1 x R2 x R3 …Rn
R1 R2 R3 Rsystem
0.93 0.93 0.93 0.805
2.1% 2.1% 2.1% 6.5%
www.lifetime-reliability.com 66
PEW/PWW Course Content
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 67
Common Wrongs Humans Do To Machines
• We twist them
• We squeeze them
• We hit them
• We poison them
• We burn them
• We shake them
• We break them
• We choke them
• We overload them
• We boil them
www.lifetime-reliability.com
www.lifetime-reliability.com 68
The Odds are Against Doing it Right!
40,000+ ways to
incorrectly reassemble!
(Factorial 9 possible errors)
www.lifetime-reliability.com 69
80% of Failure Events are Caused by
Human Factors and Human Error
Nuclear Plants: 88 –72%
Barringer, H. Paul, ‘Use Crow-AMSAA Reliability
Growth Plots To Forecast Future System Failures’
Dams: 90%
(Henry, P., ‘Design Paradigms: Case Histories of
Error and Judgment in Engineering’)
www.lifetime-reliability.com 70
Human Factors – limitations of Flesh and Bone
Physical Psychological
• Size • Experience
• Gender • Knowledge
• Age • Training
• Strength • Attitude
• The five senses • Emotional state
Physiological Psychosocial
• Health • Interpersonal relations
• Nutrition • Ability to communicate
• Lifestyle • Empathy
• Alertness/fatigue • Leadership
• Chemical dependency
Source: ‘Maintenance Human Factors Presentation,’ Federal Aviation Administration, 2008
www.lifetime-reliability.com 71
The 12 Most Common Causes of Human Error
www.lifetime-reliability.com 72
PEW SOLUTION: Apply the Answers in the Human
Error Rate Table to Reduce Human Error 10,000%
~5 sigma
~4 sigma
1
3
2
~4.5 sigma
1
2
~2 - 3 sigma
3
The Table confirms that ‘human element’ error is real and unavoidable. We do not perform well
when tasks are structured in ways that require care and we perform especially badly under
complicated non-routine conditions. Add stress into that that mix and you get disaster.
www.lifetime-reliability.com 73
Machines Suffer High Risk from Human Contact
Job 1 Job 2 Job 3 Job 4 Job 5 Job 6 Job 7 Job 8
Rjob= Rjob= Rjob= Rjob= Rjob= Rjob= Rjob= Rjob=
Shaft
Journal
Shaft
Seal
Lock
Nut
Inner
Race
X Roller
bearing
X Outer
race
Housing
Bore
“One fails; all fails”
“One poor; all poor”
www.lifetime-reliability.com 74
Reliability of Series Work Process
Series Tasks
1 2 3 4 5 6 7 8 9 10 11 12
R1 R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12
1 1 1 1 1 1 .9 1 1 .9 1 1
.9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9
.99 .99 .99 .99 .99 .99 .99 .99 .99 .99 .99 .99
Top System Mid System Bottom System
R system= R1 x R2 x R3 … R system= R1 x R2 x R3 …
R system= R1 x R2 x R3 …
0.9 x 0.9 x … 0.9 0.99 x 0.99 x … 0.99
0.9 x 0.9 x 110
(0.9)12 = 0.2824 (0.99)12 = 0.8864
0.81
If 50 tasks (0.9)50 = 0.0052 If 50 tasks (0.99)50 = 0.6
www.lifetime-reliability.com 75
Carpenter’s creed: ‘measure twice, cut once’
Get wood Measure 1 Mark wood Cut wood 1 error every 200
opportunities
R= 0.995 ~ 1 / wk
0.995
Get wood Mark wood Cut wood 1 error every 5000
0.995 opportunities
~1 / 20 wk
??? ??? ???
www.lifetime-reliability.com 76
PEW SOLUTION: Use the power of parallel
proof-tests on every task activtity
Original task
reliability 0.9 0.9 0.9 0.9 0.9 0.59
AND
Proof-test 0.99 0.99 0.99 0.99 0.99 0.95
reliability
R system= 1- [(1- R1)x(1- R2)x(1-R3) …]
1- [(1- 0.9)x(1- 0.99)]
1- [0.1 x 0.01]
1- [0.001] = 0.999
Equivalent
series 0.999 0.999 0.999 0.999 0.999 0.995
reliability
www.lifetime-reliability.com 77
Can we get 10,000% fewer errors?
(Two sigma to five sigma)
Work Environment
1 2 3 4 5 6 7 8 9 10 11 12
R1 R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12
Work Environment
1 2 3 4 5 6 7 8 9 10 11 12
1 2 3 4 5 6 7 8 9 10 11 12
www.lifetime-reliability.com 78
Parallel Proof Test Activity Reliability
Parallel Tasks
.9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9
.9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9
Clearly, adding a proof test into each task to check accuracy makes tremendous improvement in
workmanship quality. This process went from a 28% chance of it being done right, to a 89% chance!
www.lifetime-reliability.com 79
How Much Must We Control Chance?
Add in more Parallel Check-Test Tasks
.9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9
.9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9
.9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9 .9
Task With Second Parallel Proof Test Series with Second Parallel Prrof Test How many 9’s is enough?
1 1 1 1 1 1 1 1 1 1 1 1
R system= R1 x R2 x R3 ….
1x1x…1
(1)12 = 1
If 50 tasks (1)50 = 1
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Cost of Failure
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 82
What Makes a Productive Equipment Life?
MAINTENANCE KPI: Unit Cost
When you make plant
more reliable you
Maintenance proportion = Cost
High Capacity
work on the ‘capacity’
of the Unit Cost
Return part of the Unit Cost
On Investment equation. As a result
you drive down the
High Productivity, cost of your product
Low Operating Cost because the plant is
available to work at
High Availability, full capacity for longer.
You make more
High Capacity product in the same
time for less cost.
High Reliability
www.lifetime-reliability.com 83
The Life Cycle of Plant and Equipment
Equipment Life Cycle
Decommissioning
Disposal
Feasibility
Approval
Commissioning
Preliminary Design
Idea Creation
Detail Design
Procurement
Operation
Construction
www.lifetime-reliability.com 84
Effect of Process Failures Across Life Cycle
Process failures during this phase will cause plant and equipment failures in operation.
Equipment Life Cycle (say 20 years)
Decommissioning
Feasibility
Disposal
Approval
Commissioning
Detail Design
Procurement
Preliminary Design
Operation
Idea Creation
Construction
System
Chance
of Failing
Component
Chance of
Failing
www.lifetime-reliability.com 85
When Operating Costs are Committed
Once a plant is
designed and built
there is very little that
can be done to reduce
operating costs
because they are
substantially fixed by
the plant’s design. If
you want low
operating costs, this
chart makes it clear
that they are designed
into the plant and
equipment during
feasibility, design and
construction.
www.lifetime-reliability.com 86
Low Operating Costs are Designed-in for Life
Organisations separate projects from operations and unwittingly create a divide between groups.
You must create continuity of information back and forth along the life cycle so the total life cycle
costs are minimised. Low capital costs do not necessarily mean most profit in future.
Decommissioning
Disposal
Feasibility
Commissioning
Approval
Detail Design
Procurement
Preliminary Design
Operation
Idea Creation
Construction
Specify the
Production Needs …
Capital Project“To
~ 10be- 20%
… of Life Cycle Cost Operation ~ 80 - 90% of Life Cycle Cost
On Time
A proportionately small capital expenditure in the project phase on better equipment and strategies
On Quality
On Cost
(like intentionally buying bigger equipment and de-rating it to lessen the internal stresses) lowers
On Profit
operating costs, and …consequently delivers higher operating profits all the years of the plant’s life.
“… we need 98% 87
www.lifetime-reliability.com
Plant-wide Availability!”
PEW SOLUTION: Stop Variability and Defects Across
the Business and Plant and Equipment Life Cycles
Every process throughout the life cycle will create many defects.
Management Engineering Supply Contractor Operations Maintenance
Install Operate
Maintain
Specify Design Store
Buy Start-up
Introduced
Defect and
Failure Cost
Surge
Operating Plant Uptime defects
and Throughput
Variability in
each process
causes defects
1 Serious Failure
which at times
10 Losses Product progress to
6,500 Repairs
Higher Unit Cost, ‘failure’.
20,000 Defect Modes
Poor Quality and
The Failure Pyramid
Source: Ledet, Winston, The Manufacturing Game
Delayed Delivery Source: Thanks to Ron Moore from Ron Moore
Group in the USA for this concept.
www.lifetime-reliability.com 88
Manage and Continually Reduce the Risks to
Your Plant and Equipment Life Cycle
Equipment Life Cycle
Decommissioning
Feasibility
Commissioning
Approval
Detail Design
Procurement
Operation
Idea Creation
Construction
Disposal
To be sure we consider the long-term well-being of our business, we view the
health of its plant and equipment from the end of their life cycle. We ask what do
we do to get a long, healthy, problem-free operating life?
www.lifetime-reliability.com 90
PEW/PWW Course Content
Day 1
Foundations
• Physics of Failure
• Reliability
• Risk
• Series Arrangements
• Human Error
• Life Cycle
• Reliability Improvement
www.lifetime-reliability.com 91
How do we to Stop Problems Happening?
Maintenance sees … Production sees …
What We See
What Caused It
To getTothe
getright
the end
rightresults
end results
we first
youneed
firstto
need
get to
theget
basics
the
right.
foundation
Then the basics
right
right.
results
Thencanthe
follow.
right outcomes can follow.
www.lifetime-reliability.com 92
Journey to 6 Sigma: Minimize Variability
Accuracy Controlled
Enterprise (ACE 3T)
Total Quality Control (SPC)
Trained People with Management
Perfect processes
Systematize processes;
Standardize procedures
1σ 2σ 3σ 4σ 5σ ‘6 Sigma’ Rating
69 31 7 0.6 0.023 Errors per 100
Sources: David Burns, SIRF Roundtables Ltd, Melbourne, Australia
George. Mike et al, ‘What is Lean Six Sigma’, McGraw-Hill, 2004
www.lifetime-reliability.com 93
Chance of Trouble at Each Sigma Level
Probability of Process Six Sigma
Ranking Failure Rate Error Rate
Failure Capability Rating
High 7 1 in 50 1 69 in 100
8 1 in 25
Very High 9 1 in 10
10 1 in 5
* = process not capable at minimal levels
www.lifetime-reliability.com 94
Most Businesses make their Machines Break
MAINTENANCE KPI
Breakdown Hours
Hours Control Chart
± 3 sigma
Week No
X
vibration!
X
X
Get rid of the cause of oil
contamination!
First trial
Range of Possible Outcomes for Time,
Quality Cost, Quality
Second draft
Second trial
Standardised Procedure
means… developing one best way Third draft
Third trial
±10% Range of Possible Outcomes for
Time, Cost, Quality
New
New Ideas Procedure Technology
www.lifetime-reliability.com 97
Remove the Variability in a Job by Using 3T
(Target-Tolerance-Test) Error Proof Techniques
Only accept this range of outcomes
because they give very low risk
Number of Events
Acceptable
Outcome
By setting quality controls into a job you ensure the actions that create reliability are
done thereby greatly reducing the chance that a mistake will be made.
www.lifetime-reliability.com 100
Case Study 1 : Shaft Quality Control for Bearing
Reliability
See the example in the workbook on how to check the
quality of a shaft is suitable for a roller bearing
www.lifetime-reliability.com 101
In Each Work Task Provide a Target to Hit
29.8 to 30.2 is
acceptable
(Max 3 – (Include all tables, diagrams and pictures here) Describe the test and below specify the
4 words) target as ‘BEST’ and range of acceptable
results that are ‘Good enough’.
PASS / FAIL /
ACCEPT REJECT
www.lifetime-reliability.com 106
PEW SOLUTION: Develop and Use Reliability
Creating ACE 3T Error Proof Procedures
Build ACE 3T Mistake Proofing into SOPs
•Set a target for each task.
•Specify the acceptable tolerance.
•Do a test to prove accuracy.
Bands of Lesser
Frequency of Outcome
Quality
Quality
(Decreasing Accuracy) improvement
occurs when
‘Good, Better, Best’ variation is
‘Bronze, Silver, Gold’ reduced
Output
Specification
www.lifetime-reliability.com 107
Use Two-Sided Error Proof Quality Control
• Put Mistake Proof Controls into SOPs with clear 3T task standards
•Use the As-Designed Standards… then improve them
• Work to Internationally Recognised Standards Accuracy
– Set a Target for each task Precision
Measure
Good
World-Class BEST
Standard
BETTER
Stretch Standard
Specification
Equipment GOOD Range of Outcomes
Manufacturer’s “By stating the exact requirements to meet you are causing
Standard people to take all necessary actions to achieve them.”
www.lifetime-reliability.com 108
Write SOPs that include Failure Prevention
Sign-off
Task Task Task Record Action if
Tools & Full Description Test for After
Step Step Step
Condition of Task
Tolerance Range Actual Out of Complet
No. Owner Name Correctness Result Tolerance e
When procedures are written with the 3Ts you can guide people right to the outcome they need
to deliver. We build into 3T procedures the necessary actions, that when performed, will deliver
the maintenance strategy. We give people a way to check that their work is exactly what it
needs to be. They self-improve and gain the self-satisfaction of having done a great job.
www.lifetime-reliability.com 109
Continuous Improvement with Accuracy
Controlled Procedures
Now you know what ‘good enough’ is!
Drives Improvement
Continuous improvement: Make ‘better’ ‘good’, make ‘best’ ‘better’ and set a new
standard for ‘best’. In this way, you will drive quality improvement and innovation in
your company. ‘Good enough never is!’
www.lifetime-reliability.com 110
Activity – Put 3T Quality Control in Procedures
NOTE:
None of these task will
prevent the pump and
piping from failing. These
tasks find future repairs…
and you want a healthy,
reliable pump set...
…prevention is easier than
cure
www.lifetime-reliability.com 111
Precision Maintenance of Machinery is …
1. Accurate Fits and Tolerance at Operating Temperature
2. Impeccably Clean, Contaminant-Free Lubricant Life-long
3. Distortion-Free Equipment for its Entire Lifetime
4. Shafts, Couplings and Bearings Running True to Centre
5. Forces and Loads into Rigid Mounts and Supports
6. Laser Accurate Alignment of Shafts at Operating Temperature
7. High Quality Balancing of Rotating Parts
8. Low Total Machine Vibration
9. Correct Torques and Tensions in all Components
10. Correct Tools in the Condition to do the Task Precisely
Number 14 is the one that the vast
majority of companies miss. They
11. Only In-specification Parts don’t systemize and standardize
the delivery of precision to their
12. Failure Cause Removal during Maintenance machinery.
NOTE: These parameters are indicative and may not apply to a particular machine. Confirm actual requirements with the manufacturer.
www.lifetime-reliability.com 113
PEW SOLUTION: World Class Standards produce
World Class Equipment Reliability
Source: Wayne Bissett, OneSteel
Reliability Manager, Planning and
Condition Management Presentation,
Sydney, Australia, 2008
‘Precise’
‘Smooth’
‘Tight’
‘Dry’
‘Clean’
‘Cool’
‘Repeatable’
www.lifetime-reliability.com 114
Asset Management in a Nutshell
Business RISK performance of an asset is influenced by two things:
– Its inherent reliability established at the design stage.
– Its achieved availability, which depends on how well the asset is
operated and maintained through its life.
MAINTENANCE:
•Contributes to the success of the organisation
•It’s purpose is to eliminate the need for maintenance
•It’s goal is to do everything in its power to
– Increase the quantity and quality of production, and
– Reduce costs BY REDUCING THE CHANCE OF RISK
Sources: Jim Wardhaugh, UK Consultant (30 years with Shell
and Centre of Excellence Leader)
www.lifetime-reliability.com 115
PEW SOLUTION: A View of the Plant Wellness
Way (PWW) Climb Up to Reliability Excellence
100%
Maximum
Availability Plant and
Equipment
Wellness
Business
Equipment Availability
Preventive
Maintenance Business System
Upkeep
Limiting
Availability and
Maintenance Cost
Breakdown
Maintenance
After Failure
< 80%
Destructively Exceptionally
High Low
Maintenance Cost
www.lifetime-reliability.com 116
Integrated System to Reliability & Plant Wellness
Unit Cost of Maintenance
Performance
MTBF WO per
Measures
Operating Costs
Organizational
Asset Strategy
Action Plan
and Training
Plan
CEO
Shopfloor Maintenance
Supply
www.lifetime-reliability.com 118
World Class Practices Required at Every Step
Use A Quality Management System incorporating Proof Tests to drive
Accuracy and Continuous Improvement
3T
Plan
Do
3Ts are used here to
3T guarantee improved performance Check
(Measure)
reliability in every process
Act
(Improve)
3T
www.lifetime-reliability.com 119
Follow the Processes of Plant Wellness Way
We can
summarise the
key
requirements in
a simple chart
Mindset to show people
Barrier the necessary
philosophies
that need to be
introduced at
Skills/Knowledge the different
Barrier parts of ‘The
Journey’.
Companies
often hit
‘barriers’ and
they struggle to
get past due to
old paradigms.
www.lifetime-reliability.com 120
Develop the People of Plant Wellness
People want to
feel valuable, and
to be valued. You
can do both
when you
increase their
worth to a
business. This
diagram shows
where you need
to take your
people, and as a
consequence
they will bring
your business
with them.
Overcoming the ‘Skill and Knowledge Barrier’
www.lifetime-reliability.com 121
Observe the Financial Steps to Plant Wellness
Don’t forget that
your financial
recording and
analysis systems
will need to keep
pace with the
informational
requirements the
Plant Wellness
philosophies.
www.lifetime-reliability.com 122
Teach the Leadership Culture of Plant Wellness
Culture is ‘the
way we do things
around here’.
Overcoming the ‘Mindset Barrier’
You want a
culture of
continual
improvement,
where people
naturally come
up with new
ideas and are
helped to trial
them. You don’t
want the culture
of ‘that’s the way
we’ve always
done things
around here’.
www.lifetime-reliability.com 123
PEW SOLUTION: The ‘Wholeness’ of Plant Wellness
People, money,
machines and
processes all
working together in
an integrated
Processes People learning system,
where business life
cycle risk is
controlled to
maximize machine
and equipment
health and
Capital Culture wellbeing, so the
business and its
people profit – that
is ‘Plant Wellness’.
www.lifetime-reliability.com 124
PEW SOLUTION: The Plant and Equipment
Wellness Way to Operational Excellence
Your Ideal Operational
. Excellence Your Ideal Operational Excellence
Asset Management System Asset Management System
Engineering, Maintenance, Precision Quality Operation
and Operational Life Cycle Standards Management Standards
Quality Engineering
Quality Processes Standards Standards
Construction, Precision Engineering
Operations and Skills Materials Design Operator
of
Maintenance Practices Construction Skills
Defect Environment
Elimination Stress
Precision ReductionPrecision
Strategy Installation Operation
Operating Distortion Degradation
Risk Control Control
Reduction ATOMIC STRESS TO
Machine BUSINESS PROCESS MODEL Component
START HERE Parts Distress
for world class results Health (Atomic Structure
www.lifetime-reliability.com Failure)
www.lifetime-reliability.com 125
PEW SOLUTION: Do the Six IONICS Processes to
Plant and Equipment Wellness
6-STEP PLANT WELLNESS PLANT WELLNESS
METHODOLOGY IONICS PROCESSES
Process
A LIFETIME OF HIGHLY
RELIABLE PRODUCTION PLANT
AND EQUIPMENT 126
www.lifetime-reliability.com
The Plant Wellness Way: A Business Wide and
Life Cycle Long System of Asset, Operations
and Maintenance Management
Reliability Human Document & Learning Capital Project Business Risk
Engineering Capability Configuration Organisation Management Management
Management Management Business
Planning and Integrated Knowledge Process Life Cycle
Scheduling Business Systems Management Improvement Costing
ACE Precision ACE Quality ACE Quality ACE Engineering ACE Operation
Standards Standards Management Standards Standards
Supply Chain Lean Workplace Stores & Materials Operating Risk
Management Practices Management Reduction
Precision Robust Materials of Precision
Maintenance Engineering Construction Operator
Skills Design Selection Total Skills
Preventive Predictive Productive
Maintenance Maintenance Maintenance
ACE 3T Precision Proactive ACE 3T Precision
Installation Maintenance Operation
Parts’ Environment Operating
Management Stress Reduction
Distortion Degradation
Management Management
Failure Cause
Prevention
Equipment
Parts’ Health
START HERE (Low Atomic
for world class system Stress)
www.lifetime-reliability.com 127
The PEW/PWW Strategy….
• Maximises life cycle profit, by
• Markedly reducing human error, and
• Proactively minimizing component stress, thru
• Greatly increasing work quality results, to
• Produce the world class reliability, that
• Delivers the least maintenance commitment, and
• Brings Operational Excellence performance, by
• Creating a single holistic business system, to
• Address life cycle cost issues and reduce variation, that
• Produces the least operating equipment risk profile, and
• Results in the least operating costs forevermore.
www.lifetime-reliability.com 128