Вы находитесь на странице: 1из 12

Reliability Engineering and System Safety 144 (2015) 83–94

Contents lists available at ScienceDirect

Reliability Engineering and System Safety


journal homepage: www.elsevier.com/locate/ress

Multi-level predictive maintenance for multi-component systems


Kim-Anh Nguyen a, Phuc Do b,c,n, Antoine Grall a
a
ICD, ROSAS, LM2S, Université de Technologie de Troyes, UMR 6281, CNRS, Troyes, France
b
Université de Lorraine, CRAN, UMR 7039, Campus Sciences, BP 70239, Vandoeuvre-les-Nancy 54506, France
c
CNRS, CRAN, UMR 7039, France

art ic l e i nf o a b s t r a c t

Article history: In this paper, a novel predictive maintenance policy with multi-level decision-making is proposed for
Received 11 April 2015 multi-component system with complex structure. The main idea is to propose a decision-making process
Received in revised form considered on two levels: system level and component one. The goal of the decision rules at the system
16 July 2015
level is to address if preventive maintenance actions are needed regarding the predictive reliability of the
Accepted 17 July 2015
system. At component level the decision rules aim at identifying optimally a group of several
Available online 28 July 2015
components to be preventively maintained when preventive maintenance is trigged due to the system
Keywords: level decision. Selecting optimal components is based on a cost-based group improvement factor taking
Multi-component system into account the predictive reliability of the components, the economic dependencies as well as the
Condition-based maintenance
location of the components in the system. Moreover, a cost model is developed to find the optimal
Opportunistic
maintenance decision variables. A 14-component system is finally introduced to illustrate the use and
Economic dependencies
Prognostic the performance of the proposed predictive maintenance policy. Different sensitivity analysis are also
Importance measure investigated and discussed. Indeed, the proposed policy provides more flexibility in maintenance
decision-making for complex structure systems, hence leading to significant profits in terms of
maintenance cost when compared with existing policies.
& 2015 Elsevier Ltd. All rights reserved.

1. Introduction decision-making process relies on the diagnostic/prognostic of the


system condition over time. It has been recently introduced and
The maintenance process includes preventive and corrective becomes nowadays an interesting approach for maintenance optimi-
actions carried out to retain a system or restore it to an operating zation [2,3,7,8,12].
condition. It is a complex process of the utmost importance especially A large number of CBM policies has been investigated, developed
for the manufacturing firms. Maintenance costs may take from 15 to and successfully applied to mono-component system, e.g., [7,12,30]
70% of total production costs [1]. Optimal maintenance policies aim to using the current equipment condition (deterioration level) and
provide optimal system reliability/availability and safety performance [15,31,8] taking the future equipment health state for making main-
at the lowest possible maintenance cost [21]. In the literature, there tenance decision. However, these policies cannot be adapted directly
are mainly two types of maintenance techniques including time-based to multi-component systems [4], in which interdependencies such as
maintenance (TBM) and condition-based maintenance (CBM). For stochastic, structural and economic dependencies may exist between
TBM maintenance, preventive maintenance decision is based on the components [25]. In fact, the stochastic dependence implies that the
system age and on the knowledge of the statistical information on the deterioration process or failure of a component may affect the lifetime
system lifetime [6,9]. As a consequence, the realistic operating distribution of other ones. The structural dependence exists if com-
conditions of the system over time are not taken into account. Unlike ponents structurally form a part, so that maintenance of a failed
TBM maintenance, CBM consists in advanced maintenance technique. component implies maintenance of working components, or at least
It is popularly reported in the literature and the maintenance dismantling them. Finally, the economic dependence involves
(i) positive economic dependence (PED) implying that joint main-
tenance of a group of components is cheaper that performing main-
Abbreviation: CBM, condition-based maintenance; CM & OM & PM, corrective and tenance on components separately, and (ii) negative economic depen-
opportunistic and preventive maintenance; MCS, minimal cut set; NED & PED, dence (NED) which exist if combining maintenance activities is more
negative and positive economic dependence; RBD, reliability block diagram; RUL,
remaining useful life
expensive than performing maintenance on components separately
n
Corresponding author. [6]. It is important to note that taking into account all these
E-mail address: phuc.do@univ-lorraine.fr (P. Do). dependencies in maintenance optimization is necessary, but makes

http://dx.doi.org/10.1016/j.ress.2015.07.017
0951-8320/& 2015 Elsevier Ltd. All rights reserved.
84 K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94

Nomenclature RðT k þ 1 j x1:n;T k Þ predictive reliability of system at the next


inspection time T k þ T given the degradation level of
Notations
n components, x1:n;T k , at time Tk
RGl ðT k þ 1 j x1:n;T k Þ system predictive reliability at the next inspec-
n total number of components of considered system tion time T k þT when all components of group Gl are
i index for components of the system, with i ¼ 1; 2; …; n replaced preventively at time Tk
s(t) system state at time t C GIF ðtÞ cost-based group improvement factor computed
Gl
si(t) state of component i at time t when all components of group Gl are replaced at time
X i;t ¼ xi;t degradation level of component i measured at time t t
X 1:n;t ¼ x1:n;t degradation level of whole system measured at R0 ; Rn0 PM threshold and optimal value of R0 defined for the
time t policy (T; R0 )
ðX i;t Þt Z 0 stochastic process describing the degradation of com- Rp ; Rnp PM threshold and optimal value of Rp defined for the
ponent i over time t policy (T; Rp ; Rop )
αi ; β i shape and scale parameters of Gamma distribution for Rop ; Rnop OM threshold and optimal value of Rop defined for the
component i policy (T; Rp ; Rop )
Zi fixed failure threshold of component i Nb total number of inspection times within ½0; T rep 
cp;i ; cc;i specific PM cost and specific CM cost of component i, Ck consists of the inspection cost, the running cost and
with cc;i Z cp;i the maintenance costs of the k-th inspection cycle (i.e.
cinsp inspection cost for a component ðT k  1 ; T k , with k Z1)
cs maintenance set-up cost C 1 ðT; R0 Þ long run maintenance cost per time unit of the policy
cpsd planned shutdown cost (T; R0 )
cusd unplanned shutdown cost C 1 ðT; Rp ; Rop Þ long run maintenance cost per time unit of the
cd unplanned downtime cost rate policy (T; Rp ; Rop )
T; T n inter-inspection interval (or inspection cycle) and Trep time at which the system is completely replaced
optimal value of T I indicator function: I fg ¼ 1 if the condition fg is true
Tk k-th inspection time, with T k ¼ k T and k A N and 0 otherwise
T k ; T kþ time just before/after the inspection date Tk Ω set of all components of the system
Gl l-th group of components Policy (T; R0 ) Multi-level predictive maintenance policy
j Gl j cardinal number of the group Gl, 1 r j Gl j r n Policy (T; Rp ; Rop ) opportunistic predictive maintenance policy
Ri ðT k þ 1 j xi;T k Þ predictive reliability of component i at the next
inspection time T k þ T given its degradation level, xi;T k ,
at time Tk

the maintenance models too complicated to solve or to analyze. This is threshold. Preventive maintenance is needed if the predictive
specially true in CBM framework. In the literature most existing CBM reliability of the system is lower than the prefixed threshold.
maintenance policies for multi-component systems take advantage of  At component level: the main idea is to find an optimal
PED to minimize the average maintenance cost, see for instance grouping of several components to be preventively maintained
[2,3,13,26,16]. Nevertheless, these existing CBM policies addressing when maintenance decision at system level is triggered. Deci-
multi-component systems can be applied on several specific struc- sion making is made on the basis of a “cost-based group
tures (i.e., series, parallel, series-parallel, parallel-series, k-out-of-n). improvement factor” taking into account the predictive relia-
However, actual system structures are more complex with a large bility of components, the economic dependencies as well as the
number of components and very complex inter-connections between location of components in the system. The later is the first
components which can be described as mixtures of basic connections original contribution of the paper.
[11,28,33]. It is shown in [29] that taking into account the complex
structure in a maintenance model may lead to consider both PED and
NED dependencies. However, the maintenance policy proposed in [29] The proposed maintenance policy has two decision variables
is based on TBM technique. Recently, a CBM policy has been including inter-inspection interval and PM threshold at system level.
introduced for complex structure systems using Birnbaum importance These decision variables need to be optimized. The second contribu-
measure which can help to take into account effectively the system tion of the paper is to develop a cost model in order to find the opti-
structure in maintenance decision-making [19]. Each component is mal maintenance decision variables. In addition, Monte Carlo simula-
individually selected to be preventively maintained at discrete times tion technique is used to evaluate the maintenance cost rate.
but the maintenance grouping impacts (PED and/or NED between The rest of this paper is organized as follows. Section 2 is devoted
components) are not really highlighted. to the description of the general model of degradation and failure of a
To face this issue, in the present paper, a predictive maintenance multi-component system. The complex structure of the system is also
policy with multi-level decision-making process is proposed for investigated. Section 3 focuses on the predictive reliability/RUL at both
complex structure systems. The main idea is to jointly consider component and system level for a complex structure system. Addi-
prognostic information (i.e., predictive reliability or RUL) of compo- tionally, the cost-based group improvement factor is also introduced.
nents and system, PED and NED dependencies as well as system Section 4 describes the maintenance costs structure and the proposed
structure in the maintenance decision-making process. More precisely, multi-level predictive maintenance policy. The global cost model is
the proposed maintenance policy is divided into two levels: also developed here. To illustrate the use and advantage of the
proposed policy, a 14-component system with complex structure is
 At system level and at each inspection time, the reliability of introduced. A comparison on the performance of the proposed policy
components is firstly predicted. Based on the predictive relia- and an existing policy is investigated through different sensitivity
bility of components, the predictive reliability of the system is analyses. Finally, the present paper ends with some conclusions and
secondly determined and compared with a prefixed reliability outlines for possible future works.
K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94 85

2. System degradation modeling and assumptions leads the system to failure or shutdown. Oppositely, a compo-
nent is“non-critical” if the system can be still operating when it
2.1. General degradation process description fails or stops. It is important to note that in a complex structure a
“non-critical” component may become critical in a period of
In this work, we consider a complex multi-component engineering time when other components are failed [19]. In such a case it is
system comprised of n deteriorating components. The components called a “temporary critical” component.
can be identical or not. They are inter-connected according to a  Cut set: a cut set is a set of components for which the system
complex structure/configuration with components interdependencies fails if all components in cut set fails.
described, e.g., in term of reliability block diagram (RBD) [24]. The  Minimal cut set (MCS): a cut set is said to be minimal if it cannot
configuration is denominated as “complex” because it can be a be reduced without loosing its status as a cut set.
mixture of several commonly basic connections (e.g. series, parallel,  Critical group: a group is “critical” if the system fails when all
k-out-of-n, bridge). components of the group fail. According to the concept of MCS,
Each component in the system is assumed to be subject to an a critical group contains at least one MCS. Oppositely, a group is
underlying degradation process which can cause random failures. defined as “non-critical” if the system can still operate when all
Such a degradation processes may be a physical degradation processes the components of the group have failed.
such as accumulative wear, corrosion, fatigue and crack growth which  Path set: a path set is a set of components whose operation
are major causes of wear-out failures [5]; or it may be an artificial ensures that the system is functioning.
process describing the phenomenon of the health status or the  Minimal path set: a path set is said to be minimal if it cannot be
performance of a component worsening with usage or age [32]. reduced without loosing its status as a path set.
To model the degradation process of the considered system, in this  Structure function is a binary function, denoted ϕðsðtÞÞ, which
paper, a time-dependent stochastic process is considered. It has more allows to determine the state of the system from the given state
advantageous in describing intermediate degradation states than of all components in the system. We have
rough lifetime models. To this end, the following general assumptions (
1 if the system is functioning at time t
related to degradation of component i (i ¼ 1; …; n) are given: ϕðsðtÞ ¼ ϕðs1 ; s2 ; …; sn Þ ¼
0 if the system is in a failed state at time t

 The degradation evolution of each component is gradual, with where sðtÞ ¼ ðs1 ðtÞ; s2 ðtÞ; …; sn ðtÞÞ is a state vector; si ðtÞ
stochastically independent increments over time. The degradation ði ¼ 1; …; nÞ is binary variable representing the state of the
increments, Δxi , are supposed to be stationary and non-negative. component i
 (
The components in the system are structurally dependent 1 if component i is functioning at time t
considering the viewpoint of the RBD model and economically si ðtÞ ¼
0 if component i is in a failed state at time t
dependent from the maintenance viewpoint.
 The degradation process of each component is stochastically It is important to note that si ðtÞði ¼ 1; …; nÞ are independent
independent from the others. since all components are stochastically independent.
 The condition (degradation level) of component i at time t can  Structure function of the complex structures can be obtained by
be measured and described as a scalar random aging variable using the minimal cut sets concept or the minimal path sets
X i;t . The degradation trajectory is a path of the time-dependent one
stochastic process ðX i;t Þt Z 0 . ○ Using the minimal path sets
 The initial degradation level of component i is X i;0 ¼ 0. The np
!
component i is considered initially as new. The higher X i;t , the ϕðsðtÞÞ ¼ 1  ∏ 1  ∏ si ðtÞ ð1Þ
closer component i is to failure. j¼1 i A Pj
 If X i;t ¼ xi;t exceeds a predefined critical threshold Zi, compo-
where Pj is j-th minimal path set, and np is the total number
nent i is considered as failed, even if it is still operating. Zi can
of minimal path sets of the system;
be seen as a limit degradation level defined according to
○ Using the minimal cut sets
economical, technical or security reasons. The failure of an " #
nc
item is assumed to be self-announcing and a failed component
ϕðsðtÞÞ ¼ ∏ 1  ∏ ð1  si ðtÞÞ ð2Þ
is stopped as soon as possible. j¼1 i A Cj
 The stoppage of a component does not affect the production
performance if the system is still functioning. where Cj is j-th minimal cut set, and nc is the total number of
minimal cut sets of the system.
For a complex structure system, failure of one or several compo-
nents does not necessarily lead to a failure of the system, but if the As an example, let us consider a complex structure system whose
failed components are not immediately replaced, they may con- reliability block diagram (RBD) is represented in Fig. 1. It should be
duct some other components to idle state (see [19] for a complete noted that this system is a part of a natural gas plant as described
description). During the idle state, components may deteriorate in in [11, p. 178]. The system is composed by 8 minimal path sets
a different way than in operating state. In this paper the deteriora- including P 1 ¼ f1; 3; 4; 5; 6; 7; 8; 9; 10; 11g, P 2 ¼ f2; 3; 4; 5; 6; 7; 8; 9;
tion of a component is supposed to stop as far as the component 10; 11g, P 3 ¼ f1; 3; 4; 5; 6; 14g, P 4 ¼ f2; 3; 4; 5; 6; 14g, P 5 ¼ f1; 3; 4; 13g,
remains in idle state. This phenomenon introduces dependencies P 6 ¼ f2; 3; 4; 13g, P 7 ¼ f1; 3; 12g, P 8 ¼ f2; 3; 12g and 10 minimal cut
between the components deterioration processes. sets including C 1 ¼ f1; 2g, C 2 ¼ f3g, C 3 ¼ f4; 12g, C 4 ¼ f5; 13; 12g,
C 5 ¼ f6; 13; 12g, C 6 ¼ f7; 14; 13; 12g, C 7 ¼ f8; 14; 13; 12g, C 8 ¼ f9; 14;
2.2. Complex structure: definitions and analysis 13; 12g, C 9 ¼ f10; 14; 13; 12g, C 10 ¼ f11; 14; 13; 12g.
The structure function of the system in this case is represented by
To tackle with the complexity of the system configuration the
following concepts are recalled, [22]:
 using the minimal path sets
" #
8
 Critical component: a component is called “critical” if its failure or ϕðsðtÞÞ ¼ 1  ∏ 1  ∏ si ðtÞ
stoppage, while the other components are in operating state, j¼1 i A Pj
86 K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94

4 5 6 7 8 9 10 11
1

3 14
Start End
13
2
12

Fig. 1. Reliability block diagram of the 14-component system.

¼ 1  ½1 s1 ðtÞs3 ðtÞs4 ðtÞs5 ðtÞs6


[t 1 ; t 2 ] (with t 1 o t 2 ) is written as
ðtÞs7 ðtÞs8 ðtÞs9 ðtÞs10 ðtÞs11 ðtÞ
! !
½1  s2 ðtÞs3 ðtÞs4 ðtÞs5 ðtÞs6 ðtÞs7 ðtÞs8 np np

ðtÞs9 ðtÞs10 ðtÞs11 ðtÞ½1 s1 ðtÞs3 ðtÞs4 ðtÞs5 Rðt 2 j t 1 Þ ¼ 1  ∏ 1  ∏ Ri ðt 2 j t 1 Þ ¼ 1 ∏ 1  ∏ Ri ðt 2 j t 1 Þ ;


j¼1 i A Pj j¼1 i A Pj
ðtÞs6 ðtÞs14 ðtÞ½1  s2 ðtÞ
ð7Þ
s3 ðtÞs4 ðtÞs5 ðtÞs6 ðtÞs14 ðtÞ½1  s1 ðtÞs3 ðtÞs4 ðtÞs13
  ðtÞ½1  s2 ðtÞs3 ðtÞs4 ðtÞs13 ðtÞ½1  s1 ðtÞs3 ðtÞs12 ðtÞ where Ri ðt 2 j t 1 Þ is the conditional reliability of component i given it
½1  s2 ðtÞs3 ðtÞs12 ðtÞ; ð3Þ is functioning at time t1. If the deterioration level of the compo-
nent is measured at time t1 such that X i;t1 ¼ xi;t 1 , Ri ðt 2 j t 1 Þ can be
 using the minimal cut sets stated as follows:
" #
10 Ri ðt 2 j xi;t 1 Þ ¼ PðX i;t2 o Z i j X i;t 1 ¼ xi;t1 Þ: ð8Þ
ϕðsðtÞÞ ¼ ∏ 1  ∏ ð1 si ðtÞÞ
j¼1 i A Cj
If the deterioration process of component i is non decreasing,
¼ ½1  ð1  s1 ðtÞÞð1  s2 ðtÞÞ½1 ð1 s3 ðtÞÞ Ri ðt 2 j xi;t 1 Þ can then be estimated by
½1  ð1 q4 ðtÞÞð1  s12 ðtÞÞ
Z Z i  xi;t1
½1  ð1 s5 ðtÞÞð1  s12 ðtÞÞð1  s13 ðtÞÞ
Ri ðt 2 j xi;t 1 Þ ¼ PðX i;ðt2  t 1 Þ o Z i  xi;t1 Þ ¼ f i ðuÞ du: ð9Þ
½1  ð1 s6 ðtÞÞð1  s12 ðtÞÞð1  s13 ðtÞÞ 0
½1  ð1 s7 ðtÞÞð1  s12 ðtÞÞð1  s13 ðtÞÞ
Finally, the degradation-based conditional reliability of the system
ð1  s14 ðtÞÞ½1  ð1  s8 ðtÞÞð1  s12 ðtÞÞ
comprised of n components, denoted Rðt 2 j x1:n;t1 Þ, can be obtained
ð1  s13 ðtÞÞð1  s14 ðtÞÞ½1  ð1  s9 ðtÞÞ
by introducing Eq. (9) into Eq. (7).
ð1  s12 ðtÞÞð1  s13 ðtÞÞð1  s14 ðtÞÞ
At each inspection time the predictive system reliability will be
½1  ð1 s10 ðtÞÞð1  s12 ðtÞÞð1  s13 ðtÞÞ
used as an essential indicator for PM decision making for the
ð1  s14 ðtÞÞ½1  ð1  s11 ðtÞÞð1  s12 ðtÞÞ proposed predictive maintenance policy. This point will be dis-
ð1  s13 ðtÞÞð1  s14 ðtÞÞ: ð4Þ cussed in Section 4.

3.2. Cost-based group improvement factor


3. Reliability prediction and cost-based group improvement
factor In order to improve the reliability of the system, the reliability
improvement importance measure, defined as the reliability
3.1. Reliability prediction difference of system when a component is replaced by a new
one [22], seems to be an interesting indicator. The latter can help
Mathematically, for non repairable components the reliability to identify the importance ranking of components relating to the
of the system, denoted R(t), is expressed as improvement ability on the system reliability. It must however be
noted on one hand that this importance measure does not take
RðtÞ ¼ E½ϕðsðtÞÞ; into account the maintenance cost. From a practical point of view,
a high improvement ability with very high maintenance cost may
where E½ indicates the mathematical expectation. As mentioned above, not be a proper choice for maintenance optimization. A “reason-
ϕðsðtÞÞ can be determined by using the minimal path set concept or the able” improvement ability with lower maintenance cost may be
minimal cut set one. Without loss of generality, we assume that ϕðsðtÞÞ is more appropriate. On the other hand, due to economical and/or
given by Eq. (1), the reliability of the system is expressed then technical reasons, it is preferable that several components are
preventively maintained at each maintenance time [9,10]. Never-
! !
np np theless, the existing reliability improvement measure does not
RðtÞ ¼ 1  ∏ 1  ∏ Eðsi ðtÞÞ ¼ 1 ∏ 1  ∏ Ri ðtÞ ; ð5Þ take into account the simultaneous replacement impacts of several
j¼1 i A Pj j¼1 i A Pj
components together, e.g. economic dependence between compo-
nents or joint improvement impacts. Therefore, in this paper an
where Ri ðtÞ ¼ Eðsi ðtÞÞ is the reliability of component i. Since component i extension of this importance measure, namely “cost-based group
gradually deteriorates, its reliability can be expressed as follows: improvement factor” is proposed for group-based maintenance
Z Zi
decision-making.
Ri ðtÞ ¼ 1  PðX i;t Z Z i Þ ¼ PðX i;t oZ i Þ ¼ f i ðuÞ du; ð6Þ The cost-based group improvement factor for a group Gl is
0 denoted C GIF
Gl . It is defined as follows:

where f i ðÞ is the probability density function describing the deterioration RGl ðt 2 j t 1 Þ  Rðt 2 j t 1 Þ
evolution of component i. C GIF
Gl ðt 1 Þ ¼ ; ð10Þ
C p;Gl
Let us now assume that the system is known to be functioning
at time t1, the reliability of the system within the interval horizon where:
K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94 87

Predict R(Tk 1 | x1:n,Tk ) with assuming that unplanned shutdown cost cusd, is incurred (generally, cusd Z cpsd ).
Moreover, when the system remains in its failure state, it is

System level
all (n - m) failed components are already replaced
necessary to pay an additional cost (unavailability cost rate) cd
for each time unit. In that way, the cost structures which applied in
No PM No
R(Tk this paper are given hereafter.
action is 1 | x1:n,Tk ) R0 ?
needed Within the paper I fg denotes the indicator function with I fg ¼ 1
Yes if fg is true; otherwise I fg ¼ 0.
Gl
Assess R (Tk 1 | x1:n,Tk ) with l 1,...,2 m-1
 When replacing running component i at time t, a PM cost C p;i , is
incurred such that
Gl is not No Gl
R (Tk 1 | x1:n,Tk ) R0 ? C p;i ¼ cp;i þ cs þ cpsd I fComponent i is critical or temporary criticalg ð11Þ
considered |fflfflfflffl{zfflfflfflffl} |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
unitary PM cost planned shutdown cost

Component level
Yes
where cp;i is the specific PM cost; cs the maintenance set-
Gl is admissible group
up cost.
 When replacing failed component i at time t given that the
Calculate CGGIF (Tk ) for all admissible groups component has failed at time tiF (with 0 ot iF r t), a corrective
l
cost C c;i , is incurred such that

Define the optimal g roup Gk* such that C c;i ¼ cc;i þ cs þ ½ðt  T F Þcd þ cusd I fti ¼ T F g ð12Þ
|fflfflfflffl{zfflfflfflffl} |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
F

CGGIF
* (Tk ) = max CGGIF
l
(Tk ) unitary CM cost unavailability cost
k l 1,...,|S adm |
where cc;i is the unitary specific CM cost for component i; The
unavailability cost is incurred if the system failure is due to the
Remplace preventively the optimal group Gk* failure of component i i.e. if t iF ¼ T F with TF indicating the failure
Fig. 2. Procedure of multi-level preventive maintenance decision-making. time of the system.
 Assume the system has not failed and all the components in Gl
are working but have to be preventively replaced. By replacing
preventively group Gl of elements j Gl j (1 r j Gl j r n) at the same
 RG ðt 2 j t 1 Þ indicates that the system predictive reliability calcu-
l
time t, a PM group cost C p;Gl , is incurred
lated at time t2 when all the components of the group Gl are X
C p;Gl ¼ C p;i  cs ðj Gl j  1Þ
preventively replaced at time t1. i A Gl
|fflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflffl}
 C p;Gl is the cost related to PM operation performed on the group 0
saved setupcost:PED
1
Gl and described in detail in Section 4. X
 cpsd @ I fcomponent i is critical or temporary criticalg  I fGl is a critical groupg
A
i A Gl
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
By definition, C GIF
Gl ðt 1 Þ
takes jointly into account various informa- savedplannedshutdowncost:PEDorNED

tion related to: the predictive reliability behavior of all components ð13Þ
of the system, the costs (e.g. PM cost of maintained components,
The second part on the right-hand side of Eq. (13) represents the
positive/negative economic dependencies), the system structure
saved set-up cost due to positive economic dependence (PED)
(degradation dependencies between components). The use of the
between components. The third part could be positive or negative.
proposed C GIF
Gl factor for selecting an optimal group of components
In fact, it is positive (i.e., PED) if the group Gl contains more than
to be preventively maintained will be described in the next section.
one critical and/or temporary critical component. If Gl is a critical
group but its components are non-critical one, the third part is
negative and represents negative economic dependence (NED). It is
4. Multi-level predictive maintenance policy
important to note that the third part does not exist in multi-
component series systems since all components of a series system
4.1. Assumptions associated with inspection, maintenance and costs
are critical one.
Inspection operation: We assume that the degradation level of
functioning components can only be known by periodic inspection 4.2. Description of multi-level predictive maintenance policy
operations at times T k ¼ kT, where T is the inter-inspection
interval and k A N. The inspection operations are assumed to be The surviving components of the system are regularly inspected at
instantaneous, perfect and non-destructive. For each functioning times T k ¼ k T (k A N). The inspection operation performed on com-
component, an inspection action incurs a cost cinsp. ponent i at time Tk returns its current condition X i;T k ¼ xi;T k . The
Maintenance operations: CM activity is applied to failed com- corrective replacement of failed components and the preventive
ponents and PM activity is carried out on running components. In replacement of running components are only carried out at inspection
this work, both maintenance activities can completely restore a times Tk. T is the first decision variable.
component to “as good as new” state and the duration related to The main intention of the proposed predictive maintenance
these activities is assumed to be negligible. We assume also that policy is to jointly take into account in the decision process the
necessary maintenance resources (such as spare parts and crews, predictive reliability of the components and the system, the
and maintenance tools) to execute the maintenance actions are system structure as well as the economic dependencies between
always available at planned inspection dates. components. For this purpose a two-level PM decision process is
A PM intervention on a specific component/group of compo- proposed at both system and component level as shown in Fig. 2.
nents may lead the system to a planned shutdown. Although the
maintenance durations are assumed to be negligible, a planned  At the system level, the reliability of the system before the next
shutdown usually incurs a planned shutdown cost cpsd. If the inspection time, RðT k þ 1 j x1:n;T k Þ, is firstly predicted considering
system has stopped due to the failure of a component, an that all failed components during ðT k  1 ; T k  have been already
88 K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94

replaced. It is then compared with a reliability threshold R0 The cost-rate can be generally defined as
(0 o R0 r1) to trigger PM actions, where R0 is a decision Cðut Þ
variable. The decision rule at system level is shown in Fig. 2. C 1 ¼ lim : ð16Þ
t-1 ut
If PM action is necessary, the remaining issue is to find an
“optimal” group of components to be preventively maintained. On the basis of the renewal theory [23,27], the cost-rate in Eq.
The latter can be done at component level. (16) can be revaluated by
 At the component level, the PM decision rules are based on the E½CðH 1 Þ
cost-based group improvement factor which allows to select C1 ¼ ; ð17Þ
E½H 1 
the group of components to be maintained. If at time Tk, m
components are working, the number of possible groups is where H1 is the usage length of the first renewal cycle, i.e. of the
then 2m  1. A group is said to be admissible if the replacement first life cycle of the system. To illustrate how to compute the
of all the components of the group can upgrade the system mean maintenance cost rate, one supposes that the system is
predictive reliability above R0. Based on the set of all the completely replaced at H 1 ¼ T rep , which means that all compo-
admissible groups, denoted Sadm, an optimal group is deter- nents of the system are replaced at the same time Trep. Since, the
mined as the one having the highest value of the cost-based maintenance cost of the whole system till time Trep, CðT rep Þ, is the
group improvement factor C GIF n sum of all partial costs of N b inspection cycles within time interval
 ðT k Þ. Gk denotes the optimal P b
group that is obtained at inspection date Tk. It is then such that ½0; T rep , i.e. CðT rep Þ ¼ N C . It is noted that the time considered
k¼1 k
n o in the denominator of Eq. (17) is total operating time of the system
C GIF
Gn ðT k Þ ¼ maxm
C GIF Gl
Gl ðT k Þj R ðT k þ 1 j x1:n;T k Þ 4 R0 : ð14Þ [20]. So, the cost-rate can be rewritten as
k l ¼ 1;‥2  1
hP i
Nb
Gl
R ðT k þ 1 j x1:n;T k Þ represents the predictive reliability of the E k ¼ 1
Ck
C1 ¼ ; ð18Þ
system after PM on group Gl. Due to the existence of PED E½T rep  DðT rep Þ
between components in the system, the set-up cost cs is
assigned only once for all maintenance activities (including where DðT rep Þ is total downtime of the system during its mission
PM and CM) performed at the same time Tk. Therefore, cs is not (i.e. total unavailability time) and T rep  DðT rep Þ is the usage dura-
added to C p;Gl if there is at least one failed component j that tion until time Trep.
must be correctively replaced at Tk. T F;k is a random variable Ck is the inspection cost, the running cost and the maintenance
describing the failure time of the system in the k-th inspection costs of the k-th inspection cycle (i.e. in time interval ðT k  1 ; T k )
cycle. If the failure of the system occurs at time T F;k r T k and gathered at inspection time Tk
after inspection time T k  1 , i.e. T F;k 4 T k  1 , then cpsd is not also C k ¼ C insp þ C corr þ C prev ð19Þ
k k k
added to C p;Gl . As a result, the PM cost related to the group Gl
can be expanded from Eq. (13) as follows: where
0 1
X
@ P A
 Cinsp
k is the cost related to inspection operations on the surviv-
C p;Gl ¼ C p;i  cs j Gl j  I f I fx Z Z g ¼ 0g
i A Gl j A Ω⧹Gl
j;T k j ing components at time Tk.
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}  Ccorr
k is the cost related to the replacement of failed components
saved setup cost: PED
2 3 at time Tk (i.e. it includes the set-up costs and the system's
X unplanned shutdown cost).
 cpsd 4 Iipsd  I fGl is a critical groupg ∏ I ft j a T F;k g
5; ð15Þ  Cprev is the cost related to the preventive replacement of the
k
i A Gl j A Ω⧹Gl F

|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl} optimal group of running components Gnk at time Tk (i.e. it includes


saved planned shutdown cost: PED or NED the set-up costs and the system's planned shutdown cost).
where Ω⧹Gl is the set of all the components outside the group
Gl and Since inspection operation is only performed on the surviving
○ I fP I fx Z Z g ¼ 0g
¼ 0 if there is not any failed component components up to time Tk, thus, the inspection cost at Tk is
j A Ω⧹Gl j;T k j
calculated as
0 1
in Ω⧹Gl , 0 otherwise
X
○ ∏j A Ω⧹Gl I ft j a T g ¼ 1 if the system does not fail in the k-th insp @
C k ¼ cinsp n  I fxj;T Z Z j g A: ð20Þ
F F;k k
j A Ω⧹Gnk
inspection cycle is due to the failure of a component from
For the corrective replacement cost Ccorr k , if there is no failed
Ω⧹Gl , 0 otherwise.
component between (T k  1 ; T k ], then C corr
k ¼ 0. Otherwise, on the
basis of Eq. (12), Ccorr is calculated by
Finally, the factor C GIF
Gl ðT k Þ can be obtained by including Eq. (15) X
k

into Eq. (10). C corr


k ¼ ðcc;j þcs ÞI fxj;T Z Z j g
k
The maintenance policy described previously will be hereafter j A Ω⧹Gnk
referred as “policy ðT; R0 Þ”. It is important to note that the inter- 0 1
X  
inspection interval T and the PM threshold R0 are two decision  cs @ I fxj;T A þ ðT k  T F;k Þcd þcusd I j
Z Zj g  1 ft F ¼ T F;k g
;
k
variables supporting the model. They have to be sought in order to j A Ω⧹Gk n |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
reach the optimum value of the objective function which is repre- |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl} unavailability cost in ðT k  1 ;T k 
saved setup cost at T k : PED
sented by the long run maintenance cost per unit time. Therefore the
ð21Þ
next subsection is devoted to the cost model development.
In the same manner, the PM cost Cprev k is determined as follows:
4.3. Optimization for the proposed maintenance policy if no PM action is required at time Tk, then C prev k
¼ 0. Otherwise, on
the basis of Eq. (11), Cprev k is computed as
We now develop a mathematical expression of long run X  
C prev ¼ C p;i  cs j Gnk j I fNo failed component at T k g
maintenance cost per unit time (or cost-rate). Let Cðut Þ be the k
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
i A Gn k
cumulative cost function of the whole system till usage ut at time t. saved setup cost at T k : PED
K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94 89

2 3
X   5. Numerical example
cpsd 4 Iipsd  1  I ft j ¼ T F;k g
I fGnk is a critical groupg
5: ð22Þ
F
i A Gnk In this section, we show how the proposed predictive main-
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
saved planned shutdown cost at T k : PED or NED tenance policy can be used in PM optimization of complex systems
through a 14-component system whose structure is sketched in
Finally, from Eqs. (18), (20)–(22), the cost-rate of the system Fig. 1. This structure could have been any combination of series,
can be calculated as follows: parallel, series-parallel and parallel-series configuration, or even
1 more generally any structure for which the system reliability can
C 1 ðT; R0 Þ ¼ h P b i
E T rep  N ðT  T F;k Þ be computed from the components reliability.
k¼1 k
8 0 0 1 In order to highlight the effectiveness of the proposed multi-
<X Nb X X level maintenance policy (i.e. policy ðT; R0 Þ), a comparison is
E @cinsp @n  I fxj;T Z Z j g A þ C p;i
: k proposed with another opportunistic maintenance policy. The
k¼1 j A Ω⧹Gnk i A Gnk
0 1 policy (T; Rp ; Rop ), which has been recently introduced for series
X X structure systems in [26] and for k-out-of-n systems in [14] is
þ ðcc;j þ cs ÞI fxj;T Z Z j g  cs @ I fxj;T Z Z j g þ j Gk j  1A
n

j A Ω⧹Gnk
k
j A Ω⧹Gnk
k
considered. The performance for the two maintenance policies is
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl} numerically compared for the 14-component system. The policy
saved setup cost at T k : PED
(T; Rp ; Rop ) has three decision variables: the inter-inspection inter-
2 3
X val T, a PM threshold Rp fixed for all components of the system,
 cpsd 4 Iipsd  ð1  I ft j ¼ T g ÞI fGnk is a critical groupg 5 and an opportunistic maintenance (OM) threshold Rop (0 r Rp o
F F;k
i A Gnk
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
Rop r 1). The description of this maintenance policy is reported in
saved planned shutdown cost at T k : PED or NED Appendices A and B. It is herein implemented and applied to the
1 14-component system.
  C To compare the performance of the two policies, a relative cost-
þ ðT k  T F;k Þcd þ cusd I ftj ¼ T g C g: ð23Þ
F;k A gain in the optimal cost-rate of the proposed policy (T; R0 )
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
F

unavailability cost in ðT k  1 ;T k  compared to the policy (T; Rp ; Rop ) is used as a criterion. It is


denoted CG and defined as follows:
Stochastic Monte Carlo simulation is used to evaluate this cost
criterion. For each value of the decision variables fT; R0 g, the C 1 ðT n ; Rnp ; Rnop Þ  C 1 ðT nTR ; Rn0 Þ
CG ¼  100 ð25Þ
corresponding cost-rate is calculated. To ensure that the conver- C 1 ðT n ; Rnp ; Rnop Þ
gence of cost-rate is reached, a large number of simulation
realizations must be done. where Tn, Rnp and Rnop (respectively T nTR and Rn0 ) are the optimal
By considering different values of fT; R0 g on a mesh grid, the decision variables of the policy (T; Rp ; Rop ) (respectively (T; R0 )).
minimum cost-rate can be obtained. The latter addresses the According to the definition, CG 4 0 means that the policy (T; R0 ) is
optimal value of the decision variables fT; R0 g, i.e., more effective than the policy (T; Rp ; Rop ) and less effective else.
1
In this section the degradation behavior of each component i is
C 1 ðT n ; Rn0 Þ ¼ min C ðT; R0 Þ : ð24Þ
T 4 0;R0 A ð0;1 assumed to be described by homogeneous gamma stochastic
processes ðX i;t Þt Z 0 (i ¼ 1; …; 14). The degradation increment
Table 1 between two times Tk and T k þ 1 follows a gamma probability
Data of a 14-component system. density function (pdf) with a shape parameter αi ðT k þ 1  T k Þ A R þ n
and a scale parameter β i A R þ n as shown in Appendix C. In this
cinsp ¼ 5, cs ¼ 30, cd ¼325, cusd ¼ cpsd ¼ 160
way, the conditional reliability of component i is predicted by
Component αi βi Zi cp;i cc;i Component αi βi Zi cp;i cc;i using Eq. (8)

1 0.5 1 30 130 195 8 1.1 1.5 35 140 210


Ri ðT k þ 1 j xi;T k Þ ¼ 1  PðX i;T k þ 1  T k ZZ i  xi;T k Þ
2 0.5 1 30 130 195 9 1.4 1.7 42 130 195 Z þ1
3 0.4 1.5 32 250 375 10 0.85 1.2 38 100 150 ¼ 1 f αi ðT k þ 1  T k Þ;βi ðxÞ dx
4 0.55 0.9 34 180 270 11 1 1 40 100 150 Z i  xi;T
k
5 0.6 1.1 27 150 275 12 0.65 1.4 28 200 300  
6 0.7 1.1 27 150 275 13 0.8 1.5 31 210 315 Γ αi ðT k þ 1 T k Þ; βi ðZ i  xi;T k Þ
¼ 1   ð26Þ
7 0.3 1.2 20 120 180 14 0.7 1.25 30 180 270 Γ αi ðT k þ 1  T k Þ

Fig. 3. Maintenance cost rate as a function of iterations numbers. (a) Policy (T; R0 ). (b) Policy (T; Rp ; Rop ).
90 K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94

and the predictive reliability of the system RðT k þ 1 j x1:14;T k Þ can be The convergence of the mean maintenance cost with respect to
obtained from Eqs. (7) and (26). Table 1 reports the data of the 14- the number of iterations is illustrated in Fig. 3.
component system. Where, Fig. 3(a) shows that the convergence of the optimal cost-rate
(i.e. C 1 ðT n ; Rn0 Þ) of the policy (T; R0 ) is stable from Nb Z 18; 000
5.1. Optimization of the proposed policy and the policy (T; Rp ; Rop ) interactions, while that of the policy (T; Rp ; Rop ), in Fig. 3(b), is stable
from Nb Z 26; 000 interactions. This is because the policy (T; R0 ) is quite
cost-rate of the proposed policy (T; R0 ) and the policy simple to be implemented since only two decision variables T and R0
(T; Rp ; Rop ) are evaluated with different values of fT; R0 g and have to be considered while the policy ðT; Rp ; Rop Þ requires the optimiza-
fT; Rp ; Rop g using Eqs. (23) and (27) respectively. The obtained tion of three decision variables. Thus, the convergence of the policy
results are the following: ðT; Rp ; Rop Þ is slower than that of the proposed maintenance policy.
According to the obtained results, the cost-gain CG is about
 For the policy (T; R0 ): the minimum cost-rate C 1 ðT n ; Rn0 Þ ¼ 29.34%. This means that the proposed maintenance policy (T; R0 ) is
35:7216 has been obtained with the optimal inter-inspection significantly more profitable than the policy (T; Rp ; Rop ). This can be
interval T n ¼ 43, the optimal PM threshold Rn0 ¼ 0:77. explained by the fact that the proposed maintenance policy takes
 For the policy (T; Rp ; Rop ): the minimum cost-rate C 1 ðT n ; Rnp ; into account the system structure and grouping effects, which can
Rnop Þ ¼ 50:5536 corresponds to the optimal inter-inspection lead to a reduction on maintenance cost.
interval T n ¼ 53, the optimal PM threshold Rnp ¼ 0:43 and the For the policy ðT; R0 Þ, the cost-rate with different values of T and
optimal OM threshold Rnop ¼ 0:83. R0 is sketched in Fig. 4. It is clear that the cost-rate surface is
convexly shaped and the global optimal solution exists. Addition-
ally, the maintenance impacts on the system reliability is illustrated
in Fig. 5 in the optimal case T n ¼ 43 and Rn0 ¼ 0:77. The results show
that whenever the system reliability drops below the threshold R0
(at inspection time T k ), the system improvement is triggered. Then
an optimal group of components Gnk ) is selected to be replaced
preventively. As a result, the system predictive reliability after
maintenance is raised to values higher than the threshold R0.
A short comparison of the two maintenance policies (T; Rp ; Rop )
and (T; R0 ) will be proposed for a series structure in Section 5.3.
The cost-gain between two policies may depend on the system
parameters and mainly on unitary costs such as inspection, set-up,
planned/unplanned shutdown costs as well as unplanned down-
time cost rate. The next Subsection is devoted to different
sensitivity analyses for the two maintenance policies.

Fig. 4. Cost-rate of the proposed maintenance policy as a function of T and R0.


5.2. Sensitivity analyses of two maintenance policies

In order to observe the trend of the cost-gain between the two


maintenance policies, sensitivity analysis with respect to the cost
parameters is herein investigated. The performance of the proposed
policy are compared to that of the policy (T; Rp ; Rop ). The sensitivity
analysis is performed separately on each considered cost parameter
by varying it when the others remain fixed. The obtained numerical
results are shown in Fig. 6 for the set-up cost and the planned
shutdown cost and Fig. 7 for the inspection cost and the downtime
cost rate. For these figures, each point on the cost-gain curve is
Fig. 5. System predictive reliability relation before and after improvement
obtained from the respective optimal maintenance cost rates of the
operations. two policies. The sensitivity analysis is summarized as follows:

Fig. 6. Sensitivity analysis to: (a) set-up cost cs and (b) planned shutdown cost cpsd.
K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94 91

 Fig. 6(a) is obtained by varying the set-up cost cs from 0 to 90 5.2.2. Sensitivity analysis to the inspection cost cinsp and the
with increments of 15 cost units and assigning the other downtime cost rate cd
parameters of the system as in Table 1. Fig. 7 shows the impact of the inspection cost and the down-
 Fig. 6(b) is obtained by varying the planned shutdown cost cpsd from time cost rate that has an opposite effect to that of the set-up cost
0 to 640 with a step of 80 cost units (noted herein that cusd ¼ cpsd ) and the planned shutdown cost. The cost-gain is a decreasing
and assigning the other parameters of the system as in Table 1. function of cinsp and of cd, respectively.
 Fig. 7(a) is obtained by varying the inspection cost cinsp from Sensitivity analysis to the inspection cost cinsp: The results shown
1 to 19 in increment of 2 cost units and assigning the other in Fig. 7(a) can be explained by the inspection cost. As it increases,
parameters of the system as in Table 1. the optimal inter-inspection interval of both policies is set to
 Fig. 7(b) is obtained by varying the downtime cost rate cd from higher values to avoid frequent inspections. The results obtained
0 to 1000 with a step of 100 cost units and assigning the other after optimization of the two policies show that the optimal inter-
parameters of the system as in Table 1. inspection interval of the policy ðT; Rp ; Rop Þ (e.g. T n ¼ 53) is always
much longer that of the policy ðT; R0 Þ (e.g. T n ¼ 43). The number of
inspection times of the policy ðT; R0 Þ is greater than that of the
policy ðT; Rp ; Rop Þ during the system mission. As a consequence the
5.2.1. Sensitivity analysis to the set-up cost cs and the planned maintenance cost of the policy ðT; R0 Þ increases faster and
shutdown cost cpsd as a result the cost-gain function is a decreasing one as shown
As shown in Fig. 6, both cost-gains increase as the set-up cost in Fig. 7(a).
and the planned shutdown cost increase, respectively. Sensitivity analysis to the downtime cost rate cd: The results
Sensitivity analysis to the set-up cost cs: Fig. 6(a) shows the influence obtained about the sensitivity analysis of the downtime cost rate
of the set-up cost on the maintenance cost of both policies. When the cd show that the two maintenance costs increase sharply with cd.
set-up cost increases, the cost-rate of the two policies increases. When cd increases, the PM thresholds (i.e. Rp, Rop, R0) rise and the
However, the maintenance cost of the policy ðT; Rp ; Rop Þ increases inter-inspection intervals decrease for both policies. This means
faster than that of the policy ðT; R0 Þ. In other words, the higher cs, the that the components of the system are replaced earlier to prevent
higher cost-gain is. Indeed the policy ðT; R0 Þ can avoid unnecessary penalty costs due to the system failure. The cost-gain in this case is
operations on the system thanks to the decision at system level. What a decreasing function of cd as shown in Fig. 7(b). As mentioned
is more it has more opportunities than to replace preventively some previously, this can be explained by the fact that the policy ðT; R0 Þ
groups of components in order to save set-up-costs (due to PED). This tends to keep failed components (i.e. less critical components) till
proves the effectiveness of the grouping policy of the proposed ðT; R0 Þ inspection time is greater. The system sustainability is not ensured
maintenance decision rule. as with the policy ðT; Rp ; Rop Þ. Consequently, the increasing of
Sensitivity analysis to the planned shutdown cost cpsd: Fig. 6 C 1 ðT; R0 Þ is faster than that of C 1 ðT; Rp ; Rop Þ as cd increases.
(b) shows the influence of the planned shutdown cost on the
maintenance cost of both policies. The impact of cpsd is similar to
that of cs. Because cpsd is taken into account to select the optimal
group of components for PM through the cost-based improvement 5.3. Performance evaluation of the two policies performed on a
factor. This allows the policy ðT; R0 Þ to avoid actively unnecessary subsystem consisting of multi-component in series
shutdowns (or reducing the NED) due to the PM intervention on
system MCSs. As a result, the policy ðT; R0 Þ is more profitable than In order to study the influence of the system structure on the
the policy ðT; Rp ; Rop Þ when cpsd increases. Indeed, one can see both policies, a series subsystem of 5 components (i.e. components
again on Fig. 6(b) that the cost-gain increases rapidly with cpsd. 7, 8, 9, 10, 11) is extracted from the given system in Fig. 1. Its
This is an outstanding advantage of the proposed policy ðT; R0 Þ production loss is lower than the whole complex system. In this
when the planned shutdown cost cpsd is expensive. subsection, the following considered cost parameters are lower
Through the two previous cases (i.e. the influence of the set-up than previously: cd ¼90, cusd ¼ cpsd ¼ 45. The other parameters are
cost and of the planned shutdown cost on both maintenance unchanged as given in Table 1. The optimization process and the
policies) the proposed policy ðT; R0 Þ is shown to better drive its sensitivity analysis are similar as in Sections 5.1 and 5.2.
decision variables in order to exploit the interactions among Optimization of the two policies: The cost-rate of the policy
components (by enhancing the PED and reducing the NED among (T; R0 ) and (T; Rp ; Rop ) are respectively evaluated with different
components) better than other policies for multi-component values of fT; R0 g and fT; Rp ; Rop g using Eqs. (23) and (27). The
systems with the complex structure. obtained results are the following:

Fig. 7. Sensitivity analysis to: (a) inspection cost cinsp and (b) downtime cost rate cd.
92 K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94

 For the policy (T; R0 ): the minimum cost-rate is (T; Rp ; Rop ) in the case of series system. This can be explained
C 1 ðT n ; Rn0 Þ ¼ 20:551. The optimal inter-inspection interval is because a series multi-component system is never stronger than
T n ¼ 34, the optimal PM threshold is Rn0 ¼ 0:116. the weakest of its link/node. Thus, one should not invest too much
 For the policy (T; Rp ; Rop ): the minimum cost-rate is maintenance resource in order to improve the reliability of most of
C 1 ðT n ; Rnp ; Rnop Þ ¼ 20:458. The optimal inter-inspection interval components in the system if only one component remains at a very
is T n ¼ 34, the optimal PM threshold and the optimal OM low reliability level. The policy (T; Rp ; Rop ) makes maintenance
threshold are respectively Rnp ¼ 0:55 and Rnop ¼ 0:85. decisions at component level in combination with opportunistic
maintenance. All the components of system are considered at the
The obtained cost-gain in this case is about  0.454% which shows same priority level for the reliability improvement. It is more
the policy (T; R0 ) seems to be less profitable than the policy appropriate for series structures than policy (T; R0 ). Finally, for series
(T; Rp ; Rop ). The influences of cost parameters of the system (i.e. multi-component systems, using the cost-based group improvement
inspection cost, set-up cost, planned shutdown cost as well as factor for maintenance decision-making at system level is useless.
unplanned downtime cost rate) on the cost-gain are investigated The maintenance policy to consider should decide maintenance
hereafter through the different sensitivity analysis. actions at component level and combine with the opportunistic
Sensitivity analysis to the maintenance costs: The sensitivity maintenance. This conclusion was also drawn by Li [17].
analysis for each considered cost parameter is performed by varying Let us summarize the comparison of the two policies from the
it while the other remain unchanged. The obtained numerical numerical results that have been obtained from various parameter
results of the four parameters are shown in Fig. 8 in which: sets. In the case of series systems the cost-gain CG is slightly
negative (around  0.5%) for all the tested cases. In contrast, in the
 Fig. 8(a) is obtained by varying the set-up cost cs from 0 to 90 in general case of complex structures the cost-gain is highly positive
increment of 15 cost units and assigning cinsp ¼ 5, cd ¼90, (more than 20%) in the vast majority of tested cases. The policy
cusd ¼ cpsd ¼ 45; ðT; R0 Þ, based on PM decision-making at system level has proved
 Fig. 8(b) is obtained by varying the planned shutdown cost cpsd its effectiveness in incorporating information about the system
from 0 to 180 with a step of 22.5 cost units (noted herein that structure into the decision process. The grouping policy is efficient
cusd ¼ cpsd ); from the PED viewpoint and avoids actively unnecessary system
 Fig. 8(c) is obtained by varying the inspection cost cinsp from shutdowns from the NED viewpoint. The higher cs and cpsd is, the
1 to 19 in increment of 2 cost units and assigning cs ¼30, more the policy ðT; R0 Þ is effective. The components selection for
cd ¼90, cusd ¼ cpsd ¼ 45; PM activities is not only performed on the basis of their reliability,
 Fig. 8(d) is obtained by varying the downtime cost rate cd from but also on their incurred costs with respect to the ability to
0 to 300 with a step of 30 cost units assigning cinsp ¼ 5, cs ¼30, improve the system reliability. This allows the policy ðT; R0 Þ to
cusd ¼ cpsd ¼ 45. preventively replace more important components while retaining
less important components (i.e. for example components with low
All the other parameters are assigned as in Table 1. reliability improvement ability but hight replacement cost). So, the
Fig. 8 shows almost all the cost-gains are negative. This confirms policy ðT; R0 Þ allows more savings in maintenance cost compared
again that the policy (T; R0 ) is slightly less profitable than the policy to the policy ðT; Rp ; Rop Þ, mainly if cinsp and cd are low.

Fig. 8. Sensitivity analysis to (a) inspection cost cinsp, (b) set-up cost cs, (c) downtime cost rate cd, (d) planned shutdown cost cpsd.
K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94 93

It is important to note that the proposed maintenance policy is represented in the policy by a PM threshold Rp, 0 r Rp r 1. In
quite simple to be implemented since only two decision variables addition, in order to increase the effectiveness of grouping main-
T and R0 have to be considered while the policy ðT; Rp ; Rop Þ requires tenance operation, the opportunistic maintenance (OM) activities
the optimization of three decision variables. are handled through an OM threshold Rop. The PM actions have to
be implemented on more deteriorated components than the OM
ones, thus it is reasonable to assume that 0 r Rp o Rop r1. The NED
6. Conclusions is considered that if the grouping of PM activities lead the system
to a shutdown, a penalty cost cpsd is incurred. To do this end, at
In this work, a novel predictive maintenance policy with multi- inspection time Tk, the current state of surviving component i
level decision approach is proposed for multi-component systems (i ¼ 1; …; n) is firstly considered. The decision rules are placed for
with complex structure. Predictive reliability, positive and nega- component i is as follows:
tive dependencies as well as the structure of the system are taken
into account in the decision-making process. This process is  If component i has been failed, it has to be correctively
composed of two levels: a system level and a component one. replaced.
The first level aims at triggering maintenance interventions, while  If component i is surviving, but Ri ðT k þ 1 j xi;T Þ r Rp , component i k
the second level addresses the optimal selection of a group of is then preventively replaced.
several components to be preventively maintained using the cost-  If the system has been failed or if the system is functioning,
based group improvement factor. Thanks to the two level decision but the PM on the selected components lead the system to a
process, the proposed maintenance policy is more flexible and shutdown. This is considered as an opportunity to main-
efficient for complex structure system when compared with an tain other functioning ones. More precisely, component j
existing effective maintenance policy. In the cases of series (j a i) will also be opportunistically maintained if
structure systems, some numerical results show that the perfor- Rp o Rj ðT k þ 1 j xj;T k Þ r Rop .
mance of the proposed maintenance policy is slightly lower than  If all above decision rules above are not met, no maintenance
the existing one. Nevertheless the number of decision variables of action is then carried out on the component i at time Tk.
the proposed policy is lower. This point can be considered as an
advantage for the optimization process. Monte-Carlo simulation
technique is used to evaluate the maintenance cost-rate. This
method is robust but the computing time can intensively increase Appendix B. Cost model of the policy ðT; Rp ; Rop Þ
when the number of components is high.
This paper is the development of our research in the framework Let us denote by Gk the group of selected running components
of predictive maintenance approaches for multi-component sys- for the preventive replacement at time Tk. The same manner as
tems presented partially in [18]. Our future works will focus on the presented in Section 4.3, the long-run maintenance cost rate of the
development of the proposed maintenance policy with considera- system under the policy ðT; Rp ; Rop Þ can be formulated as follows:
tion of uncertainty on the deterioration parameters of components.

8 0 0 1
1 <X Nb X
1
C ðT; Rp ; Rop Þ ¼ h iE @cinsp @n  I fxi;T Z Z i g A
P b
E T rep  N ðT  T F;k Þ :k ¼ 1
k
k¼1 k i A Ω⧹Gk
X X X
þ ðcc;i þ cs ÞI fxi;T Z Z i g : þ C p;i I fRi ðT k þ 1 j xi;T Þ r Rp g þ ðcp;j þ cs ÞI fRp o Rj ðT k þ 1 j xj;T Þ r Rop g I fsystem is shutdowng
k k k
i A Ω⧹Gk i A Gk j A Gk ;j a i
0 1
X X X
 cs @ I fxi;T Z Zi g þ I fRi ðT k þ 1 j xi;T Þ r Rp g þ I fRp o Ri ðT k þ 1 j xi;T Þ r Rop g I fsystem is shutdowng 1
A
k k k
i A Ω⧹Gk i A Gk j A Gk ;j a i
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
saved setup cost at T k : PED
1
2 3 C
X   C
C
 cpsd 4 Iipsd  ð1  I fti ¼ T F;k g ÞI fG is a critical groupg
5 þ ðT k  T F;k Þcd þ cusd I i
ft F ¼ T F;k g Cg; ð27Þ
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl} C
F k
i A Gk A
|fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl} unplanned shutdown cost in ðT k  1 ;T k 
saved planned shutdown cost at T k : PED or NED

Actually these parameters are frequently estimated with little data


available and it may be difficult to ensure the estimators consis- The optimal decision variables fT n ; Rnp ; Rnop g of the policy
tency. Furthermore, we will focus also on mathematical develop- ðT; Rp ; Rop Þ can be obtained by minimizing the mean maintenance
ment for an analytical calculation of the maintenance cost-rate in cost rate function C 1 ðT; Rp ; Rop Þ, i.e.,
order to reduce the computing time. 1

C 1 ðT n ; Rnp ; Rnop Þ ¼ min C ðT; Rp ; Rop Þ : ð28Þ


T 4 0;0 r Rp o Rop r 1

Appendix A. Maintenance decision rules of the policy


ðT; Rp ; Rop Þ Appendix C. The pdf function is expressed as
1
The policy ðT; Rp ; Rop Þ in this study is extended from the f αi ðT k þ 1  T k Þ;βi ðxÞ ¼ βαi ðT k þ 1  T k Þ xαi ðT k þ 1  T k Þ  1 e  βi x I fx Z 0g ; ð29Þ
Γ ½αi ðT k þ 1  T k Þ i
ðΔT; Rp ; Ru Þ policy reported in [14] in which the NED and structural
R þ1
dependency between components of the system are not taken into where Γ ðtÞ ¼ 0 ut  1 expð  uÞ du denotes the Euler gamma
account in maintenance decision-making. The PM activities are function. The couple of parameters (αi ; βi ) is supposed to be given
94 K.-A. Nguyen et al. / Reliability Engineering and System Safety 144 (2015) 83–94

but can be estimated from monitoring data. The mean degradation information levels on the components state. In: Prognostic and system health
rate and the variance are given by αi =βi and αi =βi , respectively.
2 management conference, Milan, Italy; 2013.
[17] Li L, Ambani S, Ni J. Plant-level maintenance decision support system for
throughput improvement. Int J Prod Res 2009;47(24):7047–61.
References [18] Nguyen KA, Do P, Grall A. Predictive grouping maintenance strategy for
complex structure systems using importance measure. In: International
conference on quality, reliability, risk, maintenance, and safety engineering
[1] Bevilacqua M, Braglia M. The analytic hierarchy process applied to main-
(QR2MSE) 2013, international conference on quality, reliability, risk, main-
tenance strategy selection. Reliab Eng Syst Saf 2000;70:71–83.
[2] Bouvard K, Artus S, Berenguer C, Cocquempot V. Condition-based dynamic tenance, and safety engineering (QR2MSE) 2013, Emeishan, Sichuan, China,
maintenance operations planning & grouping. application to commercial IEEE, Piscataway, New Jersey; 2013. p. 582–88.
heavy vehicles. Reliab Eng Syst Saf 2011;96(6):601–10. [19] Nguyen K-A, Do P, Grall A. Condition-based maintenance for multi-component
[3] Castanier B, Grall A, Bérenguer C. A condition-based maintenance policy with systems using importance measure and predictive information. Int J Syst Sci:
non-periodic inspections for a two-unit series system. Reliab Eng Syst Saf Oper Logist 2014;1(4):228–45.
2005;87(1):109–20. [20] Nguyen K-A, Do P, Grall A. A joint predictive maintenance and spare parts
[4] Cho DI, Parlar M. A survey of maintenance models for multi-unit systems. Eur J provisioning policy for multi-component systems using RUL prediction and
Oper Res 1991;51(1):1–23. importance measure. In: 2nd european conference of the prognostics and
[5] Dasgupta A, Pecht M. Material failure mechanisms and damage models. IEEE health management society, PHME 2014, vol. 5, no. 14; 2014. p. 500–11.
Trans Reliab 1991;40(5):531–6. [21] Pham H, Wang H. Imperfect maintenance. Eur J Oper Res 1996;94(3):425–38.
[6] Dekker R, Wildeman RE, van der Duyn Schouten FA. A review of multi- [22] Rausand M, Høyland A. System reliability theory: models, statistical methods
component maintenance models with economic dependence. Math Methods and applications. 2nd ed.John Wiley & Sons, Hoboken, New Jersey; 2004.
Oper Res 1997;45(3):411–35. [23] Ross S. Stochastic processes. John Wiley & Sons, Inc., Chichester, New York;
[7] Dieulle L, Bérenguer C, Grall A, Roussignol M. Sequential condition-based main- 1996.
tenance scheduling for a deteriorating system. Eur J Oper Res 2003;150(2):451–61. [24] Shaikh A, Mettas A. Application of reliability, availability, and maintainability
[8] Do P, Voisin A, Levrat E, Iung B. A proactive condition-based maintenance simulation to process industries: a case study. In: Simulation methods for
strategy with both perfect and imperfect maintenance actions. Reliab Eng Syst reliability and availability of complex systems. Springer, London; 2010.
Saf 2015;133:22–32. [25] Thomas L. A survey of maintenance and replacement models for maintain-
[9] Do Van P, Barros A, Bérenguer C, Bouvard K, Brissaud F. Dynamic grouping ability and reliability of multi-item systems. Reliab Eng 1986;16(4):297–309.
maintenance strategy with time limited opportunities. Reliab Eng Syst Saf [26] Tian Z, Liao H. Condition based maintenance optimization for multi-
2013;120:51–9. component systems using proportional hazards model. Reliab Eng Syst Saf
[10] Do Van P, Vu H, Barros A, Berenguer C. Grouping maintenance strategy with
2011;96(5):581–9.
availability constraint under limited repairmen. In: The 8th IFAC international
[27] Tijms HC. A first course in stochastic models. New York: John Wiley & Sons;
symposium on fault detection, supervision and safety for technical processes,
2003.
SAFEPROCESS-2012, August 2012, Mexico city, Mexico; 2012. p. 486–91.
[28] VU HC, DO P, Barros A. Bérenguer C. Maintenance grouping strategy for multi-
[11] Faulin J, Juan AA, Martorell S, Ramírez-Márquez J-E. Simulation methods for
component systems with dynamic contexts. Reliab Eng Sys Saf 2014;132:
reliability and availability of complex systems. Springer-Verlag, London; 2010.
[12] Grall A, Dieulle L, Bérenguer C, Roussignol M. Continuous-time predictive-main- 233–49.
tenance scheduling for a deteriorating system. IEEE Trans Reliab 2002;51(2):141–50. [29] VU HC, DO P, Barros A, Bérenguer C. Maintenance planning and dynamic
[13] Horenbeek AV, Pintelon L. A dynamic predictive maintenance policy for grouping for multi-component systems with positive and negative depen-
complex multi-component systems. Reliab Eng Syst Saf 2013;120:39–50. dencies. IMA j Manag Math 2015;23(2):145–70.
[14] Huynh KT, Barros A, Berénguer C. Multi-level decision-making for the [30] Wang H. A survey of maintenance policies of deteriorating systems. Eur J Oper
predictive maintenance of k-out-of-n: F deteriorating systems. IEEE Trans Res 2002;139(3):469–89.
Reliab, 64(1), 2015, 94–117. [31] Wang W, Zhang W. A model to predict the residual life of aircraft engines
[15] Huynh KT, Castro IT, Barros A, Bérenguer C. On the use of mean residual life as based upon oil analysis data. Naval Res Logist (NRL) 2005;52(3):276–84.
a condition index for condition-based maintenance decision-making. IEEE [32] Xu Z, Ji Y, Zhou D. A new real-time reliability prediction method for dynamic
Trans Syst Man Cybern: Syst 2014;44(7):877–93. systems based on-line fault prediction. IEEE Trans Reliab 2009;58(3):523–38.
[16] Lesobre R, Bouvard K, Berenguer C, Barros A, Cocquempot V. A maintenance [33] Yun WY, Moon I, Kim G. Simulation-based maintenance support system for
free operating period policy for a multi-component system with different multi-functional complex systems. Prod Plan Control 2008;19(4):365–78.

Вам также может понравиться