Chapter - 3 System Reliability Models and Redundancy Techniques in System Design
Chapter - 3 System Reliability Models and Redundancy Techniques in System Design
Chapter - 3 System Reliability Models and Redundancy Techniques in System Design
CHAPTER – 3
3.10 Conclusions 92
48
CHAPTER - 3
which they are set inside the system have direct effect on system
breakdown of the whole system in few cases and may not in others,
others do not. For example, electric bulbs and switches have well
output voltage of a stabilizer may lie inside the limits of V - and V +.
When the output voltage crosses these limits only, the device is
has revealed that based upon the nature of failure, the failures can be
parts, poor fit, poor insulation, bad assembly etc. This period is
“The failures which occur after initial failures, due to the sharp
With the passage of time, the units get worn out and begin to
the performance of the product and when the parameters exceed the
limits of tolerance, the product fails. This area is called the wear-out
region where the failure rate increases and the prediction of wear-out
period t2 – t1. After time ‘t2’, the product is changed with a pre-tested
product.
Infant
Mortality Wear out
Failure Rate
Useful life
T1 Time T2
A curve is plotted with the failure rate on the y-axis and with
the product life on the x-axis. The life can be in cycles, actuations,
years, hours, minutes, or any other units of time or use, which are
failure rate. From this plot, it can be shown that many products begin
their lives with a higher failure rate due to poor workmanship, poor
stabilizes to more or less constant rate where the failures observed are
a chance failure which is known as useful life region. With usage, the
51
products experience more wear and tear, the failure rate begins to
grow and the products failure begins to occur related to wear-out. The
mortality rate is more during the first year or so of life, then drops to
a low constant level during teenage and early adult life and then rises
rather the result of a few components with gross defects and the lack
system can be many. Some are known and others are unknown due
7. Human errors
available.
stress models.
fail. Let us assume that Ns(t) denote the number of surviving products
53
and Nf(t ) indicate the products that have failed after time t . Each
Then
N S (t ) N (t )
Rt 1 f ….. (3.2)
N N
failure is defined as
dR(t ) 1 dN S (t )
dt N dt
dR (t ) N (t ) z (t )
S R(t ) z (t )
dt N
Then,
1 dR (t )
z (t ) …(3.4)
R (t ) dt
t
z ( x)dx logR t
0
or
t
R(t ) exp z ( x)dx ….. (3.5)
0
then,
54
t
F (t ) 1 exp z ( x) dx
0
dF (t )
f (t )
dt
t
= z (t ) exp z ( x)dx
0
z (t ) R (t )
Therefore,
f (t ) f (t )
z (t ) …. (3.6)
R(t ) 1 F (t )
MTTF assumes that the product cannot resume any of its normal
failure rate”. MTTF is properly used only for components that can be
important rather than the complete failure details. MTTF for all the
55
on the population of N items with failure times t1, t2,……..tn then the
1 n
MTTF ti
N i 1
……. (3.7)
defined as
MTTF E[T ] tf (t )dt .......(3.8)
0
dF (t ) dR (t )
f (t )
dt dt
Hence, MTTF tdT (t )
0
tR (t ) 0 R ( x)dt
0
R(t )dt .........(3.9)
0
The MTTF can also be evaluated as the Laplace - transform of R(t) i.e.,
t
MTTF t ( R)dt lim R( x) dx
0 t 0 0
t
However, lim R( x) dx lim R( s )
t 0 s 0
N S (t t )
z (t ) ….. (3.11)
N S (t )t
Fig.3.2. It is assumed that the time intervals are identical. The time
()
z(t)
0 20 40 60 80 100
TIME (→)
precisely suitable.
distribution;
Z (t)= …….(3.15)
1
MTTF e t dt ... (3.19)
0
3.5.3 Linear – Hazard Model
where b is a constant
R(t ) exp bt dt ….. (3.21)
0
exp(bt 2 / 2)
useful period in the bath-tub curve, might follow this model for few
2
MTTF e bt /2
dt
0
T (1/ 2)
….. (3.23)
2 b/ 2 2b
This gives us
R(t ) exp at b 1 /(b 1) ….. (3.25)
f (t ) at b exp at b 1 / b 1) .….. (3.26)
and a affects the amplitude and so they are called shape and scale
parameters respectively.
1
MTTF b 1 ….. (3.27)
1 /( b 1)
b 1 a
b 1
dT (d )
(ad ) a
1
Where d
b 1
good fit to a few sets of failure data. However, it is not widely used as
t
a 1 e
z (t )
(a 1)!
t
R(t )
…………. (3.28)
R (t )
n 1
t j
e t …....... (3.29)
j 0 j!
a 1
f (t )
a 1!
t e t …..…….. (3.30)
MTTF = a /
describe the failure curves which do not fit in with the earlier
This is used when the initial hazard rate is almost zero for some time.
The following model also can serve this purpose to a limited extent.
For this,
R(t ) exp a / c e a 1 ….. (3.33)
component and thus its reliability. For almost all components the
Operating Voltage
1
rated voltage
Operating current
2
rated current
upon the variety of component. The other factors like quality factors,
application factor, complexity factor, etc will also be used for the
MODEL
the hazard rate of the part is known. The part is assumed to exist in
1-z(t) (t) 1
z(t) (t)
State 0 State 1
component failure and reach state 1 in the time interval t is z (t) t,
If P0 (t) = the reliability that the component will not fail in time t;
Then, the probability that the component will stay in state 0 at time t
+ t is given by
P0 t t P0 t
z t P0 t ….. (3.36)
t
P1 t t P1 t
z t P0 t ….. (3.37)
t
As t 0, We have
dP0 t
z t P0 t
dt
t
log P0 t z x dx c1
0
t
P0 t exp z x dx c1
0
t
c2 exp z x dx ….. (3.38)
0
64
P0 0 1 c2
t
Rt P0 t exp z x dx ….. (3.39)
0
the subsystems are evaluated and then combined to find the reliability
the system and the kind of its functions to evaluatefind the behavior
are known.
systems) are developed in this section. For all models the assumption
any component does not alter the failure of rest of the components.
3.8.1Series Systems
system, the accurate operation of all the ‘n' components, then we call
estimation. The information from the IN end will reach the OUT end
IN 1 2 n OUT
Let ‘Ei’ represent the event that component ‘i’ is working state
the system.
…..i,…..n respectively.
Rs Pr(E1 E 2 E3 E n )
reliabilities.
in parallel and the system does not fail, even one component is in
good working condition i.e the system fails only when all components
have failed”.
Let, the event that component ‘i’ is in good working state be ‘ Ei’.
The event that component ‘i’ is in failure state be ‘ Ei’ the number of
67
2
IN OUT
= Pr(E1).Pr(E2 /E1).Pr(E3/E2.E1)..Pr(Em)
1 e 1t . 1 e 2t . 1 e 3t ..... 1 e n t ….. (3.43)
System reliability,
unreliability.
1 p (t ) [1 p (t )] m 1
m
reliability
R1 t P t
R t 1 1 e m
MTTF 1 1 e
0
dt
m
Putting 1 e x, we get
1 1 1 x m
MTTF dx
0 1 x
1 1
1 x x
.......x m 1 dx
2
0
1 1 1 1
1 .............
2 3 m
1 m 1
i 1 i
….. (3.47)
1
MTTF [ In( m) 0.577 1 / 2m] ….. (3.48)
[exp(1t ) exp(2t ) ........exp(mt )
0
69
aircraft where two engines are required for successful operation and
two are kept as standby units. Many such systems are present in
given by
Where
m
B(m, x )
x
70
required for operation of the system without any failure is k , then the
m
R B(m, i ) p i (1 p ) m i ….. (3.50)
i k
When k=m the system becomes a series-system and when k=1 the
called basic units whose functioning is essential for the success of the
units required will increase and hence the redundant units decrease.
F1 F3
G1 G2
F2 F4
MW. If the capacity is higher than the demand, the system can meet
demand for less than 130 MW, the system will function as a 2-out-of-
demand of, say 195 M.W would mean that the system works as a 3-
R 6 p 2 (1 p)2
1
system. The reduction in case of constant failure rates is T
2
m
m
Q ( k , m) (1 p) i p m i ….. (3.51)
i m k 1 i
If the values of m and k are large, the approximate formula for mean
life is
1
m
1 2
MTTF ln ….. (3.54)
1
k
2
72
3.7. S1, S2 …… are the subsystems and direction of arrows show the
S3
S1
S5
IN OUT
S2 S4
S1 S3
S1 S5 S4
IN OUT
S2 S4
some subsystems find place in more than one path. The use of the
reliability be Ri.
reliability be Pi and.
R1 = p1p3
R2 = p2p5p4
R3 = p2p4
On simplification, we get
diagram showing the paths for these systems is illustrated in fig. 3.10.
S1 S2 S1 S2
IN OUT
S5
IN S5 OUT
S3 S4 S3 S4
S1 S2
S1 S5 S4
INPUT OUTPUT
S3 S5 S2
S3 S4
p5p1p2p3+2p1p2p3p4p5 …....(3.58)
For a value p=0.8 Eqs. (3.57) yields R=0.890 and Eq. (3.59) yields
R=0.91
It may be noted that Eqs. (3.36) and (3.37) are identical. Similarly, if
Section 1 Section 2
S1 S2
IN OUT
S3 S4
=p1+p3 -p1p3
Section 1 Section 2
S1 S2
IN OUT
S3 S4
p1p2p3p4 ....(3.63)
there are some exceptions. For example, an electric appliance will fail
estimation of system-reliability.
by
p=1-(q0+qs) -----(3.64)
fail. If there is a short-circuit across any one of them, it will cause the
C
C C
C2
1960’s. Since then they have been adopted for a broad variety of
within a system. Basic events at the base of the fault tree are
the fault tree usually represent human and component faults for
Pump failure
systems is based on the events that will cause the system to fail. “A
failure”. The faults which are at the lowest level of the system are
usually represented at the base of the tree and the system fault at the
top. The events which are at the lowest level are known as “basic
which may or may not be of interest and such events are represented
the three components. The fault tree diagram of this system is shown
in Fig. 3.15. For combining two or more events the logical OR and
C2
C1
C3
output signal only when all its inputs function. If Y represents the
Y = X1 X2 X3 …….. Xn
Y = X1 X2 X3 ……. Xn
System fails
OR
gate
E1 E1
AND
gate
E2 E3
paths in a system for accomplishing a given task such that all means
factors, for example the weight, size, initial cost and operating
can be applied in two ways as shown in Fig. 3.15. Assuming that the
Ru=1-(1-p1p2)(1- p1p2)
C1 C2 C1 C2
C2 C1 C2
C1
C
Rc [1 (1 p1 ) 2 ] [1 (1 p2 ) 2 ]
Ru 2 p 2 p 4
83
Rc p 2 ( 2 p ) 2 ……..(3.67)
Then,
Rc Ru p 2 [( 2 p ) 2 ( 2 p 2 )]
= p2(2-4p+2p2)
It is obvious from Equation (6.1) that Rc - Ru > 0 for 0 < p < 1 and Rc
level is more enhanced than redundancy at the unit level to the level
reliability would be
Rc [1 (1 p ) m ]n …… (3.70)
Ru 1 [1 p n ]m …….. (3.71)
is
R=p2(3-2p)
Ru 1 (1 0.648) 2 0.876
6
6
Rc p i (1 p) 6i
i 2 i
15 p 2 (1 p) 4 20 p 3 (1 p) 3 15 p 4 (1 p) 2 6 p 5 (1 p) p 6
are 0.8 and 0.4 respectively. This system will have a reliability of 0.32
which is much less than 0.4, the reliability of the weakest component.
methods:-
85
redundancy).
are presented in table 3.1. Types (b) and (c) shows the use of one
reliability. So the pay off will not be much when investment is done
B are same, then the reliability-cost ratio of type (b) is lower than that
of type (c).
A B 2
B Rb 1 1 pa pb 0.384
A
A B 2
C Rc pa 1 1 pb 0.512
B
B
A
Rd 2 pa pb pa2 pb2 0.5376
D
A B
A B
Re pa pb 2 pa 2 pb 0.6282
E
A B
86
A B 3
R f pa 1 1 pb 0.6272
B
F
B
A B 4
Rg pa 1 1 pb 0.6963
B
G B
A B
Rh 1 1 pa pb 2 pb 1 pa pb
H
B 0.6674
A B
A B 2 3
Ri 1 1 pa 1 1 pb
B
I A 0.7526
B
A B 3
R j 1 1 pa pb 0.6855
A B
J
A B
A B 2
Rk 1 1 pa pb 2 pb
B
0.7618
K A B
A B 3 3
Rl 1 1 pa 1 1 pb
A B
L 0.7777
A B
Now the configurations (c), (f) and (g) are considered in which
further as shown in (f). It may be noted that the reliability of type (e) is
87
higher than the reliability of type (f) even though both of them employ
should be done in A. Compare the reliabilities of types (g) and (i) both
type (g).
cost ratios. A comparison of types (j), (k) and (l) will demonstrate this.
then the unit redundancy is applied. Its reliability is more than the
sections.
All the components and equipments are not suitable for "active
problems when they are placed actively in parallel. If one of two such
such systems.
88
by
R p1 p2 pc (1 p1 ) p2 pc ps p1 (1 p2 ) p1 pc ……… (3.72)
Where
p1 = Unit-1 reliability
p2 = Unit-2 reliability
pc = switch reliability
redundant system. For simple parallel systems, the cost and reliability
of can be expressed as
Where ‘p’ and ‘c’ are the reliability and cost and of each element and
(ii) The cost, volume/weight of any stage raise linearly with the
unreliability ‘qi’ are used at stage ‘i’ then the reliability of the entire
R= =
R= …….(3.73)
Q= ……(3.74)
Then
and Q= …….(3.76)
j=1,2,……..K .…………….(3.77)
Where ‘Cij’ is the jth resource used at ith stage. The constraints
might represent total cost (C1), total volume (C2), etc. Equation (3.75)
and (3.76) represent the objective functions and the equation (3.77)
Maximize R=
cij mi < Cj
equality to zero.
log R=
= (- qimilog(qi))/ (1 - qimi ) *
(3.78)
qimi * log (qi) = q2mi *log (qi) = qn mn* log (qi) .….(3.80)
Where E = q1mi*
H = log (q1)
G = log (qi)
The detailed procedure for computing the vector ‘M’ is given below:
92
Step 3: Check cost and volume constraints. If they are not violated
go to otherwise go to step 5.
Step 5: Round of mi* such that the constraints are not violated.
systems.
3.10 Conclusions