A Maintenance Service Improvement Approach Based on Queue Networks: Application ()
1. Introduction
Queuing theory was developed to provide models for predicting the behavior of systems responding to random requests. The first problems studied concerned the congestion of telephone traffic. Erlang will find that a telephone system can be modeled by arrivals of fish customers and exponential uptime [1] . A queuing system could be represented by Figure 1 and Figure 2 [2] .
The two queues schematized above are not the only types to be found in this area, but there are several. In a queuing process, if the client arrives and finds the server busy, it joins the queue associated with that server. At a certain point, the client is selected for service according to a rule known as a policy or service discipline. The expected service is then provided and the client exits the system.
In queue systems, a distinction is made between open systems where no restriction on the size of the queue is imposed; and closed systems where only a limited number of clients are allowed to stay in the queue.
In this paper, we are particularly interested in queuing networks, as they allow us to model and predict the behavior of the system in all its states. Thus, in any state of the system, reliability and availability can be assessed.
2. Markovian Queues
Markovian queues are those for which inter-arrivals and service times are exponential. Their Kendall notation will be of the form M/M/··· (M for Markovian...) [3] - [8] .
2.1. The M/M/1 Queue
This queue is characterized by a Poissonian arrival with birth rate
and an exponential service life with death rate
[3] [4] .
Let
, the queue can be considered as a process of birth and death, for which we have the relation (1).
(1)
The state probability (taking into account that
for there to be a steady state) is given by relation (2).
(2)
So
(3)
All performance parameters are calculated for the case where the queue is stable
and for the steady state of the queue.
The illustrative diagram is given by Figure 3.
2.1.1. The Rate d
Here
because for all
. Another way to look at it is to notice that the service is performed at a rate
in every state where the system has at least one client [3] [4] .
(4)
2.1.2. Server Utilization Rate
(5)
2.1.3. Average Number of Clients L
The average number of clients is calculated from the stationary probabilities of the following way:
From where
(6)
2.1.4. Average Residence Time in System T
This parameter is obtained using Little’s law, given by the relation (7).
(7)
This formula can be decomposed according to relation (8).
(8)
From the previous relationship, we deduce respectively the average time spent in the queue and the average number of customers in the queue by relationships (9) and (10).
(9)
(10)
2.2. The M/M/1/K Queue
Consider a single server system identical to the M/M/1 queue except that the capacity of the queue is finite. We therefore always have the following assumptions: the process of arriving customers in the queue is a Poisson process with rate
and a customer’s service time is an exponential random variable with rate
. Let K be the capacity of the queue: this is the maximum number of clients that can be present in the system, either on standby or in service. When a customer arrives when there are already K customers present in the system, it is lost [3] [4] .
Figure 4 shows an example of an M/M/1/K queue.
The process of birth and death modeling this type of queue is then defined by relation (11).
(11)
The integration of the recurring equation for performing calculations is given as follows:
(12)
(13)
2.2.1. The Rate d
The system flow can be calculated in two equivalent ways: either by measuring the departure rate of clients leaving the server
, or by measuring the actual arrival rate of clients accepted into the
system. We would of course expect to obtain equality of these two rates.
The output rate of the server is equal to
as soon as the queue is not empty:
(14)
The effective input rate to the file is equal to
from the moment a client arrives when the file is not full:
(15)
Since
, we have
, where d is the average throughput of the queue (entry or exit):
(16)
Note that, when K tends to infinity, we find the results of M/M/1, that is to say, provided that
, which corresponds to the condition of stability of M/M/1.
2.2.2. U(K) Server Utilization Rate
(17)
Thus, in the case of a queue with limited capacity, the utilization rate is no longer equal to
. Indeed, the rate of use is always equal to the ratio of the average input rate to the average service rate (Little’s law):
.
Note that, as
,
if
, and towards one if
[10] .
2.2.3. Average Number of Clients L
The average number of customers in the system is given by relation (18).
(18)
2.2.4. Average Residence Time in System T
We calculate T by Little’s formula, but
not being constant, we must calculate
, that is:
with
Hence:
(
) (19)
Table 1 illustrates the different symbols we will use in queues and their meaning.
Table 1. Symbols and meanings [2] .
3. Markov Chains
Markov chains—or State Space Method (MEE)—were developed in the 1950s for the reliability analysis of repairable systems [11] [12] [13] [14] [15] .
This method consists of representing the operation of a system by a set of components that can be in a finite number of operating and fault states.
A graphic support (the state graph) makes it possible to visualize the different states of a system which are represented by circles and interconnected by oriented arcs which correspond to the transitions (breakdowns and repairs) between states. For an n-component system, if each component has two states (working and failing), the maximum number of states is 2n. Figure 5 shows an example of a Markovian model of a system.
To perform this analysis, you must first identify all the states of the system, classify them into operating states or failure states. Then, it is necessary to find out how to switch from one state to another during a malfunction or repair. With each transition from state
to state
, a transition rate
is associated which is defined in such a way that
is equal to the probability of passing from
to
between two very close instants t and
knowing that we are at time t in
. Finally, the last step consists in calculating the probabilities of being in the different states during a system life period as well as of calculating the characteristics of dependability [12] .
Using Markov graph modeling, time and stochastic dependencies are more widely taken into account than with classical methods.
In addition, they are easy to use [13] .
4. Modeling of Availability and Reliability Using the Theory of Queues and Markov Chains
4.1. Definition of a Homogeneous Markov Chain over Time
We will assume that the system transits from state i to state j with a probability which depends only on states i and j. Such a system is said to be memoryless or even Markovian, which means that the probability does not depend on the states prior to i through which the system has passed during its history. Thus, the future state of the studied system depends only on its present state and not on its past states.
These numbers are arranged in the matrix
. This matrix is stochastic because the (stochastic) vector in row i contains the probabilities of all
possible transitions starting from state i and therefore their sum is equal to one. The sequence of stochastic vectors
for
satisfies the matrix recurrence formula given by Relation (20).
(20)
The more compact form is given by Equation (21).
(21)
4.2. Definition of a Homogeneous Markov Chain over Time
The recurrence formula
applied at
, gives:
At any instant t (integer), the stochastic vector is therefore given by Equation (22).
(22)
4.3. Stationary Distribution
If the eigenvalue 1 of the stochastic matrix P of a homogeneous Markov chain is simple and dominant (any other eigenvalue has a modulus strictly less than 1) then the sequence
converges to a strictly positive matrix
of the form given by expression (23), with
[16] .
(23)
Moreover, any sequence
defined by:
and
,
,
converges to the vector
as defined by relation (24).
(24)
is the unique distribution of probabilities verifying:
.
4.4. Graphical Representation of Markov Chains
A homogeneous Markov chain with a set of states E can be represented by the directed graph G such that:
· its vertices are the states E,
· the arc from vertex i to vertex j exists if,
,
· the evaluation of the arc
is given by the probability of transition
.
A Markov chain can then be seen as a random walk on Graph G: we draw randomly
realization
according to the
law, then of
we draw
(realization of
) according to the transition probabilities of the arcs coming from
, etc [17] .
5. Application
Either a production system, so the history of operating hours or failures during a year of operation reveals the times between failures (TBF) and the times to repair (TTR), given in Table 2.
We suppose that our production system can be found in three states: the state (E0) where there is no failure with a probability of
, the state (E1) where there is a partial failure with a probability of
and the state (E2) where we observe a total failure of the system with a probability of
. The probabilities of transitions between the different states are defined in the matrix of transitions given in relation (25).
(25)
We make the following working hypotheses:
· service disciplines first come, first served (FIFO),
· arrivals according to a Poisson process,
Table 2. History of operating hours (h).
· exponential service time,
· either the M/M/1 queue.
5.1. Graphical Representations of the Markov Chains of the Production System
From the previous matrix, we deduce the graph of the availability states, by Figure 6.
It is assumed that the production system is reliable if and only if the probabilities of repairs resulting from transitions from state (E2) are zero (as long as we opt instead for palliative maintenance in this state).
Hence the reliability graph in Figure 7.
5.2. Parameters of Reliability and Maintainability of the Production System
The processing of data from the history of operating hours makes it possible to calculate the mean time between failure (MTBF), the mean time to repair (MTTR), the failure rate (λ) and the repair rate (μ).
Figure 6. Production system availability status graph.
Figure 7. Production system reliability status graph.
5.3. Analysis of Production Chain Queue Networks
5.3.1. U Server Utilization Rate
By definition, the utilization rate is the probability that the server in the queue is busy.
From relation (5),
.
5.3.2. Average Number of Clients L
From relation (6),
.
From relation 10, we deduce the average number of customers in the queue, i.e.
.
5.3.3. Average Residence Time T
This parameter is obtained using Little’s law of relation (7).
From relation (9), we deduce the average time spent in the queue, namely:
5.3.4. System State Equation
In steady state, we have:
We find the same general term of probability as that found in the literature [9] .
Now,
, so
and
.
We can see that
, so the production system can admit 0, 1 or 2 customers.
5.4. Production Availability Calculation
We have:
5.4.1. After Two Years of Operation
We set the following initial conditions:
.
The probability of the system in its states
after two years of operation is given as follows:
5.4.2. After Four Years of Operation
From the recurrence relation given by Equation (21), we deduce by conjecture that:
(26)
So, after four years, the probability of the system in its states is given as follows:
5.4.3. After Ten Years of Operation
5.4.4. When the Production System Returns to Steady State
When
, Equation (26) becomes:
However, according to the expression (23),
We deduce that the production system reaches the operating limit (maximum number of years of operation) with an availability of 91.82%, when it is in state (E0); with an availability of 7.51%, when it is in state (E1) and with an availability of 0%, when it is in state (E2).
On the other hand, we also note that during the life cycle of the production system, availability decreases in state (E0). On the other hand, when it is in state (E2), availability is zero, which I translate by the fact of its inactivity in this state.
5.5. Calculation of the Reliability of the Production System
According to the equations of state of the system, in a steady state, we have:
;
and
.
In this case, the new matrix of system state transitions, in agreement with Figure 7 is given by:
5.5.1. After Two Years of Operation
We set the following initial conditions:
The probability of the system in its states after two years of operation is given as follows:
5.5.2. After Four Years of Operation
From relation 26, we deduce after four years that the probability of the system in its states is given as follows:
5.5.3. After Ten Years of Operation
5.5.4. When the Production System Returns to Steady State
We see that the last row of the matrix P is zero, therefore the matrix P is no longer stochastic, therefore the relation (23) is no longer applicable in a steady state.
On the other hand, in a steady state, we have:
, because
, with
and
.
So:
.
The production system reaches the operating limit with a reliability of 93.49%, when it is in state (E0); with 6.49% reliability, when in the (E1) state and with 0% reliability, when in the (E2) state.
5.6. Proposals for Improving the Reliability and Availability of the Production System
To have better availability and adequate reliability, the following operations must be carried out:
· Set up a group of executives and technicians specializing in the field of production chain operation;
· Develop a preventive maintenance program corresponding to the needs of the production line;
· Work in close collaboration with all other departments of the company;
· Ensure rigorous control of maintenance operations.
6. Conclusion
Having reached the end of the writing of this paper, whose general objective was to evaluate the availability and the reliability of the networks of Markov queues, we showed how to take advantage of the knowledge on the theory of the chains of Markov, in order to help in the analysis of waiting phenomena. We have presented an application on a production line where the arrival process is Poissonian. This approach allowed us to show the influence of availability on reliability. Based on the results obtained, proposals for improving maintenance were made.
Acknowledgements
The authors would like to thank the anonymous reviewers for their time and effort. Their constructive comments and helpful suggestions helped us to clarify the main paper’s research contributions and improve its quality.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.