An Infinite Series Expression for the Joint Probability Density Function of OccupationTimes in a Three-State Markov Chain ()
1. Introduction
The occupation time probability density function was the subject of several research papers in the 60s and 70s as seen in Darroch [1], Pedler [2], Good [3], and Hsia [4]. More recently, there has been renewed interest in the subject as seen in Sericola [5] as well as applications in Fang [6] and Guo [7]. In this paper, we extend Darroch’s identity to any finite state Markov chain and then derive an explicit three-state joint probability density function in a manner similar to Pedler’s original two-state derivation.
2. Occupation Time for a Three-State Markov Chain
Consider an irreducible continuous time Three-State Markov chain
with rate matrix
if
ranging over states
with initial distribution
where
denotes the probability of starting in
at time 0. Then for
, the occupation times of
by time
are
where
and
.
Theorem 1 Given
, the joint probability density function of the occupation time for a Three-State Markov chain up to time t is given by
where
(1)
and
are Dirac delta functionals satisfying the property that
if
and 0 otherwise.
3. Derivation of Three-State Occupation Time Probability
Density Function
The proof of Theorem 1 is based on the following Lemmas:
Lemma 2
(2)
where
is the dot product of
and
,
,
the column vector of 1s,
the occupation time of state
by time
, and
Proof of Lemma 2.
From Darroch [1], one may write the moment generating function of the weighted sum of states as
with
,
the matrix with elements
,
where
are arbitrary weights,
the Kronecker delta, and
Then setting
, with
arbitrary, the moment generating function is extended to a multivariate moment generating function and we obtain (2) as desired.
Lemma 3
The inverse Laplace transform of the expression below reads
(3)
Proof of Lemma 3.
Recall that
if
and
if
is a positive integer where the latter identity may be obtained by repeated integration by parts. Splitting
(4)
into cases where the exponent(s) of
are zero or non-zero, we may ascertain that (4) is equivalent to
(5)
Taking the inverse Laplace transform of (5) gives (3) as desired.
Proof of Theorem 1.
Let the joint probability density function of
be
and denote the bivariate moment generating function of
by
. Then taking the Laplace transform of
and applying Lemma 2 for
and
, we obtain
(6)
where
Denoting the numerator of (6) as
and factoring
, (6) may be written as
(7)
Then for sufficiently large
we may expand (7) as a geometric series and rewrite it as
(8)
From the equality of (8) and (6), we have
Finally, by taking the inverse Laplace transform of (8) in a manner akin to the proof of Lemma 3 for all terms gives (1) as desired.
4. Illustrative Example
We present a table with values provided by a Wolfram Mathematica implementation of (1) as well as a simulation of 109 paths in C++. The Mathematica implementation of (1) involves only a summation of index
up to 40 terms instead of the infinite summation of (1), and for the path simulation the proportion of paths satisfying
,
was used to calculate the simulated probabilities. The relevant parameters are as follows.
|
|
|
|
|
|
|
|
0.036051100 |
0.055836600 |
0.002708140 |
0.000005337 |
0.000000001 |
0 |
|
0.290716000 |
0.384080000 |
0.009010030 |
0.000003378 |
0 |
0 |
|
0.123652000 |
0.093462900 |
0.000392612 |
0 |
0 |
0 |
|
0.003429840 |
0.000652630 |
0 |
0 |
0 |
0 |
|
0.000003147 |
0 |
0 |
0 |
0 |
0 |
|
0 |
0 |
0 |
0 |
0 |
0 |
Path Simulation |
|
|
|
|
|
|
|
0.036047441 |
0.055842521 |
0.002709707 |
0.000005406 |
0 |
0 |
|
0.290723689 |
0.384056728 |
0.009008985 |
0.000003493 |
0 |
0 |
|
0.123652673 |
0.093470497 |
0.000393724 |
0 |
0 |
0 |
|
0.003433051 |
0.000649946 |
0 |
0 |
0 |
0 |
|
0.000002139 |
0 |
0 |
0 |
0 |
0 |
|
0 |
0 |
0 |
0 |
0 |
0 |
Comparison of the simulated values with the exact values in the table above shows good agreement, whereby offering Monte Carlo method as an alternative to exact pdf formula which are expected to be very lengthy for
larger than 3.
Without a closed form expression, considerations must be made when determining an upper bound for index
in (1). An appropriate upper bound for
often depends on
and
where sufficiently large values of
can result in the truncated version of (1) performing poorly. An informal heuristic for gauging the accuracy of the truncated pdf is by confirming that its integral over its domain is sufficiently close to 1. The authors decision to pick the rate matrix and initial distribution above is based off of this observation.
5. Discussions
We would like to note that in principle, one may derive the occupation time probability density function for any finite state Markov chain by utilizing the relevant extension of Darroch’s identity as given by Lemma 2. However, as seen in (1), even for a Three-State Markov chain, the formula is rather lengthy and its derivation is more suited to a computer algebra system. To the best of the authors’ knowledge, (1) is the first explicit example of the occupation time probability density function for a Three-State Markov chain, and is also the first demonstration that Darroch’s identity and Pedler’s derivation may be extended beyond two states. As far as we are aware, Sericola’s approach appears to be the only other method that may be used to derive the occupation time probability density function in a finite state Markov chain; however, we did not find any explicit results in the literature. Concerning Hsia’ result, the probability density function found in Hsia [4] is expressed in terms of random variables with unknown distributions and cannot be directly integrated to calculate numerical values.
On a final note, we would like the reader to be aware that there are several published results for the Two-State occupation time probability density function that contain typographical errors. Among incorrectly typed formulas, the omission of the Dirac delta functional is particularly common and the resulting probability density function fails to integrate to 1 over its domain. Regarding the Two-State occupation time probability density function, Pedler’s formula as written was verified to be correct. As for future work, it would be desirable to have a closed form for various parts of (1) if it exists. Alternatively, new methods for deriving the probability density function may yield more tractable representations of (1).
Acknowledgements
The authors would like to thank the reviewers for their valuable suggestions that considerably improved the presentation of our findings in the revised version of the manuscript.