Probability Distributions Arising in Connection with the Inspection Paradox for the Poisson Process

Abstract

The Inspection Paradox refers to the fact that in a Renewal Process, the length of the interarrival period which contains a fixed time is stochastically larger than the length of a typical interarrival period. To provide a more complete understanding of this phenomenon, conditioning arguments are used to obtain the distributions and moments of the lengths of the interarrival periods other than the one containing this fixed time for the case of the time-homogeneous Poisson Process. Distributions of the waiting times for events that occur both before and after this fixed time are derived. This provides a fairly complete probabilistic analysis of the Inspection Paradox.

Share and Cite:

Marengo, J. , Himes, A. , Reinberger, W. and Farnsworth, D. (2023) Probability Distributions Arising in Connection with the Inspection Paradox for the Poisson Process. Open Journal of Statistics, 13, 16-24. doi: 10.4236/ojs.2023.131002.

1. Introduction

Consider a Renewal Process with interarrival times X 1 , X 2 , and waiting times S n = k = 1 n X k for n = 1 , 2 , . Suppose that a piece of equipment, such as a battery, is installed at time zero and is immediately replaced upon failure by a similar battery. When this second battery fails, it is replaced by a similar battery, and this process is continued indefinitely. We can model this situation with a Renewal Process by regarding X k as being the lifetime of the k-th battery, S n as the time at which the n-th battery fails, and N ( t ) as the number of failures (renewals) by time t.

Now, fix t > 0 and consider the length X N ( t ) + 1 of the interarrival period which contains the instant t. Even though X 1 , X 2 , are independent and identically distributed, X N ( t ) + 1 is stochastically larger than X k for every positive integer k. This is the Inspection Paradox. With regard to the batteries, it says that the battery in use at time t tends to have a longer lifetime than an ordinary battery. Its explanation lies in the fact that a longer interarrival period, rather than a shorter one, is more likely to cover the instant t.

The Inspection Paradox can arise in almost any sampling application. For example, there is a well-known result which says that in sampling by groups, one is less likely to encounter people who are in smaller groups [1] . This paradox appears in the sciences [2] and medicine [3] . In some circumstances, it can be remediated by using statistical procedures [4] . Its variations remain an area of study [5] . Instead of being regarded as a threat to the validity of an experiment, it can be used as a tool for gaining efficiency [6] . Our goal is to provide a more complete analysis of the paradox in the time-homogeneous Poisson Process case. We do this by deriving probability distributions and moments for the lengths of time periods that arise in the context of the paradox, and we examine some of their properties.

We write

X N ( t ) + 1 = A t + Y t , (1)

where

A t = t S N ( t ) and Y t = S N ( t ) + 1 t .

The random variable X N ( t ) + 1 may be regarded as the lifetime of the battery in operation at time t, A t is the age at time t of the battery operating at that instant, and Y t is the remaining life of this battery.

In the following, take the Renewal Process to be a time-homogeneous Poisson Process with rate λ , interarrival times X 1 , X 2 , and waiting times S n = k = 1 n X k for n = 1 , 2 , . Such a process has independent increments and the number of events in any interval of length c has the Poisson distribution with mean λ c . The interarrival times are independent and identically distributed with the exponential distribution with mean 1 / λ . The waiting time S n has the Gamma ( n , λ ) distribution with probability density function

f ( s ) = ( 0 if s < 0 λ e λ s ( λ s ) n 1 ( n 1 ) ! if 0 s

and distribution function

F ( s ) = ( 0 if s < 0 1 k = 0 n 1 e λ s ( λ s ) k k ! if 0 s .

S n is called the “waiting time” because, starting at time zero, it is the amount of time that an observer of the process waits before the n-th event occurs.

Starting at time t, the time Y t until the next renewal is exponentially distributed with mean 1 / λ and is independent of A t . This follows from the memoryless property of the exponential distribution. In fact, the independence of A t and Y t is known to characterize the Poisson Process. The distribution of A t is given by

P ( A t x ) = ( 0 if x < 0 1 e λ x if 0 x < t 1 if t x

with expectation

E [ A t ] = 1 λ ( 1 e λ t ) .

Thus, A t has the same distribution as min { X 1 , t } , and Y t has the same distribution as X 1 . Each of these assertions separately are also known to characterize the Poisson Process.

From (1), the distribution of X N ( t ) + 1 is the convolution of the distributions of A t and Y t . It follows that

P ( X N ( t ) + 1 x ) = ( 0 if x < 0 1 e λ x ( 1 + λ x ) if 0 x < t 1 e λ x ( 1 + λ t ) if t x

with expectation

E [ X N ( t ) + 1 ] = 2 λ 1 λ e λ t .

As t ,

A t D exp ( λ ) ,

X N ( t ) + 1 D Gamma ( 2, λ ) ,

and

E [ X N ( t ) + 1 ] 2 λ , (2)

where D indicates convergence in distribution. From (2) it follows that for large t, the expected life of the battery in operation at time t is approximately twice the expected life of an typical battery, whose lifetime has an exponential distribution with mean 1 / λ . This background material is presented in [7] [8] .

We give explicit formulas for the distributions and moments of the lengths of the interarrival periods other than the one containing a fixed instant t when the underlying Renewal Process is a time-homogeneous Poisson Process with rate λ . We show that the infinite sequence of lengths of the interarrival periods preceding the one containing t is strictly stochastically decreasing. Also, we demonstrate that the lengths of the interarrival periods succeeding the one containing t are independent exponential random variables with mean 1 / λ , and we derive the distributions and moments for S N ( t ) + k for every integer k.

Section 2 presents the probability distribution for the length of each interarrival period (e.g., lifetime) that ends before a fixed inspection time t and each one that begins after this time. It is noted that, proceeding backwards in time from the instant t, the lifetimes are strictly stochastically decreasing, and all moments for each of these lifetimes are presented. In light of the memoryless property of a Poisson process, it isn’t surprising that the lifetimes that begin after time t are independent and identically distributed exponential random variables having a distribution that doesn’t depend on t. Section 3 presents the probability distribution for the waiting time (i.e., partial sum) corresponding to each of the interarrival periods mentioned above. Section 4 summarizes these results and discusses the limitations of this study.

2. Interarrival Times

We begin with a definition.

Definition 1. A random variable U is strictly stochastically smaller than a random variable V if for all real x,

P ( U x ) P ( V x ) ,

and if the inequality between these two probabilities is strict for some real number x. A sequence { U n } n = 1 of random variables is strictly stochastically decreasing if U n + 1 is strictly stochastically smaller than U n for n 1 .

It is well-known that, given that N ( t ) = n , the conditional joint distribution of the waiting times that terminate prior to time t is the same as the joint distribution of the order statistics for n independent random variables that are each uniformly distributed on the interval [ 0, t ] [8] . It follows that each of the lengths of the corresponding interarrival times have the same beta distribution with pdf f t ( x ) = ( n / t n ) ( t x ) n 1 for 0 < x < t [9] . The following theorem gives the unconditional probability distribution and moments for the length of each of the interarrival periods that end before the fixed time t.

Theorem 1. If j { 0,1, } and t > 0 , then

P ( X N ( t ) j x ) = ( 0 if x < 0 1 e λ x k = j + 1 e λ ( t x ) λ k ( t x ) k k ! if 0 x < t 1 if t x (3)

and for m = 1,2, , the m-th moment of X N ( t ) j is

E [ X N ( t ) j m ] = m ! λ m k = m + j + 1 e λ t ( λ t ) k k ! .

Proof. We define X N ( t ) j = 0 if j N ( t ) , and we note that 0 X N ( t ) j t . By conditioning on the value of N ( t ) , we have for x [ 0, t ] that

n = 0 P ( X N ( t ) j x ) = n = 0 P ( X N ( t ) j x , N ( t ) = n ) .

For n = 0 , 1 , , j ,

P ( X N ( t ) j x , N ( t ) = n ) = P ( N ( t ) = n ) = e λ t ( λ t ) n n ! , (4)

where we have defined X k = 0 for k 0 .

For n j + 1 ,

P ( X N ( t ) j x , N ( t ) = n ) = P ( X n j x , S n t , S n + 1 > t ) . (5)

This probability does not depend on j { 0,1, , n 1 } . Setting j = 0 in the right hand side of (5) and conditioning on the value of S n 1 , we have

P ( X N ( t ) j x , N ( t ) = n ) = P ( X n x , S n 1 + X n t , S n 1 + X n + X n + 1 > t ) = 0 t P ( X n min { x , t s } , X n + X n + 1 > t s ) d F n 1 ( s ) = 0 t x P ( X n x , X n + X n + 1 > t s ) d F n 1 ( s ) + t x t P ( X n t s , X n + X n + 1 > t s ) d F n 1 ( s ) , (6)

where F n 1 ( s ) is the distribution function of S n 1 .

For s [ 0, t x ] ,

P ( X n x , X n + X n + 1 > t s ) = P ( X 1 x , X 1 + X 2 > t s ) = P ( N ( x ) = 1 , N ( t s ) N ( x ) = 0 ) = P ( N ( x ) = 1 ) P ( N ( t s ) N ( x ) = 0 ) = e λ x ( λ x ) e λ ( t s x ) = λ x e λ ( t s ) , (7)

where we have used the fact that a Poisson Process has independent increments. For s [ t x , t ] ,

P ( X n t s , X n + X n + 1 > t s ) = P ( X 1 t s , X 1 + X 2 > t s ) = P ( N ( t s ) = 1 ) = λ ( t s ) e λ ( t s ) . (8)

We may use the fact that S n 1 has the Gamma ( n 1, λ ) distribution to obtain from (6), (7), and (8) that for n j + 1 ,

P ( X N ( t ) j x , N ( t ) = n ) = 0 t x λ x e λ ( t s ) d F n 1 ( s ) + t x t λ ( t s ) e λ ( t s ) d F n 1 ( s ) = e λ t { ( λ t ) n n ! λ n ( t x ) n n ! } . (9)

Using (4) and (9), we have after summing on n that for x [ 0, t ] ,

P ( X N ( t ) j x ) = e λ t { n = 0 j ( λ t ) n n ! + n = j + 1 ( λ t ) n n ! n = j + 1 λ n ( t x ) n n ! } = 1 e λ x k = j + 1 e λ ( t x ) λ k ( t x ) k k ! ,

which establishes (3). Therefore, we may write

P ( X N ( t ) j > x ) = ( 1 if x < 0 e λ x k = j + 1 e λ ( t x ) λ k ( t s ) k k ! if 0 x < t 0 if t x .

Then,

E [ X N ( t ) j m ] = 0 m x m 1 P ( X N ( t ) j > x ) d x = 0 t m x m 1 e λ x k = j + 1 e λ ( t x ) λ k ( t x ) k k ! d x = 0 t m x m 1 e λ x ( 1 k = 0 j e λ ( t x ) λ k ( t x ) k k ! ) d x = 0 t m x m 1 e λ x d x e λ t k = 0 j λ k k ! 0 t m x m 1 ( t x ) k d x .

Now,

0 t m x m 1 e λ x d x = m ! λ m 0 t ( λ x ) m 1 ( m 1 ) ! λ e λ x d x = m ! λ m ( 1 k = 0 m 1 e λ t ( λ t ) k k ! ) ,

where the last equality follows from the fact the last integrand is the value at t of the distribution function for the Gamma ( m , λ ) distribution. Also,

0 t m x m 1 ( t x ) k d x = t m + k 0 1 m y m 1 ( 1 y ) k d y = t m + k m ! k ! ( m + k ) ! .

Therefore,

E [ X N ( t ) j m ] = m ! λ m ( 1 k = 0 m 1 e λ t ( λ t ) k k ! ) e λ t k = 0 j λ k k ! t m + k m ! k ! ( m + k ) ! = m ! λ m k = m + j + 1 e λ t ( λ t ) k k ! .

Remark 1. It follows from Theorem 1 that as t

X N ( t ) j D X 1 and E [ X N ( t ) j m ] m ! λ m = E [ X 1 m ] for m = 1,2, .

Since exponential growth dominates polynomial growth, one minus the sum that appears in the statement of Theorem 1 approaches zero as t approaches infinity for fixed j. So X N ( t ) j converges in distribution to X 1 as t approaches infinity. Moreover,

P ( X N ( t ) j x ) > 1 e λ x for x > 0,

and so X N ( t ) is strictly stochastically smaller than X k for every positive integer k. Since N ( t ) is a finite random variable, P ( X N ( t ) j = 0 for sufficiently large j ) = 1 . The sequence { X N ( t ) j } j = 0 is strictly stochastically decreasing, and the distribution function of X N ( t ) j has a single jump of size

P ( N ( t ) j ) = k = 0 j e λ t ( λ t ) k k !

at zero. Of course, the size of this jump tends to zero as t . Routine calculus shows that the probability density function associated with the absolutely continuous part of the distribution function of X N ( t ) j is strictly decreasing and concave up on the interval ( 0, t ) . If j 1 , this density is continuous everywhere except at zero where it has a jump of size

λ ( k = j e λ t ( λ t ) k k ! ) / ( k = j + 1 e λ t ( λ t ) k k ! ) . If j = 0 , the density of X N ( t ) j is

continuous everywhere except at zero, where it has a jump of size λ / ( 1 e λ t ) , and at t, where it has a jump of size λ e λ t / ( 1 e λ t ) .

The next theorem gives the probability distribution for the length of each of the interarrival periods that begin after the fixed time t.

Theorem 2. If j { 2,3, } and t > 0 , then X N ( t ) + j is exponentially distributed with mean 1 / λ .

Proof. We condition on the value of N ( t ) to obtain for x > 0 that

P ( X N ( t ) + j x ) = n = 0 P ( X N ( t ) + j x , N ( t ) = n ) = n = 0 P ( X n + j x , S n t , S n + 1 > t ) = n = 0 P ( X n + j x ) P ( S n t , S n + 1 > t ) = n = 0 ( 1 e λ x ) P ( N ( t ) = n ) = 1 e λ x .

Remark 2. This proof can be shortened by observing that, since N ( t ) + 1 is a stopping time for X 1 , X 2 , , the event N ( t ) = n is independent of X n + j for j 2 .

3. Waiting Times

The next theorem gives the distribution of the waiting time for each event which occurs before the fixed time t.

Theorem 3. If j { 0,1,2, } and t > 0 , then

P ( t S N ( t ) j s ) = ( 0 if s < 0 P ( Y s ) if 0 s < t 1 if t s

where Y has the Gamma ( j + 1, λ ) distribution.

Proof. Observe that 0 S N ( t ) j t . If 0 s < t , then

P ( t S N ( t ) j s ) = P ( N ( t ) N ( t s ) j + 1 ) = 1 k = 0 j e λ s ( λ s ) k k ! = P ( Y s ) .

Remark 3. From Theorem 3, as t ,

t S N ( t ) j D Gamma ( j + 1, λ ) ,

Theorem 3 states that for s < t , P ( S N ( t ) j s ) = P r ( Y s ) . So S N ( t ) j converges in distribution to Y as t approaches infinity. But Y has the Gamma ( j + 1, lambda ) distribution. Since N ( t ) is a finite random variable, P ( S N ( t ) j = 0 for sufficiently large j ) = 1 . The distribution function of t S N ( t ) j has a single jump of size

P ( N ( t ) j ) = e λ t k = 0 j ( λ t ) k k !

at t. A straightforward, but tedious, calculation yields that

E [ ( t S N ( t ) j ) m ] = m ! λ m { ( m + j m ) k = 0 j ( m 1 + k m 1 ) l = 0 m 1 + k e λ t ( λ t ) l l ! } .

It follows that as t

E [ ( t S N ( t ) j ) m ] m ! λ m ( m + j m ) = E [ Y m ] ,

where Y has the Gamma ( j + 1, λ ) distribution.

Our last theorem gives the distribution of the waiting time for each event that occurs after the fixed time t. Although these distributions are of primarily theoretical interest, knowing them provides a more complete understanding of the Poisson Process.

Theorem 4. If j { 1,2, } and t > 0 , then S N ( t ) + j t has the Gamma ( j , λ ) distribution.

Proof. Note that S N ( t ) + j t > 0 . For x > 0 ,

P ( S N ( t ) + j t x ) = P ( N ( t + x ) N ( t ) j ) = 1 k = 0 j 1 e λ x ( λ x ) k k ! .

Remark 4. The following is another proof of Theorem 4. Recall that Y t is exponentially distributed with mean 1 / λ , and, from Theorem 2, observe that for k = 2 , , j , X N ( t ) + k also has that distribution. Because N ( t ) + 1 is a stopping time for the sequence X 1 , X 2 , , the random variables Y t , X N ( t ) + 2 , , and X N ( t ) + j are independent. Since

S N ( t ) + j t = Y t + k = 2 j X N ( t ) + k ,

Theorem 4 follows.

4. Conclusion

In summary, we have effectively completed the mathematical study of the Inspection Paradox for the case of the Poisson Process. We have accomplished this by deriving the probability distributions and moments for all of the interarrival periods other than the one containing the fixed inspection time under consideration, and also by finding the distributions and moments of the associated waiting times. We note that our results apply only in the case of a Poisson process and not for an arbitrary renewal process. Since such a process doesn’t necessarily have memoryless interarrival times, the theoretical challenges associated with accomplishing this generalization seem formidable.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Field, S. (1991) Why Your Friends Have More Friends Than You Do. American Journal of Sociology, 96, 1464-1477.
https://doi.org/10.1086/229693
[2] Morozov, E. and Potakhina, L. (2014) An Application of the Inspection Paradox in Stability Analysis of Optical Systems. 2014 6th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), St. Petersburg, 6-8 October 2014, 522-525.
https://doi.org/10.1109/ICUMT.2014.7002156
[3] Zelen, M. and Feinleib, M. (1969) On the Theory of Screening for Chronic Diseases. Biometrika, 56, 601-614.
https://doi.org/10.1093/biomet/56.3.601
[4] Lann, A. and Falk, R. (2006) Tell Me the Method, I’ll Give You the Mean. The American Statistician, 60, 322-327.
https://doi.org/10.1198/000313006X151460
[5] Ross, S.M. (2003) The Inspection Paradox. Probability in the Engineering and Informational Sciences, 17, 47-51.
https://doi.org/10.1017/S0269964803171033
[6] Pal, A., Kostinski, S. and Reuveni, S. (2022) The Inspection Paradox in Stochastic Resetting. Journal of Physics A: Mathematical and Theoretical, 55, 1-25.
https://doi.org/10.1088/1751-8121/ac3cdf
[7] Karlin, S. and Taylor, H.M. (1975) A First Course in Stochastic Processes. 2nd Edition, Academic Press, San Diego.
https://doi.org/10.1016/B978-0-08-057041-9.50005-2
[8] Ross, S.M. (1993) An Introduction to Probability Models. 5th Edition, Academic Press, San Diego.
https://doi.org/10.1016/B978-0-12-598455-3.50004-0
[9] Hogg, R.V., McKean, J.W. and Craig, A.T. (2018) Introduction to Mathematical Statistics. 8th Edition, Pearson, Boston.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.