Generalized Kumaraswamy Generalized Power Gompertz Distribution: Statistical Properties, Application, and Validation Using a Modified Chi-Squared Goodness of Fit Test

Abstract

A new six-parameter continuous distribution called the Generalized Kumaraswamy Generalized Power Gompertz (GKGPG) distribution is proposed in this study, a graphical illustration of the probability density function and cumulative distribution function is presented. The statistical features of the Generalized Kumaraswamy Generalized Power Gompertz distribution are systematically derived and adequately studied. The estimation of the model parameters in the absence of censoring and under-right censoring is performed using the method of maximum likelihood. The test statistic for right-censored data, criteria test for GKGPG distribution, estimated matrix Ŵ, Ĉ, and Ĝ, criteria test Y2n, alongside the quadratic form of the test statistic is derived. Mean simulated values of maximum likelihood estimates and their corresponding square mean errors are presented and confirmed to agree closely with the true parameter values. Simulated levels of significance for Y2n (γ) test for the GKGPG model against their theoretical values were recorded. We conclude that the null hypothesis for which simulated samples are fitted by GKGPG distribution is widely validated for the different levels of significance considered. From the summary of the results of the strength of a specific type of braided cord dataset on the GKGPG model, it is observed that the proposed GKGPG model fits the data set for a significance level ε = 0.05.

Share and Cite:

Maxwell, O. , Onyedikachi, I. , Aidi, K. , Akpa, C. and Seddik-Ameur, N. (2022) Generalized Kumaraswamy Generalized Power Gompertz Distribution: Statistical Properties, Application, and Validation Using a Modified Chi-Squared Goodness of Fit Test. Applied Mathematics, 13, 243-262. doi: 10.4236/am.2022.133019.

1. Introduction

The Gompertz distribution is a continuous probability distribution often applied in lifetime data analysis to describe the distribution of the science such as biology [1], gerontology [2], adult lifespans by demographers [3], actuaries [4], marketing [5], network theory [6] and computer science [7]. The Gompertz distribution has a convex hazard function. It is a flexible distribution, skewed to the right and the left, and a generalization of the exponential distribution.

To produce a more flexible distribution for a highly skewed dataset, new families of distributions are proposed daily. Some of these families of distributions include the Generalized Kumaraswamy generalized family by Nofal et al. [8], the Marshall-Olkin generalized family by Yousof et al. [9], the odd Dagum generalized family by Afify and Alizadeh [10], a new generalized Weibull-G family by Cordeiro et al. [11], a new Weibull-G family by Tahir et al. [12], the Gompertz generalized family by Alizadeh et al. [13], the Type II Power Topp-Leone generated family by Bantan et al. [14], the generated odd burr III family BY Hag et al. [15], Exponentiated-G (EG) by Cordeiro et al. [16], Weibull-X by Alzaatreh et al. [17], Weibull-G by Bourguignon et al. [18], Logistic-G by Torabi and Montazari [19], Gamma-X by Alzaatreh et al. [20], a Lomax-G family by Cordeiro et al. [21], Exponentiated T-X by Alzaghal et al. [22], a Beta Marshall-Olkin family of distributions by Alizadeh et al. [23], Logistic-X by Tahir et al. [24], the beta generalized family (Beta-G) by Eugene et al. [25], a Lindley G family by Cakmakyapan and Ozel [26], Odd Lindley-G family by Gomes-Silva et al. [27], Transmuted family of distributions by Shaw and Buckley [28], Gamma-G (type 1) by Zografos and Balakrishnan [29], the Kumaraswamy-G by Cordeiro and de Castro [30], McDonald-G by Alexander et al. [31], Gamma-G (type 2) by Ristic et al. [32], Gamma-G (type 3) by Torabi and Montazari [33], Log-gammaG by Amini et al. [34], and so on.

Statistics show that a powerful transformation is a series of functions used to create a monotonous data transformation using power functions. Applied to the random variable, the technique is useful in stabilizing variance, making the data more normal distribution-like, improving the validity of association measures like the Pearson correlation between variables, and providing a more flexible model by adding a new parameter named power parameter. The works of Ieren et al. [35], Ghitany et al. [36], and Rady et al. [37] prove this fact. Ieren et al. [35] proposed the power Gompertz distribution, and derived certain properties of the new distribution. Estimated parameters by Maximum Probability Estimate (MLE) were provided. The application of the proposed model with other existing distributions to a data set of remission times for a random sample of 128 patients with bladder cancer was done with the power Gompertz model providing better performance than the Gompertz model, Ghitany et al. [36] introduced the power Lindley distribution. This model provides more flexibility than Lindley distribution when applied to lifetime data, Rady et al. [37] proposed the Power Lomax distribution, when applied to bladder cancer data, the proposed Power Lomax distribution exhibited a much more flexible model than the Lomax distribution. To produce a more flexible distribution for a highly skewed dataset, our focus in this paper is to present an extension of the power Gompertz distribution using the generalized Kumaraswamy generalized family of distribution [8], the resulting distribution is a six-parameter continuous distribution called the generalized Kumaraswamy generalized power Gompertz distribution, various statistical properties will be looked at. The method of maximum likelihood is discussed for estimating the model parameter. We also construct and analyze the generalized Nikulin Rao-Robson goodness-of-fit statistic test Y n 2 (Bagdonavicius and Nikulin [38], Bagdonavicius and Nikulin [39] ) for the generalized Kumaraswamy generalized power Gompertz distribution based on censored data.

The remaining parts of this article are presented in sections as follows: formation of the new distribution is provided in Section 2. In Section 3, we analyzed the plots of the probability density and cumulative distribution function. Derivation of some properties of the new distribution such as asymptotic behavior, quantile function for median, Skewness and Kurtosis, and reliability analysis was discussed in Section 4. The distribution of order statistics in Section 5, estimation of parameters based on censored and uncensored random samples using Maximum Likelihood Estimation (MLE) is provided in Section 6. We evaluate the new goodness-of-fit statistic test Y n 2 , and investigate some criteria test for the generalized Kumaraswamy generalized power Gompertz distribution in Section 7, a simulation study was carried out in Section 8, and an application of the new model to the dataset is illustrated in Section 9.

2. Formation of the Generalized Kumaraswamy Generalized Power Gompertz Distribution (GKGPG)

The Power Gompertz (PG) distribution [35] with positive parameter α , β and θ has pdf and cdf given by:

g ( x ) = α θ x θ 1 e β x θ e α β ( e β x θ 1 ) (1)

and:

G ( x ) = 1 e α β ( e β x θ 1 ) (2)

where x > 0 , α > 0 , β > 0 , θ > 0

The cdf of the Generalized Kumaraswamy Generalized (GK-G) family is defined (for x > 0 ) by:

F ( x ) = 1 [ 1 c G ( x ) a ] b 1 ( 1 c ) b (3)

The corresponding pdf of the GK-G family is given by:

f ( x ) = a b c g ( x ) 1 ( 1 c ) b [ G ( x ) ] a 1 [ 1 c G ( x ) a ] b 1 (4)

where 0 < c 1 , a > 0 and b > 0 are shape parameters.

The hazard rate function (hrf) of the GK-G family is given by:

h ( x ) = a b c g ( x ) [ G ( x ) ] a 1 [ 1 c G ( x ) a ] b 1 [ 1 c G ( x ) a ] b ( 1 c ) b (5)

Hence the pdf and cdf of the newly proposed Generalized Kumaraswamy Generalized Power Gompertz (GKGPG) distribution is given by:

f ( x ) = a b c α θ x θ 1 e β x θ e α β ( e β x θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 (6)

And:

F ( x ) = 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b (7)

where x > 0 , 0 < c 1 , a > 0 , b > 0 , α > 0 , β > 0 , θ > 0 .

3. Graphical Description of the Generalized Kumaraswamy Generalized Power Gompertz Distribution (GKGPG)

Here, we graphically illustrate the probability density function, and cumulative distribution function of the generalized kumaraswamy generalized power Gompertz distribution at different parameter values.

Remarks: Figure 1 represents the behavior of the density plot the effect of the different parameter values. The probability density function of the generalized kumaraswamy generalized power Gompertzdistribution is unimodal; it is also decreasing, and right skewed, depending on the indicated parameter values.

Remarks: Figure 2 represents the cdf plot, clearly, the cdf approaches one (1) as X tends to infinity and equals zero when X tends to zero.

4. Statistical Properties of the Generalized Kumaraswamy Generalized Power Gompertz Distribution (GKGPG)

4.1. Asymptotic Behavior

This section examines the limiting behavior of the GKGPG distribution as X and as X 0 .

Figure 1. PDF plot of the Generalized Kumaraswamy Generalized Power Gompertz Distribution (GKGPG) at different parameter values.

Figure 2. CDF plot of the Generalized Kumaraswamygeneralized Power Gompertz Distribution (GKGPG) at different parameter values.

For the pdf,

lim x f ( x ) = lim x [ a b c α θ x θ 1 e β x θ e α β ( e β x θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ]

lim x f ( x ) = [ a b c α θ θ 1 e β θ e α β ( e β θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β θ 1 ) ) a ] b 1 ] = 0

(8)

lim x 0 f ( x ) = lim x 0 [ a b c α θ x θ 1 e β x θ e α β ( e β x θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ]

lim x 0 f ( x ) = [ a b c α θ 0 θ 1 e β 0 θ e α β ( e β 0 θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β 0 θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β 0 θ 1 ) ) a ] b 1 ] = 0

(9)

For the cdf,

lim x F ( x ) = lim x [ 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b ] = 1 [ 1 c ( 1 e α β ( e β θ 1 ) ) a ] b 1 ( 1 c ) b = 1 (10)

lim x 0 F ( x ) = lim x 0 [ 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b ] = 1 [ 1 c ( 1 e α β ( e β 0 θ 1 ) ) a ] b 1 ( 1 c ) b = 0 (11)

4.2. Quantile Function

The quantile function (qf) of X, say Q ( u ) = F 1 ( u ) can be obtained by inverting Equation (3) numerically, and it is given by:

Q ( u ) = G 1 { c 1 [ 1 ( 1 u d ) 1 b ] } 1 a (12)

where d = 1 ( 1 c ) b .

Ieren et al. (2019) defined the quantile function of the power Gompertz distribution as:

G 1 ( u ) = X q = [ 1 β log [ 1 β α log ( 1 u ) ] ] 1 / θ (13)

By substituting Equations (12) in (13), we obtain the quantile function of the GKGPG distribution as:

Q ( u ) = [ 1 β log [ 1 β α log ( 1 { c 1 [ 1 ( 1 u d ) 1 b ] } 1 a ) ] ] 1 / θ (14)

This above derived function is used to obtain certain moments, such as Skewness and Kurtosis, as well as the median of the distribution and generation of random variables from the distribution concerned.

4.3. Skewness and Kurtosis

The analysis of the Skewness and Kurtosis variability on the shape parameters can be examined on the basis of quantile action. The weaknesses of the conventional measure of Kurtosis are well known. Kenney and Keeping [40] gives the Bowely Skewness based on quantiles as:

S k = Q ( 3 4 ) 2 Q ( 1 2 ) + Q ( 1 4 ) Q ( 3 4 ) Q ( 1 4 ) (15)

Moors et al. [41] gave the Moors quantile based Kurtosis as:

K u = Q ( 7 8 ) Q ( 5 8 ) Q ( 3 8 ) + Q ( 1 8 ) Q ( 6 8 ) Q ( 1 8 ) (16)

With Q ( . ) is obtainable using the equation of the quantile function as given in Equation (14).

4.4. Reliability Analysis of the GKGPG Distribution

The Survival function of the generalized kumaraswamy generalized power Gompertz distribution is given as (Figure 3).

S ( x ) = 1 [ 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b ] (17)

where x > 0 , 0 < c 1 , a > 0 , b > 0 , α > 0 , β > 0 , θ > 0 .

The Hazard failure of the generalized kumaraswamy generalized power Gompertz distribution is given as (Figure 4).

Figure 3. Survival plot of the Generalized Kumaraswamy generalized Power Gompertz Distribution (GKGPG) at different parameter values.

Figure 4. Hazard function plot of the Generalized Kumaraswamy generalized Power Gompertz Distribution (GKGPG) at different parameter values.

h ( x ) = a b c [ α θ x θ 1 e β x θ e α β ( e β x θ 1 ) ] [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c [ 1 e α β ( e β x θ 1 ) ] a ] b 1 [ 1 c [ 1 e α β ( e β x θ 1 ) ] a ] b ( 1 c ) b (18)

where x > 0 , 0 < c 1 , a > 0 , b > 0 , α > 0 , β > 0 , θ > 0 .

5. Order Statistics

For i = 1 , , n from an independent and identically distributed random variables, let X 1 , , X n denote a random sample from the Generalized Kumaraswamy generalized Power Gompertz Distribution with cdf F ( x ) , and pdf given by Equations (3) and (4) respectively. Then the probability density function f i : n ( x ) of the ith order statistics of the GKGPG distribution is given by:

f i : n ( x ) = n ! ( i 1 ) ! ( n i ) ! k = 0 n i ( 1 ) k ( n i k ) f ( x ) F ( x ) K + i + 1 (19)

By substituting Equations (6) and (7) into the ith order statistics of the GKGPG distribution, we have that:

f i : n ( x ) = n ! ( i 1 ) ! ( n i ) ! k = 0 n i ( 1 ) k ( n i k ) a b c α θ x θ 1 e β x θ e α β ( e β x θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 [ 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b ] k + i 1

(20)

Hence the minimum order statistics X ( 1 ) for the GKGPG distribution is given by:

f 1 : n ( x ) = n k = 0 n 1 ( 1 ) k ( n 1 k ) a b c α θ x θ 1 e β x θ e α β ( e β x θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 [ 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b ] k (21)

Similarly, the maximum order statistics X ( n ) for the GKGPG distribution is given by:

f n : n ( x ) = n a b c α θ x θ 1 e β x θ e α β ( e β x θ 1 ) 1 ( 1 c ) b [ 1 e α β ( e β x θ 1 ) ] a 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 [ 1 [ 1 c ( 1 e α β ( e β x θ 1 ) ) a ] b 1 ( 1 c ) b ] n 1 (22)

6. Parameter Estimation

6.1. Maximum Likelihood Estimation

Here, the parameters of the GKGPG distribution are estimated using the method of maximum likelihood. Let X 1 , X 2 , , X n be random samples distributed according to the GKGPG distribution, the likelihood function is obtained by the relationship:

l n ( γ ) = i = 1 n ln f ( X , γ ) (23)

l n ( γ ) = n ln ( a b c α θ ) + ( θ 1 ) i = 1 n ln ( x i ) + β i = 1 n x i θ α β i = 1 n ( e β x i θ 1 ) i = 1 n ln ( s i ) + ( a 1 ) i = 1 n ln ( 1 φ i ) + ( b 1 ) i = 1 n ln ( ϖ i ) (24)

With s i = 1 ( 1 c ) b , ϖ i = 1 c ( 1 e α β ( e β x θ 1 ) ) a , φ i = e α β ( e β x θ 1 ) , v i = e β x θ 1 .

The maximum likelihood estimators a ^ , b ^ , c ^ , α ^ , β ^ and θ ^ of the unknown parameters a , b , c , α , β and θ are derived from the nonlinear following score equations:

L a = n a + i = 1 n ln ( 1 φ i ) i = 1 n c ( b 1 ) ( 1 φ i ) a ln ( 1 φ i ) ϖ i (25)

L b = n b + i = 1 n ( 1 c ) b ln ( 1 c ) s i + i = 1 n ln ( ϖ i ) (26)

L c = n c i = 1 n b ( 1 c ) b 1 s i i = 1 n ( b 1 ) ( 1 φ i ) a ϖ i (27)

L α = n α i = 1 n v i β + i = 1 n ( a 1 ) φ i v i β ( 1 φ i ) i = 1 n a c ( b 1 ) φ i v i ( 1 φ i ) a 1 ϖ i (28)

L β = i = 1 n x i θ + i = 1 n α v i β 2 α β i = 1 n x i θ e β x i θ + α ( a 1 ) β 2 i = 1 n ( 1 e β x i θ + β x i θ e β x i θ ) φ i 1 φ i + a c ( b 1 ) β 2 i = 1 n ( 1 e β x i θ + β x i θ e β x i θ ) φ i ( 1 φ i ) a 1 ϖ i (29)

L θ = n θ + i = 1 n ln ( x i ) ( 1 + β x i θ α x i θ e β x i θ ) + α ( a 1 ) i = 1 n x i θ ln ( x i ) e β x i θ φ i 1 φ i a α c ( b 1 ) i = 1 n x i θ ln ( x i ) e β x i θ φ i ( 1 φ i ) a 1 ϖ i (30)

6.2. Estimation under Right-Censored Data

The hypothesizing test will be discussed under complete and censored data, however, the MPS is only defined for complete data, since the MLE is usually considered for right-censored data, Let us consider X 1 , X 2 , , X n a random right censored sample obtained from the GKGPG distribution with the parameter vector γ = ( a , b , c , α , β , θ ) T . The censoring time τ is fixed. So, the observation X i is equal to X i = ( x i , δ i ) where:

δ i = { 0 if x i isacensoringtime 1 if x i isafailuretime (31)

In this case, the log-likelihood is obtained as follow:

L n ( γ ) = i = 1 n δ i ln f ( x i , γ ) + i = 1 n ( 1 δ i ) ln S ( x i , γ ) (32)

L n ( γ ) = i = 1 n δ i [ n ln ( a b c α θ ) + ( θ 1 ) ln ( x i ) + β x i θ α v i β ln ( s i ) + ( a 1 ) ln ( 1 φ i ) + ( b 1 ) ln ( ϖ i ) ] + i = 1 n ( 1 δ i ) ln ( 1 δ i ) ln ( 1 1 ϖ i b s i ) (33)

The maximum likelihood estimators a ^ , b ^ , c ^ , α ^ , β ^ and θ ^ of the unknown parameters a , b , c , α , β and θ are derived from the nonlinear following score equations:

L a = i = 1 n δ i [ 1 a + ln ( 1 φ i ) c ( b 1 ) ( 1 φ i ) a ln ( 1 φ i ) ϖ i ] b c i = 1 n ( 1 δ i ) ( 1 φ i ) a ln ( 1 φ i ) ϖ i b 1 s i ( 1 ϖ i b ) (34)

L b = i = 1 n δ i [ 1 b + ( 1 c ) b ln ( 1 c ) s i + ln ( ϖ i ) ] + i = 1 n ( 1 δ i ) [ s i ϖ i b ln ( ϖ i ) ( 1 c ) b ln ( 1 c ) ( 1 ϖ i b ) s i ( s i ( 1 ϖ i b ) ) ] (35)

L c = i = 1 n δ i [ 1 c b ( 1 c ) b 1 s i ( b 1 ) ( 1 φ i ) a ϖ i ] i = 1 n ( 1 δ i ) [ s i b ( 1 φ i ) a ϖ i b 1 b ( 1 c ) b 1 ( 1 ϖ i b ) s i ( s i ( 1 ϖ i b ) ) ] (36)

L α = i = 1 n δ i [ 1 α v i β + ( a 1 ) φ i v i β ( 1 φ i ) a c ( b 1 ) φ i v i ( 1 φ i ) a 1 ϖ i ] a c b i = 1 n ( 1 δ i ) φ i v i ( 1 φ i ) a 1 ϖ i b 1 s i ( 1 ϖ i b ) (37)

L β = i = 1 n δ i [ x i θ + α v i β 2 α β x i θ e β x i θ + α ( a 1 ) β 2 ( 1 e β x i θ + β x i θ e β x i θ ) φ i 1 φ i + a c ( b 1 ) β 2 ( 1 e β x i θ + β x i θ e β x i θ ) φ i ( 1 φ i ) a 1 ϖ i ] + a c b β 2 i = 1 n ( 1 δ i ) ( 1 e β x i θ + β x i θ e β x i θ ) φ i ( 1 φ i ) a 1 ϖ i b 1 s i ( 1 ϖ i b ) (38)

L θ = i = 1 n δ i [ 1 θ + ln ( x i ) ( 1 + β x i θ α x i θ e β x i θ ) + α ( a 1 ) x i θ ln ( x i ) e β x i θ φ i 1 φ i a α c ( b 1 ) x i θ ln ( x i ) e β x i θ φ i ( 1 φ i ) a 1 ϖ i ] a α c b i = 1 n ( 1 δ i ) x i θ ln ( x i ) e β x i θ φ i ( 1 φ i ) a 1 ϖ i b 1 s i ( 1 ϖ i b ) (39)

Monte Carlo technique or other iterative methods can be used to determine the values of a ^ , b ^ , c ^ , α ^ , β ^ and θ ^ .

7. Test Statistic for Right Censored Data

Let X 1 , , X n be n i.i.d. random variables grouped into r classes I i . To assess the adequacy of a parametric model F₀:

H 0 : P ( X i x H 0 ) = F 0 ( x ; γ ) , x 0 , γ = ( γ 1 , , γ s ) T Θ R s (40)

When data are right censored and the parameter vector β is unknown, Bagdonavicius and Nikulin [38] proposed a statistic test Y2 based on the vector:

Z j = 1 n ( U j e j ) , j = 1 , 2 , , r with r s . (41)

This one represents the differences between observed and expected numbers of failures ( U j and e j ) to fall into these grouping intervals I j = ( p j 1 , p j ] with p 0 = 0 , p r = τ , where τ is a finite time. The authors considered p j as random data functions such as ther intervals chosen have equal expected numbers of failures e j .

The statistic test Y2 is defined by:

Y 2 = Z T Σ ^ Z = i = 1 r ( U j e j ) 2 U j + Q (42)

where Z = ( Z 1 , , Z r ) T and Σ ^ is a generalized inverse of the covariance matrix Σ ^ and:

Q = W T G ^ W , A ^ j = U j n , U j = i : X i I j δ i

W = ( W 1 , , W s ) T , G ^ = [ g ^ l l ] s × s , g ^ l l = i ^ l l j = 1 r C ^ l J G ^ l J A ^ J 1

C ^ l j = 1 n i : X i I j δ i ln h ( x i , γ ^ ) γ , i ^ l l = 1 n i = 1 n δ i ln h ( x i , γ ^ ) γ l ln h ( x i , γ ^ ) γ l

W ^ l = j = 1 r C ^ l J A ^ J 1 Z j , l , l = 1 , , s

γ ^ is the maximum likelihood estimator of γ on initial non-grouped data.

Under the null hypothesis H₀, the limit distribution of the statistic Y2 is a chi-square with r = r a n k ( Σ ) degrees of freedom. The description and applications of modified chi-square tests are discussed in Voinov et al. [42].

The interval limits p j for grouping data into j classes I j are considered as data functions and defined by:

p ^ j = H 1 ( E j l = 1 i 1 H ( x l , γ ^ ) n i + 1 , γ ^ ) , p ^ j = max ( X ( n ) , τ ) (43)

Such as the expected failure times e j to fall into these intervals are e j = E r r

for any j, with E r = i = 1 n H ( x i , γ ) . The distribution of this statistic test Y n 2 is chi-square (see Voinov et al., 2013).

7.1. Criteria Test for GKGPG Distribution

For testing the null hypothesis H₀ that data belong to the GKGPG model, we construct a modified chi-squared type goodness-of-fit test based on the statistic Y2. Suppose that τ is a finite time, and observed data are grouped into r > s sub-intervals I j = ( p j 1 , p j ] of [ 0 , τ ] . The limit intervals p j are considered as random variables such that the expected numbers of failures in each interval I j are the same, so the expected numbers of failures e j are obtained as:

E j = j r 1 i = 1 n ln ( 1 1 ϖ i b s i ) , j = 1 , , r 1 (44)

7.2. Estimated Matrix W ^ and C ^

The components of the estimated matrix W ^ are derived from the estimated matrix C ^ which is given by:

C ^ 1 j = 1 n i : x i I j n δ i [ 1 a + ln ( 1 φ i ) c ( b 1 ) ( 1 φ i ) a ln ( 1 φ i ) ϖ i + b c ( 1 φ i ) a ln ( 1 φ i ) ϖ i b 1 s i ( 1 ϖ i b ) ] (45)

C ^ 2 j = 1 n i : x i I j n δ i [ 1 b + ( 1 c ) b ln ( 1 c ) s i + ln ( ϖ i ) s i ϖ i b ln ( ϖ i ) ( 1 c ) b ln ( 1 c ) ( 1 ϖ i b ) s i ( s i ( 1 ϖ i b ) ) ] (46)

C ^ 3 j = 1 n i : x i I j n δ i [ 1 c b ( 1 c ) b 1 s i ( b 1 ) ( 1 φ i ) a ϖ i + s i b ( 1 φ i ) a ϖ i b 1 b ( 1 c ) b 1 ( 1 ϖ i b ) s i ( s i ( 1 ϖ i b ) ) ] (47)

C ^ 4 j = 1 n i : x i I j n δ i [ 1 α v i β + ( a 1 ) φ i v i β ( 1 φ i ) a c ( b 1 ) φ i v i ( 1 φ i ) a 1 ϖ i + a b c φ i v i ( 1 φ i ) a 1 ϖ i b 1 s i ( 1 ϖ i b ) ] (48)

C ^ 5 j = 1 n i : x i I j n δ i [ x i θ + α ( a 1 ) β 2 ( 1 e β x i θ + β x i θ e β x i θ ) φ i 1 φ i + a c ( b 1 ) β 2 ( 1 e β x i θ + β x i θ e β x i θ ) φ i ( 1 φ i ) a 1 ϖ i + α v i β 2 α β x i θ e β x i θ a c b β 2 ( 1 e β x i θ + β x i θ e β x i θ ) φ i ( 1 φ i ) a 1 ϖ i b 1 s i ( 1 ϖ i b ) ] (49)

C ^ 6 j = 1 n i : x i I j n δ i [ 1 θ + α ( a 1 ) x i θ ln ( x i ) e β x i θ φ i 1 φ i a α c ( b 1 ) x i θ ln ( x i ) e β x i θ φ i ( 1 φ i ) a 1 ϖ i + ln ( x i ) ( 1 + β x i θ α x i θ e β x i θ ) + a α c b x i θ ln ( x i ) e β x i θ φ i ( 1 φ i ) a 1 ϖ i b 1 s i ( 1 ϖ i b ) ] (50)

And:

W ^ l = j = 1 r C ^ l J A ^ J 1 Z j , l , l = 1 , 2 , 3 , 4 , 5 , 6 ; j = 1 , , r

7.3. Estimated Matrix G ^

The estimated matrix G ^ = [ g ^ l l ] 6 × 6 is defined by:

g ^ l l = i ^ l l j = 1 r C ^ l J G ^ l J A ^ J 1

where:

i ^ l l = 1 n i = 1 n δ i ln h ( x i , γ ^ ) γ l ln h ( x i , γ ^ ) γ l , l , l = 1 , 2 , 3 , 4 , 5 , 6

Therefore the quadratic form of the test statistic can be obtained easily:

Y n 2 ( γ ^ ) = j = 1 r ( U j e j ) 2 U j + W ^ T [ i ^ l l j = 1 r C ^ l J G ^ l J A ^ J 1 ] 1 W ^ (51)

8. Simulations

8.1. Maximum Likelihood Estimation

We generated N = 10000 right censored samples with different sizes ( n = 25 , 50 , 130 , 350 , 500 ) from the GKGPG model with parameters a = 2 , b = 1 , c = 0.9 , α = 0.2 , β = 0.7 and θ = 1.5 . Using R statistical software and the Barzilai-Borwein (BB) algorithm (Ravi, [43] ), we calculate the maximum likelihood estimators of the unknown parameters and their Mean Squared Errors (MSE). The results are given in Table 1.

The maximum likelihood estimated parameter values, presented in Table 1, agree closely with the true parameter values.

Table 1. Mean simulated values of MLEs γ ^ their corresponding square mean errors.

8.2. Criteria Test Y n 2

For testing the null hypothesis H₀ that right censored data become from GKGPG model, we compute the criteria statistic Y n 2 ( γ ) as defined above for 10,000 simulated samples from the hypothezised distribution with different sizes (30, 50, 150, 350, 500). Then, we calculate empirical levels of significance, when Y 2 > χ ε 2 ( r ) , corresponding to theoretical levels of significance ( ε = 0.10 , ε = 0.05 , ε = 0.01 ), We choose r = 7 . The results are reported in Table 2.

The null hypothesis H₀ for which simulated samples are fitted by GKGPG distribution is widely validated for the different levels of significance. Therefore, the test proposed in this work, can be used to fit data from this new distribution.

9. Application

In this section, we apply the results obtained through this study to real data set from reliability (Crowder et al. [44] ), previously used by [45] [46] [47]. In an experiment to gain information on the strength of a certain type of braided cord after weathering, the strengths of 48 pieces of cord that had been weathered for a specified length of time were investigated. The observed right-censored strength-values are given below:

26.8*, 29.6*, 33.4*, 35*, 36.3, 40*, 41.7, 41.9*, 42.5*, 43.9, 49.9, 50.1, 50.8, 51.9, 52.1, 52.3, 52.3, 52.4, 52.6, 52.7, 53.1, 53.6, 53.6, 53.9, 53.9, 54.1, 54.6, 54.8, 54.8, 55.1, 55.4, 55.9, 56, 56.1, 56.5, 56.9, 57.1, 57.1, 57.3, 57.7, 57.8, 58.1, 58.9, 59, 59.1, 59.6, 60.4, 60.7

We use the statistic test provided above to verify if these data are modelled by GKGPG distribution, and at that end, we first calculate the maximum likelihood estimators of the unknown parameters:

γ = ( a , b , c , α , β , θ ) T = ( 2.5134 , 1.6384 , 0.9467 , 0.3796 , 0.5931 , 1.7649 ) T (52)

Table 2. Simulated levels of significance for Y n 2 ( γ ) test for GKGPG model against their theoretical values ( ε = 0.01 , 0.05 , 0.10 ).

Table 3. Values of p j , e j , U j , C ^ 1 j , C ^ 2 j , C ^ 3 j , C ^ 4 j , C ^ 5 j , C ^ 6 j .

Data are grouped into r = 7 intervals I j . We give the necessary calculus in Table 3.

Then we obtain the value of the statistic test Y n 2 :

Y n 2 = X 2 + Q = 5.6317 + 4.1237 = 10.7554 (53)

For significance level ε = 0.05 , the critical value χ 7 2 = 14.0671 is superior than the value of Y n 2 = 10.7554 , so we can say that the proposed model GKGPGfit these data.

10. Conclusion

This research has successfully introduced and studied a six-parameter continuous distribution called the generalized Kumaraswamy generalized power Gompertz distribution. The plots of the probability density and cumulative distribution function have been analyzed. We have also derived some properties of the new distribution such as asymptotic behavior, quantile function for median, Skewness, and Kurtosis, and reliability analysis. The distribution of order statistics estimation of parameters based on censored and uncensored random samples using Maximum Likelihood Estimation (MLE) has been provided. We evaluated the new goodness-of-fit statistic test Y n 2 and investigated some criteria tests for the generalized Kumaraswamy generalized power Gompertz distribution. A simulation study was carried out in applying the new model to datasets. The newly proposed model GKGPG adequately fits the data.

Formation of the Generalized Kumaraswamy Generalized

Defined in this paper has three shape parameters which control its Skewness, Kurtosis and tails. It can therefore be applied in more real-life situations. Maximum likelihood estimates are discussed, and modified chi-square goodness-of-fit tests for right censoring are constructed. The statistical test provided in this article can be used to fit unknown parameters and censorship into this model and its sub-models. The results and efficacy of the proposed test are shown in an important simulation study.

Conflicts of Interest

The authors declare no conflict of interest.

References

[1] Economos, A.C. (1982) Rate of Aging, Rate of Dying and the Mechanism of Mortality. Archives of Gerontology and Geriatrics, 1, 46-51.
https://doi.org/10.1016/0167-4943(82)90003-6
[2] Brown, K. and Forbes, W. (1974) A Mathematical Model of Aging Processes. Journal of Gerontology, 29, 46-51.
https://doi.org/10.1093/geronj/29.1.46
[3] Vaupel, J.W. (1986) How Change in Age-Specific Mortality Affects Life Expectancy (PDF). Population Studies, 40, 147-157.
https://doi.org/10.1080/0032472031000141896
[4] Willemse, W. and Koppelaar, H. (2000) Knowledge Elicitation of Gompertz’ Law of Mortality. Scandinavian Actuarial Journal, 2, 168-179.
https://doi.org/10.1080/034612300750066845
[5] Bemmaor, A.C. and Glady, N. (2012) Modeling Purchasing Behavior with Sudden Death: A Flexible Customer Lifetime Model. Management Science, 58, 1012-1021.
https://doi.org/10.1287/mnsc.1110.1461
[6] Chukwudike, C.N., Ugoala, Chukwuma, B., Maxwell, O., Okezie, O.U.-I., Bright, C.O. and Henry, I.U. (2020). Forecasting Monthly Prices of Gold Using Artificial Neural Network. Journal of Statistical and Econometric Method, 9, 19-28.
[7] Ohishi, K., Okamura, H. and Dohi, T. (2009) Gompertz Software Reliability Model: Estimation Algorithm and Empirical Validation. Journal of Systems and Software, 82, 535-543.
https://doi.org/10.1016/j.jss.2008.11.840
[8] Nofal, Z.M., Altun, E., Afify, A.Z. and Ahsanullah, M. (2019) The Generalized Kumaraswamy-G Family of Distribution. Journal of Statistical Theory and Application, 18, 329-342.
https://doi.org/10.2991/jsta.d.191030.001
[9] Marshall, A.M. and Olkin, I. (1997) A New Method for Adding a Parameter to a Family of Distributions with Application to the Exponential and Weibull Families. Biometrika, 84, 641-652.
https://doi.org/10.1093/biomet/84.3.641
[10] Afify, A.Z. and Alizadeh, M. (2020) The Odd Dagum Family of Distributions: Properties and Applications. Journal of Applied Probability and Statistics, 15, 45-72.
[11] Cordeiro, G.M., Ortega, E.M.M. and Ramires, T.G. (2015) A New Generalized Weibull Family of Distributions: Mathematical Properties and Applications. Journal of Statistical Distributions and Applications, 2, 13.
https://doi.org/10.1186/s40488-015-0036-6
[12] Tahir, M.H., Zubair, M., Mansoor, M., Cordeiro, G.M. and Alizadeh, M. (2016) A New Weibull-G Family of Distributions. Hacettepe Journal of Mathematics and Statistics, 45, 629-647.
https://doi.org/10.1186/s40488-014-0024-2
[13] Alizadeh, M., Cordeiro, G.M., Pinho, B.L.G. and Ghosh, I. (2017) The Gompertz-G Family of Distributions. The Journal of Statistical Theory and Practice, 11, 179-207.
https://doi.org/10.1080/15598608.2016.1267668
[14] Bantan, R.A.R., Jamal, F., Chesneau, C. and Elgarhy, M. (2020) Type II Power Topp-Leone Generated Family of Distributions with Statistical Inference and Applications. Symmetry, 12, 1-24.
https://doi.org/10.3390/sym12010075
[15] ul Hag, M.A., Elgarhy, M. and Hashmi, S. (2019) The Generalized Odd Burr III Family of Distributions: Properties, Applications and Characterizations. Journal of Taibah University for Science, 13, 961-971.
https://doi.org/10.1080/16583655.2019.1666785
[16] Cordeiro, G.M., Ortega, E.M.M. and da Cunha, D.C.C. (2013) The Exponentiated Generalized Class of Distribution. Journal of Data Science, 11, 1-27.
https://doi.org/10.6339/JDS.201301_11(1).0001
[17] Alzaatreh, A., Famoye, F. and Lee, C. (2013) Weibull-Pareto Distribution and Its Applications. Communication in Statistics—Theory and Methods, 42, 1673-1691.
https://doi.org/10.1080/03610926.2011.599002
[18] Bourguignon, M., Silva, R.B. and Cordeiro, G.M. (2014) The Weibull-G Family of Probability Distributions. Journal of Data Science, 12, 53-68.
https://doi.org/10.6339/JDS.201401_12(1).0004
[19] Torabi, H. and Montazari, N.H. (2014) The Logistic-Uniform Distribution and Its Application. Communications in Statistics—Simulation and Computation, 43, 2551-2569.
https://doi.org/10.1080/03610918.2012.737491
[20] Alzaatreh, A., Famoye, F. and Lee, C. (2014) The Gamma-Normal Distribution: Properties and Applications. Computational Statistics & Data Analysis, 69, 67-80.
https://doi.org/10.1016/j.csda.2013.07.035
[21] Cordeiro, G.M., Ortega, E.M.M., Popovic, B.V. and Pescim, R.R. (2014) The Lomax Generator of Distributions: Properties, Minification Process and Regression Model. Applied Mathematics and Computation, 247, 465-486.
https://doi.org/10.1016/j.amc.2014.09.004
[22] Alzaghal, A., Lee, C. and Famoye, F. (2013) Exponentiated T-X Family of Distributions with Some Applications. International Journal of Probability and Statistics, 2, 31-49.
https://doi.org/10.5539/ijsp.v2n3p31
[23] Alizadeh, M., Cordeiro, G.M., de Brito, E. and Demetrio, C.G.B. (2015) The Beta Marshall-Olkin Family of Distributions. Journal of Statistical Distributions and Applications, 2, 1-18.
https://doi.org/10.1186/s40488-015-0027-7
[24] Tahir, M.H., Cordeiro, G.M., Alzaatreh, A., Mansoor, M. and Zubair, M. (2016) The Logistic-X Family of Distributions and Its Applications. Communications in Statistics: Theory and Methods, 45, 7326-7349.
https://doi.org/10.1080/03610926.2014.980516
[25] Eugene, N., Lee, C. and Famoye, F. (2002) Beta-Normal Distribution and It Applications. Communications in Statistics: Theory and Methods, 31, 497-512.
https://doi.org/10.1081/STA-120003130
[26] Cakmakyapan, S. and Ozel, G. (2016) The Lindley Family of Distributions: Properties and Applications. Hacettepe Journal of Mathematics and Statistics, 46, 1-27.
https://doi.org/10.15672/HJMS.201611615850
[27] Gomes-Silva, F., Percontini, A., De Brito, E., Ramos, M.W., Venancio, R. and Cordeiro, G.M. (2017) The Odd LindleyG Family of Distributions. Austrian Journal of Statistics, 46, 65-87.
https://doi.org/10.17713/ajs.v46i1.222
[28] Shaw, W.T. and Buckley, I.R. (2007) The Alchemy of Probability Distributions: Beyond Gram-Charlier Expansions and a Skew-Kurtotic-Normal Distribution from a Rank Transmutation Map. Research Report.
[29] Zografos, K. and Balakrishnan, N. (2009) On Families of Beta- and Generalized Gamma Generated Distributions and Associated Inference. Statistical Methodology, 63, 344-362.
https://doi.org/10.1016/j.stamet.2008.12.003
[30] Cordeiro, G.M. and De Castro, M. (2011) A New Family of Generalized Distributions. Journal of Statistical Computation and Simulation, 81, 883-898.
https://doi.org/10.1080/00949650903530745
[31] Alexander, C., Cordeiro, G.M., Ortega, E.M.M. and Sarabia, J.M. (2012) Generalized Beta-Generated Distributions. Computational Statistics & Data Analysis, 56, 1880-1897.
https://doi.org/10.1016/j.csda.2011.11.015
[32] Ristic, M.M. and Balakrishnan, N. (2012) The Gamma-Exponentiated Exponential Distribution. Journal of Statistical Computation and Simulation, 82, 1191-1206.
https://doi.org/10.1080/00949655.2011.574633
[33] Torabi, H. and Montazari, N.H. (2012) The Gamma-Uniform Distribution and Its Application. Kybernetika, 48, 16-30.
[34] Amini, M., Mirmostafaee, S.M.T.K. and Ahmadi, J. (2012) Log-Gamma-Generated Families of Distributions. Stat, 12, 174-187.
[35] Ieren, T.G., Kromtit, F.M., Agbor, B.U., Eraikhuemen, I.B. and Koleoso, P.O. (2019) A Power Gompertz Distribution: Model, Properties and Application to Bladder Cancer Data. Asian Research Journal of Mathematics, 15, 1-14.
https://doi.org/10.9734/arjom/2019/v15i230146
[36] Ghitany, M.E., Al-Mutairi, D.K., Balakrishnan, N. and Al-Enezi, L.J. (2013) Power Lindley Distribution and Associated Inference. Computational Statistics & Data Analysis, 64, 20-33.
https://doi.org/10.1016/j.csda.2013.02.026
[37] Rady, E.A., Hassanein, W.A. and Elhaddad, T.A. (2016) The Power Lomax Distribution with an Application to Bladder Cancer Data. Springer Plus, 5, 1838.
https://doi.org/10.1186/s40064-016-3464-y
[38] Bagdonavicius, V. and Nikulin, M. (2011) Chi-Squared Goodness-of-Fit Test for Right Censored Data. International Journal of Applied Mathematics and Statistics, 24, 30-50.
[39] Bagdonavicius, V., Levuliene, R.J. and Nikulin, M. (2013) Chi-Squared Goodness-of-Fit Tests for Parametric Accelerated Failure Time Models. Communications in Statistics— Theory and Methods, 42, 2768-2785.
https://doi.org/10.1080/03610926.2011.617483
[40] Kenney, J.F. and Keeping, E.S. (1962) Mathematics of Statistics. 3th Edition, Chapman & Hall Ltd., London.
[41] Moors, J.J. (1988) A Quantile Alternative for Kurtosis. Journal of the Royal Statistical Society, Series D, 37, 25-32.
https://doi.org/10.2307/2348376
[42] Voinov, V., Nikulin, M. and Balakrishnan, N. (2013) Chi-Squared Goodness of Fit Tests with Applications. Academic Press, Cambridge.
[43] Ravi, V. and Gilbert, P.D. (2019) BB: An R Package for Solving a Large System of Nonlinear Equations and for Optimizing a High-Dimensional Nonlinear Objective Function. Journal of Statistical Software, 32, 7-30.
[44] Crowder, M.J., Kimber, A.C., Smith, R.L. and Sweeting, T.J. (1991) Statistical Analysis of Reliability Data. Chapman & Hall/CRC, London.
https://doi.org/10.1007/978-1-4899-2953-2
[45] Maxwell, O., Chukwudike, N.C. and Bright, O.C. (2019) Modeling Lifetime Data with the Odd Generalized Exponentiated Inverse Lomax Distribution. Biometrics & Biostatistics International Journal, 8, 39-42.
https://doi.org/10.15406/bbij.2019.08.00268
[46] Maxwell, O., Chukwu, A.U., Oyamakin, O.S. and Khaleel, M.A. (2019) The Marshall-Olkin Inverse Lomax Distribution (MO-ILD) with Application on Cancer Stem Cell. JAMCS, 33, 1-12.
https://doi.org/10.9734/jamcs/2019/v33i430186
[47] Maxwell, O., Oyamakin, S.O. and Joseph T.E. (2019) The Gompertz Length Biased Exponential Distribution and Its Application to Uncensored Data. Current Trends on Biostatistics & Biometrics, 1, 52-57.
https://doi.org/10.32474/CTBB.2019.01.000111

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.