1. Introduction
The Hamza distribution was introduced by [1] with the cumulative distribution function (cdf) and the corresponding probability density function (pdf) respectively given by
(1)
and
(2)
for
,
and
.
It may be observed that the Hamza distribution was obtained by compounding the exponential distribution having scale parameter
and the gamma distribution having shape parameter 7 and scale parameter
, with mixing proportions
and
such that
.
[1] studied the properties and applications of this distribution in the context of lifetime analysis, showing that the distribution is superior to Lindley distribution due to [2], Ishita distribution by [3] and Pranav distribution by [4], respectively.
The aim of this paper is to introduce a new distribution, called the power Hamza distribution, which is a direct generalization of the Hamza distribution. Some of the distributions proposed using the power transformation include the power Lindley due to [5], power Akash and Shanker proposed by [6] [7], power Ishita and power Aradhana due to [8] [9], power Rama and power Garima due to Abebe et al. [10] [11], power Pranav due to [12], power Sujatha by [13], power Prakaamy by [14]. From the literature reviewed in this paper, all the power transformed distributions were shown to be more flexible than their corresponding baseline distributions and more useful for analyzing complex data structures in various fields of life.
The rest of the paper is organized as follows. The pdf, cdf and hazard rate function of the new distribution is given in Section 2. Section 3 provides a comprehensive account of the properties of the distribution including the moment generating function, moments, skewness, kurtosis, mean residual lifetime, mean deviations, Bonferroni and Lorenz curves, stochastic ordering, entropy measure, stress-strength reliability, distributions and moments of order statistics. In Section 4, the maximum likelihood estimates of the parameters of the distribution are given. Also, Section 5 gives the asymptotic confidence intervals. Section 6 illustrates the proposed model in two real-datasets. The paper is concluded in Section 7.
2. The Power Hamza (PH) Distribution
The probability density function, cumulative distribution function and hazard function of the power Hamza distribution having parameters
,
and
are provided in as Propositions 1, 2 and 3.
Proposition 1. A random variable X is said to have a PH distribution if its pdf is of the form
;
,
,
,
(3)
Proof. Given the distribution of the Hamza random variable Y defined in (2). Assume that another random variable X is related to Y by the power function
. Then the distribution of X is the power Hamza distribution. To derive the distribution of X, we notice that X is a one-to-one function of Y and so,
when
and
when
, which implies that the support of the distribution of X is
.
Letting
, and
in (2), gives
,
(4)
According to [15], the probability density function of a continuous random variable
is gotten by
(5)
Substituting (4) and
into (5), one obtains
;
(6)
Further simplification of (6) yields the pdf of the power Hamza random variable X defined in (3), and the proof of Proposition 1 is complete.
Corollary 1. Let X be a power Hamza random variable, then the function defined by Equation (3) is a pdf.
Proof. We show that
and
are satisfied.
1)
2)
By setting
in the above integral and noting that
one obtains
Henceforth, a random variable X that follows the distribution in (3) is symbolized by
. The power Hamza distribution reduces to the Hamza distribution when
.
Proposition 2. For
,
,
and
, the cdf of
is given by
(7)
Proof. [15] defines the cdf of a continuous random variable X as
(8)
Substituting (3) into (8) leads to
(9)
Letting
,
,
, the integral (9) becomes
(10)
Applying direct integration to the first part of the square bracket in (10) and integration by parts to the second part the square bracket in (10) gives
(11)
and
(12)
respectively. Substituting (11) and (12) into (10), we obtain
(13)
Further simplification of (13) gives (7), which completes the proof of Proposition 2.
Proposition 3. Let
, then the hazard rate function of X is given by
(14)
Proof. The proof of Proposition 3 follows from using (4) and (7) in the relation
(15)
Figure 1 and Figure 2 demonstrate the graphs of the pdf and hazard function of the PH distribution for different values of
,
and
.
(a) (b)
Figure 1. (a) pdf plot of power hamza distribution; (b) pdf plot of power hamza distribution.
(a) (b)
Figure 2. (a) hazard plot of power hamza distribution; (b) hazard plot of power hamza distribution.
3. Properties of the Power Hamza (PH) Distribution
3.1. Moment Generating Function
Proposition 4. Let
, then the moment generating function of X is given by
(16)
Proof. The moment generating function of
is obtained as follows
(17)
Letting
,
and
, (17) reduces to
(18)
Further simplification of (18) gives (16), which completes the proof of Proposition 4.
3.2. Non-Central Moment
Proposition 5. Let
, then the rth non-moment of X is given by
(19)
Proof. The rth moment of
is obtained as follows
(20)
Putting
,
and
into (20), yields
(21)
Simplifying (21) a little bit, we get (19), hence the proof of Proposition 5.
Corollary 2. The first four non-central moments of
are
(22)
(23)
(24)
(25)
Proof. The proof of (22)-(25) follows directly from Proposition 5 by substituting
and 4 into (19).
3.3. Variance
Apart from the non-central moments, variance of a distribution is always relevant for measuring the spread from the mean. So, we provide the variance of the PH distribution in Proposition 6.
Proposition 6. Let
, then the variance of X is given by
(26)
Proof. The variance of a random variable X can be computed using the relation
(27)
The proof of (26) follows directly from substituting (22) and (23) into (27)
(28)
3.4. Coefficient of Variation and Index of Dispersion
The coefficient of variation is given by
(29)
The index of dispersion is given by
(30)
3.5. Central Moment
Proposition 7. Let
, then the central moment of X is given by
(31)
Proof. The rth moment of the
can be obtained from the relation
(32)
Using (22) and (19) in (32) gives (31) and the proof of Proposition 7 is complete.
3.6. Conditional Moment
A function that is useful in deriving the mean residual life function of a component as well as the mean deviations is the conditional moment. Given that X follows a power Hamza distribution with parameters
,
and
, then
(33)
where
(34)
Letting
in (34) leads to
(35)
where
is the complementary incomplete gamma function.
Also from (7), we get
(36)
Plugging (35) and (36) into (33), we have the conditional moment as
(37)
3.7. Mean Residual Life Function
In many life testing experiments, it is always of interest to know the additional lifetime given that a component has survived until a certain amount of time. To achieve this purpose, the mean residual life function (MRL), which refers to the expected remaining life,
, given that the item has survived up to time x, is required. It may be observed from (37), that the MRL function is derived from the conditional as follows
(38)
Putting
in (37) and substituting the result into (38), we obtain MRL function as
(39)
3.8. Mean Deviations
In statistical modelling, it is often an interest to measure the spread in a population from either the mean or the median. To achieve this, two indices, namely mean deviation about the mean
and mean deviation about the median
are used. Let
denote the mean and M, the median of a power Hamza distributed random variable X. The values of
and
can be calculated using the relationships
(40)
and
(41)
respectively. By replacing x with
and M in (7) and (35), yields the following
(42)
(43)
3.9. Bonferroni and Lorenz Curves
It has been found that the Bonferroni curve proposed by [16] and Lorenz curve proposed by [17] have applications in the fields of economics, reliability, demography, insurance, medicine, among others. So, if
, the Bonferroni and Lorenz curves are respectively given by
(44)
and
(45)
where
and
,
. Hence, for the power Hamza pdf (3), one gets
(46)
Letting
,
,
and noting that
implies
, (1) becomes
(47)
Substituting (46) and (47) into (44) and (45), we obtain the Bonferroni and Lorenz curves, respectively, for the power Hamza distribution as
(48)
(49)
3.10. Stochastic Ordering
In this section, we discuss the comparative behaviour of the power Hamza random variable using the stochastic ordering. In line with [18], a power Hamza random variable X is said to be smaller than another random variable Y in the 1) stochastic order
if
, 2) hazard rate order
if
, 3) mean residual life order
if
and 4) likelihood ratio order
if
a decreasing function of x. To show the flexibility of the power Hamza distribution, we present the following Proposition.
Proposition 8. Let
and
be two independent random variables. If 1)
,
and
; 2)
,
,
; 3)
,
,
and 4)
,
,
, then
,
,
and
.
Proof. The likelihood ratio is
(50)
The log-likelihood ratio is
(51)
Differentiating the log-likelihood with respect to x, we get
(52)
Since
for conditions 1), 2), 3) and 4), then
and hence,
,
and
, which completes the proof of Proposition 8.
3.11. Rényi Entropy
To quantify the amount of information (such as the diversity, uncertainty, or randomness) contained in a random sample drawn from a population, the entropy is utilized. It may be noted that a large value of entropy indicates that the data contains greater uncertainty. Several studies have applied entropy in the fields of physics, probability and statistics, communication theory, economics, among others. Therefore, we derive one of the commonly used entropies, namely the R
nyi entropy. For
and
, the Rényi entropy due to [19] is defined for a continuous random variable as
(53)
Using the pdf (3) in (53), we obtain
(54)
3.12. Distribution of Order Statistics for the PH Distribution
Suppose
constitutes the order statistics for a random sample
drawn from the power Hamza distribution with pdf (3) and cdf (7). Then the pdf of the rth order statistic
can be written as
(55)
Putting
into (55) gives the pdf of the first order statistic
as
(56)
Putting
into (55) gives the pdf of the nth order statistic
as
(57)
4. Maximum Likelihood Estimators of the Power Hamza Distribution
Let
denote a random sample of size n from the PH distribution having parameters
,
and
. To estimate the parameters
,
and
using the maximum likelihood method, we define the likelihood function of the random sample form the PH distribution as
(58)
Taking the natural log of (58), we obtain the log-likelihood function as
(59)
Differentiating (59) with respect to
,
and
respectively and equating the resulting derivatives to zero, one obtains
(60)
(61)
(62)
The above non-linear systems of equations are solved by numerical iteration technique and maximum likelihood estimates are obtained. Since the maximum likelihood estimates for
,
and
are not in closed form we use the large sample behaviour of maximum likelihood estimators to obtain the confidence intervals for model parameters.
5. Asymptotic Confidence Intervals of the Power Hamza Distribution
In this section, we present the asymptotic confidence intervals for the parameters of the PH distribution. Let
be the maximum likelihood estimate of
. Under the conditions that the parameters are in the interior of the parameter space, but not on the boundary, the asymptotic distribution of
is
, where
is the expected Fisher information matrix. The asymptotic behaviour of the expected information matrix can be approximated by the observed information matrix, denoted by
. The observed information matrix of the power Hamza distribution is given by
(63)
Thus,
(64)
Taking the second order derivatives of (59) with respect to
,
and
are, respectively, we obtain the entries of (63) as follows
(65)
(66)
(67)
(68)
(69)
(70)
The expectations in the Fisher information matrix can be obtained numerically. The multivariate normal distribution with mean vector
and covariance matrix
can be used to construct confidence intervals for the model parameters. The approximate
two-sided confidence intervals for
,
and
are determined by
,
and
(71)
respectively, where
is the upper
percentile of a standard normal distribution.
6. Applications
In this section, we provide an application to real data set to demonstrate the importance and flexibility of the PH distribution.
The data set is on the breaking strength of carbon fibres of 50 mm length (GPa). The data has been previously used by [20] and [21]. The data is as follows:
0.39, 0.85, 1.08, 1.25, 1.47, 1.57, 1.61, 1.61, 1.69, 1.80, 1.84, 1.87, 1.89, 2.03, 2.03, 2.05, 2.12, 2.35, 2.41, 2.43, 2.48, 2.50, 2.53, 2.55, 2.55, 2.56, 2.59, 2.67, 2.73, 2.74, 2.79, 2.81, 2.82, 2.85, 2.87, 2.88, 2.93, 2.95, 2.96, 2.97, 3.09, 3.11, 3.11, 3.15, 3.15, 3.19, 3.22, 3.22, 3.27, 3.28, 3.31, 3.31, 3.33, 3.39, 3.39, 3.56, 3.60, 3.65, 3.68, 3.70, 3.75, 4.20, 4.38, 4.42, 4.70, 4.90.
We fitted the PH distribution to the data set by using the method of maximum likelihood and the results are compared with five other competitive lifetime distributions namely,
1) Hamza distribution (HD) defined in Equation (2),
2) Weighted Weibull distribution (WWD):
,
3) Two-Parameter Weibull distribution (TPWD):
,
4) Pareto distribution (PD):
,
5) Exponential distribution (ED):
.
We used the goodness-of-fit test based on the Kolmogorov-Smirnov test due to ( [22] [23] [24] [25] ) with its corresponding p-value to verify that the data set under consideration actually follow the proposed distribution. The computational formula for this goodness-of-fit test is given by
(72)
where
is the estimated distribution function under the ordered data. Since there is more than one distribution to be compared, the distribution with the largest KS p-value will be more appropriate to fit the given sample.
We shall also determine the appropriate model from among all models compared for the real data set by considering three discrimination criteria, based on the log-likelihood function evaluated at the maximum likelihood estimates, the Akaike information criterion (AIC) due to [26] and the Bayesian information criterion (BIC) due to [27], respectively. To compute the AIC and BIC, the following formulae are used
Table 1. Parameter estimates, standard errors, log-likelihood values and goodness-of-fit measures.
(73)
(74)
where l denotes the log-likelihood function evaluated at the maximum likelihood estimates, k is the number of parameters in the statistical model and n is the sample size of the fitted data respectively. All the computations for (72)-(74) were performed using R software. Generally, for the given data-sets, we consider a distribution to be best among all competing distributions if it has smallest AIC value, the smallest BIC value, the smallest log-likelihood value and the largest p-value.
As shown in Table 1, the PH distribution has the largest KS p-value and smallest AIC, BIC and log-likelihood values as compared to other fitted distributions. Hence, the PH distribution is better than the other distributions in Table 1 for fitting the data under consideration.
7. Conclusion
This study introduced a new distribution, called the power Hamza distribution using power transformation method. The contribution of this paper has to do with addition of skewness to the Hamza distribution, which depends only on one parameter. The density function of the power Hamza distribution can take various forms depending on its shape parameter. The hazard rate function of the power Hamza distribution exhibits heavy-tailed shape, upside-down bathtub shape and J-shape, which implies that the distribution can be used for analyzing lifetime and survival time datasets. A detailed discussion of the properties of the proposed distribution has been given. Estimates of the three unknown parameters of the PH distribution are obtained using the maximum likelihood estimation method. The PH distribution was fitted to a real dataset and compared to five distributions and the results showed that the proposed distribution outperformed all of them in modelling the data under consideration.
Acknowledgements
The authors are thankful to the editor and reviewers of this article for providing very useful comments which led to the improvement of this paper.
Authors’ Contribution
All authors contributed equally in developing the article.
Funding Statement
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.