Flexible Reduced Logarithmic-Inverse Lomax Distribution with Application for Bladder Cancer ()
1. Introduction
Survival analysis is the branch of statistics that uses as a random variable the study of time. Before the event of the analysis occurs, such as mortality in species and breakdown of mechanical structures or facilities, this subject is referred to in engineering as reliability theory or reliability analysis, duration analysis or duration modeling in economics, and analysis of event history in sociology. Accordingly, A variety of distributions have been suggested to serve as templates for wide-ranging implementations of real-life data. Lomax [1] suggested a model for lifetime analysis, and actuarial science, known as Lomax or Pareto Type II, is a special case of the second type of generalized beta distribution. Wide applications such as the study of business failure lifetime data, income and wealth disparity, urban size, actuarial science, medical and biological sciences, engineering, lifetime and reliability modeling have been described in this distribution. Also, it has been shown its utilities for modeling and analyzes lifetime data in medical and biological sciences, engineering, etc. So, it has gained the greatest attention from theoreticians and statisticians because of its numerous uses. Hassan and Al-Ghamdi [2] developed step stress accelerated life testing for Lomax distribution. Although Corbelini et al. [3] used it to model firm size and queuing problems. Some authors, such as Bryson [4] used this distribution as an alternative to the exponential distribution when the data is heavy-tailed and has been proposed.
The Lomax distribution with two parameters,
and
has a random variable X if it has a cumulative distribution function (CDF) given by
(1)
where,
and
are the shape and scale parameters respectively. The probability density function (PDF) is
(2)
The inverse Lomax distribution is one of the significant lifetime models, and it is used in economic sciences, geography, econometrics and clinical fields; the inverse Lomax distribution was used by Kleiber [5] to obtain the Lorenz ranking relationship between the ranked stats. This distribution was used for reliability estimation based on censored Type II observations by Yadav et al. [6]. Rahman et al. [7] discussed the estimated and predicted values using Bayesian approach under various loss functions. The reliability estimators of the inverse Lomax distribution under Type II censoring were tested by Singh and Singh [8]. The Bayesian estimate of the two-component inverse Lomax distribution mixture based on the Type-I censoring scheme was discussed by Reyad and Othman [9].
The cumulative distribution function (CDF) of inverse Lomax distribution ILD with parameters
and
are given by
(3)
and the corresponding probability density function (PDF) is
(4)
Recently, Shanker and Shukla [10] introduced the generalization of generalized Gamma distribution. The Gamma-Weibull G family of distributions was introduced by Oluyede et al. [11] with applications to real-life data. Maiti and Pramanik [12] proposed a new class of distributions called odds XGamma-G family of distributions for modeling lifetime data. Korkmaz et al. [13] proposed a new class of lifetime distributions called the generalized odd Weibull generated family. Aslam et al. [14] proposed a new family of distributions, namely a modified T-X family of distributions with three most attractive features: flexibility, efficiency and parsimony. The applications of generalized distributions have been discussed with many researchers, and the reader can refer to Gallardo et al. [15], Al-Saiary et al. [16], Bantan et al. [17], Al-Babtain et al. [18], and Al-Babtain et al. [19].
Yinglin Liu et al. [20] proposed a new family of distributions with medical data sets. The family that is proposed may be named as a flexible reduced logarithmic-X family. Reparameterization of the exponentiated Kumaraswamy G-logarithmic family and the alpha logarithmic distribution family can be used to obtain the proposed family. In modeling complex types of data, the proposed distribution would be quite flexible. Thus, the reason for proposing the FRL-X family is to decrease the number of parameters and to relax the boundary conditions of the parametric values so that the hazard rate function is more flexible than the classical monotone behavior. Also, this provides us more knowledge about the behavior of the hazard rate function in the tail end to improve the description that calls for complexity by adding the parameters in the class of distributions. A random variable X is said to have the FRL-X distribution if it (CDF) is given by
(5)
where depending on the parameter
,
is CDF of the baseline random variable and
is an additional parameter. The term in Equation (5) is also true for
. The probability density function (PDF) corresponding to Equation (5) is given by
(6)
The reliability or survival function
and failure rate or hazard rate function
, of the flexible reduced logarithmic-X (FRL-X) distribution, is given by
(7)
and
(8)
In addition to the above, the main reasons for using the FRL-X family in practice are:
1) The possibility of adding additional parameters in a simple way to modify the existing distributions.
2) To improve the features and flexibility of existing distributions.
3) To show the extended version of the baseline distribution having closed forms for cumulative distribution function, hazard rate function, and survival function.
4) To provide better measurements than the corresponding modified models.
5) To add new distributions having nonmonotonic shaped hazard rate functions.
6) To insert the best fit to unimodal medical care data sets.
This paper is structured as follows: The FRL-X family, called the flexible logarithmic-inverse Lomax (FRL-IL) distribution, was introduced in Section 2. The structural characteristics of the distribution of FRL-IL include the behavior of the function of probability density, the reliability or survival function, the function of the hazard rate, the function of the reversed hazard rate, the residual (reversed) life. The moments and the moments generating function, quantile function, and skewness and kurtosis are given in Section 3. Section 4 provides order statistics and extreme values. The maximum likelihood estimation of the unknown parameters is discussed in Section 5. Finally, in Section 6, a real data life application has shown up the potential of FRL-IL distribution relative to other distributions.
2. The Flexible Reduced Logarithmic-Inverse Lomax (FRL-IL) Distribution
It is said that the random variable X has flexible reduced distribution logarithmic-inverse Lomax (FRL-IL) denoted by FRL-IL
. Let
and
be cumulative distribution function (CDF) and probability density function (PDF) of the two-parameter inverse Lomax distribution. Using
and
from Equations (3) and (4), respectively, in Equations (5) and (6) to obtain, the (CDF) of the FRL-IL distribution is given by
(9)
The probability density function (PDF) corresponding to Equation (6) is given by
(10)
The survival function or reliability
, failure rate or hazard rate function
, reversed-hazard rate function
, and cumulative hazard rate function
of the flexible reduced logarithmic-inverse Lomax (FRL-IL) distribution are given by
(11)
(12)
(13)
and
(14)
respectively,
and
.
Figures 1-6 show the PDF, CDF, survival function
, hazard rate function
, reversed hazard rate function
and cumulative hazard rate function
of the FRL-IL
distribution for some parameter values.
Figure 1. The pdf of the FRL-IL for different values of parameters.
Figure 2. The CDF of the FRL-IL for different values of parameters.
Figure 3. The S(x) of the FRL-IL for different values of parameters.
Figure 4. The h(x) of the FRL-IL for different values of parameters.
Figure 5. The r(x) of the FRL-IL for different values of parameters.
Figure 6. The H(x) of the FRL-IL for different values of parameters.
3. Some Statistical Properties
In this section, we give some statistical properties of FRL-IL distribution.
3.1. Quantile Function and Median
The quantile function has a number of important applications, for example, it can be used to obtain the median, skewness, kurtosis and can be also used to generate random variables. Suppose X a random variable from the FRL-IL distribution with CDF from Equation (10), the quantile function of X, is given by
(15)
where
from the FRL-IL distribution, random numbers can easily be generated using
(16)
It is possible to derive the median of the FRL-IL distribution by setting
in Equation (16) to be
(17)
3.2. Mode of the FRL-IL Distribution
The mode of the flexible reduced logarithmic-inverse Lomax (FRL-IL) distribution is derived by differentiating the probability density function in Equation (10) with respect to random variable
and equal it to zero.
So, then the mode is the solution of the following equation
(18)
3.3. Skewness and Kurtosis
One of the most common methods to measure the skewness and kurtosis of a distribution is to consider measures defined with moments. However, moments cannot always be found. This applies true for heavy-tailed distributions such as the Lomax or inverse Lomax distribution. For this reason, the use of the quantile function offers some alternatives. The shortcomings of the conventional measure of kurtosis are well known. Kenney and Keeping [21] provides the skewness of Bowely on the basis of quantities as
(19)
Moors [22] gave the Moors quantile based Kurtosis as
(20)
with the
representing quantile function.
The sign of S is informative on the direction of the skewness of the distribution
for right-skewed,
for symmetric, and
for left-skewed. The value of K measures the tail-heaviness of the distribution; in general, the bigger is the value of K is the heavier is the tail of the distribution.
3.4. Moments
Given the importance of the rth moments in any statistical analysis in applications, as they can be used to study the most important features and characteristics of the distribution (such as slope, dispersion, skew and kurtosis), in this subsection we will discuss how to find the rth moments of the FRL-IL distribution, which are derived is be given by
(21)
By replacing Equation (10) with Equation (21), we get
where
where
, by using binomial expansion, where binomial expansion is giving by equation
Let
and
then
(22)
3.5. The Moment Generating Function
The moment generating function (MGF) of the flexible reduced logarithmic-inverse Lomax (FRL-IL) distribution is follows
(23)
Hence, expanding
using Taylor series yields
(24)
(25)
4. The Order Statistics
Assuming that
are the order statistics of a random sample follows a continuous distribution with cumulative distribution function (CDF)
and probability density function (PDF)
, then the PDF of
is given by
(26)
Let X be a random variable of FRL-IL distribution, then the density function of the k-th order statistics of the FRL-IL distribution is
(27)
If
, the pdf of order statistics is
If
, the pdf of order statistics is
Distribution of Maximum, Minimum and Median
Suppose
be independent, identically distributed random variables from FRL-IL
(28)
(29)
and
(30)
5. Parameter Estimation
There are many estimation methods for estimating unknown parameters in probability distributions, but the most commonly used is the maximum likelihood probability technique. In addition, the MLEs have desirable properties and can be used to establish confidence intervals. The normality estimate for these estimators is easily addressed either numerically or analytically in the large sample distribution theory. In this section, the point and interval estimation of the unknown FRL-IL distribution parameters is derived using the maximum likelihood method based on a complete sample.
Assuming that
denote a random sample of complete data from the FRL-IL distribution.
The likelihood function is given by
(31)
If we substituting Equation (10) for Equation (31), we have
The corresponding log-likelihood function for the parameters
and
is
(32)
The first partial derivatives are calculated of
with respect to
and
and equating each to zero, we get the likelihood equations as
(33)
(34)
(35)
By solving the nonlinear Equations (33)-(35), MLEs can be obtained numerically for
and
.
Asymptotic Confidence Bounds
We obtain the asymptotic variances and covariances of the MLEs of
and
, by using variance-covariance matrix
(Lawless [23] ), which is defined as follows
(36)
where
(37)
(38)
(39)
(40)
(41)
and
(42)
The
intervals of
and
, can be obtained by using variance-covariance matrix as the following forms
where
is the percentile of the standard normal distribution with right-tail probability
.
6. Application
This section illustrates the usefulness of FRL-IL distribution using a set of real data. The following data set represents the length of time to recover (in months) for a randomized sample of 128 patients with bladder cancer. Medically, bladder cancer is defined as the place where the abnormal tissue grows. As more cancerous tissues and cells develop, they can turn into a tumor, and with a period of time without detection, they will spread to other parts of the body (see Lee and Wang [24] ). The data are:
The data has been used by Kumar et al. [25], El-Gohary et al. [26], Chandra [27], De Andrade and Zea [28] and Selim [29].
We fitted the above-mentioned data sets using MLE to the flexible reduced logarithmic-inverse Lomax (FRL-IL), inverse Nadarajah-Haghighi (INH), inverse Weibull (IW), inverse exponential (IE) and Inverse Generalized Power Weibull IGPW distributions. The MLEs for IGPW, INH, IW, IE and FRL-IL distributions are displayed in Table 1. Kolmogorov-Smirnov (K-S), -Log likelihood (-L), Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC) and Hannan-Quinn Information Criterion (HQIC) were used to compare the fitted models. Based on these criteria, the best model is the one that achieves the lowest values for the information criteria and goodness-of-fit statistics. Hence, it is clear from the numerical results in Table 2, The FRL-IL model presents a better fit than other compared models. Figure 7 displays the empirical and fitted cumulative for the FRL-IL.
Also, Figure 7 graphically illustrates that FRL-IL distribution provides the best fit to our data sets, as compared to the other considered models. Therefore, the FRL-IL model can be used as a possible alternative to the well-known models
Figure 7. The empirical and fitted for the FRL-IL.
Table 1. The estimates
and
.
Table 2. The estimates of the goodness-of-fit for data
like inverse exponential and inverse Weibull models.
7. Conclusion
This paper presents a new three-parameter distribution, called the flexible reduced logarithmic-inverse Lomax distribution. Some of the statistical properties of the (FRL-IL) distribution include the moments, hazard rate function, quantile function and order statistics are derived. To estimate the model parameters, the maximum likelihood approach is used. The practical applications have established that the proposed distribution is quite useful for dealing with reliable data and behaves better. Also, the figure graphically illustrates that FRL-IL distribution provides the best fit to our data sets, as compared with the other considered models. Therefore, the FRL-IL model can be used as a possible alternative to the well-known models like inverse exponential and inverse Weibull models. In the future, it will be developed and studied the generalized FRL-IL distribution under progressively type II and hyper type II censored.