New Probability Distributions in Astrophysics: V. The Truncated Weibull Distribution ()
1. Introduction
The Weibull distribution was originally introduced to model the fracture strength of brittle and quasi-brittle materials, see [1] [2]. The Weibull distribution was successively applied to analysing the voltage breakdown of electric circuits [3] [4], the life table data of plants such as the Kolla paulula [5], the distribution of tree diameters [6], plant vegetative tissue [7] and the fatigue failure studies of human extensor digitorum longus [8]. The analysis of the truncated Weibull distribution has been explored in many papers, we list some in what follows. The upper truncated Weibull distribution has been analysed by [9] and applied to modeling component or system failure, and by [10] to modeling wind speed data and estimating wind power density. The lower truncated Weibull distribution has been analysed by [11]. The lower and upper truncated Weibull distribution and the evaluation of its moments have been analysed in [12] [13] [14]. A careful analysis of the above approaches allows concluding that the truncated Weibull distribution has not yet been applied to astrophysics. This paper reviews the existing Weibull distribution in Section 2, introduces the truncated Weibull distribution in Section 3, derives the Weibull luminosity function (LF) for galaxies and the connected modification due to the truncation in Section 4, and discusses, in Section 5, the application of the Weibull LF to the SDSS galaxies, to QSOs, to the photometric maximum visible in the 2MRS catalog of galaxies, and to the cosmological evolution of the average absolute magnitude with the redshift.
2. The Weibull Distribution
Let X be a random variable defined in
; the two/parameter Weibull distribution function (DF),
, is
(1)
where b and c, both positive, are the scale and the shape parameters, see [15]. The probability density function (PDF),
, is
(2)
We now introduce the function
(3)
the average value or mean,
, is
(4)
the variance,
, is
(5)
the skewness is
(6)
and the kurtosis
(7)
The rth moment about the origin for the Weibull distribution,
, is
(8)
where r is an integer and
(9)
is the gamma function, see [16]. The median is at
(10)
and the mode is at
(11)
Random generation of the Weibull variate X is given by
(12)
where R is the unit rectangular variate. The two parameters b and c can be derived by the numerical solution of the two following equations which arise from the maximum likelihood estimator (MLE)
(13a)
(13b)
where
are the elements of the experimental sample with i varying between 1 and n.
3. The Truncated Weibull Distribution
Let X be a random variable defined in
; the truncated two-parameter Weibull DF,
, is
(14)
and the PDF,
, is
(15)
see Section 2.1 in [14].
The inequality which fixes the range of existence is
. We report the indefinite integral which characterizes the average value or mean,
,
(16)
which is
(17)
where
(18)
and
(19)
where
is the Whittaker M function, see [17] [16]. The average value is therefore
(20)
for a comparison, see Equation (5) in [14]. The indefinite integral which characterizes the rth moment about the origin for the truncated Weibull distribution,
, is
(21)
which is
(22)
where
(23)
The rth moment about the origin for the truncated Weibull distribution is therefore
(24)
The variance,
, of the truncated Weibull distribution is given by
(25)
The
in the case
is at
(26)
and the
in the case
is at
(27)
which is the same value as that for the Weibull pdf. Random generation of the truncated Weibull variate X is given by
(28)
where R is the unit rectangular variate. The four parameters xl, xu, b and c can be obtained in the following way. Consider a sample
and let
denote their order statistics, so that
,
. The first two parameters
and
are
(29)
The MLE is obtained by maximizing
(30)
The two derivatives
and
generate two non-linear equations in b and c which are
(31a)
(31b)
where
(32)
and
(33)
4. The luminosity Function
This section reports the luminosity functions (LFs) for the Weibull distribution and the truncated Weibull distribution.
4.1. The Weibull LF
The Schechter function, introduced by [18], provides a useful reference for the LF of galaxies
(34)
here
sets the slope for low values of L,
is the characteristic luminosity and
is the normalization. The equivalent distribution in absolute magnitude is
(35)
where
is the characteristic magnitude as derived from the data. We now introduce the parameter h, which is H0/100, where H0 is the Hubble constant. The scaling with h is
and
. In order to derive the Weibull LF we start from the PDF as given by Equation (2),
(36)
where L is the luminosity,
is the characteristic luminosity and
is the normalization and the version in absolute magnitude is
(37)
4.2. The Truncated Weibull LF
We start with the truncated Weibull PDF with scaling as given by Equation (15)
(38)
where L is the luminosity,
is the characteristic luminosity,
is the lower boundary in luminosity,
is the upper boundary in luminosity, and
is the normalization. The magnitude version is
(39)
where M is the absolute magnitude,
the characteristic magnitude,
the lower boundary in magnitude,
the upper boundary in magnitude and
is the normalization. The mean theoretical absolute magnitude,
, can be evaluated as
(40)
5. Astrophysical Applications
This section reviews the adopted statistics, applies the truncated Weibull distribution to the initial mass function (IMF) for stars, models the LF for galaxies and QSOs, explains the photometric maximum in the number of galaxies of the 2MRS, and traces the cosmological evolution of the average absolute magnitude.
5.1. Statistics
The merit function
is computed according to the formula
(41)
where n is the number of bins,
is the theoretical value, and
is the experimental value represented by the frequencies. The theoretical frequency distribution is given by
(42)
where N is the number of elements of the sample,
is the magnitude of the size interval, and
is the PDF under examination.
A reduced merit function
is given by
(43)
where
is the number of degrees of freedom, n is the number of bins, and k is the number of parameters. The goodness of the fit can be expressed by the probability Q, see equation 15.2.12 in [19], which involves the number of degrees of freedom and
. According to [19] p. 658, the fit may be acceptable’ if
.
The Akaike information criterion (AIC), see [20], is defined by
(44)
where L is the likelihood function and k the number of free parameters in the model. We assume a Gaussian distribution for the errors. Then the likelihood function can be derived from the
statistic
where
has been computed by Equation (41), see [21], [22]. Now the AIC becomes
(45)
The Kolmogorov-Smirnov test (K-S), see [23] [24] [25], does not require binning the data. The K-S test, as implemented by the FORTRAN subroutine KSONE in [19], finds the maximum distance, D, between the theoretical and the astronomical CDF as well the significance level PKS, see formulas 14.3.5 and 14.3.9 in [19]; if
, the goodness of the fit is believable.
5.2. The IMF for Stars
We tested the truncated Weibull distribution on four samples of stars: NGC 2362 (271 stars), the young cluster NGC 6611 (207 stars), the γ Velorum cluster (237 stars), and the young cluster Berkeley 59 (420 stars), for more details, see Section 5.2 of [26]. The results are presented in Table 1 for the truncated Weibull distribution with two parameters, where the last column reports whether the results are better compared to the lognormal distribution (Y) or worse (N). Results on the lognormal distribution were reported in Table 1 in [26].
Graphical displays of the empirical PDF visualized through histograms as well as the theoretical PDF for NGC 6611 are reported in Figure 1 and those for the γ Velorum sample are reported in Figure 2.
5.3. The LF for Galaxies
A test has been performed on the
band of SDSS as in [27] with data available at https://cosmo.nyu.edu/blanton/lf.html. The Schechter function, the new Weibull LF represented by formula (37) and the data are reported in Figure 3, parameters as in Table 2.
A careful examination of Table 2 reveals that the Weibull LF has a lower
compared to the Schechter LF.
Another case is the LF for QSO in the case
, see [28] for more details. Figure 4 displays the observed LF for QSO as well the theoretical fit with the Weibull LF. The parameters and the statistical results for the Schechter LF are reported in Table 3 and those for the Weibull LF in Table 4; the Weibull LF
![]()
Table 1. Numerical values of
, AIC, probability Q, D, the maximum distance between theoretical and observed DF, and PKS, significance level, in the K-S test of the truncated Weibull distribution with two parameters for different mass distributions. The last column (LN) indicates an AIC lower (Y) or bigger (N) in respect to the lognormal distribution. The number of linear bins, n, is 20.
![]()
Figure 1. Empirical PDF of the mass distribution for NGC 6611 cluster data (blue histogram) with a superposition of the truncated Weibull PDF (red line). Theoretical parameters as in Table 1.
![]()
Figure 2. Empirical PDF of the mass distribution for γ Velorum data (blue histogram) with a superposition of the truncated Weibull PDF (red line). Theoretical parameters as in Table 1.
![]()
Table 2. Numerical values and
of the LFs applied to SDSS Galaxies in the
band.
![]()
Figure 3. The LF data of SDSS (
) are represented with error bars. The continuous line fit represents the Weibull LF (37) and the dotted line represents the Schechter function.
![]()
Figure 4. The observed LF for QSOs, empty stars with error bar, and the fit by the Weibull LF for z in
and M in
. Parameters as in Table 4.
![]()
Table 3. Parameters of the Schechter LF in the range of redshift
when
and
.
![]()
Table 4. Parameters of the Weibull LF for QSOs in the range of redshift
when
and
.
has smaller
compared to the Schechter LF.
5.4. The Photometric Maximum
In the pseudo-Euclidean universe, the correlation between the expansion velocity and distance is
(46)
where
is the Hubble constant,
, with
when h is not specified, D is the distance in Mpc,
is the speed of light and z is the redshift. In the pseudo-Euclidean universe, the flux of radiation, f, expressed in units of
, where
represents the luminosity of the sun, is
(47)
where D represents the distance of the galaxy expressed in Mpc, and
(48)
The joint distribution in z and f for a generic LF,
is
(49)
where
,
and
represent the differentials of the solid angle, the redshift, and the flux, respectively, and
(50)
where
is the characteristic luminosity, for more details, see [29]. The LF is chosen to be the Schechter function, but different LFs can be tested, for example, the Weibull LF. The joint distribution in z, f and
for galaxies for the Weibull LF, see Equation (36), is
(51)
The above number of galaxies in z and f has a maximum at
which is the solution of the following non-linear equation
(52)
A first numerical evaluation of the position in z of the above equation is reported in units of
, see the blue dashed line in Figure 5. A second analytical result can be obtained inserting for the number of galaxies a numerical value for
![]()
Figure 5. Position in units of
of the photometric maximum as a function of the shape parameter c: numerical solution (blue dashed line) and Taylor solution (red full line).
c. As an example when
, the nonlinear equation for the photometric maximum is
(53)
which has a physical solution at
(54)
A third approximate result is obtained using a Taylor expansion of Equation (52) around
of order 3, which gives
(55)
where
(56)
A graphical display of the Taylor solution is reported in Figure 5 as the red full line. Figure 6 reports the number of observed galaxies for the 2MASS Redshift Survey (2MRS) catalog at a given apparent magnitude and both the Schechter and the Weibull models for the number of galaxies as functions of the redshift. The influence on the above curve of varying
is reported in Figure 7.
The mean redshift for galaxies
is
(57)
![]()
Figure 6. The galaxies of the 2MRS with
or
are organized in frequencies versus heliocentric redshift, (empty circles); the error bar is given by the square root of the frequency. The maximum frequency of observed galaxies is at
. The full line is the theoretical curve generated by
as given by the application of the Schechter LF which is Equation (43) in [29] and the dashed line represents the Weibull LF which is Equation (51). The Weibull LF parameters are
and
,
for the Schechter LF and
for the Weibull LF.
![]()
Figure 7. The theoretical number of galaxies for the Weibull LF as function of the redshift and
; parameters as in Figure 6.
The mean redshift for the Weibull LF as a function of
when
is
(58)
or as a function of the flux
(59)
where
is the reference magnitude of the sun at the considered bandpass, or as a function of the apparent magnitude
(60)
5.5. Mean Absolute Magnitude
The absolute magnitude which can be observed as a function of the limiting apparent magnitude,
, is
(61)
where
for the 2MRS catalog.
The theoretical average absolute magnitude of the truncated Weibull LF, see Equation (40), can be compared with the observed average absolute magnitude of the 2MRS as a function of the redshift. To fit the data, we assumed the following empirical dependence on the redshift for the characteristic magnitude of the truncated Weibull LF.
![]()
Figure 8. Average absolute magnitude of the galaxies belonging to the 2MRS (green-dashed line), theoretical average absolute magnitude for the truncated Weibull LF (blue dash-dot-dash-dot line) as given by equation (42), lower theoretical curve as represented by Equation (63) (red line) and minimum absolute magnitude observed (cyan dotted line).
(62)
This relationship models the decrease of the characteristic absolute magnitude as a function of the redshift and allows us to match the observational and theoretical data. The lower bound in absolute magnitude is given by the minimum magnitude of the selected bin, the upper bound is given by Equation (61), the characteristic magnitude varies according to Equation (62) and Figure 8 reports a comparison between the theoretical and the observed absolute magnitude for the 2MRS catalog.
6. Conclusions
Truncated Weibull distribution
We derived the PDF, the DF, the average value, the rth moment, the variance, the median, the mode, an expression to generate random numbers and the way to obtain the two parameters, b and c, by the MLE for the truncated Weibull distribution.
Weibull luminosity function
We derived the Weibull LF in the standard and the truncated case: the application to both the SDSS Galaxies and to the QSOs in the range of redshift
yields a lower reduced merit function compared to Schechter LF, see Table 2 and Table 4.
Cosmological applications
The number of galaxies as functions of the redshift, the flux and the solid angle for the Weibull LF in the pseudo-Euclidean universe presents a maximum which can be compared with the observed one for the 2MRS, see Figure 6. The truncated Weibull LF produces a good fit to the average absolute magnitude of the 2MRS galaxies as a function of the redshift, see Figure 8.