The Burr 12 Distribution Family and the Maximum Entropy Principle: Power-Law Phenomena are not necessarily Nonextensive

In this paper we recall for physicists how it is possible, using the principle of maximization of the Boltzmann-Shannon entropy, to derive the Burr-Bingh-Maddala (burr12) double power law probability distribution function and its approximations (Pareto, loglogistic ..) and extension first used in econometrics. this is possible using a deformation of the power function, as this has been done in complex systems for the exponential function. We give to that distribution a deep stochastic interpretation using the theory of Weron et al. applied to thermodynamics the entropy nonextensivity can be accounted for by assuming that the asymptotic exponents are scale dependent. Therefore functions which describe phenomena presenting power-law asymptotic behaviour can be obtained without introducing exotic forms of the entropy.


Introduction
In this paper, we want to show how the BurrrXII-Singh-Maddala (BSM) [1] [2] distribution function, known also as the q-Weibull distribution can be naturally derived from the maximum entropy principle using the Boltzmann-Shannon entropy with well-defined constraints including a generalization of the definition of the moment [3]- [5] similar to the deformation of the exponential.This is what has been done in Section 5.In Sec-tion 2 and 3, we recall the properties of the BSM distribution and its applications in physics and chemistry.Section 4 is devoted to summarize the Weron stochastic interpretation of the BSM probability distribution which has allowed us in Section 7 and 8 to give a novel interpretation of the concept of nonextensivity.
This paper has been written for several reasons.The BSM distribution has been used in a variety apparently independent fields: econometrics [6] [7], actuary sciences [8] [9], hydrology [5], forestry [10], sorption theories [11] [12], fractal kinetics [13]- [20], pharmacokinetics [21], relaxation and reaction phenomena [22] [23].Two of its approximations, the Weibull and the Hill equations are widely used in materials sciences [24]- [26], medical sciences in particular cancer remission and pharmacokinetic [27]- [29], physico-chemistry in particular complex and enzymatic reactions [30]- [32], meteorology [33] adsorption [34] [35], economy [36] etc.We have quoted here some of the most recent papers using the BSM, the Weibull and Hill distributions.The BSM distribution and some of the distribution derived from it give rise asymptotically to power-laws at high or (and) small values of the variable.For some values of the parameters, they belong to the family of Lévy heavy tail functions [37].
There is a widespread belief in physics that the phenomena characterized by power laws behaviour, ubiquitous in nature, should require extensions of the Boltzmann-Shannon (BS) entropy and that the BS entropy should be restricted to the family of Gaussian and exponential laws i.e. the distributions which obey the classical central limit theorem.Among the more than 20 proposed generalized entropies, the most famous are the Reyni [38] and the Tsallis [39] [40] entropies.To avoid divergences of the moments, Reyni introduces in the expression of the entropy powers of the probability function.Tsallis has used the same procedure introducing so-called "escort probabilities".He proposes a form of entropy which has the property to be nonextensive ab-initio and derives the so-called Tsallis distribution to extend the exponential distributions in thermodynamics.
We show in details, as this has been noticed by previous authors before us, that this is not necessarily true.It is possible quite naturally to obtain power law distributions by generalizing the definition of moments or using constraints determined by the data observation and keeping the BS entropy as a starting point for the maximization procedure [4] [5] [41]- [47].M. Visser [48] claims that power laws can be obtained by applying maximum entropy ideas directly to the Shannon entropy subject only to one constraint that the logarithm of the observable quantity is specified.
The BSM cumulative distribution functions are characterized by three parameters, two form factors a and c and one scale parameter b.The power law exponents corresponding to, respectively the asymptotic behaviour of small and large values of its argument are a and μ = a/c.The Tsallis distribution, derived from the maximization of Tsallis entropy is the survival part of the cumulative BSM distribution when a is equal to one.This density function has been used in the literature to fit many experimental data.These data can be fitted with the same degree of precision using both density distributions.This has created confusion on the value of the so-called entropic index q which is different in both cases.
It must be stressed that the BSM density function includes an exponent a which is absent from the Tsallis density function.The experimental observations, which dates back to the beginning of the last century [49]- [51] show that in natural phenomena, the exponent a is rarely equal to one and that the relation of μ with the two parameters a and c is an important feature which is ignored in Tsallis formalism.The far future is not independent on the early beginning.We know it from cosmology and from all organism evolutions.
However, we will show in Section 8 that nonextensivity of the entropy may arise from the scale dependence of the characteristic exponents.
As a consequence of the previous points we will show that phenomena described by function with one or two tails asymptotic power laws have not necessarily to be obtained by the maximization of an extension of the BS entropy.This is has been known for a long time in fields outside of physics such as hydrology and econometrics.

The Burr-Singh-Madalla Distribution
The BSM cumulative density function is written as: where a and c are form factors and b is a scaling factor.
Its density function is easily obtained by differentiation ( ) It is solution of a differential equation The function ( ) ( )  where ( ) The function ( ) . One has: ( ) The differential equation describes a birth and death process modulated by a quasi hyperbolic function which can be modified to accommodate more complex situations.
The cumulative distribution function x exhibits asymptotically two power laws: one for 0 and one for x → ∞ , ( ) . It has a limited number of finite moment depending on the value of µ .
The survival part of the cumulative function is given by which has the same form as the Tsallis density function if 1 a = and 1 c q = − where q is the Tsallis entropy index.By contrast the power of the BSM density function is 1 1 . 1 q c q − − = − and the BSM density function is the so called "escort probability" in Tsallis formalism.The Lévy power law exponent µ are accordingly dif- ferent 2 1 1 and this has to be considered in interpreting experimental results.
As the mathematician Vladimir Arnold once said "Differential equations are the source of the development of modern sciences", one can consider Equation (3) as the natural starting point to study complex systems in the same way the exponential is solution of a differential equation and is the natural starting point for the description of simple systems.For instance, it has been used in epidemiology [52], the influence of the sanitary authorities to its propagation being accounted for by modifying the function g(x) accordingly.It must be reminded that the approximation g(x) = 1 is the famous Verhulst [53] equation whose discrete version is one of the paradigms in the theory of chaos [54].

Application of the BSM Distribution Function in Physics
The BSM function has been used to establish a three parameters fractal kinetic equation which has been em-ployed with some success to characterize macroscopically the sorption (ad-, chemi-, bio-) in gaseous and aqueous phase [11] [12] as well as in the theory of relaxation [22] [23] to justify the two asymptotic behaviours of the Havriliak-Negami formula [55] in the frequency range.It has been used also to show that some of the empirical isotherms (Langmuir [56], Sips-Hill [57] and Brouers-Sotolongo [34] isotherms) are well defined approximations of the BSM function and therefore enjoy the properties of statistical distributions.
The work of K. Weron and its collaborators has given a deep physical understanding of this distribution.Indeed they have derived the BSM survival function in the theory of relaxation by means of stochastic arguments linking the observed macroscopic properties to the mesooscopic and microscopic energetic and geometric organisation (fractal scaling, clustering, self-organiszation) of complex heterogeneous systems.We will deal with this interpretation in the next paragraph.

Stochastic Interpretation of the BSM Distribution
The analytical form of the BSM distribution function can be justified and the parameters ( ) , , a b c can be given physical and statistical interpretations following the stochastic analysis given by K. Weron and collaborators [22] [58] [59] in a series of papers on relaxation and reaction in complex systems The arguments are the following: to relate macroscopic data to a macroscopic model representing the system as a whole a number of averaging formal operation at the micro-and meso-scopic level have to be done.Two cases can occur 1) the system is not strongly disordered and usual averages obeying the central limit theorem can be performed.As a consequence the probability functions belong to the basin of attraction of Gaussian functions or 2) the disorder, due to geometric and energy frustrations is giving rise to self-similar and clustering structures.As a consequence the summations of local physical quantities such as relaxation or chemical rates are dominated by their extreme values a situation well known in processes like earthquakes, water river level, atmospheric catastrophe and insurance claims to mention the most common in the literature.The corresponding distributions obey generalized limit theorems and belong to the basin of attraction of stable Lévy distributions popularized by Mandelbroot in his work on fractal structures in economics.As far as we are concerned, in that case expected values cannot be defined and specific formal methods have to be devised to tame what is called "wild disorder".One essential characteristic of these distribution functions is that they exhibit scaling properties which reveals a universal behaviour independent on the microscopic details of the system.According to Weron et al., in the time domain, the relaxation or survival function can be written as: The quantities ϑ  and β  are the waiting time and the relaxation or chemical rate of a virtual macroscopic state representing the system as a whole.One has to average over random macroscopic objects which are themselves average on the mesoscopic (self-similar geometrical and dynamical clusters) and microscopic (individual reacting pairs molecules or atoms).The quantity β  is defined as the sum of individual relaxation rates accord- ing to equation where N A is a N-dependent normalizing constant.Two situations have to be considered: the expecting value of β  is finite and we are in the situation of a well-behaved disorder system or it does not exist and the sum β  is also a random variable obeying the same probability distribution as the individual i β under appropriate normalization (Lévy stable distribution).If β  does not have a mean, the function ( ) where ( ) a g λ is the one-sided Lévy stable density probability distribution and A is a normalization constant ( ) ( ) ( ) It is the Weibull survival function quite frequent in physical, chemical and biological phenomena.The pa-rameter a arises from the stable scaling properties at the micro and mesoscopic levels.When 1 a = , one has a simple exponential function and the rate is constant The BSM distribution function can be obtained by considering a more complex situation where due to complex frustrations, the number of reacting element is not fixed and is itself a random variable.In that case Equation (8) should be replaced by * 1 As argued by Weron et al. the fluctuations of N ν can be view as a birth and death process.In that case the most natural probability distribution is the negative binomial probability distribution which in the limit N → ∞ tends to the gamma distribution: Then the survival probability function of the entire system is The solution of which has been obtained by Rodriguez [60] more than forty years ago ( This is the BSM survival distribution function in the time domain.The corresponding density function is In this derivation of the BSM function, the parameter c appears to be a measure of aggregation with one or several characteristic lengths.

The BSM Density Function Derived from the Maximization of the BS Entropy
Starting from the Boltzmann-Shannon entropy We determine ( ) ; , , f x a b c by imposing three constraints ( ) And generalizing the definition of the power of a variable in the same spirit as the definition of the deformed exponential ( ) The third constraint can then be expressed as: The constraint conditions can be understood as known prior information which can be used to achieve a least biased distribution.
Using the method of Lagrange optimization method and defining ; , , d The maximization of S(x) is obtained by solving the equation: ( ) which taking account of the normalization condition 1 1 C = yields the functional form: With the normalization constant determined by the partition function The values of the j λ parameters are determined by the set of equations ( ) log , one gets finally, given the constraints ( ) γ is the Euler constant and ( ) the Bigamma function.We have therefore ( ) with This finally gives the BSM density function (Equation ( 2)).
The expression for the k-th moment is given by The Tsallis density function (a = 1) is the survival function of ( ) with the constraints ( ) These results have already been obtained with other notations in the field of econometrics and meteorology [3]- [5] [45].The same procedure has been used also for 4-parameters generalization of the BSM (GB2 foe example) in the econometrics literature.One recovers our results when the extra parameter is put to one or zero according to the type of extension.
Another tail distribution, the Cauchy distribution ( ) with the same method with the constraint ( ) ( ) The Weibull and the Pareto and Cauchy tail distribution entropies are well known and have been derived in the classical books on entropy maximization method [62].
For c = 0, one recovers the well-known results for the Weibull distribution ( ) , ; , ,0 exp For c = 1, the log-logistic-Hill-Fisk constraints ( ) ( ) ; , ; , ,1 1 Finally it must be noted that if x b  (Zipf law), the two constraints ( 18) and ( 20) reduced to only one on the logarithm and we reach the same conclusion as Visser [48].

Expressions of the Entropy
Making use of the results of the previous section, one can now write the expressions of the entropy corresponding to the various approximations.
From the general definition We get for the BSM distribution for the Weibull distribution For the log-logistic-Hill-Fisk distribution ( ) The general form of the entropy is therefore K(a, c) is a constant that we can consider as the origin of the entropy for a couple of values a and c.The previous results are therefore compatible with the extensivity of the BS entropy.If we have two subsystems 1 and 2 added to form a larger system with have: ( ) ( ) This is only true if ( ) ( ) ( ) . Otherwise the system is nonextensive.We will come back to this situation in the two next sections.

Implications in Thermodynamics
The aim of this paper is to discuss the probabilistic and stochastic foundation of the use of the BSM distribution to describe physical and chemical complex systems characterized by power-law, Levy and extreme value distributions.We will nevertheless in the last section touch the problem of the application of this formalism to thermodynamics.
In the case (a = 1, c = 0, q = 1), we have ln 1 S b = + , in the canonical version of thermodynamics the x variable can be replaced by the individual particle energy and we can obtain in the continuous limit the set of relations The corresponding canonical entropy is In the case a = 1, which is the case used in extensions of the classical thermodynamics, one has ( ) ( ) Therefore if  is constant, entropy decreases when c increases.
The usual canonical thermodynamics expression for the entropy (Equation ( 47)) can be recovered if one introduces a c-temperature assuming that c T is proportional to c  : ( ) which gives a c-depending temperature depending on the evolution of c Z (and therefore on c S ) with c.

Nonextensivity
Nonextensivity observed in systems with long range interactions can be accounted for if the parameter c (and q) are scale dependent.One can use an argument used by C. Beck [63] to obtain a quasi nonexpensive entropy from a superstatistic version of the Tsallis entropy.I quote "in other words q(r) is a strictly monotonously decreasing function of the scale, just as observed in experiments".This means that, in the BSM formalism, if the parameter c decreases with an increase of the volume; the additive property of the entropy is no longer respected since ( ) ( ) ( ) Moreover, the system is no longer in an equilibrium state since c  and the temperature c T are also c-dependent.
This interpretation of nonextensivity is in agreement with the signification of the parameter c which is related to the cluster organization (possibly multifractal) of the heterogeneous system as discussed in Section 4.

Conclusion
In conclusion, we think that physicists have to learn a lot from the progress made the last decades by mathematicians and ex-physicists in the field of statistics in econometrics.We think that many phenomena exhibiting power-law behavior are not necessarily the consequence of a nonextensivity of the entropy as this has been assumed by a number of authors, myself [64]- [66] included, evoking exotic form of the entropy.The same results can be obtained using the BSM density function and taking account of the difference in the definition of the exponents (Equation ( 6)).In our presentation nonextensivity, when it occurs, it is the consequence of the scale dependence of the characteristic exponents a and c of the distribution and is a property of systems with long range interactions and complex systems, where thermodynamic equilibrium is not achieved.
+ , one of the Levy distributions has been derived