Fourth-Order Predictive Modelling: I. General-Purpose Closed-Form Fourth-Order Moments-Constrained MaxEnt Distribution

Abstract

This work (in two parts) will present a novel predictive modeling methodology aimed at obtaining “best-estimate results with reduced uncertainties” for the first four moments (mean values, covariance, skewness and kurtosis) of the optimally predicted distribution of model results and calibrated model parameters, by combining fourth-order experimental and computational information, including fourth (and higher) order sensitivities of computed model responses to model parameters. Underlying the construction of this fourth-order predictive modeling methodology is the “maximum entropy principle” which is initially used to obtain a novel closed-form expression of the (moments-constrained) fourth-order Maximum Entropy (MaxEnt) probability distribution constructed from the first four moments (means, covariances, skewness, kurtosis), which are assumed to be known, of an otherwise unknown distribution of a high-dimensional multivariate uncertain quantity of interest. This fourth-order MaxEnt distribution provides optimal compatibility of the available information while simultaneously ensuring minimal spurious information content, yielding an estimate of a probability density with the highest uncertainty among all densities satisfying the known moment constraints. Since this novel generic fourth-order MaxEnt distribution is of interest in its own right for applications in addition to predictive modeling, its construction is presented separately, in this first part of a two-part work. The fourth-order predictive modeling methodology that will be constructed by particularizing this generic fourth-order MaxEnt distribution will be presented in the accompanying work (Part-2).

Share and Cite:

Cacuci, D. (2023) Fourth-Order Predictive Modelling: I. General-Purpose Closed-Form Fourth-Order Moments-Constrained MaxEnt Distribution. American Journal of Computational Mathematics, 13, 413-438. doi: 10.4236/ajcm.2023.134024.

1. Introduction

Scientific progress stems from the judicious combination of experimental information with results produced by computational models, neither of which are perfectly accurate. On the one hand, computations are afflicted by errors stemming from numerical procedures, uncertain model parameters, boundary/initial conditions and/or imperfectly known physical processes. On the other hand, results of measurements are afflicted by experimental errors, which means that around any reported experimental value there always exists a range of values that may also be plausibly representative of the true but unknown value of the measured quantity. Extracting “best estimate” values for model parameters and predicted results, together with “best estimate” uncertainties for these parameters and results requires the combination of experimental and computational data and their uncertainties.

The earliest systematic activities aimed at extracting best-estimate values for model parameters were initiated in the mid-1960s [1] [2] , in the course of evaluating neutron cross sections by using time-independent reactor physics models and experiments for evaluating, computationally and experimentally, so-called “integral quantities” (also called “system responses”) such as reaction rates and multiplication factors. A decade later, these activities had reached conceptual maturity under the name of “cross section adjustment” methodology [3] [4] . This methodology employs a user-defined least-square functional for combining uncertainties in the model parameters with uncertainties in the experimental data, subject to the hard constraint represented by the reactor physics model linearized with respect to the model parameters. The resulting “adjusted” neutron cross sections (model parameters) and their “adjusted” uncertainties were subsequently employed in the respective reactor physics model to predict improved “model responses,” such as improved reaction rates and reaction rate ratios, reactor multiplication factors, Doppler coefficients. By the late 1970s, adjoint neutron fluxes were introduced and used [5] [6] to compute efficiently the first-order response sensitivities, which were used as weighting functions in the least squares adjustment procedure. It is important to note that all of these works dealt with the time-independent linear neutron transport or diffusion equation, as encountered in reactor physics and shielding, for which the corresponding adjoint equations were already known and readily available.

In the late 1980s and during the 1990s, the fundamental concepts underlying the above-mentioned “data adjustment” methodology were “rediscovered” in the geophysical sciences while developing the so-called “data assimilation” procedure, in that the concepts underlying “data assimilation” are the same as those underlying the previously developed “data adjustment” procedure. Since then, numerous works have been published on “data assimilation;” the most representative can be found cited in the books by Lewis et al. [7] , Lahoz et al. [8] , and Cacuci et al. [9] . The data assimilation procedures also minimize, in a least-squares procedure, a “user-defined quadratic functional” which is meant to represent the discrepancies between computed and measured responses. The main differences between the data adjustment and data assimilation methods are as follows: 1) the data adjustment method are aimed at time-dependent models/systems, whereas the data assimilation methods are aimed at time-dependent models/systems; 2) the data adjustment methods can adjust/calibrate simultaneously model parameters and model responses, whereas the data assimilation methods can adjust/calibrate only model responses and initial conditions.

In the late 1990s, Cacuci and Ionescu-Bujor [10] [11] have developed a predictive modeling methodology which significantly generalizes and extends the “data adjustment and/or assimilation” methods by dispensing with the need to minimize the a priori user-defined “cost functional”, and by replacing the respective least-squares minimization procedure with the “principle of maximum entropy” (MaxEnt), originally formulated by Jaynes [12] , to attain optimal compatibility of the available information, while simultaneously ensuring minimal spurious information content. This methodology has been extended by Cacuci [13] to include the predictive modeling of coupled multi-physics systems, leading to the development of the “BERRU-PM” mathematical framework [14] for obtaining best-estimate optimal results with reduced predicted uncertainties. The “BERRU-PM” (Best-Estimate Results with Reduced Uncertainties Predictive Modeling) methodology provides a probabilistic description of possible future outcomes based on all recognized errors and uncertainties, along with a quantitative indicator, constructed from sensitivity and covariance matrices, for determining the consistency (agreement or disagreement) among the a priori computational and experimental data for parameters and responses. This consistency indicator measures, in the corresponding metric, the deviations between the experimental and nominally computed responses, and can be evaluated directly from the originally given data (i.e., originally available parameters and responses, together with their original uncertainties), once the response sensitivities have become available.

The data adjustment and assimilation methodologies, as well as the BERRU-PM methodology, incorporate only 1st-order sensitivities of the model responses with respect to the model’s parameters. In generalizing the BERRU-PM methodology, this limitation has been removed by Cacuci [15] [16] by having recently conceived the “2nd-BERRU-PM” methodology, which can incorporate arbitrarily-high order sensitivities of model responses with respect to model parameters. The essential contributions of the second- and higher-order sensitivities for reducing predicted uncertainties in the model response have been illustrated in [17] [18] [19] by applying the 2nd-BERRU-PM methodology [15] [16] to the polyethylene-reflected plutonium (PERP) OECD/NEA reactor physics benchmark [20] . The response of interest for this benchmark was the leakage of neutrons through the benchmark’s outer surface, which was computed [21] using the neutron transport Boltzmann equation, involving 21,976 imprecisely known parameters, the solution of which is representative of “large-scale computations.” As detailed in [21] , the first- through fourth-order sensitivities of this benchmark’s leakage response to the benchmark’s parameters were most efficiently computed using the adjoint sensitivity analysis methodology conceived by Cacuci [22] [23] and subsequently generalized by Cacuci [24] [25] to enable the computation of arbitrarily-high order sensitivities of model responses to model parameters.

Although the 2nd-BERRU-PM methodology can incorporate arbitrarily high sensitivities, this methodology is limited to considering just second-order moments (hence the designation “2nd-”) of the experimentally measured and computed model responses. The third-order moments (skewness) and fourth-order moments (kurtosis) of the distribution of measured and computed responses cannot be considered within the framework of the 2nd-BERRU-PM methodology. Furthermore, the “output” produced by the 2nd-BERRU-PM methodology is limited to yielding optimal best-estimate values for the means and covariances (i.e., the first- and second- moments) of the best-estimate predicted distribution of responses and parameters.

On the other hand, skewness and kurtosis play an essential role in determining the asymmetries of distributions. It is therefore paramount to generalize the 2nd-BERRU-PM methodology to enable the incorporation of third- and fourth-order moments of measured and computed responses, as well as to enable the computation of skewness and kurtosis of the best-estimate predicted posterior distribution of calibrated model parameters and responses. However, such a generalization can be achieved only if the underlying moment-constrained MaxEnt distribution could be correspondingly generalized to include third- and fourth-order moments of the distributions of measured and computed model responses.

Various variants of the “moment constrained maximum entropy problem” arise in areas as diverse as solid-state physics [26] , econometrics [27] , statistical description of gas flows [28] , weather and climate prediction [29] [30] [31] . It appears that the most efficient computational algorithm currently available for solving the “multidimensional moment constrained maximum entropy problem” is the algorithm devised by Abramov [32] , which is practically capable of solving two-dimensional problems with moments of order up to 8, three-dimensional problems with moments of order up to 6, and four-dimensional problems of order up to 4. But being able to solve “four-dimensional problems of order up to 4” is woefully insufficient for realistic problems, which are characterized by multivariate quantities of interest having dimensions much larger than just “four-dimensional”. On the other hand, the unparalleled efficiency of 2nd-BERRU-PM methodology for handling large-scale systems was demonstrated by means of the OECD/NEA reactor physics benchmark [20] involving 21,976 imprecisely known parameters. This computational efficiency was enabled by the analytical expressions for the first-and second-order moments produced by the underlying posterior second-order MaxEnt distribution within the 2nd-BERRU-PM methodology. It is therefore logical to attempt to extend to fourth-order the mathematical concepts and algorithms underlying the moment-constrained MaxEnt distribution within the 2nd-BERRU-PM methodology. This extension has been accomplished as will be presented in this work.

This work is structured as follows: Section 2 presents the novel, closed-form, fourth-order moment-constrained MaxEnt probabilistic representation of the distribution of an imprecisely measured or computed multivariate quantity, the components of which will be called “responses.” Such responses are produced by computations, measurements or both. The first four moments of the unknown multivariate distribution of these response, namely their mean values, variances/ covariances, skewness, and kurtosis, are assumed to be known. Section 3 concludes this work by outlining the crucial role which the analytical expression of fourth-order moment-constrained MaxEnt distribution presented in this work will play in constructing the mathematical framework of the 4th-BERRU-PM methodology, which will be presented in the accompanying Part 2 [33] .

2. Construction of the Fourth-Order Moment-Constrained Maximum Entropy (MaxEnt) Representation of Uncertain Multivariate Quantities

The components of the imprecisely known multivariate quantity of interest will be called “system responses.” Such system responses usually stem from measurements. Also, a computed response is equivalent to a “measurement” when the details (equations, parameters, etc.) underlying the computational model are unavailable. The case when the details of the computational model are available will be analyzed in the accompanying Part 2 [33] . In order to introduce the mathematical definitions of the known information (i.e., the moments of the unknown distribution of system responses), the unknown distribution of measured model system responses will be denoted as p e ( r ) , r ( r 1 , , r T R ) , where r i denotes the “ith-system response”, i = 1 , , T R , and where TR denotes the total number of system responses of interest. The unknown distribution p e ( r ) is formally defined on a TR-dimensional real-valued domain D e T R . The letter “e” will be used either as a superscript or a superscript to indicate “experimentally obtained” (e.g., measured or computed with an inaccessible model) quantities. Matrices will be denoted using capital bold letters while vectors will be denoted using either capital or lower-case bold letters. The symbol “ ” will be used to denote “is defined as” or “is by definition equal to.” Transposition will be indicated by a dagger ( ) superscript.

In this work, it is considered that the first four moments, namely: mean values, variances/covariances, triple correlations (skewness) and quadruple correlations (kurtosis), of the unknown distribution of system responses are available. The formal mathematical definitions of the first four moments, considered to be known, of the unknown distribution p e ( r ) , are as follows:

(i) Known mean/expectation values, denoted as r i e , for the system responses r i , where i = 1 , , T R :

r i e D e r i p e ( r ) d r ; r e ( r 1 e , , r i e , , r T R e ) ; i = 1 , , T R . (1)

(ii) Known covariances, denoted as c i j e , for two system responses r i and r j , where i , j = 1 , , T R :

c i j e cov ( r i , r j ) e D e ( r i r i e ) ( r j r j e ) p e ( r ) d r ; i , j = 1 , , T R ; (2)

The covariances cov ( r i , r j ) e , i , j = 1 , , T R , of the system responses are considered to be components of the T R × T R -dimensional covariance matrix of

system responses, which will be denoted as C r r e [ cov ( r i , r j ) e ] T R × T R [ c i j e ] T R × T R .

(iii) Known triple correlations, denoted as t i j k e , for three system responses r i , r j , r k , where i , j , k = 1 , , T R :

t i j k e D e ( r i r i e ) ( r j r j e ) ( r k r k e ) p e ( r ) d r ; i , j , k = 1 , , T R ; (3)

(iv) Known quadruple correlations, denoted as q i j k l e , for four system responses r i , r j , r k , r l , where i , j , k , l = 1 , , T R :

q i j k l e D e ( r i r i e ) ( r j r j e ) ( r k r k e ) ( r l r l e ) p e ( r ) d r ; i , j , k , l = 1 , , T R . (4)

When an unknown distribution, such as p e ( r ) , needs to be reconstructed from a finite number of its known moments, the principle of maximum entropy (MaxEnt) originally formulated by Jaynes [12] provides the optimal compatibility with the available information, while simultaneously ensuring minimal spurious information content, yielding an estimate of a probability density with the highest uncertainty among all densities satisfying the known moment constraints. According to the MaxEnt principle, such a MaxEnt probability density would satisfy the “available information” provided in Equations (1)-(4), without implying any spurious information or hidden assumptions, if the following conditions are satisfied:

(a) p e ( r ) maximizes the Shannon [34] information entropy, S, which is defined below:

S = D e p e ( r ) ln [ p e ( r ) ] d r , (5)

(b) p e ( r ) satisfies the four “moments constraints” defined by Equations (1) through (4);

(c) p e ( r ) satisfies the following normalization condition:

D e p e ( r ) d r = 1. (6)

The MaxEnt distribution p e ( r ) is obtained as the solution of the variational problem H ( p e ) / p e = 0 , where the entropy (Lagrangian functional) H ( p e ) is defined as follows:

H ( p e ) = D e p e ( r ) ln [ p e ( r ) ] d r c 0 [ D r p e ( r ) d r 1 ] i = 1 T R β i D e ( r i r i e ) p e ( r ) d r 1 2 i , j = 1 T R λ i j [ D e ( r i r i e ) ( r j r j e ) p e ( r ) d r c i j e ] i , j , k = 1 T R θ i j k [ D e ( r i r i e ) ( r j r j e ) ( r k r k e ) p e ( r ) d r t i j k e ] i , j , k , l = 1 T R ω i j k l [ D e ( r i r i e ) ( r j r j e ) ( r k r k e ) ( r l r l e ) p e ( r ) d r q i j k l e ] . (7)

In Equation (7), the quantities c 0 , β i , λ i j , θ i j k , and ω i j k l denote the respective Lagrange multipliers, and the factor 1/2 has been introduced for subsequent computational convenience. Introducing the variable:

z r r e ( z 1 , , z T R ) ; r e ( r 1 e , , r T R e ) ; z i r i r i e ; (8)

and solving the equation H ( p e ) / p e = 0 yields the following expression for the resulting MaxEnt distribution p e ( z ) :

p e ( z ) = h ( z ) D r h ( z ) d z ; (9)

where:

h ( z ) exp { i = 1 T R β i z i 1 2 i , j = 1 T R λ i j z i z j i , j , k = 1 T R θ i j k z i z j z k i , j , k , l = 1 T R ω i j k l z i z j z k z l } ; (10)

Re-writing Equations (1)-(4) in terms of the variable z r r e yields the following formal expressions for the known first four moments of the MaxEnt distribution p e ( z ) :

0 = D e z i p e ( r ) d r , i = 1 , , T R ; (11)

c i j e D e z i z j p e ( r ) d r ; i , j = 1 , , T R ; (12)

t i j k e D e z i z j z k p e ( r ) d r ; i , j , k = 1 , , T R ; (13)

q i j k l e D e z i z j z k z l p e ( r ) d r ; i , j , k , l = 1 , , T R . (14)

The expression of p e ( z ) can be evaluated in closed-form up to fifth-order in the variables z i by expanding in Taylor-series the third- and fourth-order terms in the expression of h ( z ) , to obtain the following approximate expression:

p e ( z ) f ( z ) exp [ g ( z ) ] Z ( β ) , (15)

where the following definitions have been used:

g ( z ) i = 1 T R β i z i 1 2 i , j = 1 T R λ i j z i z j = β z 1 2 z Λ z ; β ( β 1 , , β T R ) ; Λ [ λ 11 · λ 1 , T R · · · λ T R , 1 · λ T R , T R ] ; (16)

f ( z ) 1 i , j , k = 1 T R θ i j k z i z j z k i , j , k , l = 1 T R ω i j k l z i z j z k z l ; (17)

Z ( β ) D r f ( z ) exp [ g ( z ) ] d z = F 0 ( β ) i , j , k = 1 T R θ i j k F 3 i j k ( β ) i , j , k , l = 1 T R ω i j k l F 4 i j k l ( β ) ; (18)

where:

F 0 ( β ) D r exp { β z 1 2 z Λ z } d z ; (19)

F 3 i j k ( β ) D r z i z j z k exp { i = 1 T R β i z i 1 2 i , j = 1 T R λ i j z i z j } d z ; (20)

F 4 i j k l ( β ) D r z i z j z k z l exp { i = 1 T R β i z i 1 2 i , j = 1 T R λ i j z i z j } d z . (21)

The closed form expression of the integral shown in Equation (19) is well known:

F 0 ( β ) D r exp { β z 1 2 z Λ z } d z = K exp ( 1 2 β Λ 1 β ) , (22)

where:

K ( 2 π ) T R / 2 D e t ( Λ ) ; (23)

It follows from the definitions provided in Equations (19)-(21) that:

F 3 i j k ( β ) = 3 F 0 ( β ) β i β j β k ; F 4 i j k l ( β ) = 4 F 0 ( β ) β i β j β k β l . (24)

Inserting the results obtained in Equations (19)-(24) into Equation (18) yields the following expression:

Z ( β ) = F 0 ( β ) + i , j , k = 1 T R θ i j k 3 F 0 ( β ) β i β j β k i , j , k , l = 1 T R ω i j k l 4 F 0 ( β ) β i β j β k β l . (25)

The explicit expressions for the quantities 3 F 0 ( β ) / β i β j β k and 4 F 0 ( β ) / β i β j β k β l are derived in Appendix A. Inserting into Equation (25) the expression obtained in Equation (19) for F 0 ( β ) together with the expressions obtained in Equations (64) and (66), in Appendix A, for the quantities 3 F 0 ( β ) / β i β j β k and 4 F 0 ( β ) / β i β j β k β l , respectively, yields the following expression for the normalization integral Z ( β ) :

Z ( β ) = K { 1 + i , j , k = 1 T R θ i j k G 3 i j k ( β ) i , j , k , l = 1 T R ω i j k l G 4 i j k l ( β ) } exp ( 1 2 β Λ 1 β ) , (26)

where the quantities G 3 i j k ( β ) and G 4 i j k l ( β ) are defined in Appendix A.

In terms of the approximate expression shown in Equation (15) for the distribution p e ( z ) , the moments defined by Equations (11)-(14) take on the following expressions:

0 = 1 Z ( β ) D e z i f ( z ) exp [ g ( z ) ] d z = 1 Z ( β ) Z ( β ) β i , i = 1 , , T R ; (27)

c i j e = 1 Z ( β ) D e z i z j f ( z ) exp [ g ( z ) ] d z = 1 Z ( β ) 2 Z ( β ) β i β j ; i , j = 1 , , T R ; (28)

t i j k e = 1 Z ( β ) D e z i z j z k f ( z ) exp [ g ( z ) ] d z = 1 Z ( β ) 3 Z ( β ) β i β j β k ; i , j , k = 1 , , T R ; (29)

q i j k l e = 1 Z ( β ) D e z i z j z k z l f ( z ) exp [ g ( z ) ] d z = 1 Z ( β ) 4 Z ( β ) β i β j β k β l ; i , j , k , l = 1 , , T R . (30)

The relation shown in Equation (27) can be expressed in terms of the normalization integral Z ( β ) as follows:

0 = 1 Z ( β ) Z ( β ) β i = β i ln [ Z ( β ) ] . (31)

It follows from Equation (26) that:

ln [ Z ( β ) ] = 1 2 β Λ 1 β + ln K + ln { 1 + i , j , k = 1 T R θ i j k G 3 i j k ( β ) i , j , k , l = 1 T R ω i j k l G 4 i j k l ( β ) } , (32)

It follows from Equations (31) and (32) that:

0 = ln [ Z ( β ) ] β m = b = 1 T R ( Λ 1 ) m b β b [ i , j , k = 1 T R θ i j k G 3 i j k ( β ) β m ] [ i , j , k , l = 1 T R ω i j k l G 4 i j k l ( β ) β m ] 1 + i , j , k = 1 T R θ i j k G 3 i j k ( β ) i , j , k , l = 1 T R ω i j k l G 4 i j k l ( β ) b = 1 T R ( Λ 1 ) m b β b [ i , j , k = 1 T R θ i j k G 3 i j k ( β ) β m ] + [ i , j , k , l = 1 T R ω i j k l G 4 i j k l ( β ) β m ] + O ( θ i j k 2 ) . (33)

The last approximate equality on the right-side of Equation (33) was obtained by expanding the respective denominator in a power series and neglecting the sixth- and higher-order terms.

It follows from the expressions obtained in Equations (68) and (71) of Appendix A that:

G 3 i j k ( β ) β m = O ( Λ 2 ) ; G 4 i j k l ( β ) β m = O ( Λ 3 β ) . (34)

It follows from Equations (33) and (34) that if the third-order correlations among the measured system responses are neglected by setting θ i j k = 0 , i , j , k = 1 , , T R , then the solution of these equations becomes:

β = 0 , if θ i j k = 0 , for i , j , k = 1 , , T R . (35)

The solution of Equation (33) cannot be obtained analytically in closed form if the third-order correlations among the measured system responses are not neglected.

Taking the second derivative of β i ln [ Z ( β ) ] with respect to

Differentiating the relation obtained in Equation (31) with respect to another Lagrange multiplier, β n , n = 1 , , T R , yields the following relation:

2 ln [ Z ( β ) ] β n β m = β n [ 1 Z ( β ) Z ( β ) β m ] = 1 Z 2 ( β ) Z ( β ) β n Z ( β ) β m 1 Z ( β ) 2 Z ( β ) β n β m = 1 Z ( β ) D e z n z m f ( z ) exp [ g ( z ) ] d z = c m n e ; m , n = 1 , , T R . (36)

The last equality on the right-side of Equation (36) has been obtained by using Equations (31) and (35).

On the other hand, differentiating the relation shown in Equation (33) with respect to β n , n = 1 , , T R , yields the following result:

2 ln [ Z ( β ) ] β n β m ( Λ 1 ) m n i , j , k = 1 T R θ i j k 2 G 3 i j k ( β ) β n β m + i , j , k , l = 1 T R ω i j k l 2 G 4 i j k l ( β ) β n β m ( Λ 1 ) m n . (37)

The result obtained on the rightmost-side of Equation (37) follows by evaluating all terms at β = 0 (which implies that third-order correlations were neglected) and by also neglecting the fourth-order correlations by setting ω i j k l = 0 . The results obtained in Equations (37) and (36) indicate that

( Λ 1 ) m n = c m n e ; Λ 1 = C e ; C e ( c m n e ) T R × T R ; m , n = 1 , , T R , (38)

where C e ( c m n e ) T R × T R denotes the known T R × T R -dimensional covariance matrix of the system responses. It follows from Equation (38) that:

K = ( 2 π ) T R / 2 D e t ( C e ) . (39)

In view of Equations (34) and (38), neglecting the third- and fourth-order system response correlations while obtaining the expressions shown in Equations (35) and (38) for the Lagrange multipliers (i.e., β = 0 and Λ 1 = C e , respectively) is equivalent to neglecting double- and triple products of measured-system response correlations.

Differentiating the relation shown in Equation (37) with respect to another Lagrange multiplier, β μ , μ = 1 , , T R , yields the following result:

3 ln [ Z ( β ) ] β μ β n β m = i , j , k = 1 T R θ i j k 3 G 3 i j k ( β ) β μ β n β m + i , j , k , l = 1 T R ω i j k l 3 G 4 i j k l ( β ) β μ β n β m . (40)

On the other hand, differentiating the relation shown in Equation (36) with respect to β μ and evaluating the resulting expression by using the results obtained in Equations (35) and (38) for the Lagrange multipliers β = 0 and Λ 1 = C e yields the following result:

{ 3 ln [ Z ( β ) ] β μ β n β m } β = 0 = { β μ [ 1 Z 2 ( β ) Z ( β ) β n Z ( β ) β m 1 Z ( β ) 2 Z ( β ) β n β m ] } β = 0 = { 2 Z 3 ( β ) Z ( β ) β μ Z ( β ) β n Z ( β ) β m + 1 Z 2 ( β ) [ 2 Z ( β ) β μ β n Z ( β ) β m + Z ( β ) β n 2 Z ( β ) β μ β m ] } β = 0 + { 1 Z 2 ( β ) Z ( β ) β μ 2 Z ( β ) β n β m 1 Z ( β ) 3 Z ( β ) β μ β n β m } β = 0 = { 1 Z ( β ) 3 Z ( β ) β μ β n β m } β = 0 = { 1 Z ( β ) D e z μ z n z m f ( z ) exp [ g ( z ) ] d z } β = 0 = t μ n m e ; μ , n , m = 1 , , T R . (41)

The last equality shown on the right-side of Equation (41) follows from the relation provided in Equation (29).

Evaluating the expression obtained in Equation (40) by using Equations (70) and (79) from Appendix B, in conjunction with the results obtained in Equations (35) and (38) for the Lagrange multipliers β = 0 and Λ 1 = C e yields the following relation for determining the Lagrange multipliers θ i j k :

i , j , k = 1 T R θ i j k { c i m e [ c j n e c k μ e + c j μ e c k n e ] + c j m e [ c i n e c k μ e + c i μ e c k n e ] + c k m e [ c i n e c j μ e + c i μ e c j n e ] } = t μ n m e ; μ , n , m = 1 , , T R . (42)

Evidently, Equation (42) could be solved numerically, but cannot be solved analytically to obtain a closed-form solution in the general case. Furthermore, the triple cross-correlations t μ n m e , μ , n , m = 1 , , T R , are seldom available in practice. Usually, only the third-order self-correlations t a e t a a a e , for each system response r a e , a = 1 , , T R , are available in practice for the measured system responses. In this case, Equation (42) will reduce to the following form:

b = 1 T R τ a b θ b = t a e / 6 ; τ a b ( c a b e ) 3 = ( c b a e ) 3 ; θ b θ b b b ; a , b = 1 , , T R . (43)

Equation (43) can be solved explicitly by inverting the matrix T ( τ a b ) to obtain:

θ = T 1 t e / 6 ; θ ( θ 1 , , θ T R ) ; t e ( t 1 e , , t T R e ) ; T ( τ a b ) T R × T R . (44)

In particular, if only the system response variances are available but the second-order correlations among the system responses are negligible or unavailable, then the result obtained in Equation (44) reduces to the following simple expression for determining the Lagrange multipliers θ k , k = 1 , , T R :

θ k = t k e 6 ( c k k e ) 3 ; k = 1 , , T R . (45)

Differentiating the relation shown in Equation (40) with respect to another Lagrange multiplier, β ν , ν = 1 , , T R , yields the following result:

4 ln [ Z ( β ) ] β ν β μ β n β m = i , j , k , l = 1 T R ω i j k l 4 G 4 i j k l ( β ) β ν β μ β n β m ; ν , μ , n , m = 1 , , T R . (46)

The expression of 4 G 4 i j k l ( β ) / β ν β μ β n β m is known, as provided in Equation (84) in Appendix A.

On the other hand, differentiating the relation shown in Equation (41) with respect to β ν and evaluating the resulting expression by using the results obtained in Equations (35) and (38) for the Lagrange multipliers β = 0 and Λ 1 = C e yields the following expression:

4 ln [ Z ( β ) ] β ν β μ β n β m = β ν { 2 Z 3 ( β ) Z ( β ) β μ Z ( β ) β n Z ( β ) β m } + β ν { 1 Z 2 ( β ) [ 2 Z ( β ) β μ β n Z ( β ) β m + Z ( β ) β n 2 Z ( β ) β μ β m ] } + β ν { 1 Z 2 ( β ) Z ( β ) β μ 2 Z ( β ) β n β m 1 Z ( β ) 3 Z ( β ) β μ β n β m } . (47)

Performing the above differentiations and evaluating the resulting expression by using the results obtained in Equations (35) and (38) for the Lagrange multipliers β = 0 and Λ 1 = C e yields the following relation:

{ 4 ln [ Z ( β ) ] β ν β μ β n β m } β = 0 = { 1 Z ( β ) 4 Z ( β ) β ν β μ β n β m } β = 0 + { 1 Z 2 ( β ) [ 2 Z ( β ) β μ β n 2 Z ( β ) β ν β m + 2 Z ( β ) β ν β n 2 Z ( β ) β μ β m + 2 Z ( β ) β ν β μ 2 Z ( β ) β n β m ] } β = 0 . (48)

Using the expressions provided in Equations (28) and (30) to replace the expressions on the right-side of Equation (48) yields the following result:

{ 4 ln [ Z ( β ) ] β ν β μ β n β m } β = 0 = q ν μ n m e + c μ n e c ν m e + c ν n e c μ m e + c ν μ e c n m e . (49)

It follows from the results obtained in Equations (49) and (46) that the following relation holds for ν , μ , n , m = 1 , , T R :

q ν μ n m e + c μ n e c ν m e + c ν n e c μ m e + c ν μ e c n m e = i , j , k , l = 1 T R ω i j k l { 4 G 4 i j k l ( β ) β ν β μ β n β m } β = 0 , Λ 1 = C e . (50)

The Lagrange multipliers ω i j k l for the fourth-order correlations of the measured system responses are to be obtained as the solution of Equation (50). This equation could be solved numerically but cannot be solved analytically to obtain closed-form solutions for these Lagrange multipliers. In addition, the general fourth-order correlations q ν μ n m e are not expected to be available in practice; at most, the fourth-order self-correlations q a e q a a a e , for each system response r a e , a = 1 , , T R , might be available in practice for the measured system responses. In this case, Equation (50) will reduce to the following form for the quantities ω b ω b b b b and G 4 b G 4 b b b b :

q a e + 3 ( c a a e ) 2 = b = 1 T R ω b { 4 G 4 b ( β ) ( β a ) 4 } β = 0 , Λ 1 = C e . (51)

Using the result presented in Equation (84) in Appendix B yields the following expression for the quantity 4 G 4 b ( β ) / ( β a ) 4 , for β = 0 and Λ 1 = C e :

{ 4 G 4 b ( β ) ( β a ) 4 } β = 0 , Λ 1 = C e = { c l m e c i n e ( c j μ e c k ν e + c k μ e c j ν e ) + c l m e c j n e ( c i μ e c k ν e + c k μ e c i ν e ) + c l m e c k n e ( c i μ e c j ν e + c j μ e c i ν e ) + c i m e c l n e ( c j μ e c k ν e + c k μ e c j ν e ) + c i m e c j n e ( c l μ e c k ν e + c k μ e c l ν e ) + c i m e c k n e ( c l μ e c j ν e + c j μ e c l ν e ) + c j m e c l n e ( c i μ e c k ν e + c k μ e c i ν e ) + c j m e c i n e ( c l μ e c k ν e + c k μ e c l ν e ) + c j m e c k n e ( c l μ e c i ν e + c i μ e c l ν e ) + c k m e c l n e ( c i μ e c j ν e + c j μ e c i ν e ) + c k m e c i n e ( c l μ e c j ν e + c j μ e c l ν e ) + c k m e c j n e ( c l μ e c i ν e + c i μ e c l ν e ) } m = n = μ = ν = a i = j = k = l = b = 24 ( c b a e ) 4 . (52)

Inserting the result obtained in Equation (52) into the right-side of Equation (51) yields the following vector-matrix equation for determining the Lagrange multipliers ω b ω b b b b , b = 1 , , T R :

b = 1 T R p a b ω b = [ q a e + 3 ( c a a e ) 2 ] / 24 ; p a b ( c b a e ) 4 = ( c a b e ) 4 ; a = 1 , , T R . (53)

The solution ω ( ω 1 , , ω T R ) of Equation (53) is obtained in the following form:

ω = P 1 s / 24 ; P ( p a b ) T R × T R ; s ( s 1 , , s T R ) ; s a 3 ( c a a e ) 2 q a e ; a = 1 , , T R . (54)

In particular, if the (second-order) correlations among the measured system responses are negligible or unavailable, then the result obtained in Equation (54) reduces to the following simple expression for determining the Lagrange multipliers ω k , k = 1 , , T R :

ω k = 3 ( c k k e ) 2 q k e 24 ( c k k e ) 4 ; k = 1 , , T R . (55)

The expressions of the Lagrange multipliers obtained in Equations (35), (38), (44) and (54) are not exact, but have been obtained in closed forms, so they can be used in Equations (9) and (10) to obtain the correspondingly approximate MaxEnt distribution, which will be denoted as p e M E ( r ) and which has the following expression:

p e M E ( z ) = h e M E ( z ) D r h e M E ( z ) d z ; h e M E ( z ) exp { 1 2 z Λ z i = 1 T R θ i z i 3 i = 1 T R ω i z i 4 } . (56)

where the Lagrange multipliers Λ , θ i and ω i , i = 1 , , T R , have the expressions obtained in Equations (38), (44) and (54), respectively. The approximations incurred during the course of obtaining the expression shown in Equation (56) for the fourth-order MaxEnt distribution p e M E ( r ) of experimentally measured system responses are summarized below:

1) When obtaining the result β = 0 in Equation (35) for the Lagrange multipliers for the expected/mean values of the system responses, the third-order correlations among the system responses were neglected. This approximation is confirmed, as shown in Appendix B, namely:

r k M E D e p e M E ( r ) d r = r k e , if t i j k e = 0 ; i , j , k = 1 , , T R . (57)

2) When obtaining the result Λ 1 = C e in Equation (38) for the Lagrange multipliers for the covariances of the system responses, the third- and fourth-order correlations among the system responses were neglected. This approximation is confirmed by using the expression of the fourth-order MaxEnt distribution p e M E ( r ) given in Equation (56), the result shown in Equation (57) and the definition of the system responses’ covariances, as shown in Appendix C, to obtain the following expression:

c i j M E D e ( r i r i e ) ( r j r j e ) p e e ( r ) d r = c i j e ; i , j = 1 , , T R ; if t i j k e = 0 ; q i j k e = 0 ; i , j , k , l = 1 , , T R . (58)

3) When obtaining the result θ = T 1 t e / ( 6 K ) in Equation (44) for the Lagrange multipliers for the triple-correlations of the system responses, only the self-triple-correlations among the system responses were considered. This approximation is confirmed, as shown in Appendix B, by using:

i) the expression of the approximate fourth-order MaxEnt distribution p e M E ( r ) given in Equation (56);

ii) the result shown in Equation (57); and

iii) the definition of the self-correlations of third-order for the system responses.

Thus, as shown in Appendix B, the following expression is obtained for the triple-correlations t i j k e ; i , j , k = 1 , , T R :

t i j k M E D e ( r i r i e ) ( r j r j e ) ( r k r k e ) p e M E ( r ) d r = t i j k e ; i , j , k = 1 , , T R . (59)

4) When obtaining the result ω = P 1 s / ( 24 K ) in Equation (54) for the Lagrange multipliers for the quadruple-correlations of the measured system responses, only the self-correlations of fourth order among the measured system responses were considered. This approximation is confirmed by using the expression of the approximate fourth-order MaxEnt distribution p e M E ( r ) given in Equation (56), the result shown in Equation (57), and the definition of the self-correlations of third-order for the measured system responses to obtain, as shown in Appendix B, the following expression for the quadruple-correlations q i j k l e , i , j , k , l = 1 , , T R :

q i j k l M E D e ( r i r i e ) ( r j r j e ) ( r k r k e ) ( r l r l e ) p e M E ( r ) d r = q i j k l e . (60)

In summary, the results obtained in Equations (57)-(60) indicate that the known/given means, covariances, triple and quadruple correlations for the system responses can all be used as values for the corresponding moments of the approximate fourth-order MaxEnt distribution p e M E ( r ) given in Equation (56), since the approximations incurred when using this correspondence are at least of 2-orders higher than for the moment in question, namely:

1) The first-order moments of p e M E ( r ) have third-order errors by comparison to the measured mean values.

2) The second-order moments of p e M E ( r ) have fourth-order errors by comparison to the measured mean values.

3) The third-order moments of p e M E ( r ) have fifth-order errors by comparison to the measured mean values.

4) The fourth-order moments of p e M E ( r ) have sixth-order errors by comparison to the measured mean values.

5) Notably, if the triple- and the quadruple correlations are negligeable (or unavailable) then the MaxEnt distribution p e M E ( r ) reduces to a multivariate Gaussian with mean r e and covariance matrix C e .

3. Discussion and Conclusions

This work has presented a novel closed-form expression for the fourth-order moments-constrained Maximum Entropy (MaxEnt) probability distribution, which was constructed from the known first four moments (means, covariances, skewness, kurtosis) of an otherwise unknown distribution of a high-dimensional multivariate uncertain quantity of interest, which was called a “system response”. This fourth-order MaxEnt distribution provides optimal compatibility of the available information while simultaneously ensuring minimal spurious information content, yielding an estimate of a probability density with the highest uncertainty among all densities satisfying the known moment constraints. This novel generic fourth-order MaxEnt distribution is of interest in its own right for applications in many areas, including solid-state physics, econometrics, statistical description of gas flows, weather and climate prediction. In the accompanying work [33] , this novel generic fourth-order MaxEnt distribution will be used to construct a novel fourth-order predictive modeling methodology aimed at obtaining “best-estimate results with reduced uncertainties” for the first four moments (mean values, covariance, skewness and kurtosis) of the optimally predicted distribution of model results and calibrated model parameters, by combining fourth-order experimental and computational information, including fourth (and higher) order sensitivities of computed model system responses to model parameters.

Appendix A: Auxiliary Computations for Constructing the Moment-Constrained Fourth-Order MaxEnt Distribution

Recall that the following closed-form expression of F 0 ( β ) was provided in Equation (19): F 0 ( β ) = K exp ( β Λ 1 β / 2 ) . Differentiating this expression with respect to a Lagrange multiplier β i yields the following relation:

F 0 ( β ) β i = K [ a = 1 T R ( Λ 1 ) i a β a ] exp ( 1 2 β Λ 1 β ) . (61)

Differentiating the expressions in Equation (61) with respect to a Lagrange multiplier β j yields the following relations:

2 F 0 ( β ) β i β j = K β j { [ a = 1 T R ( Λ 1 ) i a β a ] [ exp ( 1 2 β Λ 1 β ) ] } = K G 2 i j ( β ) exp ( 1 2 β Λ 1 β ) ; (62)

where:

G 2 i j ( β ) ( Λ 1 ) i j + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] ; (63)

Differentiating the expressions in Equation (62) with respect to a Lagrange multiplier β k yields the following relations:

3 F 0 ( β ) β i β j β k = K exp ( 1 2 β Λ 1 β ) β k { ( Λ 1 ) i j + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] } + K { ( Λ 1 ) i j + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] } β k [ exp ( 1 2 β Λ 1 β ) ] = K { ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] } exp ( 1 2 β Λ 1 β ) + K { ( Λ 1 ) i j + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] } × [ c = 1 T R ( Λ 1 ) k c β c ] exp ( 1 2 β Λ 1 β ) = K G 3 i j k ( β ) exp ( 1 2 β Λ 1 β ) ; (64)

where:

G 3 i j k ( β ) ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] + ( Λ 1 ) i j [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] ; (65)

Differentiating the expressions in Equation (64) with respect to a Lagrange multiplier β l yields the following relations:

4 F 0 ( β ) β i β j β k β l = K { ( Λ 1 ) i j [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ]

× [ c = 1 T R ( Λ 1 ) k c β c ] } β l [ exp ( 1 2 β Λ 1 β ) ] + K exp ( 1 2 β Λ 1 β ) × β l { ( Λ 1 ) i j [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] }

= K { ( Λ 1 ) i j [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] } [ d = 1 T R ( Λ 1 ) l d β d ] exp ( 1 2 β Λ 1 β ) + K exp ( 1 2 β Λ 1 β ) { ( Λ 1 ) i j ( Λ 1 ) k l + ( Λ 1 ) i k ( Λ 1 ) j l + ( Λ 1 ) i l ( Λ 1 ) j k + ( Λ 1 ) i l [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) j l [ a = 1 T R ( Λ 1 ) i a β a ] [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) k l [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] } = K G 4 i j k l ( β ) exp ( 1 2 β Λ 1 β ) , (66)

where:

G 4 i j k l ( β ) [ d = 1 T R ( Λ 1 ) l d β d ] { ( Λ 1 ) i j [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] } + [ d = 1 T R ( Λ 1 ) l d β d ] [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) i j ( Λ 1 ) k l + ( Λ 1 ) i k ( Λ 1 ) j l + ( Λ 1 ) i l ( Λ 1 ) j k + ( Λ 1 ) i l [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) j l [ a = 1 T R ( Λ 1 ) i a β a ] × [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) k l [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] . (67)

The various derivatives of the functions G 3 i j k ( β ) and G 4 i j k l ( β ) with respect to the Lagrange multipliers β m , m = 1 , , T R , will also be used when evaluating the various derivatives of the normalization integral Z ( β ) with respect to these Lagrange multipliers. The derivatives of the function G 3 i j k ( β ) with respect to the Lagrange multipliers β m , m = 1 , , T R , are obtained by using Equation (65), which yields the following expressions:

G 3 i j k ( β ) β m = ( Λ 1 ) i k ( Λ 1 ) j m + ( Λ 1 ) j k ( Λ 1 ) i m + ( Λ 1 ) i j ( Λ 1 ) k m + ( Λ 1 ) i m [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) j m [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] ( Λ 1 ) k m . (68)

Differentiating the expression in Equation (68) with respect to a Lagrange multiplier β n yields the following relation:

2 G 3 i j k ( β ) β n β m = ( Λ 1 ) i m { ( Λ 1 ) j n [ c = 1 T R ( Λ 1 ) k c β c ] + [ b = 1 T R ( Λ 1 ) j b β b ] ( Λ 1 ) k n } + ( Λ 1 ) j m { ( Λ 1 ) i n [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) k n } + ( Λ 1 ) k m { ( Λ 1 ) i n [ b = 1 T R ( Λ 1 ) j b β b ] + [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) j n } . (69)

Differentiating the expression in Equation (69) with respect to a Lagrange multiplier β μ yields the following relation:

3 G 3 i j k ( β ) β μ β n β m = ( Λ 1 ) i m { ( Λ 1 ) j n ( Λ 1 ) k μ + ( Λ 1 ) j μ ( Λ 1 ) k n } + ( Λ 1 ) j m { ( Λ 1 ) i n ( Λ 1 ) k μ + ( Λ 1 ) i μ ( Λ 1 ) k n } + ( Λ 1 ) k m { ( Λ 1 ) i n ( Λ 1 ) j μ + ( Λ 1 ) i μ ( Λ 1 ) j n } . (70)

The derivatives of the function G 4 i j k l ( β ) with respect to a Lagrange multipliers β m , m = 1 , , T R , are obtained by using Equation (67). Thus, the first derivative of the function G 4 i j k l ( β ) with respect to a Lagrange multipliers β m , m = 1 , , T R , has the following expression:

G 4 i j k l ( β ) β m = ( Λ 1 ) l m { ( Λ 1 ) i j [ c = 1 T R ( Λ 1 ) k c β c ] + ( Λ 1 ) i k [ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) j k [ a = 1 T R ( Λ 1 ) i a β a ] } + [ d = 1 T R ( Λ 1 ) l d β d ] { ( Λ 1 ) i j ( Λ 1 ) k m + ( Λ 1 ) i k ( Λ 1 ) j m + ( Λ 1 ) j k ( Λ 1 ) i m } + ( Λ 1 ) l m [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + [ d = 1 T R ( Λ 1 ) l d β d ] ( Λ 1 ) i m [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ]

+ [ d = 1 T R ( Λ 1 ) l d β d ] [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) j m [ c = 1 T R ( Λ 1 ) k c β c ] + [ d = 1 T R ( Λ 1 ) l d β d ] [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] ( Λ 1 ) k m + ( Λ 1 ) i l { ( Λ 1 ) j m [ c = 1 T R ( Λ 1 ) k c β c ] + [ b = 1 T R ( Λ 1 ) j b β b ] ( Λ 1 ) k m } + ( Λ 1 ) j l { ( Λ 1 ) i m [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) k m } + ( Λ 1 ) k l { ( Λ 1 ) i m [ b = 1 T R ( Λ 1 ) j b β b ] + [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) j m } . (71)

Differentiating Equation (71) with respect to a Lagrange multiplier β n n = 1 , , T R , yields the following relation:

2 G 4 i j k l ( β ) β n β m = X 1 + X 2 + X 3 + X 4 + X 5 + X 6 , (72)

where:

X 1 ( Λ 1 ) l m { ( Λ 1 ) i j ( Λ 1 ) k n + ( Λ 1 ) i k ( Λ 1 ) j n + ( Λ 1 ) j k ( Λ 1 ) i n } + ( Λ 1 ) l n { ( Λ 1 ) i j ( Λ 1 ) k m + ( Λ 1 ) i k ( Λ 1 ) j m + ( Λ 1 ) j k ( Λ 1 ) i m } , (73)

X 2 ( Λ 1 ) l m { ( Λ 1 ) i n [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) j n [ c = 1 T R ( Λ 1 ) k c β c ] + [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] ( Λ 1 ) k n } , (74)

X 3 ( Λ 1 ) i m { ( Λ 1 ) l n [ b = 1 T R ( Λ 1 ) j b β b ] [ c = 1 T R ( Λ 1 ) k c β c ] + [ d = 1 T R ( Λ 1 ) l d β d ] ( Λ 1 ) j n [ c = 1 T R ( Λ 1 ) k c β c ] + [ d = 1 T R ( Λ 1 ) l d β d ] [ b = 1 T R ( Λ 1 ) j b β b ] ( Λ 1 ) k n } , (75)

X 4 ( Λ 1 ) j m { ( Λ 1 ) l n [ a = 1 T R ( Λ 1 ) i a β a ] [ c = 1 T R ( Λ 1 ) k c β c ] + [ d = 1 T R ( Λ 1 ) l d β d ] ( Λ 1 ) i n [ c = 1 T R ( Λ 1 ) k c β c ] + [ d = 1 T R ( Λ 1 ) l d β d ] [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) k n } , (76)

X 5 ( Λ 1 ) k m { ( Λ 1 ) l n [ a = 1 T R ( Λ 1 ) i a β a ] [ b = 1 T R ( Λ 1 ) j b β b ] + [ d = 1 T R ( Λ 1 ) l d β d ] ( Λ 1 ) i n [ b = 1 T R ( Λ 1 ) j b β b ] + [ d = 1 T R ( Λ 1 ) l d β d ] [ a = 1 T R ( Λ 1 ) i a β a ] ( Λ 1 ) j n } , (77)

X 6 ( Λ 1 ) i l [ ( Λ 1 ) j m ( Λ 1 ) k n + ( Λ 1 ) j n ( Λ 1 ) k m ] + ( Λ 1 ) j l [ ( Λ 1 ) i m ( Λ 1 ) k n + ( Λ 1 ) i m ( Λ 1 ) k m ] + ( Λ 1 ) k l [ ( Λ 1 ) i m ( Λ 1 ) j n + ( Λ 1 ) i n ( Λ 1 ) j m ] . (78)

Differentiating Equation (72) with respect to a Lagrange multiplier β μ μ = 1 , , T R , yields the following relation:

3 G 4 i j k l ( β ) β μ β n β m = Y 1 + Y 2 + Y 3 + Y 4 , (79)

where:

Y 1 ( Λ 1 ) l m ( Λ 1 ) i n [ ( Λ 1 ) j μ c = 1 T R ( Λ 1 ) k c β c + ( Λ 1 ) k μ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) l m ( Λ 1 ) j n [ ( Λ 1 ) i μ c = 1 T R ( Λ 1 ) k c β c + ( Λ 1 ) k μ a = 1 T R ( Λ 1 ) i a β a ] + ( Λ 1 ) l m ( Λ 1 ) k n [ ( Λ 1 ) i μ b = 1 T R ( Λ 1 ) j b β b + ( Λ 1 ) j μ a = 1 T R ( Λ 1 ) i a β a ] , (80)

Y 2 ( Λ 1 ) i m ( Λ 1 ) l n [ ( Λ 1 ) j μ c = 1 T R ( Λ 1 ) k c β c + ( Λ 1 ) k μ b = 1 T R ( Λ 1 ) j b β b ] + ( Λ 1 ) i m ( Λ 1 ) j n [ ( Λ 1 ) l μ c = 1 T R ( Λ 1 ) k c β c + ( Λ 1 ) k μ d = 1 T R ( Λ 1 ) l d β d ] + ( Λ 1 ) i m ( Λ 1 ) k n [ ( Λ 1 ) l μ b = 1 T R ( Λ 1 ) j b β b + ( Λ 1 ) j μ d = 1 T R ( Λ 1 ) l d β d ] , (81)

Y 3 ( Λ 1 ) j m ( Λ 1 ) l n [ ( Λ 1 ) i μ c = 1 T R ( Λ 1 ) k c β c + ( Λ 1 ) k μ a = 1 T R ( Λ 1 ) i a β a ] + ( Λ 1 ) j m ( Λ 1 ) i n [ ( Λ 1 ) l μ c = 1 T R ( Λ 1 ) k c β c + ( Λ 1 ) k μ d = 1 T R ( Λ 1 ) l d β d ] + ( Λ 1 ) j m ( Λ 1 ) k n [ ( Λ 1 ) l μ a = 1 T R ( Λ 1 ) i a β a + ( Λ 1 ) i μ d = 1 T R ( Λ 1 ) l d β d ] , (82)

Y 4 ( Λ 1 ) k m ( Λ 1 ) l n [ ( Λ 1 ) i μ b = 1 T R ( Λ 1 ) j b β b + ( Λ 1 ) j μ a = 1 T R ( Λ 1 ) i a β a ] + ( Λ 1 ) k m ( Λ 1 ) i n [ ( Λ 1 ) l μ b = 1 T R ( Λ 1 ) j b β b + ( Λ 1 ) j μ d = 1 T R ( Λ 1 ) l d β d ] + ( Λ 1 ) k m ( Λ 1 ) j n [ ( Λ 1 ) l μ a = 1 T R ( Λ 1 ) i a β a + ( Λ 1 ) i μ d = 1 T R ( Λ 1 ) l d β d ] . (83)

Differentiating Equation (79) with respect to a Lagrange multiplier β ν , ν = 1 , , T R , yields the following relation:

4 G 4 i j k l ( β ) β ν β μ β n β m = W 1 + W 2 + W 3 + W 4 , (84)

where:

W 1 ( Λ 1 ) l m ( Λ 1 ) i n [ ( Λ 1 ) j μ ( Λ 1 ) k ν + ( Λ 1 ) k μ ( Λ 1 ) j ν ] + ( Λ 1 ) l m ( Λ 1 ) j n [ ( Λ 1 ) i μ ( Λ 1 ) k ν + ( Λ 1 ) k μ ( Λ 1 ) i ν ] + ( Λ 1 ) l m ( Λ 1 ) k n [ ( Λ 1 ) i μ ( Λ 1 ) j ν + ( Λ 1 ) j μ ( Λ 1 ) i ν ] , (85)

W 2 ( Λ 1 ) i m ( Λ 1 ) l n [ ( Λ 1 ) j μ ( Λ 1 ) k ν + ( Λ 1 ) k μ ( Λ 1 ) j ν ] + ( Λ 1 ) i m ( Λ 1 ) j n [ ( Λ 1 ) l μ ( Λ 1 ) k ν + ( Λ 1 ) k μ ( Λ 1 ) l ν ] + ( Λ 1 ) i m ( Λ 1 ) k n [ ( Λ 1 ) l μ ( Λ 1 ) j ν + ( Λ 1 ) j μ ( Λ 1 ) l ν ] , (86)

W 3 ( Λ 1 ) j m ( Λ 1 ) l n [ ( Λ 1 ) i μ ( Λ 1 ) k ν + ( Λ 1 ) k μ ( Λ 1 ) i ν ] + ( Λ 1 ) j m ( Λ 1 ) i n [ ( Λ 1 ) l μ ( Λ 1 ) k ν + ( Λ 1 ) k μ ( Λ 1 ) l ν ] + ( Λ 1 ) j m ( Λ 1 ) k n [ ( Λ 1 ) l μ ( Λ 1 ) i ν + ( Λ 1 ) i μ ( Λ 1 ) l ν ] , (87)

W 4 ( Λ 1 ) k m ( Λ 1 ) l n [ ( Λ 1 ) i μ ( Λ 1 ) j ν + ( Λ 1 ) j μ ( Λ 1 ) i ν ] + ( Λ 1 ) k m ( Λ 1 ) i n [ ( Λ 1 ) l μ ( Λ 1 ) j ν + ( Λ 1 ) j μ ( Λ 1 ) l ν ] + ( Λ 1 ) k m ( Λ 1 ) j n [ ( Λ 1 ) l μ ( Λ 1 ) i ν + ( Λ 1 ) i μ ( Λ 1 ) l ν ] . (88)

Appendix B: Approximations Inherent to the Fourth-Order Maximum Entropy Distribution

Recalling Equation (56), the expression of the approximate fourth-order MaxEnt probability distribution function p e M E ( r ) for the responses has the following form:

p e M E ( z ) = h e M E ( z ) D r h e M E ( z ) d z 1 Z e M E f e M E ( r ) exp [ g e M E ( r ) ] , (89)

where:

g e M E ( r ) exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] ; (90)

f e M E ( r ) 1 i = 1 T R θ i ( r i r i e ) 3 i = 1 T R ω i ( r i r i e ) 4 ; (91)

Z e M E = D r f e M E ( r ) exp [ g e M E ( r ) ] d r = K ( 1 ω g 4 ) = K [ 1 3 i = 1 T R ω i ( c i i e ) 2 ] . (92)

The MaxEnt probability distribution function p e M E ( r ) is properly normalized, as shown below:

D r p e M E ( r ) d r = K 1 [ 1 3 i = 1 T R ω i ( c i i e ) 2 ] 1 { D r exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] d r i = 1 T R θ i D r ( r i r i e ) 3 exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] d r i = 1 T R ω i D r ( r i r i e ) 4 exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] d r } = K 1 [ 1 3 i = 1 T R ω i ( c i i e ) 2 ] 1 [ K 0 3 K i = 1 T R ω i ( c i i e ) 2 ] = 1 . (93)

The first-order moments r i M E , i = 1 , , T R , of p e M E ( r ) are given by the following expression:

r i M E D r r i p e M E ( r ) d r

= K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 { D r r k exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] d r a = 1 T R θ a D r r i ( r a r a e ) 3 exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] d r a = 1 T R ω a D r r i ( r a r a e ) 4 exp [ 1 2 ( r r e ) ( C e ) 1 ( r r e ) ] d r }

= K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 { r i e D r exp [ 1 2 z ( C e ) 1 z ] d z D r z i exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R θ a r i e D r z a 3 exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R θ a D r z i z a 3 exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R ω a r i e D r z a 4 exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R ω a D r z i z a 4 exp [ 1 2 z ( C e ) 1 z ] d z }

= K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 × { K r i e 0 0 O [ K a = 1 T R θ a ( c a a e ) 2 ] 3 K a = 1 T R ω a r i e ( c a a e ) 2 0 } = K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 K [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] r i e K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 × K × O [ a = 1 T R t a e ( c a a e ) 3 ( c a a e ) 2 ] = r i e + O ( t i e / c i i e ) . (94)

As indicated by the result obtained in Equation (94), if the third-order correlations t i j k e among the measured responses are all neglected, then the MaxEnt moments r k M E coincide with the measured moments, r k e (i.e., r k M E = r k e if t i j k e = 0 ).

Within the approximation r k M E = r k e , the second-order moments c i j M E , i , j = 1 , , T R , of p e M E ( r ) are given by the following expression:

c i j M E D e ( r i r i e ) ( r j r j e ) p e M E ( r ) d r = K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 { D r z i z j exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R θ a D r z i z j z a 3 exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R ω a D r z i z j z a 4 exp [ 1 2 z ( C e ) 1 z ] d z } = K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 K [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] { c i j e 0 O ( ω a ) } = c i j e ; i , j = 1 , , T R . (95)

Within the approximation r k M E = r k e , the third-order moments t i j k M E , i , j , k = 1 , , T R , of p e M E ( r ) are given by the following expression:

t i j k M E D e ( r i r i e ) ( r j r j e ) ( r k r k e ) p e M E ( r ) d r = K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 { D r z i z j z k exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R θ a D r z i z j z k z a 3 exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R ω a D r z i z j z k z a 4 exp [ 1 2 z ( C e ) 1 z ] d z } = t i j k e + H O T , (96)

where the acronym “HOT” denotes “higher order terms.”

Within the approximation r k M E = r k e , the fourth-order moments q i j k e , i , j , k , l = 1 , , T R , of p e M E ( r ) are given by the following expression:

q i j k l M E D e ( r i r i e ) ( r j r j e ) ( r k r k e ) ( r l r l e ) p e M E ( r ) d r = K 1 [ 1 3 a = 1 T R ω a ( c a a e ) 2 ] 1 { D r z i z j z k z l exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R θ a D r z i z j z k z l z a 3 exp [ 1 2 z ( C e ) 1 z ] d z a = 1 T R ω a D r z i z j z k z l z a 4 exp [ 1 2 z ( C e ) 1 z ] d z } = q i j k l e + H O T , (97)

where the acronym “HOT” denotes “higher order terms”.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1] Humi, M., Wagschal, J.J. and Yeivin, Y. (1964) Multi-Group Constants from Integral Data. Proceedings of the 3rd International Conference on the Peaceful Uses of Atomic Energy, Geneva, 31 August-9 September 1964, 398 p.
[2] Usachhev, L.N. (1964) Perturbation Theory for the Breeding Ratio and for Other Number Ratios Pertaining to Various Reactor Processes. Journal of Nuclear Energy. Parts A/B. Reactor Science and Technology, 18, 571-583.
https://doi.org/10.1016/0368-3230(64)90142-9
[3] Rowlands, J., et al. (1973) The Production and Performance of the Adjusted Cross-Section Set FGL5. Proceedings of the International Symposium on Physics of Fast Reactors, Tokyo, 16 October 1973.
[4] Gandini, A. and Petilli, M. (1973) AMARA: A Code Using the Lagrange’s Multipliers Method for Nuclear Data Adjustment. CNEN-RI/FI(73)39, Comitato Nazionale Energia Nucleare, Rome.
[5] Kuroi, H. and Mitani, H. (1975) Adjustment to Cross Section Data to fit Integral Experiments by Least Squares Method. Journal of Nuclear Science and Technology, 12, 663-680.
https://doi.org/10.1080/18811248.1975.9733172
[6] Dragt, J.B., Dekker, J.W.M., Gruppelaar, H. and Janssen, A.J. (1977) Methods of Adjustment and Error Evaluation of Neutron Capture Cross Sections; Application to Fission Product Nuclides. Nuclear Science and Engineering, 62, 117-129.
https://doi.org/10.13182/NSE77-3
[7] Lewis, J.M., Lakshmivarahan, S. and Dhall, S.K. (2006) Dynamic Data Assimilation: A Least Square Approach. Cambridge University Press, Cambridge.
https://doi.org/10.1017/CBO9780511526480
[8] Lahoz, W., Khattatov, B. and Ménard, R. (2010) Data Assimilation: Making Sense of Observations. Springer, Berlin.
[9] Cacuci, D.G., Navon, M.I. and Ionescu-Bujor, M. (2014) Computational Methods for Data Evaluation and Assimilation. Chapman & Hall/CRC, Boca Raton.
[10] Cacuci, D.G. and Ionescu-Bujor, M. (2010) Best-Estimate Model Calibration and Prediction through Experimental Data Assimilation—I: Mathematical Framework. Nuclear Science and Engineering, 165, 18-44.
https://doi.org/10.13182/NSE09-37B
[11] Cacuci, D.G. and Ionescu-Bujor, M. (2010) Sensitivity and Uncertainty Analysis, Data Assimilation and Predictive Best-Estimate Model Calibration. In: Cacuci, D.G., Ed., Handbook of Nuclear Engineering, Springer, Boston, 1913-2051.
https://doi.org/10.1007/978-0-387-98149-9_17
[12] Jaynes, E.T. (1957) Information Theory and Statistical Mechanics. Physical Review Journals Archive, 106, 620-630.
https://doi.org/10.1103/PhysRev.106.620
[13] Cacuci, D.G. (2014) Predictive Modeling of Coupled Multi-Physics Systems: I. Theory. Annals of Nuclear Energy, 70, 266-278.
https://doi.org/10.1016/j.anucene.2013.11.027
[14] Cacuci, D.G. (2019) BERRU Predictive Modeling: Best-Estimate Results with Reduced Uncertainties. Springer, Berlin.
https://doi.org/10.1007/978-3-662-58395-1
[15] Cacuci, D.G. (2023) Second-Order MaxEnt Predictive Modelling Methodology. I: Deterministically Incorporated Computational Model (2nd-BERRU-PMD). American Journal of Computational Mathematics, 13, 236-266.
https://doi.org/10.4236/ajcm.2023.132013
[16] Cacuci, D.G. (2023) Second-Order MaxEnt Predictive Modelling Methodology. II: Probabilistically Incorporated Computational Model (2nd-BERRU-PMP). American Journal of Computational Mathematics, 13, 267-294.
https://doi.org/10.4236/ajcm.2023.132014
[17] Fang, R. and Cacuci, D.G. (2023) Second-Order MaxEnt Predictive Modelling Methodology. III: Illustrative Application to a Reactor Physics Benchmark. American Journal of Computational Mathematics, 13, 295-322.
https://doi.org/10.4236/ajcm.2023.132015
[18] Cacuci, D.G. and Fang, R. (2023) Demonstrative Application to an OECD/NEA Reactor Physics Benchmark of the 2nd-BERRU-PM Method—I: Nominal Computations Consistent with Measurements. Energies, 16, Article No. 5552.
https://doi.org/10.3390/en16145552
[19] Fang, R. and Cacuci, D.G. (2023) Demonstrative Application to an OECD/NEA Reactor Physics Benchmark of the 2nd-BERRU-PM Method—II: Nominal Computations Apparently Inconsistent with Measurements. Energies, 16, Article No. 5614.
https://doi.org/10.3390/en16155614
[20] Valentine, T.E. (2006) Polyethylene-Reflected Plutonium Metal Sphere Subcritical Noise Measurements. SUB-PU-METMIXED-001. In: International Handbook of Evaluated Criticality Safety Benchmark Experiments; NEA/NSC/DOC(95)03/I-IX; Organization for Economic Co-Operation and Development; Nuclear Energy Agency, Paris.
[21] Cacuci, D.G. and Fang, R. (2023) The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology, Volume II: Overcoming the Curse of Dimensionality: Large-Scale Application. Springer, Cham.
https://doi.org/10.1007/978-3-031-19635-5
[22] Cacuci, D.G. (1981) Sensitivity Theory for Nonlinear Systems: I. Nonlinear Functional Analysis Approach. Journal of Mathematical Physics, 22, 2794-2802.
https://doi.org/10.1063/1.525186
[23] Cacuci, D.G. (1981) Sensitivity Theory for Nonlinear Systems: II. Extensions to Additional Classes of Responses. Journal of Mathematical Physics, 22, 2803-2812.
https://doi.org/10.1063/1.524870
[24] Cacuci, D.G. (2022) The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology, Volume I: Overcoming the Curse of Dimensionality: Linear Systems. Springer, Cham, 362 p.
https://doi.org/10.1007/978-3-030-96364-4
[25] Cacuci, D.G. (2023) The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology, Volume III: Overcoming the Curse of Dimensionality: Nonlinear Systems. Springer, Cham, 369 p.
https://doi.org/10.1007/978-3-031-22757-8
[26] Bandyopadhyay, K., Bhattacharya, A., Biswas, P. and Drabold, D.A. (2005) Maximum Entropy and the Problem of Moments: A Stable Algorithm. Physical Review E, 71, Article ID: 057701.
https://doi.org/10.1103/PhysRevE.71.057701
[27] Wu, X. (2003) Calculation of Maximum Entropy Densities with Application to Income Distribution. Journal of Econometrics, 115, 347-354.
https://doi.org/10.1016/S0304-4076(03)00114-3
[28] Müller, I. and Ruggeri, T. (1993) Extended Thermodynamics. In: Truesdell, C., Ed., Springer Tracts in Natural Philosophy, Vol. 37, Springer, New York.
https://doi.org/10.1007/978-1-4684-0447-0
[29] Roulston, M.S. and Smith, L.A. (2002) Evaluating Probabilistic Forecasts Using Information Theory. Monthly Weather Review, 130, 1653-1660.
https://doi.org/10.1175/1520-0493(2002)130<1653:EPFUIT>2.0.CO;2
[30] Abramov, R., Majda, A. and Kleeman, R. (2005) Information Theory and Predictability for Low-Frequency Variability. Journal of the Atmospheric Sciences, 62, 65-87.
https://doi.org/10.1175/JAS-3373.1
[31] Haven, K., Majda, A. and Abramov R. (2005) Quantifying Predictability through Information Theory: Small Sample Estimation in a Non-Gaussian Framework. Journal of Computational Physics, 206, 334-362.
https://doi.org/10.1016/j.jcp.2004.12.008
[32] Abramov, R. (2009) The Multidimensional Moment-Constrained Maximum Entropy Problem: A BFGS Algorithm with Constraint Scaling. Journal of Computational Physics, 228, 96-108.
https://doi.org/10.1016/j.jcp.2008.08.020
[33] Cacuci, D.G. (2023) Fourth-Order Predictive Modelling: II. 4th-BERRU-PM Methodology for Combining Measurements with Computations to Obtain Best-Estimate Results with Reduced Uncertainties.
[34] Shannon, C.E. (1948) A Mathematical Theory of Communication. The Bell System Technical Journal, 27, 379-423.
https://doi.org/10.1002/j.1538-7305.1948.tb01338.x

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.