Experimental Design of Measuring Soil-Water Characteristic Curve of Unsaturated Soil Using Bayesian Approach ()
1. Introduction
Soil-water characteristic curve (SWCC) represents the variation of volumetric water content (or effective saturation) with the matrix suction, which is significant to estimate unsaturated soil parameters (e.g., unsaturated shear strength and permeability coefficient) [1]-[3]). Only a limited number of SWCC measuring data can be obtained considering that the direct measurement of SWCC is often costly and time-consuming through in-situ or laboratory tests according to some prescribed experimental schemes (i.e., the number of measuring points and their corresponding values of the control variable). The uncertainty of estimating SWCC based on limited data is inevitable, which depends on the data obtained from prescribed experimental schemes and affects the estimation of unsaturated soil parameters and geotechnical reliability analysis [4]. Determining an optimal experimental scheme is vital for reducing the uncertainty in SWCC estimated from a limited number of data points.
Experimental design optimization (EDO) provides a rational vehicle to determine the optimal experimental scheme for acquisition of measuring data in a cost-effective way [5]. Several EDO methods have been developed in the literature, including conventional experimental design optimization (CEDO) methods based on classical statistics [6] and Bayesian experimental design optimization (BEDO) methods based on Bayesian inference and/or information theory [7] [8]. Compared with CEDO, the BEDO has an advantage of quantifying various uncertainties, which has been recently applied in geotechnical and geological engineering to design in-situ instrumentation [9] and site investigation programs [10]. Despite of these previous studies on in-situ monitoring and sampling design, research on applying BEDO to design geotechnical laboratory tests that can be troublesome and time-consuming, e.g., SWCC test, is rare. Ding et al. (2022) [8] proposed a BEDO approach for SWCC testing, which, however, requires to implement the optimization procedure twice at two stages of the experimental design for determining control and additional measuring points, respectively.
This paper presents a one-stage Bayesian experimental design optimization (OBEDO) method for SWCC testing based on Fredlund and Xing (1994) (FX) model, which determines the optimal experimental scheme by implementing a single run of optimization procedure. The proposed method adopts expected utility to quantify the expected value of information provided by SWCC testing. The ancestral sampling and Bayesian method are used to generate simulated data to evaluate the effect of uncertainty on soil parameters. The optimal scheme with maximal expected utility is searched out with Subset Simulation Optimization (SSO), which improves the efficiency of determining the optimal scheme in the design space. This paper starts with a description of the proposed OBEDO framework based on FX model, followed by quantifying the expected utility of candidate experimental schemes and optimizing the experimental scheme by maximizing the expected utility using SSO [11]. Then, the proposed approach is illustrated using a SWCC experimental design example.
2. One-Stage Bayesian Experimental Design Optimization
(OBEDO) Framework for Measuring SWCC
As shown in Figure 1, the proposed OBEDO framework starts with collecting available prior knowledge (i.e., prevailing SWCC models and typical ranges of its model parameters) before testing on the SWCC of soils concerned and the information of testing apparatus and technique, which are used to determine the design space of candidate experiment schemes. The proposed OBEDO framework is comprised of three steps: determination of the candidate experimental schemes, calculation of the expected utility, U€, of a possible experimental scheme E that is specified by the number, n, of measuring points, and optimization of the experimental scheme performed by SSO to maximize the U€. Details of the three steps of the proposed OBEDO framework are provided in the following three sections.
Figure 1. One-stage Bayesian experiment design optimization (OBEDO) framework for measuring SWCC.
3. Candidate Experimental Schemes Based on FX Model
The trajectory of SWCC can be generally controlled by characteristic matric suction values (such as the air-entry value
, the matric suction at the inflection point
, and the matric suction corresponding to the residual water content
) and their corresponding degrees of saturation. For a given SWCC parametric model, the
,
, and
divide the SWCC into four partitions. There are, at least, four control measuring points selected within the ranges of the matric suction, i.e.,
,
,
,
to capture the general trajectory of the estimated SWCC and a certain number of additional points selected within the ranges of the matric suction, i.e.,
to reduce its associated uncertainty. Let n denote the total number of measuring points. Each candidate experimental scheme, E, of SWCC testing is comprised of four control points (i.e., A1, A2, A3, A4) and (n-4) additional points (i.e., B1-Bn-4), as shown in Figure 2.
Nevertheless, during the experimental design stage, the
,
, and
values corresponding to the prescribed SWCC model are unknown. The expected value (i.e.,
,
, and
) of
,
,
is adopted to constrain the matric suction range of control point (i.e., A1, A2, A3, A4), which can be determined using Monte Carlo simulation based on the prior knowledge of SWCC model parameters. Consider, for example, the FX model given below:
(1)
The values of
,
,
corresponding to the FX model satisfy Eqs. (2)-(4) [12].
(2)
(3)
(4)
where
,
and
are the model fitting parameters of FX model;
is an effective degree of saturation corresponding to
; k1 is the slope at the inflection point;
is the matric suction where the SWCC starts to drop linearly;
is an effective degree of saturation corresponding to
; and k2 is the slope at the point
. These symbols are illustrated in Figure 3. Np estimates of
,
, and
can be obtained with the number, Np, of random samples of
,
and
simulated from their uniform prior distribution through Monte Carlo simulation. Based on the Np estimates of
,
, and
, their respective mean values (i.e.,
,
, and
) are evaluated, with which the matric suction values (i.e.,
, and
) of the four control measuring points A1, A2, A3, and A4 are, respectively, assigned within the matric suction ranges
,
,
, and
. The matric suction values (i.e.,
-
) of n-4 additional measuring points B1-Bn-4 belong to the range,
, but should not be equal to any values of the four control measuring points A1, A2, A3, A4.
Figure 2. Illustration of control measuring points and additional measuring points.
Figure 3. Typical soil-water characteristic curve [12].
Let
denote the feasible discrete matric suction value, and a set of possible value of
can be expressed as
, where
,
,
,
are discrete intervals (e.g., the minimum increment of the matric suction that can be applied by the testing apparatus). The above discretization procedure of the matrix suction results in a total of
possible values of
. Assume that
,
,
, and
values of
in
fall within
,
,
, and
, respectively, which constitute the set
,
,
,
. The matric suction values (i.e.,
, and
) of the control measuring points A1, A2, A3, and A4 satisfied
,
,
, and
, respectively. Let
denote the set of possible values of the matric suction (i.e.,
) of each additional measuring point Bj (
), which can be written as a set
(
;
). Each set of possible values of
and
(
) constitute a candidate experimental scheme E, which can be expressed as Eq.(5)
(5)
The optimal experimental scheme is determined by maximizing the expected data worth (i.e., the expected utility U(E)) of the SWCC test performed according to candidate experimental schemes using SSO. Calculations of the U(E) of each candidate experimental scheme, E, and its optimization through SSO are provided in the following two sections, respectively.
4. Expected Utility of Candidate Experimental Schemes
Consider, for example, a candidate experimental scheme E. The data worth of SWCC test can be quantified by the relative entropy,
, that indicates the statistical difference between the updated distribution,
, of SWCC model parameters,
, given a set of newly-obtained data (e.g., values of effective degree of saturation,
), obtained according to E and the prior distribution,
, of
.
can be written as Eq. (6) [5]
(6)
Without the real measurement data at the experimental design stage, the expected utility, U(E), of SWCC measurement data corresponding to E is adopted to quantify the expected worth of data, which is evaluated as Eq. (7) [13]:
(7)
where
is the probability density function (PDF) of
corresponding to E. Substituting Eq. (6) into Eq. (7) gives
(8)
Using the
and
samples, Eq. (8) is re-written as Eq. (9) [8]:
(9)
where
is the posterior distribution of
evaluated at
given
.
For a given number of measuring points, the optimal experimental scheme
is taken as the scheme with the maximum U(E) among candidate experimental schemes, i.e.,
(10)
The next section makes uses of SSO to identify the
among candidate experimental schemes.
5. Optimizing the Experimental Scheme with
Subset Simulation
As mentioned in Section 3 entitled “Candidate experimental schemes based on FX model”, the number of candidate experimental schemes is equal to
. Identifying the
among candidate experimental schemes can be formulated as an optimization problem below:
(11)
where the feasible domains (i.e.,
and
) of
(
) and
(
) are defined previously in Section 3. In this study, SSO is used to search the E* in the design space. SSO is a global optimization algorithm that was originally developed from Subset simulation [11]. The proposed OBEDO approach makes use of SSO to identify the optimal experimental scheme E* according to the expected utility, where only one-stage optimization is involved, and returns the one with the maximum expected utility as the E*, which contains the optimal control and additional measuring points.
Within the SSO framework, E* can be found among the candidate schemes by solving the following reliability analysis problem [14]:
(12)
where
is an auxiliary failure event.
represents the probability that event F occurs, which becomes zero as scheme E is equal to E*. A number of conditional samples of a series of nested intermediate failure events satisfying
is generated with SSO, with which
is expressed as Eq. (13):
(13)
where
,
.
is equal to
;
are an increasing sequence of Ns intermediate threshold values, which are determined adaptively with simulated samples so that the sample estimates of
and
are always equivalent to a specific value of conditional probability p0 (e.g., 0.1) [15]. The implementation of SSO can refer to Li and Au (2010) [14] and Ding et al. (2022) [8].
6. Illustrative Example
6.1. Determining Candidate Experimental Schemes Based on the
Prior Knowledge
In this example, the prior knowledge of FX model parameters is taken as their respective typical ranges
,
,
and
, which are consistent with those reported in reference [16]. Consider, for example, a SWCC testing apparatus with the measured matric suctions range of (0, 2000kPa), which is divided into the matric suction range of
,
,
, and
by , , and estimated using the prior knowledge of FX model parameters. Then, the feasible values of the matric suction include
= {2, 4, 6, ∙∙∙, 14, 16, 18, ∙∙∙, 24, 26, 28, ∙∙∙, 96, 98, 148, 198, ∙∙∙, 1948, 1998} (in kPa) with
,
,
,
(Figure 4).
Figure 4. Determination of
,
and
based on prior knowledge in the illustrative example.
6.2. Optimal Experimental Scheme for SWCC Testing
For consideration of the effect of n on the data worth of the candidate experimental scheme, a series of n values are considered, including 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 20, and 25. For each of n value, the SSO runs with conditional probability p0 = 0.1, the maximum number of simulated levels Ns = 20, and 2000 samples per level is used to obtain the optimal matric suction values and their corresponding U(E*) values, as shown in Figure 5. Figure 6 shows the variation of U(E*) as a function of n. It is found that the U(E*) increases rapidly as n is less than 17. The improvement of U(E*) becomes marginal by adding more measuring points as the n is greater than 17. As a result, the optimal number of measuring points is taken as n = 17 in this example. Correspondingly, the optimal experimental scheme E* (given n = 17) is {6, 12, 20, 48, 64, 86, 148, 298, 398, 648, 848, 948, 998, 1048, 1198, 1448, 1598} (in kPa), of which the expected utility (i.e., U(E*)) is 5.27.
Figure 5. Evolution of SSO for different numbers of measuring points.
Figure 6. Expected utility with different number of measuring points.
6.3. Further Illustration with Real Data of Loess
The measured SWCC data of loess that is reported in references ([2]; [17]-[20]) is used to verify the effectiveness of proposed method, as shown in Figure 7. The utility (i.e., R(E)) that is calculated using Eq. (6) of measured SWCC data obtained from Punrattanasin et al. (2002) [17], Huang et al. (2009) [18], Chen et al. (2011) [19], Jiao et al. (2016) [20], and Wang et al. (2018) [2] are determined as 1.90, 2.06, 0.51, 1.99, and 0.46, respectively. As discussed in subsections 6.1 entitled “Optimal experimental scheme for SWCC testing”, the optimal experimental scheme, E*, obtained from the OBEDO approach and referred to as one-stage Bayesian optimal scheme (OBOS) is {6, 12, 20, 48, 64, 86, 148, 298, 398, 648, 848, 948, 998, 1048, 1198, 1448, 1598} (in kPa), and its expected utility (i.e., 5.27) is superior to the utility of the measured data of loess reported in literature.
Figure 7. SWCC measured data of loess.
It is worth to point out that the number of measured SWCC data obtained from Punrattanasin et al. (2002) [17], Huang et al. (2009) [18], Chen et al. (2011) [19], Jiao et al. (2016) [20], and Wang et al. (2018) [2] are 7, 10, 6, 9, 4, respectively, which are not consistent with the optimal number (i.e., 17) of SWCC measurements in E* determined by the proposed method. To enable a consistent comparison, 17 data points are randomly selected from the 36 measurement data points of the loess shown in Figure 7 to mimic the experimental scheme with 17 measuring points, which is referred to random experimental schemes (RES) herein. Figure 8 shows the values of the utility of the 10000 RESs by circles, among which the maximum value is around 4.08 and its corresponding RES is referred to as random optimal scheme (ROS) indicated by the dotted line in Figure 7. The utility of ROS is less than the expected utility (i.e., 5.27) of OBOS obtained from the proposed approach, which demonstrates the effectiveness of the proposed OBEDO method.
Figure 8. Utility of random experimental schemes.
7. Summary and Conclusions
This paper developed a one-stage Bayesian experimental design optimization (OBEDO) approach for determining the optimal experimental scheme of SWCC test using the prior knowledge and the information of testing apparatus. The candidate experimental scheme with the maximum expected utility is identified as the optimal experimental scheme using Subset Simulation optimization (SSO).
1) The proposed OBEDO approach was illustrated using a design example. It was shown that the expected utility of the optimal experimental scheme improves by adding more measurements. Such an improvement becomes marginal as the number of measuring points is sufficiently large (e.g., 17 in the illustrative example). Hence, the optimal number of measuring points can be determined as a trade-off between the improvement of data worth and the commitment involved in testing.
2) The proposed approach was also verified using real loess data. Results showed that the arbitrary arrangement of measuring points of SWCC test is hardly to give the optimal experiment scheme in terms of the expected utility (or values of information). The proposed OBEDO approach provides a rational tool to optimize the arrangement of measuring points of SWCC test based on prior knowledge and the information of testing apparatus so as to obtain SWCC measurement data with relatively high value of information for uncertainty reduction.