A Class of Copulas Derived from Residual Implications and Its Applications ()
1. Introduction
Copulas are multivariate distribution functions characterized by uniform marginal distributions. In 1959, Sklar [1] demonstrated that for any multivariate distribution, a copula exists that allows the joint distribution to be expressed as the copula applied to the marginal distributions. Because copulas are independent of the marginal distributions, they provide a convenient framework for addressing various modeling challenges. Copulas have been used in many fields, including finance, engineering, economics, and environmental science [2]-[5].
Fuzzy implications play a critical role in fuzzy logic, enabling logical reasoning in uncertain situations [6]. Among them, residual implications stand out as a key type and are derived from aggregation functions [7]-[10]. In recent years, many researchers have explored fuzzy implications formed by various aggregation functions [11]-[13], while others have focused on characterizing residual implications based on different aggregation functions [14] [15]. Additionally, some studies constructed different fuzzy implications from copulas, including probabilistic implications and probability S-implications [16]-[18].
There is a one-to-one correspondence between left-continuous semicopulas and their derived residual implications [9]. In 1999, a characterization for residual implications derived from left-continuous semicopulas was provided by Demirli et al. [9]. In 2007, Durante et al. [19] extended this work by characterizing residual implications derived from quasi-copulas. More recently, in 2022, Ji et al. [20] further advanced this area by characterizing residual implications derived from copulas through their level curves.
This article aims to study a class of copulas derived from residual implications, where the extensions of the level curves intersect at a point. Specifically, this class represents a comprehensive and asymmetric extension of the Archimedean copula family (4.2.7) as described in [23], providing greater flexibility for modeling real-world phenomena. The proposed copulas include two dependence parameters,
and
, and are able to capture correlation coefficients in the range
, i.e. Kendall’s tau and Spearman’s rho. It is well known that multiparameter copulas are capable of modeling a variety of dependency structures, including non-linear, asymmetric and tail dependent relationships that one-parameter copulas cannot adequately capture. In summary, the new model allows for both positive and negative dependencies without imposing constraints on the correlation structure. These important features motivate a theoretical and practical analysis of the proposed copulas.
The remainder of the paper is structured as follows. Section 2 provides a review of the fundamental concepts and key properties related to copulas and residual implications. In section 3, we introduce radial copulas and examine some dependence concepts. Section 4 focuses on estimating the two dependence parameters of radial copulas and includes a simulation study to assess the performance of these estimators. In section 5, we perform an analysis of two real datasets to examine the applications of the proposed radial copulas. Finally, section 6 presents our conclusions.
2. Preliminaries
Copulas are multivariate functions characterized by uniform marginal distributions. Let us first review some of their definitions and properties.
Definition 1 [21] A binary operation
is defined as a semi-copula if it fulfills the subsequent conditions: [label=(C)]
1.
and
;
2. For every
, it holds that
and
;
3.
is nondecreasing in both variables.
Definition 2 [22] A binary operation
is defined as a quasi-copula if it is a semicopula and also adheres to the 1-Lipschitz condition. Specifically, this implies that the following inequality is valid for all
:
.
Definition 3 [23] A binary operation
is defined as a copula if it is a semicopula and also fulfills the condition of being 2-increasing. Specifically, this indicates that for any
where
and
, the following inequality is valid:
.
Residual implications derived from various classes of left-continuous semicopulas have been examined in [9] [19]. For any left-continuous semicopula
, the derived residual implication, denoted as
, is defined by
(1)
This function
is referred to as the residuum of
or the
-residuum [19].
Proposition 4 [9] If
is a left-continuous semicopula, then the corresponding
residuum
has the properties listed below:
1.
if and only if
;
2.
for all
;
3. If
with
, then it holds that
;
4. If
with
, then it holds that
;
5. The first variable of the
function is left-continuous;
6. The second variable of the
function is right continuous.
On the contrary, if a function
fulfills conditions (R1) through (R6), then there exists a left-continuous semicopula
where
. Additionally, the definition of
is as follows:
(2)
The set of all functions
that fulfill conditions (R1) through (R6) is represented by
. Given any
, the associated deresiduum, referred to as the
-deresiduum, is denoted by
[19]. For simplicity, when the context is clear,
and
are often abbreviated as
and
, respectively.
Similar to the t-norms, the adjointness condition of the
(3)
applies to any left-continuous semicopula
and its residuum
[9].
Proposition 5 [19] [20] Let
be a quasi-copula. The
-residuum
fulfills conditions (R1) through (R6) and two additional properties:
1. For all
such that
and
, it holds that
;
2. For all
with the condition that
, it holds that
.
By contrast, if
fulfills conditions (R1) through (R8), then the
-deresiduum
is a quasi-copula.
The level set of
for a constant
is defined as
. Let
be a function that satisfies conditions (R1) through (R8). For
, if
, then according to condition (R8), we conclude that
. Consequently, the level set of
associated with
is made up of points lying on a curve, known as the level curve of
[20]. For
, the level set of
is defined as
. The boundary curve of this level set,
, is referred to as the level curve of
for the constant 1. We often denote the level curve of
as
, where
.
Proposition 6 [20] Let
be a quasi-copula, and let
represent its residuum. Suppose that
represents a level curve of
for
. For
such that
, we have
.
The following proposition gives a necessary and sufficient condition for the
-deresiduum
to be a copula.
Proposition 7 [20] Let
satisfy conditions (R1) through (R8). Let
denote the set of level curves of
. The
-deresiduum
is a copula if and only if each level curve of
has a domain that is an interval, and the inequality
holds for
, where
,
, and
exists for all
.
The domain of any level curve is either
or
, where
, for the residual implication derived from a copula. Notably, the vertical distance between two level curves is consistently greater on the right side than on the left [20].
3. Proposed Copulas
This section introduces a class of copulas derived from residual implications, which are characterized by extensions of level curves that intersect at a point. This class includes two dependence parameters, providing greater flexibility in modeling dependencies.
For the product copula
, the corresponding residual implication is given by
. For
, the level curves of
can
be expressed by the equation
, where
. Clearly, for a given value of
, the level curves of
are segments of lines with a slope of
, all intersecting at the origin.
According to Proposition 6, if the level curves lie on a line, then the slope of the level curves of residual implications lies within the interval
. Consequently, when the level curves of a residual implication derived from a copula are straight lines intersecting at a common point, this intersection point is represented as
, where
. Given the intersection point
of the extended level curves and noting that they always pass through
, we derive the
equation of the line on which these level curves lie as
. This simplifies to
. Thus,
. It follows from Propositions 5 and 7 that the function
is a residual implication derived from a copula.
For
in
such that
, let
be a function given by
(4)
Then the corresponding copula
is given by
(5)
Figure 1. Graphs of
and
for
and
.
We refer to the copula
as a radial copula. The left side of Figure 1 shows some level curves of
, while the right side displays the graph of the radial copula
.
Remark 1 Here are some special cases of the radial copula: [label = ()]
1. If
, then
, i.e.,
.
2. If
, then
, which is equal to the Archimedean copula family (4.2.7) in [23].
3. If
and
, then
.
4. If
and
, then
Figure 2. Graphical plots of Ca,b for various parameter sets a and b.
The latter part of the formula is equal to the copula
studied in [20, Example 5.14]. In particular, if
, then
. If
, then
.
The class of radial copulas is obviously an asymmetric extension of the Archimedean copula family (4.2.7), as presented in [23]. Notably, the Archimedean copula family (4.2.7) does not include the Frechet-Hoeffding upper bound
. As a result, the class of radial copulas provides a comprehensive extension of the Archimedean copula family (4.2.7). In Figure 2 and Figure 3, we show the radial copula plots and their contour plots for four different parameter value sets
and
.
There are various methods employed to generate observations
for a pair of random variables
characterized by a joint distribution function
. One such method involves utilizing the copula function and the concept of conditional probability, referred to as the conditional distribution method [23]. This approach proposes an algorithm for generating random samples from the copula
using the conditional copula
given
. The algorithm is as follows:
1. Generate two independent uniform random variables
and
from the interval
.
2. Define
, where
represents a quasi-inverse of the function
, and also
. Specifically, the conditional distribution function
is given by
Figure 3. Contour plots of Ca,b for various parameter sets a and b.
and its quasi-inverse
is given by
3. Repeat steps i and ii
times to generate a set of independent and identically distributed pairs
from the copula
.
Figure 4 displays scatter plots derived from 500 simulated data points, which were generated using the previously described algorithm with four distinct sets of parameter values for
and
. Figures 4(c) and 4(d) correspond to the scatter plots of copulas in [25] with
(right) and
(left), respectively.
For any copula
, we have the decomposition
where
(6)
with
representing the absolutely continuous component and
the singular component of the copula
.
If a copula
satisfies
on
, that is, if
can be represented as a joint distribution with density
, then
is absolutely continuous. If
on
, meaning that
almost everywhere, then
is classified as singular [23]. Conversely, if this condition does not hold,
can be decomposed into an absolutely continuous component
and a singular component
.
Figure 4. The scatter plots are derived from 500 simulated observations of Ca,b.
Proposition 8 The radial copula contains an absolutely continuous
and singular components
. Thus we have
and consequently
Proof. The proof is immediate, so it is omitted.
Thus, the
-measure of the singular component of
is expressed as
. In other words, if
and
are uniformly distributed random variables across the interval [0, 1], and their joint distribution is characterized by the copula
, then it follows that
.
In the following proposition, we explore the statistical properties of the copula
. Specifically, we examine the behavior of classical concordance measures, including Kendall’s
and Spearman’s
, under the copula
.
Let
and
be continuous random variables associated with the copula
, where
and
represent their respective marginal distribution functions. Using the probability transformations
and
, Kendall’s
for
and
can be expressed as [23]:
(7)
and Spearman’s
as [23]:
(8)
Proposition 9 Let
and
be continuous random variables of copula
, where
. Then
1. Kendall’s
for the random variables
and
is given by
(9)
which takes values between −1 and 1.
2. Spearman’s
for the random variables
and
is given by
(10)
which also takes values between −1 and 1.
Proof.
1. Given that the copula
contains both an absolutely continuous and a singular component, we apply formula 7 to compute Kendall’s
. Then
and hence
In particular, if
and
, then we have
which can be written as
Similarly, if
, then we obtain
.
2. We use formula 8 and the copula
to derive Spearman’s
. Then we have
In particular, if
and
, then we have
which can be written as
Similarly, if
, then we get
.
The full
range for Kendall’s
and Spearman’s
is theoretically proven in Proposition 9, which establishes that both
and
are constrained within this range for all radial copulas. In practice, the
parameter controls the overall strength of the dependence between the variables, with larger values indicating stronger dependence. The
parameter, on the other hand, primarily affects the tail dependence, with larger values leading to greater correlation at the extreme values. Together,
and
provide flexibility in modeling both general dependence and extreme comovements. In Figure 5, we plot Spearman’s
and Kendall’s
as functions of the dependency parameters
and
, where
is constrained by the condition
. Specifically, it is set as
and further adjusted to 0 if
.
(a) Spearman’s ρ values
(b) Kendall’s τ values
Figure 5. Spearman’s
and Kendall’s
plotted against dependence parameters
and
.
4. Parameter Estimation of Radial Copulas: A Simulation Study
In this section, we estimate the two dependence parameters
of the copula
using the
-inversion method and bivariate L-moments. Furthermore, a simulation study is performed to measure the performance of these two estimation methods.
Tsukahara [24] introduced the rank approximate Z-estimation (RAZ) estimators, which are derived from Kendall’s tau (
) and Spearman’s rho (
). These estimators are referred to as the
-score and
-score RAZ estimators. Let us assume where
, with
and
denoting the sample estimators of Kendall’s tau (
) and Spearman’s rho (
), respectively. The
-inverse estimator
and the
-inverse estimator
of
are defined using a method that is analogous to the method of moments:
When
, the parameters
can be estimated by solving the following system of equations:
We refer to the solution obtained from the preceding system as the
-inversion estimator of
[25]. This method is employed to estimate the parameters
of the radial copula
as shown in the formula 5. The corresponding Kendall’s tau (
) and Spearman’s rho (
) are computed from the formulas 9 and 10, respectively:
The solution of the following system is defined as the
inversion estimator for the parameters
:
(11)
Another key method for estimating bivariate copula models with multiparameter is the application of bivariate L-moments. To begin, we offer a brief overview of univariate L-moments. Hosking [26] proposed L-moments, denoted by
, as an alternative to the traditional central moments
, which are derived from the distribution function (df)
of the random variable
. Using the quantile function
, the
moment
can be represented as follows:
where
denotes the shifted Legendre polynomial [27]. In the following, we will use the first three shifted Legendre polynomials:
The bivariate L-moments (BLM) extend univariate L-moments to the multivariate setting. Next we introduce the fundamental notations and definitions associated with BLM. Let
and
represent two random variables with finite means, the marginal distributions
and
, as well as the respective sequences of
moments denoted by
and
. Serfling et al. [28] introduced the k-th L-comoment of
in relation to
, which corresponds to the covariance between
and
for values of
. This relationship can be expressed as follows:
with asymmetric counterpart
. If
is assumed to be part of a parametric family of distribution functions, the k-th L-comoment
is influenced by both the parameters of the marginal distributionsd and the dependence structure between
and
. Given that our primary goal is to estimate the parameters of the copula, it is more practical to apply the k-th L-comoment of
in relation to
, rather than using
directly. Thus, we define:
This quantity
is referred to as the k-th bivariate copula L-moment of
in relation to
.
Brahimi et al. [25] introduced the k-th bivariate copula L-moment of
in relation to
, which can be expressed, for each
, as
(12)
Using this formula, it is possible to set up a system of equations to estimate the parameters of multivariate copula models. For example, if
and
,
, then based on formula 12, the first two bivariate copula L-moments are provided by [25]
(13)
(14)
Below is the estimation process for the copula
using the BLM method, structured as follows
1. Step 1: Compute the empirical L-moments. Using the observed sample
, the empirical L-moment
is computed as follows:
where for each
,
denotes the rescaled empirical distribution function, given as
, where
is the empirical distribution function.
2. Step 2: Construct the system of equations. Using formula 12, we can derive a system of
equations as follows:
For the radial copula
, according to formulas 13 and 14, the two first bivariate copula L-moments are given by:
3. Step 3: Solve the system
(15)
The solution
obtained is referred to as the BLM estimator for the parameters of the copula
.
A simulation study was conducted to assess the performance of the
-inversion method in comparison with bivariate L-moments for estimating the dependence parameters
of the radial copula. The evaluation of performance is conducted through the assessment of bias and RMSE, which are defined below:
where
represents an estimate of
derived from the
-th sample out of
generated samples obtained from the underlying copula [25]. For this study, we set
. To evaluate the reduction in bias and RMSE as the sample size increases to
, the estimation procedures for both methods were repeated by solving the systems in 11 and 15.
The chosen values for the parameters
must be reasonable, ensuring that each pair reflects a specific level of dependence: weak, moderate, or strong. Table 1 shows the selected values of the real parameters, while Table 2 gives an overview of the bias and RMSE for both the
-inversion estimator and the BLM estimator applied to the radial copula. The results for bias and RMSE show that the BLM estimator consistently outperforms the
-inversion estimator across all sample sizes. The outperformance of the BLM estimator over the
-inversion method is consistent with the findings of Brahimi et al. [25]. This is because the
-inversion method involves solving an optimization problem under constraints, whereas the BLM estimator directly solves a system of equations, which is generally more efficient and stable for parameter estimation. Furthermore, as the sample size increases from
to
, both the bias and RMSE for the parameters
typically decrease, indicating enhanced performance of the estimators with larger samples.
Table 1. Selected true parameters for the radial copula.
|
τ |
a |
b |
Weak |
0.01 |
0.208 |
0.1 |
Moderate |
0.5 |
2.650 |
0.5 |
Strong |
0.8 |
11.647 |
0.9 |
Table 2. Bias and RMSE of radial copula (τ, ρ)-inversion and BLM estimators.
|
τ = 0.01 |
τ = 0.5 |
τ = 0.8 |
|
a = 0.208 |
b = 0.1 |
a = 2.650 |
b = 0.5 |
a = 11.647 |
b = 0.9 |
|
Bias |
RMSE |
Bias |
RMSE |
Bias |
RMSE |
Bias |
RMSE |
Bias |
RMSE |
Bias |
RMSE |
n=50 |
|
|
|
|
|
|
|
|
|
|
|
|
τ − ρ |
0.0282 |
0.309 |
0.0132 |
0.1566 |
−0.053 |
0.1037 |
−0.0176 |
0.1986 |
−0.1477 |
0.2963 |
−0.1277 |
0.2351 |
BLM |
−0.0575 |
0.1059 |
−0.0274 |
0.0664 |
−0.0104 |
0.0261 |
−0.0283 |
0.1195 |
−0.0202 |
0.0313 |
−0.0915 |
0.1455 |
n=100 |
|
|
|
|
|
|
|
|
|
|
|
|
τ − ρ |
0.0227 |
0.3157 |
0.0113 |
0.1596 |
−0.037 |
0.0673 |
−0.0113 |
0.1406 |
−0.138 |
0.264 |
−0.107 |
0.2053 |
BLM |
−0.0314 |
0.0841 |
−0.0159 |
0.0502 |
−0.009 |
0.0185 |
−0.0129 |
0.0782 |
−0.0129 |
0.0245 |
−0.0524 |
0.1007 |
n=200 |
|
|
|
|
|
|
|
|
|
|
|
|
τ − ρ |
0.0239 |
0.3162 |
0.012 |
0.1584 |
−0.0209 |
0.0465 |
−0.0072 |
0.1007 |
−0.1063 |
0.212 |
−0.091 |
0.1749 |
BLM |
−0.0149 |
0.0593 |
−0.0073 |
0.0347 |
−0.009 |
0.0185 |
−0.0129 |
0.0782 |
−0.0076 |
0.0183 |
−0.0288 |
0.0698 |
n=500 |
|
|
|
|
|
|
|
|
|
|
|
|
τ − ρ |
0.0543 |
0.317 |
0.0276 |
0.1587 |
−0.0104 |
0.0302 |
−0.0014 |
0.0666 |
−0.0819 |
0.1478 |
−0.0604 |
0.1217 |
BLM |
−0.0084 |
0.0372 |
−0.0036 |
0.0223 |
−0.0064 |
0.0136 |
−0.0065 |
0.0491 |
−0.002 |
0.0086 |
−0.0071 |
0.0315 |
5. Applications
In this section, we investigate the goodness-of-fit performance of radial copulas under different dependence structures. We assess the fit of radial copulas to two datasets previously analyzed in the literature using different bivariate distributions, including various copulas.
Figure 6. Scatterplot of mean wind speed versus mean ozone.
5.1. New York Air Quality Data
The dataset “air quality” contains daily air quality measurements collected over 153 consecutive days in the New York Metropolitan Area. Two variables to be examined in this study are average wind speed (measured in miles per hour) and average ozone levels (measured in parts per billion). For a comprehensive description of the data, please consult Chambers et al. [29] (Appendix, Dataset 2). This dataset is still accessible in the R package “datasets”.
In our study, we analyze 116 observations after excluding missing data. Figure 6 illustrates a negative correlation between average wind speed and mean ozone levels. This is further confirmed by the negative empirical values of Spearman’s rho (−0.590) and Kendall’s tau (−0.428). To further explore this correlation, we use the bivariate L-moments estimator to model their dependence structure through the radial copula, obtaining the dependence parameters
. We then use formulas (9) and (10) to calculate the concordance measures for the radial copula: Spearman’s rho is
and Kendall’s tau is
.
The goodness-of-fit of the radial copula
is assessed using the Kolmogorov-Smirnov (KS) and Cramer-von Mises (CVM) statistics, applying the bootstrap algorithm presented by Genest et al. [30]. These two tests are related to the empirical process
, and their corresponding statistics are given by
respectively. Furthermore, various well-known copula families commonly used to model negative dependence are applied to the New York air quality data; however, none yield significant results. Therefore, our results are primarily comparable with those previously described by Ghosh et al. [31] and El Ktaibi et al. [32], which are based on different negatively dependent copulas. Specifically, the copulas in these studies have the following definitions:
and
where
and
.
Table 3 shows that our model exhibits an excellent fit to the data, presenting the highest p-values among the models considered. It also outperforms the models presented by Ghosh et al. [31] and El Ktaibi et al. [32]
Table 3. Goodness-of-fit test for air quality data.
|
KS |
CVM |
|
Test statistic |
p-value |
Test statistic |
p-value |
Ca,b |
Tn = 0.077 |
0.585 |
Sn = 0.099 |
0.269 |
Cθ |
Tn = 0.078 |
0.584 |
Sn = 0.120 |
0.219 |
Cα |
Tn = 0.099 |
0.232 |
Sn = 0.224 |
0.039 |
5.2. Twins Data
This dataset, as summarized by Ashenfelter et al. [33], contains hourly wages for 149 pairs of identical twins in the United States, each with varying levels of education. The study focuses on individuals aged 18 and older. The two variables under examination are the hourly wages (measured in log U.S. dollars) of each twin sibling. This dataset is also available in the supplementary material provided by Tang et al. [34]. A plot of these data is shown in Figure 7, where the observations tend to cluster around the major diagonal of the unit square. The calculated values of Spearman’s rho and Kendall’s tau for the hourly wages of each twin pair are 0.558 and 0.421, respectively, indicating a positive correlation between these variables.
Next, we fit the data using the bivariate L-moments estimator to model their dependence structure through the radial copula, obtaining the dependence parameters
. We then apply formulas (9) and (10) to compute the concordance measures for the radial copula. Specifically, Spearman’s rho is
, and Kendall’s tau is
. To assess the goodness-of-fit of the radial copula
, we use the KS and CVM statistics with the bootstrap algorithm from Genest et al. [30]. When fitting the twins dataset, various classical copula families commonly used to model positive dependence are considered, including Plackett, Clayton, and Gumbel copulas [35] [36]. As shown in Table 4, our model demonstrates superior goodness-of-fit to the twins data, as indicated by its relatively high p-value compared to other models. The improved performance of our model can be attributed to the increased flexibility of radial copulas in capturing complex nonlinear dependencies. In contrast, Archimedean copulas, while useful in certain contexts, may face limitations when attempting to capture such complex relationships due to their more structured and simpler forms.
Table 4. Goodness-of-fit test for twins data.
|
KS |
CVM |
|
Test statistic |
p-value |
Test statistic |
p-value |
Ca,b |
Tn = 0.052 |
0.854 |
Sn = 0.102 |
0.638 |
Gumble |
Tn = 0.056 |
0.864 |
Sn = 0.119 |
0.527 |
Plackett |
Tn = 0.062 |
0.777 |
Sn = 0.104 |
0.630 |
Clayton |
Tn = 0.078 |
0.492 |
Sn = 0.112 |
0.620 |
Figure 7. Scatter plot of hourly wages of identical twins.
6. Conclusions
In this paper, we have examined a class of copulas called radial copulas, which is derived from residual implications where the extensions of level curves intersect at a point. The class of radial copulas is a comprehensive and asymmetric extension of the Archimedean copula family (4.2.7) presented in [23]. Notably, radial copulas can capture both positive and negative dependencies. A simulation analysis is performed to evaluate the effectiveness of the
-inversion and bivariate L-moments estimations methods in different modes. Furthermore, the efficacy of radial copulas is demonstrated through simulations and real case studies on two datasets (New York air quality dataset and twins dataset). We suggest that the radial copula based approach is straightforward in simulation and explanation and will be an important addition to copula theory. While applying radial copulas to high-dimensional datasets presensts challenges, such as increased computational complexity and difficulties in parameter estimation, these issues are acknowledged and will be addressed in future research.
Acknowledgement
Sincere thanks to the members of JAMP for their professional performance, and special thanks to managing editor Hellen XU for a rare attitude of high quality.