Higher-Order Expansions of Sample Range from the Skew-t-Normal Distribution ()
1. Introduction
A random variable
is said to have a skew-t-normal distribution [1] with scale
, degrees of freedom
and shape parameter
(written as
) if its probability density function (pdf) is
(1)
where
denotes the standard normal cumulative distribution function (cdf) with
denoting the Gamma function. The skew-t-normal distribution is a probabilistic model capable of capturing both heavy-tailedness and pronounced skewness. It exhibits a substantially wider range of skewness compared to the skew-normal distribution [2] and related families, thereby offering greater flexibility in modeling data that deviates significantly from normality. These characteristics make this distribution deeply studied and widely applicable in practical applications, for example, the study of Vanadium pollution in Shadegan Wetland [3] and automated flow cytometry analysis [4]. Further practical applications of the skew-t-normal distribution can be located in [5]-[7].
Let
be a sequence of independent and identically distributed random variables with common marginal distribution function
following the skew-t-normal distribution, and let
and
denote the partial maximum and minimum. If
is an extreme value distribution, we say a distribution
is in the domain of attraction of
(written
) if there exists
,
,
such that
(2)
weakly as
where
is of the type of one of the following classes:
(3)
for some
(cf. [8]). In extreme value analysis, accurately determining the domain of attraction to which a distribution belongs is a prerequisite for establishing its limiting distribution and even higher-order expansions under appropriate normalization.
Yang and Hu (2025)1 proved that under different values of
, the skew-t-normal distribution belongs to the domain of attraction of different extreme value distribution with different normalizing constants, i.e.:
(4)
with
(5)
and
(6)
with
(7)
where the norming constants
,
,
and
satisfy
(8)
and for fixed
,
and
have a special relationship:
.
For independent and identically distributed (i.i.d.) random sequences, in addition to the asymptotic behavior of the normalized partial maxima and minima and their joint distribution, the asymptotic behavior of the sample range has also been studied. The weak convergence of the range was first discussed by De Haan (1974) [9]. Subsequently, Barakat and Nigm (1990) [10] extended this work by studying the weak convergence of the sample range, as well as the case where the sample size is a positive integer random variable. Later, Tan et al. (2007) [11] proved the almost sure convergence of the sample range, and Zang (2014) [12] derived an almost inevitable local central limit theorem for it under mild assumptions. Furthermore, Matula and Adler (2022) [13] proved the law of large numbers for the sample range of the Pareto distribution. More recently, Zhang and Lu (2023) [14] studied the higher-order expansion of the sample range for the skew normal distribution, while Lu et al. (2023) [15] established the distributional expansions of the normalized sample range from the general error distribution.
Shifting focus to the skew-t-normal distribution, Yang and Hu (2025) focused on the skew-t-normal distribution, discussing the convergence rates of its maximum and minimum distribution functions, as well as their joint distribution and density functions. However, the asymptotic theory for the range of skew-t-normal samples remains unexplored. Therefore, to fill this gap in the literature, this paper will systematically derive the higher-order asymptotic expansion of the range distribution for skew-t-normal samples. The significance of this work resides in its capacity to utilize high-order expansions for improving the precision and convergence of the limiting distribution approximation, furnishing a more dependable theoretical framework for statistical inference concerning extreme events.
The rest of this article is arranged as follows. Section 2 presents the main results. Section 3 compares the accuracy between the actual values and the approximate values. Section 4 gives some auxiliary lemmas and the proofs of the main results are given in Section 5. Section 6 concludes with a summary.
2. Main Results
In this section, we provide the limiting distribution and convergence rates of the distribution of
and its higher-order distributional expansion under different value of
with different normalized constants.
Theorem 1. Let
be a sequence of independent random variables with common distribution
. Denote
and
and
and
given by (5) and (7). Then, we have
(i). for
,
(9)
and
(10)
where
(11)
and
(12)
with
(13)
and
(14)
(ii). for
,
(15)
and
(16)
where
and
are obtained from
and
by replacing
with
.
Remark 1. For normalized sample range with norming sequence
and
, where
.
is given by Yang and Hu (2025), its limit distribution of the sample range
is consistent with the conclusion of Theorem 1, and the corresponding higher-order expansions of sample range can also be obtained, that is,
(i). for
,
(17)
and
(18)
where
(19)
and
(20)
with
(21)
and
(22)
(ii). for
,
(23)
and
(24)
where
and
are obtained by replacing
with
from
and
given in (19) and (20).
Note that
and
. From Theorem 1 and Remark 1, the convergence rate of the distribution of sample range
is
under normalized constants
and
, while it is
under normalized constant
.
3. Numerical Analysis
In this section, numerical studies are presented to illustrate the accuracy of higher-order expansions of the cdf of the normalized
. Let
and
,
, denote the first-order, the second-order and the third-order asymptotics of the cdf of the normalized
. From Theorem 1, we have
Table 1. The absolute errors between actual values and its approximations of the cdf of the normalized
with
for
.
|
|
|
|
|
0.009350061961 |
0.009350061959 |
0.009350061941 |
|
0.037794456846 |
0.037794456846 |
0.037794456481 |
|
0.007291688814 |
0.007291688815 |
0.007291687969 |
|
0.002310197299 |
0.002310197299 |
0.002310196367 |
|
0.006727253763 |
0.006721065231 |
0.006721065216 |
|
0.027098930697 |
0.027098930654 |
0.027098930363 |
|
0.005664154550 |
0.005664154550 |
0.005664153874 |
|
0.001877767574 |
0.001877767574 |
0.001877766829 |
|
0.013532735646 |
0.013532735640 |
0.013532735576 |
|
0.016635733516 |
0.016635733516 |
0.016635733121 |
|
0.003804335347 |
0.003804335347 |
0.003804334512 |
|
0.001415812591 |
0.001415812591 |
0.001415811643 |
|
0.010045832712 |
0.010045832705 |
0.010045832641 |
|
0.012265049126 |
0.012265049085 |
0.012265048684 |
|
0.002886327334 |
0.002886327334 |
0.002886326488 |
|
0.001106975657 |
0.001106975657 |
0.001106974696 |
Table 2. The absolute errors between actual values and its approximations of the cdf of the normalized
with
for
.
|
|
|
|
|
0.005999173742 |
0.005999173741 |
0.005999173739 |
|
0.017938460495 |
0.017938460495 |
0.017938460452 |
|
0.002808046188 |
0.002808046188 |
0.002808046088 |
|
0.001017960224 |
0.001017960224 |
0.001017960113 |
|
0.004541237239 |
0.004539342334 |
0.004539342333 |
|
0.013260117764 |
0.013260117751 |
0.013260117723 |
|
0.002125050279 |
0.002125050279 |
0.002125050216 |
|
0.000849008491 |
0.000849008491 |
0.000849008421 |
|
0.004593492428 |
0.004593492426 |
0.004593492418 |
|
0.004988635638 |
0.004988635638 |
0.004988635593 |
|
0.000721496906 |
0.000721496906 |
0.000721496810 |
|
0.000000628940 |
0.000000628940 |
0.000000628951 |
|
0.003477885996 |
0.003477885994 |
0.003477885989 |
|
0.003877077425 |
0.003877077413 |
0.003877077377 |
|
0.000861162434 |
0.000861162434 |
0.000861162358 |
|
0.000371306232 |
0.000371306233 |
0.000371306146 |
Table 3. The absolute errors between actual values and its approximations of the cdf of the normalized
with
for
.
|
|
|
|
|
0.009350061808 |
0.009350061806 |
0.009350061788 |
|
0.037794449719 |
0.037794449719 |
0.037794449354 |
|
0.007291668033 |
0.007291668033 |
0.007291667187 |
|
0.002310175903 |
0.002310175903 |
0.002310174971 |
|
0.006727254051 |
0.006721065519 |
0.006721065504 |
|
0.027098980537 |
0.027098980494 |
0.027098980202 |
|
0.005664175501 |
0.005664175501 |
0.005664174826 |
|
0.001877789343 |
0.001877789343 |
0.001877788599 |
|
0.013533517090 |
0.013533517084 |
0.013533517020 |
|
0.016642050457 |
0.016642050457 |
0.016642050061 |
|
0.003818218828 |
0.003818218828 |
0.003818217994 |
|
0.001430121982 |
0.001430121982 |
0.001430121034 |
|
0.010045834757 |
0.010045834751 |
0.010045834686 |
|
0.012265089309 |
0.012265089267 |
0.012265088867 |
|
0.002886297768 |
0.002886297768 |
0.002886296922 |
|
0.001106665963 |
0.001106665963 |
0.001106665002 |
Table 4. The absolute errors between actual values and its approximations of the cdf of the normalized
with
for
.
|
|
|
|
|
0.005999364766 |
0.005999364765 |
0.005999364763 |
|
0.017944923928 |
0.017944923929 |
0.017944923885 |
|
0.002824078186 |
0.002824078186 |
0.002824078085 |
|
0.002001450902 |
0.002001450902 |
0.002001450791 |
|
0.004541252964 |
0.004539358060 |
0.004539358059 |
|
0.013292745311 |
0.013292745298 |
0.013292745271 |
|
0.002126531624 |
0.002126531624 |
0.002126531561 |
|
0.000708637712 |
0.000708637712 |
0.000708637643 |
|
0.004614674689 |
0.004614674687 |
0.004614674680 |
|
0.005183691518 |
0.005183691518 |
0.005183691473 |
|
0.001109438827 |
0.001109438827 |
0.001109438732 |
|
0.000390670111 |
0.000390670111 |
0.000390670003 |
|
0.003463618270 |
0.003463618268 |
0.003463618263 |
|
0.003877046225 |
0.003877046212 |
0.003877046176 |
|
0.000832418148 |
0.000832418148 |
0.000832418071 |
|
0.000295375652 |
0.000295375652 |
0.000295375566 |
(25)
for
; and
(26)
for
, where
,
,
and
are given in Theorem1, respectively.
Next, we use
to calculate the absolute errors with sample sizes
and
under normalized constants
,
,
and
. From Tables 1-4, we have the following findings: 1) The accuracy of higher-order asymptotics improves with increasing
; 2) First-order, second-order and third-order asymptotics are all close to the actual values; 3) The accuracy of the three approximations is comparable, as the second-order and third-order corrections do not lead to a significant improvement in the approximation. Therefore, it cannot be concluded that the third-order approximation is markedly superior; it only marginally approaches the true value in numerical terms. To achieve further improvements in accuracy, subsequent studies could employ the same approach as presented in this work to calculate higher-order terms.
4. Auxiliary Lemmas
Before proving the main conclusion, we first give the following lemmas. Lemma 1 and Lemma 2 have been proved by Yang and Hu (2025).
Lemma 1. Let
denote the cdf of
. For large
, we have
(i). for
,
(27)
where
and
.
(ii). for
,
(28)
where
.
Lemma 2. Let
be the density function of the skew-t-normal distribution.
(i). for
, we have
(29)
(ii). for
, we have
(30)
Lemma 3. With
,
,
and
given by (5) and (7), we have
(i). for
,
(31)
(ii). for
,
(32)
where
,
,
and
are given in Theorem 1.
Proof. 1). For
, by lemma 1 and
, we have
(33)
for large
. Hence by (33), we get
(34)
with
and
given in Theorem 1. It can be easily derived that as
(35)
So, combining with (35), we can obtain
(36)
(ii). The proof is based on a similar approach to that used in the proof of (i). For
, by (27) and (28), we have
(37)
for large
. Thus by (37) we have
(38)
Similar to (35), we have
(39)
and then combining with (39) we can get
(40)
□
5. Proofs
Proof of Theorem 1. It follows from Yang and Hu (2025) that the density function of the normalized
is
(41)
(i). For
, let
be the density function of
. By the convolution formula, we can get
. Then we have
(42)
Note that
and
, so we have
(43)
It follows from the dominated convergence theorem, (29) and (31), we have
(44)
as
.
(ii). Similarly, for
, let
be the density function of
. Then
and
satisfy
, so we have
(45)
Since
and
, then we have
(46)
Combining with (30) and (32), we have
(47)
as
. The proof is finished. □
6. Conclusions
This paper systematically investigates the limiting distribution properties of the normalized sample range under the skew-t-normal distribution. We first derive higher-order expansions for its extreme value distribution and normalization constants, thereby determining the optimal convergence rate of the distribution of normalized sample range
. In particular, the convergence rate obtained remains of the same order regardless of whether the skewness parameter
is positive or negative, reflecting an intrinsic symmetry in the distributional morphology with respect to its convergence behavior. More importantly, a comparative analysis in this study reveals that the skew-t-normal, skew-normal (cf. [14]), and general error distributions (cf. [15]) have the same order of optimal convergence rate for the normalized sample range under normalized constants
and
. However, the specific forms and convergence behaviors of the higher-order (second- and third-order) expansions of sample range for the skew-t-normal distribution are further dictated by the interplay between the skewness parameter
and the degrees of freedom
.
In the numerical analysis section, we observe that the second- and third-order asymptotics provide no significant improvement in accuracy, indicating that further incorporation of lower-order terms within the current expansion framework offers limited refinement. To achieve substantial breakthroughs in approximation precision, future work must account for the influence of fourth-order and higher-order terms. Therefore, systematic investigation into the mathematical structure and numerical effects of higher-order expansion terms represents a critical pathway to overcoming current precision bottlenecks. Simultaneously, the limitations inherent in the current expansion order clearly delineate the theoretical boundaries of existing methodologies and highlight principal directions for subsequent research.
NOTES
1The study by Yang, W. and Hu, S. (“Higher-order expansions of sample extremes from the skew-t-normal distribution”) has been submitted to Communications in Statistics-Theory and Methods. The manuscript is currently under reconsideration by the journal, with reviewers evaluating the revised version. It is not listed in the bibliography to prevent circular citation. A copy of the manuscript can be provided by the corresponding author, should it be required.