The Estimation of the Spot Volatility for Diffusion Process

Abstract

In this paper, we propose a Gasser-Müller type spot volatility estimator (abbreviated as GM type estimator) for diffusion process, which is weighted by integrals, it is different from the kernel spot volatility estimator discussed by Kristensen (2010). Under more general conditions, the asymptotic unbiasedness and the asymptotic normality of the GM type estimator are derived. The simulation results show that the GM type spot volatility estimator has good estimation effect, and its mean square error tends to be less than that of the kernel spot volatility estimator discussed by Kristensen (2010), so it provides a selection method for estimating the spot volatility in high frequency data environment.

Share and Cite:

Xu, W. , Yang, X. and Yang, S. (2021) The Estimation of the Spot Volatility for Diffusion Process. Open Journal of Statistics, 11, 303-318. doi: 10.4236/ojs.2021.112017.

1. Introduction

Volatility is the main component of describing the price movement towards financial market, so we need to use the observed actual data to estimate volatility. However, the traditional low-frequency data cannot meet the needs of high-frequency traders in today’s market, and the common models in the traditional low-frequency field are no longer applicable to the high-frequency field. In order to avoid errors caused by setting any model, more and more scholars consider using non-parametric methods to estimate volatility in high-frequency data.

Consider the diffusion process { X t } = { X t : t 0 } , which satisfies that

d X t = μ t d t + σ t d W t , (1.1)

where { W t } is a standard Brownian motion, while { μ t } and { σ t } are adapted stochastic processes. The process { σ t } is usually denoted the spot volatility process, while { μ t } is the drift process. In the diffusion process, the volatility is an important measure to reflect the volatility of financial products’ logarithmic price.

Under the condition that asset prices obey the diffusion process, the quadratic variation of the price process converges in probability to the integrated volatility. So quadratic variation, also known as realized volatility, is a consistent estimator of integrated volatility, and has attracted more and more attention. For example, Barndorff—Nielsen and Shephard [1] (2002) prove the central limit theorem of realized volatility. Christensen and Podolskij [2] (2005) give another estimator of integral volatility, range-based realized volatility, and prove the consistency and central limit theorem of this estimator under diffusion model. A nonparametric estimator is proposed by Li Guo and Bao [3] (2013) for the class of integrated self-weighted cross volatilities. Under assuming that processes X and Y are sampled with microstructure noise and in an asynchronous way, the asymptotic properties, which include consistency and asymptotic normality, are obtained. There is still a lot of further research on the integrated volatility estimator, see also, Zhang, Mykland and At-Sahalia [4] (2005), Hansen, Large and Lunde [5] (2008), Li, et al. [6] (2013), Jing and Liu and Kong [7] (2015) and Li, Xie and Zheng [8] (2016).

Merton [9] (1976) first proposed the jump-diffusion model. Later, Duffie, Pan and Singleton [10] (2000), Pan [11] (2002), Eraker, Johannes and Polson [12] (2003) found that asset price process contains improper mutation behavior, so jump part should be added in diffusion model. Previous studies have shown that integrated volatility is generally estimated by realized multiple power variation and realized threshold volatility when asset prices obey jump-diffusion model. Barndorff-Nielsen and Shephard [13] [14] (2004, 2006) proposed realized power variation and realized bipower variation, and proved that realized bipower variation is a robust consistent estimator of integrated volatility with jump process. By using the properties of Brown’s motion path, when the price obeys the jump-diffusion model with finite jump, another estimator of integrated volatility, the realized threshold volatility, is proposed, and the central limit theorem is obtain by Mancini [15] [16] (2004, 2009). Christensen and Podolskij [17] (2012) put forward the realized multiple power variation of based-range. They also show the robustness of the estimator to jump behavior in probability limit and asymptotic distribution. The further research can be seen in Todorov [18] [19] (2009, 2010), At-Sahalia and Jacod [20] [21] (2009, 2011), Tauchen and Zhou [22] (2011).

The integrated volatility characterizes the total volatility over a certain period of time for the past, but investors pay more attention to the spot volatility at a certain time, so we need to use high-frequency data to estimate the spot volatility. As the price process obeys the diffusion process, the rolling sample estimator for spot volatility was first proposed by Foster and Nelson [23] (1996). This rolling sample method is extended by Andreou and Ghysels [24] (2002) to construct the rolling sample variance to estimate the quadratic variation of asset returns, and a rolling sample estimator with many similar characteristics to the instantaneous volatility estimator is proposed. They further discussed the asymptotically effective bandwidth and weighted form of the quadratic variation estimator, and established the relationship between the instantaneous volatility estimator and the integrated volatility estimator.

Moreover, Fan and Wang [25] (2008), Shigeyoshi and Simona [26] (2011), Zu and Boswijk [27] (2014), Yu, et al. [28] (2014), Mancini, Mattiussi and Renò [29] (2015), Curato, Mancino and Recchioni [30] (2018) and Liu, et al. [31] (2018) et al. have done a lot of research on instantaneous volatility estimation. Fan and Wang [25] (2008) employs a bivariate diffusion to model the price and volatility of an asset and investigates kernel type estimators of spot volatility based on high-frequency return data. And they establish both pointwise and global asymptotic distributions for the estimators. Zu and Peter [27] (2014) constructs a spot volatility estimate for high frequency financial data with market microstructure noise. At the same time, the consistency of the estimator is proved and the asymptotic distribution of the estimator is derived. Curato, Mancino and Recchioni [30] (2018) defines a new nonparametric estimation of spot volatility based on the relationship between Laplace transformation of price process and fluctuation process of Brownian semi-martingale model, and proves the consistency and asymptotic normality of the estimation. The simulation results confirm the theoretical results of the proposed estimator in the presence of noise effects on the microstructure.

According to the idea of Nadaraya-Watson kernel regression estimation, Kristensen [32] (2010) proposes the NW type kernel estimation of spot volatility σ t

σ ^ τ 2 = i = 1 n K h ( t i τ ) ( Δ X t i ) 2 , τ [ 0 , T ] (1.2)

where T is the time interval, t i = i δ , i = 0 , 1 , , n , δ = T / n , Δ X t i = X t i X t i 1 , K h ( s τ ) = K ( s τ h ) / h , K : R R is a kernel, h > 0 is the bandwidth. And the asymptotic property of the kernel estimator is proved in Kristensen [32] (2010).

In non-parameter kernel regression estimation, there are two popular kernel methods, Nadaraya-Watson estimator μ ^ n , N W ( t ) and Gasser-Müller estimator μ ^ n , G M ( t ) . Gasser and Müller [33] (1979) gave

E ( μ ^ n , N W ( t ) ) = h 1 0 1 K ( t s h ) μ ( s ) d s + O ( n γ μ + ( n h ) γ K + n ( δ 0 1 ) ) , (1.3)

and

E ( μ ^ n , G M ( t ) ) = h 1 0 1 K ( t s h ) μ ( s ) d s + O ( n γ μ ) , (1.4)

where γ μ > 0 , γ K > 0 and δ 0 > 1 . The two equations imply that Gasser-Müller estimator μ ^ n , G M ( t ) has a better remainder term for the bias than Nadaraya-Watson estimator μ ^ n , N W ( t ) .

In view of this advantage, this paper will study the Gasser-Müller integrated type spot volatility estimator (or the GM type spot volatility estimator)

σ ^ τ , G M 2 = 1 δ i = 1 n ( Δ X t i ) 2 t i 1 t i K h ( s τ ) d s , τ [ 0 , T ] . (1.5)

We will demonstrate the asymptotic property of the GM type instantaneous volatility estimator under more general conditions, including asymptotic unbias and asymptotic normality.

The remainder of the paper is organized as follows. In Section 2, asymptotic property of the GM type instantaneous volatility estimator are proved. Simulation study is put in Section 3. Finally, our conclusion is given in Section 4.

2. Asymptotic Property of Estimation

Throughout the paper, we will use the following basic assumptions.

1) Let δ = T / n , t i = i δ , i = 0 , 1 , , n .

2) (a) The processes { μ t } and { σ t } are jointly independent of { W t } , absolutely integral and locally bounded; (b) There exist constants λ > 0 and C > 0 such that

E | σ t σ t | C | t s | λ . (2.1)

(c) Let f ( t ) = σ t 2 . Assume that f ( t ) is m times differentiable where m 0 , and there exist constants α > 0 and C > 0 such that

E | f ( m ) ( t ) f ( m ) ( τ ) | C | t τ | α . (2.2)

3) (a) The kernel K ( u ) is bounded in R 1 and satisfies that + K ( u ) d u = 1 , + | K ( u ) | d u < , + u j K ( u ) d u = 0 for j = 1 , 2 , , m , and + | u | r | K ( u ) | d u < for some r m + α and r λ ; (b) K ( u ) satisfies the βth order Lipschitz condition where β > 0 .

4) As n , h n 0 and n h n 1 + 1 / β .

Notice that when f ( t ) is not differentiable, m = 0 in (c) of (2). In this case, the conditions (2.1) and (2.2) are identical.

Bandi and Nguyen [34] (2003) and Bandi and Phillips [35] (2003) assumed that σ t and μ t are at least twice continuously differentiable and satisfy Lipschitz condition. Kristensen [32] (2010) relaxed the restriction of differentiability and allowed the order of differentiability m 0 . Our assumption (2) is to refer to the assumption A.2 - A.4 in Kristensen [32] (2010).

In this section, we first establish the asymptotic unbias of the kernel estimator (1.5), and then study its asymptotic normality.

Theorem 2.1. Assume that the conditions (1)-(4) hold. Then

E ( σ ^ τ , G M 2 | { σ t } ) σ τ 2 = O p ( n 1 + 1 n β h n 1 + β + h n m + α ) , (2.3)

furthermore

E ( σ ^ τ , G M 2 σ τ 2 ) = O ( n 1 + 1 n β h n 1 + β + h n m + α ) . (2.4)

Proof We introduce some denotations. Let Δ X ˜ t i = t i 1 t i σ s d W s and

σ ^ ˜ τ , G M 2 = 1 δ i = 1 n ( Δ X ˜ t i ) 2 t i 1 t i K h ( s τ ) d s . (2.5)

Obviously,

σ ^ τ , G M 2 σ ^ ˜ τ , G M 2 = 1 δ i = 1 n ( t i 1 t i μ s d s + t i 1 t i σ s d W s ) 2 t i 1 t i K h ( s τ ) d s 1 δ i = 1 n ( t i 1 t i σ s d W s ) 2 t i 1 t i K h ( s τ ) d s = 1 δ i = 1 n ( t i 1 t i μ s d s ) 2 t i 1 t i K h ( s τ ) d s + 2 δ i = 1 n t i 1 t i μ s d s t i 1 t i σ s d W s t i 1 t i K h ( s τ ) d s . (2.6)

Due to boundedness of μ t , it follows that

1 δ | i = 1 n ( t i 1 t i μ s d s ) 2 t i 1 t i K h ( s τ ) d s | C δ i = 1 n t i 1 t i | K h ( s τ ) | d s = C δ 0 T | K h ( s τ ) | d s C δ | K ( u ) | d u C n 1 . (2.7)

Note that

t i 1 t i σ s d W s = l a w ( t i 1 t i σ s 2 d s ) 1 / 2 U i , (2.8)

where U 1 , U 2 , , U n are independently and identically standard normal N ( 0,1 ) and independent of the process σ . Let

A n = 2 δ i = 1 n t i 1 t i K h ( s τ ) d s t i 1 t i μ s d s ( t i 1 t i σ s 2 d s ) 1 / 2 U i , (2.9)

Since E ( A n | { σ t } ) = 0 , from (2.6)-(2.8), we find that

E ( σ ^ τ , G M 2 | { σ t } ) E ( σ ^ ˜ τ , G M 2 | { σ t } ) = O ( n 1 ) . (2.10)

Therefore, it is sufficient to show that

E ( σ ^ ˜ τ , G M 2 | { σ t } ) σ τ 2 = O p ( 1 n β h n 1 + β + h n m + α ) . (2.11)

By (2.8), we have that

E [ ( Δ X ˜ t i ) 2 | { σ t } ] = E [ ( t i 1 t i σ s d W s ) 2 | { σ t } ] = t i 1 t i σ s 2 d s , (2.12)

and

E ( σ ^ ˜ τ , G M 2 | { σ t } ) σ τ 2 = 1 δ i = 1 n t i 1 t i K h ( s τ ) d s E ( ( Δ X ˜ t i ) 2 | { σ t } ) σ τ 2 = 1 δ i = 1 n t i 1 t i K h ( s τ ) d s t i 1 t i σ v 2 d v σ τ 2 . (2.13)

By the mean value theorem of integrals, there exist t ˜ i [ t i 1 , t i ] ( i = 1 , 2 , , n ) such that

E ( σ ^ ˜ τ , G M 2 | { σ t } ) σ τ 2 = i = 1 n K h ( t ˜ i τ ) t i 1 t i σ v 2 d v σ τ 2 . (2.14)

We know that K ( u ) satisfies the βth Lipschiztz condition and σ t 2 is bounded in [ 0, T ] . It follows that

| i = 1 n K h ( t ˜ i τ ) t i 1 t i σ v 2 d v 0 T K h ( v τ ) σ v 2 d v | = | i = 1 n t i 1 t i { K h ( t ˜ i τ ) K h ( v τ ) } σ v 2 d v | C i = 1 n t i 1 t i 1 h n | t ˜ i v h n | β σ v 2 d v C δ β h n 1 + β 0 T σ v 2 d v C δ β h n 1 + β , (2.15)

hence

E ( σ ^ ˜ τ , G M 2 | { σ t } ) σ τ 2 = 0 T K h ( v τ ) σ v 2 d v σ τ 2 + O ( δ β h n 1 + β ) = τ h n T τ h n K ( u ) σ τ + u h n 2 d u σ τ 2 + O ( δ β h n 1 + β ) = τ h n T τ h n K ( u ) σ τ + u h n 2 d u σ τ 2 τ h n T τ h n K ( u ) d u + O ( δ β h n 1 + β ) . (2.16)

Recalling f ( t ) = σ t 2 in the assumption (2) and f ( t ) is m times differentiable and its mth derivative f ( m ) ( t ) satisfies the αth order Lipschitz condition in L 1 ( [ 0, T ] ) where m 0 and α > 0 . By Taylor formula, we have that

τ h n T τ h n K ( u ) σ τ + u h n 2 d u σ τ 2 τ h n T τ h n K ( u ) d u = j = 1 m 1 f ( j ) ( τ ) h n j j ! τ h n T τ h n u j K ( u ) d u + h n m m ! τ h n T τ h n f ( m ) ( τ + θ u h n ) u m K ( u ) d u , (2.17)

where | θ | 1 . Note that + u j K ( u ) d u = 0 , j = 1 , 2 , , m and | u | r | K ( u ) | d u < where r m + α , we have

τ h n T τ h n u j K ( u ) d u = τ h n u j K ( u ) d u T τ h n u j K ( u ) d u = O ( h n r j ) , (2.18)

while

τ h n T τ h n f ( m ) ( τ + θ u h n ) u m K ( u ) d u = τ h n T τ h n [ f ( m ) ( τ + θ u h n ) f ( m ) ( τ ) ] u m K ( u ) d u f ( m ) ( τ ) τ h n u m K ( u ) d u f ( m ) ( τ ) T τ h n u m K ( u ) d u = τ h n T τ h n [ f ( m ) ( τ + θ u h n ) f ( m ) ( τ ) ] u m K ( u ) d u + O ( h n r m ) , (2.19)

and

E | τ h n T τ h n [ f ( m ) ( τ + θ u h n ) f ( m ) ( τ ) ] u m K ( u ) d u | τ h n T τ h n E | f ( m ) ( τ + θ u h n ) f ( m ) ( τ ) | | u | m | K ( u ) | d u C τ h n T τ h n | θ u h n | α | u | m | K ( u ) | d u = O ( h n α ) . (2.20)

Combining (2.17)-(2.20), we find that

τ h n T τ h n K ( u ) σ τ + u h n 2 d u σ τ 2 τ h n T τ h n K ( u ) d u = O p ( h n m + α ) . (2.21)

We obtain the desire result (2.11) from (2.16) and (2.21). Complete the proof.

Theorem 2.2. Assume that assumptions (1)-(3) hold, and β > 1 / 2 in (b) of (3). If the bandwidth h n satisfies

n h n ( 1 / 2 + β ) / ( β 1 / 2 ) , (2.22)

n h n 2 ( m + α ) + 1 0, (2.23)

then

δ 1 h n ( σ ^ τ , G M 2 σ τ 2 ) 2 σ τ 4 K 2 ( z ) d z d N ( 0,1 ) . (2.24)

Here we give some explanations for the conditions.

Remark 2.1. Obviously, h n 0 from (2.23). Note that ( 1 / 2 + β ) / ( β 1 / 2 ) > 1 + 1 / β as β > 1 / 2 , so that n h n 1 + 1 / β from (2.23). It implies that the assumption (4) naturally holds under (2.22) and (2.23). Therefore, the assumption (4) doesnt appear in the assumption of Theorem 2.2.

Remark 2.2. Normally, the used kernel function K ( u ) is derivative bounded. Thus, K ( u ) usually satisfies β 1 order Lipschitz condition. In this case, 1 < ( 1 / 2 + β ) / ( β 1 / 2 ) 3 . Thus it can be seen that (2.22) is a reasonable condition.

Remark 2.3. The condition (2.23) is used in Kristensen [32] (2010, Theorem 3 and Theorem 4). It requires that σ t 2 has smoothness. If m = α = 1 , then (2.23) is equivalent to n h n 5 0 , while If m = 2 and α = 1 (refer to Bandi and Nguyen [34] (2003) and Bandi and Phillips [35] (2003)), then (2.23) is equivalent to n h n 7 0 .

To prove the theorem, we introduce the following lemmas.

Lemma 2.1. (Rosenthal [36], 1970) Let { ξ i : i 1 } be a sequence of independent random variables with E ( ξ i ) = 0 and E ( | ξ i | p ) < for some p 2 . Then there exists constant C no depending on n such that

E | i = 1 n ξ i | p C { i = 1 n E | ξ i | p + ( i = 1 n E ξ i 2 ) p / 2 } . (2.25)

Lemma 2.2. Let the kernel K ( u ) satisfies + | K ( u ) | d u < , + | u | r | K ( u ) | d u < where r > 0 , and there exists constants β > 0 and C > 0 such that

| K ( u ) K ( v ) | C | v v | β for any u , v R . (2.26)

If h 0 and n h 1 + 1 / β , then for any given τ ( 0, T ) ,

δ i = 1 n | K h ( s i τ ) | = | K ( u ) | d u + O ( h r + ( n h 1 + 1 / β ) 1 ) . (2.27)

where t i 1 s i t i .

Proof By the mean value theorem, there exist t ^ i [ t i 1 , t i ] such that

i = 1 n t i 1 t i | K h ( s τ ) | d s = δ n i = 1 n | K h ( t ^ i τ ) | . (2.28)

Hence,

δ i = 1 n | K h ( s i τ ) | = δ i = 1 n | K h ( s i τ ) | δ i = 1 n | K h ( t ^ i τ ) | + i = 1 n t i 1 t i | K h ( s τ ) | d s δ i = 1 n | | K h ( s i τ ) | | K h ( t ^ i τ ) | | + 0 T | K h ( s τ ) | d s δ i = 1 n | K h ( s i τ ) K h ( t ^ i τ ) | + | K ( u ) | d u + O ( h r ) = | K ( u ) | d u + O ( h r + ( n h 1 + 1 / β ) β ) . (2.29)

Complete the proof.

Proof of Theorem 2.2. Let

a i ( μ , σ ) = t i 1 t i K h ( s τ ) d s t i 1 t i μ s d s ( t i 1 t i σ s 2 d s ) 1 / 2 . (2.30)

Recalling A n defined in (2.9), we have that A n = 2 δ i = 1 n a i ( μ , σ ) U i . Hence, by Lemma 2.1,

E ( A n 4 | { μ t , σ t } ) C δ 4 { i = 1 n a i 4 ( μ , σ ) + ( i = 1 n a i 2 ( μ , σ ) ) 2 } C δ 4 { δ 9 h n 3 i = 1 n t i 1 t i | K h ( s τ ) | d s + ( δ 4 h n i = 1 n t i 1 t i | K h ( s τ ) | d s ) 2 } C δ 4 h n 2 C n 4 h n 2 . (2.31)

It implies that E ( A n 4 ) C n 4 h n 2 . From this and (2.6)-(2.9), it follows that, for any given ε > 0

P ( δ 1 h n | σ ^ τ , G M 2 σ ^ ˜ τ , G M 2 | > ε ) P ( δ 1 h n | A n | > ε / 2 ) C ( δ 1 h n ) 2 E ( A n 4 ) C n 2 0 , (2.32)

that is

δ 1 h n ( σ ^ τ , G M 2 σ ^ ˜ τ , G M 2 ) = o p ( 1 ) . (2.33)

Therefore, the remaining task is to show that

δ 1 h n ( σ ^ ˜ τ , G M 2 σ τ 2 ) 2 σ τ 4 K 2 ( z ) d z d N ( 0,1 ) . (2.34)

Let

y n , i ( τ ) = 1 δ t i 1 t i σ v 2 d v t i 1 t i K h ( s τ ) d s , (2.35)

then

E ( σ ^ ˜ τ , G M 2 | { σ t } ) = 1 δ i = 1 n t i 1 t i σ v 2 d v t i 1 t i K h ( s τ ) d s = i = 1 n y n , i ( τ ) , (2.36)

V a r ( σ ^ ˜ τ , G M 2 | { σ t } ) = 2 δ 2 i = 1 n ( t i 1 t i σ v 2 d v ) 2 ( t i 1 t i K h ( s τ ) d s ) 2 = 2 i = 1 n y n , i 2 ( τ ) , (2.37)

while

δ 1 h n σ ^ ˜ τ , G M 2 σ τ 2 2 σ τ 4 K 2 ( z ) d z = σ ^ ˜ τ , G M 2 i = 1 n y n , i ( τ ) 2 i = 1 n y n , i 2 ( τ ) δ 1 h n i = 1 n y n , i 2 ( τ ) σ τ 4 K 2 ( z ) d z + δ 1 h n i = 1 n y n , i ( τ ) σ τ 2 2 σ τ 4 K 2 ( z ) d z = : I 1 , n ( τ ) I 2 , n ( τ ) + I 3 , n ( τ ) . (2.38)

First, consider I 2, n . Denote that

J 1 , n ( τ ) = h n δ 3 i = 1 n ( t i 1 t i σ v 2 d v ) 2 ( t i 1 t i K h ( s τ ) d s ) 2 h n 0 T σ s 4 K h 2 ( s τ ) d s , (2.39)

J 2 , n ( τ ) = h n 0 T σ s 4 K h 2 ( s τ ) d s σ τ 4 h n 0 T K h 2 ( s τ ) d s , (2.40)

J 3 , n ( τ ) = σ τ 4 h n T K h 2 ( s τ ) d s + σ τ 4 h n 0 K h 2 ( s τ ) d s . (2.41)

Let z = ( s τ ) / h n , we have

δ 1 h n i = 1 n y n , i 2 ( τ ) σ τ 4 K 2 ( z ) d z = h n δ 3 i = 1 n ( t i 1 t i σ v 2 d v ) 2 ( t i 1 t i K h ( s τ ) d s ) 2 σ τ 4 h n K h 2 ( s τ ) d s = J 1 , n ( τ ) + J 2 , n ( τ ) + J 3 , n ( τ ) . (2.42)

By the mean value theorem, there exists v i , s i , s ˜ i [ t i 1 , t i ] ( i = 1 , 2 , , n ) , such that

J 1 , n ( τ ) = h n δ 3 i = 1 n ( t i 1 t i σ v 2 d v ) 2 ( t i 1 t i K h ( s τ ) d s ) 2 h n i = 1 n t i 1 t i σ s 4 K h 2 ( s τ ) d s = δ h n i = 1 n σ v i 4 K h 2 ( s i τ ) δ h n i = 1 n σ s ˜ i 4 K h 2 ( s ˜ i τ ) = δ h n i = 1 n ( σ v i 4 σ s ˜ i 4 ) K h 2 ( s i τ ) + δ h n i = 1 n σ s ˜ i 4 { K h 2 ( s i τ ) K h 2 ( s ˜ i τ ) } = : Q 1 , n ( τ ) + Q 2 , n ( τ ) . (2.43)

Using Lemma 2.2, we get that

E | Q 1 , n ( τ ) | C δ h n i = 1 n E | σ v i 2 σ s ˜ i 2 | K h 2 ( s i τ ) C δ λ i = 1 n δ | K h ( s i τ ) | C δ λ , (2.44)

and

| Q 2 , n ( τ ) | C δ h n i = 1 n | K h ( s i τ ) K h ( s ˜ i τ ) | | K h ( s i τ ) + K h ( s ˜ i τ ) | C δ 1 + β h n β i = 1 n | K h ( s i τ ) + K h ( s ˜ i τ ) | C ( δ h n ) β , (2.45)

thus

J 1, n ( τ ) = O p ( n λ + ( n h n ) β ) . (2.46)

Using the integral transform u = ( s τ ) / h n for J 2, n ( τ ) , and by (2) and (3), we known that

E | J 2 , n ( τ ) | = E | τ / h n ( T τ ) / h n ( σ u h n + τ 4 σ τ 4 ) K 2 ( u ) d u | C τ / h n ( T τ ) / h n E | σ u h n + τ 2 σ τ 2 | K 2 ( u ) d u C h n λ | u | λ | K ( u ) | d u C h n λ . (2.47)

Note that + | u | α | K ( u ) | d u < , we find that

| J 3 , n ( τ ) | C σ τ 4 ( T τ ) / h n | K ( u ) | d u + C σ τ 4 τ / h n | K ( u ) | d u C σ τ 4 h n α ( T τ ) α ( T τ ) / h n | u | α | K ( u ) | d u + C σ τ 4 h n α τ α τ / h n | u | α | K ( u ) | d u C h n α . (2.48)

Combining (2.42) and (2.46)-(2.48), we have

δ 1 h n i = 1 n y n , i 2 ( τ ) σ τ 4 K 2 ( z ) d z = o p ( 1 ) . (2.49)

It follows that

I 2, n ( τ ) p 1. (2.50)

Next, consider I 1, n . We defined c n , i ( τ ) = y n , i ( τ ) / i = 1 n y n , i 2 ( τ ) , thus

I 1 , n ( τ ) = l a w i = 1 n c n , i ( τ ) U ¯ i 2 , (2.51)

where U ¯ i 2 = ( U i 2 1 ) / 2 are independent and identically distributed random variables with mean zero and variance one. According to (2.49), we obtain

c n , i ( τ ) = y n , i ( τ ) T n h n σ τ 4 K 2 ( z ) d z + o p ( 1 n h n ) = 1 δ h n t i 1 t i σ v 2 d v t i 1 t i K ( s τ h n ) d s T n h n σ τ 4 K 2 ( z ) d z + o p ( 1 n h n ) C n h n T n h n σ τ 4 K 2 ( z ) d z + o p ( 1 n h n ) C ( 1 n h n ) 1 / 2 ,

(2.52)

and i = 1 n c n , i 2 ( τ ) = 1 , by Barndorff-Nielsen and Shephard [14] (2006, Cor. 3.1), we obtain

I 1, n ( τ ) | { σ t } d N ( 0,1 ) . (2.53)

Finally, consider I 3, n . By (2.3), we known

i = 1 n y n , i ( τ ) σ τ 2 = O p ( n 1 + 1 n β h n 1 + β + h n m + α ) , (2.54)

then

I 3 , n ( τ ) = O p ( n 1 + 1 n β h n 1 + β + h n m + α ) ( n h n ) 1 / 2 = O p ( n 1 / 2 h n 1 / 2 + 1 n β 1 / 2 h n 1 / 2 + β + n 1 / 2 h n m + α + 1 / 2 ) = O p ( n 1 / 2 h n 1 / 2 + 1 ( n h n ( 1 / 2 + β ) / ( β 1 / 2 ) ) β 1 / 2 + ( n h n 2 m + 2 α + 1 ) 1 / 2 ) . (2.55)

By (2.22) and (2.23), we obtain

I 3, n ( τ ) = o p ( 1 ) . (2.56)

Therefore, it follows (2.34) from (2.50), (2.53) and (2.56). Complete the proof.

3. Simulation

Here, we will show the performance of the Gasser-Müller integrated type spot volatility estimator σ ^ τ , G M 2 (see 1.5) proposed in this paper and compare it with the kernel estimator σ ^ τ 2 proposed by Kristensen [32] (2010). We consider the following stochastic volatility model as referred in Kanaya and Kristensen [37] (2016),

d X t = σ t d W 1, t ,

d σ t 2 = β ( α σ t 2 ) d t + κ σ t 2 d W 2, t ,

where W 1, t and W 2, t are independent standard Brownian motions, parameters are respectively β = 0.510 , α = 0.476 and κ 2 = 0.0518 (refer to Kanaya and Kristensen [37] (2016), page 888). In order to simulate the above model, we use Euler discretization scheme (see Kloeden and Platten [7], 1999) to obtain the following model.

Δ X i δ = σ ( i 1 ) δ δ ε 1, i , (3.1)

Δ σ i δ 2 = β ( α σ ( i 1 ) δ 2 ) δ + κ σ ( i 1 ) δ 2 δ ε 2, i . (3.2)

where { ε 1, i } and { ε 2, i } are i.i.d N ( 0,1 ) , δ > 0 is the length of the discretization step.

In the actual operation of the simulation, we assume that the drift term μ t = 0 , and the observed time is T = 2 (48 hours). The specific steps are as follows:

1) Get the true value { σ t } . According to model (3.2), the truth sequence { σ t } of σ t is obtained, because the discretized model is approximated to the original continuity model and the simulation interval is as small as possible, so the time interval δ = Δ / 100 is selected in model (3.2), where Δ 1 = 3 × 60 × 24 × 2 .

2) Produce samples. Δ X i δ is sampled at 20 seconds, 1 minute, 5 minutes, 10 minutes, 15 minutes and 30 minutes sampling frequency. Model (3.1) is used to generate simulated sample { Δ X i δ , i = 1 , 2 , , n } . The numbers of samples corresponding to various sampling frequencies are

δ 1 = 3 × 60 × 24 × 2 , 60 × 24 × 2 , 12 × 24 × 2 , 6 × 24 × 2 , 4 × 24 × 2 , 2 × 24 × 2 ,

respectively. At the same time, the sample is repeated 400 times and 400 sets of sample data are obtained for each sampling case. Define σ ˜ t i , j 2 , i = 1 , 2 , , n ; j = 1 , 2 , , 400 as the estimated value at time t i using the group j sample data.

3) Choose kernel function and bandwidth. In the process of calculating estimators σ ^ τ , G M 2 and σ ^ τ , we choose the Gaussian kernel. Since the choice of the kernel function has a great influence on estimates on the boundary (see Silverman [38] (1986)), in order to avoid the boundary effects, we only considered estimated values of the [ 1 / 2 , 3 / 2 ] . Because the selection of bandwidth has a great influence on the internal estimation performance, we apply the cross-validated mentioned in Kristensen [32] (2010) to obtain the optimal bandwidth, namely, the optimum bandwidth is the minimum bandwidth for

I S E ^ ( h n ) = i = 1 n I { T l t i 1 T u } [ ( Δ X t i ) 2 δ σ ^ i , t i 2 ] , (3.3)

where σ ^ i , t i 2 is based on the samples removed from the ith sample to calculate the spot volatility estimator of the time t i = i δ , T l and T u are severally the upper and lower limits of the time span we consider, T l = 1 / 2 , T u = 3 / 2 . Using the simulated sample { Δ X i δ } and (3.3), we acquire the optimal bandwidth of the kernel spot volatility estimator and the GM type spot volatility estimator under the different time intervals.

We want to measure the accuracy of the estimator (i.e. estimation effect) by the bias, the variance and the integrated mean square error of this estimator. The calculation formulas for the bias, the variance and the integrated mean square error of the estimators are respectively

Bias 2 = 1 n i = 1 n ( 1 400 j = 1 400 σ ˜ t i , j 2 σ t i 2 ) 2 ,

Var = 1 n i = 1 n 1 400 j = 1 400 ( σ ˜ t i , j 2 1 400 j = 1 400 σ ˜ t i , j 2 ) 2 ,

and

IMSE = 1 n i = 1 n 1 400 j = 1 400 ( σ ˜ t i , j 2 σ t i 2 ) 2 .

Obviously, IMSE = Var + Bias 2 .

Table 1. Performance in interior.

[Note: Integrated sq.bias (×10−4), variance (×10−4), and IMSE (×10−4) of estimators].

The bias, the variance and the integrated mean square error of the GM instantaneous volatility estimator and the kernel instantaneous volatility estimator under the different time intervals are reported in Table 1.

According to the simulation results shown in Table 1, we find that the higher the sampling frequency, the better the accuracy of both the GM type spot volatility estimator σ ^ τ , G M 2 and the kernel spot volatility estimator σ ^ τ , indicating that both estimators are suitable for high frequency data. At the same time, we see that the IMSE of σ ^ τ , G M 2 is slightly smaller than that of σ ^ τ . This shows that the GM type spot volatility estimator provides a selection method for estimating the spot volatility, especially in high frequency data environment, its estimation performance is better.

4. Conclusion

We propose a Gasser-Müller type spot volatility estimator for diffusion process, and prove its asymptotic unbiasedness and asymptotic normality under mild conditions. By simulation analysis, we find that the GM type spot volatility estimator is a better estimator. Therefore, it provides a selection method for estimating the spot volatility in high frequency data environment.

Funding

This research was supported by the National Natural Science Foundation of China [grant number 11461009], the Innovation Project of Guangxi Graduate Education [grant number XYCSZ2017075] and the Scientific research project of the Guangxi Colleges and Universities [grant number KY2015YB345].

Authors’ Contributions

All authors carried out the mathematical studies, and read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Barndorff-Nielsen, O.E. and Shephard, N. (2002) Econometric Analysis of Realized Volatility and Its Use in Estimating Stochastic Volatility Models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64, 253-280.
https://doi.org/10.1111/1467-9868.00336
[2] Christensen, K. and Podolski, M. (2005) Asymptotic Theory for Range-Based Estimation of Integrated Variance of a Continuous Semi-Martingale. Technical Reports.
[3] Li, C., Guo, E. and Bao, M. (2012) A Nonparametric Estimate of Integrated Cross-Volatility. Acta Scientiarum Naturalium Universitatis Sunyatseni, 52, 55-58.
[4] Zhang, L., Mykland, P.A. and Aït-Sahalia, Y. (2005) A Tale of Two Time Scales: Determining Integrated Volatility with Noisy High-Frequency Data. Journal of the American Statistical Association, 100, 1394-1411.
https://doi.org/10.1198/016214505000000169
[5] Hansen, P., Large, J.H. and Lunde, A. (2008) Moving Average-Based Estimators of Integrated Variance. Econometric Reviews, 27, 79-111.
https://doi.org/10.1080/07474930701853640
[6] Li, C.X., Liang, X.L., Jing, B.Y. and Kong, X.B. (2013) The Asymptotics of the Integrated Self-Weighted Cross Volatility Estimator. Journal of Statistical Planning & Inference, 143, 1708-1718.
https://doi.org/10.1016/j.jspi.2013.05.003
[7] Jing, B., Liu, Z. and Kong, X. (2015) Estimating the Volatility Functionals with Multiple Transactions. Econometric Theory, 33, 331-365.
https://doi.org/10.1017/S0266466615000420
[8] Li, Y., Xie, S. and Zheng, X. (2016) Efficient Estimation of Integrated Volatility Incorporating Trading Information. Journal of Econometrics, 195, 33-50.
https://doi.org/10.1016/j.jeconom.2016.05.017
[9] Merton, R.C. (1976) Option Pricing When Underlying Stock Returns Are Discontinuous. Journal of Financial Economics, 3, 125-144.
https://doi.org/10.1016/0304-405X(76)90022-2
[10] Duffie, D., Pan, J. and Singleton, K. (2000) Transform Analysis and Asset Pricing for Affine Jump-Diffusions. Econometrica, 68, 1343-1376.
https://doi.org/10.1111/1468-0262.00164
[11] Pan, J. (2002) The Jump-Risk Premia Implicit in Options: Evidence from an Integrated Time Series Study. Journal of Financial Economics, 63, 3-50.
https://doi.org/10.1016/S0304-405X(01)00088-5
[12] Eraker, B., Johannes, M. and Polson, N. (2003) The Impact of Jumps in Equity Index Volatility and Returns. Journal of Finance, 58, 1269-1300.
https://doi.org/10.1111/1540-6261.00566
[13] Barndorff-Nielsen, O.E. and Shephard, N. (2004) Power and Bipower Variation with Stochastic Volatility and Jumps. Economics Papers, 2, 1-37.
https://doi.org/10.1093/jjfinec/nbh001
[14] Barndorff-Nielsen, O.E. and Shephard, N. (2006) Econometrics of Testing for Jumps in Financial Economics Using Bipower Variation. Journal of Financial Econometrics, 4, 1-30.
https://doi.org/10.1093/jjfinec/nbi022
[15] Mancini, C. (2004) Estimation of the Characteristics of the Jumps of a General Poisson-Diffusion Model. Scandinavian Actuarial Journal, 2004, 42-52.
https://doi.org/10.1080/034612303100170091
[16] Mancini, C. (2009) Non-Parametric Threshold Estimation for Models with Stochastic Diffusion Coefficient and Jumps. Scandinavian Journal of Statistics, 36, 270-296.
https://doi.org/10.1111/j.1467-9469.2008.00622.x
[17] Christensen, K. and Podolskij, M. (2012) Asymptotic Theory of Range-Based Multipower Variation. Journal of Financial Econometrics, 10, 417-456.
https://doi.org/10.1093/jjfinec/nbr019
[18] Todorov, V. (2009) Estimation of Continuous-Time Stochastic Volatility Models with Jumps Using High-Frequency Data. Journal of Econometrics, 148, 131-148.
https://doi.org/10.1016/j.jeconom.2008.10.005
[19] Todorov, V. (2010) Variance Risk-Premium Dynamics: The Role of Jumps. Review of Financial Studies, 23, 345-383.
https://doi.org/10.1093/rfs/hhp035
[20] At-Sahalia, Y. and Jacod, J. (2009) Estimating the Degree of Activity of Jumps in High Frequency Data. Annals of Statistics, 37, 2202-2244.
https://doi.org/10.1214/08-AOS640
[21] At-Sahalia, Y. and Jacod, J. (2011) Analyzing the Spectrum of Asset Returns: Jump and Volatility Components in High Frequency Data. Journal of Economic Literature, 50, 1007-1050.
https://doi.org/10.1257/jel.50.4.1007
[22] Tauchen, G. and Zhou, H. (2011) Realized Jumps on Financial Markets and Predicting Credit Spreads. Journal of Econometrics, 160, 102-118.
https://doi.org/10.1016/j.jeconom.2010.03.023
[23] Foster, D.P. and Nelson, D.B. (1996) Continuous Record Asymptotics for Rolling Sample Variance Estimators. Modelling Stock Market Volatility, 64, 291-329.
https://doi.org/10.1016/B978-012598275-7.50011-9
[24] Andreou, E. and Ghysels, E. (2002) Rolling-Sample Volatility Estimators: Some New Theoretical, Simulation and Empirical Results. Journal of Business & Economic Statistics, 20, 363-376.
https://doi.org/10.1198/073500102288618504
[25] Fan, J. and Wang, Y. (2008) Spot Volatility Estimation for High-Frequency Data. Statistics and Its Interface, 1, 279-288.
https://doi.org/10.4310/SII.2008.v1.n2.a5
[26] Shigeyoshi, O. and Simona, S. (2011) An Improved Two-Step Regularization Scheme for Spot Volatility Estimation. Economic Notes, 40, 107-134.
https://doi.org/10.1111/j.1468-0300.2011.00233.x
[27] Zu, Y. and Boswijk, H.P. (2014) Estimating Spot Volatility with High-Frequency Financial Data. Journal of Econometrics, 181, 117-135.
https://doi.org/10.1016/j.jeconom.2014.04.001
[28] Yu, C., Fang, Y., Li, Z., Zhang, B. and Zhao, X. (2014) Non Parametric Estimation of High-Frequency Spot Volatility for Brownian Semimartingale with Jumps. Journal of Time Series Analysis, 35, 572-591.
https://doi.org/10.1111/jtsa.12082
[29] Mancini, C., Mattiussi, V. and Renò, R. (2015) Spot Volatility Estimation Using Delta Sequences. Finance and Stochastics, 19, 261-293.
https://doi.org/10.1007/s00780-015-0255-1
[30] Curato, I.V., Mancino, M.E. and Recchioni, M.C. (2018) Spot Volatility Estimation Using the Laplace Transform. Econometrics and Statistics, 6, 22-43.
https://doi.org/10.1016/j.ecosta.2016.07.002
[31] Liu, Q., Liu, Y., Liu, Z. and Wang, L. (2018) Estimation of Spot Volatility with Superposed Noisy Data. The North American Journal of Economics and Finance, 44, 62-79.
https://doi.org/10.1016/j.najef.2017.11.004
[32] Kristensen, D. (2010) Nonparametric Filtering of the Realized Spot Volatility: A Kernel-Based Approach. Econometric Theory, 26, 60-93.
https://doi.org/10.1017/S0266466609090616
[33] Gasser, T. and Müller, H.G. (1979) Kernel Estimation of Regression Functions. In: Gasser, T. and Rosenblatt, M., Eds., Smoothing Techniques for Curve Estimation, Springer-Verlag, Berlin, 23-68.
https://doi.org/10.1007/BFb0098489
[34] Bandi, F.M. and Nguyen, T.H. (2003) On the Functional Estimation of Jump-Diffusion Models. Journal of Econometrics, 116, 293-328.
https://doi.org/10.1016/S0304-4076(03)00110-6
[35] Bandi, F.M. and Phillips, P.C.B. (2003) Fully Nonparametric Estimation of Scalar Diffusion Models. Econometrica, 71, 241-283.
https://doi.org/10.1111/1468-0262.00395
[36] Rosenthal, H.P. (1970) On the Estimations of Sums of Independent Random Variables. Israel Journal of Mathematics, 8, 273-303.
https://doi.org/10.1007/BF02771562
[37] Kanaya, S. and Kristensen, D. (2016) Estimation of Stochastic Volatility Models by Nonparametric Filtering. Econometric Theory, 32, 861-916.
https://doi.org/10.1017/S0266466615000079
[38] Silverman, B.W. (1986) Density Estimation for Statistics and Data Analysis. Chapman and Hall, London.
https://doi.org/10.1007/978-1-4899-3324-9

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.