Measuring the Intraday Jump Tail Risk of Financial Asset Price with Noisy High Frequency Data

This paper proposes a simple two-step nonparametric procedure to estimate the intraday jump tail and measure the jump tail risk in asset price with noisy high frequency data. We first propose the pre-averaging threshold approach to estimate the intraday jumps occurred, and then use the peaks-overthreshold (POT) method and generalized Pareto distribution (GPD) to model the intraday jump tail and further measure the jump tail risk. Finally, an empirical example further demonstrates the power of the proposed method to measure the jump tail risk under the effect of microstructure noise.


Introduction
It's well recognized that the financial asset returns are not normally distributed, but instead exhibit more slowly decaying and asymmetric tails.The earliest influential researches in Mandelbrot [1] and Fama [2] show the empirical evidence for fat-tailed return distributions.And the numerous subsequent studies show that these fatter tails may be attributable to stochastic volatility and/or occasionally large absolute price changes, called "jumps" in the underlying asset price process.With the availability of reliable financial high frequency data over the last two decades, many closer researches on the dynamics of financial asset prices have documented the presence of jumps; see Barndorff-Nielsen and Shephard C. Yu et al. [3] [4], Huangand Tauchen [5], Aït-Sahalia and Jacod [6], Lee and Hannig [7], Lee and Mykland [8], and so on.While both components can account for the extreme tail behavior, they have different mechanisms and further have very different implications on pricing and risk management, as recently explored by Bollerslev and Todorov [9].
In contrast to the numerous studies on tail risk resulting from stochastic volatility, there is fewer work to study the jump tail risk.To the best of our knowledge, recent contributions are mainly from Bollerslev and Todorov [9] [10] [11].
However, the recent financial crisis has further spurred the interest of studying the jump tail events, and the econometric techniques for more accurately estimating and modeling such risks.On the other hand, the existing studies on jump tail risk are performed under the assumption of semimartingale price process in an idealized world.The real application, however, runs into wellknown bias problem caused by market microstructure noise, when the data frequency is very high.The presence of market microstructure noise is widely demonstrated in literature; see O'Hara [12], Hasbrouck [13] and the references therein.Such kind of noises are usually caused by the frictions in actual trades, including tick size, discrete observation, bid-ask spread, and other trading mechanics.Hence, how to estimate the jump tails and measure the jump tail risk under the effect of microstructure noise is of great significance in real application.Although there are some methods proposed to deal with the noise, such as the two time scale and multi-time scale approach (Zhang, et al. [14] [15]), pre-averaging method (Podolskij and Vetter [16], Jacod, Li, Mykland [17]) and realized kernel method (Barndorff-Nielsen and Shephard [18]), most of them are used in the scenarios of estimating the integrated volatility or testing the jump component.In this paper, we consider the problem of estimating the jump tail and measuring the jump tail risk when the observations are contaminated by microstructure noise.
In this paper, we focus on studying the intraday jump tail and measuring the jump tail risk under the market microstructure noise.A simple two-step nonparametric procedure is proposed to implement the analysis.In first step, we use the pre-averaging threshold method to nonparametrically estimate the intraday jump under the effect of microstructure noise.In particular, we first adopt local "pre-averaging" via a kernel function to produce a set of non-overlapping (asymptotically) noise-free observations, and then use the threshold technique to identify the jump series.In second step, we model the intraday jump tail based on the extreme value theory (EVT) and further calculate the jump tail risk measure (Value-at-Risk and Expected Shortfall).Our method is nonparametric, and is easy to implement.Finally, a real data example with actual high frequency data of MSFT is used to show these procedures.
The remainder of this paper is organized as follows.Section 2 presents the methodology to estimate the intraday jump and jump tail risk measurement.
Section 3 provides an empirical example to show the procedure.Section 4 draws conclusions.
C. Yu et al.

Intraday Jump Tail Risk Measurement under Microstructure Noise
In this section, a simple two-step procedure is proposed to measure the intraday jump tail risk with noisy high frequency data.In first step, a pre-averaging threshold method is proposed to nonparametrically identify the intraday jump under the effect of microstructure noise.In second step, the peaks-over-threshold (POT) method based on the generalized Pareto distribution (GPD) is used to model the intraday jump tail and further to calculate the jump tail risk measure, i.e.VaR (Value-at-Risk) and ES (Expected Shortfall).

Pre-Averaging Threshold Estimation of Intraday Jump
Assume that the efficient logarithmic price t p of an asset defined on a filtered probability space , evolves as where ( ) Poisson process with finite activity of jumps.Note that t J can be written as where t ε is the noise term.Assume that the t ε s are i.i.d. and independent of t W and t J processes, and with 0 t Eε = , and Eε < ∞ .Although the noises are not necessary i.i.d, this assumption is only for the simplicity to prove the theoretical properties.See the studies in Yu et al. [19], where we show that the estimation method for intraday jump used in this paper also performs well in the setting of correlated noises.
Our goal is to estimate the intraday jump i X , with these noisy observation data { } , 0,1, , For the simplicity of notation, we denote for any process In this paper, we use the pre-averaging approach to diminish the effect of noise.Let n i Z denote the weighted average of n k observations of . We require that the weighting function ( ) C with a piecewise Lipschitz derivative g′ , and satisfies ( ) ( ) . We further require that the integer sequence n k satisfies ( ) Power functions ( ) , which says that the threshold function ( ) n r ∆ can be used to asymptotically identify the intervals where no jump occurred; also see the literature on the noise-and jumprobust volatility estimation (Jing et al. [20]).In other words, if ( ) ( ) where ( ) We now turn to estimate the jump size by a simple nonparametric method.

Denote by
( ) the size of this first jump, also let . For the simplicity of notation, we denote ( ) ( ) ∆ , while the pre-averaging observation of jump process n i J is greater than X multiplying some constant, which is not negligible.So we propose the following estimator for jump size Yu et al. [19] demonstrated the theoretical properties of estimator (4).The results shows that for each i ,

Intraday Jump Tail Risk Measurement
In this subsection, we present how to model the intraday jump tail and then to measure the jump tail risk, i.e.VaR (Value-at-Risk) and ES (Expected Shortfall) based on extreme value theory (EVT).Extreme value theory provides simple parametric models to capture the extreme tails of distribution and to forecast risk.There are mainly two methods of applying EVT: the first is known as the Block Maxima (Minima) (BMM) method based on the generalized extreme value distribution (GEV), while the second is known as the peaks-over-threshold (POT) approach based on the generalized Pareto distribution (GPD).Since the POT method uses GPD to fit the exceedances over a given threshold and hence it doesn't require a large data set as BMM, it is considered more efficient in modelling limited data (McNeil, Frey and Embrecht [21]).Thus, in the following, we use the POT method to model the tail distribution of the identified intraday jump series.
Suppose that the jump series { } i X are identically distributed random va- riables with unknown underlying distribution function for 0 F y x u < < − , where F x ≤ ∞ is the right endpoint of F , and y x u = − .
In EVT framework, there is a key result that for a large class of underlying distributions F (containing all the common continuous distributions in statistics, such as normal, lognormal, t, gamma, exponential, beta, etc.), as the threshold u progressively increases, the excess distribution u F converges to a generalized Pareto distribution.In the sense of this result, the GPD is the natural model for the excess distribution above sufficiently high thresholds.That is the excess distribution function u F can be approximated by GPD for a certain u : where , G ξ σ is the generalized Pareto distribution (GPD), which is given by ( ) ( ) Here ξ is the shape parameter and σ is the scale parameter for GPD.Hence, for x u ≥ , replacing the u F by GPD, ( ) ( C. Yu et al. This gives a formula for tail probabilities.The inverse of (8) gives the high quantile of the distribution or VaR.Thus, for ( ) For 1 ξ < , the ES is given by ( ) Equations ( 9) and (10) give the theoretical formulae to calculate the jump tail risk measure.In the following, we show that how to estimate the VaR and ES with the identified jump series.
For the identified jump series { } ˆi X , if there are total n observations and u N of observations above u , we get an empirical estimator u N n of ( ) Putting the maximum likelihood estimates of the parameters of the GPD together, we arrive an estimator for tail distribution ( ) Also, we get the estimator of VaR and the estimator of ES The estimation procedure presented above depends heavily on the important parameter u .In this paper, we will use the mean excess plot to choose a reasonable threshold.The idea behind this method is demonstrated as follows.Given a high threshold 0 u , suppose that the excess 0 X u − follows a GPD with parameter ξ and σ .Then the mean excess over the threshold 0 For any 0 u u > , define the mean excess function ( ) Thus, for a fixed ξ , the mean excess function is a linear function of u for 0 u u > .This result leads to simple graphical method to infer the appropriate threshold value 0 u for the GPD.Define the empirical mean excess function as ( ) ( ) The scatter plot of ( ) ê u against u is called the mean excess plot, which should be linear in u for 0 u u > .Hence, we can choose a reasonable threshold according to the mean excess plot.

Empirical Example
In this section, we implement our procedure of measuring the intraday jump tail risk with actual high frequency data.We collect the transaction data for Microsoft Corporation (MSFT) shares carried out on NASDAQ from Jan 3, 2011 to Jul 29, 2011 from Wharton Research Data Services (WRDS).We use every ten seconds data to identify and estimate the intraday jumps in one minute return by implementing pre-averaging step with 7 n k = observations.Over this seven months time period, there were total 336,960 ten-seconds observations corresponding to daily 6.5 trading hours in valid 144 trading days excluding weekends and holidays.The return is calculated by ( ) , where i t P denotes the transaction price at i t .
Firstly, we use the pre-averaging threshold method to estimate the intraday jump.Let ( ) ( ) which is used in Jacod et al. [17].In addition, choose the threshold function following the studies in Christensen et al. [22].In order to study the intraday dynamic pattern of jumps, we summarize their frequencies at one-minute frequency of all trading days.Figure 1 presents the frequency distribution of the identified intraday jumps occurred in 6.5 trading hours.It's obvious that the intraday jumps for MSFT from Jan 3, 2011 to Jul 29, 2011 take on "L"-type dynamics.It says that most jumps occurred around the market opening time.For example, there are over 40 trading days with jumps observed at 9:31 (i.e. one minute after the market opening).However, there are less than 10 trading days with jumps observed at half an hour after opening time.
This "L"-type intraday pattern may be driven by the accumulations of news arrivals overnight.Based on the chosen threshold u , Table 1 presents the estimation results of intraday jump and jump tail.Firstly, we can see that there are 452 positive jumps and 437 negative jumps happened among the total one-minute return observations and the corresponding percentage is 0.81% and 0.78% respectively.The number of exceedances over threshold is 293 and 286 for positive and negative jump respectively.It seems that the number of jumps occurred or the intensity of jumps is symmetric for positive and negative jumps.Secondly, by comparing the results of jump tail distribution, we find that the shape parameter ξ for positive jump is −0.0803 and is not significant at the given 10%, 5%, 1% levels, which means that positive jump tail may follow exponential distribution.However, the shape parameter ξ for negative jump is 0.2176 and is significant at 1% level, which means that negative jump tail follows GPD with heavy tail.
These results show that the positive and negative jump tail is asymmetric.In particular, the negative tail is heavier than the positive tail, which shows that there are more negative extreme events happened than positive events over the periods from Jan. 3, 2011 to Jul. 29, 2011 for MSFT.
We then calculate the VaR and ES for negative and positive jumps based on the above estimation results of jump tail distribution.The results of VaR and ES are presented in Table 2.We find that as the significance level (i.e.tail probability) decreases, the results of VaR and ES for negative jump becomes larger than positive jump as expected, which further demonstrates the asymmetry of negative and positive jump tails.Meanwhile, the values in parenthesis in Table 2

Conclusion
Jump component in asset price process is a very important source of financial  extreme risk.With the availability of high frequency data, it has aroused wide attention of researchers in last two decades.However, with the frequency of data increases, the identification of jump and its relevant studies will run into the bias problem caused by market microstructure noise.In this paper, we propose a simple nonparametric method to identify the intraday jump and measure the intraday jump tail risk with noisy high frequency data.We use a two-step procedure to measure the jump tail risk.In first step, we use a pre-averaging approach to diminish the effects of noises, and then propose the pre-averaging threshold estimator of intraday jump.In second step, we fit the tail distribution of the identified jump series with POT method and GPD, and then to calculate the risk measure (VaR and ES) of jump tail.Finally, we show the power of our procedure by a real data study.The results show that our proposed procedure of measuring the jump tail risk is valid and is easy to implement.Moreover, the nonparametric identification of intraday jump can also be used to analyze the dynamics of intraday jump, which is useful to study the microstructure of the market.Further studies on risk management, such as analyzing the impactors of jump tail risk, dynamic jump tail risk forecasting are the future research directions.
Then we can use the threshold technique to identify the jump with these pre-averaging observations { } n i Z .The threshold function is required to satisfy the following assumption.Assumption 1 The threshold function ( ) n r ∆ is a deterministic function of the step length n ∆ there exists jumps on interval ( t t + −   .Thus, we can use this threshold method to identify the intervals where jump occurs and further give a coarse estimation of the location of jumps.Let T τ denote the location set of jumps oc- curred on [ ] the following.For small n ∆ , we have that a.s. in any time interval ( −   , at most only one jump can occur.Moreover, we can obtain that the pre-averaging observation 0 n i Z of continuous diffusion process without jump satisfies

τ
estimates the product of some constant g and the size of the first jump occurs within (

Figure 2
Figure 2 presents the Q-Q plot of the estimated intraday jumps.The result shows that the intraday jump has fatter tails than normal distribution.This further demonstrates the reasonability of using the EVT to model the jump tails.

Figure 3 .
Figure 3. Mean excess function for negative jump tail.

Figure 4 .
Figure 4. Mean excess function for positive jump tail.
are the p values in testing the validity of VaRs and ESs by Kupiec test.Values smaller than a given significance level indicate that the risk measures are invalid.From the results, we can see that the risk measure are valid except the case of 10% significance level for positive jump, which further in turn shows the success of our measuring method for jump tail risk.

Table 1 .
Estimation results of intraday jump and jump tail.

Table 2 .
Results of VaR and ES for intraday jump.: Values in parenthesis are the p values in testing the validity of VaRs and ESs, *, **, *** mean that the risk measures are invalid at 10%, 5%, 1% level respectively. Note