Value at Risk and Expected Shortfall for Normal Weighted Inverse Gaussian Distributions

Abstract

Value at Risk (VaR) and Expected Shortfall (ES) is commonly used measures of potential risk for losses in financial markets. In literature VaR and ES for the Normal Inverse Gaussian (NIG) distribution, a special case of Generalized Hyperbolic Distribution (GHD), is frequently used. There are however, Normal Inverse Gaussian related distributions, which are also special cases of GHD that can also be used. The objective of this paper is to calculate VaR for Normal Weighted Inverse Gaussian (NWIG) distributions. The Expectation-Maximization (EM) algorithm has been used to obtain the Maximum Likelihood (ML) estimates of the proposed models for the Range Resource Corporation (RRC) financial data. We used Kupiec likelihood ratio (LR) for backtesting of VaR. Kolmogorov-Smirnov test and Anderson-Darling test have been used for goodness of fit test. Akaike Information Creterion (AIC), Bayesian Information Creterion (BIC) and Log-likelihood have been used for model selection. The results clearly show that the NWIG distributions are good alternatives to NIG for determining VaR and ES.

Share and Cite:

Maina, C. , Weke, P. , Ogutu, C. and Ottieno, J. (2022) Value at Risk and Expected Shortfall for Normal Weighted Inverse Gaussian Distributions. Journal of Mathematical Finance, 12, 20-45. doi: 10.4236/jmf.2022.121002.

1. Introduction

The most popular measures for financial risk are Value at Risk (VaR) and Expected Shortfall (ES). VaR was proposed by Till Guldimann in the late 1980s while working for J. P. Morgan as the head of global research. It is generally defined as possible maximum loss over a given holding period within a fixed confidence level. An attractive feature of VaR is the backtestability of the measure. Backtesting is a method that uses historical data to gauge accuracy and effectiveness (Zhang and Nadarajah [1] ). Backtesting VaR is used to compare the forecast/predicted losses from the actual calculated losses realised at the end of a fixed time horizon. However, the main shortcoming of VaR is that it ignores any loss beyond the value at risk level. That is, it fails to capture tail risk. It also lacks a mathematical property called subadditivity as stated by Wimmerstedt [2]. That is, VaR for two combined portfolios can be larger than VaR for the sum of the two portfolios independently. This implies that diversification could increase risk, a contradiction to standard beliefs in finance. Artzner, Delbaen, Eber and Heath ( [3] [4] ) have proposed the use of Expected Shortfall (ES) also called conditional Value at Risk (CVaR) to circumvent the problems inherent in VaR. Expected Shortfall is the conditional expectation of loss given that the loss is beyond the VaR level. However, the main problem with ES is that it lacks a mathematical property called elicitability, Gneting [5], necessary for the backtestability of the risk measure. The Basel Committee [6] proposed to replace Value at Risk with Expected Shortfall but concluded that the backtesting will still be done on VaR even though the capital would be based on Expected Shortfall. Therefore the two measures of risk still remain the most popular and useful in financial management.

Notebly, Nadarajah et al. [7] have given a detailed review of VaR and ES for various distributions. One of the distributions reviewed is the Generalized Hyperbolic Distribution (GHD) introduced by Barndorff-Nielsen [8] as a Normal Variance-Mean Mixture with the Generalized Inverse Gaussian (GIG) distribution as the mixing distribution. The GIG is a three-parameter distribution denoted as G I G ( λ , δ , γ ) . It embraces a number of special and limiting cases. The GHD and its subclasses fit high frequency financial data well which are characterised by skewness, excess kurtosis and fat tail. The most common special case is Normal Inverse Gaussian (NIG) distribution introduced by Barndorff-Nielsen [9] with the Inverse Gaussian (IG) as the mixing distribution. However, there are other special cases of GHD which are related to the NIG distribution which have not been considered for VaR an ES. These special cases are Normal Weighted Inverse Gaussian Distributions.

The objective of this paper is to determine VaR and ES for Range Resource Corporation (RRC) financial data using Normal Weighted Inverse Gaussian (NWIG) distributions. In particular we consider Normal mixtures with G I G ( 1 2 , δ , γ ) , G I G ( 3 2 , δ , γ ) and G I G ( 3 2 , δ , γ ) as mixing distribution. We study their properties and estimate parameters using the Expectation Maximization algorithm introduced by Dempster et al. [10]. Kolmogorov-Smirnov test and Anderson-Darling test have been used for goodness of fit test. Akaike Information Creterion (AIC), Bayesian Information Creterion (BIC) and Log-likelihood have been used for model selection.

The concept of a weighted distribution was introduced by Fisher [11] and elaborated by Patil and Rao [12]. Reciprocal Inverse Gaussian and the finite mixture of Inverse Gaussian and Reciprocal Inverse Gaussian distribution are shown to be Weighted Inverse Gaussian (WIG) distributions by Akman and Gupta [13]; Gupta and Akman [14]; Gupta and Kundu [15]. The Special cases G I G ( 3 2 , δ , γ ) and G I G ( 3 2 , δ , γ ) are also WIG distributions.

For value at Risk of these models we use the Kupiec likelihood ratio (LR) introduced by Kupiec [16]. The remainder of this paper is structured as follows: Section 2 deals with the concept of VaR and ES. Section 3 deals with Generalised Inverse Gaussian and its special cases. Weighted distribution is covered in Section 4. The concept on Generalised hyperbolic distribution is illustrated in Section 5 while its special cases of interest are in Section 6. Parameter estimation of the proposed models is performed in Section 7. Application to Range Resource Corporation is done in Section 8 and Section 8 deals with conclusion.

2. Value at Risk and Expected Shortfall: Mathematical Background

The most important risk measures despite their drawbacks are Value at Risk (VaR) and Expected Shortfall (ES). VaR was proposed by Till Guldimann in the late 1980s, and at the time he was the head of global research at J. P. Morgan.

Value at Risk is generally defined as possible maximum loss over a given holding period within a fixed confidence level. Mathematically VaR at the ( 100 α ) percent confidence level is defined as the lower 100 α percentile of the profit-loss distribution.

In statistical terms, VaR is a quantile of distribution for financial asset returns. More formally, VaR is defined as

P { X V a R 1 α X } = α (1)

where X represents the Asset’s returns. In integral form it can be expressed as

V a R α X f ( x ) d x = α (2)

where f(x) is the profit-loss distribution.

The concept of Expected Shortfall (ES) was first introduced in Rappoport [17]. Artzner et al. ( [3] [4] ) formally developed the concept. ES is the conditional expectation of loss given that the loss is beyond the VaR level and measures how much one can lose on average in the states beyond the VaR level.

From Equation (2.2)

1 α V a R α f ( x ) d x = 1 (3)

Therefore f ( x ) α is a pdf for < x < V a R α and we refer to it as “Tail loss distribution”.

Conditional Expectation

E [ X | X < V a R α ] = V a R α x f ( x ) α d x (4)

is the Expected Shortfall denoted as E S α . This version was used by Yamai and Yoshiba [18] to obtain the ES for a normal distribution. Equation (2.4) can be expressed in a different version as follows: Defining F(x) as the cdf of the random variable X, let

u = F ( x ) x = F 1 ( u )

d u = f ( x ) d x

when

x = u = 0

x = V a R α u = α

E S α = 1 α 0 α F 1 ( u ) d u = 1 α 0 α V a R u d u (5)

as presented by Zhang et al. [19].

Remarks: Equation (2.3) is the mean of the loss distribution. Equation (2.4) represents the average of the VaR between 0 and α . The loss distribution, f ( x ) α , < x < V a R α gives the tail distribution.

For the purpose of VaR and ES analysis, a model for the return distribution is important because it describes the potential behaviour of a financial security in the future Bams and Wilhouwer [20]. A Normal distribution supposedly underestimates the tail and hence VaR. Recently alternative distributions have been proposed that focus more on tail behaviour of the returns. One such candidate is the Normal Inverse Gaussian (NIG) distribution. We consider extensions of NIG distribution as Normal Weighted Inverse Gaussian (NWIG) distributions. In the next few sections we give a detailed illustration on their construction, properties and parameter estimation via EM-algorithm.

3. Generalised Inverse Gaussian Distribution

The Generalised Inverse Gaussian (GIG) Distribution is based on modified Bessel function of the third kind. Modified Bessel function of the third kind of order λ evaluated at ω denoted by K λ ( ω ) is defined as

K λ ( ω ) = 1 2 0 x λ 1 e ω 2 ( x + 1 x ) d x (6)

with the following properties

a) K 1 2 ( ω ) = K 1 2 ( ω ) = π 2 ω e ω (7)

b) K 3 2 ( ω ) = K 3 2 ( ω ) = π 2 ω e ω ( 1 + 1 ω ) (8)

c) K 5 2 ( ω ) = K 5 2 ( ω ) = π 2 ω e ω ( 1 + 3 ω + 3 ω 2 ) (9)

d) K 7 2 ( ω ) = K 7 2 ( ω ) = π 2 ω e ω ( 1 + 6 ω + 15 ω 2 + 15 ω 3 ) (10)

e) K 9 2 ( ω ) = K 9 2 ( ω ) = π 2 ω e ω ( 1 + 10 ω + 45 ω 2 + 105 ω 3 + 105 ω 4 ) (11)

f) K 11 2 ( ω ) = K 11 2 ( ω ) = π 2 ω e ω ( 1 + 15 ω + 105 ω 2 + 420 ω 3 + 945 ω 4 + 945 ω 5 ) (12)

which are necessary in deriving the properties and estimates of the proposed models. For more definition and properties see Abramowitz and Stegun [21].

Using Parametrization ω = δ γ and transformation x = γ δ z then formula (2.1) becomes

K λ ( δ γ ) = 1 2 0 ( γ δ ) λ z λ 1 exp { 1 2 ( δ 2 z + γ 2 z ) } d z

Hence

g ( z ) = ( γ δ ) λ z λ 1 2 K λ ( δ γ ) exp { 1 2 ( δ 2 z + γ 2 z ) } (13)

z > 0 ; < λ < , δ > 0 , γ > 0

which is a Generalized Inverse Gaussian (GIG) distribution with parameter λ , δ , γ .

Thus

Z G I G ( λ , δ , γ )

with

E ( Z r ) = ( δ γ ) r K λ + r ( δ γ ) K λ ( δ γ )

where r can be positive or negative integers.

4. Weighted Inverse Gaussian Distribution

Let X be a random variable with pdf f ( x ) . A function of X, w ( X ) is also a random variable with expectation

E [ w ( X ) ] = w ( x ) f ( x ) d x

1 = w ( x ) E [ w ( X ) ] f ( x ) d x

Thus

f W ( x ) = w ( x ) E [ w ( X ) ] f ( x ) , < x < (14)

is a weighted distribution. It was introduced by Fisher [11] and elaborated by Patil and Rao [12].

From Equation (3.8) When λ = 1 2 we have G I G ( 1 2 , δ , γ ) with pdf

g 1 ( z ) = δ e δ γ 2 π z 3 2 exp { 1 2 ( δ 2 z + γ 2 z ) } (15)

This is called Inverse Gaussian (IG) distribution.

The following special cases of GIG distribution can be expressed in terms of IG distribution. They are weighted Inverse Gaussian distributions.

Example 1: G I G ( 1 2 , δ , γ )

When λ = 1 2 , then the pdf of G I G ( λ , δ , γ ) becomes

g 2 ( z ) = γ e δ γ 2 π z 1 2 exp { 1 2 ( δ 2 z + γ 2 z ) }

This is a Reciprocal Inverse Gaussian (RIG) distribution. It can be written as

g 2 ( z ) = γ δ z [ δ e δ γ 2 π z 3 2 exp { 1 2 ( δ 2 z + γ 2 z ) } ]

g 2 ( z ) = ( γ δ z ) g 1 ( z )

i.e.,

G I G ( 1 2 , δ , γ ) = ( γ δ z ) G I G ( 1 2 , δ , γ ) (16)

Thus a G I G ( 1 2 , δ , γ ) is a weighted Inverse Gaussian distribution with weights

w ( Z ) = Z (17)

where

Z G I G ( 1 2 , δ , γ )

E [ w ( Z ) ] = E ( Z ) = δ γ

RIG distribution is called Length Biased Inverse Gaussian distribution.

Example 2: G I G ( 3 2 , δ , γ )

The pdf is

g 3 ( z ) = δ 3 e δ γ 2 π ( 1 + δ γ ) z 5 2 e 1 2 ( δ 2 z + γ 2 z ) = δ 2 1 + δ γ z 1 g 1 ( z )

Therefore G I G ( 3 2 , δ , γ ) is a weighted Inverse Gaussian distribution with weights

w ( Z ) = Z 1 (18)

and

E [ w ( Z ) ] = 1 + δ γ δ 2

Example 3: G I G ( 3 2 , δ , γ )

The pdf of G I G ( 3 2 , δ , γ )

g 4 ( z ) = γ 3 1 + δ γ e δ γ 2 π z 1 2 e 1 2 ( δ 2 z + γ 2 z ) = γ 3 δ ( 1 + δ γ ) z 2 [ δ e δ γ 2 π z 3 2 exp { 1 2 ( δ 2 z + γ 2 z ) } ] = γ 3 δ ( 1 + δ γ ) z 2 g 1 ( z )

Thus, G I G ( 3 2 , δ , γ ) is a weighted Inverse Gaussian distribution with weights

w ( Z ) = Z 2 (19)

where

Z G I G ( 1 2 , δ , γ )

E [ w ( Z ) ] = δ ( 1 + δ γ ) γ 3

5. Generalized Hyperbolic Distribution

A stochastic representation of a Normal Variance-Mean mixture is given by letting Let

X = μ + β Z + Z Y

where

Y N ( 0,1 )

and Z, independent of Y, is a positive random variable.

If F(x) is a cdf of X, then

F ( x ) = p r o b { X x } = { Y x μ β z z , 0 < z < } = 0 x μ β z z ϕ ( y ) g ( z ) d y d z = 0 Φ ( x μ β z z ) g ( z ) d z

where ϕ ( ) and Φ ( ) are pdf and cdf of a standard normal distribution, respectively.

f ( x ) = 0 1 z ϕ ( x μ β z z ) g ( z ) d z = 0 1 2 π z e [ x ( μ + β z ) ] 2 2 z g ( z ) d z (20)

Thus we have a hierarchical representation as

X / Z = z N ( μ + β z , z ) (21)

being the conditional pdf and g(z) the mixing distribution.

If

Z G I G ( λ , δ , γ ) (22)

then

f ( x ) = ( γ δ ) λ e β ( x μ ) 2 π K λ ( δ γ ) 1 2 0 z λ 3 2 e 1 2 ( α 2 z + δ 2 z ϕ ( x ) ) d z

where

α 2 = γ 2 + β 2 γ = α 2 β 2

and

ϕ ( x ) = 1 + ( x μ ) 2 δ 2

This is an integral form of a GHD

for < x < ; < μ < , δ > 0 , α > 0 , β > 0 .

Using the stochastic representation of NVM mixture, the properties are expressed in terms of the mixing distribution as shown in the following

Proposition 1

M X ( t ) = e μ t M Z ( β t + t 2 2 )

E ( X ) = μ + β E ( Z )

V a r ( X ) = E ( Z ) + β 2 V a r ( Z )

μ 3 ( X ) = 3 β v a r ( Z ) + β 3 μ 3 ( Z )

μ 4 ( X ) = β 4 μ 4 ( Z ) + 6 β 2 μ 3 ( Z ) + 6 β 2 E [ Z ] v a r ( Z ) + 3 E [ Z 2 ]

6. Special Cases of Interest

When λ = 1 2 , the GHD becomes NIG whose pdf can be expressed in the integral form as

f 1 ( x ) = δ e δ γ e β ( x μ ) 2 π 0 z 2 e 1 2 { δ 2 ϕ x z + α 2 z } d z (23)

Therefore the Normal Weighted Inverse Gaussian formulation can be represented as

f ( x ) = δ e δ γ e β ( x μ ) 2 π 0 w ( z ) E [ w ( Z ) ] z 2 e 1 2 { δ 2 ϕ x z + α 2 z } d z (24)

Hence

λ = 1 2 w ( z ) E ( w ( Z ) ) = 1 (25)

f 1 ( x ) = α δ e δ γ + β ( x μ ) K 1 ( α δ 2 + ( x μ ) 2 ) π δ 2 + ( x μ ) 2

which is the Normal Inverse Gaussian (NIG) distribution with properties given in Table 1 below.

when

λ = 1 2 w ( z ) E [ w ( Z ) ] = γ δ z (26)

f 2 ( x ) = γ e δ γ + β ( x μ ) K 0 ( α δ 2 + ( x μ ) 2 ) π

Which is the Normal Reciprocal Inverse Gaussian (NRIG) distribution with properties given in Table 2 below.

Similarly, when

λ = 3 2 w ( z ) E [ w ( Z ) ] = δ 2 ( 1 + δ γ ) z (27)

f 3 ( x ) = α 2 δ 3 e δ γ + β ( x μ ) K 2 ( α δ 2 + ( x μ ) 2 ) π ( 1 + δ γ ) [ δ 2 + ( x μ ) 2 ]

which is the Normal- G I G ( λ = 3 2 , δ , γ ) with properties given in Table 3 below.

Table 1. Properties of NIG.

Table 2. Properties of NRIG.

Table 3. Properties for Normal- G I G ( 3 2 , δ , γ ) .

Finally, when

λ = 3 2 w ( z ) E [ w ( Z ) ] = γ 3 z 2 δ ( 1 + δ γ ) (28)

f 4 ( x ) = γ 3 e δ γ + β ( x μ ) [ δ 2 + ( x μ ) 2 ] K 1 ( α δ 2 + ( x μ ) 2 ) α π ( 1 + δ γ )

which is the Normal- G I G ( λ = 3 2 , δ , γ ) with properties given in Table 4 below

7. Parameter Estimation via EM Algorithm

7.1. Theory/Concept

EM algorithm is a powerful technique for maximum likelihood estimation for data containing missing values or data that can be considered as containing missing values. It was introduced by Dempster et al. [10].

Table 4. Properties for Normal- G I G ( 3 2 , δ , γ ) .

Assume that the true data are made of an observed part X and unobserved part Z. This then ensures the log likelihood of the complete data ( x i , z i ) for i = 1 , 2 , 3 , , n factorizes into two parts, Kostas [22].

This implies that the joint density of X and Z is given by

f ( x , z ) = f ( x / z ) g ( z )

The likelihood function is

L = i = 1 n f ( x i / z i ) g ( z i ) = i = 1 n f ( x i / z i ) i = 1 n g ( z i )

log L = log i = 1 n f ( x i / z i ) + log i = 1 n g ( z i ) = i = 1 n log f ( x i / z i ) + i = 1 n log g ( z i ) = l 1 + l 2

where

l 1 = i = 1 n log f ( x i / z i )

and

l 2 = i = 1 n log g ( z i )

Karlis [23] applied EM algorithm to mixtures which he considered to consist of two parts; the conditional pdf is for observed data and the mixing distribution is based on an unobserved data, the missing values.

7.2. M-Step for Conditional pdf

In this study

l 1 = n 2 log ( 2 π ) 1 2 i = 1 n log z i i = 1 n ( x i μ β z i ) 2 2 z i

Therefore

β l 1 = 0 i = 1 n ( x i μ ^ β ^ z i ) = 0

i.e., i = 1 n x i n μ ^ β ^ i = 1 n z i = 0

μ ^ = x ¯ β ^ z ¯

where x ¯ = i = 1 n x i n and z ¯ = i = 1 n z i n .

Similarly,

μ l 1 = 0 i = 1 n x i z i μ ^ i = 1 n 1 z i n β ^ = 0

i = 1 n x i z i x ¯ i = 1 n 1 z i + β ^ z ¯ i = 1 n 1 z i n β ^ = 0

β ^ = i = 1 n x i z i x ¯ i = 1 n 1 z i n z ¯ i = 1 n 1 z i

7.3. E-Step

Values of random variables Z i and 1 Z i are not known. So we estiamte them by considering posterior expectations E ( Z i / X i ) and E ( 1 Z i ) .

The posterior distribution is defined as

f ( z / x ) = f ( x / z ) g ( z ) 0 f ( x / z ) g ( z ) d z

Therefore

E ( Z / X ) = 0 z f ( x / z ) g ( z ) d z 0 f ( x / z ) g ( z ) d z

E ( Z / X ) = 0 1 z f ( x / z ) g ( z ) d z 0 f ( x / z ) g ( z ) d z

7.4. For Inverse Gaussian Mixing Distribution

7.4.1. M-Step

l 2 = n 2 log ( 2 π ) + n log δ + n δ γ 3 2 i = 1 n log z i δ 2 2 i = 1 n 1 z i γ 2 2 i = 1 n z i

γ l 2 = n δ γ i = 1 n z i

γ l 2 = 0 γ ^ = δ ^ z ¯

and

δ l 2 = 0 n δ + n γ δ i = 1 n 1 z i = 0

δ ^ = n i = 1 n 1 z i n z ¯

7.4.2. Posterior Expectations

For NIG

E ( Z / X ) = δ ϕ ( x ) α K 0 ( α δ ϕ ( x ) ) K 1 ( α δ ϕ ( x ) )

and

E ( 1 Z / X ) = α δ ϕ ( x ) K 2 ( α δ ϕ ( x ) ) K 1 ( α δ ϕ ( x ) )

7.4.3. Iterations

Let

s i = E ( Z i / X i ) = δ ϕ ( x i ) α K 0 ( α δ ϕ ( x i ) ) K 1 ( α δ ϕ ( x i ) )

where

ϕ ( x i ) = 1 + ( x i μ ) 2 δ 2

and

α = γ 2 + β 2

Therefore s i is a function of α , β , δ , γ , μ

Next, let

w i = E ( 1 Z i / X i ) = α δ ϕ ( x i ) K 2 ( α δ ϕ ( x i ) ) K 1 ( α δ ϕ ( x i ) )

which is also a function of α , β , δ , γ , μ

For computation, we have the following in the k-th iteration

s i ( k ) = δ ( k ) ϕ ( k ) ( x i ) α ( k ) K 0 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (29)

w i ( k ) = α ( k ) δ ( k ) ϕ ( k ) ( x i ) K 2 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (30)

δ ^ ( k + 1 ) = n i = 1 n w i ( k ) n 2 i = 1 n s i ( k ) (31)

γ ^ ( k + 1 ) = δ ^ ( k + 1 ) 1 n i = 1 n s i ( k ) (32)

The log-likelihood function of NIG distribution is given by

log L = i = 1 n log { α δ e δ γ + β ( x μ ) K 1 ( α δ 2 + ( x μ ) 2 ) π δ 2 + ( x μ ) 2 } = n log α + n ( δ γ β μ ) + β i = 1 n x i n log π 1 2 i = 1 n ϕ ( x i ) + i = 1 n log K 1 ( α δ ϕ ( x i ) 1 2 ) (33)

The k-th iteration of the loglikelihood function of the NIG distribution is

l ( k ) = n log α ( k ) + n ( δ ( k ) γ ( k ) β ( k ) μ ( k ) ) + β ( k ) i = 1 n x i n log π 1 2 i = 1 n ϕ ( k ) ( x i ) + i = 1 n log K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) 1 2 ) (34)

7.5. For Length Biased (Reciprocal) Inverse Gaussian Distribution

7.5.1. M-Step

l 2 ( δ , γ ) = n log γ n 2 log ( 2 π ) + n δ γ δ 2 2 i = 1 n 1 z i 1 2 i = 1 n log z i

δ l 2 = 0 δ ^ = n γ ^ i = 1 n 1 z i

γ l 2 = 0 γ ^ = n i = 1 n z i n 2 ( i = 1 n 1 z i ) 1

7.5.2. Posterior Expectation

E ( Z / X ) = δ ϕ ( x ) α K 1 ( α δ ϕ ( x ) ) K 0 ( α δ ϕ ( x ) ) (35)

and

E ( 1 Z / X ) = α δ ϕ ( x ) K 1 ( α δ ϕ ( x ) ) K 0 ( α δ ϕ ( x ) ) (36)

7.5.3. Iterations

Let

s i = E ( Z i / X i ) = δ ϕ ( x i ) α K 1 ( α δ ϕ ( x i ) ) K 0 ( α δ ϕ ( x i ) )

and

w i = E ( 1 Z i / X i ) = α δ ϕ ( x i ) K 1 ( α δ ϕ ( x i ) ) K 0 ( α δ ϕ ( x i ) )

For iterations we have

s i ( k ) = δ ( k ) ϕ ( k ) ( x i ) α ( k ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 0 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (37)

w i ( k ) = E ( 1 Z i / X i ) = α ( k ) δ ( k ) ϕ ( k ) ( x i ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 0 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (38)

γ ^ ( k + 1 ) = n i = 1 n s i ( k ) n 2 i = 1 n w i ( k ) (39)

δ ^ ( k + 1 ) = γ ( k + 1 ) 1 n i = 1 n w i ( k ) (40)

The loglikelihood function of NRIG distribution is given by

log L = n log γ n log π + n ( δ γ β μ ) + β i = 1 n x i + i = 1 n log K 0 ( α δ ϕ ( x i ) 1 2 ) (41)

The k-th iteration becomes

l ( k ) = n log γ ( k ) n log π + n ( δ ( k ) γ ( k ) β ( k ) μ ( k ) ) + β ( k ) i = 1 n x i + i = 1 n log K 0 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) 1 2 ) (42)

7.6. Special Case When the Index Parameter Is −3/2

l 2 = 3 n log δ n 2 log ( 2 π ) 5 2 i = 1 n log z i n log ( 1 + δ γ ) + n δ γ 1 2 i = 1 n ( δ 2 z i + γ 2 z i )

γ l 2 = 0 γ ^ = δ 2 z ¯ δ z ¯

δ l 2 = 0 3 n δ n ( n δ 2 i = 1 n z i δ i = 1 n z i ) ( 1 + δ γ ) 1 + n ( n δ 2 i = 1 n z i δ i = 1 n z i ) δ i = 1 n 1 z i = 0

δ 4 ( n 2 i = 1 n z i i = 1 n 1 z i ) + n δ 2 i = 1 n z i + ( i = 1 n z i ) 2 = 0

( n 2 i = 1 n z i i = 1 n 1 z i ) t 2 + ( n i = 1 n z i ) t + ( i = 1 n z i ) 2 = 0

i.e., [ i = 1 n 1 z i n z ¯ ] t 2 n t n z ¯ = 0

where

t = δ 2

Therefore

δ ^ = [ n i = 1 n z i + i = 1 n z i ( 4 i = 1 n z i i = 1 n 1 z i 3 n 2 ) 1 2 2 [ i = 1 n z i i = 1 n 1 z i n 2 ] ] 1 2

7.6.1. Posterior Expectation

E ( Z / X ) = δ ϕ ( x ) α K 1 ( α δ ϕ ( x ) ) K 2 ( α δ ϕ ( x ) ) (43)

and

E ( 1 Z / X ) = α δ ϕ ( x ) K 3 ( α δ ϕ ( x ) ) K 2 ( α δ ϕ ( x ) ) (44)

7.6.2. Iterations

Let

s i = E ( Z i / X i ) = δ ϕ ( x i ) α K 1 ( α δ ϕ ( x i ) ) K 2 ( α δ ϕ ( x i ) ) (45)

and

w i = E ( 1 Z i / X i ) = α δ ϕ ( x i ) K 3 ( α δ ϕ ( x i ) ) K 2 ( α δ ϕ ( x i ) ) (46)

For iterations we have

s i ( k ) = δ ( k ) ϕ ( k ) ( x i ) α ( k ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 2 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (47)

w i ( k ) = E ( 1 Z i / X i ) = α ( k ) δ ( k ) ϕ ( k ) ( x i ) K 3 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 2 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (48)

δ ^ ( k + 1 ) = [ n i = 1 n s i ( k ) + i = 1 n s i ( k ) ( 4 i = 1 n s i ( k ) i = 1 n w i ( k ) 3 n 2 ) 1 2 2 [ i = 1 n s i ( k ) i = 1 n w i ( k ) n 2 ] ] 1 2 (49)

γ ^ ( k + 1 ) = δ ( k + 1 ) 1 n i = 1 n s i ( k ) 1 δ ( k + 1 ) (50)

The log-likelihood function of Normal- G I G ( 3 2 , δ , γ ) is given by

log L = n log π n log ( 1 + δ γ ) + 2 n log α + n log δ + n ( δ γ β μ ) + β i = 1 n x i i = 1 n ϕ ( x i ) + i = 1 n log K 2 ( α δ ϕ ( x i ) 1 2 ) (51)

The k-th iteration becomes

l ( k ) = n log π n log ( 1 + δ ( k ) γ ( k ) ) + 2 n log α ( k ) + n log δ ( k ) + n ( δ ( k ) γ ( k ) β ( k ) μ ( k ) ) + β ( k ) i = 1 n x i i = 1 n ϕ ( k ) ( x i ) + i = 1 n log K 2 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) 1 2 ) (52)

7.7. Special Case When the Index Parameter Is 3/2

7.7.1. M-Step

l 2 = i = 1 n log g 4 ( z i ) = n 2 n log ( 1 + δ γ ) + 3 n log γ + n δ γ + 1 2 log i = 1 n ( z i ) δ 2 2 i = 1 n 1 z i γ 2 2 i = 1 n z i

δ l 2 = 0 n γ 2 δ 1 + δ γ δ i = 1 n 1 z i = 0

δ ^ = n γ ^ 2 i = 1 n 1 z i γ ^ i = 1 n 1 z i

Similarly

γ l 2 = 0 δ 2 γ 1 + δ γ + 3 γ γ z ¯ = 0

γ 2 δ 2 + ( 3 γ γ 3 z ¯ ) δ γ 2 z ¯ + 3 = 0

Substituting for δ and letting γ 2 = t we obtain

( n 2 i = 1 n z i i = 1 n 1 z i ) t 2 + ( n i = 1 n 1 z i ) t + ( i = 1 n 1 z i ) 2 = 0

Therefore

t = b ± b 2 4 a c 2 a

where

a = n 2 i = 1 n z i i = 1 n 1 z i

b = n i = 1 n 1 z i

c = ( i = 1 n 1 z i ) 2

γ ^ = [ b + b 2 4 a c 2 a ] 1 2

7.7.2. E-Step

Posterior Expectation

E ( Z / X ) = δ ϕ ( x ) α K 2 ( α δ ϕ ( x ) ) K 1 ( α δ ϕ ( x ) ) (53)

E ( 1 Z / X ) = α K 0 ( α δ ϕ ( x ) ) δ ϕ ( x ) K 1 ( α δ ϕ ( x ) ) (54)

7.8. Iterations

Let

s i = E ( Z i / X i ) = δ ϕ ( x i ) α K 2 ( α δ ϕ ( x i ) ) K 1 ( α δ ϕ ( x i ) )

and

w i = E [ 1 Z i / X i ] = α K 0 ( α δ ϕ ( x i ) ) δ ϕ ( x ) K 1 ( α δ ϕ ( x ) )

For iterations we have

s i ( k ) = δ ( k ) α ( k ) ϕ ( k ) ( x i ) K 2 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (55)

w i ( k ) = α ( k ) K 0 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) δ ( k ) ϕ ( k ) ( x i ) K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x i ) ) (56)

γ ^ ( k + 1 ) = t ( k ) (57)

δ ^ ( k + 1 ) = n ( γ ^ ( k + 1 ) ) 2 i = 1 n w i ( k ) γ ^ ( k + 1 ) i = 1 n w i ( k ) (58)

The log-likelihood function for the Normal- G I G ( 3 2 , δ , γ ) distribution is given by

l = 3 n log γ + n ( δ γ β μ ) n log ( α π ( 1 + δ γ ) ) + n log δ + β i = 1 n x i + 1 2 i = 1 n log ϕ ( x i ) + i = 1 n log K 1 ( α δ ϕ ( x ) ) (59)

For the k-th iteration we have

l ( k ) = 3 n log γ ( k ) + n ( δ ( k ) γ ( k ) β ( k ) μ ( k ) ) n log ( α ( k ) π ( 1 + δ ( k ) γ ( k ) ) ) + n log δ ( k ) + β ( k ) i = 1 n x i + 1 2 i = 1 n log ϕ ( k ) ( x i ) + i = 1 n log K 1 ( α ( k ) δ ( k ) ϕ ( k ) ( x ) ) (60)

Remarks:

1) For all the proposed models, the β , μ and α parameters of the conditional distribution are updated as follows:

β ^ ( k + 1 ) = i = 1 n ( x i x ¯ ) w i ( k ) n 1 n i = 1 n s i ( k ) i = 1 n w i ( k )

μ ^ ( k + 1 ) = x ¯ β ^ ( k + 1 ) i = 1 n s i ( k ) n

α ^ ( k + 1 ) = [ ( γ ( k + 1 ) ) 2 + ( β ( k + 1 ) ) 2 ] 1 2

2) The stopping criterion is when

l ( k ) l ( k 1 ) l ( k ) < t o l

where tol is the tolerance level chosen; e.g 10−6.

3) Initial values used are moment estimates of NIG ditsribution as proposed by Karlis [23].

8. Application

8.1. Fitting of the Proposed Models

The data used in this research is the Range Resource Corporation weekly returns for the period 3/01/2000 to 1/07/2013 with 702 observations. The histogram for the weekly log-returns in Figure 1 shows that the data is negatively skewed and exhibiting heavy tails.The Q-Q plot shows that the normal distribution is not a good fit for the data especially at the tails.

Table 5 provides descriptive statistics for the return series in consideration. We observe that the excess kurtosis of 2.768252 indicates the leptokurtic behaviour of the returns. The log-returns has a distributions with relatively heavier tails than the normal distribution. We observe skewness of −0.1886714 which indicates that the two tails of the returns behave slightly differently.

The proposed models are now fitted to RRC weekly log-returns. Using the sample estimates and the NIG estimators to the RRC data we obtain the following estimates as initial values for the EM algorithm (see Karlis [23] ).

α ^ = 0.3722511 , β ^ = 0.02456226 , δ ^ = 2.950864 , μ ^ = 0.4284473 .

Figure 1. Fitting Model 1 to RRC weekly returns.

Table 5. Summary Statistics for RRC weekly log-returns.

The initial values were used in all the proposed models to obtain the maximum likelihood estimates as shown in Table 6 below

The parameter estimates from Table 2 are now fitted to RRC weekly log-returns. Figures 2-5 show the histogram and Q-Q plots of the RRC returns fitted with the proposed models. Figure 2-5 show that the proposed model fit the data well.

Table 7 present results of Kolmogorv-Smirnov and Anderson-Darling test performed on the models. All models produce high p-values, a strong evidence that we can not reject the null hypothesis that the returns data follow the proposed models.

Table 8 presents values of Alkaike Information Criterion (AIC), Bayesian Information Creterion (BIC) and Log-likelihood. The values illustrate that the models are alternative to each other.

Model 3 has the lowest AIC and BIC with the highest log-likelihood. It is the best fit for the data.

Figure 2. Fitting Model 1 to RRC weekly returns.

Figure 3. Fitting Model 2 to RRC weekly returns.

Figure 4. Fitting Model 3 to RRC weekly returns.

Figure 5. Fitting Model 4 to RRC weekly returns.

Table 6. Estimates for the proposed models.

Table 7. Results from Kolmogorov-Smirnov Test and Anderson-Darling Test.

Table 8. AIC, BIC and Log-likelihood Values.

8.2. Risk Estimation and Backtesting

We use the parameter estimates for our proposed model to determine the VaR (Table 9) and ES (Table 10) at levels α { 0.001 , 0.01 , 0.05 , 0.95 , 0.99 , 0.999 } . The first three level are used to measure the risk of long position, while the last three levels are used to measure the risk of short positions. We apply the Kupiec Likelihood Ratio (LR) test given by Kupiec [16] which test the hypothesis that the expected proposition of violations is equal to α . The method consist of calculating τ ( α ) the number of times the observed returns, x t falls below (for long position) or above (for short position) the V a R α estimates at level α ; i.e., x t < V a R α or x t < V a R 1 α , and compare the corresponding failure rate to α .

The likelihood ratio statistic is given by

2 log ( τ ( α ) n ) τ ( α ) ( 1 τ ( α ) n ) n τ ( α ) 2 log ( α τ ( α ) ( 1 α ) n τ ( α ) ) (61)

where τ ( α ) is the number of violations. Under the null hypothesis this statistic is distributed as χ 2 distribution with one degree of freedom. Table 11 gives the number of violations for the models. Table 12 gives the p-values based on Kupiec test.

Model 3 has the highest VaR and ES value indicating that it perform well than the other models at the tails.

Table 9. VaR Values of RRC log-returns based on normal and proposed models.

Table 10. ES Values of RRC log-returns based on normal and proposed models.

Table 11. Number of violations of VaR for each distribution at different levels.

Table 12. P-value for the kupiec test for each distribution at different levels.

Remark: At 5 percent level of significant, the Normal distribution is rejected at levels: 0.001, 0.95 and 0.999. In addition it is also rejected at level 0.99 at 10 percent level of significant. The Normal weighted Inverse Gaussian distributions were all effective and well specified on all levels of VaR. It can be noted model 3 (Normal- G I G ( 3 2 , δ , γ ) ) outperforms the other models at level 0.99.

9. Conclusions

In this paper we obtained VaR using NWIG distributions. We first constructed Normal Variance-Mean Mixture when the mixing distributions are G I G ( 1 2 , δ , γ ) , G I G ( 1 2 , δ , γ ) , G I G ( 3 2 , δ , γ ) and G I G ( 3 2 , δ , γ ) . We have shown that these mixing distribution are WIG distributions.

The parameters of the mixed models are estimated using EM-algorithm. The iterative schemes used are based on explicit solutions of normal equations. We used method of moment estimates of NIG as initial values and obtained monotonic convergence.

We used AIC, BIC and loglikelihood for model selection. Normal- G I G ( 3 2 , δ , γ ) was found to be the best model. The results show that the three NWIG distributions are as good as NIG for VaR computation.

Further work can be done on Normal Mixtures when the mixing distributions are Finite mixtures of WIG distributions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Zhang, Y. and Nadarajah, S. (2017) A Review of Backtesting for Values at Risk. In: Communications in Statistics—Theory and Methods, 47, 3616-3639.
[2] Wimmerstedt, L. (2015) Backtesting Expected Shortfall: The Design and Implementation of Different Backtest. Master’s Thesis in Mathematical Statistics, Royal Institute of Technology, Stockholm.
[3] Artzner, P., Delbaen, F., Eber, J.-M. and Heath, D. (1997) Thinking Coherently. Risk Journal, 10, 68-71.
[4] Artzner, P., Delbaen, F., Eber, J.-M. and Heath, D. (1997) Coherent Measures of Risk. Mathematical Finance, 9, 203-228. https://doi.org/10.1111/1467-9965.00068
[5] Gneiting, T. (2011) Making and Evaluating Point Forecasts. Journal of the American Statistical Association, 106, 746-762. https://doi.org/10.1198/jasa.2011.r10138
[6] Basel Committee (2013) Fundamental Review of the Trading Book: A Revised Market Risk Framework.
[7] Nadarajah, S., Zhang, B. and Chan, S. (2014) Estimation Methods for Expected Shortfall. Quantitative Finance, 14, 271-291.
https://doi.org/10.1080/14697688.2013.816767
[8] Barndorff-Nielsen, O.E. (1977) Exponentially Decreasing Distributions for the Logarithm of Particle Size. Proceedings of the Royal Society of London. Series A, 353, 409-419. https://doi.org/10.1098/rspa.1977.0041
[9] Barndorff-Nielsen, O.E. (1997) Normal Inverse Gaussian Distribution and Stochastic Volatility Modelling. Scandinavian Journal of Statistics, 24, 1-13.
https://doi.org/10.1111/1467-9469.00045
[10] Dempster, A.P., Laird, N.M. and Rubin, D. (1977) Maximum Likelihood from Incomplete Data Using the EM Algorithm. Journal of the Royal Statistical Society: Series B, 39, 1-38. https://doi.org/10.1111/j.2517-6161.1977.tb01600.x
[11] Fisher, R.A. (1934) The Effect of Methods of Ascertainment upon the Estimation of Frequencies. The Annals of Human Genetics, 6, 13-25.
https://doi.org/10.1111/j.1469-1809.1934.tb02105.x
[12] Patil, G.P. and Rao, C.R. (1978) Weighted Distributions and Size-Biased Sampling with Applications to Wildlife Populations and Human Families. Biometrics, 34, 179-189. https://doi.org/10.2307/2530008
[13] Akman, H.O. and Gupta, R.C. (1992) A Comparison of Various Estimators of the Mean of an Inverse Gaussian Distribution. Journal of Statistical Computation and Simulation, 40, 71-81. https://doi.org/10.1080/00949659208811366
[14] Gupta, R.C. and Akman, O. (1995) On Reliability Studies of a Weighted Inverse Gaussian Distribution Model. Statistical Papers, 38, 445-452.
https://doi.org/10.1007/BF02925999
[15] Gupta, R.C. and Kundu, D. (2011) Weighted Inverse Gaussian—A Versatile Lifetime Model. Journal of Applied Statistics, 38, 2695-2708.
https://doi.org/10.1080/02664763.2011.567251
[16] Kupiec, P. (1995) Techniques for Verifying the Accuracy of Risk Measurement Models. https://doi.org/10.3905/jod.1995.407942
[17] Rappoport, P. (1993) A New Approach: Average Shortfall. Technical Report, J.P. Morgan.
[18] Yamai, Y. and Yoshiba, T. (2002) On the Validity of Value at Risk Comparative Analyses with Expected Shortfall. Monetary and Economic Studies.
[19] Zhang, Y., Chu, J., Chan, S. and Chan, B. (2019) The Generalised Hyperbolic Distribution and Its Subclass in Analysis of a New Era of Cryptocurrencies: Ethereum and Its Financial Risk. Physica A, 526, 1-12.
https://doi.org/10.1016/j.physa.2019.04.136
[20] Bams, D. and Wilhouwer, J.L. (2001) Empirical Issues in Value at Risk. ASTIN Bulletin, 31, 299-315. https://doi.org/10.2143/AST.31.2.1007
[21] Abramowitz, M and Stegun, I.A. (1972) Handbook of Mathematical Function. Dover, New York.
[22] Kostas, F. (2007) Tests of Fit for Symmetric Variance Gamma Distributions. UADPhilEcon, National and Kapodistrian University of Athens, Athens.
[23] Karlis, D. (2002) An EM Type Algorithm for Maximum Likelihood Estimation of the Normal-Inverse Gaussian Distribution. Statistics and Probability Letters, 57, 43-52.
https://doi.org/10.1016/S0167-7152(02)00040-8

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.