Forecasting Crude Oil Price Volatility by Heston Model

Abstract

Given the significance and complexity of forecasting the crude oil price volatility, this paper introduces the Heston model to predict volatility dynamics of crude oil price. The high-frequency intra-day data of the West Texas Intermediate (WTI) market serves to model the problem. Furthermore, the study used the Euler-Maruyama scheme to simulate Heston model using an error analysis. On the other hand, the study used the mean square error (MSE), the mean average error (MAE), and the root means square error (RMSE) for the forecast accuracy of the GARCH-type models. The results of the error analysis indicated that Heston’s stochastic volatility model is more consistent with oil performance data than traditional GARCH-class models. The study’s findings demonstrated that the Heston model is more economical in terms of setup and is capable of handling stylized facts.

Share and Cite:

Mwanakatwe, P. , Daniel, J. and Nzungu, K. (2023) Forecasting Crude Oil Price Volatility by Heston Model. Journal of Mathematical Finance, 13, 408-420. doi: 10.4236/jmf.2023.133026.

1. Introduction

Globally, crude oil prices, gold prices, currency exchange rates and other macroeconomic indicators play a fundamental role in the global economy. To clarify, the crude oil price is always treated more seriously since oil is the blood in the vessels of the economy. Forecasting crude oil price volatility provides market participants with valuable information regarding market uncertainty. Overall, oil price variability implies enormous losses or gains and therefore lower incomes or higher reserves to achieve the development targets [1] . Therefore, forecasting the price of crude oil and its volatility has attracted a lot of attention from researchers, policy makers and investors.

Moreover, instabilities in the price system affect governments in many sectors, such as, planning and decision-making regarding the revenue system. Oil price uncertainties are also linked to inflation and have implications for the cost of consumer products and goods in industry. As a result, it may have a critical effect on the policy order of petroleum-dependent economies. Furthermore, the association between the price of oil and agricultural commodity prices is important for the world food policy, and the sensitivity of the different economic agents on which the main input in the production process is oil [2] [3] [4] .

Numerous studies consider the modeling and forecasting the volatility of crude oil price by using improved GARCH models. Although various work has been done to find the most suitable model that offers the best performance of out-of-sample forecasts, none of the models has always outperformed the other [2] . For example, the Autoregressive Conditional Heteroscedasticity (ARCH) model, considered a powerful tool to describe conditional, and historical volatilities, was introduced by Engle [5] [6] [7] . This was followed by the introduction of the generalized autoregressive conditional heteroskedasticity (GARCH) model as an extension to the ARCH models [8] . Moreover, different scholars have developed variations of GARCH models, for example integrated GARCH (IGARCH), exponential GARCH (EGARCH), asymmetrical power GARCH (APGARCH), and fractional integrated GARCH (FIGARCH) [5] [6] [9] . These extension models were enhancing the GARCH model to capture the features of the time series data. Therefore, from an investor’s perspective, the forecast nature of future returns and volatility in oil markets is critical in determining asset prices, hedging, pricing of derivatives, and risk control. Given its vital global significance and the minimization of the negative effect fluctuations of oil price, the academic literature over the past few years has focused on modeling and forecasting oil price volatility. The market for crude oil sometimes seems at times relatively quiet, while other highly volatile markets [2] .

The upgraded GARCH-type models are capable of capturing the most important stylized facts [4] [5] [10] . For more information on stylised facts, please refer to the following studies [1] [2] [6] [7] [11] [12] . These models are popular for the modeling of time-variable conditional volatility as a deterministic function of lagged variance, lagged conditional squared residuals, and the past observations with the future volatility [2] [13] . In addition, these models are parametric and usually assess daily, weekly or monthly volatility using sampled data at the same rate. However, these models may not capture the fat-tail property of financial data.

Currently, stochastic volatility (SV) models are alternative existing models to accurately capture stylized facts [7] . The SV models are non-parametric and are based on the time-continuous probability process. In addition, the variance in the SV models is treated as an unobserved quantity that admits a stochastic, and logarithmic first-order autoregressive processes. In general, these models extend the geometric Brownian motion model by incorporating an additional source of uncertainty. Therefore, in describing volatility behavior, the volatility approximation is more realistic [3] . Although the (SV) model is theoretically appealing, it is difficult, because the unobserved volatility process introduces the model into a non-linear model. This process leads to the likelihood function depending on the high-dimension integrals [7] [9] .

The study’s new contribution to the literature is the introduction of the Heston stochastic volatility model to forecast the price volatility of crude oil. The model develops the Black and Scholes model (BSM) by incorporating the process driven by a Cox-Ingersoll-Ross (CIR) process [14] [15] . Furthermore, the model can capture the volatility clustering, the leverage effect, and the heavy-tailed nature of the return distributions. In this model, while the volatility of the variance controls the kurtosis of the underlying asset return distribution, the correlation defines its asymmetry. To conclude, the model is analytical tractable [14] . Despite the advantages of the model, the fundamental challenge is the complexity of the approximation process. Compared to models like BSM, the implementation of the Heston model involves more sophisticated calculations, and a more difficult process for the model calibration. Since the closed-form solutions for non linear SDEs are rarely available or are too hard to obtain, numerical approximations are very important. To overcome this challenge, this study employs the Euler-Maruyama scheme to approximate the price volatility model.

To sum up, the main objective of this study is to introduce the Heston stochastic volatility model in forecasting price volatility of a crude oil. Moreover, the innovations of this study are; introduction of the Heston model to measure the price volatility of crude oil, and applying the Euler-Maruyama method to simulate the Heston model. Moreover, in the accuracy analysis of the model, the study compares the Heston model performance with the improved GARCH-type models by performing error analysis. Normally, the smallest error implies a good approximation measure. For improved GARCH-type models, three error measures to evaluate the forecasting accuracy of models were employed. These are the mean squared error (MSE), the root means squared error (RMSE), and the mean absolute error (MAE).

The remaining sections of this paper are organized as follows. In Section 2, the mathematical model is presented. Moreover, Section 3 presents the Data and preliminary analysis; a numerical example is in Section 4. Finally, Section 5 concludes the study.

1.1. Mathematical Model

In this study, the crude oil market is assumed to be complete, frictionless, and continuously open over a fixed time interval [0, T]. Moreover, the financial market uncertainties are defined and modeled using a complete filtered probability space ( Ω , F , P ) . In the probability space above, Ω is the sample space, F = F ( t ) , t 0 , is the information available at a time t, and P is the historical probability measure.

1.2. The Heston Model

In this study, we consider that crude oil price dynamics assume a stochastic volatility model, particularly the Heston model. The introduction of a stochastic volatility model is because the volatility is not constant or deterministic instead it is a random process. Hence, the asset price at time t follows the system of stochastic differential equations (SDE) given:

d Y ( t ) = Y ( t ) ( μ d t + V ( t ) d W 1 ( t ) ) , (1)

d V ( t ) = β ( θ V ( t ) ) d t + σ V ( t ) d W 2 ( t ) , (2)

Y ( 0 ) = Y 0 and V ( 0 ) = V 0 . (3)

In addition, β is the mean reversion rate, θ is the long-run variance, and the volatility of volatility (variance of volatility) is denoted by σ . Thus, the Heston volatility model is the system of two correlated SDEs namely the price of asset and the volatility process, see (1) and (2). In addition, to ensure that V ( t ) is almost surely non-negative (the Feller condition has to be net), that is, 2 β θ > σ 2 . Moreover, W 1 ( t ) and W 2 ( t ) are standard Brownian motions with a nonzero correlation.

1.3. The Euler-Maruyama Method

The simplest and most useful scheme for the approximation of the numerical solution of the stochastic differential equations is the Euler-Maruyama. This is a generalization of the Euler’s approach method in ordinary differential equations and stochastic differential equations [16] . Consider the SDE below:

d Y ( t ) = μ ( t , Y ( t ) ) d t + σ ( t , Y ( t ) ) d W ( t ) , Y ( 0 ) = Y 0 , (4)

where σ and μ are scalar functions, W ( t ) is the Wiener process, and Y ( 0 ) is the initial condition. The solution to (4) admits the process Y t :

Y t = Y 0 + 0 t μ ( s , Y s ) d s + 0 t σ ( s , Y s ) d W s . (5)

To approximate the solution of the SDE on the interval [ 0, T ] , time is discretized into N equal subintervals with width Δ t = T N . Therefore, the Euler-Maruyama approximation to the actual solution of (4) is the Markov chain X expressed in the form:

Y n + 1 = Y n + μ ( t n , Y n ) Δ t + σ ( t n , Y n ) Δ W n , (6)

with Y n + 1 standing for an approximation, Δ t = t n + 1 t n = T N , Δ W = W n + 1 W n , and n = 0 , 1 , 2 , , N . From the expression, Δ W is normally distributed, i.e., Δ W ~ N ( 0, Δ t ) .

Proposition 1.1 Assume that Y n + 1 , and V n + 1 are discrete approximate of Y and V respectively. The corresponding continuous Euler-Maruyama method is then used to approximate the solution to the Heston model is given by:

Y n + 1 = Y n ( 1 + μ Δ t + V n Δ W n ) ,

V n + 1 = V n + β ( θ V n ) Δ t + σ V n Δ W n .

where Y n + 1 is the approximate price, and V n + 1 is the approximate volatility.

1.4. Numerical Approximation of the Model

Simulation methods are critical for approximating numerical solutions of SDEs. For the model efficiency, the focus is on analyzing errors measured at t = T by quantity.

ε Δ t = E | Y ( T ) Y ¯ ( T ) | .

This process determines whether the method is converging to the exact solution. The Euler-Maruyama scheme exhibits both strongly and weakly convergent properties [17] [18] [19] . The strong (convergent) error measures the error of the approximate Y sampling paths on average, the pathwise error is the random quantity that satisfies:

l i m Δ t 0 E ( | Y ( T ) Y ¯ ( T ) | ) = 0. (7)

From above expression, E is the expected value, while Y ¯ ( T ) is the approximation of Y ( t ) at time t = T calculated with a constant step Δ t .

Conversely, it is weakly convergent (weak error) if the random quantity satisfies:

l i m Δ t 0 | E [ f ( Y ( T ) ) ] E ( f ( Y ¯ ( T ) ) ) | = 0, (8)

for all f polynomials.

1.5. Approximation of GARCH-Type Models

In the present study, the symmetric GARCH and asymmetric GARCH (EGARCH and TGARCH) models compare with the Heston model. These models are discussed extensively in the few studies mentioned here [2] [7] [9] [12] [20] [21] . To assess the forecast performance of the models above, the Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE) are used. These error measures are the most well-known for testing model accuracy [2] . The model with the smallest error is referred to be the best one.

1.5.1. The Means Squared Error (MSE)

The MSE is regularly used to compare forecasting performance of models. It is considered the most suitable measure for determining methods that avoid significant errors. Thus, the MSE is computed by averaging the square of the difference between the original and predicted values of the data.

RMSE = t n e t 2 n (9)

where, e = Y T Y ¯ .

1.5.2. The Root Means Squared Error (RMSE)

RMSE is the standard deviation for errors that occur when a forecast is performed. This is similar to mean square error (MSE), but the root of the value is taken into consideration when determining the accuracy of the model. It is expressed by:

RMSE = t n e t 2 n (10)

1.5.3. The Mean Absolute Error (MAE)

It is expressed as:

MAE = t = 1 n | e t Y t | n (11)

where | e t Y t | is the absolute error calculated on the values adjusted for a specific forecasting method.

2. Data Description and Preliminary Analysis

To carry out the numerical illustration, this study employs daily spot prices of West Texas Intermediate (WTI) observations. The data is taken from the U.S. Energy Information Administration (EIA). In brief, the WTI crude oil is a specific grade of crude oil and one of the main three benchmarks in oil pricing, along with Brent and Dubai Crude.

The sampling period covers the period from January 4, 2009, to December 31, 2019, and consists 2761 observations. Since the logarithmic returns are analytically tractable when sub-period returns are related together to form returns over long intervals, then the crude oil price of each series is transformed into logarithmic returns. Study focuses on the logarithmic behavior of the crude oil returns. Furthermore, simulations using the Heston model and the application of the Euler-Maruyama numerical method to approximate the model are performed.

2.1. Preliminary Analysis

The preliminary analysis reveals significant differences in price movements during the period under consideration. Similarly, there are signs of positive asymmetry in the statistical distribution of crude oil prices, implying extreme of the right tail. Also, the returns are non-Normal, which proved significantly asymmetrical positive and excess kurtosis, as is expected from daily returns.

For the kurtosis, the price of crude oil is leptokurtic, which implies the fat tails than the normal distribution, see Table 1. Particularly, Table 1 presents the following; descriptive statistics, the unit root test, the normality test, and the ARCH effect test for crude oil. The results indicate that the mean returns are measured by standard deviation. In addition, returns show leptokurtic features (i.e., fat-tailed) and therefore, the variance of oil prices for crude primarily reflects irregular but extreme differences. Similarly, returns are skewed positively, implying that the series has a longer right tail than the left tail.

Moreover, the unit root test, and the Augmented Dickey-Fuller (ADF) tests examine the presence of a unit root [22] [23] . The method tests the hypotheses H0: A series is non-static versus H1: A series is stationary. The results of unit root test for the data, altogether at levels 0.01 and 0.05 reject the null hypothesis that there is a unit root in the returns.

Furthermore, the ARCH effect test looks at the existence of a heteroscedasticity of crude oil data. The ARCH effect test reveals the presence of strong conditional heteroscedasticity for the crude oil prices, which is common in financial data. Therefore, there is sufficient evidence to reject the null hypothesis that there is no ARCH effect. Likewise, the Jarque-Bera (JB) statistic using kurtosis and skewness test information illustrates non-normality. Therefore, it leads to the rejection of the null hypothesis.

2.2. Volatility and Approximation of Model Parameters

Considering Y t as the price of crude oil on day t, then daily spot price return transformation is:

S t = log ( Y t Y t 1 ) , (12)

where S t is the daily return on crude oil, Y t is the current day crude oil price, and Y t 1 is the crude oil price of the previous day. Also, the daily square returns are taken as an approximation of the real volatilities. Using the transformation results the annual and daily volatilities are determined with their respective variances as presented in Table 2. Therefore, we find the daily and annual volatility of crude oil prices ( σ ) with the respective variances ( σ 2 ) for each year by using the Parkinson’s extreme values method [24] .

3. Numerical Illustrations

The daily observations of crude oil return for the period from 02.01.2019 to 31.12.2019 amount to 250 days are used in this study. From these observations, we compute the model parameters (see Table 3).

Figures 1-3 show the spot prices, returns, and volatility of crude oil respectively. The results reveal an asymmetrical pattern in crude oil price behavior and returns. It implies that the performance patterns suggest evidence of volatility clustering. Thus, periods of relatively low volatility are the response to the periods of high volatility. The unusual peaks in these figures are evidence of major unstable trends in crude oil price returns.

Table 1. Basic statistics for the crude oil data.

Note: ***indicate statistical significance at 0.05 level.

Table 2. Crude oil volatilities, WTI 2009-2019.

Table 3. Computed parameters.

Particularly, Figure 1 shows the change in the spot price for the crude oil. Precisely, it shows that over the 2014-2016, crude oil prices declined sharply, reflecting increased uncertainty in the crude oil market. Moreover, Figure 2 illustrates the dynamics of crude oil returns. It implies that a number of global supply and demand tremors cause significant fluctuations in crude oil markets. Generally speaking, crude oil prices tend to reflect a high degree of uncertainty over time.

3.1. Simulation Results

This section starts by focusing on the logarithmic behavior of crude oil prices and conducting simulations using Heston’s stochastic volatility model. First of all, the January 2009 to January 2019 observations were used for parameters computation. Moreover, to forecast, the data from 02.01.2019 to 31.12.2019 were used. That is, for the parameters estimation the interval t = 1 to N was used while for forecasting N + 1 was used. This study uses Euler-Maruyama’s numerical method to simulate the Heston model.

In addition, the following parameters were also used in the simulations: the initial time “ t = 0 ”, the terminal time “ T = 1 ”, the number of discretization between “ n = 100 ”, Δ t = 0.01 (the uniform mesh size), the number of paths) “ N = 1000 ”. Figure 3 displays the results of the crude oil volatility simulation. The simulated results suggest that crude oil prices tend to be asymmetrical with lower kurtosis.

3.2. Error Analysis

Normally, for the efficacy of the model, the emphasis is on error analysis. We start by defining the error by

E | Y ( T ) Y ¯ ( T ) | .

To test the convergence strength for the Euler-Maruyama, we use N = 2 9 dicretized Brownian paths over [ 0,1 ] , and five different stepsizes: Δ t = 2 p 1 d t for 1 p 5 . Results for this process are shown in Table 5. These results are compared with the results obtained from GARCH-type models as shown in Table 5. To conclude, the model with the lowest error implies a more accurate estimate of the model.

Table 4, and Table 5 present error analysis results for the Euler-Maruyama scheme and GARCH-type models respectively. Particularly, Table 4 shows the error analysis results for the Heston model. Likewise, the error analysis for the GARCH-type models is presented in Table 5 as shown. The results in Table 5 show that the GARCH model offers the best accurate forecast than the two improved GARCH-type models (EGARCH and TGARCH). In general, results in Table 4, and Table 5, reveal the small errors in Heston model approximation than the improved GARCH models. Basing on results, it is evident that the Heston model can estimate better crude oil price volatility than the counterpart improved GARCH-type models.

Figure 1. Crude oil Spot price.

Figure 2. Crude oil logarithmic returns.

4. Conclusion

This study introduces the Heston stochastic volatility model to model the crude oil price volatility. It applied the Euler-Maruyama scheme to approximate the Heston stochastic volatility model. Moreover, the differences in the trend of prices for the covered period are revealed by simulation results. Furthermore, results show that the Heston model fits data well compared to counterpart improved GARCH-type models in the approximation of the price volatility.

Figure 3. Crude oil volatilities.

Table 4. The error analysis, the Euler-Maruyama scheme.

Table 5. The Error analysis, the GARCH-type models.

Because of the strategic role of crude oil price volatility and its effects to all countries globally, it is essential to employ different forecasting methods. In the future study, we will work on jump-diffusion models to investigate the behavior of crude oil price volatility.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Nakajima, J. (2009) Bayesian Analysis of GARCH and Stochastic Volatility: Modeling Leverage, Jumps and Heavy-Tails for Financial Time Series. Technical Report Mimeo, Department of Statistical Science, Duke University, Durham.
[2] Charles, A. and Darné, O. (2017) Forecasting Crude-Oil Market Volatility: Further Evidence with Jumps. Energy Economics, 67, 508-519.
https://doi.org/10.1016/j.eneco.2017.09.002
[3] Crisostomo, R. (2015) An Analysis of the Heston Stochastic Volatility Model: Implementation and Calibration Using Matlab. https://doi.org/10.2139/ssrn.2527818
[4] Wang, Y., Wu, C. and Yang, L. (2016) Forecasting Crude Oil Market Volatility: A Markov Switching Multifractal Volatility Approach. International Journal of Forecasting, 32, 1-9. https://doi.org/10.1016/j.ijforecast.2015.02.006
[5] Cont, R. (2001) Empirical Properties of Asset Returns: Stylized Facts and Statistical Issues. Quantitative Finance, 1, 223-236. https://doi.org/10.1080/713665670
[6] Lee, J. (2000) Change in Volatility in the Won/Us Dollar Daily Exchange Rate: Stochastic Volatility Model. Asia-Pacific Financial Markets, 7, 83-96.
[7] Sadorsky, P. (2005) Stochastic Volatility Forecasting and Risk Management. Applied Financial Economics, 15, 121-135.
https://doi.org/10.1080/0960310042000299926
[8] Bollerslev, T. (1986) General Autoregressive Conditional Heteroscedasticity. Journal of Econometrics, 31, 307-327. https://doi.org/10.1016/0304-4076(86)90063-1
[9] Sadorsky, P. (2006) Modeling and Forecasting Petroleum Futures Volatility. Energy Economics, 28, 467-488. https://doi.org/10.1016/j.eneco.2006.04.005
[10] Laurent, S. and Neely, C.J. (2012) Econometric Modeling of Exchange Rate Volatility and Jumps. In: Bell, A.R., Brooks, C. and Prokopczuk, M., Eds., Handbook of Research Methods and Applications in Empirical Finance, Edward Elgar Publishing, Cheltenham, 373-427.
[11] Kulikova, M. and Taylor, D. (2013) Stochastic Volatility Models for Exchange Rates and Their Estimation Using Quasi-Maximum-Likelihood Methods: An Application to the South African Rand. Journal of Applied Statistics, 40, 495-507.
https://doi.org/10.1080/02664763.2012.740791
[12] Salisu, A.A. and Fasanya, I.O. (2013) Modelling Oil Price Volatility with Structural Breaks. Energy Policy, 52, 554-562. https://doi.org/10.1016/j.enpol.2012.10.003
[13] Vo, M.T. (2009) Regime-Switching Stochastic Volatility: Evidence from the Crude Oil Market. Energy Economics, 31, 779-788.
https://doi.org/10.1016/j.eneco.2009.05.001
[14] Heston, S.L. (1993) A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options. Review of Financial Studies, 6, 327-343. https://doi.org/10.1093/rfs/6.2.327
[15] Sircar, K. (1999) Stochastic Volatility, Smile and Asymptotics. Applied Mathematical Finance, 6, 107-145. https://doi.org/10.1080/135048699334573
[16] Cao, W., Liu, M. and Fan, Z. (2004) Ms-Stability of the Euler-Maruyama Method for Stochastic Differential Delay Equations. Applied Mathematics and Computation, 159, 127-135. https://doi.org/10.1016/j.amc.2003.10.015
[17] Broadie, M. and Kaya, O. (2006) Exact Simulation of Stochastic Volatility and Other Affine Jump Diffusion Processes. Operations Research, 54, 217-231.
https://doi.org/10.1287/opre.1050.0247
[18] Gerstner, T. and Kloeden, P. (2013) Convergence of Numerical Methods for Stochastic Differential Equations in Mathematical Finance. World Scientific, Singapore, 49-80. https://doi.org/10.1142/9789814436434_0002
[19] Rosa, M.L. (2016) Numerical Methods for Stochastic Differential Equations.
[20] Lux, T., Segnon, M. and Gupta, R. (2016) Forecasting Crude Oil Price Volatility and Value-at-Risk: Evidence from Historical and Recent Data. Energy Economics, 56, 117-133. https://doi.org/10.1016/j.eneco.2016.03.008
[21] Manera, M., Nicolini, M. and Vignati, I. (2016) Modelling Futures Price Volatility in Energy Markets: Is There a Role for Financial Speculation. Energy Economics, 53, 220-229. https://doi.org/10.1016/j.eneco.2014.07.001
[22] Dickey, D.A. and Fuller, W.A. (1979) Distribution of the Estimators for Autoregressive Time Series with a Unit Root. Journal of the American Statistical Society, 75, 427-431. https://doi.org/10.1080/01621459.1979.10482531
[23] Phillips, P.C.B. and Perron, P. (1988) Testing for a Unit Root in Time Series Regressions. Biometrica, 75, 335-346. https://doi.org/10.1093/biomet/75.2.335
[24] Parkinson, M. (1980) The Extreme Value Method for Estimating the Variance of the Rate of Return. Journal of Business, 53, 61-65. https://doi.org/10.1086/296071

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.