Properties of Time-Varying Causality Tests in the Presence of Multivariate Stochastic Volatility

This paper compares the statistical properties of time-varying causality tests when errors of variables have multivariate stochastic volatility (SV). The time-varying causality tests in this paper are based on a logistic smooth transition autoregressive model. The compared time-varying causality tests include asymptotic tests, heteroskedasticity-robust tests, and tests using wild bootstrap. Our simulation results show that asymptotic tests and heteroskedasticity-robust counterparts have size distortions under multivariate SV, whereas tests using wild bootstrap have better size properties regardless of type of error. In particular, the time-varying causality test with first-order Taylor approximation using wild bootstrap has better statistical properties.


Introduction
Granger causality is one of most representative methods to analyze causality between economic variables.It is based on linear vector autoregressive (VAR) models and investigates whether past information is effective for prediction.Although Granger causality is used for various studies, it can be applied to examine only stable linear relationships in the long run.The relationship between economic variables is not necessarily stable in the long run and frequently has time-varying properties.This implies that a causality relationship can also be time-varying, and hence we should take into account the time-varying properties when analyzing a causality relationship.
One method to introduce time-varying properties to Granger causality is through the use of a logistic smooth transition (LST) function.By using an LST function with time as the transition variable, we can test for both smooth and abrupt causalities.When a causality has such nonlinearity, the usual Granger causality tests based on a linear VAR model have low power and tend to give the misleading result of having no causalities in the system.[1] [2] and [3] proposed nonlinear causality tests.Their analyses also showed significant nonlinear causality.While time-varying causality is significant for the precise analysis of variables, heteroskedastic variances influence the tests for causality and nonlinearity such as timevarying properties.For example, [4] provided Monte Carlo evidence that causality tests have size distortions under heteroskedastic variances.In addition, [5] and [6] showed that heteroskedastic variances lead to spurious nonlinearity.Several economic variables investigated using Granger causality have heteroskedastic variances such as stochastic volatility (SV) (e.g., [7] and [8]).Therefore, if we do not deal appropriately with heteroskedastic variances in the tests for causality, we would not be able to obtain reliable results when examining for time-varying causality.However, previous studies have not clarified the influences of heteroskedastic variances on time-varying causality tests.
This paper investigates the statistical properties of time-varying causality tests when the disturbance terms have SV.The investigated tests include asymptotic tests based on first-order and third-order Taylor approximation and their counterparts with the heteroskedasticity-consistent covariance matrix estimators (HCCME) as introduced by [9].As pointed out by [10], the order of Taylor approximation affects the performance of linearity tests.We reveal the impact of the order of Taylor approximation on timevarying causality tests in the presence of SV.We also examine the time-varying causality tests using wild bootstrap.Wild bootstrap was proposed by [11] and replicates a sampling that does not depend on the form of heteroskedastic variances.[12] and [13] examine the properties of tests using wild bootstrap.We show which tests perform well even under SV by analyzing the size and power of the tests.
Our simulation results provide evidence that asymptotic time-varying causality tests and their counterparts with HCCME over-reject the null hypothesis of no causality in the presence of SV.This implies that their tests tend to yield misleading and unreliable results.In particular, their tests based on third-order Taylor approximation have larger distortions than those based on first-order Taylor approximation.In contrast, we find that time-varying causality tests using wild bootstrap have reasonable empirical sizes and sufficient power.The results of this paper would enable appropriate and reliable time-varying causality tests.
The rest of this paper is organized as follows.Section 2 presents time-varying causality tests.Section 3 provides the size and power properties of tests.Finally, Section 4 concludes the paper.

Time-Varying Causality Tests
We consider the following bivariate vector autoregressive system to test for time-varying causality relationship.

(
) ( ) , , , where γ is a parameter determining the function's smoothness, t is a transition variable, and c is the point where a regime changes from one to another.We assume that 0 γ > ,  The null and alternative hypotheses to test for time-varying causality in the system are If 0 γ = , Equation (1) has no causality from t x to t y .However, the test is not simple and easy because the null hypothesis has an identification problem about 0 β and 1 β .They are identified only under the alternative hypothesis with 0 γ > .The identifi- cation problem was considered by [14] and [15].To conduct the test in the presence of the identification problem, [16] proposed a Taylor series approximation.We use first-order and third-order Taylor series approximation around 0 γ = because the performance of the tests depends on the order of Taylor series approximation (e.g., [10]).
The regression models for (1) using the first-order and third-order Taylor series approximation are given by First-order : , where t e is an error term including a remainder term of Taylor series approximation., , , ,  c c c c c c ′  =  c , ( 5) and ( 6) can be rewritten respectively as First-order : , Third-order : , where ( ) . Testing for time-varying causality is expressed as First-order : : Third-order : : 0, : 0.
The Wald statistics to test for time-varying causality are derived as where ( ) , b and ĉ are estimates of b and c , and 2 σ is the estimate of the residual variance in each regression. 1 R and 3 R are matrixes that satisfy Under the null hypothesis of no time-varying causality, ( 11) and ( 12) follow F distributions with degrees of freedom ( ) T − , respectively.When we use HCCME for statistics (11) and (12), they are given by ˆFirst-order : HC1 , where ˆt e represents the residual in each regression.Statistics ( 13) and ( 14) using HCCME asymptotically have the same distributions as ( 11) and ( 12).Wild bootstrap is also used for regression models with heteroskedastic variances to obtain reliable results.The method can simply resample heteroskedastic variances like SV.This paper employs the recursive-design wild bootstrap.The testing procedure is as follows.
Step 2. Estimate the system using the restricted model with 0 = b in ( 7) and 0 = c in (8) and obtain the estimate of a and residuals denoted as ˆrt e .
Step 3. Obtain the estimates 02 α and 12 α and the residual 2 ˆt u , where 2 ˆt u is the residual of (2).
Step 6. Repeat the bootstrap iterations M for steps 4 and 5.We obtain M statistics WB1 and WB3.
Step 7. Compute the bootstrap p-values as follows: ( ) I ⋅ is an indicator function such that ( ) ⋅ is true and 0 otherwise.The null hypothesis is rejected if the p-value is smaller than a significant level.

Size and Power Properties
This section conducts Monte Carlo simulations to compare the size and power properties of causality tests under multivariate SV.The nominal size of the tests is 0.05, and we consider sample sizes 200 T = and 400.Causality tests using wild bootstrap have 1000 bootstrap replications.The number of replications of simulations for all the tests is 10,000.We generate data with 100 T + and use the data with sample size T. The initial 100 samples are discarded to avoid the effect of initial conditions.We denote the tests compared in this section as F1, F3, HC1, HC3, WB1, and WB3; we also denote the linear Granger causality test as F0, its test with HCCME as HC0, and its test using wild bootstrap as WB0.
We first investigate the size properties based on data generating process (DGP) given as where 1t u and 2t u are error terms.We set 1t u and 2t u with normal error to the following.
The correlation parameter ρ between 1t u and 2t u is set to 0 ρ = and 0.5 ρ = .DGP do not have any causality from t x to t y in the system from (19) to (21).indicate that for all the tests, the correlation parameter ρ does not have any influence on size.Linear Granger causality test F0 and its time-varying causality version F1 perform well regardless of the value of α .Their rejection frequencies are near the nominal size 0.05.However, we find that F3 has small over-rejections.For example, the size of F3 for 0 ρ = , .The high persistence of t y affects the empirical size of F3.The property that F3 has additional regression parameters may lead to size distortions.We find that HC0, HC1, and HC3 perform worse than F0, F1, and F3.The rejection frequencies are larger than 0.05.In particular, the rejection frequency of HC3 is more than 0.1.These results imply that causality tests with HCCME are not useful under normal error.In contrast, the empirical sizes of WB0, WB1, and WB3 using wild bootstrap are close to the nominal size 0.05.While WB3 has small under-rejections, small size distortions are acceptable.Causality tests with wild bootstrap have reasonable empirical sizes.We next examine the empirical sizes of tests under multivariate SV.The property of SV is that volatility is influenced by an error and changes stochastically.Multivariate SV allows for the correlation between errors of volatilities.1t u and 2t u in (21) with SV are generated by Here, 1t h and 2t h are given by where The regression parameter α in ( 19) is set to 0.   In addition, we observe that the correlation between errors affects the size performance of F0, F1, and F3.When compared with the empirical sizes of F0, F1, and F3 for  .This is different from the results in Table 1(a).Thus, the correlation between errors increases the over-rejections when the errors have multivariate SV.The results imply that multivariate SV causes size distortions in time-varying causality tests.Possibly, they provide misleading results that indicate a time-varying causality relationship.However, note that WB0, WB1, and WB3 perform better even under SV.The empirical sizes of WB0, WB1, and WB3 for 1 .From Table 1(c), while HC0, HC1, and HC3 have larger rejection frequencies than F0, F1, and F3, their size distortions are smaller than those for symmetric multivariate SV.WB0, WB1, and WB3 show reasonable size performances, as in Table 1(a) and Table 1

(b). A comparison of Table 1(b) and Table 1(c)
shows that asymmetry of SV does not have a large impact on time-varying causality tests.From the results of empirical sizes, time-varying causality tests using HCCME have a negative influence on empirical sizes.Furthermore, time-varying causality tests F3 and HC3 based on third-order Taylor approximation is inferior to tests F1 and HC1 based on first-order Taylor approximation.Although WB3 tends to have slight under-rejection, WB0, WB1, and WB3 are superior to other tests.In particular, WB0 and WB1 perform best irrespective of type of error.
We next investigate the power properties based on DGP, given as ( ) ( ) 1 2 1 0.5 , where c is the point at which a regime changes from one to another.We set c to 2 c T = .We compare the cases of ( ) 0.01, 0. All the tests find it more difficult to detect changes only in a constant 0 β than only in AR parameter 1 β or in both a constant and AR parameter.In addition, the power of most of the tests increases when θ is large, because a large θ provides a sharp change in the smooth transition function.F0, HC0, and WB0 have smaller power compared to other tests regardless of the value of θ , 0 β , and 1 β .This shows that it is difficult for linear Granger causality tests to detect time-varying causality.F1, HC1, HC3, and WB1 apparently outperform other tests.However, HC1 and HC3 have over-rejections, as shown in Table 1(a).The better power performance of HC1 and HC3 is attributed to over-rejection of the null hypothesis; moreover, they tend to lead to spurious time-varying causality.Although F3 has size distortions under the null hypothesis with normal error, F3 has similar or lower power compared to F1. WB3 has a small under-rejection of the null hypothesis and lower power.These results indicate that time-varying causality tests with third-order Taylor approximation are not advantageous.Accordingly, F1 and WB1 obtain reasonable empirical sizes and better power performance when a variable has time-varying causality. .HC0 and WB0 are naturally inferior to other tests.F0 performs well, unlike the results of Table 2(a).This performance is attributed to the size distortions under SV, as in Table 1(b).The same is true of the results of F1, F3, HC1, and HC3.They outperform other tests, but over-reject the null hypothesis.When DGP have stochastic volatility, they are likely to reject the null hypothesis of no time-varying causality and yield misleading results.It is important to have reasonable and acceptable empirical sizes in order to avoid misleading results.Although the power of WB1 and WB3 are lower than that of F1, F3, HC1, and HC3, they have reasonable and acceptable empirical sizes and lead to reliable results.We see that WB1 has higher power than WB3.The simulation results provide clear evidence that WB1 is more reliable from the perspective of controlling the size and obtaining sufficient power to find time-varying causality regardless of the presence of SV.
= = = in(6).Since we cannot test for 0 γ = directly, we instead test for 0 1 0 = = = .Denoting a , b , and c as , 0.8 .All ij κ are set to zero in (24) and (25).Mul- tivariate SV clearly leads to over-rejection.For example, the empirical sizes of F1, F3, HC1, and HC3 for 0.2 α = , 0 ρ = , 1 2 0.8 φ φ = = , and 200 T = are respectively 0.059, 0.090, 0.078, and 0.153 in , and 0.053, respectively.Asymmetric multivariate SV also results in size distortions for causality tests.We set 11 a) reports the power performance of tests under normal errors 1t u and 2t u with 0 ρ = in (21).All the tests have a larger power when ( This paper investigated the statistical properties of time-varying causality tests when the errors of variables have multivariate SV.It is important to clarify the statistical properties of time-varying causality tests under SV, because economic variables often have SV and the relationship between them is time-varying.The tests we compared include the standard linear Granger causality and the time-varying causality tests, their tests with HCCME, and their tests using wild bootstrap.Simulation results indicate that time-varying causality tests and their counterparts with HCCME have size distortions

Table 1 (
a) presents the size properties of tests for normal error.We investigate two cases of 0.2 α = and 0.8 α =.The results in Table1(a)

Table 1 .
(a) Size properties under normal error; (b) Size properties under symmetric multivariate stochastic volatility; (c) Size properties under asymmetric multivariate stochastic volatility.

Table 2 .
(a) Power properties under normal error; (b) Power properties under multivariate stochastic volatility.
under highly persistent SV.Standard linear Granger causality tests perform relatively well under SV but has low power under time-varying causality.In contrast, time-varying causality tests using wild bootstrap have better size properties regardless of type of error.In particular, the time-varying causality test with first-order Taylor approximation and wild bootstrap has better statistical properties.These results indicate that the time-varying causality test with first-order Taylor approximation and wild bootstrap is