Distributed Estimator of Market Beta under Extreme Conditions

Market beta is a measure of the volatility or systematic risk of a security or portfolio compared to the market as a whole. This paper considers the distributed estimation of market beta in the case of massive data, and obtains the consistency and asymptotic normality of the estimator. Further, simulations show the finite sample properties of this estimator.


Introduction
Distributed statistical inference, as a hot topic and an effective method, has been widely discussed in the past ten years, and a lot of research results have been accumulated.Its representative work are: In theory, Chen and Zhou (Chen and Zhou) [1] put forward the distributed Hill estimator and prove its Oracle property; Volgushev et al. (2017) [2] propose distributed inference for quantile regression processes, and propose a method to calculate the efficiency of this inference, which requires almost no additional computational cost.In application, Mohammed et al. (2020)  [3] propose a technique to divide a Deep Neural Networks (DNN) in multiple partitions, which reduces the total latency for DNN inference; Smith and Hollinger (2018) [4] propose a distributed inference-based multi-robot exploration technique that uses the observed map structure to infer unobserved map features, resulting in a reduction in the cumulative exploration path length in the trial; Ye (2017) [5] started to study the stability of the beta coefficient of the Chinese stock market and found the best beta estimation time.Mitra (2019) [6]  Market beta, also known as systematic risk or equity beta, is a measure of a stock's sensitivity to overall market movements.The development of market beta can be traced back to the early 20th century when economists and financial analysts began to understand the importance of systematic risk in determining asset returns.One of the pioneering works in this area is the paper of Markowitz (1952) [7] on portfolio selection, which laid the foundation for modern portfolio theory, and emphasized the importance of diversification.Black and Scholes (1973) [8] introduced the concept of beta to measure the systematic risk of individual securities or stock.Banks with a higher beta are expected to suffer from larger capital losses in the event of an extremely adverse shock in the financial system.
Estimating market beta involves analyzing historical data on a stock's returns and its correlation with the market returns.The usually used econometric model is that the stock's returns are regressed against the market returns over a specific time period, which has been used to evaluate beta of financial returns on commodities, currencies (Atanasov and Nitschka (2014) [9]; Lettau et al. ( 2013)) [10], stocks (Post and Versijp (2004)) [11], and active trading strategies (Mitchell and Pulvino 2001) [12].However, in extreme cases, conditional regression is based on a small number of tail observations, which may produce a relatively large variance of the estimator, and the data of the financial market are mostly heavy tail, which may further increase error.To avoid these situations, Oordt and Zhou (2017) [13] proposed a new method to estimate market β .Let X and Y be continuous random variables with distribution functions X F and Y F , respectively.Assume that 1 X F − and 1 Y F − be heavy-tail with tail index x α and y α , respectively.This means that ( ) ( ) ( ) ( ) where ( ) x l u and ( ) y l u are slowly varying functions as u → ∞ .Let ( ) ( ) with small p , relation of X and Y restricted under extreme X is given by ( ) , for , ε is the error term that is assumed to be independent of the X under the condition ( ) To get estimator under the EVT method, we consider the following tail dependence measure from multivariate EVT (see, e.g, Hult and Lindskog (2002)) [14], where ( ) where ( ) ( ) Suppose (1.4) hold, under the linear model in (1.2), with 2 the following conclusion is given in Oordt and Zhou (2017) [13]: Naturally, consider independent and identically distributed (i.i.d.) observations ( ) ( ) ( ) , , , , , , with the i.i.d.unobserved error terms 1 , , n ε ε  , we mimic the limit procedure 0 p → by considering only the lowest k observations in the tail region, such that ( ) Oordt and Zhou (2017)  [13] gives an estimator of β as And to prove asymptotic normality, the second-order condition for Y is given by: where ( ) ( ) is an eventually positive or negative function, ( ) ρ′ ≤ .Then, Drees and Huang (1998) [16] define For this dependence structure, we assume that, for some positive function ( ) , R x y , with a speed of convergence as follows: there exists a 0 θ > for which, . And we can simply get ( ) Under condition (1.4), (1.7) and (1.8) hold, suppose ( ) Oordt and Zhou (2017) [13] prove the asymptotic normality of β .This estimation method can be used not only for the assessment of investment risks, but also for banking (Oordt and Zhou (2018)) [17], insurance and other fields.However, due to confidentiality, banks may not share their operating losses with each other, and insurance companies cannot share any observation results with the outside world in order to protect the privacy of customers.
Therefore, banks and insurance companies can only make statistics based on their own data and share the results, and cannot re-identify individual data from the shared information.Distributed statistical inference is a good way to deal with these situations, it can analyze data stored in multiple machines, and it usually requires a divide-and-conquer algorithm that estimates the required parameters on each machine, transmits the results to a central machine that combines all the results, usually by simple averaging, to arrive at a computationally feasible estimator.
The objective of this paper is to apply divide-and-conquer idea to estimating market β .Considering independent and identically distributed (i.i.d.) observa- tions ( ) ( ) ( ) , , , , , , are distributed across k different machines, each machine has m observations, n mk = , and we assume as n → ∞ , ( We follow a divide-and-conquer algorithm, first estimating , ˆn j β in each machine, and then taking the average of k machines as the distributed estimator ˆD β for β , , , , m j j j X X X  in the j-th machine, we get the order statistic ( ) ( ) ( ) , and only the first d are selected to estimate β , where ( ) where, the tail index is estimated using the Hill estimator given in Hill (1975) [18]: The estimator of dependence measure is provided by multivariate EVT, see Embrechts et al. (2000) [19], that is where ( ) , we require some additional conditions to ensure the asymptotic normality of the Hill estimator: lim , .log Suppose there would exist a sequence 0 n p → as n → ∞ such that, for suf- ficient large n, we have n p p < , which implies that the linear model in (1.2) ap- plies for sufficiently large n.
The remainder of this paper is organized as follows.Section 2 provides the main results; finite behaviors of ˆD β are considered in Section 3; all proofs are deferred to Section 4.

Main Innovations and Results
The innovations of this paper are: • Under extreme market conditions, with less data and heavy tails, a new beta estimator is proposed by using the distributed idea.• In the numerical simulation, the profile of data pollution is considered, and the expected effect is achieved, and the data is more inclusive.

Simulation
We conduct two sets of simulations to demonstrate the finite sample performance of the distributed beta estimator ˆD β .For each simulation, we consider three linear models, that is, 1.5, 1, 0.5 β = . We generate samples with samples size n = 10,000.Based on r = 1000 repetitions, we obtain the finite sample squared bias, variance and Mean Squared Error (MSE) for our estimator.

Compare for Different Level of d
In the first set of simulations, we vary the level of d in the distributed beta estimator to verify the theoretical results on the oracle property.The oracle sample 1 , , n X X  contains n = 10,000 observations stored in k machines with m observations each.We fix k = 20 and m = 500, compare the finite sample performance of the distributed beta estimator with that of the oracle beta estimator for different values of d.Since the Student's t-distribution is known to be heavy-tailed with the tail index equal to the degrees of freedom, we perform simulations of X and ε based on random draws from a Student's t-distribution with Four degree of freedom.According to Lemma 1.3.1 in Embrechts et al.
(1997) [20], the sum of two heavy-tailed random variables is also a heavy-tailed random variable, and the tail index of the sum is controlled by smaller tail index.
Then, the observations for Y are constructed by aggregating the simulated X and ε, which could guarantees Y is also heavy-tailed and The first column of Figure 1 compares the Mean Square Error of the distributed beta estimator ˆD β and the oracle beta estimator β .Firstly, Mean Square Error gradually decreases with the increase of β.Theoretically, τ increases with the increase of β, while Mean Square Error decreases with the increase of Figure 1.Finite sample performance for the distributed beta estimator and the oracle beta estimator for different levels of d.The blue report the simulation results for distributed EVT approach; the yellow lines report those for the EVT approach.
S. Y. Zhu τ.Therefore, the simulation results are in agreement with the theoretical results.
Secondly, the second and third columns of Figure 1 show decomposition of the MSE into squared bias and variance, we observe a trade off between the bias and varience for the both estimators: as d increase, the bias increase while the variance decreases, and when the number of observations is small, the variance of the oracle beta estimator is smaller than that of the distributed beta estimator, and as the number of observations increases, the variance becomes equal, which is in line with the result of Theorem 2.2.

Data Is Contaminated
In the second set of simulations, we want to know whether distributed estimators have good properties when the data is contaminated.We simulate three cases of X being contaminated, ε being contaminated and both X and ε being contaminated respectively.The total number of observations does not change, that is, n = 10,000 is divided into k = 20 machines with m = 500 observations in each machine.
Figure 2 shows the Mean Square Error, square deviation and variance of the two estimators when X is contaminated.We also model 10,000 observations of ε

S. Y. Zhu
from a Student's t-distribution with 4 degrees of freedom, observations of X are drawn from a standard normal distribution with probability 0.1 and a Student's t-distribution with 4 degrees of freedom with probability 0.9, this means that 1000 out of 10,000 observations are contaminated.We then sort the observations in each machine and use (1.12) to get , ˆn j β .
The third column in Figure 2 shows the variance of the two estimators when d takes different values, which is almost the same as the result when the observations are not contaminated.When the number of observations is small, the variance of the distributed estimator is larger than that of the Oracle estimator, and with the increase of d, the variance is close to zero.Observe the first column, the Mean Square Error is less than 0.05, the estimation effect is good.
Figure 3 shows the Mean Square Error, square deviation and variance of the two estimators when ε is contaminated.We also model 10,000 observations of X from a Student's t-distribution with 4 degrees of freedom, observations of ε are drawn from a standard normal distribution with probability 0.1 and a Student's t-distribution with 4 degrees of freedom with probability 0.9, this means that 1000 out of 10,000 observations are contaminated.We then sort the observations in each machine and use (1.12) to get S. Y. Zhu consistent with Figure 1, indicating that the selection of ε does not affect the properties of the estimator, which is consistent with the theory that the random error can be thin-tailed.
Figure 4 shows the Mean Square Error, square deviation and variance of the two estimators when both ε and X are contaminated.Observations of ε and X are drawn from a standard normal distribution with probability 0.1 and a Student's t-distribution with 4 degrees of freedom with probability 0.9, this means that 1000 out of 10,000 observations are contaminated.We then sort the observations in each machine and use (1.12) to get , ˆn j β .Similar to Figure 1, this is consistent with the theoretical results, indicating that distributed estimators can be treated similarly when the data is contaminated.

Proof
In order to prove the main results, we need the following two lemmas.
, then as n → ∞ , we have where x l is a slowly varying function, since 2 2 1 > to be specified later.It's clear that for any 0 1 Proof.Let ( ) From the heavy-tailed property of the distribution function of Y in (1), we obtain that ( ) ( ) Then we prove (4.1) first: Notice that ( )

and Physics
The penultimate step is based on (4.4).As n → ∞ , since 0 Next, we prove (4.2): for some 0 D > , we write ( ) ( ) The last step uses the condition that ( ) ( ) ( ) . By (4.4), the denominator converges to p , which is positive and finite.Same as (4.1), we have 0.
According to Corollary 2.2.2 in de Haan and Ferrira (2006) [21], as n → ∞ , ( ) , then, as ( ) Hence, for any 0 δ > , as n → ∞ , we have A similar relation for Y holds.Therefore, in order to prove that ,1 1 P j I → , we will prove a more general result that ( ) since the observed values of different machines are independent and identically distributed, we have ( ) , ., , The penultimate step using the convergence of ( ) Hence, what remains to be proved is that , , lim 1 Oordt and Zhou (2017)  [13] with p d m holds uniformly for all ( )  .We further simplify the denominator as follows: the last step uses the second order condition of X, that is (1.4).
From (1.5), as n → ∞ , we get that x holds uniformly for holds uniformly for ( )  , as n → ∞ .Hence, we proved that , together with the consistency of Next, we deal with . Note that the observations of different machines are independently and identically distributed, similar to the proof in Oordt and Zhou (2017) [13], if lim sup 0 , then the consistency of ˆj α leads to ,2 1 , to prove that as n → ∞ , there is 2 1 Theorem 3.2.5 in de Haan and Ferrira (2006) [21] guarantees the asymptotic normality of ˆj α under conditions (1.3) and (1.13): as n → ∞ , ( ) that is, ( ) therefore, it only remains to prove that ( ) .
The last step exploits the properties of slowly varying functions, then we have For ,4 j I , same as ,3 j I , we know ( ) ( ) Finally, according to (1.5), From the above analysis, , , , , and ( ) ( )  the second step uses the Delta method, where d m Next we deal . According to Theorem 3.2.5 in de Haan and Ferrira (2006) [21], we know that Gaussian process can also control the convergence of the tail exponents, i.e., ( ) ( ) ( ) x y is the same zero-mean Gaussian process as above, as n → ∞ , Use Delta method, let ( ) , 0 1, 0 log . 1 , then, ( ) ( ) ( ) . According Theorem 4 in Chen et al. (2021) [23], we know that Finally, we deal with ( ) use Delta method, let ( ) Based on the expression for ( ) , l x y , we get 0, And j S is independent and identical distributed on different machines, use the Central Limit Theorem, ( ) where, ( ) Therefore, what remains to be proved is the following deterministic relation According to (1.5), we know that Next, from (1.8) and (1.9), we have ( )

S. Y. Zhu Journal of Applied Mathematics and Physics
Notice that Lemma 1 in Oordt and Zhou (2017) [13] can be written as , then, for sufficiently large n, . Thus, (4.11) is equal to i.e., as n → ∞ , we have 0.
Let's just prove that the convergence rate is kd .By referring to the set For C 1 , due to X and ε independent, we have that ( ) ( ) the limit relation (4.2) implies that lim 1 0 gether with (4.3), we have ( ) uses a smooth linear transfer function to measure the amplitude and direction of market movement, and the proposed classification can better S. Y. Zhu DOI: 10.4236/jamp.2023.11112323677 Journal of Applied Mathematics and Physics capture the asymmetric behavior of beta.

Figure 2 .
Figure 2. X is contaminated, finite sample performance for the distributed beta estimator and the oracle beta estimator for different levels of p.The blue lines represent the simulation results of the distributed beta estimator, and the red lines represent the corresponding Oracle results.
Figure 3. ε is contaminated, finite sample performance for the distributed beta estimator and the oracle beta estimator for different levels of p.The blue lines represent the simulation results of the distributed beta estimator, and the red lines represent the corresponding oracle results.

Figure 4 .
Figure 4.Both ε and X are contaminated, finite sample performance for the distributed beta estimator and the oracle beta estimator for different levels of p.The blue lines represent the simulation results of the distributed beta estimator, and the red lines represent the corresponding Oracle results.
of κ is feasible.And we have that first prove that as n → ∞ , is a homogeneous function of the first degree.According to Lemma 2 in Oordt and Zhou (2017)[13], for 0x > , we have ( ) ( ) 