Investigation of Relation between Solar Activity and Earthquakes with Deep Learning Method

Solar activity (SA) has been hypothesized to be a trigger of earthquakes, although it is not as intuitively associated as other potential triggers such as tid-al stress, rainfall, and the building of artificial water reservoirs. Here, we investigate the relation between SA and global earthquake numbers (GEN) by using a deep learning method to test the hypothesis. We use the daily data of GEN and SA (1996/01/01-2019/12/31) to construct a temporal convolution network (TCN). From the computational results, we confirm that the TCN captures the relation between SA and earthquakes with magnitudes from 4.0 to 4.9. We also find that the TCN achieves better fitting and prediction performance compared with previous work.


Introduction
The 11-year solar cycle contributes to events such as sunspots, coronal mass ejection, and solar wind. The mechanism of the sun-earth magnetosphere connection is a mystery in relation to earthquakes [1]. Several studies have proposed that solar activity (SA) might be linked to earthquakes [2] [3] [4]. Statistical methods are usually used to prove this hypothesis. Reference [5] suggests a correlation between SA and large earthquakes worldwide, and [6] investigates the correlation between long-range clustering of global seismicity and SA. Sunspot number is also considered to be an SA variable for predicting earthquakes [7]. Meanwhile, some mechanisms have been considered to improve the correlation between the SA and the earthquakes. For example, induced current causes an increase in fault stress through piezoelectricity [8], and the eddy electric currents in faults reduce the shear strength [9].
The previous studies mainly focused on investigating the significant correlation between SA and earthquakes using non-parametric statistical methods. However, parametric statistical models and machine learning models are also necessary for earthquake forecasting, although they are far from applicable to this task. In our previous work [10], we attempted to predict Global Earthquake Numbers (GEN) by using variables associated with SA as inputs. The results in [10] show that the GEN of earthquakes with magnitude 4 -4.9 is most predictable.
With the development of sensing technologies, including GPS and InSAR [11], a massive amount of data on SA has been accumulated. Furthermore, the solarearth coupling can be characterized as a non-linear dynamical system. For these two reasons, we decided to construct deep learning models to predict GEN with SA as the input for earthquakes of magnitude 4 -4.9. In particular, we considered daily time series of GEN and SA in sequential format. The recurrent neural network (RNN) and long-short-memory-term are two benchmark DL models for sequential data. However, feedback in the recurrent architecture can lead to higher computational complexity [12]. Recent studies [13] [14] indicate that certain convolutional neural network (CNN) architectures can reach state-of-the-art accuracy for sequential data. A CNN can ensure the causality of sequential data of any sequence length with no feedback.
By considering the proven effectiveness of CNNs for sequencing data, we took all the observations in the time-series format and implemented the temporal convolutional network (TCN) [15]. We constructed TCN by using GEN data and SA data as input to predict GEN for earthquakes of magnitude 4 -4.9.

Dataset
Daily data of GEN were downloaded from ComCat (https://earthquake.usgs.gov/earthquakes/search/). The data ranged from 01/01/ 1996 to 12/31/2019, including the 23 rd and 24 th solar cycles, and are partly depicted in Table 1. EQi means earthquakes with magnitude ii.9 for 3, 4,5, 6, 7 i = . Note that earthquakes with M ≥ 8 rarely occurred, so we combined those into one column: EQ89. The data contain an earthquake M = 7.2 (04/05/2010) that occurred in Estado de Baja California of Mexico and the Touhoku earthquake M = 9.0 (03/11/2011) that occurred in the north-east of Japan. Because large earthquakes always cause aftershocks, the GEN itself was also used as an input of TCN.
( Figure 1) The daily data of SA were downloaded from OMNIWeb (https://omniweb.gsfc.nasa.gov/). The SA variables used in this research are listed in Table 2. Part of the SA data are illustrated in Figure 2. Missing values in the original SA data were filled using the linear interpolation method. (Table 3)

TCN Architecture
According to our previous works [10] In this way, we construct a non-linear model This research uses TCN as ( ) g ⋅ , whose architecture is shown by Figure 3.
The TCN is mainly composed of convolutional blocks with 16, 32, 32, and 64 channels. In each block, a dilated convolutional operation is performed on a sequence input n ∈ u  :  being a filter.
Because the maximum time lag is relatively short, the convolutional kernels of size 1 × 2 are implemented in each block. To obtain the robust estimates, the Huber loss function is used as follows:

Prediction Results
The whole dataset was divided into two parts. The SA and GEN data in the 23 rd solar cycle (01/01/1996-12/31/2007) were used as the training data. The SA and GEN data in the 24 th solar cycle (01/01/2008-12/31/2019) were used as the test data to verify the trained TCN. Pearson's correlation coefficient R was used to evaluate the fitting and prediction performance of the TCN.

TCN without/with SA Variables
First, we constructed a TCN without SA variables. Figure 4 illustrates the loss curves of the training and test losses versus epoch number. The curves indicate that 100 epochs are enough to ensure convergence of the TCN training.
We also constructed TCN with all of the SA variables. Figure 5 illustrates the training and test losses plotted against epoch number. The curves indicate that 100 epochs are enough to ensure convergence of the TCN training. Table 4 lists the fitting and prediction performance of TCNs without SA variables for 1-to 3-day-ahead predictions. Let R f and R p be the correlations between the real observations of EQ4 and the output of the TCN obtained from the training data and test data, respectively. Table 5 lists the fitting and prediction   performance of TCNs with SA variables for 1-to 3-day-ahead predictions. As a reasonable result, R f is R p for all days ahead in Table 4 and Table 5. Thus, the "decrease" in Table 4 and Table 5 means the difference between R f and R p .
The two tables indicate that the TCNs are of better fitting and prediction performance than the support vector regression in our previous work [10]. By comparing Table 4 and Table 5, it can be seen that the SA variables improve both the fitting and prediction performance of TCNs. The gap between R f and R p is trivial in the 1-day-ahead prediction, which supposes a balance between the fitting and prediction performance of TCNs with/without SA variables. However, R p significantly decreases for the 2-and 3-day-ahead predictions.

Impact of SA Variables on Prediction of Earthquakes
To evaluate how the SA variables improve the prediction of earthquakes, we adopted the following forward stepwise procedure: International Journal of Geosciences . Add to  the variable from  that gives the biggest improvement in a R . 3) Repeat (2) until  becomes empty and a total of 15 TCNs are obtained. Table 6 shows the sequentially selected variables according to R a for the 1-dayahead prediction. We can see that the plasma speed V improves R a by almost 0.02 based on EQ3 and EQ4 in step 3. The IMF Magnitude improves R a by almost 0.02 at the last step, jointly with other variables. These results suggest that all the SA variables should be used as the inputs of TCNs.

Conclusions
In this research, we investigate the relation between SA and GEN. We construct the deep learning model TCN to predict EQ4 for 1-to 3-day-ahead predictions.
The numerical results show that: 1) Compared with SVR in our previous works, TCN significantly enhances the fitting and prediction performance. This result confirms that there exists a strong nonlinear relation between GEN and SA.
2) Because the fitting performance R f is similar to R p , we suppose that TCN is of potential capacity for the 1-day-ahead prediction for EQ4.
3) EQ4 in the past is the crucial input of TCN. Thus, TCN is essentially a nonlinear autoregressive model. However, SA variables can still improve the fitting and prediction performance of TCN.
From the aforementioned results, we suppose that SA has the potential to affect GEN.
TCNs in this research are still far from being predictive. Table 6 shows that the TCN is continuously improved until all the SA variables are implemented. This result suggests that the prediction performance can be further improved by considering more variables other than the candidates selected in this research. Over the decades, lots of novel geophysics and space data have become available, thanks to improvements in sensing and measurement technologies. Although earthquakes remain not predictable for now, we will continue to reveal relations among earthquakes, the earth's environment and SA on the basis of various statistical methods and machine/deep learning models.