Estimating Functional Brain Network with Low-Rank Structure via Matrix Factorization for MCI/ASD Identification ()
1. Introduction
Functional brain network (FBN), calculated by resting-state functional magnetic resonance imaging (rs-fMRI), can provide informative pattern about the organization of the brain [1], and mine sensitive biomarkers for neurological or psychological diseases such as Autism spectrum disorder (ASD) [2] [3], Parkinson’s disease [4], Alzheimer’s disease (AD) [5] [6] and its early stage, i.e., mild cognitive impairment (MCI) [7] [8] [9]. All of the disease identification tasks rely on the quality of the FBNs, and therefore it is indispensable to estimate more reasonable FBNs.
To date, researchers have developed many FBN estimation methods to improve the accuracy of disease prediction [10] [11]. Specifically, Pearson’s correlation (PC) is the most popular and simplest method for FBN estimation that measures the full correlation between different brain regions [12]. Despite its simplicity, high computational efficiency and statistical robustness, the full connection FBN tends to include confounding effects from other brain regions. By contrast, the complex interactions between multiple regions-of-interest (ROIs) can be explained by partial correction or its
-regularized version, namely sparse representation (SR).
Compared with the low-order FBN estimation methods, such as PC and SR, some high-order statistics may also offer additional and useful information for FBN analysis [13] [14] [15]. For example, Zhang et al. [16] recently proposed a high-order FBN (HoFBN) estimation method that can effectively find identify biomarkers between different groups of subjects. To put it easy to understand, we consider the relationship network as an analogy, as shown in Figure 1, where the nodes denote persons, the edges are the relationship between the persons, and the weight on each edge is the strength of relationship. In this example, the relationship provides the low-order connection information of the network. However, there may exist some relationships among the friendship network of one person which may provide some high-order connection information for a network system. However, the three traditional methods do not take the
Figure 1. The blue and red nodes are the friends of the i-th node and j-th node respectively, and the yellow nodes represent the common friends of nodes i and j. The blue and red boxes are the friendship networks of i and j. r is the low-order relationship between i and j, and the high-order relationship between i and j is expressed by r' which is measured via the friendship network of i and j.
topological structure of the FBN into account. It has been reported that FBN usually has modularity structure [17] [18] [19], meaning that the network exists some node modules, and the nodes are connected densely in each module, while the connections between these modules are sparse. In consequence, the nodes within a same module tend to have a high probability connect to each other, which may lead to some dependent rows/columns and further resulting in a low-rank edge weight matrix. However, most methods for estimating FBN, such as PC, SR, and HoFC, usually produce full-rank networks, indicating that there are some noises in these networks.
For improving the quality of the FBN estimated by conventional methods, we propose a novel FBN construction method that uses matrix factorization (MF) to characterize the low-rank structure, while preserving the information of traditional FBN as much as possible. Specifically, first, we estimate FBNs for all the subjects using PC, SR and HoFC, and then we perform the MF of the estimeted FBNs to obtain a new connections network with
. Finally, to evaluate the effectiveness of the proposed method, we conduct the experiments based on two public database (Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset and Autism Brain Imaging Data Exchange (ABIDE) dataset). The experimental results show that our method can effectively improve the identification accuracy compared with the baseline methods.
The rest of this paper is organized as follows. In Section 2, we review three conventional FBN estimation methods, i.e. PC, SR, and HoFC. In Section 3, we first introduce the datasets used in this study and pre-processing pipeline. Then we present our proposed method to estimate FBNs and solving process. In Section 4, we exhibit our experimental setting and the results for classification. In Section 5, we investigate the influence of modeling parameters, number of rank on the final classification performance. Beside, we also discuss the modularity score of FBNs and discriminative functional connections for MCI identify and several limitations of this work as well as the possible research in the future directly. Finally, we briefly summarize this paper in Section 6.
2. Related Work
By reason of the significant role in exploring the inherent organization and neurodegenerative diseases of the brain, researchers have proposed many FBN estimation methods in the past decades. In this section, we review several methods, i.e. PC [20], SR [21], and HoFC [16], which are closely related to our study briefly.
2.1. Pearson’s Correlation
It is well-known that PC is the simplest and most popular method for FBN estimation. Its mathematical expression is given as follows:
(1)
where
is the blood oxygen level-dependent (BOLD) signal associated with the ith ROI, m is the total number of temporal image volumes, and
is the mean of the elements in
. Without loss of generality, we redefine
by centralization and normalization like
. Then, Equation (1) can be simplified as
, which can be easily proved to be the optimal solution of the following optimization problem [22]:
(2)
or its matrix form:
(3)
where W is the adjacency matrix to be estimate by PC,
denotes the data matrix which is the set of BOLD signals, and n is the number of ROIs.
2.2. Sparse Representation
SR is another commonly utilized method to estimate FBNs which regressing out the confounding effect from other ROIs, and the mathematical model of SR is expressed as follows:
(4)
where
is the regularized parameter for controlling the balance of two terms in Equation (4). Mathematically, Equation (4) can be further rewritten by the following matrix form:
(5)
where
is a data fitting term which implies to invert the covariance matrix and capture the information of partial correction,
denotes
-regularized term which encoding the sparsity prior of FBN. Note that, the constraint
is to avoid the trivial solution.
2.3. High-Order Functional Connectivity
As mentioned in Section 1, many HoFBN estimation methods has been proposed in recently years [14] [15] [16]. In this work, we only review the high-order functional connectivity (HOFC) due to it is simplest and direct. The HoFC method includes two steps (as shown in Figure 2): First, computing the low-order correlation coefficient
based on the BOLD signal corresponded by i-th and j-th ROI utilize PC. Then, HOFC between brain regions i and j,
, can be calculated by the pearson’s correlation coefficient between i-th row (or column) and j-th row (or column) in the low-order network.
Figure 2. Pipeline of the estimation of HoFBN use HoFC. (a) PC; (b) Ho-FC.
The model is as follows:
(6)
where
is the i-th row (or column) in low-order network.
3. Materials and Methodology
In this section, we design a framework for brain disease classification, as shown in Figure 3. Including the data preparation, the proposed method, and the MCI/ASD classification pipeline.
3.1. Data Preparation
To verify the availability of the proposed method, we use two benchmark databases, i.e., ADNI dataset and ABIDE dataset in the experiment.
For ADNI dataset, 137 subjects (including 68 MCIs and 69 NCs.) were extracted and preprocessed in [23]. Table 1 shows the demographic information of these 137 subjects. For each subject, the scanning time was 7 min, corresponding to 140 volumes. The head motion, brain size, shape, orientation, gyral anatomy, the differences in image acquisition time between slices, long-term physiological shifts, and instrumental instability have large influence on the fMRI reliability. In order to reduce these influences, a preprocessing pipeline is used in this paper to improve fMRI data quality prior to FBN estimation. At the beginning, the first 3 volumes were removed for signal stabilization, and then processed the remaining 137 volumes via a standard pipeline (the Data Processing Assistant for Resting-State fMRI (DPARSF) toolbox.) [24]. Specifically, the subjects which with more than 2.5 min of FD (alculated based on head motion parameters), larger than 0.5 mm were removed. Then, we reduce the influence of the ventricle and white matter signals used nuisance regression as well as the high-order effect of head motion based on Friston 24-parameters model. After that, we performe the fMRIs onto the Montreal Neurological Institute (MNI) space, followed by spatially smoothing with the full-width-half-maximum of 4 mm. Finally, every brain were divided into 116 ROIs based on the Automated Anatomical Labeling
Figure 3. The main pipeline of MCI/ASD identification used in this study, which contains three major steps: (a) data preparation, (b) FCN construction, and (c) classification with feature selection.
Table 1. Demographic and clinical information of subjects in the ADNI and ABIDE datasets. Values are reported as mean ± standardard deviation. M/F: Male/Female; MMSE: Mini-Mental Examination; GCDR: Global Clinical Dementia Rating; FIQ: Full-Scale Intelligence Quotient; VIQ: Verbal Intelligence Quotient; PIQ: Performance Intelligence Quotient.
(AAL) template [25], and we extracted the mean time series (with band-pass filtered 0.015 - 0.150 Hz) of each ROIs as the input data of the proposed method.
For the ABIDE dataset, we used 184 subjects (including 79 ASDs and 105 NCs) that are from the largest site New York University (NYU) in this work. The demographic information of these subjects are also shown in Table 1. Specifically, the fMRI data were acquired based on a standard echo-planar imaging sequence on a clinical routine 3.0 Tesla Allegra scanner with the following imaging parameters: TR/TE is 2000/15 ms with 180 volumes, the number of slices is 33, and the slice thickness is 4.0 mm. The involved fMRI data are provided by the Preprocessed Connectome Project initiative, and further preprocessed by DPARSF. More specifically, the preprocessing pipeline includes four mainly steps: 1) volume slices and head motion correction, 2) nuisance signals regression, 3) registration to MNI space, and 4) temporal filtering (0.01 - 0.10 Hz). Afterwards, the brain of each subject is partitioned into 116 ROIs used AAL atlas, and extracted mean time series from all these ROIs for constituting the data matrix
.
3.2. Proposed Method for FBN Estimation
To further improve the identifiability of the FBNs, we propose a low-rank FBN estimation method by MF to reduce the rank of the network which is estimated by the conventional methods. There are two motivation for our study: 1) Low-rank structure is beneficial to reduce the noise which disturb the diagnosis of the network, thereby obtaining a more “clear” FBN to improve the accuracy of MCI/ASD identification. 2) As reported in the study Qiao et al. [26], low-rank structure can enhance the modularity of network which is extremely important for promoting stability of our brain. The rank of the network can be reduced by MF effectively which has a elegant probability interpretation and optimal solution, and the model as follow:
(7)
where W is the network estimated by traditionary methods, k is the rank of the FBNs estimated by the proposed method.
is the data fitting term for maintaining as much information of W.
is the matrix-regularized term which is usually necessary in prediction for bias-variance trade-off [27]. Further, make the model concrete:
(8)
note that, we use Frobenius Norm since it has a Guassian noise interpretation, and Equation (8) can be easily transformed into the following matrix form:
(9)
We take the derivative of U and V respectively:
(10)
(11)
And then, Equation (9) can be efficiently solved by alternatively update U and V in each iteration of gradient descent algorithm based on Equation (10) and Equation (11).
is the low-rank FBN constructed by the proposed method.
3.3. FBN-Based Disease Classification
After constructing the FBNs of all subjects, the subsequently work is to classify MCIs (or ASDs) and NCs based on these FBNs. To be specific, the first task is to select the features for identification procedure. In our experiment, 6670 features can be produced, since the adjacency matrix of the estimated FBN is symmetric. Thus, the feature dimension is very high compared with the sample size. It typically causes the curse of dimensionality which generate the classification accuracy with false enhancement [28]. To address this issue, numerous feature selection methods have been proposed such as t-test, least absolute shrinkage and selection operator (LASSO) [29], genetic algorithm (GA) [30] and so on. In this work, we only adopt the simple feature selection method, i.e. t-test, and fixed
empirically. In the second step, we utilize the linear Support Vector Machine (SVM) [31] classifier (
) since the classifier design has an enormous influence on the ultimate accuracy [32]. Specifically, we use the linear kernel function of SVM, i.e.,
. There are two reasons for using the linear kernel function: 1) Since more features (usually more than the number of subjects) are used in our classification task, SVM with linear kernel function can achieve a relatively ideal classification effect. 2) The linear kernel function has fewer parameters and the computational efficiency is higher than the complex kernel function. We choose 5-fold cross validation (5-fold CV) to evaluate the performance of involved methods.
4. Experiment
In this section, we introduce the primary experimental procedure, including the competing methods in this work, the experimental setup and the classification results for different methods.
4.1. Competing Methods
As mentioned in Section 3, after obtaining the pre-processed fMRI data, we extracted mean signal for each ROI as the input data to estimate FBNs based on different methods, including: 1) the conventional methods, i.e. PC, SR and HoFC. 2) LR-PC, LR-SR and LR-HoFC use our proposed method to reduce the rank of the FBN estimated by PC, SR and HoFC.
4.2. Experimental Setup
In our experiments, to verify the effectiveness of the proposed method, we use 5-fold CV to evaluate different methods and repeat 100 times since 5-fold CV usually generate an unstable value. Note that, for a fair comparison, the same linear SVM (with
) is used in all the methods. Since the modeling parameters may significantly affect the structure of the constructed networks and further influence the ultimate classification results, we select optimal parameters through a large-scale grid search. Specifically, we uniformly utilize 11 candidate values [
] for the regularization parameters (
,
and
) in SR, LR-PC, LR-SR and LR-HoFC. In particular, the proposed method has a hidden parameter k which is the rank of the estimated low-rank FBNs. We first assign a value to k empirically, and then we will discuss the value of k in the range of [
] in Section 5. To be consistent with other methods, we use 11 candidate values [
] when building FBN base on PC and LR-HoFC. For example, 100% means all edges are preserved, and 90% means 10% weak edges are removed.
4.3. Estimated Functional Brain Networks
In this section, we visualize the FBNs estimated by six different methods (i.e., PC, SR, HoFC, LR-PC, LR-SR and LR-HoFC) base on a random subject from ADNI/ABIDE dataset. The results are shown in Figure 4, and the parameters are
for SR,
,
for LR-PC, LR-SR and LR-HoFC. Based on Figure 4, it can be observer that the FBN estimated by LR-PC, LR-SR and LR-HoFC are cleaner than the baseline methods, respectively. This means that the proposed method may improve the quality of FBNs by reducing possible noisy edges. Besides, FBNs estimated by our proposed method have higher modularity structure compared with other results, and then we will detailed discuss in Section 5.
4.4. Classification Results
In this section, we report the classification performances of different methods under five evaluation metrics, i.e., accuracy (ACC), specificity (SPE), sensitivity (SEN), F1 and AUC (the area under the receiver operating characteristic (ROC) curve), which mathematical definitions are given as follows:
(12)
(13)
(14)
(15)
where TP, TN, FP and FN represent true positive, true negative, false positive and false negative.
Figure 4. The adjacency matrices of the same subject estimated by six different methods, i.e., (a) PC, (b) SR, (c) HoFC, (d) LR-PC, (e) LR-SR and (f) LR-HoFC.
In Table 2, we report the classification results achieved by six different ways. 1) Results of MCI identification: We found that the performances of LR-PC, LR-SR and LR-HoFC are better than PC, SR and HoFC, respectively. Further, the terms marked by “*” denote that the result of the proposed method is significantly better than that of conventional methods under the assumption of
. Specifically, the LR-PC and LR-SR are significantly better than PC and SR in the terms of all the five evaluation metrics, and LR-HoFC is significantly better than HoFC in the terms of ACC, SEN and F1. 2) Results of ASD identification: It can be seen from Table 2 that the proposed method comprehensively outperforms the competing methods in terms of ACC, SEN, F1 and AUC. Only in SPE, LR-SR is slightly lower than SR. In particular, LR-HoFC achieves the significantly better performance than HoFC in the sense of all the five evaluation metrics. LR-PC and LR-SR obtain significantly better results of four indicator respectively. The results in Table 2 illustrate that the low-rank structure introduced by the proposed method not only can remove the “noise”, but also can enrich the topological structure of FBNs, and thus enhancing the discrimination of the estimated FBNs. In addition, we report the algorithm running time of the four methods in Table 3. It is worth noting that our method achieve the shortest running time, which shows that the proposed method is simple and can work with high efficiency.
Table 2. Classification performance (mean ± standard deviation) of 6 different methods in MCI vs. NC and ASD vs. NC classification. The term marked by “*” denotes that the result of proposed methods is significantly better than that of the baseline methods (with p < 0.05). The best results in each pair of Comparisons are shown in bold.
Table 3. The algorithm running time of different FBN estimation methods.
5. Discussion
Specifically, we study the sensitivity of modeling parameters, the influence of the number of rank in MCI classification, and we show the discriminative features visually. We also compared the modularity score obtained by our method with that obtained by traditional methods and list several limitations of our work.
5.1. Sensitivity to Modeling Parameters
Proverbially, FBN estimation models usually involve the free parameters which significantly affect the FBN structure. In Figure 5, we exhibit the ACC (using leave-one-out (LOO) cross validation) of six methods with different parametric combinations (
,
and
for LR-PC, LR-SR and LR-HoFC, respectively). It can be seen that the classification accuracy of PC, SR and HoFC are exceedingly sensitive to the parametric value [33] [34]. In contrast, the results of LR-PC, LR-SR, and LR-HoFC are relatively stable, which means that the proposed method has better robustness. In addition, we can achieve the best ACC in all three sets of comparisons under the optimal parameter.
5.2. Influence of Number of Rank
As the complexity and unknown of functional brain network, the optimal value of the parameter k is a practical problem. Thus, we report the five performance metrics achieved by LR-PC, LR-SR and LR-HoFC with different values of k in Figure 6 to investigate the influence of the number of k on the classification results. It can be observed in Figure 6 that LR-PC, LR-SR and LR-HoFC achieve the best performance with
,
and
, respectively. Thus, we suspect that FBN achieve the best quality when
. If
, the
Figure 5. The classification accuracy based on FBNs that estimated by six different methods with 121 or 11 parametric combinations.
Figure 6. The ACC, SEN, SPE, F1 and AUC values of our methods (i.e. LR-PC, LR-SR and LR-HoFC) with different numbers of the rank of FBNs (i.e., k) in the task of MCI vs. NC classification. (a) LR-PC; (b) LR-SR; (c) LR-HoFC.
information in the networks may be excessively lost; if
, there may have a certain degree noise in the network.
5.3. Discriminative Functional Connections
For further illustrate the effectiveness of our proposed method, we investigate the most discriminative connections based on the proposed LR-PC, LR-SR and LR-HoFC, respectively. Here, constructing FBNs by LR-PC with
,
, LR-SR with
,
, and LR-HoFC with
,
. And further the most discriminative features (31, 32 and 41 features for LR-PC, LR-SR and LR-HoFC) are selected with
. As visualized in Figure 7, the discrimination of features is represented by the thickness of the corresponding arc.
In Figure 7, we note that several ROIs that marked by red boxes, including middle temporal gyrus, supramarginal gyrus, amygdalae, hippocampus, precuneus and superior-medial frontal gyrus, are selected in our proposed methods. Especially, hippocampus is identified in all three methods, indicating that hippocampus plays a major role in MCI classification. These findings are consistent with several previous studies [35] [36] [37] [38].
5.4. Modularity Scores of FBNs
For quantitatively evaluating the modularity of FBNs, we employ the signed modularity maximization algorithm to calculate the modularity scores of brain networks that constructed by different methods [39] [40]. In addition, to avoid the randomness of the results, we select 24 subjects equidistantly and calculate their FBN (with the optimal combination of parameters) modularity score. It can be observed from Figure 8 that the low-rank structure can improve the
Figure 7. The most discriminative connections in FBNs that estimated by LR-PC, LR-SR and LR-HoFC, respectively. The nodes, indicated by red boxes, are related to the MCI identification according to previous study.
Figure 8. Modularity scores of networks constructed by different methods. And the abscissa indicates the label of the participant. (a) PC vs LR-PC; (b) SR vs LR-SR; (c) HoFC vs LR-HoFC.
modularity of FBN. Note that, the modularity of FBN that estimated by SR and LR-SR are generally higher than that by other methods, which indicates that moderate sparsity also can improve the modular structure of FBN.
5.5. Comparison with State-of-the-Arts
In Table 4, we briefly report the results of several state-of-the-art (SOTA) in the literature on the ADNI dataset compare with our method. As exhibited in Table 4, the results achieved by our method are overall better than the four SOTA methods. Note that, even if the SEN and AUC respectively reported in [11] and [41]
Table 4. Comparison with state-of-the-art methods for FBN-based MCI vs. NC classification with ADNI dataset. The top 2 best results are shown in bold.
are better than our method, their performances was on account of a relatively smaller dataset compared with our study.
5.6. Limitations and Future Work
There are some limitations in our current work. First, we only construct the low-rank FBNs based on the PC, SR and HoFC in this study. In fact, our proposed model can also combined with other FBN estimation methods [44], such as dynamic time warping distance (DTW) [45] and dynamic causal model (DCM) [46]. Second. The low-rank structure only enhance the modularity of FBN, but not make FBNs have an explicit modular structure. In the future, we plan to encode the modular structure with an elegant mathematical model for improving the quality of estimated FBNs, and further improve the classification accuracy.
6. Conclusion
In this paper, we propose a novel approach to estimate FBN by MF. More specifically, we implement this method with two steps. We firstly construct FBNs based on traditional methods, including PC, SR and HoFC. Then, we improve the modular structure of FBNs via MF model. For verifying the effectiveness of this method, we apply it to identify subjects with MCI/ASD from NCs. Compared with the baseline methods, our method has a better performance under five performance metrics. Moreover, we found that the FBNs, estimated by LR-PC, LR-SR and LR-HoFC, have higher modularity than the methods that do not take the low-rank structure of the FBNs into account.
Supported
This work was partly supported by National Natural Science Foundation of China (Nos. 61976110) and Natural Science Foundation of Shandong Province (Nos. ZR2018MF020).