Bipolar and Schizophrenia Disorders Diagnosis Using Artificial Neural Network

Motivation: Bipolar disorder (BD) and schizophrenia (SZ) has a difficult diagnosis, so the main objective of this article is to propose the use of Artificial Neural Networks (ANNs) to classify (diagnose) groups of patients with BD or SZ from a control group using sociodemographic and biochemical variables. Methods: Artificial neural networks are used as classifying tool. The data from this study were obtained from the array collection from Stanley Neuropathology Consortium databank. Inflammatory markers and characteristics of the sampled population were the inputs variables. Results: Our findings suggest that an artificial neural network could be trained with more than 90% accuracy, aiming the classification and diagnosis of bipolar, schizophrenia and control healthy group. Conclusion: Trained ANNs could be used to improve diagnosis in Schizophrenia and Bipolar disorders.


Introduction
Artificial neural network (ANN) attempts to use multiple layers of calculations to emulate the neuronal circuitry in the human brain by interpreting and drawing conclusions from several information.ANN algorithms are mathematical models based on biological neural systems that simulate the behavior of neurons.upon the activation function [1].The ANN inputs are multiplied by different weights to generate a predictive response.So, these responses in ANN are widely used for several applications such as classification and pattern recognition [2].This tool is effective modeling non-linear relationships that may be a promising candidate for differentiation for several biological processes [3].ANN are used in medical field to analysis of sleep disorders, cytopathology and histopathology such as classification of breast cancer images and others, in prediction of heart disease, CD4+ T cell differentiation and immune cell subset classification, combining clinical predictors of antidepressant response in mood disorder, and other classifications [4] [5].
Bipolar disorder (BD) and schizophrenia (SZ) are complex mental disorders with high genetic load, and are the largest global contributors to years with functional disability [6].These disorders are among the most severe psychiatric disorders that affects around 1% and 0.8% of the general population, respectively [7] [8] [9].BD is characterized by depressive, (hypo)manic, or mixed episodes [10].SZ is characterized by amotivational, disorganized, affective, delusional, hallucinatory, or catatonic symptoms [10].Both have been associated with negative health outcomes and progressive impairments [11] [12].Therefore improve the diagnosis may be associated with a better definition of the treatment and lower damages to subject.Several evidences point out that psychiatric disorders have several changes in the molecular and functional mechanisms of the neuron, leading to observable changes in the brain [13].Studies suggest that stressful events are important in the early stages of the disease [14], so the phenotypic manifestation of mood disorders is presumably the result of the interaction between the effects of environmental stress and genetic predisposition.
BD and SZ have been associated with alteration in inflammatory cytokine levels, including Interleukin (IL-1, IL-6, IL-18, and IL-10), tumor necrosis factor alpha (TNF-α) and beta (TNF-β), transforming growth factor beta (TGF-β), and interferon gamma (IFNγ), when compared to healthy controls, and has been associated to neurotrophic factors changes [15] [16] [17] [18] [19].These cytokines are produced by a variety of cell types including immune cells, muscular cells, glial cells and neurons; they mediate signaling between immune cells, and are mainly secreted from monocytes, macrophages or lymphocytes [20].Moreover, cytokines play a central role in the control and modulation of inflammatory responses, and modulate the neurotrophins, with a constant balance between proinflammatory and anti-inflammatory cytokines [21].
In the last years, there has been an upsurge of interest within the neuroscience community in the use of artificial intelligence (AI) methods, including ANN [22] [23].Moreover, ANN analyses are gaining traction in psychiatric research, providing predictive models for both clinical practice and public health systems.
Compared with traditional statistical methods that provide primarily average group-level results, machine-learning algorithms provide predictions and stratification of clinical outcomes at the level of an individual subject [24].However, for the best of our knowledge there is no ANN using an accessible peripheral biomarker (inflammatory interleukins) to classify the outcome in BP and SZ patients.In this way, the main objective of this article is to propose the use of ANN to classify (diagnose) groups of patients with BP or SZ from a control group.

Selection, Clinical Information, and Diagnosis
Briefly, donors for the brain collection are identified by investigators in original study.Individuals over age 65 are excluded because of the increased likelihood of comorbid neurological disorders.A preliminary diagnosis and requests permission for donation of the brain and for release of the deceased's medical records was solicited.Data regarding the sociodemographic, clinical and psychiatric history, substances use was collected.Medical and psychiatric records are requested for known hospitalizations and outpatient treatments to be made until sufficient information has been collected to make a clear diagnosis.All records was reviewed by one psychiatrist and the information is entered into a computerized database (demographic data, family history, education, age of onset, total duration of hospitalizations, psychiatric diagnosis, cause of death, medical diagnoses, medications at time of death, brain weight, interval between death and refrigeration of body, and interval between death and freezing of brain tissue [postmortem interval (PMI)]) by identifying number only (more details see [25]).After all information was collected, the DSM-IV [10] psychiatric diagnosis was made independently by two senior psychiatrists.If there was disagreement between them, the records were made by a third senior psychiatrist.

Processing of Brain Tissue
Trained medical examiners collected and processed the brain tissue.Half of brain was fixed in formalin while the other is cut into 1.5 cm thick coronal slices and frozen in a mixture of isopentane and dry ice.The frozen half was stored at −70˚C until analysis.

Neuropathology Consortium
The Stanley Medical Research Institute (SMRI) [25] provides postmortem brain tissue for research since 1994 facilitating the number and quality of neuropathology studies for the major psychiatric disorders and to identify possible targets for drug development.In this context, the arraycollection [26] provides samples with 35 cases in each of three groups: SZ, BD and unaffected controls.
The diagnostic groups in collections are matched for the descriptive variables, age, gender, race, postmortem interval, mRNA quality (RIN), brain pH and hemisphere [25] [27].All samples were collected between September 1994 and February 1997.The specimens that constitute the Neuropathology Consortium are made available without charge to research groups around the world.

Multiplex Immunoassay Analysis
All analytes were measured by multiplex immunoassay.Extracts (200 µL) were analyzed using the Discovery MAP™ multiplexed immunoassay panel at Myriad-RBM (Austin, TX, USA).Each assay was calibrated using duplicate 8-point standard curves, and raw intensity measurements were interpreted into final protein concentrations using proprietary software.Machine performance was verified using quality control samples at low, medium, and high levels for each analyte [28].

ANN Training and Statistical Analyses
In this study the inputs to the first layer of the neural network consist of 34 sociodemographic and biochemical variables while the target output consist of the following outputs trainings classifications: 1) control or case group; 2) control or BD group; 3) control or SZ group; and 4) BD or SZ group.The network is then trained to attempt to predict response from the set of variables.Supervised ANNs were applied in this work.Supervised ANNs means that the output is already know, in a training data bank.Supervised ANNs calculate an error function between the desired fixed output (target) and their own output, and adjust the connection strengths (weights) during the training process to minimize the result of the error function.The trained ANN can be seen as an equation, which translate the ANNs inputs into outputs, and rules by which the weights are modified to minimize the error of the equation [29].A general ANN can be identified in Figure 1, this topology has p inputs, one weight connected in each input, k neurons in parallel in a Hidden (or middle) layer, with a non-linear activation function, and one neuron in output layer, with linear activation function, for one output variable.This model of ANN can approximate the output of any continuous function [2], and was used in this work to classify the diagnosis.
To perform ANN training analyses, the OpenNN software was used.It is a multiplatform and open source software, for artificial neural networks [30].In this work, the weights were randomized at the start of each training, and trained until the performance increase was above 1e-6 with the quasi-newton method.Also, the data bank was divided for training and testing, being 80% for training and 20% of the bank for test.The ANN training gives as result a confusion matrix.The accuracy, sensitivity, specificity and F1 score (harmonic mean between Figure 1.Neural network representation with p inputs, k neurons in hidden (middle) layer and one output neuron for one output variable.Note that is possible to have p ≠ k, provided that each input has its weights.In this work we used 35 inputs (brain inflammatory markers and characteristics of the sampled population) and the output is binary as a classification of control group or BD or SZ disease.We vary the neurons in hidden layer to compare results and found the best suitable network.precision and sensitivity) were obtained from confusion matrix calculation.Some related works also show classification functions to analyze the ANN training [31] [32].In this work, we performed ten trainings for each number of neurons in hidden layer.They are show as mean and standard deviation (mean ± S.D.) and the best achieved result of these trainings is also presented.

Results
The analysis was conducted with 104 data samples.The original data bank was composed of 105 individuals, 35 in bipolar disorder, 35 in schizophrenia and 35 in control group.One case of bipolar disorder has no data available, so training and analysis with BD were matched to 34 individuals.
The characteristics of patients are shown in Table 1.In BD group, most subjects were female (67%) while in SZ and control groups were male (74% in both groups) (p = 0.025).44% of BD patients and 20% of SZ patients are suicide victims, although there are no cases in control group (p ≤ 0.001).The mean age was 45.4 ± 10.67, 42.57± 8.47, and 44.2 ± 7.58 years for BD, SZ and control respectively.The duration of illness was 20.00 ± 9.62 and 21.29 ± 10.14 years for BD and SZ, respectively.In addition, the lifetime antipsychotics were 1.13 ± 2.62 and 9.70 ± 11.45 years for BD and SZ, respectively.Smoking data was also available, but due to the more than forty percent of missing values, these data was not take intoaccount at the training or in Table 1.For the duration of illness, lifetime antipsychotics and suicide status, there were no event in control group, so these be seen as accuracy, F1 score, sensitivity and specificity in these tables.For the best results, the displayed data stands for the best accuracy case, taking the F1 score, sensitivity and specificity of this best case, using F1 score as tiebreaker when needed.ANN middle layer neurons were also compared from three to nine neurons.For less than three and more than nine neurons in ANN hidden layer, the accuracy results was below 50% (data not show).
A classification with cases and controls were performed.BD and SZ were grouped, (samples were randomly drawn to match the control group), in one patient cases group.An ANN training was done to classify cases and control groups.Results of these training can be seen in Table 2. Results in Table 2 show the best achieved accuracy, F1 score, sensitivity and specificity; they were respectively 0.93, 0.95, 1.00 and 0.80 for three neurons.
For differentiation of BD from healthy people, ANN training results can be seen in Table 3.One sample from control group was randomly drawn to match the BD group data sample.In Table 3

Discussion
In present study was applied an ANN to discriminate two mental disorders and health subjects in clinical and inflammatory-based data.The ANN model was able to identify correctly a high percentage of subjects with a psychiatric disorder in a sample with sick and healthy individuals.Moreover, the ANN model shows very specific and sensitive, in confusion matrix interpretation.In our knowledge, this is the first study that proposes an ANN model to improve the use of markers and clinical data in diagnoses of BD and SZ.Our analyses suggest that an ANN function could properly classify the cases and control groups of these disorders.
Studies investigating the impact of a variety of inflammatory stimuli on the brain and behavior have reported evidence that inflammation and the release of inflammatory cytokines affect the relevant circuits for BD and SZ [16] [28].Inflammatory cytokines reach the brain and are associated with increased expression of pro-inflammatory eicosanoids, nitric oxide, TNF-α, IL-1β, reactive oxygen species, as well as monocytes and macrophages in the brain [17].Several studies have suggested an imbalance in pro and anti-inflammatory responses in the pathogenesis of SZ and BD.However, the mechanisms involved in these processes remain unknown.Thus, the results found in our study agree with previous studies where there is an involvement of the immune system in these disorders.There is clinical evidence that mood disorders are immunoinflammatory disorders characterized, among other things, by the increase of proinflammatory cytokines [16] [17].These evidences have stimulated the search for relevant peripheral markers, and there are several indications of relationships between metabolic, pro and anti-inflammatory, pro-oxidant and antioxidant systems, among others.
Advances in technology and data acquisition have simplified the collection and storage of large datasets with long time series, finding increasingly frequent and varied fields of application, including biomedical and data mining areas.In this way, the process of evaluating large volumes of data is an invaluable process and the recent studies emphasize the use of AI methods with promising results [23] [24].Supervised ANN methods address individual differences, rather than considering differences between groups, as do more traditional statistical comparisons, and classifying individuals in order to contribute to the clinical decision making process.These methods generate a model using a training set that includes input and output data.After the classification process, the model is tested using external test data to estimate the predictive capacity of the model.
These methods are also sensitive to spatially distributed and subtle brain effects that would otherwise be indistinguishable by applying traditional univariate methods that focus on gross differences at the group level [23].Although ANN methods are used in biomedical studies, AI techniques in psychiatric disorders are still incipient.Several neuroimaging studies of [34] use AI techniques and neural networks to look for possible changes in BD patients.In addition, these authors have described, from the clinical point of view, findings relevant to the pathophysiological understanding of bipolar disorder.In this sense, our study has demonstrated that there is an interaction between several neurochemical and inflammatory factors that may be directly involved in BP and SZ.Regarding peripheral markers, there are still few studies that used AI techniques to identify biomarkers in patients with bipolar disorder or schizophrenia.A study by [35] was highlighted.The Space Vector Machine (SVM) algorithm differentiated patients with bipolar disorder from healthy controls with a predictive accuracy of 72.5%, and patients with schizophrenia from healthy subjects with a prediction accuracy of 77.5%.However, the algorithm was not able to differentiate patients with bipolar disorder from patients with schizophrenia (REF).In our study, although using a different technique, it found an accuracy of 92% when comparing patients with BD with healthy individuals, and 93% when we compared SZ with healthy individuals.Moreover, our findings differentiate patients with bipolar disorder from patients with schizophrenia; it was found an accuracy of 92%.It is necessary to point out that in the study of [35]; the evaluations were carried out on blood samples, whereas the sample of this study was brain tissue.This is a study to evaluate the feasibility of using a biomarker tool developed with ANN algorithms to identify a patient with bipolar disorder or schizophrenia when compared to healthy controls.However, the present work has some limitations: 1) Our sample was small as we used a brain from post-mortem tissue; 2) the majority of individuals were taking medication, a factor that influences the results obtained.Despite these limitations, future studies should assess larger samples from multiple centers; use advanced mathematical techniques combined with other biological and clinical variables to improve our knowledge about schizophrenia and bipolar disorder.Moreover, in the last years the use of ANNs has been growing as to a promise approach in basic and clinical studies.Here, our findings suggest that artificial neural network could be valid to detect the role of markers in the involvement of inflammatory mechanisms in the pathophysiology of bipolar disorder and schizophrenia.
Statistical analyzes were performed by the Statistical Program for Social Sciences (SPSS) 22.0.The Chi Square and Analysis of Variance (ANOVA) test were used to check the consistency of match between groups.
variables were not used in ANN training.Demographic data and complementary information were provided in Stanley Foundation research site [33].ANN training results changing the comparisons groups can be analyzed in Tables 2-5.They present the best result of ten different training and the mean and ±standard deviation of these trainings.The classification training results can

Table 4 .
best training result is for six neurons, using F1 score as tiebreaker.This gives 0.92 of accuracy, 0.95 of F1 score, 1.00 for sensitivity and 0.75 for specificity.So with six neurons in ANN hidden layer, 92% of accuracy diagnosis can be achieved, recognizing all BD patients.Schizophrenia patients group is differentiated from healthy group in Table 4 best results for accuracy, F1 score, sensitivity and specificity, were respectively 0.93, 0.93, 1.00, 0.86 for seven neurons in ANN hidden layer.So, one can classify with 93 of accuracy a SZ patient from healthy people, correctly identifying all SZ patients.

Table 5 shows
BD and SZ ANN training classification results.One sample from SZ group was randomly drawn to match the BD group data sample.It ex-

Table 1 .
Characteristics of the sampled population for cases and control group.
BD = Bipolar disorder, SZ = schizophrenia, and CTRL = control.a Simple and relative frequencies (%), b Mean and standard deviation, c 12 cases in BD with no use of Antipsychotics, while all cases in SZ used.

Table 2 .
Cases (BD and SZ grouped) and control groups ANN classification training results.Result for each number of Neurons and results shown for 10 different ANN training for each number of neurons in hidden layer (70 different trainings).Accuracy, F1 Score, Sensitivity and Specificity, are presented as mean and standard deviation (S.D.).

Table 3 .
Bipolar disorder and control groups ANN classification training results.
Results shown for 10 different ANN training for each number of neurons in hidden layer (70 different trainings).Accuracy, F1 Score, Sensitivity and Specificity, are presented as mean and standard deviation (S.D.), and the best result for each number of Neurons.

Table 4 .
Schizophrenia and control groups ANN classification training results.
Results shown for 10 different ANN training for each number of neurons in hidden layer (70 different trainings).Accuracy, F1 Score, Sensitivity and Specificity, are presented as mean and standard deviation (S.D.), and the best result for each number of Neurons.

Table 5 .
Bipolar disorder and Schizophrenia ANN classification training results.Results shown for 10 different ANN training for each number of neurons in hidden layer (70 different trainings).Accuracy, F1 Score, Sensitivity and Specificity, are presented as mean and standard deviation (S.D.), and the best result for each number of Neurons.