Statistical Features and Traditional SA-SVM Classification Algorithm for Crack Detection

In recent years, the interest in damage identification of structural components through innovative techniques has grown significantly. Damage identification has always been a crucial concern in quality assessment and load capacity rating of infrastructure. In this regard, researchers focus on proposing efficient tools to identify the damages in early stages to prevent the sudden failure in structural components, ensuring the public safety and reducing the asset management costs. The sensing technologies along with the data analysis through various techniques and machine learning approaches have been the area of interest for these innovative techniques. The purpose of this research is to develop a robust method for automatic condition assessment of real-life concrete structures for the detection of relatively small cracks at early stages. A damage identification algorithm is proposed using the hybrid approaches to analyze the sensors data. The data obtained from transducers mounted on concrete beams under static loading in laboratory. These data are used as the input parameters. The method relies only on the measured time responses. After filtering and normalization of the data, the damage sensitive statistical features are extracted from the signals and used as the inputs of Self-Advising Support Vector Machine (SA-SVM) for the classification purpose in civil Engineering area. Finally, the results are compared with traditional methods to investigate the feasibility of the hybrid proposed algorithm. It is demonstrated that the presented method can reliably detect the crack in the structure and thereby enable the real-time infrastructure health monitoring.


Introduction
The main concern in civil infrastructures is the condition-based maintenance which requires the consequent monitoring of critical structures in order to minimize the probability of failure leading major effects on national assets and public safety in every country [1]. Therefore, the structural health monitoring is absolutely an essential concept which can potentially provide effective solutions to assess the health state of infrastructure. It can reduce the asset management costs, effectively prolongs operational lifetime and ensures the public safety. In this regard, getting access to a robust paradigm to deal with aforementioned concerns is of high priority.
The considerable literature [2]- [10] on structural health monitoring has highlighted the crucial importance of damage detection in civil engineering systems in the earliest time.
One of the effective monitoring techniques in determining the health state of structures is an automatic diagnosis system using sensing data which is based on the principle that damage in a structure, e.g. crack or a loosened connection, can modifies the dynamic structural responses [11]. The automated condition assessment systems can interpret the large volumes of inspection data to detect and prevent potential structural failure in early stages by minimizing errors to ensure effective risk management, while reducing the asset management costs.
Therefore, the main objective of this paper is to propose a more robust scheme of automated condition assessment system to monitor and evaluate the health state of the structure. For the purpose of this detection system, non-destructive testing (NDT) method which has attracted lots of attention in structural health monitoring for characterizing and assessing the materials and structures is used to characterize the concrete members. Thereafter, the signal processing and artificial intelligent approaches are employed to analyze the data and enhance the accuracy of detection. However, the critical challenge here is the huge number of data which are obtained. The high priority is proposing the potential algorithm to reduce the loss of information while reducing the noise effect. The second concern is the proper extraction of features which can effectively determine the health state of the structure. These features need to be sensitive enough to detect any discontinuity happens in the structure. These features are directly affecting the performance of the classifier. The third challenge is choosing the appropriate classifier that identifies the crack as happened. The question raised here is what features and which classification technique are appropriate to determine the structural condition; what can be measured that correlates to damage, and how to measure and employ the raw measurements to determine the structural condition.
In this paper, the mounted SA based approach is utilized to detect and monitor the cracks in concrete members under loading for simple concrete beams.  set to feed the machine learning approaches, while in this paper the combination of the three damage indexes which are obtained between the healthy and target signal are considered as the feature set of the system. The Self-Advising Support Vector Machine [12] is used to classify and detect the damages in classification step. This machine learning approach has not been ever used in the application of crack detection in civil engineering to determine its effectiveness in the area.
The results will be analyzed and compared with other traditional algorithms to evaluate the accuracy and effectiveness of the system.

The Proposed Damage Detection Algorithm
This section explains the proposed damage identification algorithm for SHM.
This paradigm consists of four steps: Data Collection, Pre-Processing and Feature Extraction, Classification.

Data Collection and Pre-Processing
The Experimental analysis is performed on the data collected in Centre of Infrastructural Engineering laboratory (CIE) through using the piezoelectric-based transducers mounted on the concrete beams under static loading. These transducers are transforming the energy of transient stress wave to an electric waveform. The other transducer is used to receive the signal. Three standard concrete beams (400 × 100 × 100) mm 3 have been examined subjected to three-point bending. The recorded signals are pre-processed through signal processing approaches.

Feature Extraction
After de-noising the signal through the Wavelet signal denoiser tool, the following features are extracted from the signal in time-domain. 1) Difference of gradient between healthy and non-healthy state. In gradient, the central difference for interior data points is calculated.
2) Difference of mean between healthy and non-healthy state.
3) Difference of root mean square error between healthy and non-healthy

Classification
The Support Vector Machine (SVM) and the traditional Self-Advising Support Vector Machine (SA-SVM) is used to classify the extracted features. The brief descriptions of these two learning machines are expressed as follows.

Support Vector Machine
Support Vector Machine has been proven as a powerful technique of classification in many research areas. This statistical machine learning approach was firstly presented in 1995 by Vapnik [13]. This learning algorithm prevents over fitting by selecting a particular hyperplane among many which can separate the data in feature space. SVM applies a linear separating hyperplane for creating a classifier to maximize the margin. The width of margin between the classes is considered as the optimization criterion. Margin is defined as the distance of optimal hyperplane and nearest training data points of a class. In cases of non-linearly separation of original input space, in 1992, Guyon, Boser and Vapnik presented an approach to generate nonlinear classifiers using kernel functions [14]. SVM firstly transforms the original feature to a higher dimensional feature space. The transformation may be obtained through different nonlinear mappings. The kernel function K(x; y) may be chosen to suit the problem. After this transformation, the optimal hyperplane may easily found. The achieved hyperplane is the optimal case with respect to a maximal margin [14].
In this paper, SVM is selected as the method of choice as it linearly classifies data in a high dimensional feature space which is related nonlinearly to the input space using specific Radial Basis (RBF) kernel.

Traditional Self-Advising Support Vector Machine
The Self-Advising Support Vector Machine is a newly developed Support Vector Machine introduced in 2014 [12]. This machine uses the prior knowledge in its decision making to separate one class from the others. This classifier receive some additional information through finding the misclassified data in training phase and use this knowledge along with the training data to make the classification decision. This information are the weights assigned to the outliers.
The other contribution of this paper is to investigate the application of this algorithm in damage detection particularly crack detection in civil engineering area with the extracted features. In the SA-SVM, the outliers are defined in training phase as [12]. Although the misclassified data can be null, different experiments has proven that it mostly includes data. The neighbourhood length (NL) for each MD is formulated as [12], The distance is computed in higher dimension Finally, the proposed weighing for the data are computed as [12],

Results
In this section, the results of the proposed algorithm are investigated. Figure 1 shows the de-noised signal through wavelet de-noiser. The red signal is the signal obtained through the sensor on concrete beam and the blue signal shows the de-noised signal through the Wavelet de-noiser.
The result shows the effectiveness of this de-nosing tool on the experiment obtained data.      Therefore, in this paper to take the advantage of all three damage indexes, the combination of three damage indexes has been taken into account to provide the more affective damage recognition. In this regard, the extracted features through Equation (1) and Equation (4) and Equation (7) which computes these damage indexes between the healthy and non-healthy state of the concrete beam are considered as the feature vector. This feature vector is sensitive to any change that may happen in the structure. Figure 5 shows the comparison of gradient, mean and mean square error between healthy and non-healthy concrete beam in one graph. The comparison shows that the mean square error difference has the highest value than the other two mean and gradient differences.
The green curve which represents the mean square error reaches the highest value more than 0.02, while the other two curves values are less than 0.02. It shows that the mean square error shows the differences between the healthy and non-healthy concrete beam more accurately.  Table 1 indicates the average performance of the features in detecting the dam- Based on the results, average accuracy and F-score of "our proposed approach" are 83.54377 and 0.54242, respectively.

Comparing the Results of "Feature Extraction and SA-SVM Classification" and "Feature Extraction and SVM Classification"
For the purpose of comparison, the SVM is also trained with the same set of features employed for our proposed algorithm results in Table 1. The results of "GMM/SVM classification" are shown in Table 2  The results show the better performance of our proposed algorithm of "GMM/SA-SVM classification" than another algorithm. Our achieved results are confirmed by the results achieved by the reference paper we used for SA-SVM [9]. Both the reference paper and our results show a similar pattern in performance of SVM and SA-SVM. The t-test is also carried out as another analysis factor to show the effectiveness of the proposed algorithm rather than the traditional one. Table 3 shows the superiority of our proposed algorithm rather than the traditional one through the t-test.
A. N. Hoshyar et al.  The p-value is less than 0.05 which indicates that the difference between the obtained accuracy of the traditional and proposed algorithm in this paper are significant.

Conclusion
The structuring of automatic damage detection system in civil structures has been proposed and developed. The experimental study is performed to identify cracks in a concrete beam under static loading in laboratory condition. The piezoelectric-based transducers have been used to present the discontinuities inside Journal of Signal and Information Processing the structure. The hybrid statistical feature analysis algorithm based on the combination of three damage indexes which are the differences of gradient, mean, mean square error between the healthy and non-healthy state of the structure along with the Self-Advising Support Vector Machine is carried out on the obtained signals to determine the health state of the structure. The result shows that the proposed algorithm is able to determine the state of the structure more accurately when compared with traditional algorithms. This algorithm can be applicable in other area of research. Further improvements on accuracy can be achieved through utilizing the more features in the feature set vector.