Application of Extreme Learning Machine in Fault Classification of Power Transformer

Reliability of power system is very essential for every nation to generate and transmit power without interruption. Power transformer is one of the most significant electrical apparatus and hence it must be kept in good health. Identification and classification of faults in power transformer is a major research area. Conventional method of fault classification in transformer uses gas concentrations data and interprets them using international standards. These standards are not able to classify the faults correctly under certain conditions. To overcome this limitation, several soft computing tools namely artificial neural network (ANN), Support Vector Machine (SVM) etc. are used to automate the process of classification of faults in transformers. However, there is a scope exists to improve the classification accuracy. Hence, this research work focuses to design Extreme Learning Machine (ELM) method for classifying fault very accurately using enthalpy of dissolved gas content in transformer oil as an input feature. The ELM method is tested with two databases: one based on IEC TC10 database (DB1) and the other one based on data collected from utilities in India (DB2). The application of ELM to Power Transformer fault classification based on enthalpy as input feature outperforms over the conventional classification based on gas concentration as input feature.

power transformer in good health.Generally gases are produced during its operation and they get dissolved in transformer oil.These gases are extracted from oil and measured using gas chromatograph.Alternatively they are measured through gas sensors fitted in the power transformer.Dissolved gas analysis (DGA) is one of the oldest and reliable methods to evaluate whether there is any abnormality inside a power transformer or not [1] [2].A number of methods are in use for the interpretation of dissolved gas analysis.Key Gas method, Roger's ratio method, IEC ratio method, Dual triangle etc. are the methods used.The individual concentrations of six fault gases are used in key gas method.Four common fault conditions are distinguished by the quoted percentage concentrations of the six gases.The Roger's ratio method use four gas ratios resulting into 12 proposed diagnosis.In the revised standard, only six diagnoses are used.In the IEC ratio method, the same three ratios as per the revised Roger's ratio method is used but it suggests different ranges and interpretations.However, there is no universally accepted interpretation method for the DGA.To overcome the limitations of the conventional diagnostic methods, soft computing tools are used to develop automated transformer fault classification.
Artificial neural network (ANN) can be trained to acquire knowledge.However, lots of training data are required for proper training.ANN also has universal approximation property and it enables to approximate any given continuous function with required level of accuracy.These networks are considered as a "black box" models and hence it is very difficult to interpret in their trained state [3].Among various learning algorithms, the support vector machine (SVM) is one of the most important and widely used algorithms for transformer fault classifications.Yan Zhang [4] has reported the use of online DGA data using SVM.Four types of SVM models are used to classify transformer faults.Out of 60 dataset, 35 datasets are used for training and 25 for testing.Cross-validation method is used to calculate the free parameters of the SVM.
The state of art Extreme Learning Machine method offers a competitively good solution for complex tasks.The Extreme learning Machine is a recent second generation neural network algorithm.It is identified as the one which can give high performance in multifaceted problems and takes less computation time compared with other machine learning algorithms [5].The ELM based model proposed by Huang et al. [6] performs quite fast with high learning phase for problems of relatively large dataset which does not require iterative tuning used in other neural networks.ELM based model is used for classification of images in medical applications [7].The paper focuses on classification of tumour using three segment classifications of MR brain images in 3 dimensions.Using 60 neurons in the hidden layer, ELM model achieves 93.2% accuracy.The ELM is successfully applied to face recognition using Yale face database and Carnegie Mellon University face images database [8].It is shown that the state preserving extreme learning machine used for face recognition gives best performance than the other known classifiers.
Single layer feed forward Neural Network is modeled with hidden neurons and activation function by Guang-Bin et al. [9].Extreme learning machine for RBF (Radial Basis Function) network produces faster results.Huang et al. [10] proved that single layer feed-forward neural network with randomly assigned input weights and hidden layer biases and with any nonzero activation function can approximate any continuous functions.Thermodynamic estimation of fault severity is suggested in [11] for transformer fault energy dissipated within the transformer.It is based on the enthalpy of forming one substance from another substance.When a thermodynamic system undergoes transformation, change in enthalpy takes place.The enthalpy [12] for forming one substance from another is the amount of energy required to produce from one mole of one substance to form another expressed in grams per mole.
This research paper focuses on six segment classifications model with dissolved gas analysis data of DB1 database and four segment classifications model with DB2 database.One method based on the conventional approach using gas concentration and the other improved method based on the enthalpy of the gas concentration is used.Section 2 covers the various ELM models for transformer fault classification.Application of ELM for fault classification in transformer using DB1 database is elaborated in Section 3. Section 4 details the application of ELM for fault classification in transformer using DB2 database.Conclusions based on the present study are given in Section 5.

ELM Models
ELM is a single layer with feed forward neural network (SLFNN).It randomly selects input weights and hidden layer biases without training.The output weights are obtained analytically using the norm least square solution and Moore-Penrose inverse of a general linear system [7].Activation functions used are unipolar, bipolar and radial basis function (RBF) for the output neurons.Two databases (DB1 and DB2) are used.DB1 database contain 151 records published by IEC TC10 working group, whereas DB2 has 219 records based on data collected from utilities in INDIA.Four models namely ELM-I, ELM-II, ELM-III and ELM-IV are developed.ELM-I and ELM-III models are based on conventional approach using gas concentrations as input features.In ELM-II and ELM-IV, instead of gas concentrations as input features, enthalpy of the gas concentrations are used.Enthalpy is calculated from the gas concentrations as per the Table 1.

ELM Models Using DB1 Database
The database DB1 contains 151 records of IEC TC10 database.It contains methane, ethane, ethylene, hydrogen and acetylene carbon monoxide and carbon dioxide.The gas data are obtained from gas sensors fitted in the Power transformer or from gas chromatograph.Out of seven gases as shown in Figure 1, five gases namely acetylene, ethylene, ethane, methane and hydrogen are selected for ELM-I and ELM-II models.ELM-I model uses gas concentrations in parts per million (ppm) as input features.
Typical dataset of DB1 database showing input and output features are shown in Table 2. Partial discharge, discharges of low and high energy (D1 and D2), thermal faults (T1, T2 and T3) and normal (NF) form the output features.
For ELM-II model, the gas concentrations are converted into enthalpy and used as input feature.

ELM Models Using DB2 Database
Figure 2 shows the methodology of using DB2 database.DB2 has seven extracted features.Table 5 shows the training and testing dataset used for each output feature.Electrical and thermal fault as output feature has only 12 dataset.

Application of ELM on DB1 Database
The transformer fault classification of DB1 database is analyzed using MATLAB TM software on ELM-I and ELM-II models.In ELM-I model, transformer fault classification is based on the gas concentration present in the transformer from DB1 database.Five gases namely hydrogen, methane, ethane, ethylene and acetylene forms the input features for the model.The performance of the model is analyzed with unipolar, bipolar and radial basis function (RBF) as activation functions.The training and testing efficiencies are calculated as shown in Table 6.
ELM classifier displays a confusion matrix showing the actual and predicted class.In the confusion matrix shown in Table 7, of the 3 cases of actual PD class, the system predicted 2 cases as PD and one case as class T1&T2.The system predicted all D1 class correctly.Of the 15 D2 class, 1 is classified as (T1&T2) class and 1 is classified as NF.With total 4 actual class of T1&T2, the system predicted 1 case as class of T1&T2 and 2 case as class T3 and 1 case as class NF.Out of five actual class of T3, 1 case is wrongly classified as class D2.All 10 actual class of NF is classified correctly.The overall testing efficiency is 84.4% using ELM with DB1 database and gas content as input.
The energy dissipated within the transformer is directly related to the enthalpy of hydrocarbon gases.The gas concentrations are converted to the enthalpy.The enthalpy of hydrogen, methane, ethane, ethylene and acetylene are input features for the ELM-II model.The model is evaluated with different activation functions and number    of neurons in the hidden layer and the results are tabulated in Table 8.
The simulation results are analyzed based on activation function, number of hidden neurons and gas concentration with enthalpy of gas.With gas concentration as input feature, the activation function using unipolar and radial basis function perform better than the bipolar activation function.Similar results are obtained with enthalpy of gas as input features.Optimum results are obtained with 35 neurons in the hidden layer.
In the example confusion matrix shown in Table 9, of the 3 cases of actual class of PD, the system predicted all correctly as class PD.Similarly, the system predicted all cases of class D1 also correctly.However for 15 cases of actual D2 class, the system predicted 13 cases as D2 class and 2 cases are wrongly classified as class D1.Similarly for 10 cases of actual class NF, 9 cases are predicted correctly as class NF and one wrongly predicted as class D2.The overall testing efficiency is 93.3% using DB1 database and enthalpy of gas as input feature.
With gas concentration as input feature, the activation function using unipolar and radial basis function perform better than the bipolar activation function.Similar results are obtained with enthalpy of gas as input features.Optimum results are obtained with 35 neurons in the hidden layer.The simulation results obtained on DB1 database using enthalpy of gas as input feature is compared with conventional gas concentration as input feature and the results are shown in Figure 3.The overall testing efficiency is 84.4% using ELM with DB1 database and gas content as input feature.The testing efficiency improves to 93.3% using DB1 database and enthalpy of gas as input feature.

Application of ELM on DB2 Database
The DB2 database contains 219 records.Transformer fault classification of DB2 database is analyzed using MATLAB TM software on ELM-III and ELM-IV models.In the ELM-III model, gas concentration (ppm) is taken as input features and partial discharge, over hating, arcing and thermal and electrical faults are taken as output features.
The performance of the ELM model is evaluated using different activation functions and number of neurons in the hidden layer.The results showing training and testing efficiencies are shown in Table 10.Confusion matrix constructed for ELM-III model is given in Table 11.
The model predicted PD, OH, E&T class correctly whereas with 11 cases of actual class A (Arcing) , the system predicted 10 cases as class A and 1 case wrongly predicted as class E&T.
Model ELM-IV uses DB2 database and the enthalpy of gases are taken as input features.Partial discharge, overheating, arcing and electrical and thermal faults are treated as output features.The performance of ELM-IV is analyzed by varying the activation function and the number of neurons in the hidden layer and the results are shown in Table 12.
With 35 neurons in the hidden layer, the training efficiency is 95%.The testing efficiency achieved is 100% with activation function of unipolar and bipolar whereas it is 97% with RBF activation function.In the example confusion matrix in Table 13, the system predicted all actual classes of PD, OH, A, E&T correctly.The simulation results are analyzed based on the activation function, number of hidden neurons, gas concentration and enthalpy of gas.With gas concentration as input features, the unipolar as well as bipolar activation function perform better than the radial basis function activation function.Also, when enthalpy of gas is used as input features, the unipolar, bipolar activation function perform better than radial basis activation function.Optimum results are obtained with 35 neurons in the hidden layer.The simulation results obtained on database using enthalpy of gas as input feature is compared with conventional gas concentration as input feature and the results are shown in Figure 4.
ELM testing efficiency of 97.5% is achieved with gas concentration as input features and 100% with enthalpy of gas concentration as input features.

Conclusion
This research paper presented the application of ELM approach to transformer fault classification.The proposed method is simulated in MATLAB TM Software.ELM method is validated on DB1 and DB2 database using four models.Conventional transformer fault classification based on gas concentration as input features is compared with fault classification using enthalpy of gas as input feature.The simulation carried out using DB1 database shows 84.4% testing efficiency with gas concentration as input feature and 93.3% with enthalpy as input feature.Results on DB2 database show 97.5% testing efficiency with gas concentration and 100% with enthalpy as input feature.The results show that ELM with enthalpy as input feature in fault classification of power transformer outperform over the conventional approach.It is proposed to use IEC gas ratios using enthalpy of gas in place of gas concentration as input feature in the future research work.

Figure 1 .
Figure 1.Structure of ELM Models with DB1 database.

Figure 2 .
Figure 2. Structure of ELM Models with DB2 database.Five features are selected for ELM-III and ELM-IV model.Model ELM-III is based on the gas concentrations in ppm and Model-IV is based on the enthalpy of gases.Four types of faults namely partial discharge, overheating, arcing and electrical and thermal faults are taken as output features as in Table 4. Out of 219 dataset in DB2 database, 179 dataset is used for training and 40 dataset is used for testing.Total dataset used for partial discharge, overheating, arcing and Electrical & thermal (E&T) classes are 29, 120, 58 and 12 respectively.Table5shows the training and testing dataset used for each output feature.Electrical and thermal fault as output feature has only 12 dataset.

Table 3
shows the training and testing dataset used for each output feature.Partial discharge has only 9 dataset whereas discharge of high energy has total 48 dataset.Both ELM-I and ELM-II models use total 151 dataset.Out of 151 dataset, 106 dataset is used for training and 45 dataset is used for testing.

Table 1 .
Enthalpy of Transformer fault gases.

Table 2 .
Typical DB1 database with input and output features.
Input Features Gas concentrations (parts per million, ppm)

Table 4 .
Typical DB2 database with input and output features.
Input Features Gas

Table 6 .
Effect of ELM parameters on DB1 database with gas concentration.

Table 7 .
Confusion matrix of DB1 database with gas concentration.

Table 8 .
Effect of ELM parameters on DB1 database with enthalpy.

Table 9 .
Confusion matrix of DB1 database with enthalpy.

Table 10 .
Effect of ELM parameters on DB2 database with gas concentration.

Table 11 .
Confusion matrix of DB2 database with gas concentration.

Table 12 .
Effect of ELM parameters on DB2 database with enthalpy.

Table 13 .
Confusion matrix of DB2 database with enthalpy.