Application of Least Square Support Vector Machine ( LSSVM ) for Determination of Evaporation Losses in Reservoirs

This article adopts Least Square Support Vector Machine (LSSVM) for prediction of Evaporation Losses (EL) in reservoirs. LSSVM is firmly based on the theory of statistical learning, uses regression technique. The input of LSSVM model is Mean air temperature (T) ( ̊C), Average wind speed (WS) (m/sec), Sunshine hours (SH) (hrs/day), and Mean relative humidity (RH) (%). LSSVM has been used to compute error barn of predicted data. An equation has been developed for the determination of EL. Sensitivity analysis has been also performed to investigate the importance of each of the input parameters. A comparative study has been presented between LSSVM and artificial neural network (ANN) models. This study shows that LSSVM is a powerful tool for determination EL in reservoirs.


Introduction
One of the most effective water loss processes of reservoir is the evaporation.The amount of evaporation occurs often in large quantities.For example, in late 1950s total evaporation from water surfaces in the United States was greater than the total amount of water withdrawn for domestic purposes by cities and towns [1].Therefore, the determination of Evaporation Loss (EL) in reservoirs is an imperative task in earth science.Due to complex interactions among the components of land-plant-atmosphere system, the determination of EL in reservoir is a complicated task [2].Researches use different methods for prediction of EL in reservoir [3][4][5][6][7].Recently, Deswal and Pal [8] have successfully employed Artificial Neural Network (ANN) for determination of EL in reservoir.However, ANN has some limitations.The limitations are listed below:  Unlike other statistical models, ANN does not provide information about the relative importance of the various parameters [9]. The knowledge acquired during the training of the model is stored in an implicit manner and hence it is very difficult to come up with reasonable interpretation of the overall structure of the network [10]. In addition, ANN has some inherent drawbacks such as slow convergence speed, less generalizing performance, arriving at local minimum and over-fitting problems.This article adopts Least Square Support Vector Machine (LSSVM) for prediction of EL in reservoir.The database has been collected from the work of [8].The database contains information about EL, Mean air temperature (T) ( 0 C), Average wind speed (WS) (m/sec), Sunshine hours (SH) (hrs/day), and Mean relative humidity (RH) (%).The LSSVM is a statistical learning theory which adopts a least squares linear system as loss functions instead of the quadratic program [11].LSSVM is closely related to regularization networks [12].With the quadratic cost function, the optimization problem reduces to finding the solution of a set of linear equations.LSSVM has been successfully applied for solving different problems in engineering [13][14][15].This study has the following aims:  To examine the capability of LSSVM model for prediction of EL  To determine the error bar of predicted EL  To develop an equation for prediction of EL  To make a comparative study between developed LSSVM and ANN model developed by [8]  To do sensitivity analysis for determination of the effect of the each input parameter on EL.

Details of LSSVM
LSSVM models are an alternate formulation of SVM regression [16] proposed by [17] where the nonlinear mapping   .
 maps the input data into a higher dimensional feature space;  N w R ; b r  ; w = an adjustable weight vector; b = the scalar threshold.
In LSSVM for function estimation the following optimization problem is formulated: where e k = error variable and γ = regularization parameter.The following equation for EL prediction has been obtained by solving the above optimization problem [18][19].
where K(x, x k ) is kernel function.The radial basis function has been used as kernel function in this analysis.The radial basis function is given by where σ is the width of radial basis function.
The above LSSVM has been adopted for determination of EL.The data are collected from a reservoir in Anand Sagar, Shegaon (India).The data of evaporation loss were collected for one year only.Whereas, the other data for a period of fifteen year (from 1990 to 2004) were obtained from a full climatic station at Manasgaon, about 9 Km from Shegaon, lying under water resources division, Amravati Hydrology Project (Maharashtra, India).The dataset contains information about 48 cases.The data have been divided into two sub-sets; a training dataset, to construct the model, and a testing dataset to estimate the model performance.So, for our study a set of 34 data are considered as the training dataset and remaining set of 14 data are considered as the testing data-set.The data are scaled between 0 and 1.This study uses radial basis function , where σ is the width of the radial basis function) as a kernel function.The design values of the γ and σ will be determined during the training of LSSVM.
In this study, a sensitivity analysis has been done to extract the cause and effect relationship between the inputs and outputs of the LSSVM model.The basic idea is that each input of the model is offset slightly and the corresponding change in the output is reported.The procedure has been taken from the work of [20].According to [20], the sensitivity (S) of each input parameter has been calculated by the following formula The analysis has been carried out on the trained LSSVM model by varying each of input parameter, one at a time, at a constant rate of 20%.In the present study, training, testing and sensitivity analysis of LSSVM have been carried out by using MATLAB.

Results and Discussion
Different combinations of γ and σ have been tried to get the best result.The design values of γ and σ are 100 and 2 respectively.The performance of training dataset has been computed by using the design values of γ and σ.The value of Coefficient of Correlation(R) for training has been determined by using the following equation.
where E ai and E pi are the actual and predicted E values, respectively, a E and p E are mean of actual and predicted E values corresponding to n patterns.For good model, the value of R should be close to one.Figure 1 illustrates the performance of training dataset.For training dataset, the value of R is 0.998.Therefore, the developed LSSVM has captured the input and output relationship very well.Now, the performance of the developed LSSVM has been examined for the testing dataset.Figure 2 depicts the performance of testing dataset.It is observed from Figure 2 that the value of R is 0.996 for testing dataset.So, the developed LSSVM model can be used as a practical tool for determination of EL.The developed LSSVM model also gives the following equation {by putting and b = 0.3413 in "(4)"} for prediction of EL.

  
T 34 1 exp 0.3413 8 The values of α have been given in Figure 3.
Error bar has been also computed by using the developed LSSVM model.

Conclusions
This study has described LSSVM model for prediction of

Figure 5 .
Figure 5. 95% error bar for testing dataset.ers, number of hidden nodes, learning rate, momentum term, number of training epochs, transfer functions, and weight initialization methods.Whereas, LSSVM model uses two parameters γ and σ.

Figure 6 .
Figure 6.Sensitivity analysis of input parameters.EL in reservoirs.48 datasets have been utilized to develop LSSVM model.The performance of LSSVM model is encouraging.User can use the developed equation for prediction of EL.The developed LSSVM also gives prediction uncertainty.Sensitivity analysis indicates that T has maximum effect on EL.This article shows that the developed LSSVM is a robust model for prediction of EL.
The four input variables used for the LSSVM model in this study are T, WS, SH, and RH.The output of the LSSVM model is EL.So, in this study, Nis the N-dimensional vector space and r is the one-dimensional vector space.