Application of Least Square Support Vector Machine (LSSVM) for Determination of Evaporation Losses in Reservoirs ()
1. Introduction
One of the most effective water loss processes of reservoir is the evaporation. The amount of evaporation occurs often in large quantities. For example, in late 1950s total evaporation from water surfaces in the United States was greater than the total amount of water withdrawn for domestic purposes by cities and towns [1]. Therefore, the determination of Evaporation Loss (EL) in reservoirs is an imperative task in earth science. Due to complex interactions among the components of land-plant-atmosphere system, the determination of EL in reservoir is a complicated task [2]. Researches use different methods for prediction of EL in reservoir [3-7]. Recently, Deswal and Pal [8] have successfully employed Artificial Neural Network (ANN) for determination of EL in reservoir. However, ANN has some limitations. The limitations are listed below:
• Unlike other statistical models, ANN does not provide information about the relative importance of the various parameters [9].
• The knowledge acquired during the training of the model is stored in an implicit manner and hence it is very difficult to come up with reasonable interpretation of the overall structure of the network [10].
• In addition, ANN has some inherent drawbacks such as slow convergence speed, less generalizing performance, arriving at local minimum and over-fitting problems.
This article adopts Least Square Support Vector Machine (LSSVM) for prediction of EL in reservoir. The database has been collected from the work of [8]. The database contains information about EL, Mean air temperature (T) (0C), Average wind speed (WS) (m/sec), Sunshine hours (SH) (hrs/day), and Mean relative humidity (RH) (%). The LSSVM is a statistical learning theory which adopts a least squares linear system as loss functions instead of the quadratic program [11]. LSSVM is closely related to regularization networks [12]. With the quadratic cost function, the optimization problem reduces to finding the solution of a set of linear equations. LSSVM has been successfully applied for solving different problems in engineering [13-15]. This study has the following aims:
• To examine the capability of LSSVM model for prediction of EL
• To determine the error bar of predicted EL
• To develop an equation for prediction of EL
• To make a comparative study between developed LSSVM and ANN model developed by [8]
• To do sensitivity analysis for determination of the effect of the each input parameter on EL.
2. Details of LSSVM
LSSVM models are an alternate formulation of SVM regression [16] proposed by [17]. Consider a given training set of N data points
with input data
and output
where RN is the N-dimensional vector space and r is the one-dimensional vector space. The four input variables used for the LSSVM model in this study are T, WS, SH, and RH. The output of the LSSVM model is EL. So, in this study,
and
.
In feature space LSSVM models take the form
(1)
where the nonlinear mapping
maps the input data into a higher dimensional feature space;
;
; w = an adjustable weight vector; b = the scalar threshold. In LSSVM for function estimation the following optimization problem is formulated:
Minimize: 
Subject to:
. (2)
where ek = error variable and γ = regularization parameter. The following equation for EL prediction has been obtained by solving the above optimization problem [18-19].
(3)
where K(x, xk) is kernel function. The radial basis function has been used as kernel function in this analysis. The radial basis function is given by
(4)
where σ is the width of radial basis function.
The above LSSVM has been adopted for determination of EL. The data are collected from a reservoir in Anand Sagar, Shegaon (India). The data of evaporation loss were collected for one year only. Whereas, the other data for a period of fifteen year (from 1990 to 2004) were obtained from a full climatic station at Manasgaon, about 9 Km from Shegaon, lying under water resources division, Amravati Hydrology Project (Maharashtra, India). The dataset contains information about 48 cases. The data have been divided into two sub-sets; a training dataset, to construct the model, and a testing dataset to estimate the model performance. So, for our study a set of 34 data are considered as the training dataset and remaining set of 14 data are considered as the testing dataset. The data are scaled between 0 and 1. This study uses radial basis function
, where σ is the width of the radial basis function) as a kernel function. The design values of the γ and σ will be determined during the training of LSSVM.
In this study, a sensitivity analysis has been done to extract the cause and effect relationship between the inputs and outputs of the LSSVM model. The basic idea is that each input of the model is offset slightly and the corresponding change in the output is reported. The procedure has been taken from the work of [20]. According to [20], the sensitivity (S) of each input parameter has been calculated by the following formula
(5)
The analysis has been carried out on the trained LSSVM model by varying each of input parameter, one at a time, at a constant rate of 20%. In the present study, training, testing and sensitivity analysis of LSSVM have been carried out by using MATLAB.
3. Results and Discussion
Different combinations of γ and σ have been tried to get the best result. The design values of γ and σ are 100 and 2 respectively. The performance of training dataset has been computed by using the design values of γ and σ. The value of Coefficient of Correlation(R) for training has been determined by using the following equation.
(6)
where Eai and Epi are the actual and predicted E values, respectively,
and
are mean of actual and predicted E values corresponding to n patterns. For good model, the value of R should be close to one. Figure 1 illustrates the performance of training dataset. For training dataset, the value of R is 0.998. Therefore, the developed LSSVM has captured the input and output relationship very well. Now, the performance of the developed LSSVM has been examined for the testing dataset. Figure 2 depicts the performance of testing dataset. It is observed from figure 2 that the value of R is 0.996 for testing dataset. So, the developed LSSVM model can be used as a practical tool for determination of EL. The developed LSSVM model also gives the following equation
Figure 1. Performance of training dataset.
{by putting
, σ = 2 and b = 0.3413 in “(4)”} for prediction of EL.
(7)
The values of α have been given in figure 3.
Error bar has been also computed by using the developed LSSVM model. Figures 4 and 5 depict the error bar of training and testing dataset respectively. The predicted error bar can be used to determine confidence interval. The results of sensitivity analysis have been shown in figure 6. It is observed from figure 6 that T has maximum effect of EL followed by RH, SH, and WS.
A comparative study has been done between the developed LSSVM and ANN model developed by “Deswal and Pal (2008)”. The obtained R and Root Mean Square Error (RMSE) value from ANN model is 0.960 and 0.865. The developed LSSVM model predicts EL value with an accuracy of R = 0.996 and RMSE = 0.539. Therefore, the performance of the developed LSSVM model is slightly better than ANN model. ANN model uses many parameters such as the number of hidden lay-
Figure 4. 95% error bar for training dataset.
Figure 5. 95% error bar for testing dataset.
ers, number of hidden nodes, learning rate, momentum term, number of training epochs, transfer functions, and weight initialization methods. Whereas, LSSVM model uses two parameters γ and σ.
4. Conclusions
This study has described LSSVM model for prediction of
Figure 6. Sensitivity analysis of input parameters.
EL in reservoirs. 48 datasets have been utilized to develop LSSVM model. The performance of LSSVM model is encouraging. User can use the developed equation for prediction of EL. The developed LSSVM also gives prediction uncertainty. Sensitivity analysis indicates that T has maximum effect on EL. This article shows that the developed LSSVM is a robust model for prediction of EL.