Predicting Electric Energy Consumption for a Jerky Enterprise

Wholesale and retail markets for electricity and power require consumers to forecast electricity consumption at different time intervals. The study aims to increase economic efficiency of the enterprise through the introduction of algorithm for forecasting electric energy consumption unchanged in technological process. Qualitative forecast allows you to essentially reduce costs of electrical energy, because power cannot be stockpiled. Therefore, when buying excess electrical power, costs can increase either by selling it on the balancing energy market or by maintaining reserve capacity. If the purchased power is insuffi-cient, the costs increase is due to the purchase of additional capacity. This paper illustrates three methods of forecasting electric energy consumption: autoregressive integrated moving average method, artificial neural networks and classification and regression trees. Actual data from consuming of electrical energy was used to make day, week and month ahead prediction. The prediction effect of prediction model was proved in Statistica simulation environment. Analysis of estimation of the economic efficiency of prediction methods demonstrated that the use of the artificial neural networks method for short-term forecast allowed reducing the cost of electricity more efficiently. However, for mid-range predictions, the classification and regression tree was the most efficient method for a Jerky Enterprise. The results indicate that calculation error reduction allows decreases expenses for the purchase of electric energy.


Introduction
The amount of electricity consumed is increasing every year around. It is im-portant for energy producers to generate the amount of electricity that will be consumed by enterprises, because power cannot be stockpiled. Consequently, an accurate forecast of electricity demand is necessary for a stable power supply.
The electricity market makes heavy demands on consumers and forces companies to plan energy consumption in order to reduce energy costs. Many works were devoted to resolving predicting energy consumption [1] [2] [3]. Implementation of the forecasting system is considered by the example of a Russian enterprise.
The reliability of predicting energy consumption for large consumers (over 670 kW) plays a significant part, because of the emerging of the Russian wholesale market for electricity and power (WMEP) [4] [5]. For an enterprise to entry into the WMEP, the market players shall submit a qualitative request for the maximum hourly energy capacity to the trading system (pool) administrator and the System Operator of the Unified Power System (SOUPS) no later than 24 hours before starting energy supply [6]. The total cost ( Figure 1) of electric energy includes not only the actual consumed energy capacity, but also the payment for a deviation from the energy nomination (the transition of the enterprise to balancing energy market) [4]. An extra charge shall be paid for the purchase (supply) of missing (surplus) power [4]. Therefore, the best correct prediction shall be made to avoid the transition of the enterprise to balancing energy market (BEM) and to reduce extra energy expenses. Reducing the prediction error could make possible the enterprise to substantially reduce the operational expenses without purchasing large-ticket equipment.
There available six price ranges (rates). Enterprises have the possibility of choosing a price range. It is not advisable for large enterprises to use the 1 st and the 2 nd price ranges (PRs) (integrative price ranges) since they may not prove cost-efficient. If the total capacity of an entity exceeds 670 kW, it is advisable to use the 4 th and the 6 th PRs [7]. Since the total connected capacity of the power receivers of the enterprise under consideration exceeds 670 kW and hourly planning is intended, the 6 th PR is used.
The capacity of the energy consumed by a specific enterprise is governed by features of its operation, such as enterprise load, temperature conditions, illumination level, etc. Therefore, the prediction process, which can help to minimize the expenses, shall be determined on a case-by-case basis. Prediction makes it possible both to perform a cost-effectiveness analysis and to correct equipment operating modes.

The Practical Implementation of Predicting Electric Energy Consumption
Information on actual amount consumption of electric energy was received by electric energy metering devices installed on the territory of the enterprise.   consumption for October 2016 in which weekly cycle (load variations on the weekend) can be clearly seen.
We can determine seasonal component using a periodogram (Figure 4). There are two peaks, which equal to 24 and 168 hours (day and week) and one smaller peak, equal to 84 hours, that means that the process of electric energy consumption is cyclical. It should be taken into account when making a short-term electricity   consumption forecast. Typical days with similar daily load curve and operating schedule were selected to make day ahead prediction. The data for week ahead and month ahead predictions have been selected in the same manner.
We used the Statistica software to test the hypothesis on normality of distribution of the background data. Using the Kolmogorov-Smirnov test (p < 0.1) and the Shapiro-Wilk test (p = 0.000), the conclusion has been drawn that the time series is nonstationary, which complicates the prediction making process [8].
Currently, there are many methods for short-term forecasting of electricity consumption, which are used by industrial enterprises [9]- [14]. Since the time series is nonstationary and there are seasonal components, autoregressive integrated moving average method (ARIMA) can be applied. ARIMA is one of the most often-used forecasting methods which allows for a stationary series by differentiating the time series. ARIMA is used quite often to analyze initial data in which distinct and stable periods of operation can be seen. That is why this method requires typical time frames to be selected [9].
Recently, methods using artificial neural networks (ANNs) have been applied not only to make short-term forecasting of electricity consumption but also in the construction of medium-and long-term forecasts. Such networks consist of many input/output neurons that interact with each other through nerve endings (synapses). The application of ANNs allows one to process data with a larger degree of stochasticity. These networks can be learned, i.e. it is possible to reveal regularities and integrate background information [15]. The advantages of the neural network are fast learning algorithms and the ability to operate with noisy A multivariate prediction method of making a forecast has also been chosen, because in univariate models are built only on the basis of a one-dimensional series and does not take into account other indicators, such as the amount of output. The use of classification and regression trees (CART) enables to make predictions based on data that affect the dependent variable, which allows one to explore the model in more detail and to reveal the factors that have the maximum effect on electric energy consumption [16].
The efficiency of ARIMA method has been assessed. Because of the high degree of stochasticity of the background data, the following typical sequence of data analysis has been implemented: making an assessment of the behavior of the autocorrelation function (ACF) and the partial autocorrelation function of the load curve and selecting a model for prediction making, as well as adding a seasonal component (definite cyclicality). Taking into account these transformations, a day ahead prediction has been made. Then, the hypothesis on normality of distribution of the data obtained has been tested. It has been shown that the distribution is normal. To perform an analysis of the prediction, an analysis of the normal probability graph, the normal distribution of the random variable and the remainder series has been made, as well as the mean absolute percent error of the prediction (MAPE) has been calculated.
The mean absolute percentage error can be determined using the following formula [17] [18]: If MAPE is <10%, the precision accuracy is high, if MAPE is <20%, the precision accuracy is well and if MAPE is <50%, the precision accuracy is satisfactory.
The mean absolute percent error for the prediction under consideration is equal to 13.66%.
Week ahead and month ahead predictions have been also made. The mean absolute percent error is equal to 23.35% for the week ahead prediction of the enterprise load and to 37.19% for the month ahead prediction. The trend of increasing the mean absolute percent error of the prediction with increasing prediction interval has been revealed. Therefore, this type of prediction is not suitable for forward planning because of the excessive error value.
The efficiency of ANN method has been assessed. The assessment has been made using the regression analysis of time series. Upon completing the procedure of learning of the multilayer perceptron (MP), the network with the highest performance has been chosen. To assess the prediction quality, the remainder distribution histogram, the remainder normal probability graph and the normal probability graph have been analyzed. The performance of the model is equal to A comparison of the mean absolute percent errors of the prediction for the two methods has been made ( Figure 5). For the univariate prediction, ANN has been found to be the most suitable method since it is characterized by the lower error of 1.88%, which is smaller than the error of 13.66% for ARIMA method. Therefore, ANN is the most cost-efficient method.
It is known that the sum of predictions is more efficient than a univariate prediction. Therefore, to reduce the calculation error the multiple prediction method has been used. A few prediction models have been built in each interval and then the arithmetic mean has been found [19]. A comparison of the two multiple prediction methods has been made ( Figure 6) and the mean absolute percent error of the prediction has been determined.
The multiple univariate prediction for the ARIMA model yields a higher accuracy than the single prediction (the mean absolute percent error of the prediction is equal to 4.35%). The mean absolute percent error of the multiple prediction according to the ANN model is equal to 5.89%, which is higher than the er- is free of this drawback [16]. "The consumed power, kW" was used as the dependent variable and the energy-consuming factors, such as "Casting of SCh-15, t", "Pig iron, t", "Casting, t" and "Number of heats", were selected as continuous factors. Using the Statistica software, the classification and regression tree has been constructed ( Figure 7) for which the quality of the prediction was assessed using the remainder distribution histogram and the normal probability graph. The mean absolute percent error over a month is equal to 1.49% (Figure 8).

Estimation of the Economic Efficiency of Prediction Methods
A comparison of the cost efficiency of the four following day ahead predictions has been made: the single univariate prediction and the multiple univariate prediction made using ARIMA and ANN. The calculation has been performed on the basis of the data on non-regulated electricity price cap and the retail markup calculated by the last resort supplier for an enterprise using the 6 th PR and having the maximum capacity of power receivers of 10 MW min. over the period of August and November, 2016 [20].
The following is the example of calculation for the univariate prediction over the period of 00:00-01:00, October 1, 2016, which has been made using ARIMA [21]:  The purchased electric energy for each hour over October 1, 2016 was calculated in the same manner and then the results were summed up (Table 1).

Conclusions
Based on the analysis of the results, the following conclusions can be drawn: • For short-range predictions, the model constructed on the basis of the single univariate prediction using the ANN method is the most efficient because the mean absolute percent error and the purchased electric energy are lower than those obtained using other methods. This result is ensured owing to the possibility of learning the networks when stochastic background data are used; • Multiple univariate predictions are efficient for jerky enterprises with continually changing order package. In this case, shorter time intervals, which are closest to the prediction period, are used. The averaging of several predictions makes it possible to smooth "jumps" (data that do not fit a definite regularity for the background data); • For mid-range predictions, the classification and regression tree is the most efficient method because the background data include not only preceding measurement results, but also the enterprise load, which sufficiently affects the electric energy consumed by the enterprise; • If the calculation error decreases, expenses for the purchase (supply) of missing (surplus) electric energy also decrease, so do the retail markups. When the error decreases by 11.78%, the electric energy expenses reduce by 8.48%, which amounts to approx. 160,000 USD per year. That is, the decrease in the error by approx. 1% allows the enterprise to save approx. 13,500 USD; • Predictions shall be made with account for the electric energy price and an extra charge for a deviation from the planned value (during some hours, the extra charge is zero, which can be helpful in a slight reduction in losses).