A Multi-Objective Coordinated Charging and Discharging Strategy for Electric Vehicles Based on Stackelberg Game

For the negative impact of large-scale electric vehicles (EVs) disorderly charging on the power grid, a multi-objective optimization strategy for coordinated charging and discharging of EVs based on Stackelberg game is proposed. As the leader, the grid company aims to stabilize load fluctuations and formulate a reasonable electricity price strategy to guide EVs to participate in vehicle-to-grid (V2G); As followers, EV users optimize their charging plans based on electricity price information with the objective of reducing costs and obtaining good comfort. This paper uses the MOPSO algorithm to solve the proposed multi-objective Stackelberg problem, and calculates the optimization results under various preferences, which proves the effectiveness of the pro-posed model and method.


Introduction
With the increasingly serious problems of global environmental pollution and energy shortage, electric vehicle with the characteristics of environmental protection and energy saving has been widely concerned and rapidly developed. The increase in the number of EVs and their uncoordinated charging will cause problems such as grid overload, increased network losses, and harmonic pollution [1] [2] [3], which is not conducive to the security and stability of the power grid. V2G uses the mobile energy storage characteristic of EVs to rationally regulate the charging and discharging of EVs, which can not only alleviate fluctua-tions in the load on the power grid, but also create revenue for EV users [4].
The Stackelberg problem with the hierarchical structure was first raised by Von Stackelberg in 1952 when he studied market economy issues [5]. In [6], a Stackelberg game approach was used to deal with the charging strategy in the case of one aggregator and multiple aggregators. Similar approaches were used in [7] to stimulate the game between community charging agents and EVs, by transforming the game into linear programming, the pricing and power purchasing strategies of smart community agents were obtained. In [8], a Stackelberg game model is established to study the V2G problem in the parking lot and the battery loss and transmission loss are considered to optimize the robust characteristics of the system. Additionally, a charging guidance method based on electricity price signals.is proposed in [9] to maximize EVs benefits and to eventually flatten aggregated load curves. The work in [10] considered the uncertainty of EV behavior during the game, and proposed a real-time asynchronous DR algorithm, which effectively reduced the load fluctuation of the power grid.
A three-layer Stackelberg model including power grid, energy storage, and users is constructed in [11], and the Jaya-based Boltzmann machine algorithm was used to maximize the overall benefits of the game players.
The Stackelberg model in the above literature only considers the single objective of each subject, and in reality, the objectives of the game players are often diversified. Especially for EVs, only considering economic objective may ignores the user's requirement for comfort. Based on the previous research, this paper builds a Stackelberg multi-objective optimization model with the grid as the leader and the EV users as the followers. The MOPSO algorithm is used to calculate the charging and discharging strategies under different preferences. Main contributions of this paper include following key points: • Using a Stackelberg game between grid and EVs and considering their multiple objectives to make the model closer to reality.
• Designing examples with different preferences, and finding Stackelberg equilibrium with MOPSO algorithm and fuzzy theory.
The remaining of this paper is organized as follows: Section 2 discusses the system model and formulates the Stackelberg problem. In Section 3, the solution techniques adopted in this work have been discussed. The results obtained from our simulations are provided and discussed in Section 4. Section 5 concludes the paper

System Model
As shown in Figure 1, the power grid is upper-level leader, and EVs are lower-level followers. EV users adjust their charge and discharge plans to respond to electricity price signals from the power grid in order to reduce the cost of electricity, but they will also consider their original charging plan. Different types of EVs have different behavior characteristics and power requirements [12]. The research object of this article is private EV. Energy and Power Engineering

User Model
According to what has been s explained in Section 1, EV users participate in V2G and formulate their own charging and discharging plans to reduce charging costs. On the other hand, they also want to obtain good experience of consuming. Therefore, the objectives of EV users should include the two aspects of economy and comfort. The user problem can be formulated as follows: where , The comfort of EV users is closely related to their charging plans. When there is no difference in electricity prices in different time periods and the user has no economic considerations, the user will choose the charging plan that best meets his own habits. Therefore, the user's load curve without the difference in electricity prices can be used as the maximum comfort curve, and the deviation between the actual load curve and the maximum comfort curve can be used as an index to measure the electric comfort of the EV user. Therefore, the optimization problem is formulated as follows: ( where, , ,0 i t x is the original planned load of EV i in the interval t, and it depends on the driving habits and travel characteristics of the user. It can be simulated by Energy and Power Engineering Monte Carlo method after fitting the relevant statistical data [15] [16] [17]. t α is the influence coefficient, which measures the influence of the deviation on the user's comfort. A small value can be set at night and a large value can be set during the day.
• The constraints of users where (3) provides the upper and lower boundaries for charge and discharge power of EV and (4) gives the upper and lower boundaries of the battery SOC (State of Charge). (5) shows the relationship between the current SOC and the SOC of the previous period. In (6) the sequence of charging/discharging should be in a way that by the time that the EV owner decides to leaves the house, the SOC should reached maximum.

Grid Model
The objectives of the power grid include reducing the standard deviation of the load, reducing the peak-to-valley difference, and increasing the revenue from electricity sales. Some literatures [18] [19] [20] have studied the economic benefits of V2G to the grid, i.e., the effect of subsidizing EVs to participate in V2G is conducive to the reduction of the overall cost of the grid. Therefore, we no longer consider the objective of maximizing the electricity sales revenue of the power grid, and only considers the index of load stability.
• Objective 1: Reduce standard deviation of the load where 0,t L is the power of grid without EV load in the interval t. t L is the total load in the interval t. L is the daily average load after scheduling.
• Objective 2: Reduce peak-to-valley differences where, max L is the daily maximum load, min L is the daily minimum load.

Stackelberg Game Model Solution
The original load curve of an EV depends on the owner's behavior, the battery parameters of the EV, etc. The relevant statistics of the 2009 National Household Travel Survey (NHTS) [21] are shown in Figure 2. The daily charging time of the EV is obtained through the calculation of the daily driving distance, the power consumption of 100 kilometers, and the charging power. In combination with the moment when EVs start charging, Monte Carlo simulation can be used to obtain the original charging load when the EV did not participate in V2G. This paper uses the multi-objective particle swarm optimization (MOPSO) algorithm [22] to solve the multi-objective Stackelberg model. The MOPSO algorithm searches for the optimal solution by continuously updating its position and speed to obtain a set of Pareto optimal solution. After obtaining the Pareto optimal set, the method of fuzzy theory [23] is used to select the optimal solution. The objective function value of a non-inferior solution in the solution set is fuzzified by membership function. max max min where n k f is the K-th objective function value of non-inferior solution n x .
where K is the total number of objectives, N is the total number of non-inferior solutions, and k θ is the weight of the k-th objective. In Figure 3, MOPSO algorithm is illustrated via a flow chart.

Simulation Result
This paper uses 1000 EVs in an area as an example for calculation. Relevant research shows that [24], the average daily driving time of more than 90% of private cars is only about one hour and is generally not in the peak and valley hours of the power grid. The cars are idle in 95% of the time. On the other hand, in the future, centralized charging piles and V2G equipment can be built in the parking lot for centralized charging, which can ensure a higher connection rate for EVs. Therefore, it can be considered that EVs can participate in V2G optimized scheduling within 24 hours. The relevant parameter settings are shown in Table 1.
The typical daily load curve of an area and the disordered charging load of an EV obtained by Monte Carlo simulation are shown in Figure 4. It can be seen that the uncoordinated charging of EVs will increase the peak-to-valley difference of the total load, and the load fluctuation will be more serious, which will adversely affect the security and stability of the grid.

Strategy A: Considering Only Economy
Without considering the comfort of power consumption, i.e., the lower layer 1 1 θ = , 2 0 θ = , and set the upper layer 1 0.5 θ = , 2 0.5 θ = . According to the above algorithm, the optimization results are shown in Figure 5. From the simulation results, it can be known that under the strategy considering only Economy, the load of EVs is sensitive to changes in electricity prices, Energy and Power Engineering   and they can effectively adjust the charge and discharge load. The effect of peak-cutting and valley-filling is obvious. After optimization, the standard deviation of the total load of the power grid has been reduced by 60.06% compared to that of uncoordinated charging, the peak-to-valley difference has been reduced by 55.61% and the total charging cost has been reduced by 14,889.1 yuan.
However, in this case, the comfort level of electricity consumption is low, only −0.26. The electricity price shows obvious peak-to-valley characteristics, making full use of the owner's pursuit of economic benefits, and effectively guiding the users to orderly charge and discharge.

Strategy B: Considering Only Comfort
According to the analysis in section 2.1, if the user only considers the comfort and does not consider the economy, the user's comfort is the best when the user does not change his own charging plan. In this case, the load of the EV is consistent with the original uncoordinated charging. The comfort level is the highest, but the economy is the worst.
The optimization results are shown in Figure 6. After optimization, the standard deviation of the load is reduced by 49.44%, the peak-to-valley difference is reduced by 46.42%, and the total charging cost is reduced by 11,285.1 yuan. At this time, the comfort is −0.06, which is better than considering only economy. Compared with the strategy only considering economy, the strategy that comprehensively considers economy and comfort has a smaller electricity price difference. Because users have both economic and comfort objectives, more adjustments to charging and discharging may reduce overall satisfaction, resulting in low user response to electricity prices. Therefore, the higher electricity price difference has no obvious effect.
The data of the three strategies are shown in Table 2.

Conclusions
This paper builds a multi-objective Stackelberg game model to study the optimization strategy of EVs and power grids in terms of energy exchange and electricity pricing. The following conclusions are obtained: 1) Electric vehicles can effectively reduce their costs by participating in V2G, and the grid can guide users to orderly charge and discharge by setting appropriate electricity price to achieve the purpose of peak cutting and valley filling.  2) Under the strategy considering only Economy, the user's economic benefits are the best but the power comfort is the worst.
3) Under the strategy considering only Comfort, users charge disorderly, the comfort level is the best but the economy is the worst, and the load fluctuation of the power grid increases.
4) The strategy considering Economy and Comfort can reduce the load fluctuation of the grid and reduce the power cost of EV users. At the same time, it ensures a certain level of comfort, and the data in various aspects are relatively balanced.