Effect of Strategy-Homogeneity on the Prisoner’s Dilemma Game in a Square Lattice

We investigate the effect of strategy-homogeneity on the prisoner’s dilemma game in a square lattice. Strategy-homogeneity means that the population contains at least one connected group in which individuals maintain the same strategy at each iteration and may update according to updating rule at next iteration. The simulation results show that the introduction of strate-gy-homogeneity increases the cooperation in the evolutionary stable state. For any value of temptation to defect, the density of cooperators in equilibrium state increases firstly and then decreases as the level of strategy-homogeneity increases constantly, and there exists an appropriate level of strategy-homogeneity, maximizing the density of cooperators. The results may be favorable for comprehending cooperative behaviors in societies composed of connected groups with coherent strategy.

However, cooperation means sacrificing personal interest for the collective benefits when a defector obtains more from his cooperative opponent than a cooperator does. Therefore, it is vital to investigate why individuals choose to cooperate with others. The classical game theory is firstly proposed by Nash [11], forming a research paradigm framework based on classical game. In this theory, all of players are assumed to be of perfect rationality [11], indicating that they always make the best decisions. In practice, people are bounded rational in realistic world [12] [13] as a result of being influenced by others. In view of this, evolutionary game theory [14] [15] is put forward, aiming at studying about the evolutionary process of cooperation within a population on social dilemma [16] [17]. In evolutionary game theory, payoff is represented by fitness, and strategies are changing over iteration constantly until equilibrium state is reached. Subsequently, Axelrod and Hamilton [18] investigate the evolutional course of cooperation by considering repeated prisoner dilemma. Nowak and May combine spatial structure with evolutional classical game [19] [20] creatively, generating the networked evolutionary game. Above all, network evolutionary game provides a powerful framework for the research of the cooperation.
It is a matter of fact that societies, in almost every age and region, always contain at least one connected group in which individuals maintain the same strategy at each iteration and may update according to updating rule at next iteration. This is the strategy-homogeneity we will investigate in this paper. This idea comes from the following related pieces of literature, namely, Ref. [21] [22], studying strategy-assortativity previously, but the strategy-assortativity referred by them is used to describe people's general trend to interact with those who act like them, leading to the formation of relations. More specifically, in Ref. [21], an individual selects his partner from the assortative pool with a fixed probability that represents strategy-assortativity or from the random pool with the corresponding probability. In Ref. [22], the strategy-assortativity reflected by the fixed probability decides what kind of investors' partner in the game.
The strategy-homogeneity is equivalent to the initial strategy distribution, but the literature focused on the impact of initial strategy distribution on the evolution of cooperation for social dilemmas has so far given little attention to the influence of the level of strategy-homogeneity in the network. A part of the existing literature on initial strategy distribution is shown in the following. Szabó and Fáth unveil the fact that the result of evolutionary games is closely related to initial conditions [23]. For instance, in Ref. [24], the authors investigate not only the effect of different initial distribution of defectors on cooperators under the premise of same initial frequency of defectors but also the influence of dynamic initial frequency of defectors on cooperators under the same initial distribution of defectors. They found that the situation where defectors are located on lowest-degree vertices initially can display more robust cooperation than other situations. Then, the initial configuration that all S × S individuals are defectors except for an s × s (s = 1, 3, 5, …, 15 and 30) cluster of cooperators in the center of the lattice is investigated, and their simulations confirm that the probability of invasion is essentially independent of the initial number of cooperators provided that they form at least a 3 × 3 cluster [25]. By assuming a small fraction of zealous cooperators, Masuda shows that a large fraction of cooperation emerges in evolutionary dynamics of social dilemma games [26]. Other related studies on initial configurations can be seen in literature [27] [28] [29] [30].
In general, it is worthy of special consideration to investigate the effect of strategy-homogeneity on individual's behaviors. Therefore, this paper will focus on the effect of strategy-homogeneity on the prisoner's dilemma game in a square lattice.
The structure of the paper is as follows. Section 2 elaborates the strategy-homogeneity and corresponding strategy update rule. Section 3 shows the simulation results. Section 4 concludes.

Strategy-Homogeneity and Corresponding Strategy Update Rule
Strategy-homogeneity means that the population contains at least one connected group in which individuals maintain the same strategy at each iteration and may update according to updating rule at next iteration. For example, various connected groups are shown in bold black in Figure 1. We use strategy-homogeneity p, namely, the level of strategy-homogeneity, as an index to form different connected groups here. In addition, the strategy-homogenous edge is defined as this kind of edge through which individuals keep same strategy at each iteration and may update according to updating rule at next iteration. Before the idea of strategy-homogeneity is realized, we need know how to select strategy-homogenous edge according to strategy-homogeneity p and divide all agents into various groups. Firstly, given the strategy-homogeneity p, we will compare a random number generated for each edge with p, and the corresponding edge is then selected if the random number is less than p. Strategy-homogeneity p is about the proportion of strategy-homogenous edges. Secondly, nodes involved by connected strategy-homogenous edges constitute a group. Moreover, a single node that is not involved by any strategy-homogenous edge constitutes a group itself. So, the population is divided into various connected subgraphs in which the number of individuals may be 2, 3 and so on. Taking the strategy-homogenous edge XZ depicted in Figure 1 for example, the nodes X and Z constitute one connected group, while the single node such as Y constitutes another group. Individuals acquire their accumulated payoffs from games with their nearest neighbors. For a group, suppose the node X obtains maximum payoff P(X) within this group, and the node Y randomly selected from all neighbors of this group gains payoff P(Y). All individuals in this group maintain consistent strategy at each iteration, thus X and all individuals in this group will adopt the strategy of Y through Fermi updating rule with probability as follows:

Simulation Results in Square Lattice
The model we use in this paper is a two-player non-cooperative game called the Prisoner's Dilemma (PDG), firstly described in [31]. The PDG is considered a vital model for studying the emergence of cooperation among selfish individuals [14] [32] [33]. In the traditional PDG, the reward for mutual cooperation is R, the punishment for mutual defection is P, and the mixed strategies give the cooperator S and the defector T respectively. The dilemma holds the inequation T R P S > > > and constraint 2 T S R + < . Without loss of generality, we con-  In general, the density of cooperators increases first and then decreases in equilibrium state as p increases constantly. In detail, in equilibrium state, the density of cooperators keeps increasing when p increases from 0 to 0.35, and the density of cooperators then keeps decreasing when p increases from 0.35 to 0.45. Namely, the density of cooperators gets its maximum at p = 0.35. It is consistent with the literature of the classic prisoner's dilemma game in [34] [35] for the result of p = 0, and the introduction of p increases the cooperation in the evolutionary stable state. Here, for p ≥ 0.2, the proportion of cooperators in equilibrium state is higher than the initial state. It means that there is an increase in the number of cooperators, leading to a stable state in which cooperators dominate.
Considering the effect of p on cooperation under more b values, we explore cooperative level in dependence on the temptation to defect b for various p values, as shown in Figure 2(b). For a fixed p, the density of cooperators is a decreasing function of b. It is worth noting that with the increase of b, especially when b = 1.8, all cooperators for all p values are annihilated. It is certain that cooperative level for p = 0 is the lowest than other p values. Both the trend of cooperative level for various b in equilibrium state and b value making cooperation vanish are same with the result of [35] in the circumstance of p = 0. We can see clearly that there is an optimal p value for any b value. Particularly, for b = 1.1, the density of cooperators in equilibrium state increases first and then decreases as p increases, which is in accordance with Figure 2(a).
Next, we will explain results in Figure 2   We understand the effect of strategy-homogeneity p on the persistent of cooperative behavior through snapshots of strategic distribution in equilibrium state, as shown in Figure 4. Figure 4(d) shows that a small fraction of cooperators can prevail through forming clusters. When strategy-homogeneity p increases, both the size of clusters and the number of cooperators in Figure 4(h) and Figure 4(l) are bigger than Figure 4(d) at the end of evolution. Figure 5 shows the equilibrium average payoff of cooperators and defectors respectively for different p. We have known that defectors dominant in equilibrium state for p = 0.1, but cooperators earn more average payoff than defectors under this p value. We infer that the payoff of cooperators may be related to the formation of cooperative clusters. The payoff of defectors increases with the increase of their cooperative neighbors when p increases from 0.1 to 0.3. As p continues to increase to 0.35, defectors are hard to invade bigger cooperative clusters, and less cooperators appear in the neighbors of defectors, hence there is a descending trend in defector's payoff. In circumstance of p = 0 and p = 0.05, the payoff of all individuals is 0 because there is no cooperator in the population. Furthermore, as Figure 6 displays, we will use strategic perturbation rate during the process of iteration to measure whether strategies of all individuals remain unchanged or only small perturbations. In Figure 6, strategic perturbation rate is not zero from beginning to end for three p values. Although strategic perturbation rate decreases with the increase of p, the range of fluctuation of strategic perturbation rate is indeed increasing. The larger p is, the greater the    Considering the effect of p on cooperation under more κ values, we explore cooperative level in dependence on selection pressure κ for various p values, as shown in Figure 7. On the one hand, for any κ value, with the increase of p, the change trend of cooperative level in equilibrium state is consistent. On the other hand, compared with p = 0 and p = 0.05, cooperative level in equilibrium state under other p values are improved at different degree. For example, the cooperative level of p = 0.15 is higher than that of p = 0 and p = 0.05, the cooperative level of p = 0.2 is also higher than that of p = 0 and p = 0.05, but the degree of increase is different. In general, the change of κ will not affect the previous conclusion. Journal of Applied Mathematics and Physics

Conclusion
The strategy-homogeneity in this paper means that the population contains at least one connected group in which individuals maintain the same strategy at each iteration and may update according to updating rule at next iteration.
The strategy-homogeneity p is used to control the proportion of strategy-homogenous edges initially, thus affecting the evolution of strategies. We find that the introduction of strategy-homogeneity increases the cooperation in the evolutionary stable state. For any value of temptation to defect, the density of cooperators in equilibrium state increases firstly and then decreases as the level of strategy-homogeneity increases constantly, and there exists an appropriate level of strategy-homogeneity, maximizing the density of cooperators. The results may be favorable for comprehending cooperative behaviors in societies composed of connected groups with coherent strategy.