Entropy, biological evolution and the psychological arrow of time

We argue that in Universes where future and past differ only by the entropy content a psychological arrow of time pointing in the direction of entropy increase can arise from natural selection in biological evolution. We show that this effect can be demonstrated in very simple toy computer simulations of evolution in an entropy increasing or decreasing environment.


Introduction
While the laws of classical physics are perfectly time-symmetric. i.e. causality works equally well forward and backward in time both daily life and the microcosm governed by quantum laws are time-asymmetric. This asymmetry manifests itself in various arrows of time: • the thermodynamical arrow of time, as provided by the second law of thermodynamics: the entropy in a closed system always increases or remains constant and never decreases. For example, intact cups can fall off the table and splatter in thousand pieces, releasing their potential energy as heat radiation, while cup pieces on the floor never cool down their environment and use this energy to unbreak and jump onto the table. Of course for a system to undergo entropy increase it has to be out of equilibrium in the first place, i.e. starting from a low entropy state.
• the cosmological arrow of time: the Universe Hubble expands as time proceeds.
• the quantum mechanical arrow of time: pure states 'collapse' or decohere (depending on one's preferred interpretation of quantum mechanics) into mixed states but mixed states never evolve into pure states.
• the CP asymmetry: weak interaction processes in particle physics proceed with different rates if the direction of time is reversed.
• the biological arrow of time: living creatures die but corpses do not revive.
• the psychological arrow of time: it is possible remember the past but not to "pre-member" the future.
It is interesting to note, though, that not all of these arrows of time are independent. Most obvious, the biological arrow of time is just another example of entropy increase. After all, entropy in its statistical interpretation is nothing but a measure of how many microstates correspond to a given macrostate, and thereby a measure of how probable this macrostate is: there are simply many more possible ways for a cup to be broken than to be unbroken, and many more ways for being dead than for being alive (almost all organs intact, working and in their right places).
Beyond that, in modern physics it actually becomes more and more common to assume that all arrows of time are just different manifestations of entropy increase: • the cosmological expansion may be intimately related to the thermodynamical arrow of time by actually defining the direction in which entropy would increase by providing a low entropy state at the beginning of time. In such scenarios time would start to run backwards if the Universe would recollapse. Alternatively, the cosmological arrow may be no arrow at all, as a Universe shrinking with time could be possible as well, given different initial conditions. • if the quantum mechanical arrow of time is due to decoherence [1] as in the Everett-Many-Worlds interpretation [2] it is simply a consequence of entropy increase once a quantum system is entangled with its environment in a measurement process. This results from the fact that the microstates of the environment are not accessible to the observer.
• while the CP symmetry is violated in weak interaction processes CPT remains conserved and may be interpreted as the correct operation of time inversion [3].
The notion that time is an illusion, that it is nothing but entropy increase after all, is further supported by the fact that the Wheeler-de-Wit equation [4] H|ψ = 0 (1) for the quantum mechanical wave function of the Universe [5] in canonical quantum gravity models does not include any time coordinate. In quantum cosmology, time may thus rather come into being as an emergent phenomenon when the transition to a semi-classical description of Nature is made, i.e. quantum fields evolving in a background of classical gravity, since by separating out gravity as a classical background of spacetime a macrostate is defined corresponding to many possible microstates in the full quantum gravity description (see e.g. the discussions in [6,7,8,9]). In the following we thus adopt the hypothesis that entropy increase is the only manifestation of time. It should be stressed that this assumption makes time a psychological phenomenon. After all the entropy of a given macrostate depends on its definition in terms of microstates, a property that can not be derived from the fundamental laws of physics. If this hypothesis is correct, however, the most obvious manifestation of time becomes the most puzzling one: the psychological arrow of time -the fact that we remember the past but do not "pre-member" the future.
More concrete, if the entropy budget is the only difference between past and future one would expect that building up memory would correspond to entropy increase. Quite the opposite is true, though. By organizing our brains in a way to store memory the unordered high entropy state of the brain evolves into an organized low entropy state [13,14,15] 1 . Of course this is possible as the brain is an open system, and the process of building up memory and reducing entropy locally inside the brain can proceed at the cost of increasing entropy in the environment. But this consideration seems to suggest that if entropy increase is the only manifestation of time there would be no reason why we shouldn't pre-member the future, thereby naturally growing entropy in our brain instead of forcing a local entropy decrease by using up free energy.
In the following we will follow this line of reasoning and show that in the case the fundamental laws of physics would allow to pre-member the future, natural selection in the course of biological evolution could eliminate individuals developing memory of the future.
The basic argument here is simple. Memory has evolved in the course of biological evolution for only one reason: namely, in order to increase the fitness of an individual in a situation in the present. In its most basic form, memory is helpful by providing a set of initial conditions which allow to calculate (or at least estimate) the behavior of enemies, prey or prospective sex partners in the present. The fundamental laws of physics which govern this behavior are classical: they work as well forward as backward. However, in order to calculate a process in the present from initial conditions in the future one typically has to deal with a higher entropy state than by calculating the process from initial data in the past: For an illustrative example, just imagine you try to avoid being hit by a rock. You see the position of the rock on top of a hill and you can easily estimate how it will fall to the ground, where it disintegrates (see Fig. 1, left panel). The problem reduces to solving a very simple differential equation. If you want to achieve the same goal by relying on initial data in the future, you have to know the positions of all constituents of the disintegrated rock, and calculate how and where exactly they integrate to form the rock flying up onto the hill (Fig. 1, right panel). There are far more initial conditions to memorize, and the problem now transformed into a complicated multi-dimensional set of coupled differential equations.
As entropy is a measure of the information required to describe the microstate realized in a given macrostate, an individual would have to store and process more information. And, depending on the situation it wants to manage, the amount of information may not be a factor of few, but will typically grow exponentially the farther your memory lasts into the future. Storing and processing information requires storage, and -metaphorically oversimplified -the more storage one needs the larger one's brain has to be (see e.g. the comparison of various animal brain sizes in [16]). At some point (and this point will come pretty soon) such a large brain will make the individual's ability to move and react (the only reason to develop memory in the first place) more and more difficult and finally impossible. Such species will never prevail or even develop in the course of evolution that favors agility and swift decision-making.
If we consider the psychological arrow of time as a possible consequence of biological evolution though, we run into the difficulty that we typically understand evolution as a process in time while we now use it to establish an (at least seemingly) very fundamental property of time itself. The question is thus how we understand evolution and survival in a time-symmetric Universe.
As we point out here, though, it is not too difficult at all to imagine such a scenario. For sake of simplicity we model evolution by a computer simulation. A very simple realization would study two contrasting scenarios with one in which the lifeforms remember the past while in the other they pre-member the future.

A simple toy model
In order to address the question whether remembering populations have a superior fitness compared to pre-membering populations we have developed a very simple simulation as a toy model for artificial life and evolution of a The individuals of a given population are modeled as agents moving in a two-dimensional grid. A hostile environment is simulated by adding falling rocks with a given mass which move predictably according to the laws of classical physics. At the boundary of the grid the rocks disintegrate into two rocks with half of the mass of the original rock. The exact rules for collisions with the boundaries are depicted in Fig. 2. Once the mass of a rock reaches the value of 0.5 it constitutes no threat for individuals anymore and is removed from the simulation. New rocks are generated randomly with a given probability in order to ensure a stable environment with a more or less constant rock density oscillating around an average value. The rocks are responsible for both selection and for the irreversibility of processes in the environment which allows to define the direction of time as the direction of increasing entropy. The evolution of the rock environment is being recorded first and can subsequently be run both in forward and backward direction. Finally populations of individuals can be placed in either the forward or the backward running environment and their probabilities for extinction or survival can be compared.
The individuals possess a given number of lives which gets reduced by the mass value of a rock when they get hit. When the number of lives is reduced to zero the individual "dies" and gets removed from the simulation. The individuals reproduce asexually by reduplication with a defined probability p r with each time step. Furthermore the individuals scan the adjacent and nextto-adjacent cells (a two-cell Moore-neighborhood) of their respective position and avoid a threatening rock by moving out of its way once it is detected. The limited memory of an individual is modeled by allowing individuals to find and avoid only one single rock -whenever two or more rocks threaten the individual it is at danger of being hit. If no rock is observed the individuals move or rest randomly. For a screen shot of the simulation see Fig. 3.

Results
In order to compare the probabilities for extinction or survival of pastremembering and future-premembering populations a population of individuals is situated either in the forward or backward running environment of scattering rocks and evolved according to the simple rules discussed above. As a consequence, populations can remain stable over a given time span, die out or explode. For high p r or a low number of stones, populations are more likely to explode, for low p r or high numbers of stones, they are more likely to become extinct. In between there is a critical region. Due to statistical influences induced by the random creation of rocks and random replication of individuals as well as their random initial positions the simulation may result in all possible outcomes even for a single set of initial conditions within the critical region. To study the exact differences between the two settings we investigate (past remembering and future pre-membering lifeforms), we thus performed a statistical analysis, defining the value p 1/2 as the reproduction probability p r for which more than half of the simulations result in a stable population and survive a given running time of 5000 iterations (for an example of a stable population over a certain time span see Fig. 4). 2 The result is shown in Fig. 5 where the percentage of stable populations is plotted as a function of p r . Here red and green crosses correspond to past-remembering and future-premembering populations, respectively and the respective values for p 1/2 can be read off as a result of linear extrapolations to the 50% survival rate.
For populations in an entropy increasing environment the reproduction rate necessary for a stable population is (3.7 ± 0.002) · 10 −2 while for populations in an entropy-decreasing environment the necessary rate is (3.9 ± 0.009) · 10 −2 . 3 Figure 5: Percentage of stable populations as a function of the reproduction rate p r for past remembering (red crosses, upper fit) and futurepremembering populations (green crosses, lower fit). The mean reproduction rate with a 50 % survival chance p 1/2 is obtained by linear extrapolation.

Discussion
At this point a few remarks are in order: First, we chose to adopt in our simulation what may be called a psychological perspective, as it corresponds to the subjective perception of the pre-membering population (starting with birth and ending with death). That means, if we define the direction of time as the direction of increasing entropy, our pre-membering populations are not really living forward in time with a memory of the future, but rather live entirely backwards in time. We chose the psychological perspective here in order to have a simple, model that allows to compare re-membering and pre-membering populations in a transparent way. It could be argued though that the former, biological per- The parameter setting we used includes an initial number of agents of 30, an initial number of rocks of 10 with an initial mass of 3 each and a probability of 0.4 that a new rock is injected in each iteration on a rectangular grid of 43 × 23 cells. The simulation runs had a length of 5000 iterations; the simulations for each candidate p r were repeated 100 times in order to obtain a good estimate for the corresponding survival rate.
spective is more realistic as life typically is understood as a process evolving in the direction of entropy increase. As the mechanisms responsible for the time asymmetry are the same in both cases we do not expect a qualitative change in our results, if the biological perspective is chosen instead.
Second, in our simulation the cost of information storage in the memory is not modeled. We rather compared the capabilities of populations to survive in entropy increasing and entropy decreasing environments assuming they have the same skills in information processing. This can be motivated by the fact that the brain is an open system, and that recording memory in our brains works against the external entropy increase anyway, but it leaves open the question whether backward living lifeforms can store information at no or even negative cost. As a psychological arrow of time requires to sustain the physical basis for memory storage (a "big brain") over a large time period in parallel to the biological arrow of time corresponding to entropy increase though we consider any possible amount of free energy generation by storage of pre-membrance as negligible.
Third, and related to the point above, it has been pointed out by Sean Carroll [3], that if memory can be generated automatically at no cost in free energy, "false memory" in the sense of information stored inside the brain which has no correspondence to real information about the environment is more probable and thus such memory will be less reliable. This effect will strengthen the evolutionary argument for back-in-time remembrance but is more difficult to simulate and probably less important than the simple argument advocated here.

Conclusions
In this paper we were studying a very simple toy model modeling biological evolution of past-remembering and future-premembering populations in order to compare the probabilities for extinction or survival of these populations. While this work obviously does not prove that the real psychological arrow of time is a product of biological evolution and relies on many assumptions, is extremely simple and custom built, it does demonstrate, though, that -if the fundamental laws of physics would allow to pre-member the future -natural selection could be sufficient to eliminate this skill.
A critical argument often raised is that the experiment compares only backward memory with future memory, but not with hybrid memory con-cepts, typically phrased as "a limited knowledge of the future could be very helpful". This argument misses the fact that any kind of memory is costly. This is the reason why we don't remember the entire past but only an apparently most significant subset of it in the first place. Taking the extreme case, already by remembering the entire past a deterministic Universe would allow Laplace's demon to calculate the entire future with absolute precision at a lower or equal cost than by pre-membering the future. This argument suggests that any useful amount of memory of the future would be too costly to prevail in a natural selection process.
Finally, a well-motivated question is how to test these ideas. More realistic computer simulations beyond the simple toy model described here would definitely help. More convincing of course would be to probe the idea in an actual biological experiment. In principle one could study for example the evolution of a population of bacteria in a local environment where entropy is always decreasing. If the argument advocated in this work is correct it should be possible for such a population to develop memory of the future, but it is totally unclear whether such a development could occur fast enough to be observable and if the population could survive long enough under such conditions to evolve sufficiently. 4 To discuss how realistic such a setup is thus beyond the scope of this work.
In summary we thus consider this work as a proof of principle that it is possible to discuss a possible origin of the psychological arrow of time from natural selection in a meaningful scientific way.