When a bit is added to a file, its entropy increases by one nat regardless of the length of the file or the value of the bit. However, when a node is added to a network, the entropy increase is a function of the ratio between the links and the nodes in the network. Therefore, the thermodynamic incentive for a node to join a highly linked network is higher than to join a poorly linked network.
Information Theory, Networks, Entropy
The propensity of networks to grow is a characteristic of life. It is observed in biological networks, i.e. blood vessels, neurons, fungus etc. It is also observed in sociological networks i.e. communities, companies, guilds, etc., and in man-made networks as electrical, transportation, communication etc. It seems that the crowded the network, the higher its tendency to grow. Here we argue that adding a node to a highly linked network generates more entropy than adding a node to a poorly linked network. Since the second law of thermodynamics states that entropy tends to increase to it maximum, we conclude that the second law yields higher incentive to a node to join to highly linked networks.
Networks and files are different from gases or solutions. However, from a statistical-mechanics point of view, any physical system comprises of particles, states and microstates. The particles may be atoms, molecules, links, energetic bits, etc., correspondently, states are the possible spatial locations, nodes, bits etc. and microstates are the possible different configurations of the particles in the states. It should be noted that sometimes the numbers of particles, states and microstates are a function of energy and/or other physical quantities. The logarithm of the number of the microstates is the entropy that mysteriously has a propensity to grow to its maximum.
Communication and networks are the cornerstones of life. Whittaker [
A file of N bits can carry
In a way, networks are similar to a file. If we define a link as a unidirectional connection between two nodes, then a network can be described as a sequence of N nodes having each integer number links. If the total number of links in the net is P then the number of links per node can vary from 1 to
From a statistical-mechanics view, a file is a sequence of N states, and the particles are the “1” bit. A network is a sequence of N states, and the particles are the “1” link. However, there is a difference; in a file it is impossible to have more than one particle in a state, but in a network the number of particles can be any integer. In physics, particles that cannot be more than one in a state are called Fermions (i.e. atoms, molecules, quarks and leptons), and particles that do not have such a restriction (Pauli exclusion principle) are called Bosons (i.e. photons, phonons and particles having spin 0, 1, 2…). The fermions and bosons have obviously different statistical properties that are well known. In fact, the name fermion was given to particles that behave according to Fermi-Dirac statistics, and the name Boson was given to particles that behave according to Bose-Einstein statistics. The origin of the differences between bosons and fermions is their different number of microstates.
In many networks in nature the number of links is much higher than the number of nodes. For example, in a social network, where a person is a node and a connection to acquaintance is a link, even a loner usually has a few links. The distribution of links in networks nodes receives a considerable attention [
First we calculate the entropy
Later we calculate the entropy of a network having P bosons in N states. Than we show that adding one state the entropy increase is function of
We use the Boltzmann definition of entropy for fermions [
If we designate probability
It should be noted that the maximum entropy solution for Equation (1) for
Therefore, it is clear that Equation (1) is true. One should ask about the change in p, when we add a bit. The answer is that the bit is uncertainty. Since Alice does not know its value, she assumes that
For bosons, the number of microstates is given by [
We designate occupation number
Therefore,
In general, networks can be described by two dimensional matrixes in which any matrix element
or,
In general, the entropy increase will be between
We see that unlike a binary file in which the bit carries a constant amount of uncertainty that is independent in the file in which it is transmitted, a node has an extra entropic benefit to join high occupation number nets.
High occupation number boson gas statistics can be applied for many phenomena in life. In the Internet, the sites are the states and the surfers are the particles. In the publishing market, the titles are the states and the readers are the particles. In text, the words are the states and their number is the particles, etc. In these examples one can find the long tail distributions (i.e. Planck-Benford and Zipf distributions) [
I thank R. D. Levine for his criticism and H. Kafri for her help.