1. Introduction
The connection between Number Theory and Dynamical Systems Theory is receiving recently a considerable attention. In this paper, we review some aspects of this connection focusing on the interplay between continued fractions and one dimensional dynamics. In Section 2, we review some known facts about fast and slow convergents, highlighting their relations both with irrational rotation dynamics and the ergodic theory of the Gauss map. In Section 3, after recalling the construction and the basic properties of the Farey tree, we describe different ways of coding the paths on it, as well as their dynamical counterparts obtained by combining fractional linear transformations. Deeper insights into these connections are provided by the Minkowski question mark function, whose properties are discussed in Section 4. Finally, in Section 5, we present some applications of the thermodynamical formalism based on the previous constructions.
2. Fast and Slow Convergents
We start by reviewing some well known facts about continued fractions1.
Let
(2.1)
be the continued fraction expansion of the number
. By applying Euclid’s algorithm one sees that the above expansion terminates if and only if x is a rational number. For x irrational one can construct recursively a sequence
of rational approximants of x as
(2.2)
We can write this recursion in matrix form as follows: letting
(2.3)
and noting that
(2.4)
we have
(2.5)
and
(2.6)
A short manipulation of (2.2) gives
. Since
one obtains inductively the Lagrange formula
(2.7)
Another useful formula which can be easily obtained from (2.2) is the following: for all
and
,
(2.8)
Letting
we get in particular
(2.9)
Note that

and so forth. We thus have the so called mirror formula (some consequences of which have been investigated in [4] ):
(2.10)
The numbers
are called continued fraction convergents (CFC) of x and it turns out that the n-th CFC
is the best rational approximation to
whose denominator does not exceed
[2] . One sees that
(2.11)
Putting
in (2.8) we get
(2.12)
But what happens if
in (2.8) takes on an intermediate value
?
Definition 2.1 For
the sets
for
are the n’th Farey convergents (FC) for the real number
.
Example. Let
. The first five CFC are





On the other hand, within the same accuracy, there are
FC’s. They are








We now need some notions.
Definition 2.2 The Farey sum over two rationals
and
is the mediant operation given by
(2.13)
It is easy to see that
falls in the interval
2. We say that
and
are Farey neighbours if
. Two Farey neighbours define a Farey interval and each Farey interval can be labeled uniquely according to the mediant (child)
of the neighbours.
Observe that given a pair of consecutive FC’s, say

for some
and
, we have
(2.14)
Moreover
(2.15)
by Lagrange’s formula. Therefore, for every
, each FC
for
is a Farey neighbour of
, the corresponding Farey interval getting smaller and smaller as
increases. More precisely, using again Lagrange’s formula, one easily obtains
(2.16)
We therefore see that the FC
is the best one-sided rational approximation to
whose denominator does not exceed
(although, if
, there might be a CFC with denominator less than
and closer to
on the other side of x). Increasing r, once we arrive at
we hit a new CFC on the current side of
, closer than the previous CFC. Finally, using matrix notation, the FC’s can be expressed in terms of intermediate products in (2.5) for
as
(2.17)
The algorithm which produces the sequence of
‘s of a given real number is called slow continued fraction algorithm (see, e.g., [6] [7] ).
Remark 2.3 The set
of Farey fractions of order
is the set of irreducible fractions in
with denominator
, listed in order of magnitude (see [8] ). Thus,
,

and so on. In particular
with Euler totient function
. Then we see that each
for
is consecutive to
in 
for
.
2.1. Connection to Rotations of the Circle
One can interpret the above construction in terms of a kind of renormalization procedure for rotations of the circle
through an angle
. With no loss we take the initial point to be the origin 0 and set
.
Since
we have
and thus

with
(2.18)
Moreover we have

and therefore
or, which is the same,

with
(2.19)
Iterating this procedure, we construct a family of nested intervals (see Figure 1)
,
, such that
(2.20)
and
(2.21)
where we have set
. Using (2.18), (2.19) and (2.21) one gets inductively the formula
Figure 1. The construction of nested intervals.
(2.22)
Note that
(2.23)
Now, if we denote by
the euclidean metric on
then
(2.24)
Therefore
(2.25)
That is, the sequence of arc-lengths
is but the sequence of successive closest distances to the initial point. This can be seen in the following way: starting from 0 and iterating
times one ends up at the point
which lies on the left of 0 and is the point closest to 0 up to now, being distant
from it. Iterating
more times one ends up at the point
which lies on the left of 0 at distance
, ... iterating
times one ends up at the point
which still lies on the left of 0, at distance
. One more iterate yields the point
which now lies on the right of 0 at distance
and is the point closest to 0 up to now, and so on and so forth (for more details see [9] ). The above implies that the first return map in the interval
(which is
or
according whether
is even or odd) is the rotation through the angle
. Finally, one has the equivalence:
(2.26)
In addition, for each
, it holds
(2.27)
The three distance theorem. The points
with
partition the unit circle into
intervals. A classical result (see e.g. [10] ), which can be easily obtained by induction using the above construction, is that the possible lengths of these intervals are organized according to the Farey convergents in the following way:
• If
then there are two distinct lengths:
and
(which become
and
when
).
• If
for some
and
then there are at most three lengths:
,
and
, the last of which disappears when
.
We point out that in the second case above there are two intervals, chosen from among those having the smallest lengths:

which have 0 as their common endpoint. We then see that the approximations (26) and (27) are the same as shrinking one of these intervals to zero. Moreover, the fractions
and
are the two successive elements of
having
between them (see also Remark 2.3).
2.2. Growth of Denominators
The Gauss map
is defined as
(2.28)
It is well known that
has an a.c. invariant ergodic probability measure
given by
(2.29)
A short reflection shows that
or else
(2.30)
From this we obtain at once
(2.31)
where the numbers
have been introduced in (2.22). Therefore

and, by the ergodic theorem, we have for
-almost all
and then almost everywhere,
(2.32)
Since
and thus
another consequence of (2.30) is that

and therefore using (2.31)
(2.33)
Putting together (2.32) and (2.33) we get the classical theorem of Lévy

On the other hand we may expect the growth of FC’s denominator to be subexponential. Indeed, let
with
be the m-th FC. Its denominator satisfies
. It is a result of Khinchin and Lévy (see [1] ) that

Combining the above we get the following
Lemma 2.4

Of course there are special behaviours: take
, then
and both are equal to the n-th Fibonacci number. Hence
converge to
.
3. A Walk on the Farey Tree
Having fixed
, let
be the ascending sequence of irreducible fractions between 0 and 1 constructed inductively in the following way: set first
, then
is obtained from
by inserting among each pair of neighbours
and
in
their child
as in (2.13). Thus

and so on. The elements of
are called again Farey fractions. Evidently
.
Remark 3.1 It has been shown in ([11] , Thm 2.6) that the set
becomes equidistributed as
. More specifically, the probability measure
converges to the Lebesgue measure on
.
Definition 3.2 For
we say that a Farey fraction
has rank
if
.
We also define the
. For
there are exactly
Farey fractions of rank 
and their sum is equal to
. Recall that every rational number
has a unique finite continued fraction expansion
with
[2] . The validity of the following relation will arise straightforwardly in the sequel:
Lemma 3.3

Remark 3.4 Note that, according to the above Lemma, the cardinality
of
can be interpreted as the number of choices of integers
, with
and so that
for
, 
and
. Indeed, for each fixed
the number of such choices is
, then sum over
.
It is also easy to realize that all Farey fractions which fall in the interval
have rank greater than or equal to
, whereas their continued fraction expansion starts with
.
An interesting object is the Farey tree
whose vertex-set is
and which is constructed as follows (see Figure 2):
• every column in
contains one entry (vertex or node);
• for
the
-th row is
;
• the node
, representing the interval
, is connected by edges to its left child
and right child
in the underlying row.
Figure 2. The first four levels of the Farey tree.
Note that the fractions
and
play the role of ancestors when using the Farey sum to obtain one row from the previous one. Besides the Farey sum, an alternative way to construct recursively the entries of
is as follows.
Definition 3.5 Given
its descendants are the symmetrical entries of
given by
and
respectively.
Lemma 3.6 The collection of all descendants of the entries of a given row in
is precisely the underlying row.
Proof. If
then
and
. Therefore
and the claim follows. 
Remark 3.7 If
and
then
and
.
3.1. The
Coding
Every rational number in
appears exactly once in the above construction and corresponds to a unique finite path on
starting at the root node
and whose number of vertices equals the rank of the rational number. We can code this path in the following way: first, any
can be uniquely decomposed as3
(3.1)
and the unimodular relations
(3.2)
plainly hold. The neighbours
and
are thus the ‘parents’ of
in
and we may accordingly identify
(3.3)
with
(3.4)
Note that the left column bears on the right parent and viceversa. Thus
(3.5)
On the other hand, any
as above has a unique pair of (left and right) children, given by
(3.6)
respectively. In order to generate them we set
(3.7)
Note that for 
(3.8)
and also
(3.9)
Moreover, we have
(3.10)
and
(3.11)
In other words, the matrices L and R, when acting from the right, move to the left and right child in
, respectively. Moreover, it is plain that given
we have
and
. We have thus proved the following Proposition 3.8 To each entry
there corresponds a unique element
which, in turn, can be uniquely presented as
(3.12)
where the number of terms in the product
is equal to
and Mi = L or Mi = R according whether the i-th turn, along the descending path in
which starts from the root node
and reaches xgoes left or right.
Remark 3.9 By the way, the matrices L and R induce the so called Farey tesselation of the upper half plane
(see [12] ).
Example.
is the right child of
, which is the right child of
, which is the left child of
, which is the left child of
. Thus

For
, which is the left child of
, we find

Note that
.
To any given irrational number
we may associate a unique infinite path on
, and thus a unique semi-infinite word in
. Bearing in mind the continued fraction expansion (2.1) of x, let

the first FC of x. In order to reach it from the top of
we need the block
. Whence we code x through the map
defined by
(3.13)
where
or
according whether the i-th turn along the infinite path in
which starts from
and approaches x along the sequence of successive FC’s goes left or right. This coding is faithful to the binary structure of
but apparently not so much to the continued fraction expansion of x. To make the latter more transparent we may note that, according to the characterization of the FC’s given above (see (2.15) and (2.16)), the symbols L and R in (3.13) come in blocks whose lengths are given by nothing but the partial quotients
of
. More precisely, a short reflection shows that the following rule is in force: the first block is such that
if
. Moreover, for
let

then we have

In other words, we have the coding
(3.14)
Furthermore we set
and
. More generally, we note that each rational x has two infinite paths which agree down to node
: they are those starting with the finite sequence coding the path to reach x from the root node and terminating with either
or
. We shall agree that
terminates with
or
according whether the number of its (finite) partial quotients is even or odd. On the other hand, for notational simplicity’ sake we shall assume this agreement only implicitly. We summarize the above in the following Theorem 3.10 To
with continued fraction expansion
there corresponds a unique sequence
given by
which represents an infinite path on
whose sequence of vertices starting from the
-th is precisely the sequence
of FC’s of x. Moreover, if
denotes the lexicographic order on
then

An simple consequence of the above construction is the following result.
Proposition 3.11 Let
with
and
even. Then its left and right children in
are given by
and
, respectively. If instead
is odd the expansions for
and
have to be interchanged.
Proof. Since
is even we can write
(3.15)
Therefore

which yield the claim. A similar reasoning applies for
odd. 
3.2. The {A, B} Coding
Using (3.9) we can write
(3.16)
On the other hand we have
and (see (2.4))
(3.17)
This defines a recoding
so that
(3.18)
The FC
of
, which has rank
, will then be expressed as
(3.19)
or else
(3.20)
Note that both expansions have exactly
terms and the latter agrees with (2.17) once we interpret the l.h.s. of
(2.17) as the FC
of x, that is taking the Farey sum of the columns in the same spirit as (3.3).
Example. The example with
discussed above, which yields

can be used to check step by step what we are claiming here. For example its FC
, which has rank 6, can be expressed as

3.3. The Farey Shift and Its Relatives
So far, a sequence in
starting with the symbol R has no image in
with
. Let us make the identification
(3.21)
and denote by
the half-space of
so obtained. We can write
(3.22)
We see that the map
is a bijection between
and
.
Let
be the Farey shift map defined by
(3.23)
Note that, besides
the only fixed point of
is given by the sequence
which is the image with
of
, the golden mean. This map acts on points in
by reducing their rank of one unit.
For example, since
, with the identifications made above we have





Let us define the Farey map
given by
(3.24)
Its name can be related to the easily verified observation that the set of pre-images
coincides with
for all
. Note also that the
-th row of the Farey tree is precisely
. In particularthis implies that
.
Proposition 3.12 Let
be the coding described above. Then

Proof. If
then
and
. If instead
then
and
. Therefore,
(3.25)
with
. The claim now follows from (3.23) and (3.21). 
3.3.1. The Gauss and Fibonacci Maps
The map F has (at least) two induced versions: the first one is the Gauss map
already introduced in (2.28), which for
can be written as
(3.26)
Recall that
(3.27)
Noting that
(3.28)
we see that G is obtained by iterating F once plus the number of times necessary to reach the interval
. The second one is the Fibonacci map H and is defined by iterating F once plus the number of times necessary to reach the interval
. Let
and
for
be the Fibonacci numbers. Then, for
,
(3.29)
with
(3.30)
In this case it is easy to check that if
then
(3.31)
A sketch of the map F along its induced versions G and H is given in Figure 3.
Given
we may define the Möbius transformation

By the above, given
the point
is but
and for
we have
(recall that
). But what happens if
so that
?
To see this we put
(3.32)
We have

Therefore, noting that
, for
we have
. To summarize we can represent the action of F as


Figure 3. The Farey map and its induced Fibonacci (upper) and Gauss (lower) maps.
that of G as

and that of H as

3.3.2. The Modified Farey Map
Finally we introduce the modified Farey map
given by
(3.33)
This map preserves orientation and has two indifferent fixed points, at 0 and 1. The advantage of using
instead of
is that one can retrace the path from a leaf
back to the root
. More precisely, for
let (cf. Proposition 3.8)
be the element which uniquely represents x in
. Then one easily sees that the following rule is in force: if
then
,
then
, for
with
so that
.
4. The Minkowski Question Mark
Given a number
with continued fraction expansion
, one may ask what is the number obtained by interpreting the sequence
(see (3.14)) as the binary expansion of a real number in
. The number so obtained is denoted
and writes
(4.1)
or, which is the same,
(4.2)
For instance
, for all
(see Figure 4). Setting
and
one has the following properties for the function
(see [13] -[16] ):
•
is strictly increasing from 0 to 1 and Hölder continuous of exponent
;
• x is rational iff
is of the form
, with k and s integers;
• x is a quadratic irrational iff
is a (non-dyadic) rational;
•
is a singular function: its derivative vanishes Lebesgue-almost everywhere.
The following additional properties easily follow from the definition.
Lemma 4.1
satisfies the functional equations


Proof. Assuming that
we write
with
and
. Setting moreover
we have
and
. The assertion now follows by direct application of (4.2). 
Let us now see how
acts on Farey fractions. We have already seen that

More generally, for any pair
and
of consecutive Farey fractions the function ? equates their child to the arithmetic average:
(4.3)
One sees that the function ? maps the Farey tree
to the dyadic tree
defined as follows: having fixed
, let
be the ascending sequence of fractions of the form
,
. We have

and so on. Then
is the same graph as
with the
-th row replaced by
. An immediate consequence of the fact that
is that
is the asymptotic distribution function of the sequence of Farey fractions:
Theorem 4.2 Since

then

Remark 4.3 This result can be also deduced as a consequence of a more general result obtained in [17] using a suitable enumeration of the rationals in
. As for the convergence of the atomic measure concentrated on
to
see [11] and [18] .
As a further immediate consequence we get that the Fourier-Stieltjes coefficients of
are as in the following
Corollary 4.4 Let

then

Finally, a short reflection using the definition (4.1) shows that ? conjugates the Farey map F and the modified Farey map
to the tent map
(4.4)
and the doubling map
, respectively. Indeed, for any
with
we have
(4.5)
and
(4.6)
where
and
. A similar reasoning applies for D. Putting together the above, (3.25) and (4.1) we then get the following commutative diagrams
Theorem 4.5

This implies that the measure
is invariant under both maps F and
, and its entropy is equal to
. This makes
the measure of maximal entropy for F and
. Being zero at every rational point
is of course singular w.r.t. Lebesgue. More specifically,
is concentrated on a subset
having Hausdorff dimension
(see [14] ). In view of (3.25), the above has the following straightforward consequence Lemma 4.6 If x is drawn from
according to the singular measure
, then the partial quotients
of
form a sequence of i.i.r.v.’s with
.
It is moreover easy to realize that F and
have also absolutely continuous (not normalizable) invariant measures, with densities
and
, respectively.
Finally, the conjugacy of Theorem 4.5 has been used in [19] to construct a correspondence between the parameter spaces of
-continued fraction transformations and unimodal maps.
5. Transfer Operators and Partition Functions
To a given matrix
and complex parameter
one can associate the positive operator
acting on the right as [20]
(5.1)
For example we have
(5.2)
The operator
associated in this way to the map
turns out to be the transfer operator acting as
(5.3)
Of special significance is the (Perron-Frobenius) operator
which satisfies
(5.4)
and has norm at most one in the Banach space
. A function
is the density of an absolutely continuous invariant measure for F if and only if
. In this case we find
, which however does not lie in
(see [21] ).
Let f be an eigenfunction of
analytic in the half-plane
. It satisfies
(5.5)
and also
(5.6)
Therefore the eigenvalue equation is equivalent to the three-term equation
(5.7)
which is a generalisation of the Lewis functional equation (with
) studied in number theory (see [20] [22] ). The study of this generalized equation has been initiated in [23] .
Remark 5.1 In the context of the thermodynamic formalism, once a one-sided shift
and a potential function
are given one defines a transfer operator
on
by

which plays a key role in the study of equilibrium states for
and their properties [24] [25] . In particular, one defines

and it turns out that if
decays exponentially then there is a unique mixing equilibrium state.
Relying on the above discussion it is now easy to see that
with

In order to compute
we have to consider points sharing the same path up to the k-th row of
. Take for instance
and
. Then a short reflection yields, for
,

We therefore see that although
(so that
is uniformly continuous)
it is not even of summable variation. This entails that
has indeed two equilibrium states, thus exhibiting a phase transition (see [26] ).
Next, we express the n-th iterate of
as
(5.8)
where
. We have
so that, in particular, putting
we get
(5.9)
and
(5.10)
Lemma 5.2 Let
be the sequence of functions defined by
and

For each fixed
we have that
determines a bijection between
and the set of denominators of the elements of
(considered as an ordered set).
Proof. The proof is just a straightforward verification. Suppose for instance that
with
, so that
. Then by (5.9) and (5.10)
is given by a product with
factors of the type
where r = a if
, r = b otherwise. The result now readily follows by lemma 3.6. 
Remark 5.3 The rank of the elements of
with denominator
is given by
with the convention
. The smallest of the above denominators is 1, it has rank 0 and is obtained as
. The two largest ones are equal to the
-st Fibonacci number
. They are symmetrical w.r.t
, have rank
and are obtained as
and
, respectively. More generally, it is not difficult to see that the following equivalence is in force: suppose that the element
has rank
so that
for some
and
, then the same denominator
, but corresponding to the symmetrical fraction
, is obtained as
with
.
A direct consequence of the above lemma is the following
Theorem 5.4

Remarkably, the above sum is equal to the partition function
at (inverse) temperature
of the number-theoretical spin chain introduced by Andreas Knauf in [27] . For
we have (see [28] )
(5.11)
Note that for
the above limit diverges. This reflects the fact that the invariant density for the Farey map F, that is the fixed point of the operator
, is the function
.
Let us define the pressure function
as
(5.12)
Since the sum in Thm. 5.4 has
terms we see that
(this is the topological entropy of the map
). More generally let
denote the sequence of denominators of the elements of
when the latters are arranged in increasing order in
, so that
(5.13)
The ratio
can be interpreted as the moment of order
of the size of the denominators in
.
is plainly non-increasing and for
satisfies
. Moreover we have

with
for all
. Noting that
we get for 

Since
this yields
(5.14)
Thus, for all
,
(5.15)
In addition, since
is non-increasing and
(because the spectral radius of
is 1, see above) we have
for
. Note that the same conclusion follows at once from the fact that
is finite for
(see (11)).
Remark 5.5 It holds
where
is the free energy of the Knauf model. In the context of thermodynamic formalism the pressure
is a central object. In particular it is used as a generator of averages: its first derivative
, wherever it exists, yields the mean of the function
w.r.t. the equilibrium measure
, which can be defined as the weak *-limit point of atomic measures supported on periodic points of F weighted with the function
[24] . Note that
for
and
as
. On the other hand we have already seen that
and
is called measure of maximal entropy. Higher derivatives of
are connected to (sums of) higher correlation functions, see [25] [29] .
Let us now study the asymptotic behaviour of
for
. To this end, we notice that if, instead of
, we evaluate the iterate
at
, all sequences
in (5.8) yield paths which end up at the same row of the Farey tree. The same argument leading to Theorem 5.4 now yields the following
Corollary 5.6
(5.16)
By Thm. 5.4 and (5.16) we obtain
(5.17)
so that we can directly apply the results obtained by Thaler in [30] to get4
Lemma 5.7

Lastly, noting that

one may then use
, along with Lemma 3.6, to proceed inductively with
in (5.8), and obtain the following general expression for
with
.
Theorem 5.8 For all
and
we have

We refer to [31] for further generalisations and applications (see also [32] ).
The Partition Function for Negative Integer Temperatures
Finally, we compute the value of the partition function
for some some specific value of the temperature. Related results are discussed in [33] (see also [34] ).
Lemma 5.9 We have, for all
,



Proof. The first identity is trivial. The second one follows immediately from
along with (5.14), which gives the recursion
. As for the third one, we can reason as follows: let us denote
and
. Then (5.14) yields
. Moreover, we have

This yields the recursion
with
and
and the claim easily follows
. The above result indicates a general argument to work out
for any
: setting
and
one has

and

This yields a k-dimensional recursion
(5.18)
with
matrix

and initial condition
(5.19)
By Perron-Frobenius theorem the matrix
has a simple real positive maximal eigenvalue
whose eigenvector
has strictly positive components. This immediately yields
(5.20)
More specifically, by the above the exact behaviour of
can be obtained by standard linear algebra. If for instance
can be diagonalized with spectrum
and corresponding eigenvectors
, then we can expand
so that (5.18) and (5.19) yield
(5.21)
where
denotes the first component of
. On the other hand, as we shall see in the forthcoming example,
is not always diagonalizable.
Examples. For
we find

so that by (5.20)
and using (5.21) one easily recover the result of Lemma 5.9 for
.
For
we get

In this case (5.21) does not hold but one easily finds

and
.
The case
is still different, yielding

NOTES
1Good general sources on this subject are -.

2The origin of these names traces back to Cauchy, who proved this property after it was observed by John Farey in 1816 , and named “Farey series” the numbers obtained in this way.

3All fractions are supposed in lowest terms.

4We say that an and bn are asymptotically equivalent, denoted as an ~ bn, if the quotient an/bn tends to unity as n approaches ∞.