Words and Numbers: A Dynamical Systems Perspective

Stefano Isola; Francesco Marchionni

doi:10.20944/preprints202603.0815.v1

Submitted:

09 March 2026

Posted:

10 March 2026

You are already at the latest version

Abstract

Along with some known and less known results, we discuss new insights relating combinatorics of words and the ordering of the rationals from a dynamical systems point of view, somehow continuing along the path started in [BI]. We obtain in particular a set of results that structure and enrich the correspondence between the Stern-Brocot (SB) ordering of rational numbers and the corresponding ordering of Farey-Christoffel (FC) words, a class of words that, since their appearance in literature at the end of the 18th century, have revealed numerous relationships with other fields of mathematics. Among the results obtained here is the construction of substitution rules that act on the FC words in a parallel way to the maps on the positive reals that generate the permuted SB tree both vertically and horizontally. A complete correspondence is obtained between the vertical and horizontal motions on the SB tree and the geodesic motions along scattering geodesics and the horocyclic motion along Ford circles in the upper half-plane, respectively.

Keywords:

Stern-Brocot tree

;

continued fractions

;

sturmian words

;

interval maps

;

horocycle flow

;

Ford circles

Subject:

Computer Science and Mathematics - Discrete Mathematics and Combinatorics

1. Preliminaries

The Stern-Brocot (SB) tree

T

is binary rooted tree which provides a way to order (and thus to count) the elements of

Q_{+}

, the set of positive rational numbers, so that every number appears (and thus is counted) exactly once (see [5,7,12,22]). To begin with, we say that a pair of nonnegative fractions

\frac{a}{b} < \frac{c}{d}

is a Farey pair if the unimodular relation

b c - a d = 1

holds (so that their distance is

1 / b d

). The basic operation needed to construct

T

associates to each Farey pair their mediant

\frac{a}{b} \oplus \frac{c}{d} = \frac{a + c}{b + d}

One readily sees that the child

\frac{a}{b} \oplus \frac{c}{d}

always lies somewhere in between its parents

\frac{a}{b}

and

\frac{c}{d}

, forming Farey pairs with them. Moreover, among all the fractions lying strictly between

\frac{a}{b}

and

\frac{c}{d}

it is the one (and only one) with the smallest denominator, and is always in lowest terms whenever the parents do (see [25]).

Remark 1.

Note that the mediant operation arises naturally in the following way: let L be the vertical half-line

{x = 1, y \geq 0}

in

R^{2}

, and denote by U the subspace of

R^{2}

given by of all vectors

u = (q, p)

with positive integer coordinates. Let

T : U \to Q_{+}

be the map given by

T (q, p) = p / q

, that is the ordinate of the intersection of u with L. Each reduced fraction on L is thus the image with T of a vector of U with coprime coordinates. Finally, given

u_{1}, u_{2} \in U

, we have

T (u_{1} + u_{2}) = T (u_{1}) \oplus T (u_{2})

Now, taking as initial pair

\frac{0}{1}

and

\frac{1}{0}

, we take their mediant

\frac{1}{1}

as the root of the tree. Then one writes one generation after the other using the above operation (a portion of this structure is depicted in Figure 1). As already observed,

Q_{+}

and

T

are in bijection. To a given

x \in Q_{+}

, we associate its depth, as the level of

T

it belongs to.

Lemma 1. ([5], Lemma 1.2)Let

x \in Q_{+}

then

x = [a_{0}; a_{1}, \dots, a_{n}] ⟹ depth (x) = \sum_{i = 0}^{n} a_{i}

Remark 2.

Note that the sub-tree

S

of

T

having

\frac{1}{2}

as root node and vertex set

Q_{+} \cap [0, 1]

(sometimes called Farey tree) can be obtained exactly in the same way as

T

taking as initial pair

\frac{0}{1}

and

\frac{1}{1}

instead of

\frac{0}{1}

and

\frac{1}{0}

. One easily sees ([5], Lemma 1.1) that

ϕ (T) = S

where

ϕ : [0, \infty) \to [0, 1]

is the invertible map defined by

ϕ (\infty) = 1

and

ϕ (x) = \frac{x}{x + 1}

.

One can also construct an equivalent tree whose vertex set is formed by binary strings, each fraction

p / q \in T

corresponding to a binary word

w_{\frac{p}{q}}

obtained by concatenation of its left and right parent as follows1.

Definition 1.(Farey-Christoffel (FC) words)Set

w_{\frac{0}{1}} = 0 and w_{\frac{1}{0}} = 1

If moreover

\frac{p^{'}}{q^{'}}

and

\frac{p^{''}}{q^{''}}

is a Farey pair and

\frac{p}{q} = \frac{p^{'}}{q^{'}} \oplus \frac{p^{''}}{q^{''}}

, we define

w_{\frac{p}{q}} = w_{\frac{p^{'}}{q^{'}}} w_{\frac{p^{''}}{q^{''}}}

Some notations: for

s \in {0, 1}

set

\hat{s} = 1 - s

. Then, for

w \in {0, 1}^{*}

given by

w = s_{1} \dots s_{n}

we set

\hat{w} = {\hat{s}}_{1} \dots {\hat{s}}_{n} and \tilde{w} = s_{n} \dots s_{1}

Also denote by

| w |

the length of w and by

{| w |}_{s}

the number of occurrence of the symbol

s \in {0, 1}

in w.

The above construction establishes a one to one correspondence between

Q_{+} ≃ T

and the set

F

of FC words.

Theorem 1.

We have the following properties:

1.: given $w \in F$ , we have $w = w_{\frac{p}{q}}$ with $\frac{p}{q} = \frac{{| w |}_{1}}{{| w |}_{0}}$ (so that $| w | = p + q$ ) ;
2.: given $\frac{p}{q} \in T$ with $p + q > 1$ we have $w_{\frac{p}{q}} = 0 c 1$ for some $c \in {0, 1}^{*}$ satisfying $c = \tilde{c}$ ;
3.: given $w_{\frac{p}{q}} = 0 c 1$ , we have $w_{\frac{q}{p}} = 0 \hat{c} 1$ ;
4.: given $w \in F$ with $| w | > 1$ , it can be uniquely factorized as $w = u v$ , where u and v are non-empty palindrome words. Moreover if $w = w_{\frac{p}{q}} = w_{\frac{p^{'}}{q^{'}}} w_{\frac{p^{''}}{q^{''}}}$ , then $| u | = p^{''} + q^{''}$ and $| v | = p^{'} + q^{'}$ .

Proof.

The first assertion follows from the definition, whereas the third easily follows from the second. Let us then prove 2. We proceed by induction in the depth. For the root node

\frac{1}{1}

we get

c = ϵ

, the empty word, so that the assertion is trivial. Suppose it is true up to depth

n > 1

, and consider

γ \in T

with depth

(γ) = n

. We have

w_{γ} = 0 c 1

with

c = \tilde{c}

. On the other hand

γ

is obtained as the child of a left and right parent, say

α

and

β

, one of depth

n - 1

and the other of depth

n - k

, for some

k = 2, \dots, n

(the case in which one parent is an ancestor is left to the reader). Set

w_{α} = 0 a 1

and

w_{β} = 0 b 1

, with

a = \tilde{a}

and

b = \tilde{b}

. Therefore

c = a 1 0 b = \tilde{b} 0 1 \tilde{a}

. Now consider a child

δ

of

γ

. If

δ

is the right child then by construction

w_{δ} = 0 c 1 0 b 1 = 0 a 1 0 b 1 0 b 1 = 0 d 1

with

d = a 1 0 b 1 0 b = \tilde{b} 0 1 \tilde{a} 1 0 b

, which is clearly palindromic. If

δ

is the left child, the same argument yields

w_{δ} = 0 d^{'} 1

with

d^{'} = a 1 0 \tilde{b} 0 1 \tilde{a}

.

To show the last statement, we note that from the above it follows that for

w = 0 c 1 \in F

, the palindrome c has always the structure

c = a 1 0 b = \tilde{b} 0 1 \tilde{a}

, with

a = \tilde{a}

and

b = \tilde{b}

. Therefore we can write

w = u v

with

u = 0 \tilde{b} 0

and

v = 1 \tilde{a} 1

, which are both palindrome words. As for the uniqueness, let

w = u v = t s

with

u, v, t, s

all palindromes. Assume without loss that

| u | > | t |

, so

u = t h

and

h v = s

, with

h \neq ϵ

. Since they are all palindromes, we have

v u = s t

, so that

v t h = h v t

. Then it readily follows that

w = h^{k}

for some positive

k \in N

. But this is absurd, since it should be

{| w |}_{0} = k {| h |}_{0}

and

{| w |}_{1} = k {| h |}_{1}

, but we already know that

{| w |}_{0} = p

and

{| w |}_{1} = q

with p and q coprime, and the case

k = 1

would imply

w = u = s = h

, absurd since

| w | > 1

and it couldn’t be palindromic. This holds true for each

w \in F

, except for the leftmost and rightmost nodes at each level, for which the uniqueness of the factorization is trivial since

w = 0 . . . 01

or

w = 01 . . . 1

. □

Remark 3.

The last statement of the above theorem yields two factorizations for

w \in F

with

| w | > 1

: thepalindromic factorization

w = u v

, with u and v both palindromes, and the so calledstandard factorization

w = w_{\frac{p}{q}} = w_{\frac{p^{'}}{q^{'}}} w_{\frac{p^{''}}{q^{''}}}

, in terms of FC sub-words. Both of them are unique.

Remark 4.

It follows from the definition that given a word with standard factorization

w = u v

, with

w_{\frac{p^{'}}{q^{'}}} = u

and

w_{\frac{p^{''}}{q^{''}}} = v

, then

u (u v)

and

(u v) v

are Christoffel words; in particular they are the children of w with the indicated standard factorization. Moreover, if

| w | \geq 3

, then either u is a proper prefix of v, and

v = u v^{'}

is the standard factorization of v, or v is a proper suffix of u, in which case

u = u^{'} v

.

Some rather immediate consequences of the above properties are formulated in the following corollaries (see also [2]).

Corollary 1.

Let

w = 0 c 1

be a FC word associated to some element of

T

. The FC words associated to its left and right children are given by

0 {(0 c)}^{-} 1 = 0 {(c 0)}^{+} 1 and 0 {(1 c)}^{-} 1 = 0 {(c 1)}^{+} 1

where

u^{-}

and

u^{+}

are the shortest palindrome with suffix, respectively prefix, given by u.

Corollary 2.

Let

w = 0 c 1

be a FC word associated to some element of

T

. The maximum among all its cyclic permutations is realized by the word

\tilde{w} = 1 c 0

.

Corollary 3.

The number of FC words of length n is given by Euler totient function

φ (n) = | {0 < i < n : \gcd (i, n) = 1} |

.

Proof.

From Theorem 1 we have that

{|w|}_{1} = p, {|w|}_{0} = q

. The totient function gives us the number of distinct p which are relatively prime with n, which coincides with the number of possible pairs

(p, q = n - p)

which are relatively prime. □

Figure 2. The first four level of the Christoffel words tree.

2. Relation with Cutting and Sturmian Sequences

Now, given

w \in F

we call

{| w |}_{1} / {| w |}_{0}

the slope of w. This is motivated by the following facts. To a given binary word

w = u_{1} \dots u_{n}

we can associate a stepwise walk on the lattice

Z^{2}

constructed by moving by a vertical step upwards (resp. horizontal step oriented on the right) for each occurrence of the symbol 1 (resp. 0). Clearly, the walks corresponding to

w = 0 c 1

and

\tilde{w} = 1 c 0

meet at the origin

(0, 0)

and at the point

(| w |_{0}, | w |_{1})

. Moreover, letting

α = {| w |}_{1} / {| w |}_{0}

, the central sequence c is nothing but the cutting sequence of the ray having slope

α

, where one writes 0 each time the ray cuts a vertical line, and 1 each time it cuts a horizontal line, on the open interval

(0, | w |_{0})

.

By the way, the FC word of slope

p / q

can be defined from the very beginning as a sequence of unitary steps joining points of integer lattice from

(0, 0)

to

(q, p)

so that (i) the corresponding path is the nearest path below the line segment joining these two points; (ii) there are no points of the integer lattice between the path and line segment (see [2]). When the slope is irrational, a similar definition leads to the notion of (infinite) Sturmian sequence.

In >Figure 3 we report the case with slope

3 / 5

(with

r (w) \equiv \tilde{w}

).

>Figure 4 shows the cutting sequences of the two parents of

3 / 5

, namely

1 / 2

and

2 / 3

(when concatenating two finite cutting sequences, one has to interpose the word 10, which corresponds to a cut with a corner).

Remark 5.

The standard factorization

w = w_{\frac{p}{q}} = w_{\frac{p^{'}}{q^{'}}} w_{\frac{p^{''}}{q^{''}}}

in terms of FC sub-words (cf. Remark 3), can be obtained geometrically by cutting the walk corresponding to w at the lattice point

(q^{'}, p^{'})

closest to the segment joining

(0, 0)

with

(q, p)

. The last property implies that

p q^{'} - q p^{'} = 1

and therefore

p (p^{'} + q^{'}) = p^{'} (p + q) + 1 = 1 (\mod p + q)

. In the same way, we can show that

q (p^{''} + q^{''}) = 1 (\mod p + q)

. We therefore see that the lengths of the factors

| w_{\frac{p^{'}}{q^{'}}} | = p^{'} + q^{'}

and

| w_{\frac{p^{''}}{q^{''}}} | = p^{''} + q^{''}

are the respective multiplicative inverses in

{0, 1, \dots, p + q - 1}

of p and q.

Now, putting together Remark 6 and, e.g., [6], Section 1 (or else [19], Chap. 6), one sees that the FC word

w \equiv w_{α}

can also be characterized as the symbolic representation of the orbit

{R_{β}^{k} (0)}_{k = 0}^{n - 1}

w.r.t. the partition

S^{1} = [0, 1 - β) \cup [1 - β, 1)

, with

n = | w |

and

R_{β} : S^{1} \to S^{1}

the rotation of angle

β = ϕ (α)

, sometimes also called the Sturm sequence of

β

. More specifically, set

ϵ (x) = \{\begin{matrix} 0, & 0 \leq x < 1 - β \\ 1, & 1 - β \leq x < 1 \end{matrix}

and note that

x + β = R_{β} (x) + ϵ (x)

, which can be iterated to give

x + n β = R_{β}^{n} (x) + ϵ (R_{β}^{n - 1} (x)) + ϵ (R_{β}^{n - 2} (x)) + \dots + ϵ (x) = R_{β}^{n} (x) + [n β]

Setting

w = u_{1} \dots u_{n}

, we then have

u_{k} = ϵ (R_{β}^{k} (x)) = [k β] - [(k - 1) β], k = 1, \dots, n .

(1)

Note that, since

β \in (0, 1)

we have

u_{k} \in {0, 1}

. More precisely, if

α > 1

(

β > \frac{1}{2}

) in w the symbol 0 is always isolated and between any two 0’s there are either

[α]

or

[α] + 1

1’s. If instead

α < 1

(

β < \frac{1}{2}

) in w the symbol 1 is isolated and between any two 1’s there are either

[1 / α]

or

[1 / α] + 1

0’s. The opposite plainly happens to

\hat{w}

.

The above generation rule can be further rephrased as follows (closely mirroring the original construction by Christoffel). Let

p / q \in T

and set

n = p + q

. Define the group translation

T_{p} : Z_{n} \to Z_{n}

as

T_{p} : x \mapsto x + p (\mod n)

Lemma 2.

Let

w = u_{1} \dots u_{n} \in F

, with

n > 1

, and

\frac{p}{q} = \frac{{| w |}_{1}}{{| w |}_{0}}

(so that

| w | = n = p + q

) be the corresponding element of

T

. Consider the partition

Z_{n} = Q_{0} \cup Q_{1}

with

Q_{0} = {0, 1, \dots, q - 1}

and

Q_{1} = {q, q + 1, \dots, n - 1}

.

u_{k} = ℓ ⟺ T_{p}^{(k - 1)} (0) \in Q_{ℓ}, ℓ \in {0, 1}, k = 1, \dots, n

Proof.

From the geometric interpretation of the FC words given above, one deduces the following rule: for any

k = 1, \dots, n

we have

u_{k} = 0

if

k \cdot p (\mod n) > (k - 1) \cdot p (\mod n)

and

u_{k} = 1

in the opposite case.

Now note that, setting

(k - 1) \cdot p (\mod n) = ℓ

, if

k \cdot p (\mod n) = ℓ + p

then

u_{k} = 0

, whereas if

k \cdot p (\mod n) = ℓ - q

then

u_{k} = 1

. In other words,

u_{k} = 0

if and only if

(k - 1) \cdot p (\mod n) \in Q_{0}

and

u_{k} = 1

if and only if

(k - 1) \cdot p (\mod n) \in Q_{1}

. □

Remark 6.

If one works with the sub-tree

S

instead of

T

(see Remark 2), assigning the initial symbols 0 and 1 to

0 / 1

and

1 / 1

(instead of

1 / 0

), then the above conclusions are unchanged provided

p / q

is replaced by

ϕ (p / q) = p / (p + q)

(and

q / p

by

q / (p + q)

), so that the denominator of the corresponding fraction always equals the length of the FC word. Moreover, the algorithm of Lemma 2, remains unchanged provided we let

T_{p}

act on

Z_{q}

instead of

Z_{p + q}

and we set

Q_{0} = {0, 1, \dots, q - p - 1}

and

Q_{1} = {q - p, q - p + 1, \dots, q - 1}

.

Finally, we note that the map ϕ induces the substitution map on FC words given by

0 \to 0

and

1 \to 01

. A short reflection shows that this rule can be used to obtain the FC word

w_{α} = u_{1} \dots u_{n}

constructed above from the Sturm sequence of α itself, that is the word

w_{α}^{'} = v_{1} \dots v_{q}

, with

q = {| w |}_{0}

and

v_{k} = [k α] - [(k - 1) α]

.

3. Relation with Continued Fractions

We have already seen (cf. Lemma 1) how the depth of each element

x \in T

is related to the partial quotients of its continued fraction expansion (c.f.e.)

x = [a_{0}; a_{1}, \dots, a_{n}]

. This connection can be further expanded. One starts by constructing a matrix representation of the positive rationals as follows: given

z \in C

and

X = (\begin{matrix} n & m \\ t & s \end{matrix}) \in S L (2, Z)

set

X (z) (n z + m) / (t z + s)

and identify

X ⟺ X (1) = \frac{n + m}{t + s} \in Q_{+}

(2)

Clearly

m / s

and

n / t

are but the parents of x. We have

\frac{1}{2} ⟺ (\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}) = : A e \frac{2}{1} ⟺ (\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix}) = : B

(3)

and moreover

(\begin{matrix} n & m \\ t & s \end{matrix}) (\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}) = (\begin{matrix} m + n & m \\ s + t & s \end{matrix}) ⟺ \frac{m}{s} \oplus \frac{m + n}{s + t}

and

(\begin{matrix} n & m \\ t & s \end{matrix}) (\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix}) = (\begin{matrix} n & m + n \\ t & s + t \end{matrix}) ⟺ \frac{m + n}{s + t} \oplus \frac{n}{t}

Hence the matrices A and B, when acting from the right, move downwards on

T

, respectively to the left and to the right.

Putting together the above, along with Lemma 1, we get:

Proposition 1.

Each

\frac{p}{q} = [a_{0}; a_{1}, \dots, a_{n}] \in T

, with

depth (\frac{p}{q}) > 1

, corresponds to a unique element

X \in S L (2, Z)

, for which there are only two possibilities:

n even $⟹$ $X = B^{a_{0}} A^{a_{1}} \dots A^{a_{n - 1}} B^{a_{n} - 1}$
n odd $⟹$ $X = B^{a_{0}} A^{a_{1}} B^{a_{2}} \dots A^{a_{n} - 1}$

Moreover, let

\frac{p}{q} = \frac{p^{'}}{q^{'}} \oplus \frac{p^{''}}{q^{''}}

and

w_{\frac{p}{q}} = w_{\frac{p^{'}}{q^{'}}} w_{\frac{p^{''}}{q^{''}}}

be the corresponding FC word, then

X = (\begin{matrix} | w_{\frac{p^{''}}{q^{''}}} |_{1} & | w_{\frac{p^{'}}{q^{'}}} |_{1} \\ | w_{\frac{p^{''}}{q^{''}}} |_{0} & | w_{\frac{p^{'}}{q^{'}}} |_{0} \end{matrix})

For a given element

x \in T

, the matrix product X can be used to code the descending path which reaches x starting from

\frac{1}{1}

as a binary string

σ (x) \in {0, 1}^{*}

, where each symbol 0 corresponds to an occurrence of A (down left move) and each symbol 1 to an occurrence of B (down right move).

We may now ask what kind of relation can be established between

σ (x)

and its FC word

w (x) \in F

(a reverse relation yielding the c.f.e. of x from the corresponding FC word w is discussed in Section 4 below).

The sought relation can be readily obtained from Corollary 1. Indeed, given a palindromic word

u \in {0, 1}^{*}

and a symbol

a \in {0, 1}

, we set

Φ_{a} (u) = {(u a)}^{+} = {(a u)}^{-}

(4)

For example we have

Φ_{0} (0110) = 01100110

and

Φ_{1} (0110) = 011010110

. Note moreover that

Φ_{a} (ϵ) = a

. A direct consequence of Corollary 1 is now the following rule.

Proposition 2.

Let

σ (x) = σ_{1} \dots σ_{k} \in {0, 1}^{*}

be the path of

x \in T

, and

w (x) = 0 c 1

its FC word. Then we have

c = Φ_{σ_{k}} \circ Φ_{σ_{k - 1}} \circ \dots \circ Φ_{σ_{1}} (ϵ)

(5)

Example. Taking

x = 3 / 5 = [0; 1, 1, 2]

, from Proposition 1 we have

σ (x) = 010

. Thus, applying rule (5) we get

c = Φ_{0} \circ Φ_{1} \circ Φ_{0} (ϵ) = Φ_{0} \circ Φ_{1} (0) = Φ_{0} (010) = 010010 .

Finally

w (x) = 0 c 1 = 00100101

(to be compared with the portions of the trees

T

and

F

reproduced above).

Remark 7.

The maps (4) have been introduced by Aldo de Luca in [13], who called thempalindromic closures. More generally, in combinatorial word theory literature the transformation mapping the word

σ (x)

to the central palindrome c of

w (x)

is usually encoded by a function

P a l : {0, 1}^{*} \to {0, 1}^{*}

defined recursively as follows [4]: set

P a l (ϵ) = ϵ

. If

u = v z \in {0, 1}^{*}

for some

z \in {0, 1}

then

P a l (u) = {(P a l (v) z)}^{+}

. Although the two approaches are of course equivalent, the one outlined above seems more transparently connected to the present construction.

3.1. Reversals and Duality

If we let A and B act on the left we get

(\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}) (\begin{matrix} n & m \\ t & s \end{matrix}) = (\begin{matrix} n & m \\ n + t & m + s \end{matrix}) ⟺ \frac{n + m}{n + m + t + s}

and

(\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix}) (\begin{matrix} n & m \\ t & s \end{matrix}) = (\begin{matrix} n + t & m + s \\ t & s \end{matrix}) ⟺ \frac{n + m + t + s}{s + t}

That is, they move a fraction

\frac{p}{q}

respectively to its left and right descendants

\frac{p}{p + q}

and

\frac{p + q}{q}

on

T

. Now, if we associate to a given fraction

x \in T

a matrix product

X = \prod_{i = 1}^{d} M_{i}

where

d = depth (x)

, as above, then we can consider the involution

x \to \hat{x}

, where

\hat{x}

is the rational number represented by the reversed matrix product

\hat{X} = \prod_{i = d}^{1} M_{i}

. This map acts as a permutation on

Q_{+}

and the corresponding permuted tree

\hat{T}

can be constructed starting from the root node

\frac{1}{1}

and writing under each vertex

\frac{p}{q}

the set of its descendants

{\frac{p}{p + q}, \frac{p + q}{q}}

.

Note moreover that, according to Proposition 1, the following rule is in force: let

x = [a_{0}; a_{1}, \dots, a_{n}]

, then

n even $⟹$ $\hat{X} = B^{a_{n} - 1} A^{a_{n - 1}} \dots A^{a_{1}} B^{a_{0}}$
n odd $⟹$ $\hat{X} = A^{a_{n} - 1} B^{a_{n - 1}} \dots A^{a_{1}} B^{a_{0}}$

and therefore,

n even $⟹$ $\hat{x} = [a_{n} - 1; a_{n - 1}, \dots, a_{1}, a_{0} + 1]$
n odd $⟹$ $\hat{x} = [0; a_{n} - 1, a_{n - 1}, \dots, a_{1}, a_{0} + 1]$

Definition 2.

Let

σ (x) = σ_{1} \dots σ_{k} \in {0, 1}^{*}

be the path of

x \in T

, and

w (x) = 0 c 1

its FC word. The FC word

\hat{w} = 0 \hat{c} 1

associated to

\hat{x}

, for which

\hat{c} = Φ_{σ_{1}} \circ Φ_{σ_{1}} \circ \dots \circ Φ_{σ_{k}} (ϵ)

is called thedual wordto w. In the same vein, x and

\hat{x}

will be referred to asdual elementsin

T

.

It turns out (see [4]) that whenever w and

w^{*}

are dual words associated to the irreducible fractions

x = \frac{p}{q}

and

\hat{x} = \frac{\hat{p}}{\hat{q}}

, we have

p + q = \hat{p} + \hat{q}

and

\hat{p}

and

\hat{q}

are the respective multiplicative inverses in

{0, 1, \dots, p + q - 1}

of p and q, that is

p \hat{p}, q \hat{q} \equiv 1 (\mod n)

with

n = p + q

(these inverses exist because p and q are relatively prime and therefore are also relatively prime to

n = p + q

. Therefore

\hat{p}

and

\hat{q}

are relatively prime). A straightforward consequence of this property and the content of Remark 5 is the following:

Lemma 3.

Let

x = \frac{p}{q}

and

\hat{x} = \frac{\hat{p}}{\hat{q}}

be dual elements in

T

. Then

\frac{p}{q} = \frac{p^{'}}{q^{'}} \oplus \frac{p^{''}}{q^{''}} if and only if \frac{\hat{p}}{\hat{q}} = \frac{p^{'}}{p^{''}} \oplus \frac{q^{'}}{q^{''}}

3.2. Motions on $\hat{T}$ and $\hat{F}$ .

We start recalling some results discussed in [5] about dynamics on

\hat{T}

. We start observing that the descendants of a fraction

\frac{p}{q}

are just its pre-images w.r.t. the map

F : R_{+} \to R_{+}

given by

F : x \mapsto \{\begin{matrix} \frac{x}{1 - x}, & 0 \leq x \leq 1 \\ x - 1, & x > 1 \end{matrix}

(6)

The map F can thus be used to generate “vertically” the permuted tree

\hat{T}

. Moreover, according to ([5], Proposition 2.3),

\hat{T}

can also be generated “horizontally” by means of the map

R : R_{+} \to R_{+}

given by

R (0) = 1

,

R (\infty) = 0

and

R (x) = \frac{1}{1 - x + 2 [x]}, x \in R_{+}

(7)

More precisely, denoting with

r_{n}

the n-th rational number obtained by `reading’

T

row by row, from left to right, starting from the root, and letting

r_{n^{*}}

be the element of the permuted tree

\hat{T}

corresponding to

r_{n} \in T

, it holds

r_{\hat{n}} = R^{n - 1} (1)

(or else

r_{n} = R^{\hat{n} - 1} (1)

).

Turning now to consider the permuted FC tree

\hat{F}

, an easy consequence of the construction outlined above (see also [2], Lemma 2.2) is the following:

Lemma 4.

Let w be the FC word associated to some element

\frac{p}{q} \in T

. The FC words associated to its descendants

\frac{p}{p + q}

and

\frac{p + q}{q}

are obtained by applying to w the substitution rules:

\begin{matrix} S_{0} : (0, 1) \to (0, 01) \\ S_{1} : (0, 1) \to (01, 1) \end{matrix}

Now note that any FC word w of length n can be written in the form

w = 0^{n_{1}} 1 0^{n_{2}} \dots 0^{n_{p}} 1, n_{i} \geq 1, \sum_{i = 1}^{p} n_{i} = q

(8)

whenever its slope

{| w |}_{1} / | w_{0} | = p / q \in (0, 1)

, or else

w = 0 1^{n_{1}} 0 1^{n_{2}} \dots 0 1^{n_{q}}, n_{i} \geq 1, \sum_{i = 1}^{q} n_{i} = p

(9)

whenever

p / q > 1

. As noted before (cf.remark after eq. (1), see also [21]) the integers

n_{i}

may get only two values. They are

[q / p]

or

[q / p] + 1

, if the slope

p / q

is smaller than one;

[p / q]

or

[p / q] + 1

, otherwise. Following [21], we call the exponent

[q / p] \geq 1

(or

[p / q]

) the value of w.

This naturally induces a decomposition of

F

(or

\hat{F}

) as

F = F_{< 1} \cup F_{\geq 1}

(with obvious meaning of the notations), so that

S_{0} : F \to F_{< 1}

and

S_{1} : F \to F_{\geq 1}

, in particular

F_{< 1}

consists of all the left nodes of

\hat{F}

, while

F_{\geq 1}

consists of all the right node, plus the root.

We are now ready to introduce a map T on words which generates the “horizontal” motion on

\hat{F}

, namely the displacement row by row, from left to right, starting from the root, in a similar way to how R does it for

\hat{T}

.

Theorem 2.

The map T that moves from a given word

w \in \hat{F}

to the next one, can be written as

T = T_{0} \cup T_{1}

, where the maps

T_{0} : F_{< 1} \to F_{\geq 1}

and

T_{1} : F_{\geq 1} \to F_{< 1}

act as follows:

\begin{matrix} T_{0} : (0^{k + 1} 1, 0^{k} 1) \to ({(01)}^{k} 1, {(01)}^{k - 1} 1) \\ T_{1} : (01^{k + 1}, 01^{k}) \to (0^{k} 1, 0^{k + 1} 1) \end{matrix}

where k is the value of w.

Proof.

Let

w = 0^{n_{1}} 1 0^{n_{2}} \dots 0^{n_{p}}

with:

n_{i} = k o r k + 1 f o r i = 1, \dots, p, a n d \sum_{i = 1}^{p} n_{i} = q .

Let

w^{'}

be the parent node of w and

T (w)

, we have that

w^{'}

is given by

S_{0}^{- 1} (w)

and, recalling that

0^{0} = ϵ

, we have:

w^{'} = 0^{n_{1} - 1} 10^{n_{2} - 1} 1 \dots 0^{n_{p} - 1} 1 .

Then, thanks to

S_{1}

, we have

T (w) = S_{1} (w^{'}) = {(01)}^{n_{1} - 1} 1 {(01)}^{n_{2} - 1} 1 \dots {(01)}^{n_{p} - 1} 1,

and we have shown

T_{0} = T [F_{< 1}]

.

Now we will show that

T_{1} = T [F_{\geq 1}]

by induction on the depth m of the word w. For

m = 1

, that

T (01) = T_{1} (01) = 001

is trivial. Let’s then assume it holds true for each w at depth m, and we will prove it for

m + 1

. Let

w = 01^{n_{1}} 01^{n_{2}} \dots 01^{n_{q}}

with:

n_{i} = k o r k + 1 f o r i = 1, \dots, q, a n d \sum_{i = 1}^{q} n_{i} = p .

Let

w^{'}

be the parent node of w, and

w^{''} = T (w^{'})

the parent node of

T (w)

. Then

T (w) = S_{0} (w^{''})

. Clearly,

w^{'}

is given by

w^{'} = S_{1}^{- 1} (w) = 01^{n_{1} - 1} 01^{n_{2} - 1} \dots 01^{n_{q} - 1} .

Now, let us consider the q subwords

01^{n_{i} - 1}

individually, and we call

{\bar{n}}_{i}

the complement of

n_{i}

in the set

{k, k + 1}

. Then, if

k > 1

, we have, by the induction hypothesis, that

w^{''} = T_{1} (w^{'})

and so, by the action of

T_{1}

, the subword

01^{n_{i} - 1}

becomes

0^{{\bar{n}}_{i} - 1} 1

, and applying

S_{0}

, we get:

T (w) = S_{0} (w^{''}) = 0^{{\bar{n}}_{1}} 10^{{\bar{n}}_{2}} 1 \dots 0^{{\bar{n}}_{q}} 1

which we wanted to show.

On the other hand, if

k = 1

, then the subword

01^{n_{i} - 1}

is either 0 or 01, so that

w^{'} \in F_{< 1}

and

T (w^{'}) = T_{0} (w^{'})

. Thus, applying

T_{0}

, it is clear2 that

\forall i = 1, \dots, q

for which

n_{i} - 1 = 0

, we get 01, while

\forall i = 1, \dots, q

for which

n_{i} - 1 = 1

, we get 1. And, applying

S_{0}

, we get that 01 becomes 001, while 1 become 01. So, putting it all together, we have

01^{n_{i}} S^{- 1} 01^{n_{i} - 1} T 0^{{\bar{n}}_{i} - 1} 1 S_{0} 0^{{\bar{n}}_{i}} 1

which is what we needed to prove. □

The map T, defined for FC words, can be used to generate “horizontally" the tree

\hat{F}

as the map R can be used to generate “horizontally" the tree

\hat{T}

. Since R is defined on

R_{+}

we would like to find an extension of T such that the correspondence with R is not limited to

Q_{+}

.

Conjecture 3.

The map T, defined in Theorem 2, can be extended to the set of Sturmian sequences3 (cf. Section 2) and it corresponds to the map R, in the sense that if w is the Sturmian sequence of slope

x \in R_{+}

, then u, obtained as

0 u = T (0 w)

is the Sturmian sequence of slope

R (x)

.

By analyzing the frequencies of the letters 0 and 1 of

0 u = T (0 w)

, one finds that the ratio of 1’s and 0’s corresponds to

R (x)

. The problem is to show that

T (w)

is indeed a Sturmian sequence or, equivalently, that the set of Sturmian sequences is closed wrt T.

Remark 8.(Connection with S-adic systems)

On the permuted tree

\hat{T}

one can introduce a symmetric random walk

{(Z_{k})}_{k \geq 1}

in the following way: set

Z_{1} = \frac{1}{1}

and if

Z_{k} = \frac{p}{q}

then either

Z_{k + 1} = \frac{p}{p + q}

or

Z_{k + 1} = \frac{p + q}{q}

, both with probability

\frac{1}{2}

. In [5] it is proved that this process enters any non empty interval

I = (a, b) \subset R_{+}

almost surely (Thm. 1.12) and, more specifically, it does it with asymptotic frequency

ρ (I) = \int_{a}^{b} d ρ (x)

(Corollary 3.7), where

ρ : {\bar{R}}_{+} \to [0, 1]

encodes the infinite path of

x \in {\bar{R}}_{+}

by interpreting it as the binary expansion of a real number in

[0, 1]

. Differently said,

ρ (0) = 0

,

ρ (\infty) = 1

and, if

x = [a_{0}; a_{1}, a_{2}, \dots]

, then

ρ (x) = 0 . \underset{a_{0}}{\underset{︸}{11 \dots 1}} \underset{a_{1}}{\underset{︸}{00 \dots 0}} \underset{a_{2}}{\underset{︸}{11 \dots 1}} \dots

(10)

A similar study can be pursued on the permuted tree

\hat{F}

, starting from the observation that the substitutions

S_{0}

and

S_{1}

defined in Lemma 4, whose incidence matrices coincide with A and B, define a so called S-adic system (see [20], pp. 87-109, and [3]), which, however, are rarely considered as generating a random process. For an interesting analysis of the spectral properties of S-adic random system arising from an i.i.d. sequence of unimodular substitutions, see [23]. Besides, it would be also interesting to study the dynamics induced by the map T defined in Thm. 2 from a statistical point of view (see the next Section for some results for the map R).

Remark 9.(FC words and musical scales)

FC words that are dual to one another deserve an important role in the theory of well-formed scales in music theory [8] (see also [14]). Loosely speaking, we first say that a scale isgeneratedif its elements can be obtained by an iterated application of a generator4, i.e. a fixed transposition on a given pitch class, and then we say that a generated scale iswell-formed, if each generating interval spans the same number of scale steps (including the return to origin interval). A remarkable property brought into light by the recent developments in music and combinatorics on words [11] starts from the observation that, for example, the FC word

w = 0001001

, corresponding to the fraction 2/5, is the sequence of intervals corresponding to the ancient mixolydian (descending) mode B’-A-G-F-E-D-C-(B) (or else to the ascending lydian mode as a medieval ecclesiastical mode), where 0 stands for a tone and 1 for a semi-tone. If we now take the slope 4/3, where 4 and 3 are the multiplicative inverses of respectively 2 and 5 modulo 7, the dual Christoffel word

\hat{w} = 0101011

corresponds to the same mode B’-E-A-D-G-C-F-(Bb) but in a different presentation, where now 0 stands for a descending perfect fifth (the generator) and 1 for an ascending perfect fourth (the generator’s complement within the octave), so that the pitches reached thereby all lie within the octave under the initial B’. The two presentations are respectively called thescale-step patternand thescale foldingof the mode. The other seven diatonic modes forming of the diatonic 7-notes family can be obtained from this mode by conjugation, where we say that two elements w and

w^{'}

of

{0, 1}^{*}

areconjugateif there exist words u and v such that

w = u v

and

w^{'} = v u

(or equivalently if they are conjugated in the free group

< 0, 1 >

). In the same vein can be treated other musical scales, such as the pentatonic scales (starting from the scale-step pattern 01011, whose dual is 00101), or the so called `tetractys’ (starting with 011, which is self-dual). This quick sketch can hopefully give a sense of the richness lying in the folds of the interaction between these domains. One interpretation of this richness may come from thinking of the FC words as divisions into “almost equal” parts (cf. section 17.3 in [24]), in the following sense: if

d < n

are relatively prime, then

n = d q + r

with positive remainder r. Therefore n is not divisible into d equalintegerparts. On the other hand, the second-best solution is to divide n into

d - r

equal parts of size q, and the remaining r parts of size

q + 1

. By writing these parts as a word of length d, as evenly as possible, one obtains a FC word (cf. the geometric interpretation presented at the beginning of Section 2 and in Figure ).

4. Ordering and Dynamical Systems

We shall now discuss some further aspects of the relation between the c.f.e. of a given element of

x \in T

and its FC word

w \in F

. To this end we recall that any FC word w of length n can be written in the form shown in (8) or (9) depending on its slope (cf. Section 3.2).

Then, we can construct a derived word

w^{'}

via the following algorithm: suppose that the slope

p / q

of w is smaller than one and its value is k (that is

[q / p] = k

). Then the symbol 1 is isolated and we perform the substitution

0 \to 0

and

0^{k} 1 \to 1

. If, instead, the slope

p / q

is larger than one, and

[p / q] = k

, then the symbol 0 is isolated and we perform the substitution

1 \to 1

and

01^{k} \to 0

. We keep iterating this procedure until we end up with a single symbol, 0 or 1, while recording the values

a_{0}, a_{1}, \dots, a_{n}

of the derived sequences5. We have the following:

Proposition 3.

Let

x \in T

and

w \in F

be the corresponding FC word. The values of the successively derived words

w^{'}, w^{''}, \dots

coincide with the partial quotients of the c.f.e. of x.

Proof.

The proof amounts to noting that the reduction procedure corresponds to repeated applications to the slope of the map

F : R_{+} \to R_{+}

given by

F : x \mapsto \{\begin{matrix} \frac{x}{1 - x}, & 0 \leq x \leq 1 \\ x - 1, & x > 1 \end{matrix}

(11)

whose action of c.f.e.’s is6

F : [a_{0}; a_{1}, a_{2}, \dots] \mapsto \{\begin{matrix} [0; a_{1} - 1, a_{2}, \dots], & a_{0} = 0 \\ [a_{0} - 1; a_{1}, a_{2}, \dots], & a_{0} > 0 \end{matrix}

(12)

More precisely, if w has slope x and value k then the derived sequence

w^{'}

has slope

F^{k} (x)

, and value either

[F^{k} (x)]

or

[1 / F^{k} (x)]

. □

Example. For

p / q = 3 / 5 = [0; 1, 1, 2]

and

w = 00100101

we get the following table.

derivation step	FC word	slope	value
0	00100101	$3 / 5$	1
1	01011	$3 / 2$	1
2	001	$1 / 2$	2
3	1	$1 / 0$	∞

Now, any

\frac{p}{q} \in T

of depth

d \geq 1

is the descendant of another fraction

\frac{p^{'}}{q^{'}} \in T

of depth

d - 1

, which we call its antecedent, given by the following rule: if

p > q

then

q^{'} = q

and

p^{'} = p - q

; if instead

q > p

then

p^{'} = p

and

q^{'} = q - p

. Differently said,

\frac{p^{'}}{q^{'}} = F (\frac{p}{q})

. Therefore, according to what we have said in Section 3.1, the binary coding

σ (x) = σ_{1} \dots σ_{k}

of an element

x \in T

of depth

k + 1

can be computed in terms of the symbolic orbit of x with the map F:

σ_{i} (x) = \{\begin{matrix} 0, & F^{i - 1} (x) \leq 1, \\ 1, & F^{i - 1} (x) > 1, \end{matrix} i = 1, \dots, k

(13)

This rule can be immediately checked for the already discussed example

x = 3 / 5

. For a less trivial example consider the fraction

x = 65 / 19

, whose c.f.e. is

[3; 2, 2, 1, 2]

. It has depth

3 + 2 + 2 + 1 + 2 = 10

and from Proposition 1 its symbolic coding is

σ (x) = 111001101

. Without knowing the c.f.e. this binary sequence can be obtained from the antecedents, i.e. the F-images of x till the root of

T

. They are

\frac{65}{19}, \frac{46}{19}, \frac{27}{19}, \frac{8}{19}, \frac{8}{11}, \frac{8}{3}, \frac{5}{3}, \frac{2}{3}, \frac{2}{1}, (\frac{1}{1})

and one easily checks that the sequence obtained applying rule (13) is just

σ (x)

written above.

We have said that the tree

T

enumerates the positive rationals, but what is the ordering induced on

Q_{+}

? Denoting again with

r_{n}

the n-th rational number obtained by `reading’

T

row by row, from left to right, starting from the root, we have

r_{1} = \frac{1}{1}, r_{2} = \frac{1}{2}, r_{3} = \frac{2}{1}, r_{4} = \frac{1}{3}, r_{5} = \frac{2}{3}, r_{6} = \frac{3}{2}, r_{7} = \frac{3}{1}, r_{8} = \frac{1}{4}, \dots

The general rule is in the following:

Theorem 4.

Given

1 \neq x \in T

, let

σ (x) = σ_{1} \dots σ_{k}

be its binary coding. Then we have

x = r_{n}

with

n = 2^{k} + \sum_{l = 1}^{k} σ_{l} 2^{k - l}

.

Example. The number

x = 65 / 19

yields

n = 2^{9} + 2^{8} + 2^{7} + 2^{6} + 2^{3} + 2^{2} + 2^{0} = 973

, namely

65 / 19

is the nine hundred seventy-third rational number in the Stern-Brocot ordering.

Proof.

Let

r_{\hat{n}}

be the element of the permuted tree

\hat{T}

corresponding to

r_{n} \in T

(or else

r_{n}

and

r_{\hat{n}}

are dual elements in

T

). Then

n = 2^{k} + \sum_{l = 1}^{k} σ_{l} 2^{k - l}

if and only if

\hat{n} = 2^{k} + \sum_{l = 1}^{k} σ_{l} 2^{l - 1}

. According to the above, it holds

r_{\hat{n}} = R^{n - 1} (1)

(or else

r_{n} = R^{\hat{n} - 1} (1)

), where R is the map defined in (7). Furthermore, an easy adaptation of ([5], Theorem 2.3) shows that R is topologically conjugated with the dyadic odometer (or von Neumann-Kakutani transformation [26])

K : [0, 1] \to [0, 1]

, given by

K (1) 0

and

K (x) x + \frac{1}{2^{n - 1}} + \frac{1}{2^{n}} - 1, 1 - \frac{1}{2^{n - 1}} \leq x < 1 - \frac{1}{2^{n}}, n \geq 1,

via the map

ρ

defined in (10), i.e.

R = ρ^{- 1} \circ K \circ ρ .

(14)

Finally, it is well known (see, e.g., [17]) that the map K can be used to generate the Van der Corput sequence

ω = (t_{n})

, defined as follows: set first

t_{1} = 1 / 2

. Then, given

n \geq 2

, let

n = 2^{k} + \sum_{l = 1}^{k} s_{l} 2^{l - 1}

be its dyadic expansion and set

t_{n} = 2^{- k - 1} + \sum_{l = 1}^{k} s_{l} 2^{- l}

. The first terms of

ω

are

t_{1} = \frac{1}{2}, t_{2} = \frac{1}{4}, t_{3} = \frac{3}{4}, t_{4} = \frac{1}{8}, t_{5} = \frac{5}{8}, t_{6} = \frac{3}{8}, t_{7} = \frac{7}{8}, t_{8} = \frac{1}{16}, \dots

Accordingly, we have

t_{n} = K^{n - 1} (1 / 2)

,

n \geq 1

, and one readily gets the claim. □

Remark 10.

Note that the forward orbit of 1 with R is dense in

R_{+}

, but it grows only logarithmically, as

R^{2^{n} - 2} (1) = n

. Moreover, according to [10] and [18], the following representation is in force:

R^{n} (1) = b (n) / b (n + 1)

,

n \geq 0

, where

b (n)

is the number ofhyperbinaryrepresentations of n, that is the number of ways of writing the integer n as a sum of powers of 2, each power being used at most twice. For instance

8 = 2^{3} = 2^{2} + 2^{2} = 2^{2} + 2 + 2 = 2^{2} + 2 + 1 + 1

and thus

b (8) = 4

.

The two maps F and R introduced above satisfy the following remarkable commutation rule:

Proposition 4.

For all

x \in R_{+}

we have

R^{m} \circ F^{n} (x) = F^{n} \circ R^{2^{n} m} (x), n, m \geq 1

Proof.

For the case

n = m = 1

the proof amounts to a straightforward verification, either by direct inspection or through the action of F and R on c.f.e.’s, that is (12) and

R : [a_{0}; a_{1}, a_{2}, \dots] \mapsto \{\begin{matrix} [1; a_{1} - 1, a_{2}, \dots], & a_{0} = 0 \\ [0; a_{0}, 1, a_{1} - 1, a_{2}, \dots], & a_{0} > 0 \end{matrix}

(15)

The general case easily follows by induction. □

Note that the map R is invertible, with inverse

R^{- 1} (x) = 1 - \frac{1}{x} + 2 [\frac{1}{x}]

(16)

On the other hand, the map F is two-to-one, with

F^{- 1} (x) = \{\frac{x}{x + 1}, x + 1\}

(17)

In particular, the set of F-pre-images of

x = \frac{p}{q}

coincides with the set of the descendants

{\frac{p}{p + q}, \frac{p + q}{q}}

considered above (cf. Section 3.1).

Therefore, as an ordered set, the tree

\hat{T}

can be generated both `horizontally’, as the set of successive R-images of 1, and `vertically’, as the set of successive F-pre-images of 1:

\hat{T} = \cup_{n \geq 0} R^{n} (1) = \cup_{n \geq 0} F^{- n} (1)

, and, more specifically,

\cup_{k = 0}^{2^{n} - 2} R^{k} (1) = \cup_{k = 0}^{n - 1} F^{- k} (1), \forall n \geq 1 .

Regarding the ergodic properties of these maps, we start observing that F possesses an absolutely continuous invariant measure

ν

, which can be computed explicitly: first the invariance means that

ν = ν F^{- 1}

where the latter is the measure which assigns to each measurable set

A \subset R_{+}

the number

ν (F^{- 1} (A))

. Second, expressing this measure as

ν (d x) = h (x) d x

, the invariance property translates into the following functional equation for the density h:

h (x) = \sum_{y \in F^{- 1} (x)} \frac{h (y)}{| F^{'} (y) |} = \frac{1}{{(1 + x)}^{2}} h (\frac{x}{1 + x}) + h (x + 1)

and one immediately checks that a continuous solution is

h (x) = 1 / x

. Note that

h \notin L^{1} (R_{+}, d x)

, that is

ν

is an infinite F-invariant a.c. measure. On the other hand, as the function

ρ

establishes a topological conjugacy between R and the dyadic odometer K (see (14)), it provides a topological conjugacy also between F and the doubling map

D : [0, 1] \to [0, 1]

(as shown in [5]), i.e.

F = ρ^{- 1} \circ D \circ ρ, D (x) = 2 x (\mod 1)

(18)

The map D acts as a shift on binary expansions and preserves the Lebesgue measure on the unit interval7.

Since Lebesgue measure is preserved also by the invertible map K, the conjugacies (14) and (18) ensure that both F and R leave invariant the probability measure

d ρ

.

On the other hand, all orbits

{R^{i} (x) : i \geq 0}

,

x \in {\bar{R}}_{+}

being dense, the dynamical system

({\bar{R}}_{+}, R)

is uniquely ergodic and therefore

d ρ

is its unique invariant measure. In a different guise, the map F possesses several invariant measures, two of which are

d ν

and

d ρ

, which are of course singular with respect to one another. In particular, as the entropy of the doubling map D with respect to the Lebesgue measure is

log 2

, this same value is also the entropy of F with respect to the probability measure

d ρ

, which is therefore called the measure of maximal entropy for F.

4.1. An Alternative Ordering

Proposition 4 can be viewed as expressing the fact that the "horizontal" action of the map R respects the order induced by the "vertical" action of the map F on the tree. Moreover, the conjugation (18) between F and D can be obtained in two steps, passing via the map

ϕ

through the orientation preserving Farey map

\tilde{H}

, so that

F = ϕ^{- 1} \circ \tilde{H} \circ ϕ

. We can ask whether there is an orientation reversing version of the above constructions. For instance, if we consider the standard Farey map H, then the map

G = ϕ^{- 1} \circ H \circ ϕ

, given by

G : x \mapsto \{\begin{matrix} \frac{x}{1 - x}, & 0 \leq x \leq 1 \\ \frac{1}{x - 1}, & x > 1 \end{matrix}

(19)

is conjugated via

ρ

with the tent interval map T, i.e. (18) is replaced by

G = ρ^{- 1} \circ T \circ ρ

. Therefore,

d ρ

is the measure of maximal entropy for G as well. In addition, one easily verifies that G preserves also the a.c. measure with density

1 / (x (1 + x))

. We also note that

G (Φ) = Φ

where

Φ = (\sqrt{5} + 1) / 2

is the golden mean. Since

| G^{'} (Φ) | = 1 + Φ

is a repelling fixed point.

Now, what is the map

S : {\bar{R}}_{+} \to {\bar{R}}_{+}

which plays the role of R in this orientation reversing setting? A close inspection based on continued fraction expansions leads to the following expression:

\begin{matrix} S : x = & [a_{0}; a_{1}, a_{2}, \dots] t r i m = 0 m m 0 m m 2 m m - 1 m m, c l i p ⟼ t r i m = - 0.5 m m 0 m m 2 m m - 1 m m, c l i p ⤏ t r i m = - 0.5 m m 0 m m 2 m m - 1 m m, c l i p ⤏ \\ t r i m = 0 m m 0 m m 2 m m - 1 m m, c l i p ⤏ ⟶ & \{\begin{matrix} [0; n + 1, a_{n} - 1, a_{n + 1}, \dots], & a_{0} = a_{1} = \dots = a_{n - 1} = 1, a_{n} > 1 \\ [a_{1}; a_{2}, a_{3}, \dots], & a_{0} = 0 \\ [0; ℓ + 2], & x = [\underset{ℓ - 1}{\underset{︸}{1; 1, \dots, 1}}, 2] \end{matrix} \end{matrix}

We also set

S (0) = \infty

,

S (\infty) = 1

and

S (Φ) = 0

. Now note that

[\underset{ℓ - 1}{\underset{︸}{1; 1, \dots, 1}}, 2] = \frac{F_{ℓ + 2}}{F_{ℓ + 1}}

where

F_{ℓ}

be the ℓ-th Fibonacci number, given by

F_{- 1} = 1, F_{0} = 0 and F_{ℓ} = F_{ℓ - 1} + F_{ℓ - 2}, ℓ \geq 1

We then construct the sequence

{(x_{k})}_{k \geq 0}

as

x_{k} F_{k} / F_{k - 1}

, whose first elements are

x_{0} = 0, x_{1} = \infty, x_{2} = 1, x_{3} = 2, x_{4} = \frac{3}{2}, x_{5} = \frac{5}{3}, \dots

and observe that S is continuous everywhere but at the points

x_{k}

,

k \geq 1

, where it is right-continuous. An alternative expression for S is thus the following:

S : x \mapsto \frac{F_{k} x - F_{k + 1}}{(k F_{k} - F_{k - 1}) x - k F_{k + 1} + F_{k}}, x \in C_{k}

(20)

where

C_{2 r} = [x_{2 r}, x_{2 r + 2}), C_{2 r + 1} = [x_{2 r + 3}, x_{2 r + 1}), r \geq 0

(21)

One checks that for all

x \in R_{+}

it holds

S^{m} \circ G^{n} (x) = G^{n} \circ S^{2^{n} m} (x), n, m \geq 1 .

(22)

5. Motions on the Modular Surface

F can be obtained as the factor map of a first return map for the geodesic flow on the modular surface. Let us briefly recall what does this mean.

Let

H = \{z = x + i y : x \in R, y \in R_{+}\}

be the upper half-plane, viewed as a Riemmanian manifold with hyperbolic metric

d s^{2} = (d x^{2} + d y^{2}) / y^{2}

. Set moreover

M = Γ ∖ H = {Γ z : z \in H}

, with

Γ = P S L (2, Z)

, endowed with the quotient topology. We recall that the Fuchsian group

Γ

has two generators U and V, which can be chosen as

U = (\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix})

and

V = U B^{- 1} = A U = (\begin{matrix} 0 & 1 \\ - 1 & 1 \end{matrix})

. It holds moreover

U^{2} = V^{3} = I

(so that

Γ

is not a free group).

Let

φ_{t} : S M \to S M

be the geodesic flow on the unit tangent bundle of M, and let us construct a subset of

S M

which is met infinitely many times by each

φ_{t}

-orbit. To this end set

I = \{z = x + i y : x = 0, y \in R^{+}\} \subset H

and consider the section C made by the projections on

S M

of all vectors of

S H

having base point on

I

and right-oriented, that is vectors of the form

v = (z, θ)

with

z \in I

and

θ \in (π, 2 π)

. One easily sees that the elements thus selected are all distinct. There are however

φ_{t}

-orbits which do not visit C infinitely often. These are exactly the projections of geodesics which either start or end in a cusp of

P S L (2, Z)

, that is a rational point on the real line. On

S M

these orbits converge towards (or come from) the cusp at infinity and for this reason they are called scattering geodesics. They form of course a set of zero measure.

Now, a vector

v \in S H

whose projection lies in C can be described by the two asymptotic coordinates u and w which identify the geodesic

γ (v, t)

having tangent vector v at

t = 0

. Whence,

C \{(u, w) : u < 0 < w\}

In turn C can be decomposed as

C = C_{1} \cup C_{2}

where

C_{1} = {(u, w) : u < 0 < w < 1}, C_{2} = {(u, w) : u < 0, w > 1}

The next figure shows a geodesic

γ

such that the projection on

S M

of

γ \cap I

belongs to

C_{2}

.

Figure 5.

We now construct the first return map

T_{C} : C \to C

which sends each intersection of a

φ_{t}

-orbit with C to the next one. To this end, we consider the geodesic triangle

G

with vertices 0, 1 and ∞, that is

G = {z \in H | 0 < Re z < 1, | z - \frac{1}{2} | > \frac{1}{2}}

Its three sides are equivalent w.r.t.

P S L (2, Z)

:

\hat{01}

and

\hat{1 \infty}

are mapped to

I

by the transformations

U V^{2} \equiv A^{- 1} : z \to z / (1 - z)

and

U V \equiv B^{- 1} : z \mapsto z - 1

respectively. Now, suppose that the projection of

v \in S H

lies in C and has coordinates

(u, w)

. There are two possibilities: if the projection of v lies in

C_{2}

(so that the geodesic

γ

determined by v leaves

G

through

\hat{1 \infty}

), then it is mapped by

B^{- 1}

to

(u - 1, w - 1)

; if instead the projection of v lies in

C_{1}

(so that

γ

leaves

G

through

\hat{01}

), then it gets mapped by

A^{- 1}

to

(\frac{u}{1 - u}, \frac{w}{1 - w})

. Therefore the first return map on

C = C_{1} \cup C_{2}

is

T_{C} : (u, w) \mapsto \{\begin{matrix} (\frac{u}{1 - u}, \frac{w}{1 - w}), & (u, w) \in C_{1} \\ (u - 1, w - 1), & (u, w) \in C_{2} \end{matrix}

(23)

The action of

T_{C}

on the second coordinate finally yields the factor map

F : R_{+} \to R_{+}

given by (6).

Figure 6.

Now, referring to the figure above, one can produce a tessellation of

H

by taking all the images of the geodesic triangle

G

with the isometries A and B (acting as Möbius transformations). Moreover, a direct consequence of the generating rule (13) is that, given

x = p / q

, the matrix product X dealt with in Proposition 1, as well as the corresponding binary sequence

σ (x) \in {0, 1}^{*}

, are in a one-to-one correspondence with the coding w.r.t. the above tessellation of the scattering geodesic

c_{p / q}

which converges to

p / q

, the central cusp of the geodesic triangle

X (G)

(see [16]).

Figure 7.

In a similar fashion as finite paths on

T

correspond to scattering geodesics on

H

, we can establish a correspondence between FC words and Ford circles. These are a countable family of circles orthogonal to the sides of the just mentioned geodesic triangles. Each of them, denoted

C_{\frac{p}{q}}

, is tangent to

R

in some rational point

p / q

, and has diameter

1 / q^{2}

. The largest circles have thus unit diameter and correspond to

C_{n}

,

n \in Z

(the following picture shows

C_{0}

,

C_{\frac{1}{3}}

,

C_{\frac{1}{2}}

,

C_{\frac{2}{3}}

and

C_{1}

).

Figure 8.

Clearly, each Ford circle

C_{\frac{p}{q}}

with

\frac{p}{q} \geq 0

corresponds to a unique FC word w with

\frac{p}{q} = \frac{{| w |}_{1}}{{| w |}_{0}}

, and vice versa.

Ford circles and scattering geodesics are related as follows:

first, the image with

X_{\frac{p}{q}} = (\begin{matrix} n & m \\ t & s \end{matrix}) \in S L (2, Z)

of the vertical geodesic

I = {z = i e^{τ} : τ \in R}

is a geodesic connecting

X_{\frac{p}{q}} (0) = \frac{m}{s}

and

X_{\frac{p}{q}} (\infty) = \frac{n}{t}

.

X_{\frac{p}{q}} (G)

is a Farey triangle with central cusp in

\frac{p}{q} = \frac{m + n}{s + t}

.

If, instead, we apply

X_{\frac{p}{q}}

to the positive and negative horocycles of

v = (i, 0) \in T H

, namely the horizontal line

H^{+} = {z = i + τ : τ \in R}

(B-invariant) and the circle

H^{-} = {z = \frac{i}{1 + i τ} : τ \in R}

(A-invariant) we obtain two Ford circles:

$C_{\frac{n}{t}}$ , of diameter $\frac{1}{t^{2}}$ and tangent to $R$ in $\frac{n}{t}$ ,
$C_{\frac{m}{s}}$ , of diameter $\frac{1}{s^{2}}$ and tangent to $R$ in $\frac{m}{s}$ ,

which touch each other at the point

X_{\frac{p}{q}} (i)

. The “child" circle

C_{\frac{p}{q}}

touches the cusp at

\frac{p}{q}

, and the “parents" circles

C_{\frac{n}{t}}

and

C_{\frac{m}{s}}

at

X_{\frac{p}{q}} B (i)

and

X_{\frac{p}{q}} A (i)

, respectively. Finally, the geodesics that cross

C_{\frac{p}{q}}

perpendicularly (in particular

c_{\frac{p}{q}}

) converge at the cusp.

Example.

X_{\frac{1}{2}} = A = (\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix})

,

C_{\frac{1}{2}} = A^{2} (H^{+}) = A B (H^{-})

(see the figure above).

One easily checks that two Ford circles

C_{\frac{p}{q}}

e

C_{\frac{p^{'}}{q^{'}}}

, with

\frac{p}{q} < \frac{p^{'}}{q^{'}}

, are either tangent to each other or they do not intersect, and the former situation occurs whenever

p^{'} q - p q^{'} = 1

. Moreover, three Ford circles

C_{\frac{p}{q}}

,

C_{\frac{p^{'}}{q^{'}}}

and

C_{\frac{p^{''}}{q^{''}}}

with

\frac{p}{q} < \frac{p^{''}}{q^{''}} < \frac{p^{'}}{q^{'}}

are tangent to each other if and only if

\frac{p^{''}}{q^{''}} = \frac{p}{q} \oplus \frac{p^{'}}{q^{'}}

(see, e.g., Theorems 5.6 and 5.7 in [1]).

We can say more, but first we briefly present the classical correspondence between a matrix

X \in PSL (2, R)

and

v = (z, θ) \in S H

. Given

v = (z, ζ) \in S H

, with

z \in H

and

ζ \in T_{z} H ≃ C

, we can identify

S H

with

PSL (2, R)

by corresponding v to the unique element

g \in PSL (2, R)

such that

z = g (i)

and

ζ = d g (ζ_{0}) = g^{'} (z) ζ_{0}

, where

ζ_{0}

is the unit vector tangent to the imaginary axis. One can also write the unit tangent vector as

ζ = Im (z) e^{i (θ + \frac{π}{2})}

where

θ

is the angle formed by

ζ

with the vertical line, measured counterclockwise. By identifying

ζ

with

θ

, we obtain the parametrization

v = (z, θ)

for the points in

S H

, and

(z, θ) = (g (i), β_{g} (0))

where

g = (\begin{matrix} a & b \\ c & d \end{matrix})

is given by

z = g (i) = \frac{b + i a}{d + i c}, θ = β_{g} (0) = - 2 arg (d + i c) = - 2 {tan}^{- 1} (\frac{c}{d})

(24)

In this way, the action of the positive and negative horocyclic flow

h_{t}^{+}

and

h_{t}^{-}

on

PSL (2, R)

corresponds to the right multiplication by one-parameter subgroups of matrices

n_{t}^{+} = (\begin{matrix} 1 & t \\ 0 & 1 \end{matrix}), h_{t}^{+} ⟷ g n_{t}^{+} a n d n_{t}^{-} = (\begin{matrix} 1 & 0 \\ t & 1 \end{matrix}), h_{t}^{-} ⟷ g n_{t}^{-}

(25)

This also assures us of the commutativity between isometries and flows, since the former act from the left while the latter act from the right. Finally we can say the following: consider the correspondence between an element

x \in T

and

X \in SL (2, Z)

, given by (2), and the correspondence between a matrix

X \in SL (2, Z)

, viewed as an element of

PSL (2, R)

, and

v = (z, θ) \in S H

, given by (24). This gives a correspondence between elements in

T

and points

z \in H

, as follows:

x = \frac{m}{s} \oplus \frac{n}{t} ⟶ X = (\begin{matrix} n & m \\ t & s \end{matrix}) ⟶ v = (X (i), β_{X} (i)) ⟶ X (i)

(26)

recalling that

β_{X} (i) = - 2 {tan}^{- 1} (t / s)

.

However, this correspondence is not a bijection since the same point in

H

can be associated to multiple point in

S H

and hence to multiple

X \in SL (2, Z)

which are not even associated to some

x \in T

. But considering the direction from

x \in T

to

z \in H

, which is well defined, we get a correspondence between x and

z = X (i)

.

Moreover, for our scope, we just need to prove that:

X_{1} = (\begin{matrix} n & m \\ t & s \end{matrix}) a n d X_{2} = (\begin{matrix} m & - n \\ s & - t \end{matrix})

correspond to

v_{1}, v_{2} \in S H

with

z_{1} = z_{2}

and opposite vectors

θ_{1}

and

θ_{2}

.

But this is easily shown considering:

\frac{- n + m i}{- t + s i} = \frac{- n + m i}{- t + s i} \cdot \frac{- i}{- i} = \frac{m + n i}{s + t i}

and, recalling that

{tan}^{- 1} (x) + {tan}^{- 1} (\frac{1}{x}) = \pm \frac{π}{2}

,

- 2 {tan}^{- 1} (\frac{t}{s}) + 2 {tan}^{- 1} (\frac{s}{- t}) = - 2 ({tan}^{- 1} (\frac{t}{s}) + {tan}^{- 1} (\frac{s}{t})) = \pm π .

So, we have a direct way to determine both x and z from

X \in PSL (2, Z)

, where z is obtained in the canonical way, and

x = \frac{m}{s} \oplus \frac{n}{t} = \frac{n}{t} \oplus \frac{m}{s} \frac{- n}{- t} \oplus \frac{m}{s}

(27)

Example. As in the previous example, we have

C_{\frac{1}{2}} = A^{2} (H^{+}) = A B (H^{-})

, which indeed is the negative horocycle for

v_{1} = (z_{1}, θ_{1})

, with

z_{1} \leftrightarrow A^{2} \leftrightarrow \frac{1}{3}

and the positive horocycle for

v_{2} = (z_{2}, θ_{2})

, with

z_{2} \leftrightarrow A B \leftrightarrow \frac{2}{3}

(see Figure ).

With the elements presented thus far, we can show that the horizontal movement on

T

corresponds to horocyclic flows along Ford circles. To this end we present first the following.

Lemma 5.

The horocyclic flow with unit time on a Ford circle moves from a tangency point with another Ford circle to the next one.

Proof.

From the content of this section, we know that the Ford circles associated with

\frac{1}{0}

(the horizontal line) and

\frac{0}{1}

can be mapped to any other Ford circle

C_{x}

via an isometry. We can consider the Ford circle

C_{x}

associated with

\frac{p}{q}

and the tangency point with another Ford circle

C_{x^{'}}

associated with

\frac{p^{'}}{q^{'}}

. Then, both horocyclic flows, with either negative or positive unit time, are mapped to the respective flows on the Ford circles

C_{\frac{1}{0}}

and

C_{\frac{0}{1}}

. For these, it can be directly checked that, moving with unit time (positive or negative), we are moving from the starting tangency point

z = i

to the next one in the corresponding direction along the corresponding horocycle. This proves the lemma. □

To state the next result, for any positive integer t we set:

A^{t} (\begin{matrix} 1 & 0 \\ t & 1 \end{matrix}) \equiv h_{t}^{-}, D^{t} B^{- t} = (\begin{matrix} 1 & - t \\ 0 & 1 \end{matrix}) \equiv h_{t}^{+}

so that, in particular,

A^{1} = h_{t = 1}^{-} = A

and

D \equiv D^{1} = h_{t = - 1}^{+} = B^{- 1}

.

Then, the horocyclic flows with time t correspond to either

A^{t}

or

B^{t}

, as in (25). Moreover, as shown in (26) and (27), we recall that each fraction x in

T

(and

\hat{T}

) corresponds to the tangency point between the parents of the Ford circle

C_{x}

, and vice versa.

We can now state the following:

Theorem 5.

The horizontal displacement on

T

, starting at the root 1 and moving from left to right on each level, corresponds to clockwise motion along Ford circles. More precisely, assume that we reached

x = r_{m}

, the m-th element of

T

, as in Theorem 4, with

d e p t h (x) = n

. Then, the move to the next element

y = r_{m + 1}

corresponds to the following displacement (via horocyclic flow) on Ford circles:

if x is the rightmost element in a level, i.e. $m = 2^{n} - 1$ , then moving to y corresponds to applying $D^{n - 1} A^{n}$ for n even, and $A^{n - 1} D^{n}$ for n odd;
if, instead, x is either the leftmost or an inner element in a level, i.e. $m = 2^{n - 1} + (k - 1)$ for some $1 \leq k < 2^{n - 1}$ and $k = 2^{p - 1} (\mod 2^{p})$ , with $1 \leq p \leq n - 2$ , then moving to y corresponds to applying $A^{1 + 2 (p - 1)}$ if $n = k (\mod 2)$ , $D^{1 + 2 (p - 1)}$ otherwise.

Proof.

Firstly, it is important to note that when considering the horocyclic flows, each time we move from one Ford circle to another tangent to it, the vector switches direction from inward to outward, or vice versa. This means that, since the movement is clockwise, we transition from the positive horocyclic flow with negative time

h_{- t}^{+} \equiv D^{t}

(to the left of the vector) to the negative horocyclic flow with positive time

h_{t}^{-} \equiv A^{t}

(to the right of the vector), or vice versa, from

h_{t}^{-} \equiv A^{t}

to

h_{- t}^{+} \equiv D^{t}

. Since each level

n > 1

of the tree contains an even number of elements, as we move along the level, we perform an odd number of swaps between horocycles before reaching the last element

\frac{n}{1} \in T

. This element corresponds to

z = (n - 1) + i \in H

, i.e. the point of tangency between

C_{\frac{1}{0}}

and

C_{\frac{n - 1}{1}}

(the parents of

C_{\frac{n}{1}}

). As a result, the vector

v_{\frac{n}{1}}

will point in the opposite direction compared to

v_{\frac{n - 1}{1}}

w.r.t.

C_{\frac{1}{0}}

. Therefore, when moving from one level to the next, say from n to

n + 1

, we alternate between

D^{n - 1} A^{n}

, when n is odd, and

A^{n - 1} D^{- n}

, when n is even. In this way, the direction of the vector v is reversed two more times, and the next level

n + 1

start from

\frac{1}{n + 1}

with the vector in the opposite direction compared to

\frac{1}{n}

. Thus, the horocyclic flow that begins at the start of a level n of the tree correspond to A if n is odd, and to D if n is even.

Now let

x = r_{m}

, where

m = 2^{n - 1} + (k - 1)

,with

1 \leq k \leq 2^{n - 1}

, so that it is the k-th element of the n-th level of

T

. If we want to move horizontally to the next element

r_{m + 1}

, we have two possibilities: either

k < 2^{n - 1}

, in which case we move to position

k + 1

on the same level, or

r_{m + 1}

is the first element of the next level

n + 1

. However, we have already discussed this case, so, from now on, we will consider

k < 2^{n - 1}

.

If k is odd, then x is the left child of its parent node

x^{'}

, and

r_{m + 1}

is the right child. In

H

, each of these two corresponds to the tangency points between the Ford circle

C_{x^{'}}

of

x^{'}

and the Ford circle of the other parent. Therefore, as in Lemma 5, moving from one point to the next along

C_{x^{'}}

corresponds to the horocyclic flow with

| t | = 1

, which, depending on the orientation of the vector v, corresponds to A if n is odd, or D if n is even.

If, instead, k is even, then we have a right child, and its parent is different from the parent of

r_{m + 1}

. Indeed, we need to go back at least two levels to find a common ancestor. Considering the structure of the tree, one can see that for

k = 1, 2, 3, \dots, 2^{n - 2}, \dots, 2^{n - 1}

, the number of steps needed to reach the common ancestor is 1, 2, 1, 3, 1, 2, 1,

4, \dots, 1

,

n - 1

, 1, …, 1. In general, for

k = 2^{p - 1} (mod 2^{p})

, for

1 \leq p \leq n - 2

we need p steps. This can be easily proven by induction on the level of the tree. For

n = 2

, it is trivially true. Assuming the formula holds for levels up to n, it follows that, by construction, for all the new left children, which correspond to

k = 1 (mod 2) = 2^{0} (mod 2^{1})

, the formula holds. For a given right child x, the common ancestor with the node directly to its right, which coincides with the common ancestor of its parent

x^{'}

with the node to its right, is one step further than the number of steps required from its parent

x^{'}

. By induction, from

x^{'}

, corresponding to

k^{'} = 2^{p - 1} (mod 2^{p})

, we need p steps, so from x we will need

p + 1

. From one level to the next the nodes duplicate, and x will be at the position

k = 2 k^{'}

so that

k = 2^{p} (mod 2^{p + 1})

, as required.

We have that both

r_{m}

and

r_{m + 1}

correspond to points on the Ford circle associated with the (nearest) common ancestor

y \in T

, specifically to the points of tangency with their respective parent. On the horocycle, between them, there are

2 (p - 1)

points, where p is the number of steps required to reach the common ancestor. Indeed, all the nodes traversed while moving up from

r_{m}

to the ancestor form a Farey pair with y, as do the nodes traversed to reach down to

r_{m + 1}

, and, by the properties of

T

and the Ford circles, these are all and only the points that lie between them. Thus, following the ideas in the proof of Lemma 5, this movement corresponds to the horocyclic flow with time

| t | = 1 + 2 (p - 1)

. The exact one, A or D, depends on m, and, more directly, on n and k. As we have seen, for n even, odd k corresponds to D and even k corresponds to A, while the reverse is true when n is odd. □

We already showed how the scattering goedesics in

H

are correlated with the vertical movement on the Stern-Brocot tree

T

. With this theorem, we established a parallel between Ford horocycles, which are orthogonal to the geodesics defined in the Farey tessellation, and the horizontal movement on

T

.

Remark 11.

The repeated horizontal movement on

T

can be interpreted geometrically as a cyclical movement along the upper arcs of the Ford circles and, dynamically, as a repeated composition of horocyclic flows. This corresponds to a repeated right multiplication of matrices, expressed as:

\begin{matrix} (A) D \\ (A D^{2}) A D^{3} A \\ (D^{2} A^{3}) D A^{3} D A^{5} D A^{3} D \\ (A^{3} D^{4}) A D^{3} A D^{5} A D^{3} A D^{7} A D^{3} A D^{5} A D^{3} A \\ (D^{4} A^{5}) \dots \end{matrix}

where the brackets correspond to the jump to the next level on

T

, or equivalently, to the return to i in

H

and subsequent descent towards

X_{\frac{1}{n + 1}} (i) \leftrightarrow \frac{1}{n + 1}

.

Remark 12.

If one want to consider the horizontal movement on the n-th level of

T

as composition of horocyclic flows but always resetting and starting from

(i, 0) \in S H

, we would have

\begin{matrix} (I_{2}) \\ (A) D \\ (A^{2}) D A^{3} D \\ (A^{3}) D A^{3} D A^{5} D A^{3} D; \\ (A^{4}) D A^{3} D A^{5} D A^{3} D A^{7} D A^{3} D A^{5} D A^{3} D \\ ⋮ \end{matrix}

which more clearly show the palindromic and symmetric nature of the movement along a level of

T

, obviously already present in Theorem 5.

To conclude, we provide figures to visualize the motions described in Theorem 5. In the first figure, we indicate the direction of traversal of the circles, which will be omitted in the subsequent figures, as it remains the same, i.e., clockwise. Additionally, clockwise is considered the negative direction along the horizontal line

C_{\frac{1}{0}}

. After the first two figures we will omit vectors and points to reduce clutter. Moreover, in all figures, we color-code the horocyclic flows: red for the negative horocycle

H^{-}

, associated with positive time, and blue for the positive horocycle

H^{+}

, associated with negative time. Specifically, red represents

A^{t}

, and blue represents

D^{t}

, where

t - 1

denotes the number of tangent points that must be surpassed to reach the end of the arc. A note is due: in the figures showing the movement on the n-th level, we have added, for completeness, the descent from

\frac{1}{1}

to the first element of the n-th level, which would not be included in the movement through the level. Visually, it correspond to the leftmost colored arc, descending from i along

C_{\frac{0}{1}}

.

Figure 9. Movement on the second level of

T

and transition to the third.

Figure 9. Movement on the second level of

T

and transition to the third.

Figure 10. Movement on the third level.

Figure 11. Transition to the fourth level.

Figure 12. Movement on the fourth level.

1	Defining FC words by reversed concatenation does not really change matters. In particular, it is easy to show by induction that FC words defined as above (resp. by reversed concatenation) are also Lyndon words, i.e. they are minimal (resp. maximal) w.r.t. cyclic permutations. We should also notice that what we call here Farey-Christoffel words, to emphasize their relation with the Farey order of the rationals, are commonly called just Christoffel words [2] since they have been studied for the first time by Christoffel in 1875, see [9].
2	The definition of $T_{0}$ given by the theorem is equivalent to saying that for each subword $0^{n} 1$ we substitute each of the first $n - 1$ zeros with 01, while what remains, i.e. 01, we substitute with 1.
3	A Sturmian sequence is defined as a billiard sequence, and to apply T one must add a 0 as a prefix, which corresponds to considering the path on the lattice $Z^{2}$ instead of the cutting sequence.
4	Western music, since its Greek origins, has primarily used the fifth interval as a generator of harmonic systems.
5	If the slope of the initial sequence w is smaller than one we set $a_{0} = 0$ . On the other hand the value of a single symbol can be taken to be ∞ (as it seems natural when passing to infinite sequences by indefinite repetition of the finite string).
6	In the first case, if $a_{1} = 1$ one sets $[0; a_{1} - 1, a_{2}, \dots] = [a_{2}; a_{3}, a_{4}, \dots]$ .
7	This in particular entails that F is chaotic : is topologically transitive, its periodic orbits are dense and has sensitive dependence on initial conditions.

References

Apostol, T. M. Modular functions and Dirichlet series in number theory; Graduate Text in Mathematics; Springer-Verlag, 1976. [Google Scholar]
Berstel, J; Lauve, A; Reutenauer, C; Saliola, F. Combinatorics on words: Christoffel words and repetitions in words; CRM Monograph Series; 2008; Volume 27. [Google Scholar]
Berthé, V; Delecroix, V. Beyond substitutive dynamical systems: S-adic expansions. RIMS Lecture note Ko^kyu^rokuBessatsu 2014, B46, 81–123. [Google Scholar]
Berthé, V; de Luca, A; Reutenauer, C. On an involution of Christoffel words and Sturmian morphisms. European Journal of Combinatorics 2008, 29, 535–553. [Google Scholar] [CrossRef]
Bonanno, C; Isola, S. Orderings of the rationals and dynamical systems. Colloquium Mathematicum 2009, 116, 165–189. [Google Scholar] [CrossRef]
Bugeaud, Y; Conze, J-P. Calcul de la dynamique de transformations linéaires contractantes mod 1 et arbre de Farey. Acta Arithmetica LXXXVIII 1999, 3, 201–218. [Google Scholar] [CrossRef]
A Brocot, Calcul des rouages par approximation, nouvelle méthode. Revue Chronométrique 1860, 6, 186–194.
Carey, N.; Clampitt, D. Aspects of well-formed scales. Music Theory Spectrum 1989, 11, 187–206. [Google Scholar] [CrossRef]
Christoffel, E B. Observatio arithmetica. Annali di Matematica Pura ed Applicata 1875, 6, 148–152. [Google Scholar] [CrossRef]
Calkin, N; Wilf, H S. Recounting the rationals. Amer. Math. Monthly 2000, 107, 360–363. [Google Scholar] [CrossRef]
Domínguez, M.; Clampitt, D.; Noll, T. WF Scales, ME Sets, and Christoffel Words; MCM 2007, CCIS 37; Klouche, T., Noll, T., Eds.; Springer-Verlag, 2009; Volume 37, pp. 477–488. [Google Scholar]
Graham, R L; Knuth, D E; Patashnik, O. Concrete Mathematics; Addison-Wesley, 1990. [Google Scholar]
A De Luca, Sturmian words: structure, combinatorics, and their arithmetics. Theoretical Computer Science 1997, 183, 45–82. [CrossRef]
Isola, S. Su alcuni rapporti tra matematica e scale musicali, La Matematica nella Società e nella Cultura. Rivista dell’Unione Matematica Italiana, Serie I 2016, Vol. 1(N. 1), 31–50. [Google Scholar]
Hellegouarch, Y. Gammes naturelles, first part in Gazette SMF 81 (1999) 25-39; second part in Gazette SMF 82 (1999), 13-25.
Knauf, A. Number theory, dynamical systems and statistical mechanics. Reviews in Mathematical Physics 1999, 11, 1027–1060. [Google Scholar] [CrossRef]
Kuipers, L; Neiderreiter, Neiderreiter. Uniform distribution of sequences; Wiley: New York, 1974. [Google Scholar]
Newman, M. Recounting the rationals. Continued, Amer. Math. Monthly 2003, 110, 642–643. [Google Scholar]
Fogg, N Pytheas. Substitutions in Dynamics, Arithmetics and Combinatorics; LNM 1794; Springer, 2002. [Google Scholar]
Queffélec, M. Dynamical systems arising from substitutions; Springer Berlin Heidelberg: Berlin, Heidelberg, 1987. [Google Scholar]
C Series, The geometry of Markoff numbers. The Mathematical Intelligencer 1985, 7, 20–29. [CrossRef]
Stern, M. Über eine zahlentheoretische Funktion. Journal für die reine und angewandte Mathematik 1858, 55, 193–220. [Google Scholar]
Solomyak, B. A note on spectral properties of random S-adic systems. 2025. Available online: https://arxiv.org/abs/2403.08884.
Reutenauer, C. From Christoffel Words to Markoff Numbers; Oxford University Press, 2018. [Google Scholar]
Richards, I. Continued fractions without tears. Mathematical Magazine 1981, 54(n. 4). [Google Scholar] [CrossRef]
J Von Neumann, Zur Operatorenmethode in klassischen Mechanik. Ann. Math. 1932, 33, 587–642. [CrossRef]

Figure 1. The first five levels of the Stern-Brocot tree.

Figure 3.

Figure 4.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Words and Numbers: A Dynamical Systems Perspective

Abstract

Keywords:

Subject:

1. Preliminaries

2. Relation with Cutting and Sturmian Sequences

3. Relation with Continued Fractions

3.1. Reversals and Duality

3.2. Motions on $\hat{T}$ and $\hat{F}$ .

4. Ordering and Dynamical Systems

4.1. An Alternative Ordering

5. Motions on the Modular Surface

References

MDPI Initiatives

Important Links

Subscribe

Words and Numbers: A Dynamical Systems Perspective

Abstract

Keywords:

Subject:

1. Preliminaries

2. Relation with Cutting and Sturmian Sequences

3. Relation with Continued Fractions

3.1. Reversals and Duality

3.2. Motions on T ^ and F ^ .

4. Ordering and Dynamical Systems

4.1. An Alternative Ordering

5. Motions on the Modular Surface

References

MDPI Initiatives

Important Links

Subscribe

3.2. Motions on $\hat{T}$ and $\hat{F}$ .