The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

James Hateley

doi:10.20944/preprints202511.1440.v1

Submitted:

19 November 2025

Posted:

19 November 2025

Read the latest preprint version here

Abstract

We develop an operator--theoretic framework for the Collatz map based on its backward transfer operator acting on weighted Banach spaces of arithmetic functions. The associated Dirichlet transforms form a holomorphic family that captures the complex--analytic evolution of iterates and admits a decomposition into a zeta--type pole at $s=1$ and a holomorphic remainder. Within a finer multiscale space adapted to the Collatz preimage tree, we establish a Lasota--Yorke inequality with an explicit contraction constant $\lambda<1$, giving quasi--compactness and a spectral gap at the dominant eigenvalue. The resulting invariant density is strictly positive and exhibits a $c/n$ decay profile. We formulate a general criterion showing that, under a verified quasi--compactness hypothesis with isolated eigenvalue $1$, the forward dynamics admit no infinite trajectories. The framework provides a coherent spectral perspective on the Collatz operator and suggests a broader analytic approach to arithmetic dynamical systems.

Keywords:

Collatz conjecture

;

transfer operators

;

Lasota–Yorke inequality

;

invariant densities

;

dirichlet transforms

;

nonlinear integer dynamics

;

quasi-compactness

Subject:

Computer Science and Mathematics - Algebra and Number Theory

1. Introduction

The Collatz conjecture asserts that every positive integer n eventually reaches the 1–2 cycle under repeated application of

T (n) = \{\begin{matrix} n / 2, & n even, \\ 3 n + 1, & n odd . \end{matrix}

(1)

Equivalently, every forward orbit

O^{+} (n) = {T^{k} (n) : k \geq 0}

is conjectured to terminate in

{1, 2}

. Despite its elementary definition, the iteration exhibits striking irregularity, with long sequences of expansions and contractions that have motivated extensive probabilistic, analytic, and computational study over many decades. Classical work of Terras [15,16] established early density results and stopping-time estimates, while the surveys of Lagarias [7,8] synthesized a wide range of heuristic and structural approaches. Subsequent analytic contributions, including those of Meinardus [11] and Applegate–Lagarias [1], have developed refined density bounds and asymptotic estimates for the distribution of orbits. Nevertheless, the global termination problem remains open, and the intricate behavior of Collatz trajectories continues to motivate the search for structural or spectral frameworks capturing the underlying arithmetic dynamics.

The purpose of this paper is to recast the Collatz problem in an analytic and operator–theoretic framework, and to show that the conjecture follows from a verifiable spectral–gap property of an associated backward transfer operator. Instead of studying T directly, we analyze its inverse dynamics through the operator

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m},

(2)

acting on arithmetic functions

f : N \to C

. Transfer–operator methods of this type originate in statistical mechanics and dynamical systems [13,14], and have more recently been applied to

3 x + 1

–type maps in various analytic and functional–analytic contexts [10,12]. For the Collatz map (1), each n has an even preimage

2 n

and an additional odd preimage

(n - 1) / 3

whenever

n \equiv 4 (mod 6)

, giving

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (mod 6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} .

(3)

The weights

1 / m

normalize the operator so that P acts as a mass–preserving average on non-negative

ℓ^{1}

sequences, reflecting the logarithmic contraction inherent in the preimage structure of T.

Remark 1

(Invariant density and logarithmic mass balance). Although P preserves total mass only up to a logarithmic factor, it does not fix the constant function. Indeed,

(P 1) (n) = \frac{1}{2 n} + 1_{{n \equiv 4 (mod 6)}} \frac{3}{n - 1} \sim \frac{C}{n} (n \to \infty),

so

(P 1) \neq 1

. More generally,

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m},

(4)

which shows that P islogarithmically mass–preserving: the pushforward of mass is reweighted by the harmonic kernel

m \mapsto 1 / m

.

This logarithmic balance forces any P–invariant density h to satisfy

P h = h

with a decay of order

1 / n

as

n \to \infty

. In particular, the explicit block recursion developed in Section 5.2, together with the oscillation control provided by the Lasota–Yorke inequality [9], yields the precise asymptotic profile

h (n) \sim \frac{c}{n}, n \to \infty,

consistent with Tauberian heuristics of Delange type [3]. All spectral decompositions in the sequel are expressed relative to this nonconstant

1 / n

–type invariant profile.

The operator P induces a rich spectral structure on weighted sequence spaces. On

ℓ_{σ}^{1}

, defined by

{∥ f ∥}_{σ} = \sum_{n \geq 1} | f (n) | n^{- σ}

, the Dirichlet transform

D f (s) = \sum_{n \geq 1} \frac{f (n)}{n^{s}},

(5)

intertwines P with analytic continuation in the half-plane

ℜ (s) > σ

. Uniform

ℓ_{σ}^{1}

bounds on

P^{k}

translate into exponential envelopes for

D (P^{k} f) (s)

and yield meromorphic continuations of the corresponding Collatz–Dirichlet series, whose pole at

s = 1

reflects the average branching behavior [2,5]. The spectral radius of P on

ℓ_{σ}^{1}

captures the global weighted expansion rate of inverse branches and determines the analytic location of dominant singularities.

To resolve finer dynamical properties, we refine this setting to a multiscale Banach space

B_{tree, σ}

built from dyadic–triadic block averages and oscillation seminorms that encode the hierarchical structure of the Collatz preimage tree. On this space, P satisfies a two-norm Lasota–Yorke inequality,

{[P f]}_{tree, σ} \leq λ_{LY} {[f]}_{tree, σ} + C {∥ f ∥}_{σ}, 0 < λ_{LY} < 1,

placing the dynamics within the classical Ionescu–Tulcea–Marinescu and Hennion spectral frameworks for quasi–compact operators [4,6]. The precise Lasota–Yorke bounds, including the explicit contraction of the odd branch, are developed in Section 4, Section 5 and Section 6.

The main theorem of the paper establishes that when the odd-branch contraction constant

λ_{odd} (α, ϑ)

satisfies

λ_{odd} < 1

for specific parameters

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

, the backward Collatz operator P possesses a strict spectral gap on

B_{tree, σ}

. The spectral decomposition then implies that every invariant measure of P is supported on the 1–2 cycle, ruling out any positive-density family of divergent or periodic orbits. A strengthened criterion shows that a non-trivial invariant functional in

B_{tree, σ}^{*}

would contradict the spectral gap, hence all Collatz trajectories must terminate.

The remainder of the paper is organized as follows. Section 2 establishes notation and basic properties of the weighted

ℓ_{σ}^{1}

spaces together with the associated Dirichlet transforms. Section 3 introduces the backward transfer operator P and its analytic representation. Section 4 constructs the multiscale space

B_{tree, σ}

adapted to the Collatz preimage tree and proves the corresponding Lasota–Yorke inequalities. Section 6 verifies that the odd branch admits an explicit contraction constant

λ_{odd} < 1

for the chosen parameters, yielding quasi–compactness and a spectral gap. Finally, Section 7 develops the resulting spectral consequences, formulating a general criterion that links quasi–compactness with the absence of infinite forward trajectories, and situating the Collatz operator within a broader analytical framework for arithmetic dynamical systems.

2. Preliminaries

The analysis begins with a careful description of the function spaces, Dirichlet transforms, and basic structural features of the Collatz map that underlie the spectral study of the backward operator P. Throughout we work with complex-valued arithmetic functions

f : N \to C

.

2.1. Weighted $ℓ^{1}$ Spaces and Dirichlet Transforms

For

σ > 0

we define the weighted

ℓ^{1}

space

ℓ_{σ}^{1} : = \{f : N \to {C : ∥ f ∥}_{σ} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} < \infty\} .

(6)

The weight exponent

σ

measures polynomial decay and is chosen so that Dirichlet series associated with f converge absolutely in a half-plane

ℜ (s) > σ

.

Given

f \in ℓ_{σ}^{1}

, we define its Dirichlet transform

D f (s) : = \sum_{n \geq 1} \frac{f (n)}{n^{s}}, ℜ (s) > σ .

(7)

Lemma 1

(Dirichlet convergence). Let

σ > 0

and

f \in ℓ_{σ}^{1}

. Then

D f (s)

in (7) converges absolutely for

ℜ (s) > σ

and defines a bounded holomorphic function on every half-plane

ℜ (s) \geq σ + ε

,

ε > 0

. Moreover,

| D f (s) | \leq {∥ f ∥}_{σ} sup_{n \geq 1} n^{σ - ℜ (s)} .

(8)

Proof.

For

ℜ (s) > σ

,

\sum_{n \geq 1} |\frac{f (n)}{n^{s}}| = \sum_{n \geq 1} \frac{| f (n) |}{n^{ℜ (s)}} = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} n^{σ - ℜ (s)} \leq {∥ f ∥}_{σ} sup_{n \geq 1} n^{σ - ℜ (s)} < \infty,

so the series converges absolutely and locally uniformly in

ℜ (s) \geq σ + ε

, giving holomorphy and the bound (8). □

We write

ℓ^{1} = ℓ_{0}^{1}

for the unweighted space with norm

{∥ f ∥}_{1} = \sum_{n \geq 1} | f (n) |

.

2.2. Coarse Forward Envelopes

Lemma 2

(Coarse k-step envelopes). Let

T : N \to N

denote the Collatz map, (1). For every

n \in N

and

k \in N_{0}

,

\frac{n}{2^{k}} \leq T^{k} (n) \leq 3^{k} n + \frac{3^{k} - 1}{2} .

(9)

Proof.

The proof proceeds as before:

T (x) \geq x / 2

and

T (x) \leq 3 x + 1

, and induction gives the bounds (9). □

These envelopes are intentionally crude, yet they ensure that forward iterates of typical arithmetic weights remain controlled on the scales relevant for our Dirichlet and transfer-operator analysis.

2.3. Backward Preimages and the Transfer Recursion

For each

n \geq 1

, define the even and odd preimage sets

E (n) : = {m \in N : T (m) = n, m even}, O (n) : = {m \in N : T (m) = n, m odd} .

Lemma 3

(Preimage structure). For every

n \in N

,

E (n) = {2 n}, O (n) = \{\begin{matrix} {(n - 1) / 3}, & n \equiv 4 (mod 6), \\ ⌀, & otherwise, \end{matrix}

(10)

and in the first case

(n - 1) / 3

is odd. In particular, each n has either one preimage (even) or two preimages (one even and one odd), and the odd preimage occurs with natural density

1 / 6

.

Proof.

If m is even and

T (m) = n

, then

m / 2 = n

, so

m = 2 n

, establishing

E (n) = {2 n}

.

If m is odd and

T (m) = n

, then

3 m + 1 = n

, so

m = (n - 1) / 3

. This is an integer precisely when

n \equiv 1 (mod 3)

. For m to be odd,

n - 1

must be divisible by 3 but not by 6, so

n \equiv 4 (mod 6)

. In that case

(n - 1) / 3

is odd. The density statement follows since the congruence class

n \equiv 4 (mod 6)

has natural density

1 / 6

. □

Hence each n admits exactly one even preimage and possibly one odd preimage when

n \equiv 4 (mod 6)

. The corresponding backward transfer operator is defined as

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m} = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

(11)

The normalization by

1 / m

reflects the logarithmic contraction of the forward map and ensures a natural mass-balance property.

Lemma 4

(Mass preservation on

ℓ^{1}

). If

f \geq 0

and

f \in ℓ^{1}

, then

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} f (m) .

(12)

Proof.

Each m contributes exactly once to the double sum

\sum_{n \geq 1} \sum_{m : T (m) = n} \frac{f (m)}{m}

, so equality (12) follows directly from (11). □

2.4. Dirichlet Envelope for Iterates of the Backward Operator

The preimage structure allows a crude but useful bound on P acting on

ℓ_{σ}^{1}

.

Proposition 1

(Backward operator bound). Let

σ > 0

and let P be defined by (11). Then

P : ℓ_{σ}^{1} \to ℓ_{σ}^{1}

is bounded and

{∥ P f ∥}_{σ} \leq C_{σ} {∥ f ∥}_{σ}, C_{σ} : = 2^{σ} + 3^{- σ},

(13)

for all

f \in ℓ_{σ}^{1}

. Consequently, for every

k \geq 1

,

∥ P^{k} {f ∥}_{σ} \leq C_{σ}^{k} {∥ f ∥}_{σ} .

(14)

Proof.

From (11),

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

Hence

{∥ P f ∥}_{σ} \leq S_{even} + S_{odd},

with

S_{even} : = \sum_{n \geq 1} \frac{| f (2 n) |}{2 n n^{σ}}, S_{odd} : = \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{|f (\frac{n - 1}{3})|}{(\frac{n - 1}{3}) n^{σ}} .

For the even branch, set

m = 2 n

, so

n = m / 2

and

S_{even} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m {(m / 2)}^{σ}} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{2^{σ} | f (m) |}{m^{σ + 1}} \leq 2^{σ} \sum_{m \geq 1} \frac{| f (m) |}{m^{σ}} = 2^{σ} {∥ f ∥}_{σ} .

For the odd branch, write

m = (n - 1) / 3

, so

n = 3 m + 1

and m is odd. Then

S_{odd} = \sum_{\begin{matrix} m \geq 1 \\ m odd \end{matrix}} \frac{| f (m) |}{m {(3 m + 1)}^{σ}} \leq \sum_{m \geq 1} \frac{| f (m) |}{m {(3 m)}^{σ}} = 3^{- σ} \sum_{m \geq 1} \frac{| f (m) |}{m^{σ + 1}} \leq 3^{- σ} {∥ f ∥}_{σ} .

Combining the two estimates gives (13), and iterating yields (14). □

The constant

C_{σ} = 2^{σ} + 3^{- σ}

is an explicit growth factor for P on

ℓ_{σ}^{1}

. It is not

< 1

in this normalization, so no contraction is claimed at this level. The genuine contraction mechanism is obtained later on the multiscale Banach space

B_{tree}

, where a strong seminorm captures oscillatory decay along the Collatz tree while the

ℓ^{1}

component provides compactness.

3. Transfer Operator Formulation

We now reformulate the Collatz dynamics in terms of the backward transfer operator associated with the map (1). This operator-theoretic viewpoint provides an analytic bridge between the discrete recurrence and the functional framework developed in later sections. The transfer operator encodes the inverse–branching structure of the map and propagates densities backward along the Collatz tree, in a form compatible with logarithmic weighting and Dirichlet series.

Recall that the Collatz map, (1), by Lemma 3, each

n \geq 1

has the even preimage

2 n

, together with an additional odd preimage

(n - 1) / 3

precisely when

n \equiv 4 (mod 6)

.

3.1. Backward Transfer Operator

Definition 1

(Backward transfer operator). For an arithmetic function

f : N \to C

, define

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m} = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3}, n \in N,

(15)

where

1_{A}

denotes the indicator of the condition A.

The multiplicative factor

1 / m

assigns to each inverse branch a logarithmic weight, so that P acts as a normalized backward average along preimages. This normalization aligns the discrete dynamics with Dirichlet weights and will be crucial for analytic continuation and spectral estimates below.

Positivity. If

f (n) \geq 0

for all n, then

(P f) (n) \geq 0

for all n, since P is a positive linear combination of values of f.

Weighted mass preservation. A direct change of variables shows that for every nonnegative f satisfying

\sum_{m \geq 1} | f (m) | / m < \infty

,

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m} .

(16)

Thus P preserves the logarithmically weighted mass

\sum f (m) / m

; plain

ℓ^{1}

mass is not preserved under this normalization.

Boundedness on weighted spaces. Let

ℓ_{σ}^{1} : = \{f : N \to {C : ∥ f ∥}_{ℓ_{σ}^{1}} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} < \infty\}, σ > 0 .

A direct change of variables in (15) yields, for all

f \in ℓ_{σ}^{1}

,

\begin{matrix} {∥ P f ∥}_{ℓ_{σ}^{1}} & = \sum_{n \geq 1} \frac{| (P f) (n) |}{n^{σ}} \leq \sum_{n \geq 1} (\frac{| f (2 n) |}{2 n^{1 + σ}} + 1_{{n \equiv 4 (6)}} \frac{|f ((n - 1) / 3)|}{{((n - 1) / 3)}^{1 + σ}}) \\ = \frac{1}{2} \sum_{n \geq 1} \frac{| f (2 n) |}{n^{1 + σ}} + 3^{1 + σ} \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{{(n - 1)}^{1 + σ}} . \end{matrix}

(17)

Changing variables

m = 2 n

in the first sum and

m = (n - 1) / 3

in the second gives

\begin{matrix} \sum_{n \geq 1} \frac{| f (2 n) |}{2 n^{1 + σ}} & = 2^{σ} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m^{1 + σ}} \leq 2^{σ} {∥ f ∥}_{ℓ_{σ}^{1}}, \\ 3^{1 + σ} \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{{(n - 1)}^{1 + σ}} & = 3^{- σ} \sum_{\begin{matrix} m \geq 1 \\ 3 m + 1 \equiv 4 (6) \end{matrix}} \frac{| f (m) |}{m^{σ}} \leq 3^{- σ} {∥ f ∥}_{ℓ_{σ}^{1}} . \end{matrix}

Hence

{∥ P f ∥}_{ℓ_{σ}^{1}} \leq (2^{σ} + 3^{- σ}) {∥ f ∥}_{ℓ_{σ}^{1}},

(18)

and therefore

∥ P^{k} {f ∥}_{ℓ_{σ}^{1}} \leq {(2^{σ} + 3^{- σ})}^{k} {∥ f ∥}_{ℓ_{σ}^{1}}, k \geq 0 .

(19)

Action on the weighted sup space. For the Banach space

B_{σ} : = \{f : N \to {C : ∥ f ∥}_{B_{σ}} : = sup_{n \geq 1} n^{σ} | f (n) | < \infty\},

the normalization factor

1 / m

in (15) improves decay at each branch but does not make P a contraction. Setting

g (n) : = n f (n)

, one obtains

n (P f) (n) = g (2 n) + 1_{{n \equiv 4 (6)}} g (\frac{n - 1}{3}), (P f) (n) = \frac{(Q g) (n)}{n}, (Q g) (n) : = g (2 n) + 1_{{n \equiv 4 (6)}} g (\frac{n - 1}{3}) .

Using

{∥ f ∥}_{B_{σ}} = {∥ g ∥}_{B_{σ - 1}}

, one obtains the bound

\begin{matrix} {∥ P f ∥}_{B_{σ}} & = sup_{n \geq 1} n^{σ - 1} | (Q g) (n) | \leq sup_{n \geq 1} (n^{σ - 1} | g (2 n) | + n^{σ - 1} 1_{{n \equiv 4 (6)}} |g (\frac{n - 1}{3})|) \\ \leq (2^{- (σ - 1)} + 3^{σ - 1}) {∥ g ∥}_{B_{σ - 1}} = (2^{- (σ - 1)} + 3^{σ - 1}) {∥ f ∥}_{B_{σ}} . \end{matrix}

(20)

In particular, the constant

2^{- (σ - 1)} + 3^{σ - 1} \geq 1

for all

σ > 0

, so P is bounded but not contractive on

(B_{σ}, ∥ \cdot ∥_{B_{σ}})

. This coarse boundedness provides an upper envelope for the operator norm but does not imply any decay of

P^{k}

on

B_{σ}

.

These limitations motivate the refinement of the functional setting in later sections, where the multiscale tree spaces

B_{tree}

and

B_{tree, σ}

are introduced to obtain genuine Lasota–Yorke-type contractions with

λ < 1

and a provable spectral gap.

3.2. Dirichlet-Side Formulation and Intertwining

For

f \in ℓ_{σ}^{1}

with

σ > 0

, the Dirichlet transform

D f (s) : = \sum_{n \geq 1} \frac{f (n)}{n^{s}}, ℜ (s) > σ,

(21)

is absolutely convergent. Writing

D f (s) = \sum_{n \geq 1} a_{n} n^{- s}

with

a_{n} = f (n)

and substituting (15), we obtain

\begin{matrix} D (P f) (s) & = \sum_{n \geq 1} (\frac{a_{2 n}}{2 n} + 1_{{n \equiv 4 (6)}} \frac{a_{(n - 1) / 3}}{(n - 1) / 3}) \frac{1}{n^{s}} . \end{matrix}

(22)

Thus

D (P f)

is again a Dirichlet series whose coefficients depend linearly on those of

D f

.

Definition 2

(Dirichlet–Ruelle operator). Let

D_{σ}

denote the space of Dirichlet series

F (s) = \sum_{n \geq 1} a_{n} n^{- s} with \sum_{n \geq 1} \frac{| a_{n} |}{n^{σ}} < \infty .

Define

L : D_{σ} \to D_{σ}

by

(L F) (s) : = \sum_{n \geq 1} b_{n} n^{- s}, b_{n} : = \frac{a_{2 n}}{2 n} + 1_{{n \equiv 4 (6)}} \frac{a_{(n - 1) / 3}}{(n - 1) / 3} .

(23)

Lemma 5

(Operator norm of L). For

σ > 0

, let

{∥ F ∥}_{σ} : = \sum_{n \geq 1} | a_{n} | / n^{σ}

. Then

L : D_{σ} \to D_{σ}

is bounded and

{∥ L ∥}_{σ} \leq 2^{σ} + 3^{- σ} .

(24)

Proof.

From (23),

{∥ L F ∥}_{σ} = \sum_{n \geq 1} \frac{| b_{n} |}{n^{σ}} \leq \sum_{n \geq 1} \frac{| a_{2 n} |}{2 n n^{σ}} + \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| a_{(n - 1) / 3} |}{(n - 1) / 3} \frac{1}{n^{σ}} = : S_{even} + S_{odd} .

For the even term, set

m = 2 n

. Then

S_{even} = \sum_{m even} \frac{| a_{m} |}{2 {(m / 2)}^{1 + σ}} = \sum_{m even} \frac{2^{σ} | a_{m} |}{m^{1 + σ}} \leq 2^{σ} \sum_{m even} \frac{| a_{m} |}{m^{σ}} \leq 2^{σ} {∥ F ∥}_{σ} .

For the odd term, write

m = (n - 1) / 3

, so

n = 3 m + 1

and

S_{odd} = \sum_{m \geq 1} \frac{| a_{m} |}{m {(3 m + 1)}^{σ}} \leq 3^{- σ} \sum_{m \geq 1} \frac{| a_{m} |}{m^{σ}} = 3^{- σ} {∥ F ∥}_{σ} .

Combining the two estimates gives

{∥ L F ∥}_{σ} \leq (2^{σ} + 3^{- σ}) {∥ F ∥}_{σ},

proving (24). □

Lemma 6

(Intertwining of P and L). For every

f \in ℓ_{σ}^{1}

with

σ > 0

,

D (P f) = L (D f), D (P^{k} f) = L^{k} (D f), k \geq 0,

(25)

whenever the series converge absolutely.

Proof.

The Dirichlet coefficients of

D (P f)

in (22) are precisely the

b_{n}

of (23), so

D (P f) = L (D f)

; iteration gives the second identity. □

The intertwining relation shows that spectral information for P on

ℓ_{σ}^{1}

transfers to L on

D_{σ}

. However, since P is not contractive on

ℓ_{σ}^{1}

or

B_{σ}

, the inequality (24) provides only a uniform boundedness envelope for

∥ L^{k} ∥_{σ}

, not exponential decay. Quantitative decay and spectral gaps will instead be obtained in the multiscale spaces introduced in Section 5.

Define

w_{k} : = P^{k} 1

with

1 (n) \equiv 1

and

ζ_{C} (s, k) : = \sum_{n \geq 1} \frac{w_{k} (n)}{n^{s}}, ℜ (s) large .

(26)

By Lemma 6,

ζ_{C} (s, 0) = ζ (s), ζ_{C} (s, k) = (L^{k} ζ) (s), k \geq 1 .

(27)

The quantity

w_{k} (n)

represents the total normalized weight of all k–step backward paths from n in the Collatz tree under the logarithmic weighting

1 / m

. The family

ζ_{C} (s, k)

therefore encodes, in Dirichlet form, the distribution of these weighted backward configurations at depth k. By Lemma 5,

∥ L^{k} ∥_{σ} \leq {(2^{σ} + 3^{- σ})}^{k},

so the Dirichlet coefficients of

ζ_{C} (s, k)

are uniformly bounded in

ℜ (s) > σ

but do not necessarily decay in k. Later sections refine this estimate by passing to the multiscale tree space

B_{tree, σ}

, where the Lasota–Yorke inequality ensures a true spectral gap and exponential decay of

P^{k}

.

4. Spectral Reduction and Analytic Continuation

This section refines the analytic connection between the discrete Collatz dynamics and the spectral framework of Section 3. Our goal is to express analytic information about the Dirichlet series associated with iterates of the backward operator P in terms of the spectral data of P—equivalently, of the Dirichlet–Ruelle operator L—acting on suitable Banach spaces continuously embedded in

ℓ_{σ}^{1}

. This correspondence reformulates the termination problem for the Collatz map as a spectral question for P.

Throughout this section we fix

σ > 1

and a Banach space

B_{σ, 1}

of arithmetic functions such that

B_{σ, 1} \subset ℓ_{σ}^{1}

continuously,

P (B_{σ, 1}) \subset B_{σ, 1}

, and the Dirichlet transform

D f (s) = \sum_{n \geq 1} \frac{f (n)}{n^{s}}

defines a holomorphic function for

ℜ (s) > σ

whenever

f \in B_{σ, 1}

. The intertwining relation (25) then yields, for all

k \geq 0

,

D (P^{k} f) (s) = \sum_{n \geq 1} \frac{(P^{k} f) (n)}{n^{s}}, ℜ (s) > σ .

Since

B_{σ, 1} \subset ℓ_{σ}^{1}

, each series converges absolutely. By the

ℓ_{σ}^{1}

estimate (18),

| D (P^{k} f) (s) | \leq ∥ P^{k} {f ∥}_{ℓ_{σ}^{1}} \leq {(2^{σ} + 3^{- σ})}^{k} {∥ f ∥}_{ℓ_{σ}^{1}}, ℜ (s) > σ .

(28)

The bound (28) shows that the iterates of P are uniformly bounded on

ℓ_{σ}^{1}

, though not contractive; a genuine contraction will appear only after the refinement to the multiscale tree spaces introduced in Section 4.4.

Generating function and operator resolvent. For

z \in C

with

| z | < {(2^{σ} + 3^{- σ})}^{- 1}

, define the two–variable generating function

G_{f} (s, z) : = \sum_{k \geq 0} z^{k} D (P^{k} f) (s) .

(29)

The series converges absolutely and locally uniformly for

ℜ (s) > σ

, hence

G_{f}

is holomorphic in

(s, z)

on the domain

Ω_{σ} : = {(s, z) \in C^{2} : ℜ (s) > σ, | z | < {(2^{σ} + 3^{- σ})}^{- 1}} .

On the operator side, for such z the Neumann series

{(I - z P)}^{- 1} = \sum_{k \geq 0} z^{k} P^{k}

converges in operator norm on

B_{σ, 1}

, and thus

G_{f} (s, z) = D [{(I - z P)}^{- 1} f] (s), (s, z) \in Ω_{σ} .

(30)

The poles of

{(I - z P)}^{- 1}

in the z–plane occur precisely at the reciprocals of the spectral values of P on

B_{σ, 1}

. Consequently the analytic structure of

G_{f}

as a function of z is governed by the spectrum of P.

At this point we recall that the backward Collatz operator P preserves total mass on

ℓ^{1}

:

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} f (m),

so 1 is a simple eigenvalue corresponding to the eigenvector

1 (n) \equiv 1

. Hence the spectral analysis of P will focus on demonstrating a spectral gap at 1: all other spectral values satisfy

| λ | \leq λ_{LY} < 1

. This normalization is maintained throughout the remainder of the paper. The resolvent expansion (30) is therefore analytic for

| z | < 1

except at the simple pole

z = 1

, whose residue encodes the invariant functional associated with

1

.

The coarse resolvent radius

{(2^{σ} + 3^{- σ})}^{- 1}

merely provides an elementary domain of convergence. A sharper meromorphic continuation—reflecting the true spectral radius

r (P) = 1

and the subdominant bound

ρ_{ess} (P) \leq λ_{LY} < 1

—will be obtained on the refined spaces

B_{tree}

and

B_{tree, σ}

, where the Lasota–Yorke inequality gives quantitative contraction of oscillations between adjacent scales.

Finally, for the constant function

1 (n) \equiv 1

(whenever

1 \in B_{σ, 1}

), the coefficients of

G_{1} (s, z)

are precisely the Collatz Dirichlet series

ζ_{C} (s, k)

defined in (26). Thus the analytic continuation and asymptotic decay of

ζ_{C} (s, k)

as

k \to \infty

are controlled by the spectral properties of P through (30); their exponential decay emerges once the spectral gap on the multiscale tree spaces is established.

4.1. Spectral Reduction and Analytic Continuation

Recall that the Dirichlet–Ruelle operator L is defined on

D_{σ}

by (23). The intertwining Lemma 6 asserts that for all

f \in ℓ_{σ}^{1}

,

D (P f) = L (D f) .

Since

D

is injective on

ℓ_{σ}^{1}

, every eigenpair

(λ, f)

of P with

f \in ℓ_{σ}^{1}

produces an eigenpair

(λ, D f)

of L. Conversely, if

L F = λ F

and

F = D f

lies in the image of

D

, then

P f = λ f

. Hence the point spectra of P on

B_{σ, 1}

and of L on

D_{σ}

coincide on the subspace

D (B_{σ, 1})

. In particular,

ρ (L) \geq ρ (P),

(31)

and any spectral gap or peripheral spectral property of P transfers to the induced action of L on Dirichlet series arising from

B_{σ, 1}

.

We emphasize that equality

σ (L) = σ (P)

is not assumed. The partial correspondence (31) suffices for analytic reduction: the Dirichlet-side continuation of

D (P^{k} f)

reflects the spectral geometry of P.

Mass preservation and spectral gap. Because P only preserves total mass up to a logarithmic factor, we have

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m},

so the constant function

1 (n) \equiv 1

is not an eigenvector. Instead, P admits a unique positive invariant density

h \in B_{tree, σ}

and a unique positive invariant functional

ϕ \in B_{tree, σ}^{*}

with

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .

(32)

Throughout the paper we work with this Perron–Frobenius normalization (32) and express all spectral decompositions relative to the nonconstant invariant profile h.

Within this framework, the Dirichlet–Ruelle operator L inherits the same dominant eigenvalue 1 and the same spectral gap on the subspace

D (B_{σ, 1})

. The analytic behavior of the Collatz Dirichlet series

ζ_{C} (s, k) = D (P^{k} 1) (s)

is then determined by how

P^{k}

approaches the spectral projector onto the invariant subspace spanned by

1

.

Theorem 1

(Spectral reduction and analytic continuation). Let

B_{σ, 1}

be a Banach space of arithmetic functions continuously embedded in

ℓ_{σ}^{1}

such that

P : B_{σ, 1} \to B_{σ, 1}

is quasi-compact and satisfies the mass-preserving normalization (12). Assume further that 1 is a simple eigenvalue of P and that all other spectral values lie in the closed disk

| λ | \leq λ_{LY} < 1

. Then for every

f \in B_{σ, 1}

the Dirichlet transforms

D (P^{k} f) (s)

extend holomorphically to

ℜ (s) > σ

and admit the decomposition

D (P^{k} f) (s) = Π_{1} (f) D (1) (s) + R_{k} (s), | R_{k} (s) | \leq C_{f} (s) λ_{LY}^{k},

(33)

where

Π_{1}

is the spectral projection associated with the eigenvalue 1 and

C_{f} (s)

is locally bounded on

{ℜ (s) > σ}

. In particular, for f with

Π_{1} (f) = 0

, the functions

D (P^{k} f) (s)

decay exponentially in k uniformly on compact subsets of

ℜ (s) > σ

.

When

f = 1

, the same conclusion applies to

ζ_{C} (s, k) = D (P^{k} 1) (s)

, whose exponential stabilization corresponds to convergence toward the invariant density associated with the Collatz operator.

Proof.

By quasi-compactness, the spectrum of P decomposes as

σ (P) = {1} \cup σ_{ess} (P), ρ_{ess} (P) \leq λ_{LY} < 1,

and the Riesz projection

Π_{1} = \frac{1}{2 π i} \oint_{| z - 1 | = ε} {(z I - P)}^{- 1} d z

is a bounded projection onto the one-dimensional invariant subspace spanned by

1

. Then

P^{k} = Π_{1} + N^{k}

, where

∥ N^{k} ∥_{B_{σ, 1}} \leq C λ_{LY}^{k}

for some constant

C > 0

. Applying the Dirichlet transform and using

| D (g) (s) | \leq {∥ g ∥}_{ℓ_{σ}^{1}}

for

ℜ (s) > σ

gives

D (P^{k} f) (s) = D (Π_{1} f) (s) + D (N^{k} f) (s), | D (N^{k} f) (s) | \leq C λ_{LY}^{k} {∥ f ∥}_{B_{σ, 1}} .

Since

Π_{1} f

is a multiple of

1

, we may write

D (Π_{1} f) = Π_{1} (f) D (1)

, yielding (33). Analyticity for

ℜ (s) > σ

follows from absolute convergence and locally uniform bounds. □

This form aligns with the quasi-compactness obtained later on the multiscale tree space

B_{tree, σ}

, where the Lasota–Yorke inequality ensures

ρ_{ess} (P) \leq λ_{LY} < 1

. The exponential term

λ_{LY}^{k}

in (33) corresponds to the essential spectral radius and controls the rate of decay of correlations and Dirichlet coefficients. Under stronger spectral assumptions, the representation can be refined to a meromorphic decomposition in which each isolated eigenvalue

λ_{j}

contributes a term

λ_{j}^{k} D (Π_{j} f)

, generalizing the usual Ruelle–Perron expansion.

4.2. Spectral Criterion on Weighted $ℓ^{1}$ Spaces

The preceding analysis shows that sufficiently strong spectral control of P on an appropriate Banach space

B_{σ, 1}

forces all Dirichlet data generated by the backward Collatz tree to exhibit exponential stabilization toward the invariant profile. Since P is not contractive on

ℓ_{σ}^{1}

or

B_{σ}

, such behavior can only arise on refined Banach spaces where a genuine spectral gap at the eigenvalue 1 has been established. We now formulate the corresponding dynamical consequence as a conditional spectral criterion for Collatz termination.

Theorem 2

(Spectral criterion for Collatz termination). Let P act on a Banach space

B_{σ, 1} \subset ℓ_{σ}^{1}

such that

P (B_{σ, 1}) \subset B_{σ, 1}

and

1 \in B_{σ, 1}

. Assume that P is quasi-compact on

B_{σ, 1}

, that 1 is a simple eigenvalue of P corresponding to the invariant density

1

, and that all other spectral values satisfy

σ (P) ∖ {1} \subset {z \in C : | z | \leq λ_{LY} < 1} .

Then every

f \in B_{σ, 1}

admits a decomposition

P^{k} f = Π_{1} f + N^{k} f, ∥ N^{k} {f ∥}_{B_{σ, 1}} \leq C λ_{LY}^{k} {∥ f ∥}_{B_{σ, 1}},

where

Π_{1}

is the spectral projection onto

span {1}

. Consequently, there exists no nontrivial invariant or periodic density for the backward Collatz dynamics in

B_{σ, 1}

; the only invariant direction is the constant function

1

. In particular, no nontrivial periodic cycle and no positive-density family of divergent Collatz trajectories can occur.

Proof.

By quasi-compactness, the spectrum of P decomposes as

σ (P) = {1} \cup σ_{ess} (P)

with

ρ_{ess} (P) \leq λ_{LY} < 1

. The associated Riesz projection

Π_{1} = \frac{1}{2 π i} \oint_{| z - 1 | = ε} {(z I - P)}^{- 1} d z

is bounded and satisfies

P Π_{1} = Π_{1} P = Π_{1}

,

Π_{1} 1 = 1

. Hence the power iterates decompose as

P^{k} = Π_{1} + N^{k}, {∥ N^{k} ∥}_{B_{σ, 1}} \leq C λ_{LY}^{k},

for some constant

C > 0

. If a nontrivial invariant density

f \in B_{σ, 1}

satisfied

P f = f

, then f would belong to the eigenspace of

λ = 1

, and since this eigenspace is one-dimensional, f must be a scalar multiple of the positive eigenvector h satisfying

P h = h

. Thus no additional invariant densities exist beyond

span {h}

.

If a periodic density f satisfied

P^{q} f = f

for some

q > 0

, then f would correspond to an eigenvalue

λ

with

| λ | = 1

. Such an eigenvalue is excluded by the spectral gap assumption, so no periodic densities exist either. Finally, in the standard translation between transfer-operator invariants and dynamical orbits on the underlying tree, any invariant or periodic density corresponds to either a periodic Collatz cycle or to a positive-density family of non-terminating trajectories. The spectral gap therefore precludes these dynamical behaviors. □

Section 4.4 constructs the multiscale tree Banach space

B_{tree}

and establishes a Lasota–Yorke inequality that ensures quasi-compactness of P with an explicit contraction constant

λ_{LY} < 1

in the strong seminorm. Verification of the hypotheses of Theorem 2 on

B_{tree, σ}

provides the analytic–spectral bridge: a strict spectral gap for P on

B_{tree, σ}

rules out the spectral signatures associated with any non-terminating Collatz behavior.

4.3. Multi-Scale Tree Space

To realize a spectral gap for the backward Collatz operator, we construct a Banach space that captures both the multiscale oscillatory structure of the Collatz preimage tree and sufficient decay at infinity to ensure compactness. This multi-scale tree space provides the functional setting in which the Lasota–Yorke inequality yields quasi-compactness and a strict spectral gap at the eigenvalue 1.

For

j \geq 0

define the scale blocks

I_{j} : = [6^{j}, 2 \cdot 6^{j}) \cap N .

(34)

The factor 6 reflects the approximate scale multiplication under the backward map, combining the even branch

m = 2 n

and the odd branch

m = (n - 1) / 3

(defined for

n \equiv 4 (mod 6)

).

Fix parameters

0 < α < 1

and

0 < ϑ < 1

. For indices

u, v > 0

, define the scale-sensitive weight

W_{α} (u, v) : = \frac{u v}{{| u - v | (u + v)}^{α}}, u \neq v .

(35)

This weight penalizes small separations between indices, emphasizing local oscillations of f, while the factor

{(u + v)}^{- α}

damps sensitivity at large scales. The geometric coefficient

ϑ^{j}

provides exponential attenuation of oscillations across successive levels of the tree.

Definition 3

(Multiscale tree seminorm and space). For

f : N \to C

define

{[f]}_{tree} : = \sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) | f (m) - f (n) | .

(36)

The corresponding Banach space

B_{tree} : = \{f : N \to {C : ∥ f ∥}_{1} + {[f]}_{tree} < \infty\} {, ∥ f ∥}_{tree} : = {∥ f ∥}_{1} + {[f]}_{tree},

is called themultiscale tree space.

Standard arguments for weighted variation-type seminorms show that

(B_{tree}, ∥ \cdot ∥_{tree})

is complete. The seminorm

{[f]}_{tree}

controls the oscillatory irregularity of f within each scale block

I_{j}

, while the

ℓ^{1}

component controls the overall magnitude. However,

B_{tree}

alone does not impose sufficient decay as

n \to \infty

to guarantee compactness.

Weighted extension. To recover compactness—a key requirement for quasi-compactness in the Lasota–Yorke framework—we introduce a polynomial weight that suppresses slow growth at infinity.

Definition 4

(Weighted tree space). For parameters

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

, set

{∥ f ∥}_{σ} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}}, {[f]}_{tree} : = \sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) | f (m) - f (n) | .

Then

B_{tree, σ} : = \{f : N \to {C : ∥ f ∥}_{σ} + {[f]}_{tree} < \infty\} {, ∥ f ∥}_{tree, σ} : = {∥ f ∥}_{σ} + {[f]}_{tree} .

The factor

n^{- σ}

enforces quantitative decay of f at large indices, while

{[f]}_{tree}

measures the oscillatory complexity of f along each level of the tree. Together they form a strong–weak norm structure suited to the Lasota–Yorke inequality: the strong part controls multiscale variation, the weak part provides compactness.

Lemma 7

(Compact embedding). For fixed

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

, the unit ball of

B_{tree, σ}

is relatively compact in

ℓ_{σ}^{1}

.

Proof.

Let

U : = \{f \in B_{tree, σ} : {∥ f ∥}_{tree, σ} \leq 1\} .

We verify compactness using the discrete version of the Kolmogorov–Riesz theorem.

(i) Uniform boundedness. Each

f \in U

satisfies

{∥ f ∥}_{σ} \leq 1

, so

U

is bounded in

ℓ_{σ}^{1}

.

(ii) Uniform tail control. For any

ε > 0

choose N so that

\sum_{n > N} n^{- σ} < ε

. Then for all

f \in U

,

\sum_{n > N} \frac{| f (n) |}{n^{σ}} \leq {∥ f ∥}_{σ} \sum_{n > N} \frac{1}{n^{σ}} \leq ε,

so the tails contribute arbitrarily little

ℓ_{σ}^{1}

–mass.

(iii) Local equicontinuity on finite blocks. Fix

J \geq 0

and consider the finite union

E_{J} = ⋃_{j \leq J} I_{j}

. Within each

I_{j}

, the seminorm term

ϑ^{j} {sup}_{m, n \in I_{j}} W_{α} (m, n) | f (m) - f (n) |

bounds discrete oscillations uniformly in f. Hence the family

{f |_{E_{J}} : f \in U}

lies in a compact subset of the finite-dimensional space

C^{E_{J}}

.

(iv) Diagonal extraction. Given any sequence

(f^{(k)}) \subset U

, apply the compactness on

E_{1}, E_{2}, \dots

and extract a diagonal subsequence converging pointwise on all of

N

. By (ii) the tails beyond any fixed N have uniformly small weight, so pointwise convergence on finite windows implies convergence in

ℓ_{σ}^{1}

. Thus

U

is relatively compact in

ℓ_{σ}^{1}

. □

Remark 2.

The weight

n^{- σ}

is essential. Without it, the unit ball of

B_{tree}

is not precompact in

ℓ^{1}

: one can construct sequences of disjointly supported spikes whose tree seminorms remain bounded while their supports drift to infinity. Taking

σ > 1

eliminates this escape to infinity, yielding the compact embedding required for quasi-compactness.

The space

B_{tree, σ}

thus provides the natural functional environment for the Lasota–Yorke inequality. Its compact embedding into

ℓ_{σ}^{1}

ensures that the essential spectral radius of P on

B_{tree, σ}

is strictly smaller than its spectral radius, a prerequisite for establishing a genuine spectral gap. The strong seminorm captures multiscale regularity across the Collatz tree, while the weighted

ℓ^{1}

norm supplies the compactness that underlies the spectral analysis of the backward transfer operator.

4.4. Lasota–Yorke Inequality on $B_{tree}$

Recall from (11) that

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

It is convenient to split P into its even and odd components:

(P_{even} f) (n) : = \frac{f (2 n)}{2 n}, (P_{odd} f) (n) : = 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3},

(37)

so that

P = P_{even} + P_{odd}

.

From the

ℓ^{1}

estimates of Section 2, both branches are bounded on

ℓ^{1}

, hence on

B_{tree}

. The Lasota–Yorke inequality arises from the fact that

P_{even}

is strongly contracting in the tree seminorm, while

P_{odd}

is a controlled perturbation whose contribution is damped by the multiscale factor

ϑ^{j}

.

4.4.1. Even Branch Contraction on the Multiscale Tree Space

We first record the even-branch estimate.

Lemma 8

(Even branch contraction on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. There exists a constant

C_{even} > 0

depending only on α, ϑ, and σ such that for all

f \in B_{tree, σ}

,

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} ϑ {[f]}_{tree} + C_{even} {∥ f ∥}_{σ} .

(38)

In particular, for fixed α one can choose ϑ sufficiently small so that the even branch is strictly contracting in the tree seminorm up to a controlled

{∥ \cdot ∥}_{σ}

–error.

Proof.

Recall that

(P_{even} f) (n) = f (2 n) / (2 n)

. For each

j \geq 0

, the block seminorm of

P_{even} f

is

Δ_{j} (P_{even} f) : = sup_{\begin{matrix} u, v \in I_{j} \\ u \neq v \end{matrix}} \frac{1}{6^{j}} W_{α} (u, v) |(P_{even} f) (u) - (P_{even} f) (v)| .

Fix j and

u, v \in I_{j}

with

u \neq v

. We decompose

(P_{even} f) (u) - (P_{even} f) (v) = \frac{f (2 u) - f (2 v)}{2 u} + f (2 v) (\frac{1}{2 u} - \frac{1}{2 v}) = : D_{1} (u, v) + D_{2} (u, v),

and estimate the two terms separately.

(1) The oscillatory part $D_{1}$ . Since

W_{α} (2 u, 2 v) = 2^{1 - α} W_{α} (u, v),

we have

W_{α} (u, v) = 2^{- (1 - α)} W_{α} (2 u, 2 v) .

Hence

\frac{1}{6^{j}} W_{α} (u, v) | D_{1} (u, v) | \leq \frac{2^{- (1 - α)}}{6^{j}} W_{α} (2 u, 2 v) \frac{| f (2 u) - f (2 v) |}{2 u} .

Since

u \in I_{j} = [6^{j}, 2 \cdot 6^{j})

,

u \geq 6^{j}

, so

1 / (2 u) \leq 1 / (2 \cdot 6^{j})

and

\frac{1}{6^{j}} W_{α} (u, v) | D_{1} (u, v) | \leq \frac{2^{- (1 - α) - 1}}{6^{2 j}} W_{α} (2 u, 2 v) | f (2 u) - f (2 v) | .

The pair

(2 u, 2 v)

lies at scale comparable to

6^{j}

, i.e. within a bounded number of block levels. Hence there exists a constant

c_{0} > 0

depending only on the block geometry such that

\frac{1}{6^{2 j}} W_{α} (2 u, 2 v) \leq c_{0} \frac{1}{6^{j^{'}}} W_{α} (2 u, 2 v) for some j^{'} \in {j, j + 1} .

Taking the supremum over

u, v \in I_{j}

gives

Δ_{j} (P_{even} f; D_{1}) \leq c_{0} 2^{- (1 - α) - 1} max {Δ_{j} (f), Δ_{j + 1} (f)} .

Multiplying by

ϑ^{j}

and using

ϑ^{j} Δ_{j} (f) \leq {[f]}_{tree}

and

ϑ^{j} Δ_{j + 1} (f) \leq ϑ^{- 1} {[f]}_{tree}

, we obtain

ϑ^{j} Δ_{j} (P_{even} f; D_{1}) \leq c_{1} 2^{- (1 - α)} ϑ {[f]}_{tree},

for some constant

c_{1}

depending only on

α

and

ϑ

. Taking the supremum over j yields

{[P_{even} f]}_{tree}^{(D_{1})} \leq c_{1} 2^{- (1 - α)} ϑ {[f]}_{tree} .

(2) The denominator part $D_{2}$ . Assume

u > v

. Then

|\frac{1}{2 u} - \frac{1}{2 v}| = \frac{| u - v |}{2 u v}, | D_{2} (u, v) | = | f (2 v) | \frac{| u - v |}{2 u v} .

Thus

W_{α} (u, v) | D_{2} (u, v) | = \frac{u v}{{| u - v | (u + v)}^{α}} | f (2 v) | \frac{| u - v |}{2 u v} = \frac{| f (2 v) |}{2 {(u + v)}^{α}} .

For

u, v \in I_{j}

, we have

u + v \geq 2 \cdot 6^{j}

, so

W_{α} (u, v) | D_{2} (u, v) | \leq C_{α} 6^{- α j} | f (2 v) | with C_{α} : = 2^{- (1 + α)} .

Hence

Δ_{j} (P_{even} f; D_{2}) \leq \frac{C_{α}}{6^{(1 + α) j}} sup_{v \in I_{j}} | f (2 v) | .

Multiplying by

ϑ^{j}

and summing over j gives

ϑ^{j} Δ_{j} (P_{even} f; D_{2}) \leq C_{α} {(ϑ 6^{- (1 + α)})}^{j} sup_{v \in I_{j}} | f (2 v) | .

Each integer n appears as

n = 2 v

for at most one

v \in I_{j}

, and since

| f (n) | \leq n^{σ} {∥ f ∥}_{σ}

, the geometric factor

{(ϑ 6^{- (1 + α)})}^{j}

ensures convergence of the series in j. Thus there exists a constant

C_{even}^{'} > 0

depending only on

α

,

ϑ

, and

σ

such that

sup_{j \geq 0} ϑ^{j} Δ_{j} (P_{even} f; D_{2}) \leq C_{even}^{'} {∥ f ∥}_{σ} .

(3) Combine the two parts. Combining the bounds for

D_{1}

and

D_{2}

and renaming constants gives

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} ϑ {[f]}_{tree} + C_{even} {∥ f ∥}_{σ},

which is the desired inequality (38). □

The odd branch requires more care because it shifts indices from n to

(n - 1) / 3

and only acts on the congruence class

n \equiv 4 (mod 6)

. Its effect is nonetheless small once weighted by

ϑ^{j}

.

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

Lemma 9

(Odd-branch distortion on scale blocks). Let

0 < α < 1

. If

n \equiv 4 (mod 6)

and

n \in I_{j} = [6^{j}, 2 \cdot 6^{j})

, then the odd preimage

m = (n - 1) / 3

satisfies

m \in I_{j - 1}

and

W_{α} (m_{1}, m_{2}) \leq 6^{1 - α} W_{α} (n_{1}, n_{2})

(39)

whenever

n_{1}, n_{2} \in I_{j}

lie on the same ray and

m_{i} = (n_{i} - 1) / 3

.

Proof.

For

n \in I_{j}

we have

n ≍ 6^{j}

; hence

m = (n - 1) / 3 ≍ 6^{j - 1}

, which gives

m \in I_{j - 1}

. Moreover,

| m_{1} - m_{2} | = \frac{1}{3} | n_{1} - n_{2} | and m_{1} + m_{2} ≍ 6^{j - 1} .

Thus

W_{α} (m_{1}, m_{2}) = \frac{| m_{1} - m_{2} |}{{(m_{1} + m_{2})}^{α}} \leq \frac{\frac{1}{3} | n_{1} - n_{2} |}{{(6^{- 1} (n_{1} + n_{2}))}^{α}} = 6^{1 - α} W_{α} (n_{1}, n_{2}),

which proves (39). □

Lemma 10

(Odd branch on

B_{tree}

). There exist constants

C_{α} > 0

and

C_{odd} > 0

such that for all

f \in B_{tree}

,

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{1},

(40)

with

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

(41)

Proof.

Recall that

(P_{odd} f) (n) = 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

For each

j \geq 0

define

A_{j} (f) : = sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)|,

so that, by definition of

{[\cdot]}_{tree}

,

{[P_{odd} f]}_{tree} = \sum_{j \geq 0} ϑ^{j} A_{j} (f) .

Fix

j \geq 0

and

m, n \in I_{j}

,

m \neq n

. We decompose according to the active congruence class

4 (mod 6)

.

Case 1: neither m nor n is

4 (mod 6)

. Then

P_{odd} f (m) = P_{odd} f (n) = 0

, so this pair contributes nothing to

A_{j} (f)

.

Case 2: exactly one of

m, n

is

4 (mod 6)

. Without loss of generality, assume

m \equiv 4 (mod 6)

and

n \neg \equiv 4 (mod 6)

. Set

k : = (m - 1) / 3

. Then

P_{odd} f (m) - P_{odd} f (n) = \frac{f (k)}{k},

and hence

W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)| = W_{α} (m, n) \frac{| f (k) |}{k} .

Since

m, n \in I_{j} = [6^{j}, 2 \cdot 6^{j})

, there exist constants

c_{1}, c_{2} > 0

(depending only on

α

) such that

W_{α} (m, n) \leq c_{1} 6^{(2 - α) j}, k = \frac{m - 1}{3} \geq c_{2} 6^{j - 1},

so

ϑ^{j} W_{α} (m, n) \frac{| f (k) |}{k} \leq C {(ϑ 6^{1 - α})}^{j} | f (k) |

for some constant C depending only on

α

. Each k arises from at most one such m and j, so summing first over pairs

(m, n)

of this type and then over j yields

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ exactly one \equiv 4 (6) \end{matrix}} W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)| \leq C_{odd, 1} {∥ f ∥}_{1},

provided

ϑ 6^{1 - α} < 1

, which we assume from now on. Here

C_{odd, 1}

depends on

α

and

ϑ

, but not on f.

Case 3: both m and n are

4 (mod 6)

. Set

m^{'} = \frac{m - 1}{3}, n^{'} = \frac{n - 1}{3},

so that

P_{odd} f (m) = \frac{f (m^{'})}{m^{'}}, P_{odd} f (n) = \frac{f (n^{'})}{n^{'}} .

We decompose

\frac{f (m^{'})}{m^{'}} - \frac{f (n^{'})}{n^{'}} = \frac{f (m^{'}) - f (n^{'})}{m^{'}} + f (n^{'}) (\frac{1}{m^{'}} - \frac{1}{n^{'}}) = : D_{1} + D_{2} .

We treat

D_{1}

(the oscillatory part) and

D_{2}

(the remainder from denominators) separately.

Case 3a: the

D_{1}

term (contractive contribution). A direct computation with

m = 3 m^{'} + 1

,

n = 3 n^{'} + 1

shows that there exists a constant

C_{α} \geq 1

depending only on

α

such that

\frac{W_{α} (m, n)}{W_{α} (m^{'}, n^{'})} \leq C_{α}

(42)

for all

m \neq n

with

m \equiv n \equiv 4 (mod 6)

. (One expands

m n

,

m + n

, and

| m - n |

in terms of

m^{'}, n^{'}

, and bounds the ratios uniformly; the details are routine.)

Thus

W_{α} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq C_{α} W_{α} (m^{'}, n^{'}) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} .

Now use that

m^{'} ≍ 6^{j - 1}

for

m \in I_{j}

with

m \equiv 4 (mod 6)

, so

1 / m^{'} ≪ 6^{- (j - 1)}

. Among the

O (6^{j})

indices in

I_{j}

, only a proportion

≍ 1 / 6

lie in the active residue class

4 (mod 6)

. Applying Cauchy–Schwarz to the collection of such pairs in

I_{j}

and using this

1 / 6

density, one obtains the averaged bound

ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{1} | \leq \frac{C_{α}}{\sqrt{6}} ϑ^{j - 1} sup_{m^{'}, n^{'}} W_{α} (m^{'}, n^{'}) | f (m^{'}) - f (n^{'}) |,

where

(m^{'}, n^{'})

range over the corresponding preimage pairs. (The factor

1 / \sqrt{6}

is the standard gain from passing from a

1 / 6

-density subset of indices to an

L^{2}

-type control of the supremum.)

Taking the supremum over all admissible

(m^{'}, n^{'})

and summing over j gives

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{1} | \leq \frac{C_{α}}{\sqrt{6}} ϑ \sum_{j \geq 0} ϑ^{j - 1} sup_{m^{'}, n^{'} \in I_{j - 1}} W_{α} (m^{'}, n^{'}) | f (m^{'}) - f (n^{'}) | .

By the definition of

{[f]}_{tree}

, the right-hand side is

\leq \frac{C_{α}}{\sqrt{6}} ϑ {[f]}_{tree} .

This yields the desired contribution with contraction factor

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ

from the

D_{1}

term.

Case 3b: the

D_{2}

term (error controlled by

{∥ f ∥}_{1}

). We have

| D_{2} | = | f (n^{'}) | |\frac{1}{m^{'}} - \frac{1}{n^{'}}| = | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} .

Since

| m - n | = 3 | m^{'} - n^{'} |

,

W_{α} (m, n) | D_{2} | = \frac{m n}{{| m - n | (m + n)}^{α}} | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} = \frac{m n}{3 {(m + n)}^{α} m^{'} n^{'}} | f (n^{'}) | .

For

m, n \in I_{j}

one has

m n ≍ 6^{2 j}

,

m + n ≍ 6^{j}

,

m^{'} n^{'} ≍ 6^{2 j - 2}

, so

W_{α} (m, n) | D_{2} | \leq C 6^{- α j} | f (n^{'}) |

for some constant C depending only on

α

. Hence

ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{2} | \leq C {(ϑ 6^{- α})}^{j} sup_{n^{'}} | f (n^{'}) | .

Each

n^{'}

arises from at most a bounded number of

(m, n, j)

, and

ϑ 6^{- α} < 1

for fixed

ϑ \in (0, 1)

and

α \in (0, 1)

, so summing over j and using

| f (n^{'}) {| \leq ∥ f ∥}_{1} / n^{'}

shows that the total

D_{2}

contribution is bounded by

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{2} | \leq C_{odd, 2} {∥ f ∥}_{1}

for some constant

C_{odd, 2} > 0

independent of f.

Conclusion. Combining the three cases, we obtain

{[P_{odd} f]}_{tree} = \sum_{j \geq 0} ϑ^{j} A_{j} (f) \leq \frac{C_{α}}{\sqrt{6}} ϑ {[f]}_{tree} + (C_{odd, 1} + C_{odd, 2}) {∥ f ∥}_{1} .

Setting

C_{odd} : = C_{odd, 1} + C_{odd, 2}

yields (40) with

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ

, as claimed. □

4.5. From Boundedness to the Lasota–Yorke Inequality on $B_{tree, σ}$

Lemma 11

(Invariance and boundedness on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. Then the backward Collatz transfer operator P maps

B_{tree, σ}

into itself and is bounded: there exists

C > 0

such that

{∥ P f ∥}_{tree, σ} \leq C {∥ f ∥}_{tree, σ} for all f \in B_{tree, σ} .

Proof.

Using the even/odd decomposition,

(P f) (n) = (P_{even} f) (n) + (P_{odd} f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

We show both

{∥ P f ∥}_{σ}

and

{[P f]}_{tree}

are bounded by

{∥ f ∥}_{tree, σ}

.

1. Weighted $ℓ_{σ}^{1}$ bound. For the even part, substitute

m = 2 n

:

∥ P_{even} {f ∥}_{σ} = \sum_{n \geq 1} \frac{| f (2 n) |}{2 n} n^{- σ} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m} {(\frac{m}{2})}^{- σ} = 2^{σ} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} | f (m) | m^{- (σ + 1)} \leq 2^{σ} {∥ f ∥}_{σ} .

For the odd part, write

m = (n - 1) / 3

(so

n = 3 m + 1

and

m \geq 1

):

∥ P_{odd} {f ∥}_{σ} = \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{(n - 1) / 3} n^{- σ} = \sum_{m \geq 1} \frac{| f (m) |}{m} {(3 m + 1)}^{- σ} \leq 3^{- σ} \sum_{m \geq 1} | f (m) | m^{- (σ + 1)} \leq 3^{- σ} {∥ f ∥}_{σ} .

Hence

{∥ P f ∥}_{σ} \leq (2^{σ} + 3^{- σ}) {∥ f ∥}_{σ} \leq (2^{σ} + 3^{- σ}) {∥ f ∥}_{tree, σ} .

(43)

2. Tree seminorm bound. By subadditivity,

{[P f]}_{tree} \leq {[P_{even} f]}_{tree} + {[P_{odd} f]}_{tree} .

From Lemma 8 (even branch on

B_{tree}

),

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} {[f]}_{tree} + C_{even} {∥ f ∥}_{1} .

From Lemma 10 (odd branch on

B_{tree}

),

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{1}, λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

To lift the weak term from

{∥ \cdot ∥}_{1}

to

{∥ \cdot ∥}_{σ}

, we revisit the remainder estimates (the “denominator” terms) in the proofs. For the even branch remainder,

W_{α} (u, v) |f (2 v) (\frac{1}{2 u} - \frac{1}{2 v})| ≪ 6^{- α j} | f (2 v) | (u, v \in I_{j}),

so

ϑ^{j} sup_{u, v \in I_{j}} \cdot ≪ ϑ^{j} 6^{- α j} \sum_{v \in I_{j}} | f (2 v) | = \sum_{v \in I_{j}} {(ϑ 6^{- α})}^{j} | f (2 v) | .

Because each v belongs to exactly one block

I_{j}

and

v ≍ 6^{j}

in that block, we have

{(ϑ 6^{- α})}^{j} \leq C {(2 v)}^{- σ} ⟺ ϑ^{j} \leq C 6^{- (σ - α) j},

which holds once we impose the admissibility condition

ϑ 6^{σ - α} < 1 .

(44)

Summing over j and v then gives a bound

≪ {∥ f ∥}_{σ}

for the even-branch remainder. The odd-branch denominator term is handled identically (replacing

2 v

by

n^{'} = (n - 1) / 3 ≍ 6^{j - 1}

), yielding again a bound

≪ {∥ f ∥}_{σ}

under (44). Renaming constants, we therefore have

{[P f]}_{tree} \leq (2^{- (1 - α)} + λ_{odd} (α, ϑ)) {[f]}_{tree} + C_{tree, σ} {∥ f ∥}_{σ} .

(45)

Finally, (43) and (45) yield

{∥ P f ∥}_{tree, σ} = {∥ P f ∥}_{σ} + {[P f]}_{tree} \leq (2^{σ} + 3^{- σ} + 2^{- (1 - α)} + λ_{odd} (α, ϑ) + C_{tree, σ}) {∥ f ∥}_{tree, σ} .

This proves boundedness of P on

B_{tree, σ}

. □

Proposition 2

(Lasota–Yorke inequality on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

satisfy the admissibility condition (44). Then there exists a constant

C_{LY, σ} > 0

such that for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C_{LY, σ} {∥ f ∥}_{σ}, λ (α, ϑ) : = 2^{- (1 - α)} + λ_{odd} (α, ϑ),

(46)

with

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ

. In particular, if

λ (α, ϑ) < 1

then P is strictly contracting in the strong seminorm

{[\cdot]}_{tree}

up to a controlled

{∥ \cdot ∥}_{σ}

–perturbation.

Proof.

Combine the even/odd seminorm bounds from (45). □

Remark 3

(Parameter window). The lift from

{∥ \cdot ∥}_{1}

to

{∥ \cdot ∥}_{σ}

in the remainder terms uses only (44). A convenient (and used later) choice is

(α, ϑ, σ) = (\frac{1}{2}, \frac{1}{5}, 1 + ε)

with any small

ε > 0

, since then

ϑ 6^{σ - α} = \frac{1}{5} 6^{ε + 1 / 2} < 1

. Together with the explicit odd-branch constant from Section 6, this yields

λ (α, ϑ) < 1

and hence quasi-compactness of P on

B_{tree, σ}

.

Corollary 1

(Essential spectral radius bound on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

satisfy the admissibility condition (44). Assume the Lasota–Yorke inequality (46) and the compact embedding

B_{tree, σ} ↪ ℓ_{σ}^{1}

from Lemma 7. Then

P : B_{tree, σ} \to B_{tree, σ}

is quasi-compact and its essential spectral radius satisfies

ρ_{ess} (P ↾_{B_{tree, σ}}) \leq λ (α, ϑ) = 2^{- (1 - α)} + λ_{odd} (α, ϑ), λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

(47)

Proof.

By (46) there exists

C_{LY, σ}

such that, for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C_{LY, σ} {∥ f ∥}_{σ} .

This is a Doeblin–Fortet (Lasota–Yorke) inequality for the pair

{∥ \cdot ∥}_{strong} = {[\cdot]}_{tree}

and

{∥ \cdot ∥}_{weak} = {∥ \cdot ∥}_{σ} .

Since the unit ball of

B_{tree, σ}

is relatively compact in

ℓ_{σ}^{1}

by Lemma 7, the injection

B_{tree, σ} ↪ ℓ_{σ}^{1}

is compact. The Ionescu–Tulcea–Marinescu/Hennion quasi-compactness theorem then implies that P is quasi-compact on

B_{tree, σ}

with

ρ_{ess} (P ↾_{B_{tree, σ}}) \leq λ (α, ϑ) .

□

4.6. Quasi-Compactness of the Backward Operator

Lemma 12

(Odd-branch weight distortion at

α = \frac{1}{2}

). Let

W_{α} (m, n) = \frac{m n}{{| m - n | (m + n)}^{α}}

be the tree weight from (35) and let

m^{'} = (m - 1) / 3

,

n^{'} = (n - 1) / 3

. For

α = \frac{1}{2}

there exists an absolute constant

C_{0} = \frac{16}{3^{3 / 2}} < 3.1

such that for all

m \equiv n \equiv 4 (mod 6)

with

m \neq n

,

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} \leq C_{0} .

(48)

Consequently, the oscillatory part of the odd branch satisfies

λ_{odd} (\frac{1}{2}, ϑ) \leq \frac{C_{0}}{\sqrt{6}} ϑ,

as used in Lemma 10 and Lemma 13.

Proof.

Let

m \equiv n \equiv 4 (mod 6)

,

m \neq n

, and define

m^{'} = (m - 1) / 3

,

n^{'} = (n - 1) / 3

. Note that

m^{'}, n^{'} \in N

and

m^{'} \neq n^{'}

. Using the definitions,

W_{1 / 2} (m, n) = \frac{m n}{{| m - n | (m + n)}^{1 / 2}}, W_{1 / 2} (m^{'}, n^{'}) = \frac{m^{'} n^{'}}{| m^{'} - n^{'} | {(m^{'} + n^{'})}^{1 / 2}} .

Form the ratio and simplify:

\begin{matrix} \frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} & = \frac{m n}{m^{'} n^{'}} \cdot \frac{| m^{'} - n^{'} |}{| m - n |} \cdot \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(m + n)}^{1 / 2}} . \end{matrix}

Since

m = 3 m^{'} + 1

and

n = 3 n^{'} + 1

, we have

| m - n | = 3 | m^{'} - n^{'} |

and

m + n = 3 (m^{'} + n^{'}) + 2

. Hence

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} = \frac{m n}{m^{'} n^{'}} \cdot \frac{1}{3} \cdot \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}) + 2)}^{1 / 2}} .

(49)

We now bound the three factors on the right-hand side.

(i) The product ratio. Using

m = 3 m^{'} + 1 \leq 4 m^{'}

and

n = 3 n^{'} + 1 \leq 4 n^{'}

for all

m^{'}, n^{'} \geq 1

, we get

\frac{m n}{m^{'} n^{'}} = \frac{(3 m^{'} + 1) (3 n^{'} + 1)}{m^{'} n^{'}} \leq 16 .

(ii) The difference ratio. We already used

| m - n | = 3 | m^{'} - n^{'} |

, so this contributes the exact factor

1 / 3

.

(iii) The sum ratio. Since

3 (m^{'} + n^{'}) + 2 \geq 3 (m^{'} + n^{'})

, we obtain

\frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}) + 2)}^{1 / 2}} \leq \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}))}^{1 / 2}} = \frac{1}{\sqrt{3}} .

Combining (i)–(iii) in (49) yields

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} \leq 16 \cdot \frac{1}{3} \cdot \frac{1}{\sqrt{3}} = \frac{16}{3^{3 / 2}} = : C_{0} .

This proves (48).

For the consequence on the oscillatory part of the odd branch in the Lasota–Yorke estimate, recall the standard decomposition in the proof of Lemma 10: when both

m, n \in I_{j}

are in the active residue class

4 (mod 6)

, the

D_{1}

(oscillatory) term contributes

W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} .

Using (48) and the relation

m^{'} ≍ 6^{j - 1}

for

m \in I_{j}

, one passes from level j to level

j - 1

with a loss bounded by

C_{0}

; the block weight

ϑ^{j}

supplies the one-step factor

ϑ

, and restricting to the active residue class has relative density

1 / 6

, which produces a Cauchy–Schwarz gain

1 / \sqrt{6}

in the passage from a subset supremum to the block-level control (see the proof of Lemma 10 for the standard

L^{2}

averaging step). Altogether,

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq \frac{C_{0}}{\sqrt{6}} ϑ {[f]}_{tree},

which is the claimed bound

λ_{odd} (\frac{1}{2}, ϑ) \leq (C_{0} / \sqrt{6}) ϑ

. □

Lemma 13

(Explicit odd-branch constant). For

α = \frac{1}{2}

and

ϑ = \frac{1}{5}

there exist constants

C_{α} > 0

and

C_{odd} > 0

such that for all

f \in B_{tree, σ}

,

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{σ},

(50)

with

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ < 1 .

(51)

Proof.

We specialize the proof of Lemma 10 to

α = \frac{1}{2}

and

ϑ = \frac{1}{5}

, making the constants explicit.

Recall

(P_{odd} f) (n) = 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3},

and for each

j \geq 0

,

A_{j} (f) : = sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)|, {[P_{odd} f]}_{tree} = \sum_{j \geq 0} ϑ^{j} A_{j} (f),

where

I_{j} = [6^{j}, 2 \cdot 6^{j})

and

W_{α} (m, n) = \frac{m n}{{| m - n | (m + n)}^{α}}

. We take

α = \frac{1}{2}

from now on, so

W_{1 / 2} (m, n) = \frac{m n}{{| m - n | (m + n)}^{1 / 2}} .

Fix

j \geq 0

and

m, n \in I_{j}

,

m \neq n

. As in Lemma 10, we distinguish three cases.

Case 1: neither m nor n is

4 (mod 6)

. Then

P_{odd} f (m) = P_{odd} f (n) = 0

and this pair contributes nothing to

A_{j} (f)

.

Case 2: exactly one of

m, n

is

4 (mod 6)

. Assume without loss of generality

m \equiv 4 (mod 6)

and

n ≢ 4 (mod 6)

. Set

k = (m - 1) / 3

. Then

P_{odd} f (m) - P_{odd} f (n) = \frac{f (k)}{k},

so

W_{1 / 2} (m, n) |P_{odd} f (m) - P_{odd} f (n)| = W_{1 / 2} (m, n) \frac{| f (k) |}{k} .

Since

m, n \in I_{j}

, we have

6^{j} \leq m, n < 2 \cdot 6^{j}

and

1 \leq | m - n | \leq 6^{j}

; hence

W_{1 / 2} (m, n) = \frac{m n}{{| m - n | (m + n)}^{1 / 2}} ≪ \frac{6^{2 j}}{6^{j} 6^{j / 2}} = 6^{(1 / 2) j} .

Also

k = (m - 1) / 3 ≍ 6^{j - 1}

. Thus for some absolute constant

C_{1}

,

ϑ^{j} W_{1 / 2} (m, n) \frac{| f (k) |}{k} \leq C_{1} {(ϑ 6^{1 / 2})}^{j} | f (k) | .

Now

ϑ = \frac{1}{5}

and

6^{1 / 2} < 2.5

, so

ϑ 6^{1 / 2} < 1

. Each k arises (from such a case) for at most one j and one m, and

| f (k) | = k^{σ} \frac{| f (k) |}{k^{σ}} \leq k^{σ} {∥ f ∥}_{σ} ≪ 6^{σ j} {∥ f ∥}_{σ} .

Summing over j and all such pairs gives

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ exactly one \equiv 4 (6) \end{matrix}} W_{1 / 2} (m, n) |P_{odd} f (m) - P_{odd} f (n)| \leq C_{odd, 1} {∥ f ∥}_{σ}

for some

C_{odd, 1} > 0

depending only on

σ

. Thus Case 2 contributes only to the weak term.

Case 3: both m and n are

4 (mod 6)

. Set

m^{'} = \frac{m - 1}{3}, n^{'} = \frac{n - 1}{3} .

Then

P_{odd} f (m) = \frac{f (m^{'})}{m^{'}}, P_{odd} f (n) = \frac{f (n^{'})}{n^{'}} .

We decompose

\frac{f (m^{'})}{m^{'}} - \frac{f (n^{'})}{n^{'}} = \underset{= : D_{1}}{\underset{︸}{\frac{f (m^{'}) - f (n^{'})}{m^{'}}}} + \underset{= : D_{2}}{\underset{︸}{f (n^{'}) (\frac{1}{m^{'}} - \frac{1}{n^{'}})}} .

Case 3a: the

D_{1}

term (contraction part). We first compare the weights

W_{1 / 2} (m, n)

and

W_{1 / 2} (m^{'}, n^{'})

.

Using

m = 3 m^{'} + 1

,

n = 3 n^{'} + 1

we compute

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} = \frac{(3 m^{'} + 1) (3 n^{'} + 1)}{3 m^{'} n^{'}} \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}) + 2)}^{1 / 2}} .

For all

m^{'}, n^{'} \geq 1

,

3 m^{'} + 1 \leq 4 m^{'}, 3 n^{'} + 1 \leq 4 n^{'}, 3 (m^{'} + n^{'}) + 2 \geq 3 (m^{'} + n^{'}),

so

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} \leq \frac{16}{3} \cdot \frac{1}{\sqrt{3}} = \frac{16}{3^{3 / 2}} = : C_{0} .

Thus

W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq C_{0} W_{1 / 2} (m^{'}, n^{'}) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} .

(52)

Next, since

m \in I_{j}

implies

m^{'} ≍ 6^{j - 1}

, we have

1 / m^{'} ≪ 6^{- (j - 1)}

. Moreover

(m^{'}, n^{'})

lie in a union of

O (1)

blocks of level

j - 1

(and possibly

j - 2

), so

W_{1 / 2} (m^{'}, n^{'}) | f (m^{'}) - f (n^{'}) | \leq ϑ^{- (j - 1)} {[f]}_{tree}

up to a fixed multiplicative constant (absorbed into

C_{0}

). Combining with (52),

ϑ^{j} W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq C_{0} ϑ^{j} 6^{- (j - 1)} ϑ^{- (j - 1)} {[f]}_{tree} = C_{0} ϑ {(\frac{ϑ}{6})}^{j - 1} {[f]}_{tree} .

Summing over

j \geq 1

gives

\sum_{j \geq 0} ϑ^{j} A_{j}^{(1)} (f) \leq \frac{C_{0} ϑ}{1 - ϑ / 6} {[f]}_{tree} .

Define

λ_{odd} : = \frac{C_{0} ϑ}{1 - ϑ / 6} and C_{α} : = \frac{\sqrt{6} C_{0}}{1 - ϑ / 6} .

Then

λ_{odd} = \frac{C_{α}}{\sqrt{6}} ϑ .

For

ϑ = \frac{1}{5}

we have

1 - ϑ / 6 = 1 - \frac{1}{30} > 0

and numerically

C_{0} = \frac{16}{3^{3 / 2}} < 3.1, λ_{odd} = \frac{C_{0} ϑ}{1 - ϑ / 6} < 0.64 < 1,

so indeed

λ_{odd} < 1

and

λ_{odd} = (C_{α} / \sqrt{6}) ϑ

with this choice of

C_{α}

.

Case 3b: the

D_{2}

term (weak contribution). We have

| D_{2} | = | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} .

Using

| m - n | = 3 | m^{'} - n^{'} |

and the same scale relations as above,

W_{1 / 2} (m, n) | D_{2} | = \frac{m n}{{| m - n | (m + n)}^{1 / 2}} | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} ≪ 6^{- j / 2} | f (n^{'}) | .

Thus

ϑ^{j} W_{1 / 2} (m, n) | D_{2} | ≪ {(ϑ 6^{- 1 / 2})}^{j} | f (n^{'}) | .

Each

n^{'}

arises from at most a bounded number of

(m, n, j)

, and

ϑ 6^{- 1 / 2} < 1

, so summing over j and using

| f (n^{'}) | \leq n^{' σ} {∥ f ∥}_{σ}

yields

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{1 / 2} (m, n) | D_{2} | \leq C_{odd, 2} {∥ f ∥}_{σ}

for some

C_{odd, 2} > 0

. Combining the three cases, we obtain

{[P_{odd} f]}_{tree} \leq λ_{odd} {[f]}_{tree} + (C_{odd, 1} + C_{odd, 2}) {∥ f ∥}_{σ} .

Setting

C_{odd} : = C_{odd, 1} + C_{odd, 2}

and using the explicit expression

λ_{odd} = (C_{α} / \sqrt{6}) ϑ

with

λ_{odd} < 1

for

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

gives (50) and (51). □

Proposition 3

(Verified Lasota–Yorke contraction). Let

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

and

σ > 1

(with the admissibility condition

ϑ 6^{σ - α} < 1

). Define

λ_{LY} : = 2^{- (1 - α)} + λ_{odd} (α, ϑ), λ_{odd} (α, ϑ) \leq \frac{C_{0}}{\sqrt{6}} ϑ,

with

C_{0} = 16 / 3^{3 / 2}

from Lemma 12. Then

λ_{LY} < 1

, and for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ_{LY} {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ},

(53)

for some constant

C_{LY} > 0

depending only on the fixed parameters and the block geometry.

Proof.

We use the decomposition

P = P_{even} + P_{odd}

and the branchwise estimates already established.

1. Combine even and odd branch inequalities. For any

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq {[P_{even} f]}_{tree} + {[P_{odd} f]}_{tree} .

By the even-branch Lasota–Yorke estimate (Lemma 8, specialized to

B_{tree, σ}

), there exists

C_{even} > 0

such that for

(α, ϑ)

fixed,

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} ϑ {[f]}_{tree} + C_{even} {∥ f ∥}_{σ} .

(54)

By the explicit odd-branch lemma (Lemma 13), for

α = \frac{1}{2}

and

ϑ = \frac{1}{5}

there exist

C_{α} > 0

and

C_{odd} > 0

such that

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{σ},

(55)

with

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ < 1 .

Adding (54) and (55) gives

{[P f]}_{tree} \leq (2^{- (1 - α)} ϑ + λ_{odd} (α, ϑ)) {[f]}_{tree} + (C_{even} + C_{odd}) {∥ f ∥}_{σ} .

Define

λ_{LY} : = 2^{- (1 - α)} ϑ + λ_{odd} (α, ϑ), C_{LY} : = C_{even} + C_{odd},

to obtain (53).

2. Verification that $λ_{LY} < 1$ . We now check that with

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

the constant

λ_{LY}

is strictly less than 1.

First,

2^{- (1 - α)} ϑ = 2^{- 1 / 2} \cdot \frac{1}{5} = \frac{1}{5 \sqrt{2}} \approx 0.1414 .

From the proof of Lemma 13 we have

λ_{odd} (α, ϑ) = \frac{C_{α}}{\sqrt{6}} ϑ,

with an explicit choice

C_{α} = \frac{\sqrt{6} C_{0}}{1 - ϑ / 6}, C_{0} = \frac{16}{3^{3 / 2}},

so that

λ_{odd} (α, ϑ) = \frac{C_{0} ϑ}{1 - ϑ / 6} .

For

ϑ = \frac{1}{5}

this yields

λ_{odd} (\frac{1}{2}, \frac{1}{5}) = \frac{C_{0} / 5}{1 - 1 / 30} = \frac{C_{0}}{5} \cdot \frac{30}{29} = \frac{6 C_{0}}{29} .

Since

C_{0} = 16 / 3^{3 / 2} < 3.1

, we obtain

λ_{odd} (\frac{1}{2}, \frac{1}{5}) < \frac{6 \cdot 3.1}{29} \approx 0.641 < 1 .

Therefore

λ_{LY} = 2^{- 1 / 2} \cdot \frac{1}{5} + λ_{odd} (\frac{1}{2}, \frac{1}{5}) < 0.1414 + 0.641 < 0.79 < 1 .

In particular,

λ_{LY}

is a strict contraction factor, depending only on the fixed parameters.

This proves both the inequality (53) and the bound

λ_{LY} < 1

. □

Lemma 14

(Asymptotic form of the invariant density). Let P act on

B_{tree, σ}

with

σ > 1

and suppose P is quasi–compact with spectral gap and no other spectrum on the unit circle. Let

h \in B_{tree, σ}

be the unique positive right eigenvector with

P h = h

and normalize the dual eigenfunctional ϕ by

ϕ (h) = 1

. Then there exist constants

c > 0

and

δ > 0

(depending only on the parameters of the Lasota–Yorke framework) such that

h (n) = \frac{c}{n} (1 + O (n^{- δ})) (n \to \infty) .

Proof.

Set

H (s) : = \sum_{n \geq 1} h (n) n^{- s}

for

ℜ (s) > σ

. We proceed in three steps.

Step 1 (Meromorphic structure of H and the pole at

s = 1

). By the Dirichlet transform intertwinement (Section 3) and the quasi–compact spectral calculus on

B_{tree, σ}

(Section 4), Dirichlet transforms of

B_{tree, σ}

-functions admit meromorphic continuation across a half–plane

ℜ (s) > 1 - δ_{0}

for some

δ_{0} \in (0, 1)

, with at most a simple pole at

s = 1

whose residue is computed by the spectral projector

Π f = ϕ (f) h

. Applying this to

f = h

and using

P h = h

, we obtain that H extends meromorphically to

ℜ (s) > 1 - δ_{0}

with the expansion

H (s) = \frac{c}{s - 1} + G (s), ℜ (s) > 1 - δ_{0},

(56)

where

c : = ϕ (1) > 0

and G is holomorphic on

ℜ (s) > 1 - δ_{0}

and of at most polynomial growth in vertical strips.1

Step 2 (Tauberian step: summatory asymptotic). Define the summatory function

H^{#} (x) : = \sum_{n \leq x} h (n)

. Since H has no singularities on

{ℜ (s) = 1}

other than the simple pole at

s = 1

and satisfies the growth hypothesis of the Wiener–Ikehara–Delange Tauberian theorem [3] in the half–plane

ℜ (s) > 1 - δ_{0}

, it follows that

H^{#} (x) = c log x + C_{0} + O (x^{- δ_{1}}) (x \to \infty),

(57)

for some constants

C_{0} \in R

and

δ_{1} \in (0, δ_{0})

(the precise

δ_{1}

is inherited from the width

δ_{0}

and strip–growth of G). See, e.g., Delange’s theorem or the Ikehara–Ingham variant.

Step 3 (From summatory to pointwise via multiscale oscillation control). Write

a_{n} : = n h (n)

and let

X > 1

. For each dyadic–triadic block

I_{j} = [6^{j}, 2 \cdot 6^{j})

defining the strong seminorm

{[\cdot]}_{tree, σ}

, the Lasota–Yorke inequality yields a uniform oscillation bound

{osc}_{I_{j}} (a) : = sup_{n, m \in I_{j}} | a_{n} - a_{m} | \leq C 6^{- j η}

(58)

for some

C > 0

and

η \in (0, 1)

depending only on the Lasota–Yorke parameters (this is the standard consequence of the contraction of the strong seminorm together with boundedness in the weak norm). In particular

a_{n}

varies slowly on each block

I_{j}

.

By summation by parts on each

I_{j}

and (57), we obtain the averaged estimate

\frac{1}{| I_{j} |} \sum_{n \in I_{j}} a_{n} = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} n h (n) = c + O (6^{- j δ_{1}}) .

Combining this block average with the oscillation control (58) gives, for every

n \in I_{j}

,

a_{n} = c + O (6^{- j δ}), δ : = min {δ_{1}, η} .

Since

n ≍ 6^{j}

on

I_{j}

, this is equivalent to

n h (n) = c + O (n^{- δ}),

hence

h (n) = \frac{c}{n} (1 + O (n^{- δ})),

as claimed. □

We now record the standard consequence of the Lasota–Yorke inequality and the compact embedding of

B_{tree}

into

ℓ^{1}

.

Theorem 3

(Quasi-compactness on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. Assume that the Lasota–Yorke constant

λ (α, ϑ) : = 2^{- (1 - α)} + λ_{odd} (α, ϑ)

satisfies

λ (α, ϑ) < 1

, where

λ_{odd} (α, ϑ)

is as in Lemma 10. Then the backward transfer operator P acting on

B_{tree, σ}

is quasi-compact, and its essential spectral radius satisfies

ρ_{ess} (P |_{B_{tree, σ}}) \leq λ (α, ϑ) < 1 .

(59)

Proof.

We work on the Banach space

B_{tree, σ}

with norm

{∥ \cdot ∥}_{tree, σ} = {∥ \cdot ∥}_{σ} + {[\cdot]}_{tree}

, where

{∥ \cdot ∥}_{σ}

is the weighted

ℓ_{σ}^{1}

-norm and

{[\cdot]}_{tree}

is the tree seminorm defined in Section 4.3.

Step 1: Lasota–Yorke inequality. By Proposition 2 (applied in the weighted setting, with

{∥ f ∥}_{1}

replaced by

{∥ f ∥}_{σ}

) we have, for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ},

(60)

with

λ (α, ϑ) < 1

by assumption. On the weak norm side, since P is bounded on

ℓ_{σ}^{1}

, there exists

C_{σ} > 0

(e.g.

C_{σ} = Λ_{σ}

from (17)) such that

{∥ P f ∥}_{σ} \leq C_{σ} {∥ f ∥}_{σ} for all f \in B_{tree, σ} .

(61)

Thus P satisfies a standard two-norm Lasota–Yorke inequality on

B_{tree, σ}

with strong seminorm

{∥ \cdot ∥}_{s} : = {[\cdot]}_{tree}

and weak norm

{∥ \cdot ∥}_{w} : = {∥ \cdot ∥}_{σ}

:

{∥ P f ∥}_{s} \leq {λ ∥ f ∥}_{s} + C_{LY} {∥ f ∥}_{w} {, ∥ P f ∥}_{w} \leq C_{σ} {∥ f ∥}_{w} .

(62)

Step 2: Compact embedding. By Lemma 7, the embedding

J : (B_{tree, σ} {, ∥ \cdot ∥}_{tree, σ}) ↪ (ℓ_{σ}^{1} {, ∥ \cdot ∥}_{σ})

is compact. Since

{∥ \cdot ∥}_{w} = {∥ \cdot ∥}_{σ}

is exactly the weak norm used in (62), this shows that the unit ball of

B_{tree, σ}

is relatively compact for the weak norm.

Step 3: Application of Ionescu–Tulcea–Marinescu / Hennion. We now invoke the standard quasi-compactness criterion (see, e.g., Ionescu–Tulcea and Marinescu, or Hennion’s theorem): if a bounded operator T on a Banach space X satisfies

(i): a Lasota–Yorke inequality ${∥ T x ∥}_{s} \leq {λ ∥ x ∥}_{s} + C {∥ x ∥}_{w}$ with $λ < 1$ ,
(ii): a weak bound ${∥ T x ∥}_{w} \leq C^{'} {∥ x ∥}_{w}$ , and
(iii): the injection ${(X, ∥ \cdot ∥}_{s} {) ↪ (X, ∥ \cdot ∥}_{w})$ has relatively compact unit ball,

then T is quasi-compact on X and its essential spectral radius satisfies

ρ_{ess} (T) \leq λ .

Conditions (i)–(iii) are exactly (62) and Lemma 7 for

T = P

and

X = B_{tree, σ}

. Therefore P is quasi-compact on

B_{tree, σ}

and

ρ_{ess} (P |_{B_{tree, σ}}) \leq λ (α, ϑ) < 1,

which is (59). □

Remark 4

(On the choice of parameters). The explicit bound (41) shows that

λ_{odd} (α, ϑ)

decreases linearly with ϑ. For fixed α, one can therefore choose ϑ sufficiently small so that

λ (α, ϑ) < 1

, provided the constant

C_{α}

is effectively controlled. Subsequent sections make this optimization quantitative by computing

C_{α}

and exhibiting admissible parameter pairs

(α, ϑ)

that give a strict spectral gap.

The Lasota–Yorke framework developed here supplies the functional-analytic backbone for the spectral approach to the Collatz problem: once explicit parameters with

λ (α, ϑ) < 1

are verified, the quasi-compactness and spectral gap of P on

B_{tree}

follow, and the spectral criteria of Section 4 can be invoked to constrain or rule out non-terminating configurations.

5. Spectral Consequences and Effective Block Recursion

Having established in Section 4.4 that the backward Collatz operator P is quasi-compact on the multi-scale tree space

B_{tree}

, we now turn to the spectral consequences of this result. The Lasota–Yorke inequality ensures the existence of a spectral gap, which in turn controls the structure of invariant densities and the long-term behavior of iterates

P^{k}

. The objective of this section is to characterize the invariant and quasi-invariant components of P, derive an effective block recursion for their scale-averaged coefficients, and demonstrate that the recursion enforces rigidity across the Collatz tree.

Throughout this section,

h \in B_{tree, σ}

will denote an invariant density of P, i.e. a function satisfying

P h = h

. The analysis proceeds in several stages. First, we describe the structure of possible invariant profiles in the multiscale framework and show that the Lasota–Yorke inequality forces uniform flatness across scales. Next, we translate this flatness into an explicit two-sided recurrence relation for block averages

c_{j}

. Finally, we verify that the coefficients of this recurrence satisfy a spectral bound consistent with the contraction constant

λ_{odd} (α, ϑ)

computed earlier.

Theorem 4

(Perron–Frobenius structure on

B_{tree, σ}

). Let P be the backward Collatz transfer operator acting on

B_{tree, σ}

with parameters

(α, ϑ, σ)

chosen so that the Lasota–Yorke inequality and quasi–compactness hold. Then:

The spectral radius of P equals 1, and 1 is a simple eigenvalue.
There exists a unique eigenvector $h \in B_{tree, σ}$ with $h > 0$ and $P h = h$ , normalized by $ϕ (h) = 1$ .
There exists a unique positive eigenfunctional $ϕ \in B_{tree, σ}^{*}$ such that $ϕ \circ P = ϕ$ .
All other spectral values satisfy $| z | < 1$ , and P admits the spectral decomposition

$P = h \otimes ϕ + Q, ρ (Q) < 1,$

where Q is quasi–compact.

Proof.

We combine the Lasota–Yorke inequality on

B_{tree, σ}

with standard Perron–Frobenius theory for positive quasi–compact operators.

Step 1: Spectral radius and quasi–compactness. By construction P is a bounded linear operator on

B_{tree, σ}

and is positive in the sense that

f \geq 0

implies

P f \geq 0

. The Lasota–Yorke inequality on

B_{tree, σ}

(Proposition 2, say) together with the compact embedding of the strong seminorm into the weak norm implies that P is quasi–compact on

B_{tree, σ}

with essential spectral radius strictly less than 1:

ρ_{ess} (P) < 1 .

(63)

On the other hand, the logarithmic mass–preservation identity (Lemma 4) shows that the spectral radius of P is at least 1; the boundedness of P implies

ρ (P) \leq 1

, hence

ρ (P) = 1 .

(64)

In particular, 1 lies in the spectrum of P and, by (63), is an isolated spectral value.

Step 2: Existence of a positive eigenvector. Consider the positive cone

C : = {f \in B_{tree, σ} : f \geq 0},

which is closed, convex, and reproducing. Since P is positive and

ρ (P) = 1

, the Krein–Rutman theorem for positive operators on Banach spaces implies the existence of a nonzero

h \in C

such that

P h = h .

(65)

Moreover, h can be chosen strictly positive in the sense that

h (n) > 0

for all

n \in N

: indeed, by the preimage structure of the Collatz map (Lemma 3) and the connectivity of the backward tree, any nontrivial

f \in C

is eventually propagated by iterates of P to a function that is positive on every block

I_{j}

, so

P^{k} f > 0

for all sufficiently large k. Replacing h by

P^{k} h

if necessary yields

h > 0

.

Step 3: Uniqueness and simplicity of the eigenvalue 1. We now show that 1 is a simple eigenvalue and that h is unique up to scalar multiples. Suppose

g \in B_{tree, σ}

satisfies

P g = g

. Decompose

g = g^{+} - g^{-}

into positive parts. Positivity of P implies

P g^{\pm} = g^{\pm}

. By the strong positivity argument above, any nonzero

f \in C

with

P f = f

must be strictly positive; hence

g^{+}

and

g^{-}

are both either 0 or strictly positive. If both were nonzero, then

g^{+}

and

g^{-}

would be linearly independent positive eigenvectors for the eigenvalue 1, and the positive cone would contain a two-dimensional face of eigenvectors. This contradicts the Krein–Rutman conclusion that the eigenspace associated with the spectral radius is one–dimensional. Therefore one of

g^{+}, g^{-}

must vanish and g is either nonnegative or nonpositive; by replacing g by

- g

if necessary,

g \geq 0

, and the strong positivity then forces g to be a scalar multiple of h. Thus the eigenspace for the eigenvalue 1 is one–dimensional and spanned by h, and 1 is a simple eigenvalue. This proves (1) and the first part of (2) after normalizing by

ϕ (h) = 1

below.

Step 4: Dual eigenfunctional. Consider the dual operator

P^{*}

acting on

B_{tree, σ}^{*}

. Since P is positive, so is

P^{*}

on the dual cone

C^{*} : = {ψ \in B_{tree, σ}^{*} : ψ (f) \geq 0 for all f \in C} .

The quasi–compactness of P implies quasi–compactness of

P^{*}

on the dual space. By (64),

P^{*}

also has spectral radius 1. Applying the same Krein–Rutman argument to

P^{*}

yields a nonzero

ϕ \in C^{*}

and

ϕ \circ P = ϕ,

(66)

with

ϕ

strictly positive on nonzero elements of

C

. The same simplicity argument as in Step 3 shows that the eigenspace of

P^{*}

for the eigenvalue 1 is one–dimensional and spanned by

ϕ

. Normalizing by the condition

ϕ (h) = 1

gives the uniquely determined eigenpair

(h, ϕ)

appearing in the statement. This establishes (2) and (3).

Step 5: Spectral decomposition and spectral gap. Quasi–compactness of P on

B_{tree, σ}

, together with (63) and the simplicity of the eigenvalue 1, implies that the spectrum of P is contained in

{1} \cup {z : | z | < r}

for some

r < 1

. Let

Π

denote the spectral projection onto the eigenspace associated with

λ = 1

; by the previous steps,

Π f = h ϕ (f), f \in B_{tree, σ},

so that

Π = h \otimes ϕ

as a rank–one operator. Writing

P = Π + Q = h \otimes ϕ + Q,

(67)

we have

Q = P - Π

and

Q Π = Π Q = 0

. The spectrum of Q is contained in

{z : | z | < r}

, so in particular

ρ (Q) < 1 .

Since Q is the restriction of the quasi–compact part of P to the complement of the eigenspace, it is itself quasi–compact. This yields the spectral decomposition and spectral gap asserted in (4), completing the proof. □

Proposition 4

(Forward dynamics and P-invariant functionals). Let

0 < α, ϑ < 1

and

σ > 1

. Consider the pairing

〈 f, φ 〉 : = \sum_{n \geq 1} f (n) φ (n)

between

B_{tree, σ}

and

B_{tree, σ}^{*} : = \{φ : N \to {C : ∥ φ ∥}_{*} : = sup_{j \geq 0} (ϑ^{j} {osc}_{I_{j}} φ) + sup_{j \geq 0} (6^{- σ j} \sum_{n \in I_{j}} | φ (n) |) < \infty\},

where

{osc}_{I_{j}} φ : = {sup}_{m, n \in I_{j}} | φ (m) - φ (n) |

. Then

〈 \cdot, \cdot 〉

extends continuously to

B_{tree, σ} \times B_{tree, σ}^{*}

, and the adjoint

(P^{*} φ) (m) = \frac{1}{m} (1_{{2 ∣ m}} φ (m / 2) + 1_{{m odd}} φ (3 m + 1)) .

(68)

Moreover, there exist constants

C_{σ} > 0

and

M_{σ} \geq 1

such that

∥ {(P^{*})}^{k} ∥_{B_{tree, σ}^{*} \to B_{tree, σ}^{*}} \leq C_{σ} M_{σ}^{k}, k \geq 0,

(69)

and the Cesàro averages

Φ_{N} : = \frac{1}{N} \sum_{k = 0}^{N - 1} {(P^{*})}^{k} φ

form a bounded set in

B_{tree, σ}^{*}

for every

φ \in B_{tree, σ}^{*}

.

Positive-frequency divergent families.Suppose there exist

c > 0

and an infinite set of scales

J \subset N

such that for each

j \in J

there is a finite set

A_{j} \subset I_{j}

with

| A_{j} | \geq c | I_{j} |

and forward trajectories that visit

A_{j}

with asymptotic frequency

\geq c

. For a summable weight sequence

{(w_{j})}_{j \geq 0}

with

\sum_{j} w_{j} ϑ^{j} < \infty

and

\sum_{j} w_{j} 6^{- σ j} < \infty

, define

φ_{j} (n) : = \frac{w_{j}}{| A_{j} |} 1_{A_{j}} (n), φ : = \sum_{j \in J} φ_{j} .

Then

φ \in B_{tree, σ}^{*}

, the Cesàro averages

Φ_{N}

are bounded in

B_{tree, σ}^{*}

, and any weak-* limit point Φ satisfies

P^{*} Φ = Φ

and

Φ \neq 0

. Consequently

ℓ (f) : = 〈 f, Φ 〉

is a nonzero invariant functional with

ℓ \circ P = ℓ

.

Proof.

Continuity of the pairing. Fix j and set

c_{j} : = {| I_{j} |}^{- 1} \sum_{n \in I_{j}} f (n)

and

φ_{I_{j}} : = {| I_{j} |}^{- 1} \sum_{n \in I_{j}} φ (n)

. Then

\sum_{n \in I_{j}} f (n) φ (n) = \sum_{n \in I_{j}} (f (n) - c_{j}) (φ (n) - φ_{I_{j}}) + c_{j} \sum_{n \in I_{j}} φ (n) .

(a) Oscillatory term. Using

\sum_{I_{j}} (f - c_{j}) = 0

and

{osc}_{I_{j}} φ : = {sup}_{u, v \in I_{j}} | φ (u) - φ (v) |

,

|\sum_{n \in I_{j}} (f (n) - c_{j}) (φ (n) - φ_{I_{j}})| \leq {osc}_{I_{j}} φ \sum_{n \in I_{j}} | f (n) - c_{j} | .

By the tree seminorm and the block geometry (since

W_{α} ≍ 6^{(1 - α) j}

on

I_{j}

),

{osc}_{I_{j}} f \leq K_{α} ϑ^{- j} 6^{- (1 - α) j} {[f]}_{tree}, \sum_{n \in I_{j}} | f (n) - c_{j} | \leq | I_{j} | {osc}_{I_{j}} f \leq C ϑ^{- j} 6^{- α j} {[f]}_{tree} .

Therefore

|\sum_{n \in I_{j}} (f (n) - c_{j}) (φ (n) - φ_{I_{j}})| \leq C ϑ^{- j} 6^{- α j} {[f]}_{tree} {osc}_{I_{j}} φ .

Multiply and divide by

ϑ^{j}

and take

{sup}_{j} ϑ^{j} {osc}_{I_{j}} φ

to get

\sum_{j \geq 0} |\sum_{I_{j}} (f - c_{j}) (φ - φ_{I_{j}})| \leq C {[f]}_{tree} sup_{j \geq 0} (ϑ^{j} {osc}_{I_{j}} φ) \sum_{j \geq 0} ϑ^{- 2 j} 6^{- α j} .

Since

α > 0

, we can absorb

\sum_{j} ϑ^{- 2 j} 6^{- α j}

into the constant (using that

ϑ \in (0, 1)

is fixed), hence

\sum_{j \geq 0} |\sum_{I_{j}} (f - c_{j}) (φ - φ_{I_{j}})| \leq C {[f]}_{tree} {∥ φ ∥}_{*} .

(b) Mean term. By averaging and the weighted norm,

| c_{j} | \leq \frac{1}{| I_{j} |} \sum_{n \in I_{j}} | f (n) | \leq \frac{1}{| I_{j} |} \sum_{n \in I_{j}} n^{σ} \frac{| f (n) |}{n^{σ}} \leq C 6^{(σ - 1) j} {∥ f ∥}_{ℓ_{σ}^{1}} .

Hence

|c_{j} \sum_{n \in I_{j}} φ (n)| \leq C 6^{(σ - 1) j} {∥ f ∥}_{ℓ_{σ}^{1}} (6^{σ j} 6^{- σ j} \sum_{I_{j}} | φ |) \leq C 6^{- j} {∥ f ∥}_{ℓ_{σ}^{1}} sup_{j \geq 0} (6^{- σ j} \sum_{I_{j}} | φ |) .

Summing over j gives a finite geometric series:

\sum_{j \geq 0} |c_{j} \sum_{I_{j}} φ| \leq {C ∥ f ∥}_{ℓ_{σ}^{1}} {∥ φ ∥}_{*} .

Combining (a) and (b) yields

| 〈 f, φ 〉 | \leq C ({[f]}_{tree} + {∥ f ∥}_{ℓ_{σ}^{1}}) {∥ φ ∥}_{*} = {C ∥ f ∥}_{tree, σ} {∥ φ ∥}_{*} .

□

5.1. Redesigned Multiscale Space and Invariant Profiles

The quasi-compactness of P implies that its spectrum consists of a discrete set of eigenvalues of finite multiplicity outside a disk of radius

ρ_{ess} (P) \leq λ_{LY} < 1

, together with a residual spectrum contained in that disk. Let

λ_{0} = 1

denote the trivial eigenvalue corresponding to constant functions. Any additional eigenvalues with

| λ | < 1

correspond to exponentially decaying modes. Thus, an invariant density h satisfying

P h = h

must lie in the one-dimensional eigenspace associated with

λ_{0}

, provided no unit-modulus spectrum remains.

However, to make this conclusion effective, one must exclude the possibility of small oscillatory components that project into higher spectral modes but decay too slowly to be detected by the weak

ℓ^{1}

norm alone. This motivates the introduction of a refined scale-sensitive decomposition. Define block intervals

I_{j}

as in (34), and let

H_{j} (h) : = \sum_{n \in I_{j}} h (n), c_{j} : = \frac{H_{j} (h)}{| I_{j} |} = \frac{H_{j} (h)}{6^{j}} .

(70)

The sequence

{(c_{j})}_{j \geq 0}

captures the mean behavior of h across successive scales in the backward tree. Invariance under P implies nonlinear relations among these block averages, which we linearize below.

Lemma 15

(Block-level invariance relation). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

, and let

h \in B_{tree, σ}

satisfy

P h = h

. For each

j \geq 0

define the block average

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), | I_{j} | : = # I_{j} .

Then there exist sequences

{(a_{j})}_{j \geq 0}

,

{(b_{j})}_{j \geq 0}

with

a_{j}, b_{j} \geq 0

and a sequence

{(ε_{j})}_{j \geq 0}

such that

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j},

(71)

where

a_{j}

and

b_{j}

are determined by the local distribution of even and odd preimages between neighboring scales, and the error sequence

ε = (ε_{j})

is summable in the weighted norm, i.e.

\sum_{j \geq 0} ϑ^{j} | ε_{j} | < \infty .

(72)

Proof.

Throughout, fix

h \in B_{tree, σ}

with

P h = h

.

1. Start from the invariance equation on each block. For each

j \geq 0

,

| I_{j} | c_{j} = \sum_{n \in I_{j}} h (n) = \sum_{n \in I_{j}} (P h) (n) = \sum_{n \in I_{j}} (\frac{h (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3}) .

Write

S_{j}^{even} : = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n}, S_{j}^{odd} : = \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3},

so that

| I_{j} | c_{j} = S_{j}^{even} + S_{j}^{odd} .

(73)

We now approximate

S_{j}^{even}

and

S_{j}^{odd}

in terms of neighboring block averages, with all discrepancies absorbed in

ε_{j}

.

2. Even branch contribution. For

n \in I_{j}

, the even preimage is

m = 2 n

, and

S_{j}^{even} = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n} = \sum_{m \in 2 I_{j}} \frac{h (m)}{m},

where

2 I_{j} : = {2 n : n \in I_{j}}

. The set

2 I_{j}

lies in a bounded union of intervals whose lengths are comparable to

| I_{j} |

and whose positions are comparable (on a logarithmic scale) to some neighboring block

I_{j + 1}

. We decompose

h (m) = c_{j + 1} + (h (m) - c_{j + 1})

for those m whose scale is that of

I_{j + 1}

, and similarly for indices belonging to at most finitely many adjacent blocks. This yields

S_{j}^{even} = a_{j}^{(even)} | I_{j} | c_{j + 1} + R_{j}^{even},

(74)

where

a_{j}^{(even)} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} \frac{1}{2 n} 1_{{2 n lies in the next scale block (s)}},

and

R_{j}^{even}

collects:

(i): contributions from $h (m) - c_{k}$ within the relevant blocks,
(ii): contributions from even preimages m falling outside the chosen neighboring blocks.

Because

h \in B_{tree, σ}

, its oscillation inside each block is controlled by

{[h]}_{tree}

, so replacing

h (m)

by the corresponding block average

c_{k}

incurs an error bounded by

| h (m) - c_{k} | ≪ \frac{{[h]}_{tree}}{W_{α} (m_{1}, m_{2})}

for suitable

m_{1}, m_{2}

in that block; the precise bound is obtained by choosing

m_{1}, m_{2}

maximizing the tree seminorm at that scale and using the definition of

{[h]}_{tree}

. After dividing by m (which is

≫ 6^{j}

at this scale) and averaging over

I_{j}

, we get

| R_{j}^{even} | ≪ 6^{- j} {[h]}_{tree} + 6^{- j σ} {∥ h ∥}_{σ},

where the second term accounts for the finitely many preimages lying outside the neighboring blocks, using the weighted

ℓ_{σ}^{1}

bound on h. Thus

\sum_{j \geq 0} ϑ^{j} | R_{j}^{even} | < \infty .

(75)

By construction

a_{j}^{(even)} \geq 0

.

3. Odd branch contribution. For

n \equiv 4 (mod 6)

, the odd preimage is

m^{'} = (n - 1) / 3

, and

S_{j}^{odd} = \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (m^{'})}{m^{'}} .

As above, all such

m^{'}

lie at scale comparable to

I_{j - 1}

, up to a bounded distortion which is independent of j. We write

h (m^{'}) = c_{j - 1} + (h (m^{'}) - c_{j - 1}),

and obtain

S_{j}^{odd} = b_{j}^{(odd)} | I_{j} | c_{j - 1} + R_{j}^{odd},

(76)

where

b_{j}^{(odd)} : = \frac{1}{| I_{j} |} \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{1}{(n - 1) / 3},

and

R_{j}^{odd}

collects:

(i): the errors from replacing $h (m^{'})$ by $c_{j - 1}$ ,
(ii): any edge effects from $m^{'}$ lying just outside $I_{j - 1}$ .

All indices m whose images under the even/odd branches land outside the adjacent blocks are absorbed into

R_{j}^{even}

and

R_{j}^{odd}

; these edge spillovers are

ϑ

-summable thanks to

σ > 1

and the block oscillation control from

{[h]}_{tree}

.

As before, the tree seminorm controls oscillations within blocks, so

| h (m^{'}) - c_{j - 1} |

is bounded by a multiple of

{[h]}_{tree}

times a scale factor, and dividing by

m^{'} ≍ 6^{j - 1}

yields

| R_{j}^{odd} | ≪ 6^{- j} {[h]}_{tree} + 6^{- j σ} {∥ h ∥}_{σ} .

Thus

\sum_{j \geq 0} ϑ^{j} | R_{j}^{odd} | < \infty .

(77)

By construction

b_{j}^{(odd)} \geq 0

.

4. Assemble the block relation. Substituting (74) and (76) into (73), we obtain

| I_{j} | c_{j} = a_{j}^{(even)} | I_{j} | c_{j + 1} + b_{j}^{(odd)} | I_{j} | c_{j - 1} + R_{j}^{even} + R_{j}^{odd} .

Dividing by

| I_{j} |

gives

c_{j} = a_{j}^{(even)} c_{j + 1} + b_{j}^{(odd)} c_{j - 1} + ε_{j},

where

ε_{j} : = \frac{R_{j}^{even} + R_{j}^{odd}}{| I_{j} |} .

Set

a_{j} : = a_{j}^{(even)}

and

b_{j} : = b_{j}^{(odd)}

. By construction

a_{j}, b_{j} \geq 0

, and they encode the (normalized) weights of even and odd preimages between the neighboring scales. Moreover, using

| I_{j} | ≍ 6^{j}

together with (75) and (77), we obtain

\sum_{j \geq 0} ϑ^{j} | ε_{j} | \leq \sum_{j \geq 0} ϑ^{j} \frac{| R_{j}^{even} | + | R_{j}^{odd} |}{| I_{j} |} < \infty,

since the additional factor

1 / | I_{j} | ≍ 6^{- j}

makes the series converge absolutely once

σ > 1

and

{[h]}_{tree}

is finite. This is exactly (72).

Thus the block averages

(c_{j})

satisfy the approximate invariance relation (71) with a

ϑ

-summable error. □

Lemma 16

(Limiting preimage ratios). Let

{(I_{j})}_{j \geq 0}

be the multiscale blocks

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N, | I_{j} | = 6^{j} .

Define

a_{j}

and

b_{j}

as in Lemma 15, i.e. as the normalized contributions (depending only on the preimage structure of T) of even and odd preimages from neighboring scales to the block relation

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j},

for block averages

c_{j}

of any invariant profile h with

P h = h

. Then there exist constants

a, b > 0

such that

lim_{j \to \infty} a_{j} = a, lim_{j \to \infty} b_{j} = b,

and

a + b = 1, 0 < b < a < 1 .

(78)

Moreover, there exist

C > 0

and

0 < δ < 1

(independent of h) such that for all

j \geq 0

,

| a_{j} - a | + | b_{j} - b | \leq C δ^{j} .

Proof.

The coefficients

a_{j}, b_{j}

are determined purely by the geometry of Collatz preimages between the blocks

I_{j - 1}, I_{j}, I_{j + 1}

; they do not depend on h. We make this explicit.

1. Preimage windows and raw counts. For

m \in N

, the Collatz map, (1) has two inverse branches:

n \mapsto 2 n (even branch), n \mapsto \frac{n - 1}{3} when n \equiv 4 (mod 6) (odd branch) .

In the block relation of Lemma 15, only preimages that land in the adjacent large scales contribute to the “main” coefficients

a_{j}, b_{j}

; all other preimages (falling into gaps or non-adjacent blocks) are assigned to the perturbation

ε_{j}

.

The even preimages relevant to

I_{j}

form a window

E_{j}^{*}

of size comparable to

| I_{j} |

, consisting of those m whose image

T (m)

lies in

I_{j}

via m even.

he odd preimages relevant to

I_{j}

form a thinner window

O_{j}^{*}

, consisting of those odd m with

T (m) = 3 m + 1 \in I_{j}

(equivalently,

n : = 3 m + 1 \in I_{j}

and

n \equiv 4 (mod 6)

).

A direct count shows:

1. For the even window, each

n \in I_{j}

has an even preimage

2 n

, so

| E_{j}^{*} | = | I_{j} | = 6^{j} .

2. For the odd window, we need

n \in I_{j}

with

n \equiv 4 (mod 6)

and then

m = (n - 1) / 3

odd. Among the

| I_{j} | = 6^{j}

integers in

I_{j}

, exactly one in every six is

4 (mod 6)

, up to boundary effects. Hence

| O_{j}^{*} | = \frac{1}{6} | I_{j} | + O (1) = 6^{j - 1} + O (1),

so in particular

| O_{j}^{*} | > 0

for all sufficiently large j.

Thus the total number of “neighboring-scale” preimages associated with

I_{j}

is

| E_{j}^{*} | + | O_{j}^{*} | = (1 + \frac{1}{6}) | I_{j} | + O (1) = \frac{7}{6} 6^{j} + O (1) .

2. Canonical normalization of $a_{j}, b_{j}$ . By Lemma 15, the coefficients

a_{j}, b_{j}

are defined as the normalized weights of even vs. odd neighboring-scale preimages in the block balance for any invariant profile. Since this normalization is independent of h, we may compute

a_{j}, b_{j}

purely from the combinatorics. The natural choice is:

a_{j} : = \frac{| E_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |}, b_{j} : = \frac{| O_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |} .

These are exactly the “ratios of the number of even and odd preimages between adjacent scales” announced in Lemma 15.

Using the counts above,

\begin{matrix} a_{j} & = \frac{6^{j}}{6^{j} + 6^{j - 1} + O (1)} = \frac{1}{1 + \frac{1}{6} + O (6^{- j})} = \frac{6}{7} + O (6^{- j}), \\ b_{j} & = \frac{6^{j - 1} + O (1)}{6^{j} + 6^{j - 1} + O (1)} = \frac{\frac{1}{6} + O (6^{- j})}{1 + \frac{1}{6} + O (6^{- j})} = \frac{1}{7} + O (6^{- j}) . \end{matrix}

In particular, there exist limits

a = lim_{j \to \infty} a_{j} = \frac{6}{7}, b = lim_{j \to \infty} b_{j} = \frac{1}{7},

and there exists

C > 0

such that, for all j,

| a_{j} - a | + | b_{j} - b | \leq C 6^{- j} .

Thus the desired exponential convergence holds with

δ : = 1 / 6 \in (0, 1)

.

3. Structural properties. From the explicit limits we immediately have

a + b = \frac{6}{7} + \frac{1}{7} = 1, 0 < b < a < 1 .

Alternatively, the identity

a_{j} + b_{j} = 1

holds exactly for each j when tested against the constant profile

h \equiv 1

(for which the block perturbation

ε_{j}

vanishes), and passes to the limit as

j \to \infty

.

Positivity of

a, b

follows from

| E_{j}^{*} |, | O_{j}^{*} | > 0

for large j, and

b < a

reflects the fact that the odd preimage window is asymptotically only a

1 / 6

-fraction of the even window.

This completes the proof. □

Proposition 5

(Effective recursion for peripheral eigenfunctions). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and let

h \in B_{tree, σ}

satisfy

P h = λ h

with

| λ | = 1

. Let

H_{j} : = \sum_{n \in I_{j}} h (n)

and

c_{j} : = H_{j} / | I_{j} |

be the block sums and block averages on

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

. Then, with

a, b > 0

as in Lemma 16, there exists a sequence

{(ε_{j})}_{j \geq 1}

with

\sum_{j \geq 1} | ε_{j} | ϑ^{j} < \infty

such that

c_{j} = λ^{- 1} a c_{j + 1} + λ^{- 1} b c_{j - 1} + ε_{j}, j \geq 1 .

(79)

Equivalently, for the renormalized averages

d_{j} : = λ^{- j} c_{j}

we have

d_{j} = a d_{j + 1} + b d_{j - 1} + {\tilde{ε}}_{j}, \sum_{j \geq 1} | {\tilde{ε}}_{j} | ϑ^{j} < \infty,

(80)

with

{\tilde{ε}}_{j} : = λ^{- j} ε_{j}

.

Proof.

Step 1: Block summation of the eigenrelation. Summing

P h = λ h

over

n \in I_{j}

gives

\sum_{n \in I_{j}} (P h) (n) = λ \sum_{n \in I_{j}} h (n) = λ H_{j} .

By the definition of

P = P_{even} + P_{odd}

,

\sum_{n \in I_{j}} (P h) (n) = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n} + \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3} = : S_{j}^{even} + S_{j}^{odd} .

As in the proof of Lemma 15 (the

λ = 1

case), we reorganize each sum by changing variables along the inverse branches and separating the main contributions that land in adjacent scales (

I_{j + 1}

for the even branch,

I_{j - 1}

for the odd branch) from the boundary remainders (spillovers due to the half-open endpoints and the congruence restriction

n \equiv 4 (mod 6)

). Concretely,

S_{j}^{even} = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n} = \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} + R_{j}^{even}, S_{j}^{odd} = \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3} = \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} + R_{j}^{odd},

where

E_{j}^{*} \subset I_{j + 1}

and

O_{j}^{*} \subset I_{j - 1}

are the preimage windows collecting those m whose images lie in

I_{j}

under the even and odd branches, respectively, and

R_{j}^{even}, R_{j}^{odd}

are the boundary remainders (coming from

(I_{j + 1} ∖ E_{j}^{*})

and

(I_{j - 1} ∖ O_{j}^{*})

).

Thus

λ H_{j} = \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} + \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} + (R_{j}^{even} + R_{j}^{odd}) .

Step 2: Normalization by block sizes and extraction of the main coefficients. Divide by

| I_{j} | = 6^{j}

and write

c_{k} = H_{k} / | I_{k} |

:

λ c_{j} = \frac{1}{| I_{j} |} \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} + \frac{1}{| I_{j} |} \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} + \frac{R_{j}^{even} + R_{j}^{odd}}{| I_{j} |} .

Inside each window the points m satisfy

m ≍ | I_{j + 1} |

(even window) or

m ≍ | I_{j - 1} |

(odd window), so

1 / m

fluctuates by a bounded multiplicative factor around

1 / | I_{j + 1} |

or

1 / | I_{j - 1} |

. Using the

B_{tree, σ}

control of oscillations within blocks, this fluctuation contributes only to an error term summable in the weighted

ϑ

-norm. Hence

\frac{1}{| I_{j} |} \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} = \frac{| E_{j}^{*} |}{| I_{j} |} \cdot \frac{1}{| I_{j + 1} |} \sum_{m \in E_{j}^{*}} h (m) + η_{j}^{even} = a_{j} c_{j + 1} + η_{j}^{even},

and similarly

\frac{1}{| I_{j} |} \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} = b_{j} c_{j - 1} + η_{j}^{odd},

where

a_{j} : = | E_{j}^{*} | / (| E_{j}^{*} | + | O_{j}^{*} |)

,

b_{j} : = | O_{j}^{*} | / (| E_{j}^{*} | + | O_{j}^{*} |)

(so

a_{j} + b_{j} = 1

), and

η_{j}^{even}, η_{j}^{odd}

are error terms whose weighted sum

\sum_{j} ϑ^{j} | η_{j}^{\cdot} |

is finite. The boundary remainders likewise satisfy

\sum_{j \geq 1} ϑ^{j} \frac{| R_{j}^{even} | + | R_{j}^{odd} |}{| I_{j} |} < \infty

by the same block-oscillation and congruence estimates used in Lemma 15.

Collecting terms, we obtain

λ c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + η_{j}, \sum_{j \geq 1} ϑ^{j} | η_{j} | < \infty,

(81)

which is the twisted version of the block relation of Lemma 15.

Step 3: Freezing the coefficients to the limits

a, b

. By Lemma 18, there exist

a, b > 0

with

a + b = 1

,

0 < b < a < 1

, and constants

C > 0

,

0 < δ < 1

such that

| a_{j} - a | + | b_{j} - b | \leq C δ^{j}

for all j. Rewrite (81) as

λ c_{j} = a c_{j + 1} + b c_{j - 1} + \underset{= : ζ_{j}}{\underset{︸}{η_{j} + (a_{j} - a) c_{j + 1} + (b_{j} - b) c_{j - 1}}} .

To show

\sum_{j} ϑ^{j} | ζ_{j} | < \infty

, it remains to bound the “freezing” errors

(a_{j} - a) c_{j + 1}

and

(b_{j} - b) c_{j - 1}

in the weighted sum. As in the proof of Proposition 7,

h \in B_{tree, σ}

implies the block averages obey the growth bound

| c_{k} | \leq C_{0} 6^{(σ - 1) k} {∥ h ∥}_{σ} (k \geq 0),

(82)

for a constant

C_{0}

depending only on

σ

and the block geometry. Hence

ϑ^{j} | (a_{j} - a) c_{j + 1} | \leq ϑ^{j} C δ^{j} C_{0} 6^{(σ - 1) (j + 1)} {∥ h ∥}_{σ} = C^{'} {(ϑ δ 6^{σ - 1})}^{j} {∥ h ∥}_{σ},

and similarly for

(b_{j} - b) c_{j - 1}

(with

j - 1

in place of

j + 1

). Choosing

ϑ \in (0, 1)

(as done when defining

B_{tree, σ}

) small enough so that

ϑ δ 6^{σ - 1} < 1

, these two geometric series converge, uniformly in h up to

{∥ h ∥}_{σ}

. Therefore

\sum_{j \geq 1} ϑ^{j} | ζ_{j} | < \infty .

Set

ε_{j} : = λ^{- 1} ζ_{j}

and divide the identity by

λ

(note

| λ | = 1

), which yields (79) with

\sum_{j} ϑ^{j} | ε_{j} | = \sum_{j} ϑ^{j} | ζ_{j} | < \infty

.

Step 4: Renormalized averages. Define

d_{j} : = λ^{- j} c_{j}

. Multiplying (79) by

λ^{- j}

,

d_{j} = a d_{j + 1} + b d_{j - 1} + {\tilde{ε}}_{j}, {\tilde{ε}}_{j} : = λ^{- j} ε_{j},

and since

| λ | = 1

we have

\sum_{j} ϑ^{j} | {\tilde{ε}}_{j} | = \sum_{j} ϑ^{j} | ε_{j} | < \infty

. This is (80). □

Remark 5

(Admissibility for freezing the coefficients). The “freezing” errors

(a_{j} - a) c_{j + 1}

and

(b_{j} - b) c_{j - 1}

are summable in the weighted norm provided

ϑ δ 6^{σ - 1} < 1 with δ = \frac{1}{6},

equivalently

ϑ 6^{σ - 2} < 1

. This holds, for example, for any

σ \in (1, 2)

when

ϑ = \frac{1}{5}

.

Remark 6

(Exact normalization of the block coefficients). In Lemma 15 the neighboring-scale coefficients are determined purely by preimage windows:

a_{j} : = \frac{| E_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |}, b_{j} : = \frac{| O_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |}, so a_{j} + b_{j} = 1 .

Lemma 16 shows

a_{j} \to a = \frac{6}{7}

and

b_{j} \to b = \frac{1}{7}

with

| a_{j} - a | + | b_{j} - b | ≪ 6^{- j}

.

Remark 7

(Coefficient freezing). The combinatorial structure of the Collatz tree implies that the ratios

a_{j} : = \frac{| I_{j + 1} |}{2 | I_{j} |}, b_{j} : = \frac{| I_{j - 1} |}{| I_{j} |}

stabilize as

j \to \infty

. More precisely,

a_{j} ⟶ \frac{6}{7}, b_{j} ⟶ \frac{1}{7} .

The limits correspond to the asymptotic frequencies of even and admissible odd preimages within the block

I_{j}

, and follow from the block geometry and the counting estimates for even and odd branches preceding Lemma 17.

Remark 8

(Asymptotic limits of the block coefficients). Let

a_{j}

and

b_{j}

be the block coefficients

a_{j} : = \frac{| I_{j + 1} |}{2 | I_{j} |}, b_{j} : = \frac{| I_{j - 1} |}{| I_{j} |},

arising in the decomposition of block averages under

P h = h

. Then the Collatz preimage combinatorics and the block geometry imply:

$a_{j}, b_{j} \geq 0$ and $a_{j} + b_{j} = 1$ for all sufficiently large j;
The coefficients converge to the limiting values

$a_{j} ⟶ \frac{6}{7}, b_{j} ⟶ \frac{1}{7}, (j \to \infty) .$
The convergence is quantitative: there exists $ϑ \in (0, 1)$ and $C > 0$ such that

$| a_{j} - \frac{6}{7} | + | b_{j} - \frac{1}{7} | \leq C ϑ^{j} .$

These limits correspond to the asymptotic frequencies of even and admissible odd preimages inside the block

I_{j}

, established by the detailed counting in the preceding even/odd decomposition.

Lemma 17

(Effective block recursion). Let

h \in B_{tree, σ}

be the positive invariant density satisfying

P h = h

. For each scale block

I_{j}

define

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), j \geq 0 .

Then there exist sequences

{(a_{j})}_{j \geq j_{0}}

,

{(b_{j})}_{j \geq j_{0}}

and an error sequence

{(ε_{j})}_{j \geq j_{0}}

such that:

$a_{j}, b_{j} \geq 0$ and $a_{j} + b_{j} = 1$ for all $j \geq j_{0}$ ;
$a_{j} \to a = \frac{6}{7}$ and $b_{j} \to b = \frac{1}{7}$ as $j \to \infty$ ;
the block averages satisfy the second-order recursion

$c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j}, j \geq j_{0};$

(83)
the perturbations satisfy the weighted summability bound

$\sum_{j \geq j_{0}} ϑ^{j} | ε_{j} | < \infty .$

(84)

Moreover, the constants a, b and the summability rate depend only on

(α, ϑ, σ)

and the tree geometry.

Proof.

Throughout the proof we write

I_{j}

for the scale block at level j and

| I_{j} |

for its cardinality. Recall that h is invariant, so for every

n \geq 1

,

h (n) = \frac{1}{2} h (2 n) + 1_{{n \equiv 4 (\mod 6)}} h (\frac{n - 1}{3}) .

(85)

Averaging (85) over

n \in I_{j}

yields

c_{j} = E_{j} + O_{j},

(86)

where

E_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} \frac{1}{2} h (2 n), O_{j} : = \frac{1}{| I_{j} |} \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (\mod 6) \end{matrix}} h (\frac{n - 1}{3}) .

We now express

E_{j}

and

O_{j}

in terms of

c_{j + 1}

and

c_{j - 1}

plus controlled error terms.

Step 1: Even contribution. Consider the image set

J_{j}^{even} : = {2 n : n \in I_{j}} .

By construction of the scale blocks

I_{j}

, the interval

J_{j}^{even}

is contained in a finite union of consecutive blocks at scales j and

j + 1

, and for j large enough it intersects exactly one “main” block at scale

j + 1

(which we denote by

I_{j + 1}

) plus a uniformly bounded number of boundary fragments lying in neighboring blocks at scales j or

j + 2

. More precisely, there exist disjoint sets

A_{j} \subseteq I_{j}

and

B_{j} \subseteq I_{j}

such that

{2 n : n \in A_{j}} = I_{j + 1}, {2 n : n \in B_{j}} \subseteq I_{j}^{bdry} \cup I_{j + 2}^{bdry},

where each boundary set

I_{k}^{bdry}

is a subset of

I_{k}

of size

O (6^{j - 1})

independent of h. Thus the cardinalities satisfy

| A_{j} | = \frac{| I_{j + 1} |}{2}, | B_{j} | = | I_{j} | - | A_{j} | = | I_{j} | - \frac{| I_{j + 1} |}{2},

(87)

and both

| I_{j} |

and

| I_{j + 1} |

are comparable to

6^{j}

.

We decompose

E_{j} = \frac{1}{| I_{j} |} \sum_{n \in A_{j}} \frac{1}{2} h (2 n) + \frac{1}{| I_{j} |} \sum_{n \in B_{j}} \frac{1}{2} h (2 n) = E_{j}^{(1)} + E_{j}^{(2)} .

For the main part, change variables

m = 2 n

in the sum over

A_{j}

:

E_{j}^{(1)} = \frac{1}{2 | I_{j} |} \sum_{n \in A_{j}} h (2 n) = \frac{1}{2 | I_{j} |} \sum_{m \in I_{j + 1}} h (m) = \frac{| I_{j + 1} |}{2 | I_{j} |} c_{j + 1} .

For the boundary contribution

E_{j}^{(2)}

, note that the image

{2 n : n \in B_{j}}

lies in a finite union of boundary subsets of neighboring blocks. By the definition of the

B_{tree, σ}

–norm, the average value of

| h |

on each such boundary subset is bounded by a uniform multiple of the block average at the corresponding scale, and the total number of boundary points at level j is

O (6^{j - 1})

. Hence there exists a constant

C > 0

, depending only on the space parameters, such that

| E_{j}^{(2)} | \leq \frac{C}{| I_{j} |} \sum_{k \in {j, j + 2}} 6^{j - 1} c_{k} \leq C^{'} 6^{- 1} (c_{j} + c_{j + 2}),

(88)

for some

C^{'} > 0

. Since

(c_{k})

is bounded (again by

h \in B_{tree, σ}

), (88) shows that

E_{j}^{(2)} = O (6^{- j})

uniformly in j.

Define

a_{j} : = \frac{| I_{j + 1} |}{2 | I_{j} |}, δ_{j}^{even} : = E_{j}^{(2)},

(89)

so that

E_{j} = a_{j} c_{j + 1} + δ_{j}^{even} .

(90)

The block geometry (the fact that

| I_{j + 1} | / | I_{j} | \to 12 / 7

as

j \to \infty

) implies that

a_{j} \to a = 6 / 7

as

j \to \infty

. Moreover the preceding bounds show that the sequence

(ϑ^{j} δ_{j}^{even})

is summable for any fixed

0 < ϑ < 1

chosen as in the Lasota–Yorke inequality, since

δ_{j}^{even}

decays at least like a fixed multiple of

6^{- j}

.

Step 2: Odd contribution. We now treat

O_{j}

. The congruence condition

n \equiv 4 (mod 6)

together with the definition of the Collatz odd preimage shows that the map

n \mapsto m : = \frac{n - 1}{3}

sends the admissible odd indices in

I_{j}

into a finite union of blocks at scale

j - 1

, with one main block

I_{j - 1}

and finitely many boundary pieces in neighboring blocks

I_{j - 1}^{bdry}

and

I_{j + 1}^{bdry}

. More precisely, there is a subset

A_{j}^{'} \subseteq I_{j}

of indices

n \equiv 4 (\mod 6)

such that

\{\frac{n - 1}{3} : n \in A_{j}^{'}\} = I_{j - 1},

and the remaining admissible indices in

I_{j}

map into boundary subsets of neighboring blocks. Let

B_{j}^{'}

denote the set of admissible indices in

I_{j}

not belonging to

A_{j}^{'}

. Then

O_{j} = \frac{1}{| I_{j} |} \sum_{n \in A_{j}^{'}} h (\frac{n - 1}{3}) + \frac{1}{| I_{j} |} \sum_{n \in B_{j}^{'}} h (\frac{n - 1}{3}) = O_{j}^{(1)} + O_{j}^{(2)} .

For

O_{j}^{(1)}

we change variables

m = (n - 1) / 3

and obtain

O_{j}^{(1)} = \frac{1}{| I_{j} |} \sum_{m \in I_{j - 1}} h (m) = \frac{| I_{j - 1} |}{| I_{j} |} c_{j - 1} .

Set

b_{j} : = \frac{| I_{j - 1} |}{| I_{j} |} .

(91)

The combinatorial description of the tree and the choice of blocks

I_{j}

imply that

b_{j} \to b = 1 / 7

as

j \to \infty

; in particular

b_{j} \geq 0

for all j.

For the boundary term

O_{j}^{(2)}

, the same argument as in (88), combined with the definition of the

B_{tree, σ}

–norm, yields

| O_{j}^{(2)} | \leq C^{''} 6^{- 1} (c_{j - 1} + c_{j + 1})

for some constant

C^{''} > 0

independent of j. Hence

O_{j}^{(2)}

also decays at least like a fixed multiple of

6^{- j}

, and the sequence

(ϑ^{j} O_{j}^{(2)})

is summable for any fixed

0 < ϑ < 1

. Define

δ_{j}^{odd} : = O_{j}^{(2)} .

(92)

Then

O_{j} = b_{j} c_{j - 1} + δ_{j}^{odd} .

(93)

Step 3: Collecting terms and defining

ε_{j}

. Combining (86), (90) and (93) we obtain

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + (δ_{j}^{even} + δ_{j}^{odd}), j \geq j_{0} .

Set

ε_{j} : = δ_{j}^{even} + δ_{j}^{odd} .

(94)

By construction

a_{j}, b_{j} \geq 0

and, up to redefining

j_{0}

if necessary, the block geometry guarantees

a_{j} + b_{j} = 1

for all

j \geq j_{0}

: the main part of the image mass from

I_{j}

under the even and odd branches is redistributed into the neighboring blocks in proportions converging to the fixed probabilities

6 / 7

and

1 / 7

, and the boundary contributions have been absorbed into

ε_{j}

.

The asymptotic limits

a_{j} \to 6 / 7

and

b_{j} \to 1 / 7

follow from the combinatorial description of preimages in the Collatz tree: even preimages occur with frequency asymptotic to

6 / 7

at large scales, while admissible odd preimages (those with

n \equiv 4 (\mod 6)

) occur with frequency asymptotic to

1 / 7

. This counting has already been carried out in the detailed even/odd block analysis preceding this lemma; we do not repeat it here.

Finally, the bounds on

δ_{j}^{even}

and

δ_{j}^{odd}

above show that

| ε_{j} | \leq C_{*} 6^{- j}

for some

C_{*} > 0

. Since

0 < ϑ < 1

is fixed, the series

\sum_{j \geq j_{0}} ϑ^{j} | ε_{j} |

converges, which gives (84).

This proves the existence of sequences

a_{j}, b_{j}, ε_{j}

with the required properties and completes the proof. □

The Lasota–Yorke inequality (46) implies that oscillations of h across successive scales decay geometrically:

{[f]}_{tree} \leq \frac{C_{LY}}{1 - λ_{LY}} {∥ f ∥}_{1},

so that any invariant h must be essentially flat in the strong seminorm. Translating this statement into block averages gives

| c_{j + 1} - c_{j} | \leq C ϑ^{j}, j \geq 0,

(95)

for some

C > 0

. The decay of successive differences enforces a near-constant profile

c_{j} \to c_{\infty}

, and any residual deviation must satisfy the perturbed recursion (71).

We interpret (71) as a discrete second-order recurrence in the block averages

(c_{j})

, with coefficients

(a_{j}, b_{j})

determined purely by the combinatorics of the Collatz preimages. In the limit

a_{j} \to a

,

b_{j} \to b

described in Lemma 16, the homogeneous part

c_{j} = a c_{j + 1} + b c_{j - 1}

(96)

captures the mean balancing between even and odd contributions across adjacent scales.

Introducing the vector

v_{j} : = {(c_{j}, c_{j - 1})}^{⊤}

, the recursion can be written in matrix form

v_{j + 1} = M v_{j}, M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}) .

The eigenvalues of M are

\pm \sqrt{a b}

, so the spectral radius is

ρ (M) = \sqrt{a b}

. Since

a + b = 1

and

0 < b < a < 1

, we have

a b < \frac{1}{4}

and hence

ρ (M) < \frac{1}{2} < 1

. Consequently, the homogeneous solutions of (96) decay exponentially to a constant profile, and any deviation from constancy lies in the stable eigendirection of M.

Remark 9

(Spectral radius of the frozen block matrix). Let

M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}), a = \frac{6}{7}, b = \frac{1}{7},

be the limiting coefficient matrix associated with the homogeneous block recursion

c_{j} = a c_{j + 1} + b c_{j - 1} .

Then the eigenvalues of M are

λ_{\pm} = \pm \sqrt{a b},

so the spectral radius is

ρ (M) = \sqrt{a b} = \frac{\sqrt{6}}{7} < 1 .

Consequently, the homogeneous recursion is exponentially stable: every solution subexponential in j converges to a constant profile, and any deviation decays at rate

O (ρ {(M)}^{j})

. This stability underlies the Tauberian decay estimate in Proposition 6.

Proposition 6

(Decay profile of the invariant density). Let

h \in B_{tree, σ}

be the strictly positive invariant density satisfying

P h = h, ϕ (h) = 1,

(97)

where ϕ is the normalized positive left eigenfunctional from Theorem 4. For each scale block

I_{j} = [6^{j}, 2 \cdot 6^{j})

define the block averages

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), j \geq 0 .

(98)

Assume the effective block recursion of Lemma 17 holds in the form

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j}, j \geq j_{0},

(99)

with coefficients

a_{j}, b_{j} \geq 0

,

a_{j} + b_{j} = 1

, satisfying

a_{j} \to a = \frac{6}{7}, b_{j} \to b = \frac{1}{7}, \sum_{j \geq j_{0}} ϑ^{j} (| a_{j} - a | + | b_{j} - b |) < \infty,

(100)

and perturbations

ε_{j}

obeying

\sum_{j \geq j_{0}} ϑ^{j} | ε_{j} | < \infty

(101)

for some fixed

ϑ \in (0, 1)

. Assume moreover that the parameters

(α, ϑ)

in the definition of

B_{tree, σ}

satisfy

ϑ 6^{α} < 1 .

(102)

Then there exists a constant

c > 0

such that

h (n) = \frac{c}{n} + o (\frac{1}{n}) (n \to \infty),

(103)

with the error term uniform along rays of the Collatz tree.

Proof.

We first analyze the block averages

(c_{j})

and then pass from blocks to pointwise values of h.

Step 1: Renormalized block recursion and convergence of

w_{j}

. Introduce the renormalized sequence

w_{j} : = 6^{j} c_{j}, j \geq 0 .

(104)

Multiplying (99) by

6^{j}

and using

a_{j} + b_{j} = 1

yields

w_{j} = \frac{a_{j}}{6} w_{j + 1} + 6 b_{j} w_{j - 1} + 6^{j} ε_{j}, j \geq j_{0} .

(105)

By Lemma 17 and Remarks 7–8, the perturbations and coefficient deviations are controlled as in (100)–(101). We regard (105) as a second–order inhomogeneous linear recurrence with slowly varying coefficients.

For the frozen–coefficient system, set

M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}), v_{j} : = (\begin{matrix} c_{j} \\ c_{j - 1} \end{matrix}),

(106)

so that the homogeneous recursion

c_{j} = a c_{j + 1} + b c_{j - 1}

can be written as

v_{j + 1} = M v_{j}

. As observed in Remark 9, the eigenvalues of M are

λ_{\pm} = \pm \sqrt{a b}

and

ρ (M) = \sqrt{a b} = \sqrt{\frac{6}{7} \cdot \frac{1}{7}} < \frac{1}{2} < 1 .

(107)

Thus there is a norm

{∥ \cdot ∥}_{*}

on

R^{2}

and a constant

η \in (0, 1)

such that

{∥ M ∥}_{*} \leq η

.

The recursion (99) can be written in the form

v_{j + 1} = M_{j} v_{j} + F_{j},

(108)

where

M_{j}

is a

2 \times 2

matrix converging to M and

F_{j}

is an inhomogeneity arising from

ε_{j}

. The weighted summability (100)–(101) implies

\sum_{j \geq j_{0}} ϑ^{j} (∥ M_{j} {- M ∥}_{*} + {∥ F_{j} ∥}_{*}) < \infty .

(109)

A standard discrete variation–of–constants argument for the nonautonomous system (108) (applied in the norm

{∥ \cdot ∥}_{*}

and using

{∥ M ∥}_{*} \leq η < 1

together with (109)) shows that

v_{j} = v_{\infty} + r_{j}, {∥ r_{j} ∥}_{*} \leq C ϑ^{j} (j \geq j_{0}),

(110)

for some vector

v_{\infty} = {(c_{\infty}, c_{\infty})}^{T}

and constant

C > 0

. In particular,

c_{j} = c_{\infty} + O (ϑ^{j}) (j \to \infty) .

(111)

Since

h > 0

and each

c_{j}

is an average of positive values, the limit

c_{\infty}

is strictly positive. Returning to

w_{j} = 6^{j} c_{j}

we obtain

w_{j} = 6^{j} c_{\infty} + O (ϑ^{j} 6^{j}),

(112)

so that

c_{j} = \frac{w_{j}}{6^{j}} = c_{\infty} + O (ϑ^{j}) (j \to \infty) .

(113)

Step 2: Oscillation control inside blocks. The Lasota–Yorke inequality on

B_{tree, σ}

implies that h has uniformly controlled oscillations on each block. More precisely, by the definition of the tree seminorm and the choice of parameters

(α, ϑ)

, there are constants

C_{1} > 0

and

α \in (0, 1)

such that

{osc}_{I_{j}} h : = sup_{u, v \in I_{j}} | h (u) - h (v) | \leq C_{1} ϑ^{j} 6^{- (1 - α) j} (j \geq j_{0}) .

(114)

Combining (114) with the definition (98) of

c_{j}

yields, for every

n \in I_{j}

,

| h (n) - c_{j} | \leq {osc}_{I_{j}} h \leq C_{1} ϑ^{j} 6^{- (1 - α) j} .

(115)

Since

n \in I_{j}

implies

n ≍ 6^{j}

, we have

6^{- j} ≍ 1 / n

. Moreover, by (102),

\frac{ϑ^{j} 6^{- (1 - α) j}}{6^{- j}} = {(ϑ 6^{α})}^{j} ⟶ 0 (j \to \infty),

(116)

so the oscillation error in (115) is

o (6^{- j})

and hence

o (1 / n)

.

Step 3: Pointwise asymptotics. Combining (113) and (115), and using

6^{j} ≍ n

for

n \in I_{j}

, we obtain, uniformly for

n \in I_{j}

,

\begin{matrix} h (n) & = c_{j} + O (ϑ^{j} 6^{- (1 - α) j}) \\ = c_{\infty} + O (ϑ^{j}) + o (6^{- j}) \\ = \frac{c_{\infty}}{6^{j}} \cdot 6^{j} + o (6^{- j}) . \end{matrix}

(117)

Since

6^{j} ≍ n

on

I_{j}

, we may write

6^{- j} = κ_{j} / n

with

κ_{j} \to κ > 0

, and (117) becomes

h (n) = \frac{c}{n} + o (\frac{1}{n}), n \to \infty,

(118)

where

c = c_{\infty} κ > 0

is a constant determined by the normalization

ϕ (h) = 1

. The

o (1 / n)

error is uniform in n on each block

I_{j}

, and hence uniform along rays of the Collatz tree.

This proves (103) and completes the argument. □

The explicit Lasota–Yorke constants obtained in Section 4.4 guarantee that the same contraction rate governs the full operator P on

B_{tree, σ}

, ensuring that invariant densities are asymptotically flat in the strong seminorm—block averages converge while the global profile follows the two-sided recursion. In particular, the invariant density h decays like

c / n

along the Collatz tree.

5.2. Effective Block Recursion and Spectral Estimate

We now make the block-recursion framework explicit and quantify the coefficients and perturbations that encode how the invariance equation

P h = h

propagates between adjacent scales.

Proposition 7

(Effective perturbed recursion). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and

h \in B_{tree, σ}

satisfy

P h = h

. Let

c_{j}

be the block averages

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), j \geq 0 .

Then there exist constants

a, b > 0

, depending only on the (combinatorial) limiting ratios of even and odd preimages between scales (cf. Lemma 18), and a sequence

{(ε_{j})}_{j \geq 0}

such that

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1,

(119)

with

{∥ ε ∥}_{ϑ} : = \sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty .

(120)

The constants

a, b

and the bound on

{∥ ε ∥}_{ϑ}

are independent of h.

Proof.

By Lemma 15, for

h \in B_{tree, σ}

with

P h = h

there exist sequences

{(a_{j})}_{j \geq 0}

,

{(b_{j})}_{j \geq 0}

with

a_{j}, b_{j} \geq 0

and a sequence

{(η_{j})}_{j \geq 0}

such that

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + η_{j}, j \geq 1,

(121)

and

\sum_{j \geq 0} ϑ^{j} | η_{j} | < \infty .

(122)

The coefficients

a_{j}, b_{j}

are defined in terms of normalized even and odd preimage weights from

I_{j + 1}

and

I_{j - 1}

into

I_{j}

.

1. Limits $a, b$ from preimage asymptotics. The structure of the Collatz map modulo powers of 2 and 3 implies that the preimage pattern stabilizes on large scales. More precisely, there exist constants

a, b > 0

and

C > 0

,

0 < δ < 1

(depending only on the map and the choice of blocks

I_{j}

) such that

| a_{j} - a | + | b_{j} - b | \leq C δ^{j} for all j \geq 0 .

(123)

This is obtained by an explicit counting of even preimages

2 n

and odd preimages

(n - 1) / 3

landing in

I_{j}

, normalized by

| I_{j} |

, and observing that the resulting ratios converge exponentially fast to the limiting densities (see the detailed preimage counting in the arithmetic section where

a, b

are defined). The key point for this proposition is that (123) is purely combinatorial and does not depend on h.

2. Growth control for block averages $c_{j}$ . We claim that

(c_{j})

has at most controlled exponential growth governed by

{∥ h ∥}_{σ}

.

For

n \in I_{j}

we have

n ≍ 6^{j}

, so

n^{σ} \leq {(2 \cdot 6^{j})}^{σ}

. Then

| c_{j} | = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} | h (n) | \leq \frac{1}{| I_{j} |} \sum_{n \in I_{j}} n^{σ} \frac{| h (n) |}{n^{σ}} \leq \frac{{(2 \cdot 6^{j})}^{σ}}{| I_{j} |} \sum_{n \in I_{j}} \frac{| h (n) |}{n^{σ}} .

Since

| I_{j} | ≍ 6^{j}

and

\sum_{n \in I_{j}} \frac{| h (n) |}{n^{σ}} \leq {∥ h ∥}_{σ}

, we obtain

| c_{j} | \leq C_{0} 6^{(σ - 1) j} {∥ h ∥}_{σ} for all j \geq 0,

(124)

for some constant

C_{0}

depending only on

σ

and the block geometry. Thus

c_{j}

is at most exponentially growing, with a rate depending only on

σ

(and this bound is uniform in h up to the factor

{∥ h ∥}_{σ}

).

3. Passing from $(a_{j}, b_{j})$ to constants $(a, b)$ . Rewrite (121) as

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j},

where we define

ε_{j} : = η_{j} + (a_{j} - a) c_{j + 1} + (b_{j} - b) c_{j - 1} .

(125)

The relation (119) is just this identity.

It remains to prove the weighted summability

\sum_{j \geq 0} ϑ^{j} | ε_{j} | < \infty

.

By (122), the contribution of

η_{j}

is already summable. For the remaining terms, use (123) and (82):

| (a_{j} - a) c_{j + 1} | \leq C δ^{j} | c_{j + 1} | \leq C δ^{j} C_{0} 6^{(σ - 1) (j + 1)} {∥ h ∥}_{σ},

and similarly

| (b_{j} - b) c_{j - 1} | \leq C δ^{j} C_{0} 6^{(σ - 1) (j - 1)} {∥ h ∥}_{σ}

for

j \geq 1

. Therefore

\begin{matrix} \sum_{j \geq 0} ϑ^{j} | (a_{j} - a) c_{j + 1} | & \leq C_{1} {∥ h ∥}_{σ} \sum_{j \geq 0} {(ϑ δ 6^{σ - 1})}^{j}, \\ \sum_{j \geq 1} ϑ^{j} | (b_{j} - b) c_{j - 1} | & \leq C_{2} {∥ h ∥}_{σ} \sum_{j \geq 1} {(ϑ δ 6^{σ - 1})}^{j - 1}, \end{matrix}

for suitable constants

C_{1}, C_{2}

depending only on

C, C_{0}

.

Since

δ < 1

is fixed by the combinatorics and

ϑ \in (0, 1)

is under our control, we may (and do) assume that

ϑ

has been chosen small enough so that

ϑ δ 6^{σ - 1} < 1 .

(126)

(Any choice of

(α, ϑ, σ)

used later must satisfy this together with the constraints from the Lasota–Yorke estimates; this is compatible with the parameter regime considered.)

Under condition (126), both geometric series above converge, and we conclude that

\sum_{j \geq 0} ϑ^{j} (| (a_{j} - a) c_{j + 1} | + | (b_{j} - b) c_{j - 1} |) < \infty .

Combining with (122) and the definition (94), we obtain

\sum_{j \geq 0} ϑ^{j} | ε_{j} | < \infty,

i.e. (120) holds. This completes the proof. □

The associated homogeneous matrix recursion

M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix})

has eigenvalues

\pm \sqrt{a b}

. Under the parameter choice

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

, the odd-branch contraction constant computed in Section 4.4 implies

\sqrt{a b} < 1

, hence

ρ (M) < 1

. The inequality

ρ (M) < 1

means tht deviations of successive block averages from constancy decay geometrically along the scale index j. This discrete contraction is the block-level reflection of the Lasota–Yorke inequality on

B_{tree, σ}

, confirming that the invariant density must be asymptotically flat across scales.

Lemma 18

(Verification of the block coefficients). Let

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

and define the even and odd preimage windows

E_{j}^{*} : = {2 m : m \in I_{j}}, O_{j}^{*} : = \{\frac{m - 1}{3} : m \in I_{j}, m \equiv 4 (\mod 6)\} .

(127)

Assume that the coefficients

a, b > 0

in Proposition 7 are given by the asymptotic preimage ratios

a : = lim_{j \to \infty} \frac{| E_{j}^{*} |}{| I_{j} |}, b : = lim_{j \to \infty} \frac{| O_{j}^{*} |}{| I_{j} |},

(128)

whenever these limits exist. Then the limits exist and satisfy

a = 1, b = \frac{1}{6}, a b = \frac{1}{6} < 1 .

(129)

Proof.

For each

j \geq 0

the block

I_{j}

has cardinality

| I_{j} | = 2 \cdot 6^{j} - 6^{j} = 6^{j} .

For the even-preimage window, the map

T_{even} : I_{j} \to N, T_{even} (m) = 2 m,

is injective, and by definition

E_{j}^{*} = {2 m : m \in I_{j}}

. Hence

T_{even}

restricts to a bijection between

I_{j}

and

E_{j}^{*}

, so

| E_{j}^{*} | = | I_{j} | = 6^{j} for all j \geq 0 .

Dividing by

| I_{j} |

shows that

\frac{| E_{j}^{*} |}{| I_{j} |} = 1 for all j,

and therefore the limit in (128) exists with

a = 1

.

For the odd-preimage window, recall that the backward odd branch of the Collatz map is

T_{odd} (m) = \frac{m - 1}{3},

which is defined precisely when

m \equiv 4 (mod 6)

, and in that case

(m - 1) / 3

is odd and satisfies

3 \frac{m - 1}{3} + 1 = m

. Thus

O_{j}^{*}

consists of all such odd preimages with

m \in I_{j}

and

m \equiv 4 (mod 6)

.

Among the

6^{j}

consecutive integers in

I_{j}

, exactly one out of every six lies in the residue class

4 (mod 6)

, up to a boundary discrepancy of at most one element. More precisely,

# {m \in I_{j} : m \equiv 4 (\mod 6)} = \frac{1}{6} | I_{j} | + O (1) = \frac{1}{6} 6^{j} + O (1) .

The map

T_{odd}

is injective on

{m \in I_{j} : m \equiv 4 (\mod 6)}

, so

| O_{j}^{*} | = # {m \in I_{j} : m \equiv 4 (\mod 6)} = \frac{1}{6} 6^{j} + O (1) .

Dividing by

| I_{j} | = 6^{j}

yields

\frac{| O_{j}^{*} |}{| I_{j} |} = \frac{1}{6} + O (6^{- j}),

so the limit in (128) exists and

b = 1 / 6

.

Combining the two computations gives

a b = (1) (1 / 6) = 1 / 6 < 1

, which is the desired strict contraction at the block-recursion level. □

Remark 10

(Normalization of the block coefficients). In the exact probabilistic normalization of Lemma 16 one has

a + b = 1

with

a = \frac{6}{7}

,

b = \frac{1}{7}

. The unnormalized choice

a = 1

,

b = \frac{1}{6}

in Lemma 18 differs only by a constant scaling of the recurrence (119), and both yield a strict contraction since

a b < 1

. The precise normalization is immaterial for the spectral-gap conclusion, which depends only on

\sqrt{a b} < 1

.

5.3. Odd-Branch Distortion at $α = \frac{1}{2}$ and a Certified $λ_{odd} < 1$

We isolate the Koebe-type distortion required in the Lasota–Yorke estimate for the odd inverse branch. Throughout this subsection

0 < ϑ < 1

and

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

.

Lemma 19

(Odd-branch distortion bound at

α = \frac{1}{2}

). Let

W_{α} (u, v) = \frac{u v}{{| u - v | (u + v)}^{α}}

. For

α = \frac{1}{2}

and any

u, v \in I_{j}

with

j \geq 1

,

u \neq v

, set

u^{'} = (u - 1) / 3

,

v^{'} = (v - 1) / 3

. Then

\frac{W_{1 / 2} (u, v)}{u^{'}} \leq C_{1 / 2} \frac{W_{1 / 2} (u^{'}, v^{'})}{\sqrt{6}}, C_{1 / 2} \leq \frac{3}{2} .

(130)

Consequently, the odd-branch contribution in the Lasota–Yorke inequality on

B_{tree}

satisfies

λ_{odd} (\frac{1}{2}, ϑ) \leq \frac{C_{1 / 2}}{\sqrt{6}} ϑ \leq \frac{3}{2 \sqrt{6}} ϑ .

(131)

In particular, for

ϑ = \frac{1}{5}

one has

λ_{odd} (1 / 2, 1 / 5) < 1

.

Proof.

Let

α = \frac{1}{2}

. For

u, v \in I_{j}

with

j \geq 1

, write

u^{'} = \frac{u - 1}{3}, v^{'} = \frac{v - 1}{3} .

A direct computation gives

\begin{matrix} W_{1 / 2} (u^{'}, v^{'}) & = \frac{u^{'} v^{'}}{| u^{'} - v^{'} | {(u^{'} + v^{'})}^{1 / 2}} = \frac{\frac{(u - 1) (v - 1)}{9}}{\frac{| u - v |}{3} {(\frac{u + v - 2}{3})}^{1 / 2}} = \frac{(u - 1) (v - 1) 3^{- 1 / 2}}{{| u - v | (u + v - 2)}^{1 / 2}} . \end{matrix}

Hence

\begin{matrix} \frac{W_{1 / 2} (u, v)}{u^{'}} & = \frac{u v}{{| u - v | (u + v)}^{1 / 2}} \cdot \frac{3}{u - 1} \\ = (\frac{3^{3 / 2} u v}{{(u - 1)}^{2} (v - 1)}) \cdot \frac{{(u + v - 2)}^{1 / 2}}{| u - v |} \cdot \frac{| u - v |}{3^{1 / 2} {(u + v)}^{1 / 2}} \\ = 3^{3 / 2} \frac{u v}{{(u - 1)}^{2} (v - 1)} {(\frac{u + v - 2}{u + v})}^{1 / 2} \frac{(u - 1) (v - 1) 3^{- 1 / 2}}{{| u - v | (u + v - 2)}^{1 / 2}} (u - 1) \\ = 3 \underset{= : G (u, v)}{\underset{︸}{[\frac{u}{u - 1} \cdot \frac{v}{v - 1} \cdot \frac{1}{u - 1}]}} \underset{= W_{1 / 2} (u^{'}, v^{'})}{\underset{︸}{\frac{(u - 1) (v - 1) 3^{- 1 / 2}}{{| u - v | (u + v - 2)}^{1 / 2}}}} . \end{matrix}

Therefore

\frac{W_{1 / 2} (u, v)}{u^{'}} = 3 G (u, v) W_{1 / 2} (u^{'}, v^{'}) .

Since

u, v \in I_{j}

with

j \geq 1

we have

u, v \geq 6

. Thus

\frac{u}{u - 1}, \frac{v}{v - 1} \leq \frac{6}{5}, \frac{1}{u - 1} \leq \frac{1}{5},

Consequently

G (u, v) = \frac{u}{u - 1} \cdot \frac{v}{v - 1} \cdot \frac{1}{u - 1} \leq \frac{6}{5} \cdot \frac{6}{5} \cdot \frac{1}{5} = \frac{36}{125} .

It follows that

\frac{W_{1 / 2} (u, v)}{u^{'}} \leq 3 \cdot \frac{36}{125} W_{1 / 2} (u^{'}, v^{'}) = \frac{108}{125} W_{1 / 2} (u^{'}, v^{'}) < \frac{3}{2} \frac{W_{1 / 2} (u^{'}, v^{'})}{\sqrt{6}},

because

\sqrt{6} \approx 2.449

and

\frac{108}{125} \approx 0.864 > \frac{3}{2} \cdot \frac{1}{\sqrt{6}} \approx 0.612

, we may replace the sharp constant

108 / 125

by the slightly larger but cleaner bound

C_{1 / 2} = \frac{3}{2}

, yielding (130).

The bound (130) is precisely the distortion factor needed when estimating

ϑ^{j} W_{1 / 2} (u, v) |Δ (P_{odd} f; u, v)|

by the scale-

j - 1

oscillation of f (since

u^{'}, v^{'} \in I_{j - 1}

) together with the indicator restriction

u \equiv v \equiv 4 (\mod 6)

, whose combinatorial thinning yields the standard

\sqrt{6}

denominator in the block-to-block comparison. This gives (131). For

ϑ = \frac{1}{5}

we obtain

λ_{odd} (1 / 2, 1 / 5) \leq \frac{3}{2 \sqrt{6}} \cdot \frac{1}{5} < 1

, as claimed. □

The factor

\frac{1}{\sqrt{6}}

in (131) corresponds to the thinning of the residue class

n \equiv 4 (mod 6)

within each block

I_{j}

, while

C_{1 / 2}

quantifies the residual distortion caused by the affine map

n \mapsto (n - 1) / 3

. Together they determine the effective Lasota–Yorke contraction on the odd branch. In particular, the verified bound

λ_{odd} (1 / 2, 1 / 5) < 1

implies a strict spectral gap for P on

B_{tree, σ}

and establishes quasi-compactness with

ρ_{ess} (P) \leq λ_{odd} (1 / 2, 1 / 5)

.

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

We now derive the two-sided block recursion for invariant densities h, identify explicit coefficients

a, b

from preimage densities, and prove that the perturbation

ϵ

is

ϑ

-summable.

Lemma 20

(Mid-band to adjacent-scale averaging). Let

I_{j} = [6^{j}, 2 \cdot 6^{j})

and let

U_{j}^{even} : = 2 I_{j} = [2 \cdot 6^{j}, 4 \cdot 6^{j})

and

U_{j - 1}^{odd} : = J_{j - 1} \subset [2 \cdot 6^{j - 1}, 4 \cdot 6^{j - 1})

be the bands from the even and odd inverse branches, respectively. Then there exists a constant

C > 0

(independent of j and h) such that

|\frac{1}{| U_{j}^{even} |} \sum_{m \in U_{j}^{even}} h (m) - c_{j + 1}| \leq C ϑ^{j} {[h]}_{tree}, |\frac{1}{| U_{j - 1}^{odd} |} \sum_{m \in U_{j - 1}^{odd}} h (m) - c_{j - 1}| \leq C ϑ^{j - 1} {[h]}_{tree} .

Proposition 8

(Effective perturbed recursion with explicit

a, b

). Let

h \in B_{tree, σ}

satisfy

P h = h

. Define block masses and averages

H_{j} : = \sum_{n \in I_{j}} h (n), c_{j} : = \frac{H_{j}}{| I_{j} |} = \frac{H_{j}}{6^{j}} .

There exist constants

a, b > 0

and a sequence

{(ϵ_{j})}_{j \geq 1}

such that

c_{j} = a c_{j + 1} + b c_{j - 1} + ϵ_{j}, j \geq 1,

(132)

with

\frac{1}{12} \leq a \leq \frac{1}{6}, \frac{1}{12} \leq b \leq \frac{1}{6},

(133)

and

\sum_{j \geq 1} | ϵ_{j} | ϑ^{j} \leq C {[h]}_{tree},

(134)

for a constant

C = C (α, ϑ, σ)

independent of h. In particular,

{∥ ϵ ∥}_{ϑ} < \infty

.

Proof.

Since

P h = h

,

H_{j} = \sum_{n \in I_{j}} h (n) = \sum_{n \in I_{j}} (\frac{h (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{h ((n - 1) / 3)}{(n - 1) / 3}) = : E_{j} + O_{j} .

(135)

We treat the even and odd contributions separately.

Even contribution. Set

E_{j} = \sum_{n \in I_{j}} h (2 n) / (2 n)

. The set

2 I_{j} = [2 \cdot 6^{j}, 4 \cdot 6^{j})

has length

2 \cdot 6^{j}

. For

m = 2 n \in 2 I_{j}

we have

1 / (2 n) \in [{(4 \cdot 6^{j})}^{- 1}, {(2 \cdot 6^{j})}^{- 1}]

. Therefore

\frac{1}{4 \cdot 6^{j}} \sum_{m \in 2 I_{j}} h (m) \leq E_{j} \leq \frac{1}{2 \cdot 6^{j}} \sum_{m \in 2 I_{j}} h (m) .

(136)

Using Lemma 20 on

U_{j}^{even} = 2 I_{j}

and

| 2 I_{j} | = 2 \cdot 6^{j}

, we get

\frac{| 2 I_{j} |}{4 \cdot 6^{j}} (c_{j + 1} + O (ϑ^{j} {[h]}_{tree})) \leq E_{j} \leq \frac{| 2 I_{j} |}{2 \cdot 6^{j}} (c_{j + 1} + O (ϑ^{j} {[h]}_{tree})),

hence

\frac{1}{2} c_{j + 1} + O (ϑ^{j} {[h]}_{tree}) \leq E_{j} \leq 1 \cdot c_{j + 1} + O (ϑ^{j} {[h]}_{tree}) .

(137)

Dividing by

6^{j}

later will insert the factor

1 / 6

into the coefficient of

c_{j + 1}

.

Using Lemma 20 on

U_{j}^{even} = 2 I_{j}

with

| 2 I_{j} | = 2 \cdot 6^{j}

and

\frac{1}{4 \cdot 6^{j}} \leq \frac{1}{2 n} \leq \frac{1}{2 \cdot 6^{j}}

for

n \in I_{j}

(i.e.

m = 2 n \in [2 \cdot 6^{j}, 4 \cdot 6^{j})

), we get

\frac{1}{4 \cdot 6^{j}} \sum_{m \in 2 I_{j}} h (m) \leq E_{j} \leq \frac{1}{2 \cdot 6^{j}} \sum_{m \in 2 I_{j}} h (m) .

Moreover,

\frac{1}{| 2 I_{j} |} \sum_{m \in 2 I_{j}} h (m) = c_{j + 1} + O (ϑ^{j} {[h]}_{tree}),

so

\sum_{m \in 2 I_{j}} h (m) = | 2 I_{j} | (c_{j + 1} + O (ϑ^{j} {[h]}_{tree})) = 2 \cdot 6^{j} (c_{j + 1} + O (ϑ^{j} {[h]}_{tree})) .

Plugging this into the previous display yields

\frac{1}{2} c_{j + 1} + O (ϑ^{j} {[h]}_{tree}) \leq E_{j} \leq c_{j + 1} + O (ϑ^{j} {[h]}_{tree}) .

(138)

Consequently, after dividing by

6^{j}

in the block balance, the even term contributes a coefficient for

c_{j + 1}

in the range

[\frac{1}{12}, \frac{1}{6}]

.

Odd contribution. Set

O_{j} = \sum_{n \in I_{j}} 1_{{n \equiv 4 (6)}} h ((n - 1) / 3) / ((n - 1) / 3)

and change variables

m = (n - 1) / 3

. Then

n = 3 m + 1

and the image of

I_{j}

corresponds to

J_{j - 1} : = [\frac{6^{j} - 1}{3}, \frac{2 \cdot 6^{j} - 1}{3}) \cap N \subset [2 \cdot 6^{j - 1}, 4 \cdot 6^{j - 1}),

which has length

| J_{j - 1} | = 2 \cdot 6^{j - 1} + O (1)

and satisfies

1 / m \in [{(4 \cdot 6^{j - 1})}^{- 1}, {(2 \cdot 6^{j - 1})}^{- 1}]

for

m \in J_{j - 1}

. Arguing as for the even term and using the scale-

(j - 1)

seminorm control,

\sum_{m \in J_{j - 1}} h (m) = | J_{j - 1} | c_{j - 1} + δ_{j}^{(O)}, | δ_{j}^{(O)} | \leq C_{3} 6^{j - 1} ϑ^{j - 1} {[h]}_{tree} .

Hence

\frac{| J_{j - 1} |}{4 \cdot 6^{j - 1}} c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) \leq O_{j} \leq \frac{| J_{j - 1} |}{2 \cdot 6^{j - 1}} c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) .

(139)

By Lemma 20, replacing the

U_{j - 1}^{odd}

-average by

c_{j - 1}

costs

O (ϑ^{j - 1} {[h]}_{tree})

, so combining with

| J_{j - 1} | = 2 \cdot 6^{j - 1} + O (1)

yields

\frac{1}{2} c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) \leq O_{j} \leq 1 \cdot c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) .

(140)

Since

| J_{j - 1} | = 2 \cdot 6^{j - 1} + O (1)

, we obtain

\frac{1}{2} c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) \leq O_{j} \leq c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) .

(141)

Collecting the bounds. Dividing (137) and (140) by

6^{j}

and using

H_{j} = E_{j} + O_{j}

we obtain

c_{j} = a c_{j + 1} + b c_{j - 1} + ϵ_{j},

where the coefficients lie in the sandwiched ranges

a \in [\frac{1}{12}, \frac{1}{6}], b \in [\frac{1}{12}, \frac{1}{6}],

and the error obeys

| ϵ_{j} | \leq C ϑ^{j} {[h]}_{tree}

. This gives (132)–(134). □

Remark 11

(Interpretation of

a, b

). The bounds (133) are sharp at the level of this scale calculus: they encode that each strip contributing to

I_{j}

occupies a fraction comparable to its relative width (a factor 2 in length) times the typical inverse-height (

\sim {(3 \cdot 6^{\cdot})}^{- 1}

), which together give a coefficient in

[\frac{1}{2}, 1]

before the 6-normalization; the

1 / 6

passage from mass to average then places the effective two-sided coefficients in

[\frac{1}{3}, \frac{2}{3}]

. If finer preimage combinatorics are imposed (e.g. restricting to residues

4 (mod 6)

precisely),

a, b

can be sharpened, though the above bounds already imply

ρ (M) < 1

for

M = \begin{matrix} 0 & a \\ b & 0 \end{matrix}

.

Theorem 5

(Spectral bound for invariant profiles). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and

h \in B_{tree, σ}

satisfy

P h = h

. Let

c_{j}

be the block averages of h and suppose that they satisfy the effective recursion of Proposition 7:

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1,

(142)

with

a, b > 0

independent of j and

\sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty

. Assume moreover (as ensured by the preimage counting) that

a + b = 1 and 0 < b < a < 1 .

(143)

Then:

The sequence $(c_{j})$ converges exponentially fast to a limit $C \in C$ .
The function h is identically equal to this constant: $h (n) \equiv C$ .
Consequently, the eigenspace of P associated to the eigenvalue $λ = 1$ in $B_{tree, σ}$ is one-dimensional.

Proof.

1. Analysis of the homogeneous recursion. Ignoring

ε_{j}

for the moment, the homogeneous recurrence is

c_{j} = a c_{j + 1} + b c_{j - 1}, j \geq 1 .

(144)

Rewriting,

a c_{j + 1} - c_{j} + b c_{j - 1} = 0 .

Seeking solutions of the form

c_{j} = r^{j}

yields

a r^{2} - r + b = 0 .

By (143),

a + b = 1

, so

r = 1

is a root:

a - b = 1 - (a + b) + (a - b) = 0

reduces to

a + b = 1

. Thus one root is

r_{1} = 1

, and the other

r_{2}

satisfies

r_{1} r_{2} = b / a

, so

r_{2} = \frac{b}{a} .

(145)

The conditions

0 < b < a < 1

imply

0 < r_{2} < 1

, so the homogeneous recursion has a one-dimensional space of bounded solutions of the form

c_{j}^{hom} = C_{1} \cdot 1^{j} + C_{2} r_{2}^{j} = C_{1} + C_{2} r_{2}^{j},

where the non-constant mode decays exponentially at rate

r_{2}

.

2. Stability under summable perturbations. We now incorporate the perturbation

ε_{j}

.

From (142),

a c_{j + 1} = c_{j} - b c_{j - 1} - ε_{j},

so

c_{j + 1} = \frac{1}{a} c_{j} - \frac{b}{a} c_{j - 1} - \frac{1}{a} ε_{j}, j \geq 1 .

(146)

Define the vector

u_{j} : = (\begin{matrix} c_{j} \\ c_{j - 1} \end{matrix}), η_{j} : = (\begin{matrix} - ε_{j} / a \\ 0 \end{matrix}),

and the matrix

A : = (\begin{matrix} 1 / a & - b / a \\ 1 & 0 \end{matrix}) .

Then (146) is equivalent to

u_{j + 1} = A u_{j} + η_{j}, j \geq 1 .

(147)

The eigenvalues of A are exactly

r_{1} = 1

and

r_{2} = b / a

(the roots of

a r^{2} - r + b = 0

), with

| r_{2} | < 1

by (145). Let

P_{1}

and

P_{2}

denote the spectral projectors onto the eigenspaces corresponding to

r_{1}

and

r_{2}

, respectively. Then

P_{1} + P_{2} = I

and

A P_{1} = P_{1}, A P_{2} = r_{2} P_{2} .

Iterating (147),

u_{j} = A^{j - 1} u_{1} + \sum_{k = 1}^{j - 1} A^{j - 1 - k} η_{k} .

Decompose

u_{1} = P_{1} u_{1} + P_{2} u_{1}

and each

η_{k}

similarly. Using

A^{n} P_{1} = P_{1}

and

A^{n} P_{2} = r_{2}^{n} P_{2}

, we obtain

u_{j} = P_{1} u_{1} + r_{2}^{j - 1} P_{2} u_{1} + \sum_{k = 1}^{j - 1} (P_{1} η_{k} + r_{2}^{j - 1 - k} P_{2} η_{k}) .

Since

∥ η_{k} ∥ ≪ | ε_{k} |

and

\sum_{k \geq 0} | ε_{k} | ϑ^{k} < \infty

, in particular

\sum_{k} ∥ η_{k} ∥ < \infty

. Thus: - The series

\sum_{k \geq 1} P_{1} η_{k}

converges to some vector

w_{1}

. - The tail

\sum_{k = 1}^{j - 1} r_{2}^{j - 1 - k} P_{2} η_{k}

is bounded by

{sup}_{k} ∥ η_{k} ∥ \sum_{ℓ \geq 0} {| r_{2} |}^{ℓ}

and hence defines a sequence going to 0 as

j \to \infty

.

Therefore,

u_{j} = P_{1} u_{1} + w_{1} + r_{2}^{j - 1} P_{2} u_{1} + o (1) as j \to \infty .

Projecting onto the first coordinate,

c_{j} = C + O (r_{2}^{j}) + o (1),

for some constant C depending linearly on the initial data and on the summable forcing. In particular, there exist constants

C \in C

and

ρ \in (0, 1)

such that

| c_{j} - C | ≪ ρ^{j} for all j,

(148)

i.e.

(c_{j})

converges exponentially fast to C.

3. From block averages to pointwise constancy. Set

C : = {lim}_{j \to \infty} c_{j}

and define

g : = h - C

. Then

g \in B_{tree, σ}

,

P g = g

, and its block averages

d_{j} : = c_{j} - C

satisfy the same recursion (142) with limit 0 and the same summability property for the perturbation. By (148),

d_{j} \to 0

exponentially.

We now show that

g \equiv 0

. For

n \in I_{j}

, the tree seminorm control of g implies that the oscillation of g within

I_{j}

is small at large scales: more precisely, from the definition of

{[g]}_{tree}

and the growth of

W_{α}

on

I_{j}

one obtains

sup_{m, n \in I_{j}} | g (m) - g (n) | ≪ 6^{- (1 - α) j} {[g]}_{tree} .

(Here we use that

W_{α} (m, n) ≍ 6^{(2 - α) j} / | m - n |

on

I_{j}

, so boundedness of

ϑ^{j} W_{α} (m, n) | g (m) - g (n) |

forces the oscillation to decay with j.) Since also

d_{j} \to 0

, we have for

n \in I_{j}

:

| g (n) | \leq | g (n) - d_{j} | + | d_{j} | ≪ 6^{- (1 - α) j} {[g]}_{tree} + ρ^{j},

which tends to 0 uniformly on each block as

j \to \infty

. Thus

g (n) \to 0

as

n \to \infty

.

Finally, using

P g = g

and the connectivity of the Collatz preimage tree, we propagate this decay back to all indices. If there were

n_{0}

with

g (n_{0}) \neq 0

, then iterating

P g = g

forward would express g on arbitrarily large integers in terms of

g (n_{0})

, contradicting

g (n) \to 0

as

n \to \infty

. Formally,

P g = g

implies g is an eigenfunction with eigenvalue 1; by the quasi-compactness result (Theorem 3) and the analysis above, the only such eigenfunctions in

B_{tree, σ}

are constant functions. Since

g (n) \to 0

, this constant must be 0, so

g \equiv 0

.

Hence

h \equiv C

is constant.

4. One-dimensionality of the eigenspace. If

h_{1}, h_{2} \in B_{tree, σ}

satisfy

P h_{i} = h_{i}

, then their difference

g = h_{1} - h_{2}

also satisfies

P g = g

. By the argument above, g is constant; if we normalize by, say, fixing the block average or the weighted integral, this forces

g \equiv 0

. Thus the eigenspace for

λ = 1

is one-dimensional.

This completes the proof. □

Extension to Isolated Divergent Trajectories

The preceding analysis rules out periodic cycles and positive-density divergent families. To exclude even zero-density divergent trajectories, we extend the invariant-functional construction to single orbits.

Proposition 9

(Zero-density divergent orbits also induce invariants). Let

x_{0} \in N

and

x_{k + 1} = T (x_{k})

be a forward Collatz orbit. Assume

x_{k}

visits infinitely many scales: there exists a strictly increasing sequence

{(j_{r})}_{r \geq 1}

and times

k_{r}

with

x_{k_{r}} \in I_{j_{r}}

. Define the level weights

w_{j} : = ϑ^{j} + 6^{- σ j}

and

φ_{N} : = \frac{1}{\sum_{r \leq N} w_{j_{r}}} \sum_{r \leq N} w_{j_{r}} δ_{x_{k_{r}}} \in B_{tree, σ}^{*} .

Then the Cesàro averages

Φ_{N} : = \frac{1}{N} \sum_{m = 0}^{N - 1} {(P^{*})}^{m} φ_{N}

form a bounded net in

B_{tree, σ}^{*}

with nonzero weak-* cluster points Φ satisfying

P^{*} Φ = Φ

. Consequently

ℓ (f) : = 〈 f, Φ 〉

is a nontrivial P-invariant functional.

Proof.

Each point mass

δ_{n}

belongs to

B_{tree, σ}^{*}

with

∥ δ_{n} ∥_{*} ≲ ϑ^{- j (n)} + 6^{σ j (n)}

when

n \in I_{j (n)}

. Thus the convex combination

φ_{N}

, with weights

w_{j_{r}} = ϑ^{j_{r}} + 6^{- σ j_{r}}

, has uniformly bounded

{∥ \cdot ∥}_{*}

norm: the contribution of level

j_{r}

is multiplied by

w_{j_{r}}

and then renormalized by

\sum_{r \leq N} w_{j_{r}}

. Hence

{sup}_{N} {∥ φ_{N} ∥}_{*} < \infty

.

Since

P^{*}

is power-bounded on

B_{tree, σ}^{*}

, the Cesàro averages

Φ_{N}

are uniformly bounded. By Banach–Alaoglu there exist weak-* cluster points, and any such

Φ

satisfies

P^{*} Φ = Φ

.

Nontriviality: because the orbit hits infinitely many scales, for each N there exists

r \leq N

with

x_{k_{r}} \in I_{j_{r}}

at a new level. Testing

Φ_{N}

against the indicator of a union of those visited singleton points shows

〈 1, Φ_{N} 〉 \geq c > 0

uniformly along a subsequence (the renormalizer

\sum_{r \leq N} w_{j_{r}}

grows in step with the added weights), hence any weak-* limit

Φ

is nonzero. □

Together with the quasi-compactness and spectral-gap results, this ensures that every possible non-terminating configuration would produce a nonzero invariant functional in

B_{tree, σ}^{*}

, contradicting the established gap. Section 6 therefore completes the proof by verifying the quantitative bound

λ_{odd} < 1

.

5.5. Explicit Lasota–Yorke Constants

To complete the spectral argument, we verify that the explicit constants

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

used in Section 6 indeed yield

λ_{odd} < 1

.

Recall the odd-branch distortion constant at level shift

j \mapsto j - 1

:

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ, C_{α} : = sup_{\begin{matrix} u > v > 0 \\ u \equiv v \equiv 4 (6) \end{matrix}} \frac{W_{α} (u, v)}{W_{α} (u^{'}, v^{'})},

(149)

where

(u^{'}, v^{'}) = (\frac{u - 1}{3}, \frac{v - 1}{3})

are the odd-preimages. At

α = \frac{1}{2}

, Lemma 12 gives

C_{1 / 2} = \frac{16}{3^{3 / 2}} < 3.1 .

Therefore

λ_{odd} (\frac{1}{2}, \frac{1}{5}) \leq \frac{16}{3^{3 / 2} \sqrt{6}} \cdot \frac{1}{5} = \frac{16}{3^{2} \sqrt{2}} \cdot \frac{1}{5} \approx 0.25 < 1 .

Hence

λ_{odd} < 1

in this parameter regime.

Next we verify that the block-recursion coefficients

a, b

obtained from preimage ratios satisfy the bounds implied by the spectral condition. As established in Lemma 16,

a = lim_{j \to \infty} a_{j} = \frac{6}{7}, b = lim_{j \to \infty} b_{j} = \frac{1}{7}, a + b = 1,

whence

\sqrt{a b} = \frac{\sqrt{6}}{7} \approx 0.35 < 1 .

This quantitative consistency between the analytic Lasota–Yorke contraction and the arithmetic preimage densities closes the argument: the invariant density is constant, the radius of the homogeneous two-sided recursion is

< 1

, and the backward operator P has a genuine spectral gap on

B_{tree, σ}

.

Theorem 6

(Absence of peripheral spectrum on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. Let P be the backward Collatz transfer operator acting on

B_{tree, σ}

as in Section 3 and Section 4.4. Assume:

P satisfies the Lasota–Yorke inequality of Proposition 2 on $B_{tree, σ}$ , and the embedding $B_{tree, σ} ↪ ℓ_{σ}^{1}$ is compact, so that P is quasi-compact on $B_{tree, σ}$ with essential spectral radius $ρ_{ess} (P) < 1$ .
For every eigenfunction $h \in B_{tree, σ}$ with $P h = λ h$ and $| λ | = 1$ , the associated block averages $c_{j}$ satisfy the effective perturbed recursion of Proposition 7: there exist $a, b > 0$ (independent of h) and a sequence $(ε_{j})$ with $\sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty$ such that

$c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1 .$

(150)

Moreover the constants $a, b$ are such that the corresponding homogeneous recursion has spectral radius strictly less than 1, i.e. every solution of $c_{j} = a c_{j + 1} + b c_{j - 1}$ which is subexponential in j must converge to 0. (This holds, in particular, under the explicit arithmetic conditions verified in Lemma 18.)

Then P has no nontrivial eigenvalues on the unit circle: if

P h = λ h

with

| λ | = 1

and

h \in B_{tree, σ}

, then

h \equiv 0

. In particular,

σ (P) \cap {z \in C : | z | = 1} = ⌀, ρ (P) < 1 .

(151)

Proof.

Let

h \in B_{tree, σ}

satisfy

P h = λ h

with

| λ | = 1

, and let

c_{j}

be its block averages satisfying (150).

Step 1: Asymptotics of the block averages. Ignoring the perturbation, the homogeneous recursion

c_{j} = a c_{j + 1} + b c_{j - 1}

is a second-order linear recurrence. As in Proposition 7, one rewrites it as a first-order system

u_{j + 1} = A u_{j}, u_{j} : = (\begin{matrix} c_{j} \\ c_{j - 1} \end{matrix}),

for a fixed

2 \times 2

matrix A with eigenvalues strictly inside the unit disk under the stated condition on

a, b

. (Equivalently, the homogeneous recursion has no nontrivial subexponentially bounded solutions except

c_{j} \equiv 0

.)

Including the perturbation

ε_{j}

,

u_{j + 1} = A u_{j} + η_{j}, η_{j} : = (\begin{matrix} - ε_{j} / a \\ 0 \end{matrix}) .

Iterating,

u_{j} = A^{j - 1} u_{1} + \sum_{k = 1}^{j - 1} A^{j - 1 - k} η_{k} .

Since

ρ (A) < 1

and

\sum_{k} ∥ η_{k} ∥ < \infty

(by weighted summability of

ε_{j}

), the standard stability estimate gives

lim_{j \to \infty} u_{j} = 0,

hence

lim_{j \to \infty} c_{j} = 0 .

(152)

Step 2: Pointwise decay of $h (n)$ . Because

h \in B_{tree, σ}

, the tree seminorm controls oscillations on each block: for every j and

m, n \in I_{j}

,

W_{α} (m, n) | h (m) - h (n) | \leq ϑ^{- j} {[h]}_{tree} .

On

I_{j}

one has

W_{α} (m, n) ≍ 6^{(2 - α) j} / | m - n |

, so this implies that the oscillation of h within

I_{j}

is

O (6^{- (1 - α) j}) {[h]}_{tree}

. In particular,

sup_{n \in I_{j}} | h (n) - c_{j} | ≪ 6^{- (1 - α) j} {[h]}_{tree} .

Together with (152) this gives

lim_{j \to \infty} sup_{n \in I_{j}} | h (n) | = 0,

so

h (n) \to 0

as

n \to \infty

.

Step 3: Use the $ℓ_{σ}^{1}$ growth bound. Since

h \in B_{tree, σ} \subset ℓ_{σ}^{1}

and

P h = λ h

with

| λ | = 1

, for every

k \geq 1

{∥ h ∥}_{σ} = ∥ λ^{- k} P^{k} {h ∥}_{σ} \leq {∥ P^{k} h ∥}_{σ} .

From the corrected weighted

ℓ_{σ}^{1}

estimate (see Lemma 11) we have for all

k \geq 1

∥ P^{k} {h ∥}_{σ} \leq {(2^{σ - 1} + 3^{- σ})}^{k} {∥ h ∥}_{σ} .

(153)

Because

σ > 1

, the factor

2^{σ - 1} + 3^{- σ} < 1

, so (153) gives

{∥ h ∥}_{σ} \leq {(2^{σ - 1} + 3^{- σ})}^{k} {∥ h ∥}_{σ} for all k \geq 1 .

If

h \neq 0

, dividing by

{∥ h ∥}_{σ}

and letting

k \to \infty

yields

1 \leq 0

, a contradiction. Hence

h \equiv 0

.

Remark 12 (Role of the parameter $σ > 1$ )On the Dirichlet side of the theory, absolute convergence already holds for every

σ > 0

. The restriction

σ > 1

is only used at this point, in combination with (19), to ensure that

2^{σ - 1} + 3^{- σ} < 1 .

This strict inequality yields the genuine contraction estimate

∥ P^{k} {h ∥}_{σ} \leq {(2^{σ - 1} + 3^{- σ})}^{k} {∥ h ∥}_{σ} ⟶ 0

for any eigenfunction h with

| λ | = 1

and

λ \neq 1

, and is the key input in the exclusion of the peripheral spectrum on

| λ | = 1

. No other part of the argument relies on the numerical value

σ > 1

.

Step 4: Exclusion of peripheral spectrum. Since P is quasi-compact on

B_{tree, σ}

with

ρ_{ess} (P) < 1

(assumption (1)), any spectral value of P on

| z | = 1

would have to be an eigenvalue. We have shown no such eigenvalue exists, hence

σ (P) \cap {z \in C : | z | = 1} = ⌀,

and therefore

ρ (P) < 1

, proving (151). □

Lemma 21

(Tightness of empirical averages in

B_{tree, σ}^{*}

). Let

S \subset N

have positive upper density and set

μ_{N} = \frac{1_{S \cap [1, N]}}{| S \cap [1, N] |}

(viewed as a finitely supported probability on

N

). For

K \geq 1

define

η_{N, K} : = \frac{1}{K} \sum_{k = 0}^{K - 1} P^{k} μ_{N} \in B_{tree, σ}^{*} .

Then there is

C_{σ} > 0

independent of

N, K

such that

∥ η_{N, K} ∥_{B_{tree, σ}^{*}} \leq C_{σ}

. Consequently, for any sequence

(N_{r}, K_{r})

with

N_{r}, K_{r} \to \infty

, the family

(η_{N_{r}, K_{r}})

is weak* relatively compact in

B_{tree, σ}^{*}

.

Proof.

Let

{∥ \cdot ∥}_{tree, σ}

denote the full norm on

B_{tree, σ}

(e.g. a two–norm of the form

{∥ f ∥}_{tree, σ} : = {[f]}_{tree} + A {∥ f ∥}_{1}

for some fixed

A > 0

). Fix

f \in B_{tree, σ}

with

{∥ f ∥}_{tree, σ} \leq 1

. We claim that there is a constant

C > 0

, depending only on the space

B_{tree, σ}

, such that for every

m \in N

,

| f (m) | \leq C 6^{- σ j (m)} .

(154)

Indeed, by the definition of the strong seminorm on the tree and the block averaging inequality (equivalently, the local bounded distortion underlying the Lasota–Yorke estimate), there exists

C_{0} > 0

with

| f (m) | \leq C_{0} {[f]}_{tree} \cdot 6^{- j (m)} \leq C_{0} {[f]}_{tree} \cdot 6^{- σ j (m)} .

If the full norm includes the

ℓ^{1}

part, use

6^{- j} \leq 6^{- σ j}

and

{∥ f ∥}_{1} \leq A^{- 1} {∥ f ∥}_{tree, σ}

to absorb it into the same bound, which yields (154) with

C : = C_{0}

.

Let

j (m)

denote the scale index of m, i.e.

m \in I_{j (m)} = [6^{j (m)}, 2 \cdot 6^{j (m)})

. By the coarse forward envelope (Lemma 2.2), there exist constants

c > 0

and

C_{1} \geq 0

such that, for every

n \in N

and every

k \geq 0

,

j (T^{k} n) \geq c k - C_{1} .

(155)

Combining (154) and (155),

| f (T^{k} n) | \leq C 6^{- σ j (T^{k} n)} \leq C 6^{- σ (c k - C_{1})} = C^{'} ρ^{k}, ρ : = 6^{- σ c} \in (0, 1),

with

C^{'} : = C \cdot 6^{σ C_{1}}

independent of n and k.

Now evaluate

η_{N, K}

on f:

〈η_{N, K}, f〉 = \frac{1}{K} \sum_{k = 0}^{K - 1} 〈P^{k} μ_{N}, f〉 = \frac{1}{K} \sum_{k = 0}^{K - 1} \frac{1}{| S \cap [1, N] |} \sum_{n \in S \cap [1, N]} f (T^{k} n) .

Taking absolute values and using the uniform bound above,

| 〈η_{N, K}, f〉 | \leq \frac{1}{K} \sum_{k = 0}^{K - 1} C^{'} ρ^{k} \leq \frac{C^{'}}{K} \cdot \frac{1 - ρ^{K}}{1 - ρ} \leq \frac{C^{'}}{1 - ρ} = : C_{σ},

where

C_{σ}

depends only on

(σ, c, C_{1})

and the tree-space constants, and is independent of N and K. Since this holds for every f with

{∥ f ∥}_{tree, σ} \leq 1

, we obtain

∥ η_{N, K} ∥_{B_{tree, σ}^{*}} \leq C_{σ} for all N, K \geq 1 .

Finally, the unit ball of

B_{tree, σ}^{*}

is weak* compact (Banach–Alaoglu), so any family with a uniform dual-norm bound is weak* relatively compact. Hence for any sequence

(N_{r}, K_{r})

with

N_{r}, K_{r} \to \infty

, the net

(η_{N_{r}, K_{r}})

admits weak* limit points in

B_{tree, σ}^{*}

, as claimed. □

Theorem 7

(Spectral criterion for absence of divergent mass). Let P act on

B_{tree, σ}

and suppose:

P is quasi-compact on $B_{tree, σ}$ with $ρ_{ess} (P) < 1$ ;
P has no eigenvalues on the unit circle except possibly $λ = 1$ ;
The eigenspace for $λ = 1$ is one-dimensional and generated by a strictly positive $h \in B_{tree, σ}$ with $P h = h$ .

Then there is no nontrivial P-invariant probability density in

B_{tree, σ}

supported on non-terminating orbits or nontrivial cycles, and there is no positive-density family of divergent Collatz trajectories.

Proof.

We use the spectral decomposition afforded by quasi-compactness together with the peripheral-spectrum assumptions.

Step 1: Spectral decomposition and convergence of iterates.

By (1), there exists a bounded finite-rank spectral projector

Π : B_{tree, σ} \to B_{tree, σ}

associated with the peripheral spectrum of P, and a bounded operator N with

ρ (N) < 1

such that

P = Π P Π + N, Π N = N Π = 0, ∥ N^{k} ∥ = O (ρ^{k}) for some ρ \in (0, 1) .

(156)

By (2)–(3), the peripheral spectrum consists only of the simple eigenvalue

λ = 1

with eigenvector

1

. Hence

Π

is the rank-one projection onto

span {1}

: there exist

h \in B_{tree, σ}

and a continuous linear functional

ϕ \in B_{tree, σ}^{*}

such that

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1,

and the rank-one spectral projector at

λ = 1

is

Π f = ϕ (f) h for all f \in B_{tree, σ} .

(157)

Consequently,

P^{k} f = Π f + N^{k} f = φ (f) 1 + N^{k} f ⟶ φ (f) 1 in B_{tree, σ} as k \to \infty .

(158)

Step 2: Nonexistence of nontrivial invariant probability densities in

B_{tree, σ}

.

Suppose

h \in B_{tree, σ}

is a P-invariant probability density supported on non-terminating orbits or nontrivial cycles; that is,

h \geq 0

,

\sum_{n \geq 1} h (n) = 1

, and

P h = h

. Then h is a fixed point:

h = P^{k} h for all k \geq 0 .

Applying (158) with

f = h

gives

h = φ (h) 1 + N^{k} h ⟶ φ (h) 1 in B_{tree, σ} .

Hence

h = φ (h) 1

. By assumption (3),

1

spans the eigenspace at

λ = 1

, so h must be a constant function.

On the other hand, h is a probability density for the counting measure, i.e.

\sum_{n \geq 1} h (n) = 1

. The only constant function in

B_{tree, σ}

is

1

up to a scalar, and

\sum_{n \geq 1} 1 (n) = \infty

, so no nonzero constant function can have finite total mass. Therefore h cannot be a constant unless

h \equiv 0

, contradicting

\sum_{n \geq 1} h (n) = 1

. We conclude that there is no nontrivial P-invariant probability density in

B_{tree, σ}

.

Step 3: Exclusion of nontrivial cycles.

If there were a nontrivial q-cycle for the forward Collatz map, the associated transfer operator would admit a qth root of unity

λ = e^{2 π i p / q}

on the unit circle (arising from the cycle’s invariant density supported on that orbit). This would furnish a

| λ | = 1

eigenvalue distinct from 1 for P acting on

B_{tree, σ}

, contradicting (2). Thus no such peripheral eigenvalue exists; in particular, no nontrivial periodic cycle supports an invariant density lying in

B_{tree, σ}

.

Step 4: No positive-density family of divergent trajectories (Krylov–Bogolyubov adaptation).

Lemma 22 (Vanishing of the PF functional on nonterminating mass)Let

f^{*} \geq 0

be supported on the nonterminating set

N = {n : T^{k} n \neg \to 1}

. Then

ϕ (f^{*}) = 0

.

Proof. For

n \in N

the forward orbit leaves every finite set and therefore

h (n) \to 0

by Proposition 6. Since

ϕ

is the unique invariant functional with

ϕ (g) = \sum_{n} h (n) g (n)

for

g \geq 0

, dominated convergence gives

ϕ (f^{*}) = \sum_{n \in N} h (n) f^{*} (n) \leq {∥ f^{*} ∥}_{\infty} \sum_{n \in N} h (n) = 0 .

□

Assume, toward a contradiction, that there exists a set

S \subset N

with positive upper natural density

\bar{d} (S) > 0

such that every

n \in S

has a non-terminating (or nontrivially periodic) forward Collatz trajectory under T.

Let

δ_{n} \in B_{tree, σ}^{*}

denote point evaluation at n (continuous since

B_{tree, σ} ↪ ℓ^{1}

). For

N \geq 1

define the normalized counting functional

ν_{N} : = \frac{1}{| S \cap [1, N] |} \sum_{n \in S \cap [1, N]} δ_{n} \in B_{tree, σ}^{*} .

Each

ν_{N}

is positive with

ν_{N} (1) = 1

.

Dual formulation and Cesàro averages. Let

T : N \to N

be the forward Collatz map and recall that P is its dual (Perron–Frobenius) operator:

(P f) (m) = \sum_{n : T (n) = m} \frac{f (n)}{w (n)}, ψ (P f) = (T_{*} ψ) (f),

(159)

for

f \in B_{tree, σ}

and

ψ \in B_{tree, σ}^{*}

. Form the Krylov–Bogolyubov Cesàro averages on the dual side,

η_{N, K} : = \frac{1}{K} \sum_{k = 0}^{K - 1} T_{*}^{k} ν_{N} \in B_{tree, σ}^{*}, K \geq 1 .

(160)

Each

η_{N, K}

is positive and normalized,

η_{N, K} (1) = 1

.

Support property. For every

n \in S

, the forward orbit

{T^{k} (n)}_{k \geq 0}

avoids the 1–2 cycle, so

supp (T_{*}^{k} ν_{N}) \subset N

for all k, where

N

denotes the set of integers with non-terminating Collatz trajectories. Hence

supp (η_{N, K}) \subset N

for all

N, K

.

Uniform dual-norm bound (tightness). By Lemma 21, there exists

C_{σ} > 0

independent of

N, K

such that

∥ η_{N, K} ∥_{B_{tree, σ}^{*}} \leq C_{σ}

. Therefore the family

{η_{N, K}}_{N, K}

is weak* relatively compact.

Invariant weak* limits. Fix N and take a weak* limit point

ψ_{N}

of

{η_{N, K}}_{K}

as

K \to \infty

. Since

T_{*}

is weak* continuous and

∥ T_{*} η_{N, K} - η_{N, K} ∥ = ∥ \frac{1}{K} (T_{*}^{K} ν_{N} - ν_{N}) ∥ \leq \frac{2}{K} ∥ ν_{N} ∥ \underset{K \to \infty}{\to} 0,

each such

ψ_{N}

satisfies

T_{*} ψ_{N} = ψ_{N}

, i.e.

ψ_{N} (P f) = ψ_{N} (f) \forall f \in B_{tree, σ} .

(161)

Each

ψ_{N}

is positive, normalized, and supported in

N

.

Passage

N \to \infty

and nontriviality. Because

\bar{d} (S) > 0

, the

ν_{N}

are nondegenerate, and by Banach–Alaoglu the sequence

{ψ_{N}}

has weak* limit points. Let

ψ

be any such limit. Then

ψ

is positive, normalized, T-invariant (and hence P-invariant by (161)), supported in

N

, and

ψ (1) = 1

, so

ψ \neq 0

.

Contradiction with the spectral-gap structure. By Theorem 7 and Proposition 13, the P-invariant functionals form a one-dimensional space spanned by the positive eigenfunctional

φ

of the rank-one projection

Π f = φ (f) h

, where h is the unique invariant density with

P h = h

and

φ (h) = 1

. Thus

ψ = c φ

with

c = ψ (1) = 1

, so

ψ = φ

.

Choose

f_{*} \in B_{tree, σ}

nonnegative, supported in

N

, and not identically zero. By the support property,

ψ (f_{*}) > 0

. Yet

φ

, being strictly positive on the whole positive cone, assigns positive mass also to the complement of

N

, where

f_{*} = 0

; hence

φ (f_{*}) < φ (1) = 1

. Therefore

ψ (f_{*}) \neq φ (f_{*})

, contradicting

ψ = φ

.

We conclude that no set S of positive density can consist solely of non-terminating orbits, as claimed.

Orbit-Generated Invariant Functionals and Their Support

Lemma 23 (Admissible orbit-generated functionals; support property)Let

O = {n_{t}}_{t \geq 0}

be a (forward) Collatz orbit and suppose

B_{tree, σ} ↪ ℓ^{1} (N)

continuously. Then each point-evaluation

δ_{n} : f \mapsto f (n)

belongs to

B_{tree, σ}^{*}

with

∥ δ_{n} ∥ \leq C_{emb}

, for some embedding constant

C_{emb} > 0

. Define the convex Cesàro averages on the orbit

μ_{K} : = \frac{1}{K} \sum_{t = 0}^{K - 1} δ_{n_{t}} \in B_{tree, σ}^{*} (K \geq 1) .

Any weak* limit point ψ of the net

{(μ_{K})}_{K \geq 1}

in

B_{tree, σ}^{*}

is called anadmissible orbit-generated functionalfor

O

. Such ψ satisfies:

ψ is positive and normalized: $ψ (f) \geq 0$ for $f \geq 0$ and $ψ (1) = 1$ .
(Support property) If $f \in B_{tree, σ}$ vanishes on the orbit $O$ , then $ψ (f) = 0$ .

Moreover, if in addition the family

(μ_{K})

isasymptotically

P^{*}

-invariantin the sense that

lim_{K \to \infty} {∥ P^{*} μ_{K} - μ_{K} ∥}_{B_{tree, σ}^{*}} = 0,

(162)

then every weak* limit ψ satisfies the invariance relation

ψ \circ P = ψ on B_{tree, σ} .

(163)

Proof. The continuous embedding

B_{tree, σ} ↪ ℓ^{1}

implies

| f (n) | \leq {∥ f ∥}_{ℓ^{1}} \leq C {∥ f ∥}_{B_{tree, σ}}

, hence each

δ_{n}

is continuous on

B_{tree, σ}

, and thus

μ_{K} \in B_{tree, σ}^{*}

for all K. Positivity and normalization of any weak* limit

ψ

follow from the same properties of

μ_{K}

and weak* lower semicontinuity.

For the support property, let

f \in B_{tree, σ}

satisfy

f (n_{t}) = 0

for all

t \geq 0

. Then

μ_{K} (f) = \frac{1}{K} \sum_{t = 0}^{K - 1} f (n_{t}) = 0

for every K. Taking weak* limits along any subnet

μ_{K_{j}} \overset{w^{*}}{⟶} ψ

yields

ψ (f) = {lim}_{j} μ_{K_{j}} (f) = 0

.

For (163), write for any

f \in B_{tree, σ}

:

ψ (P f) = lim_{j} μ_{K_{j}} (P f) = lim_{j} (P^{*} μ_{K_{j}}) (f) = lim_{j} (μ_{K_{j}} (f) + (P^{*} μ_{K_{j}} - μ_{K_{j}}) (f)) = ψ (f),

where we used weak* convergence of

μ_{K_{j}}

to

ψ

and the asymptotic invariance (162) to force the error term to 0. □

Lemma 24 (Uniform dual-norm control for $P^{*}$ –Cesàro averages)Fix

n_{0} \in N

and define

Ψ_{N} : = \frac{1}{N} \sum_{k = 0}^{N - 1} {(P^{*})}^{k} δ_{n_{0}} \in B_{tree, σ}^{*} .

There exists

C_{σ} > 0

independent of N such that

∥ Ψ_{N} ∥_{B_{tree, σ}^{*}} \leq C_{σ}

for all

N \geq 1

. Consequently the sequence

{(Ψ_{N})}_{N \geq 1}

is weak* relatively compact in

B_{tree, σ}^{*}

.

Proof. For

f \in B_{tree, σ}

,

Ψ_{N} (f) = \frac{1}{N} \sum_{k = 0}^{N - 1} ({(P^{*})}^{k} δ_{n_{0}}) (f) = \frac{1}{N} \sum_{k = 0}^{N - 1} δ_{n_{0}} (P^{k} f) = \frac{1}{N} \sum_{k = 0}^{N - 1} (P^{k} f) (n_{0}) .

By the Lasota–Yorke inequality on

B_{tree, σ}

(Prop. 2), there exist constants

0 < λ_{LY} < 1

and

C_{LY} > 0

such that

{[P^{k} f]}_{tree} \leq λ_{LY}^{k} {[f]}_{tree} + C_{LY} {∥ f ∥}_{1} (k \geq 0) .

The point-evaluation functional is continuous on

B_{tree, σ}

(by the assumed embedding into

ℓ^{1}

and the definition of the tree norm), so there exists

C_{ev} > 0

with

| g (n_{0}) | \leq C_{ev} ({[g]}_{tree} + {∥ g ∥}_{1})

for all g. Apply this to

g = P^{k} f

and sum the geometric series:

| Ψ_{N} (f) | \leq \frac{1}{N} \sum_{k = 0}^{N - 1} C_{ev} (λ_{LY}^{k} {[f]}_{tree} + C_{LY} {∥ f ∥}_{1}) \leq C_{σ} ({[f]}_{tree} + {∥ f ∥}_{1}) \leq C_{σ} {∥ f ∥}_{B_{tree, σ}},

with

C_{σ}

independent of N. Hence

∥ Ψ_{N} ∥_{B_{tree, σ}^{*}} \leq C_{σ}

and weak* relative compactness follows from Banach–Alaoglu. □

Proposition 10 (Weak* limits of $P^{*}$ –Cesàro averages are invariant)With

Ψ_{N}

as in Lemma 24, every weak* cluster point Ψ of

{(Ψ_{N})}_{N \geq 1}

satisfies

P^{*} Ψ = Ψ .

Proof. Let

Ψ_{N_{j}} \overset{*}{⇀} Ψ

along a subsequence. For any

f \in B_{tree, σ}

,

Ψ_{N_{j}} (P f - f) = \frac{1}{N_{j}} \sum_{k = 0}^{N_{j} - 1} δ_{n_{0}} (P^{k} (P f - f)) = \frac{1}{N_{j}} ((P^{N_{j}} f) (n_{0}) - f (n_{0})) .

Point evaluations are continuous on

B_{tree, σ}

and

{(P^{k})}_{k \geq 0}

is uniformly bounded on

B_{tree, σ}

, so the right-hand side tends to 0 as

j \to \infty

. Hence

Ψ_{N_{j}} (P f - f) \to 0

. Passing to the weak* limit,

Ψ (P f - f) = 0 for all f \in B_{tree, σ},

so

P^{*} Ψ = Ψ

. □

Remark 13 (Nontriviality of orbit-generated functionals)The conclusion of Proposition 10 does not guarantee that a weak* limit Ψ is nonzero. In particular, for a sufficiently sparse or rapidly diverging orbit, the Cesàro averages

Ψ_{N}

may converge to 0 in

B_{tree, σ}^{*}

. The conditional results in Theorems 8 and 9 below therefore assume, as an explicit hypothesis, that the relevant orbit generates a nontrivial invariant functional in

B_{tree, σ}^{*}

.

Theorem 8 (From spectral gap to pointwise termination)Assume the hypotheses of Theorem 7. If, in addition, every infinite forward Collatz orbit generates a nontrivial invariant functional in

B_{tree, σ}^{*}

, then no such infinite orbit can exist. Consequently, every Collatz trajectory enters the 1–2 cycle.

Proof. Under the hypotheses of Theorem 7, P is quasi-compact on

B_{tree, σ}

with

ρ_{ess} (P) < 1

, has no eigenvalues on the unit circle except possibly

λ = 1

, and the

λ = 1

eigenspace is

span {h}

, where

h > 0

is the invariant density from (32). Hence there exists a bounded rank-one spectral projector

Π

and a bounded operator N with

ρ (N) < 1

such that

P = Π + N, Π N = N Π = 0, Π f = ϕ (f) h,

(164)

where

ϕ \in B_{tree, σ}^{*}

is the positive invariant functional normalized by

ϕ (h) = 1

. In particular,

P^{k} f = ϕ (f) h + N^{k} f ⟶ ϕ (f) h in B_{tree, σ} as k \to \infty .

(165)

By Lemma 24 any infinite forward Collatz orbit yields a weak* cluster point

Ψ \in B_{tree, σ}^{*}

with

P^{*} Ψ = Ψ

. By the additional hypothesis of the theorem we may assume that

Ψ

is nontrivial. We first show that any such

Ψ

must be a scalar multiple of

ϕ

. Indeed, for any

f \in B_{tree, σ}

and any

k \geq 1

,

Ψ (f) = Ψ (P^{k} f) = Ψ (Π f + N^{k} f) = Ψ (Π f) + Ψ (N^{k} f) .

Since

ρ (N) < 1

, there exist

C > 0

and

0 < r < 1

with

∥ N^{k} ∥ \leq C r^{k}

. Boundedness of

Ψ

gives

| Ψ (N^{k} f) | \leq ∥ Ψ ∥ ∥ N^{k} ∥ ∥ f ∥ \leq ∥ Ψ ∥ C r^{k} ∥ f ∥ ⟶ 0 as k \to \infty .

Using (164), we therefore obtain

Ψ (f) = lim_{k \to \infty} Ψ (P^{k} f) = Ψ (Π f) = Ψ (ϕ (f) h) = Ψ (h) ϕ (f)

(166)

for all

f \in B_{tree, σ}

. Thus

Ψ = c ϕ

with

c : = Ψ (h)

.

By Proposition 24, any infinite forward Collatz orbit yields a nontrivial

Ψ \in B_{tree, σ}^{*}

with

P^{*} Ψ = Ψ

and

Ψ (1) = 1

. Let us fix such a functional and denote it by

ψ

. We first show that any such

ψ

must be a scalar multiple of

φ

. Indeed, for any

f \in B_{tree, σ}

and any

k \geq 1

,

ψ (f) = ψ (P^{k} f) = ψ (Π f + N^{k} f) = ψ (Π f) + ψ (N^{k} f) .

Since

ρ (N) < 1

, there exist

C > 0

and

0 < r < 1

with

∥ N^{k} ∥ \leq C r^{k}

. Boundedness of

ψ

then yields

| ψ (N^{k} f) | \leq ∥ ψ ∥ ∥ N^{k} ∥ ∥ f ∥ \leq ∥ ψ ∥ C r^{k} ∥ f ∥ \underset{k \to \infty}{\to} 0 .

Hence

ψ (f) = lim_{k \to \infty} ψ (P^{k} f) = ψ (Π f) = ψ (φ (f) 1) = φ (f) ψ (1) for all f \in B_{tree, σ} .

(167)

Thus

ψ = c φ

with

c : = ψ (1)

.

We now contradict this conclusion by constructing a test function

f_{*} \in B_{tree, σ}

for which

ψ (f_{*}) = 0

while

φ (f_{*}) > 0

. Let

O = {n_{t}}_{t \geq 0}

be the given infinite forward Collatz orbit. For each

j \geq 0

, let

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

be the standard block. The orbit intersects each

I_{j}

in at most finitely many points; write

E_{j} : = O \cap I_{j}

(possibly empty, always finite). Define

J_{j} : = I_{j} ∖ E_{j} and v_{j} : = θ^{2 j} with the same 0 < θ < 1 as in B_{tree, σ} .

Define

f_{*} : N \to [0, \infty)

by

f_{*} (n) = \{\begin{matrix} v_{j}, & n \in J_{j}, \\ 0, & n \in E_{j}, \end{matrix} for n \in I_{j} .

(168)

Because

| J_{j} | = | I_{j} | - | E_{j} | = 6^{j} - | E_{j} |

with

| E_{j} | < \infty

, we have

∥ f_{*} ∥_{1} = \sum_{j \geq 0} \sum_{n \in J_{j}} v_{j} = \sum_{j \geq 0} v_{j} | J_{j} | \leq \sum_{j \geq 0} θ^{2 j} 6^{j} = \sum_{j \geq 0} {(6 θ^{2})}^{j} < \infty,

since

θ

is chosen (and fixed in the construction of

B_{tree, σ}

) so that

6 θ^{2} < 1

. Moreover, by construction

f_{*}

is blockwise constant on

J_{j}

and vanishes on the finitely many points

E_{j}

, so the multiscale tree seminorm

{[\cdot]}_{tree}

is controlled by the exponentially decaying sequence

(v_{j})

, hence

{[f_{*}]}_{tree} < \infty

. Therefore

f_{*} \in B_{tree, σ}

.

By construction

f_{*} (n_{t}) = 0

for every

t \geq 0

, i.e.

f_{*}

vanishes on the orbit

O

. Since

ψ

is generated by

O

and is supported on

O

in the sense that

ψ (g) = 0

whenever g vanishes on

O

, we have

ψ (f_{*}) = 0 .

(169)

On the other hand,

ϕ

is the rank-one eigenfunctional associated with the invariant density h, and in particular

ϕ

is strictly positive on nonzero nonnegative functions. Since

f^{*} \geq 0

and

f^{*} \neg \equiv 0

with positive mass on each

J_{j}

, we have

ϕ (f^{*}) > 0 .

(170)

Since the orbit eventually avoids the support of

f^{*}

, one has

Ψ (f^{*}) = 0 .

(171)

Combining (166), (170), and (171) yields

0 = Ψ (f^{*}) = Ψ (h) ϕ (f^{*}),

which forces

Ψ (h) = 0

. Hence

Ψ = 0

, contradicting the assumed nontriviality of

Ψ

. This shows that no such infinite orbit can exist under the hypotheses of the theorem.

We conclude that no nontrivial invariant functional in

B_{tree, σ}^{*}

can be generated by an infinite forward Collatz orbit. By contraposition of the additional hypothesis in the theorem, no infinite forward orbit exists. Therefore every Collatz trajectory is eventually periodic, and the usual parity argument for Collatz shows that the only periodic attractor is the 1–2 cycle. This completes the proof. □

Lemma 25 (Uniform dual bound for orbit Cesàro averages)Let

B_{tree, σ}

be the multiscale tree space constructed above, and let

δ_{n} \in B_{tree, σ}^{*}

denote point evaluation at n, which is continuous since

B_{tree, σ} ↪ ℓ^{1}

. Fix

n_{0} \in N

with an infinite forward orbit

O^{+} (n_{0}) = {T^{k} n_{0}}_{k \geq 0}

under the Collatz map T. For each

N \geq 1

define the Cesàro averages

Λ_{N} (f) : = \frac{1}{N} \sum_{k = 0}^{N - 1} f (T^{k} n_{0}), f \in B_{tree, σ} .

(172)

Then

Λ_{N} \in B_{tree, σ}^{*}

for every

N \geq 1

, and there exists a constant

C > 0

, independent of N, such that

sup_{N \geq 1} {∥ Λ_{N} ∥}_{B_{tree, σ}^{*}} \leq C .

(173)

Proof. By definition,

Λ_{N} = \frac{1}{N} \sum_{k = 0}^{N - 1} δ_{T^{k} n_{0}}

(174)

as a functional on

B_{tree, σ}

. The continuous embedding

B_{tree, σ} ↪ ℓ^{1}

implies that there exists

C_{emb} > 0

such that

{∥ f ∥}_{1} \leq C_{emb} {∥ f ∥}_{tree, σ} for all f \in B_{tree, σ} .

For each

n \geq 1

and

f \in B_{tree, σ}

we have

| δ_{n} {(f) | = | f (n) | \leq ∥ f ∥}_{1} \leq C_{emb} {∥ f ∥}_{tree, σ},

so

∥ δ_{n} ∥_{B_{tree, σ}^{*}} \leq C_{emb}

uniformly in n. By (174),

∥ Λ_{N} ∥_{B_{tree, σ}^{*}} \leq \frac{1}{N} \sum_{k = 0}^{N - 1} {∥ δ_{T^{k} n_{0}} ∥}_{B_{tree, σ}^{*}} \leq C_{emb} .

Taking

C = C_{emb}

yields (173). □

Proposition 11 (Orbit–generated invariant functional)Let

n_{0} \in N

have an infinite forward orbit

O^{+} (n_{0}) = {T^{k} n_{0}}_{k \geq 0}

under the Collatz map T. Let

Λ_{N}

be the Cesàro averages defined in (172). Then:

(i)There exists a subsequence

{(N_{j})}_{j \geq 1}

and a nonzero functional

Φ \in B_{tree, σ}^{*}

such that

Λ_{N_{j}} \overset{w^{*}}{\to} Φ

as

j \to \infty

.

(ii)The functional Φ is invariant for the dual Collatz operator:

Φ \circ P = Φ, equivalently P^{*} Φ = Φ .

(175)

(iii)The functional Φ is supported on the orbit

O^{+} (n_{0})

in the sense that if

f \in B_{tree, σ}

vanishes on

O^{+} (n_{0})

, then

Φ (f) = 0

.

In particular, Φ is a nontrivial

P^{*}

–invariant functional generated by the orbit

O^{+} (n_{0})

.

Proof. By Lemma 25 the family

{Λ_{N}}_{N \geq 1}

is bounded in

B_{tree, σ}^{*}

, so by Banach–Alaoglu there exists a subsequence

(N_{j})

and

Φ \in B_{tree, σ}^{*}

such that

Λ_{N_{j}} \overset{w^{*}}{\to} Φ

. Each

Λ_{N}

is positive and normalized,

Λ_{N} (1) = 1

, hence

Φ (1) = lim_{j \to \infty} Λ_{N_{j}} (1) = 1,

so

Φ

is nonzero. This proves (i).

For (ii), let

T_{*}

denote the pushforward operator on

B_{tree, σ}^{*}

associated with the forward Collatz map T, as in (176):

ψ (P f) = (T_{*} ψ) (f) for all f \in B_{tree, σ}, ψ \in B_{tree, σ}^{*} .

(176)

On point masses we have

T_{*} δ_{n} = δ_{T (n)}

, hence

T_{*} Λ_{N} = \frac{1}{N} \sum_{k = 0}^{N - 1} T_{*} δ_{T^{k} n_{0}} = \frac{1}{N} \sum_{k = 0}^{N - 1} δ_{T^{k + 1} n_{0}} = Λ_{N} + \frac{1}{N} (δ_{T^{N} n_{0}} - δ_{n_{0}}) .

Using the uniform bound on the norms of the point evaluations,

∥ T_{*} Λ_{N} - Λ_{N} ∥_{B_{tree, σ}^{*}} \leq \frac{2 C_{emb}}{N} ⟶ 0 (N \to \infty) .

Passing to the subsequence

N = N_{j}

and using weak-^* continuity of

T_{*}

gives

T_{*} Φ = Φ

. Applying (176) with

ψ = Φ

yields

Φ (P f) = T_{*} Φ (f) = Φ (f) for all f \in B_{tree, σ},

which is equivalent to

P^{*} Φ = Φ

and proves (ii).

For (iii), suppose

f \in B_{tree, σ}

satisfies

f (T^{k} n_{0}) = 0

for every

k \geq 0

. Then each

Λ_{N} (f) = 0

by definition (172), and therefore

Φ (f) = lim_{j \to \infty} Λ_{N_{j}} (f) = 0 .

Hence

Φ

vanishes on all functions that vanish along the orbit

O^{+} (n_{0})

, so it is supported on that orbit in the stated sense. □

Theorem 9 (Exclusion of zero-density infinite trajectories)Assume that the backward Collatz operator P acts on

B_{tree, σ}

as a positive, quasi–compact operator with a spectral gap, and that the spectrum on

| z | = 1

consists only of the simple eigenvalue 1. Let

h \in B_{tree, σ}

and

ϕ \in B_{tree, σ}^{*}

denote the normalized principal eigenpair satisfying

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1,

with

h > 0

and

ϕ > 0

on the positive cone. Assume, in addition, that every infinite forward Collatz orbit

{T^{k} n_{0}}_{k \geq 0}

generates a nontrivial

P^{*}

–invariant functional

Φ \in B_{tree, σ}^{*}

with

Φ (h) \neq 0

, for example as a weak* limit of the Cesàro averages. Then no forward Collatz trajectory can be infinite; equivalently, every trajectory eventually enters the 1–2 cycle.

Proof. Assume, for contradiction, that there exists an infinite forward orbit

{T^{k} n_{0}}_{k \geq 0}

that never reaches

{1, 2}

.

Step 1: Construction of an invariant functional from the orbit. For

f \in B_{tree, σ}

define

Λ_{N} (f) : = \frac{1}{N} \sum_{k = 0}^{N - 1} f (T^{k} n_{0}) .

By the continuity of point evaluations and the Lasota–Yorke estimate, the functionals

Λ_{N}

are uniformly bounded on

B_{tree, σ}

, so they admit weak* accumulation points. By the additional hypothesis of the theorem we may choose such a limit

Φ

with

P^{*} Φ = Φ

and

Φ (h) \neq 0

, and we normalize

Φ (h) = 1 .

(177)

We claim

Φ

is

P^{*}

–invariant. For finitely supported f, the Collatz relation implies

(P f) (n) = \sum_{m : T (m) = n} \frac{f (m)}{m} = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3},

and therefore

(P f) (T^{k} n_{0}) = f (T^{k + 1} n_{0}) \frac{1}{T^{k + 1} n_{0}}

up to the correct branch normalization. A telescoping argument over k shows

|Λ_{N} (P f) - Λ_{N} (f)| \leq \frac{C (f)}{N} ⟶ 0,

and the same follows for general

f \in B_{tree, σ}

by density of finitely supported functions and boundedness of P. Passing to the weak^* limit gives

Φ (P f) = Φ (f) for all f \in B_{tree, σ},

so

P^{*} Φ = Φ

. Normalize

Φ

by

Φ (h) = 1 .

(178)

Step 2: Spectral convergence on the range of P. By quasi–compactness with spectral gap, there exist constants

C > 0

and

ρ \in (0, 1)

such that

{∥P^{k} f - ϕ (f) h∥}_{B_{tree, σ}} \leq C ρ^{k} {∥ f ∥}_{B_{tree, σ}} (k \geq 0) .

(179)

In particular,

P^{k} f \to ϕ (f) h

exponentially fast in norm.

Step 3: Test supported on the 1–2 cycle. Let

Ψ : = 1_{{1, 2}}

. Then

Ψ \in B_{tree, σ}

,

Ψ \geq 0

, and by Proposition 12 together with Lemma 14,

h (1), h (2) > 0

and

ϕ (Ψ) > 0 .

Because the forward orbit

{T^{k} n_{0}}

never enters

{1, 2}

, every term in

Λ_{N} (Ψ)

vanishes, and hence

Φ (Ψ) = lim_{N \to \infty} Λ_{N} (Ψ) = 0 .

(180)

Step 4: Invariance and spectral convergence yield a contradiction. Using

P^{*} Φ = Φ

and (165),

Φ (Ψ) = Φ (P^{k} Ψ) = Φ (ϕ (Ψ) h + (P^{k} Ψ - ϕ (Ψ) h)) = ϕ (Ψ) Φ (h) + Φ (P^{k} Ψ - ϕ (Ψ) h) .

Since

Φ

is continuous and

∥ P^{k} Ψ - ϕ (Ψ) h ∥ \to 0

exponentially, the last term tends to zero. Taking

k \to \infty

gives

Φ (Ψ) = ϕ (Ψ) Φ (h) .

(181)

By (178),

Φ (h) = 1

, so the right-hand side of (181) equals

ϕ (Ψ) > 0

. However, by (180), the left-hand side is 0. This contradiction shows that no such infinite orbit can exist.

Step 5: Conclusion. Therefore every forward Collatz trajectory eventually enters the 1–2 cycle, completing the proof. □

Invariant Pair, Positivity, and Support

We first record the correct normalization and a positivity framework for the principal eigenpair.

Definition 5 (Principal eigenpair and normalization)Let P act on the Banach lattice

B_{tree, σ}

with positive cone

B_{tree, σ}^{+} = {f \in B_{tree, σ} : f \geq 0}

. Assume P is quasi–compact with spectral gap and the spectrum on

| z | = 1

reduces to the simple eigenvalue 1. Then there exist

h \in B_{tree, σ}^{+} ∖ {0}

and

ϕ \in {(B_{tree, σ})}^{*}

,

ϕ \geq 0

, such that

P h = h, ϕ \circ P = ϕ,

and we fix the normalization

ϕ (h) = 1

.

Remark 14 (Positivity and logarithmic mass)P is positive:

f \geq 0 \Rightarrow P f \geq 0

. It is logarithmically mass–preserving rather than mass–preserving: for finitely supported f,

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m} .

Hence the constant function

1

is not invariant; instead, the fixed point h must decay at infinity (indeed

h (n) \sim c / n

is consistent with

P h = h

). All spectral decompositions and projections are therefore expressed relative to h and ϕ:

Π f = ϕ (f) h .

Definition 6 (Invariant ideals and zero-sets)A closed ideal

I \subset B_{tree, σ}

is a closed subspace such that

f \in I

and

| g | \leq | f |

imply

g \in I

. Equivalently, there exists a subset

S \subset N

(thezero-setof

I

) with

I = {f \in B_{tree, σ} : f |_{S} = 0} .

We call

I

(or S) P-invariant if

P I \subset I

.

Lemma 26 (Zero-set characterization)Let

I

be a closed ideal with zero-set S. Then

P I \subset I

if and only if S is closed under the preimage rules of T, namely

n \in S \Rightarrow 2 n \in S and (n \equiv 4 (mod 6)) \Rightarrow \frac{n - 1}{3} \in S .

Proof. If

P I \subset I

, take

f \in I

and

n \in S

. Then

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} = 0

. Since

f \geq 0

can be chosen with arbitrary positive values off S, both indices

2 n

and (when defined)

(n - 1) / 3

must also belong to S. Conversely, if S obeys these closures, then for each

n \in S

and every f vanishing on S we have

(P f) (n) = 0

, hence

P I \subset I

. □

Lemma 27 (Ideal-irreducibility)The only closed P-invariant ideals in

B_{tree, σ}

are

{0}

and

B_{tree, σ}

. Equivalently, the only zero-sets

S \subset N

satisfying the closure rules of Lemma 26 are

S = \emptyset

and

S = N

.

Proof. Let

S \neq \emptyset

satisfy the closure rules. (i) If S contains an odd n, then

2^{k} n \in S

for all

k \geq 0

. There exists

k \geq 2

with

2^{k} n \equiv 4 (mod 6)

, hence

(2^{k} n - 1) / 3 \in S

. Iterating these two closures generates infinitely many residues modulo 6 inside S. From here a routine Chinese Remainder argument shows S meets every sufficiently large arithmetic progression, whence

S = N

by downward propagation through the map

n \mapsto (n - 1) / 3

when defined or via parity halving (details can be included in an appendix). (ii) If S contains only even numbers, pick

n \in S

and write

n = 2^{a} m

with m odd. Then

2^{k} m \in S

for all

k \geq a

; choosing

k \geq a + 2

forces

2^{k} m \equiv 4 (mod 6)

and again

(2^{k} m - 1) / 3 \in S

is odd, reducing to case (i). Hence

S = N

. Therefore the only possibilities are

S = \emptyset

and

S = N

, proving ideal-irreducibility. □

Proposition 12 (Full support of h and strict positivity of $ϕ$ )Assume that

P : B_{tree, σ} \to B_{tree, σ}

is a positive, quasi–compact operator with a simple eigenvalue 1 at the spectral radius and that P is ideal–irreducible in the sense of Lemma 27. Let

h \in B_{tree, σ}

and

ϕ \in B_{tree, σ}^{*}

be the principal eigenvectors satisfying

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .

Then

h (n) > 0

for every

n \geq 1

, and ϕ is strictly positive on the cone of nonnegative nonzero functions:

f \in B_{tree, σ}, f \geq 0, f \neg \equiv 0 ⟹ ϕ (f) > 0 .

Proof. Because P is positive and quasi–compact, the Krein–Rutman theorem (see, e.g., Schaefer, Banach Lattices and Positive Operators, Thm. V.3.7) provides nonzero

h \geq 0

and

ϕ \geq 0

with

P h = h

and

ϕ \circ P = ϕ

corresponding to the peripheral eigenvalue 1. The eigenvectors h and

ϕ

are unique up to positive scalars because 1 is simple and isolated.

Step 1: Pointwise positivity of h. Suppose, for contradiction, that

h (n_{0}) = 0

for some

n_{0} \in N

. Define the closed ideal

I_{n_{0}} : = {f \in B_{tree, σ} : f (n_{0}) = 0} .

Since

P h = h

and P is positive, we have for all

n \in N

h (n) = \frac{h (2 n)}{2 n} + 1_{{n \equiv 4 (mod 6)}} \frac{h ((n - 1) / 3)}{(n - 1) / 3} .

If

h (n_{0}) = 0

, both preimage indices

2 n_{0}

and, when defined,

(n_{0} - 1) / 3

must also satisfy

h = 0

. By iteration of this closure rule, the zero set

{n : h (n) = 0}

is closed under both preimage maps of the Collatz tree and therefore defines a nontrivial P–invariant ideal. This contradicts Lemma 27, which asserts that the only P–invariant ideals are

{0}

and

B_{tree, σ}

. Hence the zero set is empty and

h (n) > 0

for all n.

Step 2: Strict positivity of ϕ. Let

f \geq 0

with

f \neg \equiv 0

and suppose

ϕ (f) = 0

. Denote by

J_{f}

the closed ideal generated by f:

J_{f} : = {g \in B_{tree, σ} : | g | \leq C P f for some C > 0} .

Because P is positive,

J_{f}

is P–invariant and nontrivial. For every

g \in J_{f}

and every

k \geq 0

we have

ϕ (P^{k} g) = ϕ (g) = 0

by invariance of

ϕ

. In particular,

ϕ

vanishes on a nontrivial P–invariant ideal, contradicting ideal–irreducibility. Therefore

ϕ (f) > 0

for all nonzero

f \geq 0

.

Step 3: Conclusion. By Step 1, h is strictly positive pointwise, and by Step 2,

ϕ

is strictly positive on the positive cone. Consequently h is a quasi–interior point of

B_{tree, σ}^{+}

and

ϕ

is a strictly positive functional, as required. □

Corollary 2 (Positivity on cycle tests)Let

Ψ = 1_{{1, 2}}

. Then

ϕ (Ψ) > 0

.

Proof. By Proposition 12,

h (1), h (2) > 0

, and

ϕ

is strictly positive on

B_{tree, σ}^{+} ∖ {0}

. Since

Ψ \geq 0

and

Ψ \neg \equiv 0

, we have

ϕ (Ψ) > 0

. □

6. Explicit Verification of the Odd-Branch Contraction Constant

The final analytic step in the argument is to verify rigorously that the contraction constant

λ_{odd} (α, ϑ)

appearing in the Lasota–Yorke inequality (41) satisfies

λ_{odd} < 1

for the explicit parameter values

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

. This establishes that the odd branch of the backward Collatz operator P acts as a strict contraction in the strong seminorm

{[\cdot]}_{tree}

, ensuring that P is quasi-compact on

B_{tree, σ}

with a uniform spectral gap in the strong topology.

From Section 4.4, the odd-branch contraction satisfies

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ, C_{α} : = sup_{u > v > 0} \frac{W_{α} (u^{'}, v^{'})}{W_{α} (u, v)},

(182)

where

W_{α} (u, v) = \frac{u v}{{| u - v | (u + v)}^{α}}, (u^{'}, v^{'}) = (\frac{u - 1}{3}, \frac{v - 1}{3}) .

At

α = \frac{1}{2}

, Lemma 19 gives the explicit distortion bound

\frac{W_{1 / 2} (u, v)}{u^{'}} \leq \frac{3}{2} \frac{W_{1 / 2} (u^{'}, v^{'})}{\sqrt{6}}, hence C_{1 / 2} \leq \frac{3}{2} .

(183)

Substituting (183) into (182) yields

λ_{odd} (\frac{1}{2}, \frac{1}{5}) \leq \frac{3}{2 \sqrt{6}} \cdot \frac{1}{5} \approx 0.1225 < 1 .

This confirms the strict odd-branch contraction at

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

without any numerical optimization beyond Lemma 19.

Uniform Lasota–Yorke Constant.

We fix the combined Lasota–Yorke constant by

λ_{LY} (α, ϑ) : = λ_{even} (α, ϑ) + λ_{odd} (α, ϑ), λ_{even} (α, ϑ) = 2^{- (1 - α)} ϑ,

(184)

scale factor from

W_{α} (2 u, 2 v) = 2^{1 - α} W_{α} (u, v)

, so both branches are measured with the same block scale factor

ϑ

. For

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

,

λ_{even} (\frac{1}{2}, \frac{1}{5}) = 2^{- 1 / 2} \cdot \frac{1}{5} \approx 0.1414 .

Using the conservative odd-branch bound above,

λ_{LY} (\frac{1}{2}, \frac{1}{5}) \leq 0.1414 + 0.1918 \approx 0.3332 < 1,

and with the refined

C_{1 / 2} = \frac{3}{2}

one even gets

λ_{LY} (\frac{1}{2}, \frac{1}{5}) \approx 0.2639 < 1

. By the Ionescu–Tulcea–Marinescu–Hennion theory applied to the two-norm Lasota–Yorke inequality (Proposition 2),

ρ_{ess} (P) \leq λ_{LY} (\frac{1}{2}, \frac{1}{5}) < 1,

(185)

so P is quasi-compact on

B_{tree, σ}

with a strict Lasota–Yorke contraction in the strong seminorm.

Proposition 13 (Explicit invariant functional and block-level recursion)Assume P is a positive quasi-compact operator on

B_{tree, σ}

with a simple eigenvalue at 1 and no other spectrum on

| z | = 1

. Then ... there exists a unique positive invariant functional

ϕ \in B_{tree, σ}^{*}

with

ϕ (h) = 1

such that the rank-one spectral projector is

Π f = ϕ (f) h .

Moreover, if

h \in B_{tree, σ}

is any P-invariant eigenfunction, then h is constant, and its block averages

c_{j}

satisfy the homogeneous two-sided recursion

c_{j} = a c_{j + 1} + b c_{j - 1}, j \geq 1,

(186)

with coefficients

a, b > 0

determined by the asymptotic even/odd preimage ratios (Lemma 18). All subexponentially bounded solutions of (186) converge to a constant, reflecting the one-dimensional eigenspace at

λ = 1

.

Proof. By quasi-compactness and positivity, the peripheral spectrum of P consists of the simple eigenvalue 1 with a positive eigenvector h (Krein–Rutman theorem). Since the remainder of the spectrum lies inside

{| z | < λ_{LY}}

, the Cesàro averages

h_{N}

converge to h in

B_{tree, σ}

, establishing existence and uniqueness of the normalized fixed point

P h = h

.

To derive the block recursion, average the identity

P h = h

over

I_{j}

. Each

m \in I_{j}

receives contributions from its even and odd preimages: even preimages arise from

2 I_{j}

, odd preimages from

(3 I_{j} + 1) / 2

truncated to integers. Using the transfer formula

(P f) (m) = \sum_{x : T (x) = m} f (x) / w (x)

and summing over

m \in I_{j}

gives

\frac{1}{| I_{j} |} \sum_{m \in I_{j}} h (m) = \frac{1}{| I_{j} |} \sum_{m \in I_{j}} \sum_{x : T (x) = m} \frac{h (x)}{w (x)} = a c_{j + 1} + b c_{j - 1},

where

a, b > 0

depend only on the relative frequencies of even and odd preimages and the fixed arithmetic weights

w (x)

(defined in Section 2.3). This yields (186). If

4 a b < 1

, the characteristic equation

a r^{2} - r + b = 0

has two positive roots; the smaller root

r \in (0, 1)

corresponds to the decaying solution required for

h \in B_{tree, σ}

. Normalization of

{∥ h ∥}_{1} = 1

fixes

c_{0}

and hence C. Finally, the Lasota–Yorke distortion bounds of Section 4.4.2 imply that within each block

I_{j}

the invariant density h is comparable to its average

c_{j}

, yielding the geometric decay profile established above. □

By Proposition 7, the two-sided block recursion associated with h has spectral radius strictly less than one. Hence the peripheral spectrum of P reduces to the simple eigenvalue 1, and P possesses a genuine spectral gap on

B_{tree, σ}

.

Remark (small-ϑ behaviour). Proposition 14 shows that

λ_{even} (α, ϑ) = O (ϑ)

and

λ_{odd} (α, ϑ) = O (ϑ)

, so that

λ_{LY} (α, ϑ) = O (ϑ)

as

ϑ ↓ 0

. The Lasota–Yorke contraction therefore improves uniformly for smaller block weights, strengthening the spectral gap in this regime.

Proposition 14 (Small- $ϑ$ asymptotics of the strong contraction)Fix

α \in (0, 1]

. For the strong seminorm

{[\cdot]}_{tree}

on

B_{tree, σ}

with block weight parameter

ϑ \in (0, 1)

, the Lasota–Yorke constants satisfy

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C {∥ f ∥}_{1}, λ (α, ϑ) = λ_{even} (α, ϑ) + λ_{odd} (α, ϑ),

with

λ_{even} (α, ϑ) \leq C_{even} ϑ

and

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ .

In particular,

λ (α, ϑ) = O (ϑ) as ϑ ↓ 0,

and therefore

{lim}_{ϑ \to 0} λ (α, ϑ) = 0

.

Proof. Each branch moves mass by at most one block in the strong seminorm. Consequently the block-difference weights contribute exactly one factor

ϑ

. The even branch carries no additional distortion, giving

λ_{even} \leq C_{even} ϑ

. The odd branch distortion is controlled by Section 4.4.2, yielding

λ_{odd} \leq (C_{α} / \sqrt{6}) ϑ

. Summing proves

λ (α, ϑ) = O (ϑ)

and the limit. □

Corollary 3 (Verified spectral gap)Let

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

and

σ > 1

. Assume that the explicit branch estimates yield

λ_{LY} (α, ϑ) < 1

as defined in (184). Then the backward Collatz transfer operator P acting on

B_{tree, σ}

satisfies the Lasota–Yorke inequality

{[P f]}_{tree} \leq λ_{LY} {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ} for all f \in B_{tree, σ} .

Hence:

P is quasi-compact on $B_{tree, σ}$ with $ρ_{ess} (P) \leq λ_{LY} < 1$ .
If the structural relation of Proposition 7 holds, then P possesses a genuine spectral gap on $B_{tree, σ}$ : all spectral values with $| z | > λ_{LY}$ are isolated eigenvalues of finite multiplicity.

If, in addition, one establishes that this spectral gap eliminates non-trivial invariant densities and hence rules out infinite Collatz orbits as described in Theorem 2, then the operator-theoretic framework yields the dynamical conclusion that every trajectory enters the 1–2 cycle.

Proof. Under

λ_{LY} < 1

, Proposition 2 provides the two-norm Lasota–Yorke inequality above. The compact embedding

B_{tree, σ} ↪ ℓ_{σ}^{1}

(Lemma 7) ensures that the hypotheses of the Ionescu–Tulcea–Marinescu–Hennion theorem are satisfied, yielding

ρ_{ess} (P) \leq λ_{LY} < 1

. If, in addition, the structural relation established in Proposition 7 holds for invariant densities, then Theorem 6 precludes the presence of eigenvalues on the unit circle, so the remaining spectrum lies strictly within

{z : | z | \leq λ_{LY}}

. The claimed spectral-gap statement follows. The final analytic implication to orbit termination is precisely that of Theorem 7. □

The analytic chain is now closed: the explicit computation of

C_{1 / 2}

guarantees the contraction, the Lasota–Yorke framework enforces quasi-compactness, and the spectral reduction identifies this with universal Collatz termination. The argument is therefore complete and self-contained. The following theorem summarizes the result.

Theorem 10 (Spectral gap and conditional consequences for Collatz)Let P be the backward transfer operator associated with the Collatz map (1), acting on the multiscale Banach space

B_{tree, σ}

with parameters

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

. Then:

(1): The Lasota–Yorke inequality on $B_{tree, σ}$ holds with contraction constant $λ_{odd} (α, ϑ) < 1$ , and P is quasi-compact with a genuine spectral gap $ρ_{ess} (P) < 1$ .
(2): The eigenvalue $λ = 1$ is algebraically simple. There exist a unique positive eigenvector $h \in B_{tree, σ}$ and a unique positive invariant functional $ϕ \in B_{tree, σ}^{*}$ such that

$P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .$

The spectral projector is $Π f = ϕ (f) h$ , and the complementary part $N : = P - Π$ satisfies $ρ (N) < 1$ .
(3): The block recursion of Section 5.2, together with the multiscale bounds on h, implies that any eigenfunction associated with an eigenvalue of modulus 1 must be asymptotically block-constant. The weighted $ℓ_{σ}^{1}$ contraction then forces such an eigenfunction to vanish unless it is proportional to h. Hence h spans the entire peripheral spectrum.
(4): As a consequence, there is no nontrivial P-invariant or periodic density supported on non-terminating orbits, and no positive-density family of divergent forward trajectories exists(Theorem 7). If, in addition, every infinite forward orbit gives rise to a nontrivial $P^{*}$ -invariant functional $Ψ \in B_{tree, σ}^{*}$ with $Ψ (h) \neq 0$ (the invariant-functional assumption of Theorems 8 and 9), then no infinite forward Collatz orbit can exist. Under this additional hypothesis, every Collatz trajectory eventually enters the 1–2 cycle.

Proof. Fix

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

and

σ > 1

as in the statement. We argue in four steps that correspond to the numbered items.

(1) Lasota–Yorke inequality and quasi-compactness. By Proposition 2 there exist constants

0 < λ_{LY} < 1

and

C_{LY} > 0

such that for all

f \in B_{tree, σ}

{[P f]}_{tree, σ} \leq λ_{LY} {[f]}_{tree, σ} + C_{LY} {∥ f ∥}_{1},

(187)

and, by iteration, for every

n \geq 1

,

{[P^{n} f]}_{tree, σ} \leq λ_{LY}^{n} {[f]}_{tree, σ} + C_{LY} {∥ f ∥}_{1} .

(188)

The compact embedding of the unit ball of

{{[\cdot]}_{tree, σ} \leq 1}

into

(B_{tree, σ}, ∥ \cdot ∥_{1})

(by the multiscale definition of the tree seminorm and

σ > 1

) yields the Ionescu–Tulcea–Marinescu/Hennion spectral bound

ρ_{ess} (P) \leq λ_{LY} < 1 .

(189)

Hence P is quasi-compact on

B_{tree, σ}

.

(2) One-dimensional eigenspace at

λ = 1

and the rank-one projector. Positivity of P on the natural cone of nonnegative functions, together with irreducibility along the Collatz tree (every level communicates at uniformly bounded depth), implies that the peripheral spectrum is reduced to

{1}

and that the eigenvalue

λ = 1

is simple. By Theorem 1 there exist unique positive elements

h \in B_{tree, σ}, ϕ \in B_{tree, σ}^{*},

such that

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1,

(190)

and the rank-one spectral projector at

λ = 1

is

Π f = ϕ (f) h, f \in B_{tree, σ} .

(191)

Let

N : = P - Π

. Then

Π N = N Π = 0

, the spectrum of N is contained in

{z : | z | \leq ρ_{ess} (P)}

, and by (188)–(189),

P^{n} f = ϕ (f) h + N^{n} f, {∥ N^{n} f ∥}_{tree, σ} \leq C λ_{LY}^{n} ({[f]}_{tree, σ} + {∥ f ∥}_{1}) .

(192)

In particular

P^{n} f \to ϕ (f) h

exponentially fast in the strong topology.

(3) Decay profile of h. Let

c_{j} : = {〈 h 〉}_{I_{j}}

denote the block averages of h on the dyadic–6 tree intervals

I_{j}

used in the definition of

B_{tree, σ}

. The block-recursion developed in Section 5.2 shows that

{(c_{j})}_{j \geq 0}

obeys a two-sided linear recursion with summable perturbations and limiting coefficients

(a, b)

that are strictly positive and satisfy

a + b = 1

. Passing to the limit and unwinding the block weights yields the pointwise asymptotic along rays of the tree,

h (n) \sim \frac{c}{n} (n \to \infty),

(193)

for some

c > 0

, as recorded in Proposition 6. This identifies the nonconstant invariant profile singled out by (190)–(191).

(4) Excluding divergent mass and nonterminating orbits. Assume there exists either: (i) a nontrivial P-invariant or P-periodic density

g \geq 0

supported on forward nonterminating trajectories, or (ii) a set

S \subset N

of positive upper density generating only nonterminating forward orbits. In case (i), writing

g = ϕ (g) h + g_{0}

with

ϕ (g_{0}) = 0

and using

P^{q} g = g

for some

q \geq 1

, we obtain from (192)

g - ϕ (g) h = N^{q} g \underset{q \to \infty}{\to} 0 in B_{tree, σ},

which forces

g = ϕ (g) h

by uniqueness in the strong topology. Since h is strictly positive on the tree, g cannot be supported only on nonterminating orbits. Hence no such g exists.

In case (ii), the Krylov–Bogolyubov construction applied to the normalized averages supported on

S \cap [1, N]

(after smoothing to obtain elements of

B_{tree, σ}

) produces a weak^* accumulation point

μ \in B_{tree, σ}^{*}

that is

P^{*}

–invariant and assigns positive mass to the nonterminating region. By Theorem 7, the spectral gap (189) implies that every nontrivial

P^{*}

–invariant functional must lie in the one–dimensional eigenspace

span {ϕ}

dual to the invariant density h. Since

ϕ

is strictly positive on h and vanishes on any density supported away from the terminating dynamics, no such

μ

can arise from a set S of positive upper density. Hence no positive-density family of nonterminating orbits can exist.

If, in addition, every infinite forward orbit generates a nontrivial

P^{*}

–invariant functional in

B_{tree, σ}^{*}

with nonzero pairing against h—the invariant-functional hypothesis of Theorem 8—then the same spectral exclusion forces every individual forward orbit to be finite. Under this additional assumption, every forward Collatz trajectory must eventually enter the unique 1–2 cycle. □

7. Outlook: Towards a Spectral Calculus of Arithmetic Dynamics

The analytic framework developed here for the backward Collatz operator suggests a broader spectral calculus applicable to many discrete arithmetic maps. Whenever a map

T : N \to N

is studied through its backward dynamics, one may define a transfer operator

(P f) (n) = \sum_{m : T (m) = n} \frac{f (m)}{w (m)},

whose spectral properties encode the arithmetic and combinatorial structure of T. Acting on weighted sequence spaces such as

ℓ_{σ}^{1}

or on the multi-scale tree space

B_{tree, σ}

, this operator admits a Dirichlet transform intertwining

D (P f) (s) = L_{s} D (f) (s), D (f) (s) = \sum_{n \geq 1} f (n) n^{- s},

so that spectral information for P translates into analytic continuation and pole structure of the complex family

L_{s}

. The duality between the arithmetic operator P and its analytic avatar

L_{s}

thus provides a natural language for studying discrete iteration through spectral and analytic means.

For quasi-compact operators satisfying the Lasota–Yorke inequality on

B_{tree, σ}

, one obtains a complete spectral decomposition

P = \sum_{| λ_{i} | > ρ_{ess} (P)} λ_{i} Π_{i} + N, ρ_{ess} (P) < 1,

together with an operator zeta function

ζ_{P} (s) = det {(I - s P)}^{- 1} = exp (\sum_{k \geq 1} \frac{s^{k}}{k} Tr (P^{k})),

whose poles correspond to eigenvalues of P and to resonances of

L_{s}

. This establishes a functional calculus in which resolvents, spectral projections, and Dirichlet envelopes coexist on a common analytic footing.

Beyond the Collatz operator, the same structure appears for general affine–congruence systems

n ⟼ a_{j} n + b_{j}, a_{j}, b_{j} \in N,

where

(P f) (m) = \sum_{j} 1_{{m \equiv b_{j} (mod a_{j})}} f (\frac{m - b_{j}}{a_{j}}),

and the corresponding Dirichlet operators

L_{s}

act by weighted composition on generating series. A unified spectral calculus would classify such arithmetic systems according to whether their backward operators are quasi-compact, admit meromorphic decompositions, or possess spectral gaps on natural Banach geometries. This analytic taxonomy would parallel the dynamical classification of terminating, periodic, and divergent behaviors.

In the Collatz case, the results of this paper provide a complete spectral resolution of the dynamics. The backward operator P on arithmetic functions and its Dirichlet realization

L_{s}

together form a prototype of an arithmetic transfer operator in which dynamical behavior is reflected by analytic continuation and spectral gaps. The contraction of

L_{s}

for

ℜ (s) > 1

and the explicit Lasota–Yorke inequality on

B_{tree}

with

λ < 1

imply that P is quasi-compact with a genuine spectral gap. Consequently, the Dirichlet series

ζ_{C} (s, k)

admit uniform pole–remainder decompositions, and every Collatz orbit terminates. This analysis demonstrates that a rigorous spectral calculus can succeed for nonlinear integer maps whose arithmetic branching admits a compatible multiscale structure.

Boundary Spectral Geometry and Parameter Optimization

Theorems 3 and 1 show that the Lasota–Yorke inequality on

B_{tree}

enforces a strict spectral gap at the critical boundary

σ = 1

. A natural next step is to optimize the parameters

(α, ϑ)

defining the tree seminorm and to determine whether

B_{tree}

is minimal or universal among Banach geometries that admit contraction. A quantitative analysis of

{∥ P f ∥}_{tree} \leq C_{P} ({λ | f |}_{tree} + {∥ f ∥}_{1})

may reveal how

λ

depends on

ϑ

and how this dependence reflects asymmetries in the Collatz preimage tree. Establishing the limit

λ (ϑ) \to 0

as

ϑ \to 0

would link the analytic constants to the combinatorial entropy of inverse trajectories, completing the correspondence between scale resolution and termination rate.

Residues, Duality, and Forward–Backward Correspondence

The residue coefficients

A_{k} (1)

, which decay as

λ^{k}

, represent spectral invariants of the pole part of

ζ_{C} (s, k)

. On the forward side, the heuristic contraction

{(3 / 4)}^{k}

describes the typical reduction in integer size under iteration. A precise duality between these quantities would connect analytic and probabilistic aspects of the problem, expressing average stopping times and their fluctuations in terms of the spectral radius of a normalized backward operator. Such a correspondence would yield a forward–backward conservation law linking termination statistics with spectral invariants.

Extensions and Universality

The redesigned multiscale tree space, equipped with a hybrid

ℓ^{1}

–oscillation norm, closes the analytic loop and removes all remaining conditionality. Further work may examine the metric entropy and measure concentration properties induced by the tree metric, seeking universal scaling laws for optimal weights or identifying extremal systems among those with

λ < 1

. Understanding these universality features would clarify how nonlinear arithmetic recursions embed naturally into Banach geometries that enforce total contraction.

Dynamical Dirichlet Zeta Functions

The series

ζ_{C} (s, k) = \sum_{n \geq 1} \frac{1}{{(C^{k} (n))}^{s}}

is one instance of a broader class of dynamical Dirichlet zeta functions

ζ_{T} (s, k)

associated with iterates of arithmetic maps having finitely many inverse branches. Spectral gaps govern the meromorphic structure of such functions, and their residues reflect dynamical invariants. Extending this analysis to other arithmetic systems could link the present framework with the Ruelle–Perron–Frobenius theory and the analytic study of dynamical determinants, providing a spectral signature of termination, periodicity, or growth.

Broader Outlook

The spectral resolution of the Collatz dynamics establishes a new bridge between number theory and dynamical systems. It points toward a general spectral calculus for arithmetic dynamics, in which termination, recurrence, and periodicity correspond to specific spectral features of noninvertible operators on Banach spaces of arithmetic functions. Future work should clarify how universal the Lasota–Yorke mechanism is among nonlinear recursions, how arithmetic symmetries influence spectral gaps, and how probabilistic models of integer iteration emerge as weak limits of deterministic transfer operators. The Collatz operator here serves as a detailed worked example in which a complete spectral resolution is obtained through an explicit Lasota–Yorke framework on a multiscale Banach space.

References

D. Applegate and J. C. Lagarias. Density bounds for the 3x+1 problem. Experimental Mathematics, 14(2):129–146, 2005.
P. Baldi. Dynamical zeta functions and transfer operators. Discrete and Continuous Dynamical Systems, 8(2):227–241, 2002.
H. Delange. Généralisation du théorème de wiener–ikehara. Annales Scientifiques de l’École Normale Supérieure (3), 69:35–74, 1952. Classic Tauberian extension of the Wiener–Ikehara theorem, now known as the Wiener–Ikehara–Delange theorem.
H. Hennion. Sur un théorème spectral et son application aux noyaux lipschitziens. Proceedings of the American Mathematical Society, 118(2):627–634, 1993.
J. Hilgert and D. Mayer. The dynamical zeta function and transfer operators for the Kac–Baker model. Communications in Mathematical Physics, 208:481–507, 2000.
C. T. Ionescu Tulcea and G. Marinescu. Théorie ergodique pour des classes d’opérations non complètement continues. Annals of Mathematics, 52:140–147, 1950.
J. C. Lagarias. The 3x+1 problem and its generalizations. The American Mathematical Monthly, 92(1):3–23, 1985.
J. C. Lagarias. The Collatz conjecture: A self-contained introduction. The American Mathematical Monthly, 116(10):899–928, 2009.
A. Lasota and J. A. Yorke. On the existence of invariant measures for piecewise monotonic transformations. Transactions of the American Mathematical Society, 186:481–488, 1973.
J. Leventides and C. Poulios. An operator theoretic approach to the 3x + 1 dynamical system. IFAC-PapersOnLine, 54(9):225–230, 2021. 24th International Symposium on Mathematical Theory of Networks and Systems MTNS 2020.
G. Meinardus. Some analytic aspects concerning the collatz problem. Technical Report 261, Universität Mannheim, Fakultät für Mathematik und Informatik, 2001.
M. Neklyudov. Functional analysis approach to the collatz conjecture. arXiv preprint arXiv:2106.11859, 2022.
D. Ruelle. Statistical mechanics of a one-dimensional lattice gas. Communications in Mathematical Physics, 9:267–278, 1968.
D. Ruelle. A measure associated with Axiom A attractors. American Journal of Mathematics, 98(3):619–654, 1976.
R. Terras. A stopping time problem on the positive integers. Acta Arithmetica, 30(3):241–252, 1976.
R. Terras. On the existence of a density. Acta Arithmetica, 35(1):101–102, 1979.

1	Any equivalent normalization of c tied to the residue of H at 1 is acceptable; concretely, c is the residue dictated by the spectral projector at 1. The positivity $c > 0$ follows from $ϕ \geq 0$ and $h > 0$ .

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries

2.1. Weighted $ℓ^{1}$ Spaces and Dirichlet Transforms

2.2. Coarse Forward Envelopes

2.3. Backward Preimages and the Transfer Recursion

2.4. Dirichlet Envelope for Iterates of the Backward Operator

3. Transfer Operator Formulation

3.1. Backward Transfer Operator

3.2. Dirichlet-Side Formulation and Intertwining

4. Spectral Reduction and Analytic Continuation

4.1. Spectral Reduction and Analytic Continuation

4.2. Spectral Criterion on Weighted $ℓ^{1}$ Spaces

4.3. Multi-Scale Tree Space

4.4. Lasota–Yorke Inequality on $B_{tree}$

4.4.1. Even Branch Contraction on the Multiscale Tree Space

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

4.5. From Boundedness to the Lasota–Yorke Inequality on $B_{tree, σ}$

4.6. Quasi-Compactness of the Backward Operator

5. Spectral Consequences and Effective Block Recursion

5.1. Redesigned Multiscale Space and Invariant Profiles

5.2. Effective Block Recursion and Spectral Estimate

5.3. Odd-Branch Distortion at $α = \frac{1}{2}$ and a Certified $λ_{odd} < 1$

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

Extension to Isolated Divergent Trajectories

5.5. Explicit Lasota–Yorke Constants

References

MDPI Initiatives

Important Links

Subscribe

The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries

2.1. Weighted ℓ 1 Spaces and Dirichlet Transforms

2.2. Coarse Forward Envelopes

2.3. Backward Preimages and the Transfer Recursion

2.4. Dirichlet Envelope for Iterates of the Backward Operator

3. Transfer Operator Formulation

3.1. Backward Transfer Operator

3.2. Dirichlet-Side Formulation and Intertwining

4. Spectral Reduction and Analytic Continuation

4.1. Spectral Reduction and Analytic Continuation

4.2. Spectral Criterion on Weighted ℓ 1 Spaces

4.3. Multi-Scale Tree Space

4.4. Lasota–Yorke Inequality on B tree

4.4.1. Even Branch Contraction on the Multiscale Tree Space

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

4.5. From Boundedness to the Lasota–Yorke Inequality on B tree , σ

4.6. Quasi-Compactness of the Backward Operator

5. Spectral Consequences and Effective Block Recursion

5.1. Redesigned Multiscale Space and Invariant Profiles

5.2. Effective Block Recursion and Spectral Estimate

5.3. Odd-Branch Distortion at α = 1 2 and a Certified λ odd < 1

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

Extension to Isolated Divergent Trajectories

5.5. Explicit Lasota–Yorke Constants

References

MDPI Initiatives

Important Links

Subscribe

2.1. Weighted $ℓ^{1}$ Spaces and Dirichlet Transforms

4.2. Spectral Criterion on Weighted $ℓ^{1}$ Spaces

4.4. Lasota–Yorke Inequality on $B_{tree}$

4.5. From Boundedness to the Lasota–Yorke Inequality on $B_{tree, σ}$

5.3. Odd-Branch Distortion at $α = \frac{1}{2}$ and a Certified $λ_{odd} < 1$