The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

James Hateley

doi:10.20944/preprints202511.1440.v2

Submitted:

25 November 2025

Posted:

25 November 2025

Read the latest preprint version here

Abstract

We develop an operator--theoretic framework for the Collatz map based on its backward transfer operator acting on weighted Banach spaces of arithmetic functions. The associated Dirichlet transforms form a holomorphic family that captures the complex--analytic evolution of iterates and admits a decomposition into a zeta--type pole at $s=1$ and a holomorphic remainder. Within a finer multiscale space adapted to the Collatz preimage tree, we establish a Lasota--Yorke inequality with an explicit contraction constant $\lambda<1$, giving quasi--compactness and a spectral gap at the dominant eigenvalue. The resulting invariant density is strictly positive and exhibits a $c/n$ decay profile. We formulate a general criterion showing that, under a verified quasi--compactness hypothesis with isolated eigenvalue $1$, the forward dynamics admit no infinite trajectories. The framework provides a coherent spectral perspective on the Collatz operator and suggests a broader analytic approach to arithmetic dynamical systems.

Keywords:

Collatz conjecture

;

transfer operators

;

Lasota–Yorke inequality

;

invariant densities

;

dirichlet transforms

;

nonlinear integer dynamics

;

quasi-compactness

Subject:

Computer Science and Mathematics - Algebra and Number Theory

1. Introduction

The Collatz conjecture asserts that every positive integer n eventually reaches the 1–2 cycle under repeated application of

T (n) = \{\begin{matrix} n / 2, & n even, \\ 3 n + 1, & n odd . \end{matrix}

(1)

Equivalently, every forward orbit

O^{+} (n) = {T^{k} (n) : k \geq 0}

is conjectured to terminate in

{1, 2}

. Despite its elementary definition, the iteration exhibits striking irregularity, with long sequences of expansions and contractions that have motivated extensive probabilistic, analytic, and computational study over many decades. Classical work of Terras [1,2] established early density results and stopping-time estimates, while the surveys of Lagarias [3,4] synthesized a wide range of heuristic and structural approaches. Subsequent analytic contributions, including those of Meinardus [5] and Applegate–Lagarias [6], have developed refined density bounds and asymptotic estimates for the distribution of orbits. Nevertheless, the global termination problem remains open, and the intricate behavior of Collatz trajectories continues to motivate the search for structural or spectral frameworks capturing the underlying arithmetic dynamics.

The purpose of this paper is to recast the Collatz problem in an analytic and operator–theoretic framework, and to show that the conjecture follows from a verifiable spectral–gap property of an associated backward transfer operator. Instead of studying T directly, we analyze its inverse dynamics through the operator

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m},

(2)

acting on arithmetic functions

f : N \to C

. Transfer–operator methods of this type originate in statistical mechanics and dynamical systems [7,8], and have more recently been applied to

3 x + 1

–type maps in various analytic and functional–analytic contexts [9,10]. For the Collatz map (1), each n has an even preimage

2 n

and an additional odd preimage

(n - 1) / 3

whenever

n \equiv 4 (mod 6)

, giving

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (mod 6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} .

(3)

The weights

1 / m

normalize the operator so that P acts as a mass–preserving average on non-negative

ℓ^{1}

sequences, reflecting the logarithmic contraction inherent in the preimage structure of T.

Remark 1.1

(Invariant density and logarithmic mass balance). Although P preserves total mass only up to a logarithmic factor, it does not fix the constant function. Indeed,

(P 1) (n) = \frac{1}{2 n} + 1_{{n \equiv 4 (mod 6)}} \frac{3}{n - 1} \sim \frac{C}{n} (n \to \infty),

so

(P 1) \neq 1

. More generally,

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m},

(4)

which shows that P is logarithmically mass–preserving: the pushforward of mass is reweighted by the harmonic kernel

m \mapsto 1 / m

.

This logarithmic balance forces any P–invariant density h to satisfy

P h = h

with a decay of order

1 / n

as

n \to \infty

. In particular, the explicit block recursion developed in Section 5.2, together with the oscillation control provided by the Lasota–Yorke inequality [11], yields the precise asymptotic profile

h (n) \sim \frac{c}{n}, n \to \infty,

consistent with Tauberian heuristics of Delange type [12]. All spectral decompositions in the sequel are expressed relative to this nonconstant

1 / n

–type invariant profile.

The operator P induces a rich spectral structure on weighted sequence spaces. On

ℓ_{σ}^{1}

, defined by

{∥ f ∥}_{σ} = \sum_{n \geq 1} | f (n) | n^{- σ}

, the Dirichlet transform

D f (s) = \sum_{n \geq 1} \frac{f (n)}{n^{s}},

(5)

intertwines P with analytic continuation in the half-plane

ℜ (s) > σ

. Uniform

ℓ_{σ}^{1}

bounds on

P^{k}

translate into exponential envelopes for

D (P^{k} f) (s)

and yield meromorphic continuations of the corresponding Collatz–Dirichlet series, whose pole at

s = 1

reflects the average branching behavior [13,14]. The spectral radius of P on

ℓ_{σ}^{1}

captures the global weighted expansion rate of inverse branches and determines the analytic location of dominant singularities.

To resolve finer dynamical properties, we refine this setting to a multiscale Banach space

B_{tree, σ}

built from dyadic–triadic block averages and oscillation seminorms that encode the hierarchical structure of the Collatz preimage tree. On this space, P satisfies a two-norm Lasota–Yorke inequality,

{[P f]}_{tree, σ} \leq λ_{LY} {[f]}_{tree, σ} + C {∥ f ∥}_{σ}, 0 < λ_{LY} < 1,

placing the dynamics within the classical Ionescu–Tulcea–Marinescu and Hennion spectral frameworks for quasi–compact operators [15,16]. The precise Lasota–Yorke bounds, including the explicit contraction of the odd branch, are developed in Section 4, Section 5 and Section 6.

The main theorem of the paper establishes that when the odd-branch contraction constant

λ_{odd} (α, ϑ)

satisfies

λ_{odd} < 1

for specific parameters

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

, the backward Collatz operator P possesses a strict spectral gap on

B_{tree, σ}

. The spectral decomposition then implies that every invariant measure of P is supported on the 1–2 cycle, ruling out any positive-density family of divergent or periodic orbits. A strengthened criterion shows that a non-trivial invariant functional in

B_{tree, σ}^{*}

would contradict the spectral gap, hence all Collatz trajectories must terminate.

The remainder of the paper is organized as follows. Section 2 establishes notation and basic properties of the weighted

ℓ_{σ}^{1}

spaces together with the associated Dirichlet transforms. Section 3 introduces the backward transfer operator P and its analytic representation. Section 4 constructs the multiscale space

B_{tree, σ}

adapted to the Collatz preimage tree and proves the corresponding Lasota–Yorke inequalities. Section 6 verifies that the odd branch admits an explicit contraction constant

λ_{odd} < 1

for the chosen parameters, yielding quasi–compactness and a spectral gap. Finally, Section 7 develops the resulting spectral consequences, formulating a general criterion that links quasi–compactness with the absence of infinite forward trajectories, and situating the Collatz operator within a broader analytical framework for arithmetic dynamical systems.

2. Preliminaries

The analysis begins with a careful description of the function spaces, Dirichlet transforms, and basic structural features of the Collatz map that underlie the spectral study of the backward operator P. Throughout we work with complex-valued arithmetic functions

f : N \to C

. We start with a simple unbounded estimate.

Lemma 2.1

(Coarse k-step envelopes). Let

T : N \to N

denote the Collatz map (1). For every

n \in N

and

k \in N_{0}

,

\frac{n}{2^{k}} \leq T^{k} (n) \leq 3^{k} n + \frac{3^{k} - 1}{2} .

(6)

Proof.

For every

m \geq 1

, the definition of T gives

\frac{m}{2} \leq T (m) \leq 3 m + 1 .

Iterating the lower bound yields

T^{k} (n) \geq n / 2^{k}

. For the upper bound, the recurrence

T^{k + 1} (n) \leq 3 T^{k} (n) + 1

immediately gives, by a simple induction on k, the explicit estimate

T^{k} (n) \leq 3^{k} n + (3^{k} - 1) / 2

. This proves (6). □

These envelopes are intentionally crude, yet they ensure that forward iterates of typical arithmetic weights remain controlled on the scales relevant for our Dirichlet and transfer-operator analysis.

2.1. Weighted $ℓ^{1}$ spaces and Dirichlet transforms

For

σ > 0

we define the weighted

ℓ^{1}

space

ℓ_{σ}^{1} : = \{f : N \to {C : ∥ f ∥}_{σ} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} < \infty\} .

(7)

The weight exponent

σ

measures polynomial decay and is chosen so that Dirichlet series associated with f converge absolutely in a half-plane

ℜ (s) > σ

.

Given

f \in ℓ_{σ}^{1}

, we define its Dirichlet transform

D f (s) : = \sum_{n \geq 1} \frac{f (n)}{n^{s}}, ℜ (s) > σ .

(8)

Lemma 2.2

(Dirichlet convergence). Let

σ > 0

and let

f \in ℓ_{σ}^{1}

, so that

{∥ f ∥}_{σ} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} < \infty .

Then the Dirichlet transform

D f (s) : = \sum_{n \geq 1} \frac{f (n)}{n^{s}}

converges absolutely for

ℜ (s) > σ

and defines a bounded holomorphic function on every half-plane

ℜ (s) \geq σ + ε

,

ε > 0

. Moreover,

| D f (s) | \leq {∥ f ∥}_{σ} sup_{n \geq 1} n^{σ - ℜ (s)} = {∥ f ∥}_{σ} (ℜ (s) > σ) .

(9)

Proof.

Let

s \in C

with

ℜ (s) > σ

. Then

\sum_{n \geq 1} |\frac{f (n)}{n^{s}}| = \sum_{n \geq 1} \frac{| f (n) |}{n^{ℜ (s)}} = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} n^{σ - ℜ (s)} .

Since

ℜ (s) > σ

implies

σ - ℜ (s) < 0

, the sequence

n^{σ - ℜ (s)}

is decreasing to 0, and hence

sup_{n \geq 1} n^{σ - ℜ (s)} = 1 .

Therefore,

\sum_{n \geq 1} |\frac{f (n)}{n^{s}}| \leq {∥ f ∥}_{σ} < \infty,

so the Dirichlet series converges absolutely.

For every

ε > 0

, the same bound holds uniformly on the half-plane

ℜ (s) \geq σ + ε

, since then

σ - ℜ (s) \leq - ε

and

n^{σ - ℜ (s)} \leq n^{- ε} \to 0

as

n \to \infty

. Thus the convergence is locally uniform in

ℜ (s) \geq σ + ε

, and classical Dirichlet-series theory implies that

D f

is holomorphic on this region.

The bound (9) follows directly from the estimate above. □

We write

ℓ^{1} = ℓ_{0}^{1}

for the unweighted space with norm

{∥ f ∥}_{1} = \sum_{n \geq 1} | f (n) |

.

2.2. Backward Preimages and the Transfer Recursion

For each

n \geq 1

, define the even and odd preimage sets

E (n) : = {m \in N : T (m) = n, m even}, O (n) : = {m \in N : T (m) = n, m odd} .

Lemma 2.3

(Preimage structure). For every

n \in N

,

E (n) = {2 n}, O (n) = \{\begin{matrix} {(n - 1) / 3}, & n \equiv 4 (mod 6), \\ ⌀, & otherwise, \end{matrix}

(10)

and in the first case

(n - 1) / 3

is odd. In particular, each n has either one preimage (even) or two preimages (one even and one odd), and the odd preimage occurs with natural density

1 / 6

.

Proof.

If m is even and

T (m) = n

, then

m / 2 = n

, so

m = 2 n

, establishing

E (n) = {2 n}

.

If m is odd and

T (m) = n

, then

3 m + 1 = n

, so

m = (n - 1) / 3

. This is an integer precisely when

n \equiv 1 (mod 3)

. For m to be odd,

n - 1

must be divisible by 3 but not by 6, so

n \equiv 4 (mod 6)

. In that case

(n - 1) / 3

is odd. The density statement follows since the congruence class

n \equiv 4 (mod 6)

has natural density

1 / 6

. □

Hence each n admits exactly one even preimage and possibly one odd preimage when

n \equiv 4 (mod 6)

. The corresponding backward transfer operator is defined as

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m} = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

(11)

The normalization by

1 / m

reflects the logarithmic contraction of the forward map and ensures a natural mass-balance property.

Lemma 2.4

(Weighted mass preservation). Let

f : N \to [0, \infty)

satisfy

\sum_{m \geq 1} \frac{f (m)}{m} < \infty .

Then the backward transfer operator

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m}

preserves the weighted mass in the sense that

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m} .

(12)

Proof.

Since

f \geq 0

and

\sum_{m \geq 1} f (m) / m < \infty

, Tonelli’s theorem justifies rearranging the nonnegative double series. Using the definition of P,

\sum_{n \geq 1} (P f) (n) = \sum_{n \geq 1} \sum_{m : T (m) = n} \frac{f (m)}{m} .

Each

m \geq 1

has exactly one image

T (m)

, so it appears in exactly one of the inner sums. Hence we can rewrite the double sum directly over m:

\sum_{n \geq 1} \sum_{m : T (m) = n} \frac{f (m)}{m} = \sum_{m \geq 1} \frac{f (m)}{m},

which is precisely (12). □

2.3. Dirichlet Envelope for Iterates of the Backward Operator

The preimage structure allows a crude but useful bound on P acting on

ℓ_{σ}^{1}

.

Proposition 2.5

(Backward operator bound). Let

σ > 0

and let P be defined by (11). Then

P : ℓ_{σ}^{1} \to ℓ_{σ}^{1}

is bounded and

{∥ P f ∥}_{σ} \leq C_{σ} {∥ f ∥}_{σ}, C_{σ} : = 2^{σ} + 3^{- σ},

(13)

for all

f \in ℓ_{σ}^{1}

. Consequently, for every

k \geq 1

,

∥ P^{k} {f ∥}_{σ} \leq C_{σ}^{k} {∥ f ∥}_{σ} .

(14)

Proof.

From (11),

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

Hence

{∥ P f ∥}_{σ} \leq S_{even} + S_{odd},

with

S_{even} : = \sum_{n \geq 1} \frac{| f (2 n) |}{2 n n^{σ}}, S_{odd} : = \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{|f (\frac{n - 1}{3})|}{(\frac{n - 1}{3}) n^{σ}} .

For the even branch, set

m = 2 n

, so

n = m / 2

and

S_{even} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m {(m / 2)}^{σ}} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{2^{σ} | f (m) |}{m^{σ + 1}} \leq 2^{σ} \sum_{m \geq 1} \frac{| f (m) |}{m^{σ}} = 2^{σ} {∥ f ∥}_{σ} .

For the odd branch, write

m = (n - 1) / 3

, so

n = 3 m + 1

and m is odd. Then

S_{odd} = \sum_{\begin{matrix} m \geq 1 \\ m odd \end{matrix}} \frac{| f (m) |}{m {(3 m + 1)}^{σ}} \leq \sum_{m \geq 1} \frac{| f (m) |}{m {(3 m)}^{σ}} = 3^{- σ} \sum_{m \geq 1} \frac{| f (m) |}{m^{σ + 1}} \leq 3^{- σ} {∥ f ∥}_{σ} .

Combining the two estimates gives (13), and iterating yields (14). □

The constant

C_{σ} = 2^{σ} + 3^{- σ}

is an explicit growth factor for P on

ℓ_{σ}^{1}

. It is not

< 1

in this normalization, so no contraction is claimed at this level. The genuine contraction mechanism is obtained later on the multiscale Banach space

B_{tree}

, where a strong seminorm captures oscillatory decay along the Collatz tree while the

ℓ^{1}

component provides compactness.

3. Transfer Operator Formulation

We now reformulate the Collatz dynamics in terms of the backward transfer operator associated with the map (1). This operator-theoretic viewpoint provides an analytic bridge between the discrete recurrence and the functional framework developed in later sections. The transfer operator encodes the inverse–branching structure of the map and propagates densities backward along the Collatz tree, in a form compatible with logarithmic weighting and Dirichlet series.

Recall that the Collatz map, (1), by Lemma 2.3, each

n \geq 1

has the even preimage

2 n

, together with an additional odd preimage

(n - 1) / 3

precisely when

n \equiv 4 (mod 6)

.

3.1. Backward Transfer Operator

Definition 3.1

(Backward transfer operator). For an arithmetic function

f : N \to C

, define

(P f) (n) : = \sum_{m : T (m) = n} \frac{f (m)}{m} = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3}, n \in N,

(15)

where

1_{A}

denotes the indicator of the condition A.

Lemma 3.2

(Dirichlet transform intertwining). Let

f \in ℓ_{σ}^{1}

with

σ > 1

, and define

D (f) (s) = \sum_{n \geq 1} f (n) n^{- s} .

For

ℜ (s) > σ

, the series converges absolutely and

D (P f) (s) = L_{s} D (f) (s),

where the multiplier

L_{s}

encodes the contribution of the two inverse branches of T:

L_{s} z = 2^{- 1 - s} z + 3^{- 1 - s} z \cdot 1_{{m \equiv 1 (3)}} .

Indeed,

D (P f) (s) = \sum_{n \geq 1} \sum_{m : T (m) = n} \frac{f (m)}{m} n^{- s} = \sum_{m \geq 1} f (m) m^{- 1 - s} (2^{s} 1_{m \equiv 0 (2)} + 3^{s} 1_{m \equiv 1 (3)}) = L_{s} D (f) (s) .

Proof.

Fix

f \in ℓ_{σ}^{1}

with

σ > 1

. By definition of the

ℓ_{σ}^{1}

-norm,

\sum_{n \geq 1} | f (n) | n^{- σ} < \infty .

If

ℜ (s) > σ

, then

n^{- ℜ (s)} \leq n^{- σ}

, so

\sum_{n \geq 1} | f (n) | n^{- ℜ (s)} \leq \sum_{n \geq 1} | f (n) | n^{- σ} < \infty .

Thus

D (f) (s) = \sum_{n \geq 1} f (n) n^{- s}

converges absolutely for

ℜ (s) > σ

.

Next we show that

D (P f) (s)

converges absolutely for the same range. From the definition of P,

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (\mod 6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3},

so

| P f (n) | \leq \frac{| f (2 n) |}{2 n} + 1_{{n \equiv 4 (\mod 6)}} \frac{| f ((n - 1) / 3) |}{(n - 1) / 3} .

Hence

\sum_{n \geq 1} | P f (n) | n^{- ℜ (s)} \leq S_{even} + S_{odd},

where

S_{even} : = \sum_{n \geq 1} \frac{| f (2 n) |}{2 n} n^{- ℜ (s)}, S_{odd} : = \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{(n - 1) / 3} n^{- ℜ (s)} .

For the even contribution, set

m = 2 n

so

n = m / 2

and m is even. Then

S_{even} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m} {(\frac{m}{2})}^{- ℜ (s)} = 2^{ℜ (s)} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} | f (m) | m^{- 1 - ℜ (s)} .

Since

ℜ (s) > σ

implies

ℜ (s) + 1 > σ

, we have

m^{- 1 - ℜ (s)} \leq m^{- σ}

, and therefore

S_{even} \leq 2^{ℜ (s)} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} | f (m) | m^{- σ} \leq 2^{ℜ (s)} \sum_{m \geq 1} | f (m) | m^{- σ} < \infty .

For the odd contribution, write

n = 3 k + 1

with

k \geq 1

odd (this is equivalent to

n \equiv 4 (mod 6)

and

(n - 1) / 3 = k

odd). Then

S_{odd} = \sum_{\begin{matrix} k \geq 1 \\ k odd \end{matrix}} \frac{| f (k) |}{k} {(3 k + 1)}^{- ℜ (s)} .

Since

3 k + 1 \geq k

for all

k \geq 1

, we have

{(3 k + 1)}^{- ℜ (s)} \leq k^{- ℜ (s)}

, and hence

S_{odd} \leq \sum_{\begin{matrix} k \geq 1 \\ k odd \end{matrix}} | f (k) | k^{- 1 - ℜ (s)} \leq \sum_{k \geq 1} | f (k) | k^{- 1 - ℜ (s)} .

Again

ℜ (s) + 1 > σ

gives

k^{- 1 - ℜ (s)} \leq k^{- σ}

, so

S_{odd} \leq \sum_{k \geq 1} | f (k) | k^{- σ} < \infty .

Thus

S_{even} + S_{odd} < \infty

, and

D (P f) (s)

converges absolutely for

ℜ (s) > σ

.

We now compute

D (P f) (s)

explicitly and identify it with

(L_{s} D (f)) (s)

. By definition,

D (P f) (s) = \sum_{n \geq 1} (P f) (n) n^{- s} .

Substituting the formula for P and splitting according to the two branches,

D (P f) (s) = \sum_{n \geq 1} \frac{f (2 n)}{2 n} n^{- s} + \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} n^{- s} .

For the even part, set again

m = 2 n

:

\sum_{n \geq 1} \frac{f (2 n)}{2 n} n^{- s} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{f (m)}{m} {(\frac{m}{2})}^{- s} = 2^{s} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} f (m) m^{- 1 - s} .

For the odd part, write

n = 3 k + 1

with

k \geq 1

odd and

(n - 1) / 3 = k

:

\sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} n^{- s} = \sum_{\begin{matrix} k \geq 1 \\ k odd \end{matrix}} \frac{f (k)}{k} {(3 k + 1)}^{- s} .

Putting the two contributions together,

D (P f) (s) = 2^{s} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} f (m) m^{- 1 - s} + \sum_{\begin{matrix} k \geq 1 \\ k odd \end{matrix}} f (k) k^{- 1} {(3 k + 1)}^{- s} .

Now let

F (s) = D (f) (s) = \sum_{n \geq 1} a_{n} n^{- s}

with

a_{n} = f (n)

. By definition of

L_{s}

in the lemma,

(L_{s} F) (s) = 2^{s} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} a_{m} m^{- 1 - s} + \sum_{\begin{matrix} k \geq 1 \\ k odd \end{matrix}} a_{k} k^{- 1} {(3 k + 1)}^{- s},

and with

a_{n} = f (n)

this matches exactly the expression we have obtained for

D (P f) (s)

. Hence

D (P f) (s) = (L_{s} D (f)) (s)

for all

ℜ (s) > σ

, as claimed. □

The multiplicative factor

1 / m

assigns to each inverse branch a logarithmic weight, so that P acts as a normalized backward average along preimages. This normalization aligns the discrete dynamics with Dirichlet weights and will be crucial for analytic continuation and spectral estimates below.

Positivity. If

f (n) \geq 0

for all n, then

(P f) (n) \geq 0

for all n, since P is a positive linear combination of values of f.

Weighted mass preservation. A direct change of variables shows that for every nonnegative f satisfying

\sum_{m \geq 1} | f (m) | / m < \infty

,

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m} .

(16)

Thus P preserves the logarithmically weighted mass

\sum f (m) / m

; plain

ℓ^{1}

mass is not preserved under this normalization.

Boundedness on weighted spaces. Let

ℓ_{σ}^{1} : = \{f : N \to {C : ∥ f ∥}_{ℓ_{σ}^{1}} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}} < \infty\}, σ > 0 .

A direct change of variables in (15) yields, for all

f \in ℓ_{σ}^{1}

,

\begin{matrix} {∥ P f ∥}_{ℓ_{σ}^{1}} & = \sum_{n \geq 1} \frac{| (P f) (n) |}{n^{σ}} \leq \sum_{n \geq 1} (\frac{| f (2 n) |}{2 n^{1 + σ}} + 1_{{n \equiv 4 (6)}} \frac{|f ((n - 1) / 3)|}{{((n - 1) / 3)}^{1 + σ}}) \\ = \frac{1}{2} \sum_{n \geq 1} \frac{| f (2 n) |}{n^{1 + σ}} + 3^{1 + σ} \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{{(n - 1)}^{1 + σ}} . \end{matrix}

(17)

Changing variables

m = 2 n

in the first sum and

m = (n - 1) / 3

in the second gives

\begin{matrix} \sum_{n \geq 1} \frac{| f (2 n) |}{2 n^{1 + σ}} & = 2^{σ} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m^{1 + σ}} \leq 2^{σ} {∥ f ∥}_{ℓ_{σ}^{1}}, \\ 3^{1 + σ} \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{{(n - 1)}^{1 + σ}} & = 3^{- σ} \sum_{\begin{matrix} m \geq 1 \\ 3 m + 1 \equiv 4 (6) \end{matrix}} \frac{| f (m) |}{m^{σ}} \leq 3^{- σ} {∥ f ∥}_{ℓ_{σ}^{1}} . \end{matrix}

Hence

{∥ P f ∥}_{ℓ_{σ}^{1}} \leq (2^{σ} + 3^{- σ}) {∥ f ∥}_{ℓ_{σ}^{1}},

(18)

and therefore

∥ P^{k} {f ∥}_{ℓ_{σ}^{1}} \leq {(2^{σ} + 3^{- σ})}^{k} {∥ f ∥}_{ℓ_{σ}^{1}}, k \geq 0 .

(19)

Action on the weighted sup space. For the Banach space

B_{σ} : = \{f : N \to {C : ∥ f ∥}_{B_{σ}} : = sup_{n \geq 1} n^{σ} | f (n) | < \infty\},

the normalization factor

1 / m

in (15) improves decay at each branch but does not make P a contraction. Setting

g (n) : = n f (n)

, one obtains

n (P f) (n) = g (2 n) + 1_{{n \equiv 4 (6)}} g (\frac{n - 1}{3}), (P f) (n) = \frac{(Q g) (n)}{n}, (Q g) (n) : = g (2 n) + 1_{{n \equiv 4 (6)}} g (\frac{n - 1}{3}) .

Using

{∥ f ∥}_{B_{σ}} = {∥ g ∥}_{B_{σ - 1}}

, one obtains the bound

\begin{matrix} {∥ P f ∥}_{B_{σ}} & = sup_{n \geq 1} n^{σ - 1} | (Q g) (n) | \leq sup_{n \geq 1} (n^{σ - 1} | g (2 n) | + n^{σ - 1} 1_{{n \equiv 4 (6)}} |g (\frac{n - 1}{3})|) \\ \leq (2^{- (σ - 1)} + 3^{σ - 1}) {∥ g ∥}_{B_{σ - 1}} = (2^{- (σ - 1)} + 3^{σ - 1}) {∥ f ∥}_{B_{σ}} . \end{matrix}

(20)

In particular, the constant

2^{- (σ - 1)} + 3^{σ - 1} \geq 1

for all

σ > 0

, so P is bounded but not contractive on

(B_{σ}, ∥ \cdot ∥_{B_{σ}})

. This coarse boundedness provides an upper envelope for the operator norm but does not imply any decay of

P^{k}

on

B_{σ}

.

These limitations motivate the refinement of the functional setting in later sections, where the multiscale tree spaces

B_{tree}

and

B_{tree, σ}

are introduced to obtain genuine Lasota–Yorke-type contractions with

λ < 1

and a provable spectral gap.

3.2. Dirichlet-Side Formulation and Intertwining

For

f \in ℓ_{σ}^{1}

with

σ > 0

, the Dirichlet transform

D f (s) : = \sum_{n \geq 1} \frac{f (n)}{n^{s}}, ℜ (s) > σ,

(21)

is absolutely convergent. Writing

D f (s) = \sum_{n \geq 1} a_{n} n^{- s}

with

a_{n} = f (n)

and substituting (15), we obtain

\begin{matrix} D (P f) (s) & = \sum_{n \geq 1} (\frac{a_{2 n}}{2 n} + 1_{{n \equiv 4 (6)}} \frac{a_{(n - 1) / 3}}{(n - 1) / 3}) \frac{1}{n^{s}} . \end{matrix}

(22)

Thus

D (P f)

is again a Dirichlet series whose coefficients depend linearly on those of

D f

.

Definition 3.3

(Dirichlet–Ruelle operator). Let

D_{σ}

denote the space of Dirichlet series

F (s) = \sum_{n \geq 1} a_{n} n^{- s} with \sum_{n \geq 1} \frac{| a_{n} |}{n^{σ}} < \infty .

Define

L : D_{σ} \to D_{σ}

by

(L F) (s) : = \sum_{n \geq 1} b_{n} n^{- s}, b_{n} : = \frac{a_{2 n}}{2 n} + 1_{{n \equiv 4 (6)}} \frac{a_{(n - 1) / 3}}{(n - 1) / 3} .

(23)

Lemma 3.4

(Operator norm of L). For

σ > 0

, let

{∥ F ∥}_{σ} : = \sum_{n \geq 1} | a_{n} | / n^{σ}

. Then

L : D_{σ} \to D_{σ}

is bounded and

{∥ L ∥}_{σ} \leq 2^{σ} + 3^{- σ} .

(24)

Proof.

From (23),

{∥ L F ∥}_{σ} = \sum_{n \geq 1} \frac{| b_{n} |}{n^{σ}} \leq \sum_{n \geq 1} \frac{| a_{2 n} |}{2 n n^{σ}} + \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| a_{(n - 1) / 3} |}{(n - 1) / 3} \frac{1}{n^{σ}} = : S_{even} + S_{odd} .

For the even term, set

m = 2 n

. Then

S_{even} = \sum_{m even} \frac{| a_{m} |}{2 {(m / 2)}^{1 + σ}} = \sum_{m even} \frac{2^{σ} | a_{m} |}{m^{1 + σ}} \leq 2^{σ} \sum_{m even} \frac{| a_{m} |}{m^{σ}} \leq 2^{σ} {∥ F ∥}_{σ} .

For the odd term, write

m = (n - 1) / 3

, so

n = 3 m + 1

and

S_{odd} = \sum_{m \geq 1} \frac{| a_{m} |}{m {(3 m + 1)}^{σ}} \leq 3^{- σ} \sum_{m \geq 1} \frac{| a_{m} |}{m^{σ}} = 3^{- σ} {∥ F ∥}_{σ} .

Combining the two estimates gives

{∥ L F ∥}_{σ} \leq (2^{σ} + 3^{- σ}) {∥ F ∥}_{σ},

proving (24). □

Lemma 3.5

(Intertwining of P and L). For every

f \in ℓ_{σ}^{1}

with

σ > 0

,

D (P f) = L (D f), D (P^{k} f) = L^{k} (D f), k \geq 0,

(25)

whenever the series converge absolutely.

Proof.

The Dirichlet coefficients of

D (P f)

in (22) are precisely the

b_{n}

of (23), so

D (P f) = L (D f)

; iteration gives the second identity. □

The intertwining relation shows that spectral information for P on

ℓ_{σ}^{1}

transfers to L on

D_{σ}

. However, since P is not contractive on

ℓ_{σ}^{1}

or

B_{σ}

, the inequality (24) provides only a uniform boundedness envelope for

∥ L^{k} ∥_{σ}

, not exponential decay. Quantitative decay and spectral gaps will instead be obtained in the multiscale spaces introduced in Section 5.

Define

w_{k} : = P^{k} 1

with

1 (n) \equiv 1

and

ζ_{C} (s, k) : = \sum_{n \geq 1} \frac{w_{k} (n)}{n^{s}}, ℜ (s) large .

(26)

By Lemma 3.5,

ζ_{C} (s, 0) = ζ (s), ζ_{C} (s, k) = (L^{k} ζ) (s), k \geq 1 .

(27)

The quantity

w_{k} (n)

represents the total normalized weight of all k–step backward paths from n in the Collatz tree under the logarithmic weighting

1 / m

. The family

ζ_{C} (s, k)

therefore encodes, in Dirichlet form, the distribution of these weighted backward configurations at depth k. By Lemma 3.4,

∥ L^{k} ∥_{σ} \leq {(2^{σ} + 3^{- σ})}^{k},

so the Dirichlet coefficients of

ζ_{C} (s, k)

are uniformly bounded in

ℜ (s) > σ

but do not necessarily decay in k. Later sections refine this estimate by passing to the multiscale tree space

B_{tree, σ}

, where the Lasota–Yorke inequality ensures a true spectral gap and exponential decay of

P^{k}

.

4. Spectral Reduction and Analytic Continuation

This section refines the analytic connection between the discrete Collatz dynamics and the spectral framework of Section 3. Our goal is to express analytic information about the Dirichlet series associated with iterates of the backward operator P in terms of the spectral data of P—equivalently, of the Dirichlet–Ruelle operator L—acting on suitable Banach spaces continuously embedded in

ℓ_{σ}^{1}

. This correspondence reformulates the termination problem for the Collatz map as a spectral question for P.

Throughout this section we fix

σ > 1

and a Banach space

B_{σ, 1}

of arithmetic functions such that

B_{σ, 1} \subset ℓ_{σ}^{1}

continuously,

P (B_{σ, 1}) \subset B_{σ, 1}

, and the Dirichlet transform

D f (s) = \sum_{n \geq 1} \frac{f (n)}{n^{s}}

defines a holomorphic function for

ℜ (s) > σ

whenever

f \in B_{σ, 1}

. The intertwining relation (25) then yields, for all

k \geq 0

,

D (P^{k} f) (s) = \sum_{n \geq 1} \frac{(P^{k} f) (n)}{n^{s}}, ℜ (s) > σ .

Since

B_{σ, 1} \subset ℓ_{σ}^{1}

, each series converges absolutely. By the

ℓ_{σ}^{1}

estimate (18),

| D (P^{k} f) (s) | \leq ∥ P^{k} {f ∥}_{ℓ_{σ}^{1}} \leq {(2^{σ} + 3^{- σ})}^{k} {∥ f ∥}_{ℓ_{σ}^{1}}, ℜ (s) > σ .

(28)

The bound (28) shows that the iterates of P are uniformly bounded on

ℓ_{σ}^{1}

, though not contractive; a genuine contraction will appear only after the refinement to the multiscale tree spaces introduced in Section 4.4.

Generating function and operator resolvent. For

z \in C

with

| z | < {(2^{σ} + 3^{- σ})}^{- 1}

, define the two–variable generating function

G_{f} (s, z) : = \sum_{k \geq 0} z^{k} D (P^{k} f) (s) .

(29)

The series converges absolutely and locally uniformly for

ℜ (s) > σ

, hence

G_{f}

is holomorphic in

(s, z)

on the domain

Ω_{σ} : = {(s, z) \in C^{2} : ℜ (s) > σ, | z | < {(2^{σ} + 3^{- σ})}^{- 1}} .

On the operator side, for such z the Neumann series

{(I - z P)}^{- 1} = \sum_{k \geq 0} z^{k} P^{k}

converges in operator norm on

B_{σ, 1}

, and thus

G_{f} (s, z) = D [{(I - z P)}^{- 1} f] (s), (s, z) \in Ω_{σ} .

(30)

The poles of

{(I - z P)}^{- 1}

in the z–plane occur precisely at the reciprocals of the spectral values of P on

B_{σ, 1}

. Consequently the analytic structure of

G_{f}

as a function of z is governed by the spectrum of P.

At this point we recall that the backward Collatz operator P preserves total mass on

ℓ^{1}

:

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} f (m),

so 1 is a simple eigenvalue corresponding to the eigenvector

1 (n) \equiv 1

. Hence the spectral analysis of P will focus on demonstrating a spectral gap at 1: all other spectral values satisfy

| λ | \leq λ_{LY} < 1

. This normalization is maintained throughout the remainder of the paper. The resolvent expansion (30) is therefore analytic for

| z | < 1

except at the simple pole

z = 1

, whose residue encodes the invariant functional associated with

1

.

The coarse resolvent radius

{(2^{σ} + 3^{- σ})}^{- 1}

merely provides an elementary domain of convergence. A sharper meromorphic continuation—reflecting the true spectral radius

r (P) = 1

and the subdominant bound

ρ_{ess} (P) \leq λ_{LY} < 1

—will be obtained on the refined spaces

B_{tree}

and

B_{tree, σ}

, where the Lasota–Yorke inequality gives quantitative contraction of oscillations between adjacent scales.

Finally, for the constant function

1 (n) \equiv 1

(whenever

1 \in B_{σ, 1}

), the coefficients of

G_{1} (s, z)

are precisely the Collatz Dirichlet series

ζ_{C} (s, k)

defined in (26). Thus the analytic continuation and asymptotic decay of

ζ_{C} (s, k)

as

k \to \infty

are controlled by the spectral properties of P through (30); their exponential decay emerges once the spectral gap on the multiscale tree spaces is established.

4.1. Spectral Reduction and Analytic Continuation

Recall that the Dirichlet–Ruelle operator L is defined on

D_{σ}

by (23). The intertwining Lemma 3.5 asserts that for all

f \in ℓ_{σ}^{1}

,

D (P f) = L (D f) .

Since

D

is injective on

ℓ_{σ}^{1}

, every eigenpair

(λ, f)

of P with

f \in ℓ_{σ}^{1}

produces an eigenpair

(λ, D f)

of L. Conversely, if

L F = λ F

and

F = D f

lies in the image of

D

, then

P f = λ f

. Hence the point spectra of P on

B_{σ, 1}

and of L on

D_{σ}

coincide on the subspace

D (B_{σ, 1})

. In particular,

ρ (L) \geq ρ (P),

(31)

and any spectral gap or peripheral spectral property of P transfers to the induced action of L on Dirichlet series arising from

B_{σ, 1}

.

We emphasize that equality

σ (L) = σ (P)

is not assumed. The partial correspondence (31) suffices for analytic reduction: the Dirichlet-side continuation of

D (P^{k} f)

reflects the spectral geometry of P.

Mass preservation and spectral gap. Because P only preserves total mass up to a logarithmic factor, we have

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m},

so the constant function

1 (n) \equiv 1

is not an eigenvector. Instead, P admits a unique positive invariant density

h \in B_{tree, σ}

and a unique positive invariant functional

ϕ \in B_{tree, σ}^{*}

with

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .

(32)

Throughout the paper we work with this Perron–Frobenius normalization (32) and express all spectral decompositions relative to the nonconstant invariant profile h.

Within this framework, the Dirichlet–Ruelle operator L inherits the same dominant eigenvalue 1 and the same spectral gap on the subspace

D (B_{σ, 1})

. The analytic behavior of the Collatz Dirichlet series

ζ_{C} (s, k) = D (P^{k} 1) (s)

is then determined by how

P^{k}

approaches the spectral projector onto the invariant subspace spanned by

1

.

Theorem 4.1

(Spectral reduction and analytic continuation). Let

B_{σ, 1}

be a Banach space of arithmetic functions continuously embedded in

ℓ_{σ}^{1}

such that

P : B_{σ, 1} \to B_{σ, 1}

is quasi-compact and satisfies the mass-preserving normalization (12). Assume further that 1 is a simple eigenvalue of P and that all other spectral values lie in the closed disk

| λ | \leq λ_{LY} < 1

. Then for every

f \in B_{σ, 1}

the Dirichlet transforms

D (P^{k} f) (s)

extend holomorphically to

ℜ (s) > σ

and admit the decomposition

D (P^{k} f) (s) = Π_{1} (f) D (1) (s) + R_{k} (s), | R_{k} (s) | \leq C_{f} (s) λ_{LY}^{k},

(33)

where

Π_{1}

is the spectral projection associated with the eigenvalue 1 and

C_{f} (s)

is locally bounded on

{ℜ (s) > σ}

. In particular, for f with

Π_{1} (f) = 0

, the functions

D (P^{k} f) (s)

decay exponentially in k uniformly on compact subsets of

ℜ (s) > σ

.

When

f = 1

, the same conclusion applies to

ζ_{C} (s, k) = D (P^{k} 1) (s)

, whose exponential stabilization corresponds to convergence toward the invariant density associated with the Collatz operator.

Proof.

By quasi-compactness, the spectrum of P decomposes as

σ (P) = {1} \cup σ_{ess} (P), ρ_{ess} (P) \leq λ_{LY} < 1,

and the Riesz projection

Π_{1} = \frac{1}{2 π i} \oint_{| z - 1 | = ε} {(z I - P)}^{- 1} d z

is a bounded projection onto the one-dimensional invariant subspace spanned by

1

. Then

P^{k} = Π_{1} + N^{k}

, where

∥ N^{k} ∥_{B_{σ, 1}} \leq C λ_{LY}^{k}

for some constant

C > 0

. Applying the Dirichlet transform and using

| D (g) (s) | \leq {∥ g ∥}_{ℓ_{σ}^{1}}

for

ℜ (s) > σ

gives

D (P^{k} f) (s) = D (Π_{1} f) (s) + D (N^{k} f) (s), | D (N^{k} f) (s) | \leq C λ_{LY}^{k} {∥ f ∥}_{B_{σ, 1}} .

Since

Π_{1} f

is a multiple of

1

, we may write

D (Π_{1} f) = Π_{1} (f) D (1)

, yielding (33). Analyticity for

ℜ (s) > σ

follows from absolute convergence and locally uniform bounds. □

This form aligns with the quasi-compactness obtained later on the multiscale tree space

B_{tree, σ}

, where the Lasota–Yorke inequality ensures

ρ_{ess} (P) \leq λ_{LY} < 1

. The exponential term

λ_{LY}^{k}

in (33) corresponds to the essential spectral radius and controls the rate of decay of correlations and Dirichlet coefficients. Under stronger spectral assumptions, the representation can be refined to a meromorphic decomposition in which each isolated eigenvalue

λ_{j}

contributes a term

λ_{j}^{k} D (Π_{j} f)

, generalizing the usual Ruelle–Perron expansion.

4.2. Spectral Criterion on Weighted $ℓ^{1}$ spaces

The preceding analysis shows that sufficiently strong spectral control of P on an appropriate Banach space

B_{σ, 1}

forces all Dirichlet data generated by the backward Collatz tree to exhibit exponential stabilization toward the invariant profile. Since P is not contractive on

ℓ_{σ}^{1}

or

B_{σ}

, such behavior can only arise on refined Banach spaces where a genuine spectral gap at the eigenvalue 1 has been established. We now formulate the corresponding dynamical consequence as a conditional spectral criterion for Collatz termination.

Theorem 4.2

(Spectral criterion for Collatz termination). Let P act on a Banach space

B_{σ, 1} \subset ℓ_{σ}^{1}

such that

P (B_{σ, 1}) \subset B_{σ, 1}

and

1 \in B_{σ, 1}

. Assume that P is quasi-compact on

B_{σ, 1}

, that 1 is a simple eigenvalue of P corresponding to the unique positive invariant density h, and that all other spectral values satisfy

σ (P) ∖ {1} \subset {z \in C : | z | \leq λ_{LY} < 1} .

Then every

f \in B_{σ, 1}

admits a decomposition

P^{k} f = Π_{1} f + N^{k} f, ∥ N^{k} {f ∥}_{B_{σ, 1}} \leq C λ_{LY}^{k} {∥ f ∥}_{B_{σ, 1}},

where

Π_{1}

is the spectral projection onto

span {h}

. Consequently, there exists no nontrivial invariant or periodic density for the backward Collatz dynamics in

B_{σ, 1}

; the only invariant direction is the positive eigenfunction h. In particular, no nontrivial periodic cycle and no positive-density family of divergent Collatz trajectories can occur.

Proof.

By quasi-compactness, the spectrum of P decomposes as

σ (P) = {1} \cup σ_{ess} (P)

with

ρ_{ess} (P) \leq λ_{LY} < 1

. The associated Riesz projection

Π_{1} = \frac{1}{2 π i} \oint_{| z - 1 | = ε} {(z I - P)}^{- 1} d z

is bounded and satisfies

P Π_{1} = Π_{1} P = Π_{1}

. Since 1 is a simple eigenvalue with positive eigenfunction h, we have

Π_{1} f = (ϕ (f)) h,

where

ϕ

is the corresponding eigenfunctional normalized so that

ϕ (h) = 1

.

Hence the power iterates decompose as

P^{k} = Π_{1} + N^{k}, {∥ N^{k} ∥}_{B_{σ, 1}} \leq C λ_{LY}^{k},

for some constant

C > 0

.

If a nontrivial invariant density

f \in B_{σ, 1}

satisfied

P f = f

, then f would belong to the eigenspace of

λ = 1

. Since this eigenspace is one-dimensional and spanned by h, we must have

f = c h

for some constant c. Thus no additional invariant densities exist beyond

span {h}

.

If a periodic density f satisfied

P^{q} f = f

for some

q > 0

, then f would belong to an eigenspace associated with an eigenvalue

λ

satisfying

| λ | = 1

. Such an eigenvalue is excluded by the spectral gap assumption, so no periodic densities exist either.

Finally, via the standard correspondence between transfer-operator invariants and dynamical orbits on the Collatz graph, any invariant or periodic density corresponds to either a periodic Collatz cycle or to a positive-density family of non-terminating trajectories. The spectral gap therefore precludes these dynamical behaviors. □

Section 4.4 constructs the multiscale tree Banach space

B_{tree}

and establishes a Lasota–Yorke inequality that ensures quasi-compactness of P with an explicit contraction constant

λ_{LY} < 1

in the strong seminorm. Verification of the hypotheses of Theorem 4.2 on

B_{tree, σ}

provides the analytic–spectral bridge: a strict spectral gap for P on

B_{tree, σ}

rules out the spectral signatures associated with any non-terminating Collatz behavior.

4.3. Multi-Scale Tree Space

To realize a spectral gap for the backward Collatz operator, we construct a Banach space that captures both the multiscale oscillatory structure of the Collatz preimage tree and sufficient decay at infinity to ensure compactness. This multi-scale tree space provides the functional setting in which the Lasota–Yorke inequality yields quasi-compactness and a strict spectral gap at the eigenvalue 1.

For

j \geq 0

define the scale blocks

I_{j} : = [6^{j}, 2 \cdot 6^{j}) \cap N .

(34)

The factor 6 reflects the approximate scale multiplication under the backward map, combining the even branch

m = 2 n

and the odd branch

m = (n - 1) / 3

(defined for

n \equiv 4 (mod 6)

).

Fix parameters

0 < α < 1

and

0 < ϑ < 1

. For indices

u, v > 0

, define the scale-sensitive weight

W_{α} (u, v) : = \frac{u v}{{| u - v | (u + v)}^{α}}, u \neq v .

(35)

This weight penalizes small separations between indices, emphasizing local oscillations of f, while the factor

{(u + v)}^{- α}

damps sensitivity at large scales. The geometric coefficient

ϑ^{j}

provides exponential attenuation of oscillations across successive levels of the tree.

Definition 4.3

(Multiscale tree seminorm and space). For

f : N \to C

define

{[f]}_{tree} : = \sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) | f (m) - f (n) | .

(36)

The corresponding Banach space

B_{tree} : = \{f : N \to {C : ∥ f ∥}_{1} + {[f]}_{tree} < \infty\} {, ∥ f ∥}_{tree} : = {∥ f ∥}_{1} + {[f]}_{tree},

is called the multiscale tree space.

Standard arguments for weighted variation-type seminorms show that

(B_{tree}, ∥ \cdot ∥_{tree})

is complete. The seminorm

{[f]}_{tree}

controls the oscillatory irregularity of f within each scale block

I_{j}

, while the

ℓ^{1}

component controls the overall magnitude. However,

B_{tree}

alone does not impose sufficient decay as

n \to \infty

to guarantee compactness.

Weighted extension. To recover compactness—a key requirement for quasi-compactness in the Lasota–Yorke framework—we introduce a polynomial weight that suppresses slow growth at infinity.

Definition 4.4

(Weighted tree space). For parameters

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

, set

{∥ f ∥}_{σ} : = \sum_{n \geq 1} \frac{| f (n) |}{n^{σ}}, {[f]}_{tree} : = \sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) | f (m) - f (n) | .

Then

B_{tree, σ} : = \{f : N \to {C : ∥ f ∥}_{σ} + {[f]}_{tree} < \infty\} {, ∥ f ∥}_{tree, σ} : = {∥ f ∥}_{σ} + {[f]}_{tree} .

The factor

n^{- σ}

enforces quantitative decay of f at large indices, while

{[f]}_{tree}

measures the oscillatory complexity of f along each level of the tree. Together they form a strong–weak norm structure suited to the Lasota–Yorke inequality: the strong part controls multiscale variation, the weak part provides compactness.

Lemma 4.5

(Compact embedding). For fixed

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

, the unit ball of

B_{tree, σ}

is relatively compact in

ℓ_{σ}^{1}

.

Proof.

Let

U : = \{f \in B_{tree, σ} : {∥ f ∥}_{tree, σ} \leq 1\} .

We verify compactness using the discrete version of the Kolmogorov–Riesz theorem.

(i) Uniform boundedness. Each

f \in U

satisfies

{∥ f ∥}_{σ} \leq 1

, so

U

is bounded in

ℓ_{σ}^{1}

.

(ii) Uniform tail control. For any

ε > 0

choose N so that

\sum_{n > N} n^{- σ} < ε

. Then for all

f \in U

,

\sum_{n > N} \frac{| f (n) |}{n^{σ}} \leq {∥ f ∥}_{σ} \sum_{n > N} \frac{1}{n^{σ}} \leq ε,

so the tails contribute arbitrarily little

ℓ_{σ}^{1}

–mass.

(iii) Local equicontinuity on finite blocks. Fix

J \geq 0

and consider the finite union

E_{J} = ⋃_{j \leq J} I_{j}

. Within each

I_{j}

, the seminorm term

ϑ^{j} {sup}_{m, n \in I_{j}} W_{α} (m, n) | f (m) - f (n) |

bounds discrete oscillations uniformly in f. Hence the family

{f |_{E_{J}} : f \in U}

lies in a compact subset of the finite-dimensional space

C^{E_{J}}

.

(iv) Diagonal extraction. Given any sequence

(f^{(k)}) \subset U

, apply the compactness on

E_{1}, E_{2}, \dots

and extract a diagonal subsequence converging pointwise on all of

N

. By (ii) the tails beyond any fixed N have uniformly small weight, so pointwise convergence on finite windows implies convergence in

ℓ_{σ}^{1}

. Thus

U

is relatively compact in

ℓ_{σ}^{1}

. □

Remark 4.6.

The weight

n^{- σ}

is essential. Without it, the unit ball of

B_{tree}

is not precompact in

ℓ^{1}

: one can construct sequences of disjointly supported spikes whose tree seminorms remain bounded while their supports drift to infinity. Taking

σ > 1

eliminates this escape to infinity, yielding the compact embedding required for quasi-compactness.

The space

B_{tree, σ}

thus provides the natural functional environment for the Lasota–Yorke inequality. Its compact embedding into

ℓ_{σ}^{1}

ensures that the essential spectral radius of P on

B_{tree, σ}

is strictly smaller than its spectral radius, a prerequisite for establishing a genuine spectral gap. The strong seminorm captures multiscale regularity across the Collatz tree, while the weighted

ℓ^{1}

norm supplies the compactness that underlies the spectral analysis of the backward transfer operator.

4.4. Lasota–Yorke Inequality on $B_{tree}$

Recall from (11) that

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

It is convenient to split P into its even and odd components:

(P_{even} f) (n) : = \frac{f (2 n)}{2 n}, (P_{odd} f) (n) : = 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3},

(37)

so that

P = P_{even} + P_{odd}

.

From the

ℓ^{1}

estimates of Section 2, both branches are bounded on

ℓ^{1}

, hence on

B_{tree}

. The Lasota–Yorke inequality arises from the fact that

P_{even}

is strongly contracting in the tree seminorm, while

P_{odd}

is a controlled perturbation whose contribution is damped by the multiscale factor

ϑ^{j}

.

4.4.1. Even Branch Contraction on the Multiscale Tree Space

We first record the even-branch estimate.

Lemma 4.7

(Even branch contraction on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. There exists a constant

C_{even} > 0

depending only on α, ϑ, and σ such that for all

f \in B_{tree, σ}

,

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} ϑ {[f]}_{tree} + C_{even} {∥ f ∥}_{σ} .

(38)

In particular, once α is fixed, choosing ϑ sufficiently small makes

P_{even}

strictly contracting in the tree seminorm up to a controlled

{∥ \cdot ∥}_{σ}

error term.

Proof.

Recall that

(P_{even} f) (n) = f (2 n) / (2 n)

. For each

j \geq 0

, the block seminorm of

P_{even} f

is

Δ_{j} (P_{even} f) : = sup_{\begin{matrix} u, v \in I_{j} \\ u \neq v \end{matrix}} \frac{1}{6^{j}} W_{α} (u, v) |(P_{even} f) (u) - (P_{even} f) (v)| .

Fix j and

u, v \in I_{j}

with

u \neq v

. We decompose

(P_{even} f) (u) - (P_{even} f) (v) = \frac{f (2 u) - f (2 v)}{2 u} + f (2 v) (\frac{1}{2 u} - \frac{1}{2 v}) = : D_{1} (u, v) + D_{2} (u, v),

and estimate the two terms separately.

(1) The oscillatory part $D_{1}$ . Since

W_{α} (2 u, 2 v) = 2^{1 - α} W_{α} (u, v),

we have

W_{α} (u, v) = 2^{- (1 - α)} W_{α} (2 u, 2 v) .

Hence

\frac{1}{6^{j}} W_{α} (u, v) | D_{1} (u, v) | \leq \frac{2^{- (1 - α)}}{6^{j}} W_{α} (2 u, 2 v) \frac{| f (2 u) - f (2 v) |}{2 u} .

Since

u \in I_{j} = [6^{j}, 2 \cdot 6^{j})

,

u \geq 6^{j}

, so

1 / (2 u) \leq 1 / (2 \cdot 6^{j})

and

\frac{1}{6^{j}} W_{α} (u, v) | D_{1} (u, v) | \leq \frac{2^{- (1 - α) - 1}}{6^{2 j}} W_{α} (2 u, 2 v) | f (2 u) - f (2 v) | .

The pair

(2 u, 2 v)

lies at scale comparable to

6^{j}

, i.e. within a bounded number of block levels. Hence there exists a constant

c_{0} > 0

depending only on the block geometry such that

\frac{1}{6^{2 j}} W_{α} (2 u, 2 v) \leq c_{0} \frac{1}{6^{j^{'}}} W_{α} (2 u, 2 v) for some j^{'} \in {j, j + 1} .

Taking the supremum over

u, v \in I_{j}

gives

Δ_{j} (P_{even} f; D_{1}) \leq c_{0} 2^{- (1 - α) - 1} max {Δ_{j} (f), Δ_{j + 1} (f)} .

Multiplying by

ϑ^{j}

and using

ϑ^{j} Δ_{j} (f) \leq {[f]}_{tree}

and

ϑ^{j} Δ_{j + 1} (f) \leq ϑ^{- 1} {[f]}_{tree}

, we obtain

ϑ^{j} Δ_{j} (P_{even} f; D_{1}) \leq c_{1} 2^{- (1 - α)} ϑ {[f]}_{tree},

for some constant

c_{1}

depending only on

α

and

ϑ

. Taking the supremum over j yields

{[P_{even} f]}_{tree}^{(D_{1})} \leq c_{1} 2^{- (1 - α)} ϑ {[f]}_{tree} .

(2) The denominator part $D_{2}$ . Assume

u > v

. Then

|\frac{1}{2 u} - \frac{1}{2 v}| = \frac{| u - v |}{2 u v}, | D_{2} (u, v) | = | f (2 v) | \frac{| u - v |}{2 u v} .

Thus

W_{α} (u, v) | D_{2} (u, v) | = \frac{u v}{{| u - v | (u + v)}^{α}} | f (2 v) | \frac{| u - v |}{2 u v} = \frac{| f (2 v) |}{2 {(u + v)}^{α}} .

For

u, v \in I_{j}

, we have

u + v \geq 2 \cdot 6^{j}

, so

W_{α} (u, v) | D_{2} (u, v) | \leq C_{α} 6^{- α j} | f (2 v) | with C_{α} : = 2^{- (1 + α)} .

Hence

Δ_{j} (P_{even} f; D_{2}) \leq \frac{C_{α}}{6^{(1 + α) j}} sup_{v \in I_{j}} | f (2 v) | .

Multiplying by

ϑ^{j}

and summing over j gives

ϑ^{j} Δ_{j} (P_{even} f; D_{2}) \leq C_{α} {(ϑ 6^{- (1 + α)})}^{j} sup_{v \in I_{j}} | f (2 v) | .

Each integer n appears as

n = 2 v

for at most one

v \in I_{j}

, and since

| f (n) | \leq n^{σ} {∥ f ∥}_{σ}

, the geometric factor

{(ϑ 6^{- (1 + α)})}^{j}

ensures convergence of the series in j. Thus there exists a constant

C_{even}^{'} > 0

depending only on

α

,

ϑ

, and

σ

such that

sup_{j \geq 0} ϑ^{j} Δ_{j} (P_{even} f; D_{2}) \leq C_{even}^{'} {∥ f ∥}_{σ} .

(3) Combine the two parts. Combining the bounds for

D_{1}

and

D_{2}

and renaming constants gives

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} ϑ {[f]}_{tree} + C_{even} {∥ f ∥}_{σ},

which is the desired inequality (38). □

The odd branch requires more care because it shifts indices from n to

(n - 1) / 3

and only acts on the congruence class

n \equiv 4 (mod 6)

. Its effect is nonetheless small once weighted by

ϑ^{j}

.

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

Lemma 4.8

(Odd-branch distortion on scale blocks). Let

0 < α < 1

. If

n \equiv 4 (mod 6)

and

n \in I_{j} = [6^{j}, 2 \cdot 6^{j})

, then the odd preimage

m = (n - 1) / 3

satisfies

m \in I_{j - 1}

and

W_{α} (m_{1}, m_{2}) \leq 6^{1 - α} W_{α} (n_{1}, n_{2})

(39)

whenever

n_{1}, n_{2} \in I_{j}

lie on the same ray and

m_{i} = (n_{i} - 1) / 3

.

Proof.

For

n \in I_{j}

we have

n ≍ 6^{j}

; hence

m = (n - 1) / 3 ≍ 6^{j - 1}

, which gives

m \in I_{j - 1}

. Moreover,

| m_{1} - m_{2} | = \frac{1}{3} | n_{1} - n_{2} | and m_{1} + m_{2} ≍ 6^{j - 1} .

Thus

W_{α} (m_{1}, m_{2}) = \frac{| m_{1} - m_{2} |}{{(m_{1} + m_{2})}^{α}} \leq \frac{\frac{1}{3} | n_{1} - n_{2} |}{{(6^{- 1} (n_{1} + n_{2}))}^{α}} = 6^{1 - α} W_{α} (n_{1}, n_{2}),

which proves (39). □

Lemma 4.9

(Odd branch on

B_{tree}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. Then there exist constants

C_{α} > 0

and

C_{odd} > 0

depending only on α, ϑ, and σ such that for all

f \in B_{tree, σ}

one has

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{σ},

(40)

where the contraction factor satisfies

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

(41)

Here

C_{α}

is the odd-branch distortion constant from Lemma 4.8, i.e.

C_{α} : = sup_{u > v > 0} \frac{W_{α} (u^{'}, v^{'})}{W_{α} (u, v)}, (u^{'}, v^{'}) = (\frac{u - 1}{3}, \frac{v - 1}{3}),

which is finite for every

0 < α < 1

.

Proof.

Recall that

(P_{odd} f) (n) = 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

For each

j \geq 0

define

A_{j} (f) : = sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)|,

so that, by definition of

{[\cdot]}_{tree}

,

{[P_{odd} f]}_{tree} = \sum_{j \geq 0} ϑ^{j} A_{j} (f) .

Fix

j \geq 0

and

m, n \in I_{j}

,

m \neq n

. We decompose according to the active congruence class

4 (mod 6)

.

Case 1: neither m nor n is

4 (mod 6)

. Then

P_{odd} f (m) = P_{odd} f (n) = 0

, so this pair contributes nothing to

A_{j} (f)

.

Case 2: exactly one of

m, n

is

4 (mod 6)

. Without loss of generality, assume

m \equiv 4 (mod 6)

and

n \neg \equiv 4 (mod 6)

. Set

k : = (m - 1) / 3

. Then

P_{odd} f (m) - P_{odd} f (n) = \frac{f (k)}{k},

and hence

W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)| = W_{α} (m, n) \frac{| f (k) |}{k} .

Since

m, n \in I_{j} = [6^{j}, 2 \cdot 6^{j})

, there exist constants

c_{1}, c_{2} > 0

(depending only on

α

) such that

W_{α} (m, n) \leq c_{1} 6^{(2 - α) j}, k = \frac{m - 1}{3} \geq c_{2} 6^{j - 1},

so

ϑ^{j} W_{α} (m, n) \frac{| f (k) |}{k} \leq C {(ϑ 6^{1 - α})}^{j} | f (k) |

for some constant C depending only on

α

. Each k arises from at most one such m and j, so summing first over pairs

(m, n)

of this type and then over j yields

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ exactly one \equiv 4 (6) \end{matrix}} W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)| \leq C_{odd, 1} {∥ f ∥}_{1},

provided

ϑ 6^{1 - α} < 1

, which we assume from now on. Here

C_{odd, 1}

depends on

α

and

ϑ

, but not on f.

Case 3: both m and n are

4 (mod 6)

. Set

m^{'} = \frac{m - 1}{3}, n^{'} = \frac{n - 1}{3},

so that

P_{odd} f (m) = \frac{f (m^{'})}{m^{'}}, P_{odd} f (n) = \frac{f (n^{'})}{n^{'}} .

We decompose

\frac{f (m^{'})}{m^{'}} - \frac{f (n^{'})}{n^{'}} = \frac{f (m^{'}) - f (n^{'})}{m^{'}} + f (n^{'}) (\frac{1}{m^{'}} - \frac{1}{n^{'}}) = : D_{1} + D_{2} .

We treat

D_{1}

(the oscillatory part) and

D_{2}

(the remainder from denominators) separately.

Case 3a: the

D_{1}

term (contractive contribution). A direct computation with

m = 3 m^{'} + 1

,

n = 3 n^{'} + 1

shows that there exists a constant

C_{α} \geq 1

depending only on

α

such that

\frac{W_{α} (m, n)}{W_{α} (m^{'}, n^{'})} \leq C_{α}

(42)

for all

m \neq n

with

m \equiv n \equiv 4 (mod 6)

. (One expands

m n

,

m + n

, and

| m - n |

in terms of

m^{'}, n^{'}

, and bounds the ratios uniformly; the details are routine.)

Thus

W_{α} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq C_{α} W_{α} (m^{'}, n^{'}) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} .

Now use that

m^{'} ≍ 6^{j - 1}

for

m \in I_{j}

with

m \equiv 4 (mod 6)

, so

1 / m^{'} ≪ 6^{- (j - 1)}

. Among the

O (6^{j})

indices in

I_{j}

, only a proportion

≍ 1 / 6

lie in the active residue class

4 (mod 6)

. Applying Cauchy–Schwarz to the collection of such pairs in

I_{j}

and using this

1 / 6

density, one obtains the averaged bound

ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{1} | \leq \frac{C_{α}}{\sqrt{6}} ϑ^{j - 1} sup_{m^{'}, n^{'}} W_{α} (m^{'}, n^{'}) | f (m^{'}) - f (n^{'}) |,

where

(m^{'}, n^{'})

range over the corresponding preimage pairs. (The factor

1 / \sqrt{6}

is the standard gain from passing from a

1 / 6

-density subset of indices to an

L^{2}

-type control of the supremum.)

Taking the supremum over all admissible

(m^{'}, n^{'})

and summing over j gives

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{1} | \leq \frac{C_{α}}{\sqrt{6}} ϑ \sum_{j \geq 0} ϑ^{j - 1} sup_{m^{'}, n^{'} \in I_{j - 1}} W_{α} (m^{'}, n^{'}) | f (m^{'}) - f (n^{'}) | .

By the definition of

{[f]}_{tree}

, the right-hand side is

\leq \frac{C_{α}}{\sqrt{6}} ϑ {[f]}_{tree} .

This yields the desired contribution with contraction factor

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ

from the

D_{1}

term.

Case 3b: the

D_{2}

term (error controlled by

{∥ f ∥}_{1}

). We have

| D_{2} | = | f (n^{'}) | |\frac{1}{m^{'}} - \frac{1}{n^{'}}| = | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} .

Since

| m - n | = 3 | m^{'} - n^{'} |

,

W_{α} (m, n) | D_{2} | = \frac{m n}{{| m - n | (m + n)}^{α}} | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} = \frac{m n}{3 {(m + n)}^{α} m^{'} n^{'}} | f (n^{'}) | .

For

m, n \in I_{j}

one has

m n ≍ 6^{2 j}

,

m + n ≍ 6^{j}

,

m^{'} n^{'} ≍ 6^{2 j - 2}

, so

W_{α} (m, n) | D_{2} | \leq C 6^{- α j} | f (n^{'}) |

for some constant C depending only on

α

. Hence

ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{2} | \leq C {(ϑ 6^{- α})}^{j} sup_{n^{'}} | f (n^{'}) | .

Each

n^{'}

arises from at most a bounded number of

(m, n, j)

, and

ϑ 6^{- α} < 1

for fixed

ϑ \in (0, 1)

and

α \in (0, 1)

, so summing over j and using

| f (n^{'}) {| \leq ∥ f ∥}_{1} / n^{'}

shows that the total

D_{2}

contribution is bounded by

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{α} (m, n) | D_{2} | \leq C_{odd, 2} {∥ f ∥}_{1}

for some constant

C_{odd, 2} > 0

independent of f.

Combining the three cases, we obtain

{[P_{odd} f]}_{tree} = \sum_{j \geq 0} ϑ^{j} A_{j} (f) \leq \frac{C_{α}}{\sqrt{6}} ϑ {[f]}_{tree} + (C_{odd, 1} + C_{odd, 2}) {∥ f ∥}_{1} .

Setting

C_{odd} : = C_{odd, 1} + C_{odd, 2}

yields (40) with

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ

, as claimed. □

4.5. From Boundedness to the Lasota–Yorke Inequality on $B_{tree, σ}$

Definition 4.10

(Tree seminorm). Let

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

be the standard multiscale blocks. For

f : N \to C

define the block oscillation

{osc}_{I_{j}} (f) : = sup_{m, n \in I_{j}} | f (m) - f (n) | .

Fix

0 < α < 1

. The strong tree seminorm is

{[f]}_{tree} : = sup_{j \geq 0} 6^{α j} {osc}_{I_{j}} (f),

and the full norm on

B_{tree, σ}

is

{∥ f ∥}_{tree, σ} : = {[f]}_{tree} + A {∥ f ∥}_{ℓ_{σ}^{1}},

for a fixed constant

A > 0

. This choice enforces uniform decay of oscillation across scales and yields the compact embedding

B_{tree, σ} ↪ ℓ_{σ}^{1}

.

Lemma 4.11

(Invariance and boundedness on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. Then the backward Collatz transfer operator P maps

B_{tree, σ}

into itself and is bounded: there exists

C > 0

such that

{∥ P f ∥}_{tree, σ} \leq C {∥ f ∥}_{tree, σ} for all f \in B_{tree, σ} .

Proof.

Using the even/odd decomposition,

(P f) (n) = (P_{even} f) (n) + (P_{odd} f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3} .

We show both

{∥ P f ∥}_{σ}

and

{[P f]}_{tree}

are bounded by

{∥ f ∥}_{tree, σ}

.

1. Weighted $ℓ_{σ}^{1}$ bound. For the even part, substitute

m = 2 n

:

∥ P_{even} {f ∥}_{σ} = \sum_{n \geq 1} \frac{| f (2 n) |}{2 n} n^{- σ} = \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} \frac{| f (m) |}{m} {(\frac{m}{2})}^{- σ} = 2^{σ} \sum_{\begin{matrix} m \geq 1 \\ m even \end{matrix}} | f (m) | m^{- (σ + 1)} \leq 2^{σ} {∥ f ∥}_{σ} .

For the odd part, write

m = (n - 1) / 3

(so

n = 3 m + 1

and

m \geq 1

):

∥ P_{odd} {f ∥}_{σ} = \sum_{\begin{matrix} n \geq 1 \\ n \equiv 4 (6) \end{matrix}} \frac{| f ((n - 1) / 3) |}{(n - 1) / 3} n^{- σ} = \sum_{m \geq 1} \frac{| f (m) |}{m} {(3 m + 1)}^{- σ} \leq 3^{- σ} \sum_{m \geq 1} | f (m) | m^{- (σ + 1)} \leq 3^{- σ} {∥ f ∥}_{σ} .

Hence

{∥ P f ∥}_{σ} \leq (2^{σ} + 3^{- σ}) {∥ f ∥}_{σ} \leq (2^{σ} + 3^{- σ}) {∥ f ∥}_{tree, σ} .

(43)

2. Tree seminorm bound. By subadditivity,

{[P f]}_{tree} \leq {[P_{even} f]}_{tree} + {[P_{odd} f]}_{tree} .

From Lemma 4.7 (even branch on

B_{tree}

),

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} {[f]}_{tree} + C_{even} {∥ f ∥}_{1} .

From Lemma 4.9 (odd branch on

B_{tree}

),

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{1}, λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

To lift the weak term from

{∥ \cdot ∥}_{1}

to

{∥ \cdot ∥}_{σ}

, we revisit the remainder estimates (the “denominator” terms) in the proofs. For the even branch remainder,

W_{α} (u, v) |f (2 v) (\frac{1}{2 u} - \frac{1}{2 v})| ≪ 6^{- α j} | f (2 v) | (u, v \in I_{j}),

so

ϑ^{j} sup_{u, v \in I_{j}} \cdot ≪ ϑ^{j} 6^{- α j} \sum_{v \in I_{j}} | f (2 v) | = \sum_{v \in I_{j}} {(ϑ 6^{- α})}^{j} | f (2 v) | .

Because each v belongs to exactly one block

I_{j}

and

v ≍ 6^{j}

in that block, we have

{(ϑ 6^{- α})}^{j} \leq C {(2 v)}^{- σ} ⟺ ϑ^{j} \leq C 6^{- (σ - α) j},

which holds once we impose the admissibility condition

ϑ 6^{σ - α} < 1 .

(44)

Summing over j and v then gives a bound

≪ {∥ f ∥}_{σ}

for the even-branch remainder. The odd-branch denominator term is handled identically (replacing

2 v

by

n^{'} = (n - 1) / 3 ≍ 6^{j - 1}

), yielding again a bound

≪ {∥ f ∥}_{σ}

under (44). Renaming constants, we therefore have

{[P f]}_{tree} \leq (2^{- (1 - α)} + λ_{odd} (α, ϑ)) {[f]}_{tree} + C_{tree, σ} {∥ f ∥}_{σ} .

(45)

Finally, (43) and (45) yield

{∥ P f ∥}_{tree, σ} = {∥ P f ∥}_{σ} + {[P f]}_{tree} \leq (2^{σ} + 3^{- σ} + 2^{- (1 - α)} + λ_{odd} (α, ϑ) + C_{tree, σ}) {∥ f ∥}_{tree, σ} .

This proves boundedness of P on

B_{tree, σ}

. □

Proposition 4.12

(Lasota–Yorke inequality on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

satisfy the admissibility condition (44). Then there exists a constant

C_{LY, σ} > 0

such that for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C_{LY, σ} {∥ f ∥}_{σ}, λ (α, ϑ) : = 2^{- (1 - α)} + λ_{odd} (α, ϑ),

(46)

with

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ

. In particular, if

λ (α, ϑ) < 1

then P is strictly contracting in the strong seminorm

{[\cdot]}_{tree}

up to a controlled

{∥ \cdot ∥}_{σ}

–perturbation.

Proof.

Combine the even/odd seminorm bounds from (45). □

Remark 4.13

(Parameter window). he lift from

{∥ \cdot ∥}_{1}

to

{∥ \cdot ∥}_{σ}

in the remainder terms uses only (44). A convenient (and used later) choice is

(α, ϑ, σ) = (\frac{1}{2}, \frac{1}{5}, 1 + ε)

with any small

ε > 0

, since then

ϑ 6^{σ - α} = \frac{1}{5} 6^{ε + 1 / 2} < 1

. Together with the explicit odd-branch constant from Section 6, this yields

λ (α, ϑ) < 1

and hence quasi-compactness of P on

B_{tree, σ}

.

Corollary 4.14

(Essential spectral radius bound on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

satisfy the admissibility condition (44). Assume the Lasota–Yorke inequality (46) and the compact embedding

B_{tree, σ} ↪ ℓ_{σ}^{1}

from Lemma 4.5. Then

P : B_{tree, σ} \to B_{tree, σ}

is quasi-compact and its essential spectral radius satisfies

ρ_{ess} (P ↾_{B_{tree, σ}}) \leq λ (α, ϑ) = 2^{- (1 - α)} + λ_{odd} (α, ϑ), λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

(47)

Proof.

By (46) there exists

C_{LY, σ}

such that, for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C_{LY, σ} {∥ f ∥}_{σ} .

This is a Doeblin–Fortet (Lasota–Yorke) inequality for the pair

{∥ \cdot ∥}_{strong} = {[\cdot]}_{tree}

and

{∥ \cdot ∥}_{weak} = {∥ \cdot ∥}_{σ} .

Since the unit ball of

B_{tree, σ}

is relatively compact in

ℓ_{σ}^{1}

by Lemma 4.5, the injection

B_{tree, σ} ↪ ℓ_{σ}^{1}

is compact. The Ionescu–Tulcea–Marinescu/Hennion quasi-compactness theorem then implies that P is quasi-compact on

B_{tree, σ}

with

ρ_{ess} (P ↾_{B_{tree, σ}}) \leq λ (α, ϑ) .

□

4.6. Quasi-Compactness of the Backward Operator

Lemma 4.15

(Odd-branch weight distortion at

α = \frac{1}{2}

). Let

W_{α} (m, n) = \frac{m n}{{| m - n | (m + n)}^{α}}

be the tree weight from (35) and let

m^{'} = (m - 1) / 3

,

n^{'} = (n - 1) / 3

. For

α = \frac{1}{2}

there exists an absolute constant

C_{0} = \frac{16}{3^{3 / 2}} < 3.1

such that for all

m \equiv n \equiv 4 (mod 6)

with

m \neq n

,

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} \leq C_{0} .

(48)

Consequently, the oscillatory part of the odd branch satisfies

λ_{odd} (\frac{1}{2}, ϑ) \leq \frac{C_{0}}{\sqrt{6}} ϑ,

as used in Lemma 4.9 and Lemma 4.16.

Proof.

Let

m \equiv n \equiv 4 (mod 6)

,

m \neq n

, and define

m^{'} = (m - 1) / 3

,

n^{'} = (n - 1) / 3

. Note that

m^{'}, n^{'} \in N

and

m^{'} \neq n^{'}

. Using the definitions,

W_{1 / 2} (m, n) = \frac{m n}{{| m - n | (m + n)}^{1 / 2}}, W_{1 / 2} (m^{'}, n^{'}) = \frac{m^{'} n^{'}}{| m^{'} - n^{'} | {(m^{'} + n^{'})}^{1 / 2}} .

Form the ratio and simplify:

\begin{matrix} \frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} & = \frac{m n}{m^{'} n^{'}} \cdot \frac{| m^{'} - n^{'} |}{| m - n |} \cdot \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(m + n)}^{1 / 2}} . \end{matrix}

Since

m = 3 m^{'} + 1

and

n = 3 n^{'} + 1

, we have

| m - n | = 3 | m^{'} - n^{'} |

and

m + n = 3 (m^{'} + n^{'}) + 2

. Hence

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} = \frac{m n}{m^{'} n^{'}} \cdot \frac{1}{3} \cdot \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}) + 2)}^{1 / 2}} .

(49)

We now bound the three factors on the right-hand side.

(i) The product ratio. Using

m = 3 m^{'} + 1 \leq 4 m^{'}

and

n = 3 n^{'} + 1 \leq 4 n^{'}

for all

m^{'}, n^{'} \geq 1

, we get

\frac{m n}{m^{'} n^{'}} = \frac{(3 m^{'} + 1) (3 n^{'} + 1)}{m^{'} n^{'}} \leq 16 .

(ii) The difference ratio. We already used

| m - n | = 3 | m^{'} - n^{'} |

, so this contributes the exact factor

1 / 3

.

(iii) The sum ratio. Since

3 (m^{'} + n^{'}) + 2 \geq 3 (m^{'} + n^{'})

, we obtain

\frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}) + 2)}^{1 / 2}} \leq \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}))}^{1 / 2}} = \frac{1}{\sqrt{3}} .

Combining (i)–(iii) in (49) yields

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} \leq 16 \cdot \frac{1}{3} \cdot \frac{1}{\sqrt{3}} = \frac{16}{3^{3 / 2}} = : C_{0} .

This proves (48).

For the consequence on the oscillatory part of the odd branch in the Lasota–Yorke estimate, recall the standard decomposition in the proof of Lemma 4.9: when both

m, n \in I_{j}

are in the active residue class

4 (mod 6)

, the

D_{1}

(oscillatory) term contributes

W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} .

Using (48) and the relation

m^{'} ≍ 6^{j - 1}

for

m \in I_{j}

, one passes from level j to level

j - 1

with a loss bounded by

C_{0}

; the block weight

ϑ^{j}

supplies the one-step factor

ϑ

, and restricting to the active residue class has relative density

1 / 6

, which produces a Cauchy–Schwarz gain

1 / \sqrt{6}

in the passage from a subset supremum to the block-level control (see the proof of Lemma 4.9 for the standard

L^{2}

averaging step). Altogether,

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq \frac{C_{0}}{\sqrt{6}} ϑ {[f]}_{tree},

which is the claimed bound

λ_{odd} (\frac{1}{2}, ϑ) \leq (C_{0} / \sqrt{6}) ϑ

. □

Lemma 4.16

(Explicit odd-branch constant). For

α = \frac{1}{2}

and

ϑ = \frac{1}{5}

there exist constants

C_{α} > 0

and

C_{odd} > 0

such that for all

f \in B_{tree, σ}

,

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{σ},

(50)

with

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ < 1 .

(51)

Proof.

We specialize the proof of Lemma 4.9 to

α = \frac{1}{2}

and

ϑ = \frac{1}{5}

, making the constants explicit.

Recall

(P_{odd} f) (n) = 1_{{n \equiv 4 (6)}} \frac{f (\frac{n - 1}{3})}{(n - 1) / 3},

and for each

j \geq 0

,

A_{j} (f) : = sup_{\begin{matrix} m, n \in I_{j} \\ m \neq n \end{matrix}} W_{α} (m, n) |P_{odd} f (m) - P_{odd} f (n)|, {[P_{odd} f]}_{tree} = \sum_{j \geq 0} ϑ^{j} A_{j} (f),

where

I_{j} = [6^{j}, 2 \cdot 6^{j})

and

W_{α} (m, n) = \frac{m n}{{| m - n | (m + n)}^{α}}

. We take

α = \frac{1}{2}

from now on, so

W_{1 / 2} (m, n) = \frac{m n}{{| m - n | (m + n)}^{1 / 2}} .

Fix

j \geq 0

and

m, n \in I_{j}

,

m \neq n

. As in Lemma 4.9, we distinguish three cases.

Case 1: neither m nor n is

4 (mod 6)

. Then

P_{odd} f (m) = P_{odd} f (n) = 0

and this pair contributes nothing to

A_{j} (f)

.

Case 2: exactly one of

m, n

is

4 (mod 6)

. Assume without loss of generality

m \equiv 4 (mod 6)

and

n \neg \equiv 4 (mod 6)

. Set

k = (m - 1) / 3

. Then

P_{odd} f (m) - P_{odd} f (n) = \frac{f (k)}{k},

so

W_{1 / 2} (m, n) |P_{odd} f (m) - P_{odd} f (n)| = W_{1 / 2} (m, n) \frac{| f (k) |}{k} .

Since

m, n \in I_{j}

, we have

6^{j} \leq m, n < 2 \cdot 6^{j}

and

1 \leq | m - n | \leq 6^{j}

; hence

W_{1 / 2} (m, n) = \frac{m n}{{| m - n | (m + n)}^{1 / 2}} ≪ \frac{6^{2 j}}{6^{j} 6^{j / 2}} = 6^{(1 / 2) j} .

Also

k = (m - 1) / 3 ≍ 6^{j - 1}

. Thus for some absolute constant

C_{1}

,

ϑ^{j} W_{1 / 2} (m, n) \frac{| f (k) |}{k} \leq C_{1} {(ϑ 6^{1 / 2})}^{j} | f (k) | .

Now

ϑ = \frac{1}{5}

and

6^{1 / 2} < 2.5

, so

ϑ 6^{1 / 2} < 1

. Each k arises (from such a case) for at most one j and one m, and

| f (k) | = k^{σ} \frac{| f (k) |}{k^{σ}} \leq k^{σ} {∥ f ∥}_{σ} ≪ 6^{σ j} {∥ f ∥}_{σ} .

Summing over j and all such pairs gives

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ exactly one \equiv 4 (6) \end{matrix}} W_{1 / 2} (m, n) |P_{odd} f (m) - P_{odd} f (n)| \leq C_{odd, 1} {∥ f ∥}_{σ}

for some

C_{odd, 1} > 0

depending only on

σ

. Thus Case 2 contributes only to the weak term.

Case 3: both m and n are

4 (mod 6)

. Set

m^{'} = \frac{m - 1}{3}, n^{'} = \frac{n - 1}{3} .

Then

P_{odd} f (m) = \frac{f (m^{'})}{m^{'}}, P_{odd} f (n) = \frac{f (n^{'})}{n^{'}} .

We decompose

\frac{f (m^{'})}{m^{'}} - \frac{f (n^{'})}{n^{'}} = \underset{= : D_{1}}{\underset{︸}{\frac{f (m^{'}) - f (n^{'})}{m^{'}}}} + \underset{= : D_{2}}{\underset{︸}{f (n^{'}) (\frac{1}{m^{'}} - \frac{1}{n^{'}})}} .

Case 3a: the

D_{1}

term (contraction part). We first compare the weights

W_{1 / 2} (m, n)

and

W_{1 / 2} (m^{'}, n^{'})

.

Using

m = 3 m^{'} + 1

,

n = 3 n^{'} + 1

we compute

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} = \frac{(3 m^{'} + 1) (3 n^{'} + 1)}{3 m^{'} n^{'}} \frac{{(m^{'} + n^{'})}^{1 / 2}}{{(3 (m^{'} + n^{'}) + 2)}^{1 / 2}} .

For all

m^{'}, n^{'} \geq 1

,

3 m^{'} + 1 \leq 4 m^{'}, 3 n^{'} + 1 \leq 4 n^{'}, 3 (m^{'} + n^{'}) + 2 \geq 3 (m^{'} + n^{'}),

so

\frac{W_{1 / 2} (m, n)}{W_{1 / 2} (m^{'}, n^{'})} \leq \frac{16}{3} \cdot \frac{1}{\sqrt{3}} = \frac{16}{3^{3 / 2}} = : C_{0} .

Thus

W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq C_{0} W_{1 / 2} (m^{'}, n^{'}) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} .

(52)

Next, since

m \in I_{j}

implies

m^{'} ≍ 6^{j - 1}

, we have

1 / m^{'} ≪ 6^{- (j - 1)}

. Moreover

(m^{'}, n^{'})

lie in a union of

O (1)

blocks of level

j - 1

(and possibly

j - 2

), so

W_{1 / 2} (m^{'}, n^{'}) | f (m^{'}) - f (n^{'}) | \leq ϑ^{- (j - 1)} {[f]}_{tree}

up to a fixed multiplicative constant (absorbed into

C_{0}

). Combining with (52),

ϑ^{j} W_{1 / 2} (m, n) \frac{| f (m^{'}) - f (n^{'}) |}{m^{'}} \leq C_{0} ϑ^{j} 6^{- (j - 1)} ϑ^{- (j - 1)} {[f]}_{tree} = C_{0} ϑ {(\frac{ϑ}{6})}^{j - 1} {[f]}_{tree} .

Summing over

j \geq 1

gives

\sum_{j \geq 0} ϑ^{j} A_{j}^{(1)} (f) \leq \frac{C_{0} ϑ}{1 - ϑ / 6} {[f]}_{tree} .

Define

λ_{odd} : = \frac{C_{0} ϑ}{1 - ϑ / 6} and C_{α} : = \frac{\sqrt{6} C_{0}}{1 - ϑ / 6} .

Then

λ_{odd} = \frac{C_{α}}{\sqrt{6}} ϑ .

For

ϑ = \frac{1}{5}

we have

1 - ϑ / 6 = 1 - \frac{1}{30} > 0

and numerically

C_{0} = \frac{16}{3^{3 / 2}} < 3.1, λ_{odd} = \frac{C_{0} ϑ}{1 - ϑ / 6} < 0.64 < 1,

so indeed

λ_{odd} < 1

and

λ_{odd} = (C_{α} / \sqrt{6}) ϑ

with this choice of

C_{α}

.

Case 3b: the

D_{2}

term (weak contribution). We have

| D_{2} | = | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} .

Using

| m - n | = 3 | m^{'} - n^{'} |

and the same scale relations as above,

W_{1 / 2} (m, n) | D_{2} | = \frac{m n}{{| m - n | (m + n)}^{1 / 2}} | f (n^{'}) | \frac{| m^{'} - n^{'} |}{m^{'} n^{'}} ≪ 6^{- j / 2} | f (n^{'}) | .

Thus

ϑ^{j} W_{1 / 2} (m, n) | D_{2} | ≪ {(ϑ 6^{- 1 / 2})}^{j} | f (n^{'}) | .

Each

n^{'}

arises from at most a bounded number of

(m, n, j)

, and

ϑ 6^{- 1 / 2} < 1

, so summing over j and using

| f (n^{'}) | \leq n^{' σ} {∥ f ∥}_{σ}

yields

\sum_{j \geq 0} ϑ^{j} sup_{\begin{matrix} m, n \in I_{j} \\ m \equiv n \equiv 4 (6) \end{matrix}} W_{1 / 2} (m, n) | D_{2} | \leq C_{odd, 2} {∥ f ∥}_{σ}

for some

C_{odd, 2} > 0

. Combining the three cases, we obtain

{[P_{odd} f]}_{tree} \leq λ_{odd} {[f]}_{tree} + (C_{odd, 1} + C_{odd, 2}) {∥ f ∥}_{σ} .

Setting

C_{odd} : = C_{odd, 1} + C_{odd, 2}

and using the explicit expression

λ_{odd} = (C_{α} / \sqrt{6}) ϑ

with

λ_{odd} < 1

for

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

gives (50) and (51). □

Proposition 4.17

(Verified Lasota–Yorke contraction). Let

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

and

σ > 1

(with the admissibility condition

ϑ 6^{σ - α} < 1

). Define

λ_{LY} : = 2^{- (1 - α)} + λ_{odd} (α, ϑ), λ_{odd} (α, ϑ) \leq \frac{C_{0}}{\sqrt{6}} ϑ,

with

C_{0} = 16 / 3^{3 / 2}

from Lemma 4.15. Then

λ_{LY} < 1

, and for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ_{LY} {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ},

(53)

for some constant

C_{LY} > 0

depending only on the fixed parameters and the block geometry.

Proof.

We use the decomposition

P = P_{even} + P_{odd}

and the branchwise estimates already established.

1. Combine even and odd branch inequalities. For any

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq {[P_{even} f]}_{tree} + {[P_{odd} f]}_{tree} .

By the even-branch Lasota–Yorke estimate (Lemma 4.7, specialized to

B_{tree, σ}

), there exists

C_{even} > 0

such that for

(α, ϑ)

fixed,

{[P_{even} f]}_{tree} \leq 2^{- (1 - α)} ϑ {[f]}_{tree} + C_{even} {∥ f ∥}_{σ} .

(54)

By the explicit odd-branch lemma (Lemma 4.16), for

α = \frac{1}{2}

and

ϑ = \frac{1}{5}

there exist

C_{α} > 0

and

C_{odd} > 0

such that

{[P_{odd} f]}_{tree} \leq λ_{odd} (α, ϑ) {[f]}_{tree} + C_{odd} {∥ f ∥}_{σ},

(55)

with

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ < 1 .

Adding (54) and (55) gives

{[P f]}_{tree} \leq (2^{- (1 - α)} ϑ + λ_{odd} (α, ϑ)) {[f]}_{tree} + (C_{even} + C_{odd}) {∥ f ∥}_{σ} .

Define

λ_{LY} : = 2^{- (1 - α)} ϑ + λ_{odd} (α, ϑ), C_{LY} : = C_{even} + C_{odd},

to obtain (53).

2. Verification that $λ_{LY} < 1$ . We now check that with

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

the constant

λ_{LY}

is strictly less than 1.

First,

2^{- (1 - α)} ϑ = 2^{- 1 / 2} \cdot \frac{1}{5} = \frac{1}{5 \sqrt{2}} \approx 0.1414 .

From the proof of Lemma 4.16 we have

λ_{odd} (α, ϑ) = \frac{C_{α}}{\sqrt{6}} ϑ,

with an explicit choice

C_{α} = \frac{\sqrt{6} C_{0}}{1 - ϑ / 6}, C_{0} = \frac{16}{3^{3 / 2}},

so that

λ_{odd} (α, ϑ) = \frac{C_{0} ϑ}{1 - ϑ / 6} .

For

ϑ = \frac{1}{5}

this yields

λ_{odd} (\frac{1}{2}, \frac{1}{5}) = \frac{C_{0} / 5}{1 - 1 / 30} = \frac{C_{0}}{5} \cdot \frac{30}{29} = \frac{6 C_{0}}{29} .

Since

C_{0} = 16 / 3^{3 / 2} < 3.1

, we obtain

λ_{odd} (\frac{1}{2}, \frac{1}{5}) < \frac{6 \cdot 3.1}{29} \approx 0.641 < 1 .

Therefore

λ_{LY} = 2^{- 1 / 2} \cdot \frac{1}{5} + λ_{odd} (\frac{1}{2}, \frac{1}{5}) < 0.1414 + 0.641 < 0.79 < 1 .

In particular,

λ_{LY}

is a strict contraction factor, depending only on the fixed parameters.

This proves both the inequality (53) and the bound

λ_{LY} < 1

. □

Lemma 4.18

(Asymptotic form of the invariant density). Let P act on

B_{tree, σ}

with

σ > 1

and suppose P is quasi–compact with spectral gap and no other spectrum on the unit circle. Let

h \in B_{tree, σ}

be the unique positive right eigenvector with

P h = h

and normalize the dual eigenfunctional ϕ by

ϕ (h) = 1

. Then there exist constants

c > 0

and

δ > 0

(depending only on the parameters of the Lasota–Yorke framework) such that

h (n) = \frac{c}{n} (1 + O (n^{- δ})) (n \to \infty) .

Proof.

Set

H (s) : = \sum_{n \geq 1} h (n) n^{- s}

for

ℜ (s) > σ

. We proceed in three steps.

Step 1 (Meromorphic structure of H and the pole at

s = 1

). By the Dirichlet transform intertwinement (Section 3) and the quasi–compact spectral calculus on

B_{tree, σ}

(Section 4), Dirichlet transforms of

B_{tree, σ}

-functions admit meromorphic continuation across a half–plane

ℜ (s) > 1 - δ_{0}

for some

δ_{0} \in (0, 1)

, with at most a simple pole at

s = 1

whose residue is computed by the spectral projector

Π f = ϕ (f) h

. Applying this to

f = h

and using

P h = h

, we obtain that H extends meromorphically to

ℜ (s) > 1 - δ_{0}

with the expansion

H (s) = \frac{c}{s - 1} + G (s), ℜ (s) > 1 - δ_{0},

(56)

where

c : = ϕ (1) > 0

and G is holomorphic on

ℜ (s) > 1 - δ_{0}

and of at most polynomial growth in vertical strips.1

Step 2 (Tauberian step: summatory asymptotic). Define the summatory function

H^{#} (x) : = \sum_{n \leq x} h (n)

. Since H has no singularities on

{ℜ (s) = 1}

other than the simple pole at

s = 1

and satisfies the growth hypothesis of the Wiener–Ikehara–Delange Tauberian theorem [12] in the half–plane

ℜ (s) > 1 - δ_{0}

, it follows that

H^{#} (x) = c log x + C_{0} + O (x^{- δ_{1}}) (x \to \infty),

(57)

for some constants

C_{0} \in R

and

δ_{1} \in (0, δ_{0})

(the precise

δ_{1}

is inherited from the width

δ_{0}

and strip–growth of G). See, e.g., Delange’s theorem or the Ikehara–Ingham variant.

Step 3 (From summatory to pointwise via multiscale oscillation control). Write

a_{n} : = n h (n)

and let

X > 1

. For each dyadic–triadic block

I_{j} = [6^{j}, 2 \cdot 6^{j})

defining the strong seminorm

{[\cdot]}_{tree, σ}

, the Lasota–Yorke inequality yields a uniform oscillation bound

{osc}_{I_{j}} (a) : = sup_{n, m \in I_{j}} | a_{n} - a_{m} | \leq C 6^{- j η}

(58)

for some

C > 0

and

η \in (0, 1)

depending only on the Lasota–Yorke parameters (this is the standard consequence of the contraction of the strong seminorm together with boundedness in the weak norm). In particular

a_{n}

varies slowly on each block

I_{j}

.

By summation by parts on each

I_{j}

and (57), we obtain the averaged estimate

\frac{1}{| I_{j} |} \sum_{n \in I_{j}} a_{n} = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} n h (n) = c + O (6^{- j δ_{1}}) .

Combining this block average with the oscillation control (58) gives, for every

n \in I_{j}

,

a_{n} = c + O (6^{- j δ}), δ : = min {δ_{1}, η} .

Since

n ≍ 6^{j}

on

I_{j}

, this is equivalent to

n h (n) = c + O (n^{- δ}),

hence

h (n) = \frac{c}{n} (1 + O (n^{- δ})),

as claimed. □

We now record the standard consequence of the Lasota–Yorke inequality and the compact embedding of

B_{tree}

into

ℓ^{1}

.

Theorem 4.19

(Quasi-compactness on

B_{tree, σ}

). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

. Assume that the Lasota–Yorke constant

λ (α, ϑ) : = 2^{- (1 - α)} + λ_{odd} (α, ϑ)

satisfies

λ (α, ϑ) < 1

, where

λ_{odd} (α, ϑ)

is as in Lemma 4.9. Then the backward transfer operator P acting on

B_{tree, σ}

is quasi-compact, and its essential spectral radius satisfies

ρ_{ess} (P |_{B_{tree, σ}}) \leq λ (α, ϑ) < 1 .

(59)

Proof.

We work on the Banach space

B_{tree, σ}

with norm

{∥ \cdot ∥}_{tree, σ} = {∥ \cdot ∥}_{σ} + {[\cdot]}_{tree}

, where

{∥ \cdot ∥}_{σ}

is the weighted

ℓ_{σ}^{1}

-norm and

{[\cdot]}_{tree}

is the tree seminorm defined in Section 4.3.

Step 1: Lasota–Yorke inequality. By Proposition 4.12 (applied in the weighted setting, with

{∥ f ∥}_{1}

replaced by

{∥ f ∥}_{σ}

) we have, for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ},

(60)

with

λ (α, ϑ) < 1

by assumption. On the weak norm side, since P is bounded on

ℓ_{σ}^{1}

, there exists

C_{σ} > 0

(e.g.

C_{σ} = Λ_{σ}

from (17)) such that

{∥ P f ∥}_{σ} \leq C_{σ} {∥ f ∥}_{σ} for all f \in B_{tree, σ} .

(61)

Thus P satisfies a standard two-norm Lasota–Yorke inequality on

B_{tree, σ}

with strong seminorm

{∥ \cdot ∥}_{s} : = {[\cdot]}_{tree}

and weak norm

{∥ \cdot ∥}_{w} : = {∥ \cdot ∥}_{σ}

:

{∥ P f ∥}_{s} \leq {λ ∥ f ∥}_{s} + C_{LY} {∥ f ∥}_{w} {, ∥ P f ∥}_{w} \leq C_{σ} {∥ f ∥}_{w} .

(62)

Step 2: Compact embedding. By Lemma 4.5, the embedding

J : (B_{tree, σ} {, ∥ \cdot ∥}_{tree, σ}) ↪ (ℓ_{σ}^{1} {, ∥ \cdot ∥}_{σ})

is compact. Since

{∥ \cdot ∥}_{w} = {∥ \cdot ∥}_{σ}

is exactly the weak norm used in (62), this shows that the unit ball of

B_{tree, σ}

is relatively compact for the weak norm.

Step 3: Application of Ionescu–Tulcea–Marinescu / Hennion. We now invoke the standard quasi-compactness criterion (see, e.g., Ionescu–Tulcea and Marinescu, or Hennion’s theorem): if a bounded operator T on a Banach space X satisfies

(i): a Lasota–Yorke inequality ${∥ T x ∥}_{s} \leq {λ ∥ x ∥}_{s} + C {∥ x ∥}_{w}$ with $λ < 1$ ,
(ii): a weak bound ${∥ T x ∥}_{w} \leq C^{'} {∥ x ∥}_{w}$ , and
(iii): the injection ${(X, ∥ \cdot ∥}_{s} {) ↪ (X, ∥ \cdot ∥}_{w})$ has relatively compact unit ball,

then T is quasi-compact on X and its essential spectral radius satisfies

ρ_{ess} (T) \leq λ .

Conditions (i)–(iii) are exactly (62) and Lemma 4.5 for

T = P

and

X = B_{tree, σ}

. Therefore P is quasi-compact on

B_{tree, σ}

and

ρ_{ess} (P |_{B_{tree, σ}}) \leq λ (α, ϑ) < 1,

which is (59). □

Remark 4.20

(On the choice of parameters). The explicit bound (41) shows that

λ_{odd} (α, ϑ)

decreases linearly with

ϑ

. For fixed

α

, one can therefore choose

ϑ

sufficiently small so that

λ (α, ϑ) < 1

, provided the constant

C_{α}

is effectively controlled. Subsequent sections make this optimization quantitative by computing

C_{α}

and exhibiting admissible parameter pairs

(α, ϑ)

that give a strict spectral gap.

The Lasota–Yorke framework developed here supplies the functional-analytic backbone for the spectral approach to the Collatz problem: once explicit parameters with

λ (α, ϑ) < 1

are verified, the quasi-compactness and spectral gap of P on

B_{tree}

follow, and the spectral criteria of Section 4 can be invoked to constrain or rule out non-terminating configurations.

5. Spectral Consequences and Effective Block Recursion

Having established in Section 4.4 that the backward Collatz operator P is quasi-compact on the multi-scale tree space

B_{tree}

, we now turn to the spectral consequences of this result. The Lasota–Yorke inequality ensures the existence of a spectral gap, which in turn controls the structure of invariant densities and the long-term behavior of iterates

P^{k}

. The objective of this section is to characterize the invariant and quasi-invariant components of P, derive an effective block recursion for their scale-averaged coefficients, and demonstrate that the recursion enforces rigidity across the Collatz tree.

Throughout this section,

h \in B_{tree, σ}

will denote an invariant density of P, i.e. a function satisfying

P h = h

. The analysis proceeds in several stages. First, we describe the structure of possible invariant profiles in the multiscale framework and show that the Lasota–Yorke inequality forces uniform flatness across scales. Next, we translate this flatness into an explicit two-sided recurrence relation for block averages

c_{j}

. Finally, we verify that the coefficients of this recurrence satisfy a spectral bound consistent with the contraction constant

λ_{odd} (α, ϑ)

computed earlier.

Theorem 5.1

(Perron–Frobenius structure on

B_{tree, σ}

). Let P be the backward Collatz transfer operator acting on

B_{tree, σ}

with parameters

(α, ϑ, σ)

chosen so that the Lasota–Yorke inequality and quasi–compactness hold. Then:

1.: The spectral radius of P equals 1, and 1 is a simple eigenvalue.
2.: There exists a unique eigenvector $h \in B_{tree, σ}$ with $h > 0$ and $P h = h$ , normalized by $ϕ (h) = 1$ .
3.: There exists a unique positive eigenfunctional $ϕ \in B_{tree, σ}^{*}$ such that $ϕ \circ P = ϕ$ .
4.: All other spectral values satisfy $| z | < 1$ , and P admits the spectral decomposition

$P = h \otimes ϕ + Q, ρ (Q) < 1,$

where Q is quasi–compact.

Proof.

We combine the Lasota–Yorke inequality on

B_{tree, σ}

with standard Perron–Frobenius theory for positive quasi–compact operators.

Step 1: Spectral radius and quasi–compactness. By construction P is a bounded linear operator on

B_{tree, σ}

and is positive in the sense that

f \geq 0

implies

P f \geq 0

. The Lasota–Yorke inequality on

B_{tree, σ}

(Proposition 4.12, say) together with the compact embedding of the strong seminorm into the weak norm implies that P is quasi–compact on

B_{tree, σ}

with essential spectral radius strictly less than 1:

ρ_{ess} (P) < 1 .

(63)

On the other hand, the logarithmic mass–preservation identity (Lemma 2.4) shows that the spectral radius of P is at least 1; the boundedness of P implies

ρ (P) \leq 1

, hence

ρ (P) = 1 .

(64)

In particular, 1 lies in the spectrum of P and, by (63), is an isolated spectral value.

Step 2: Existence of a positive eigenvector. Consider the positive cone

C : = {f \in B_{tree, σ} : f \geq 0},

which is closed, convex, and reproducing. Since P is positive and

ρ (P) = 1

, the Krein–Rutman theorem for positive operators on Banach spaces implies the existence of a nonzero

h \in C

such that

P h = h .

(65)

Moreover, h can be chosen strictly positive in the sense that

h (n) > 0

for all

n \in N

: indeed, by the preimage structure of the Collatz map (Lemma 2.3) and the connectivity of the backward tree, any nontrivial

f \in C

is eventually propagated by iterates of P to a function that is positive on every block

I_{j}

, so

P^{k} f > 0

for all sufficiently large k. Replacing h by

P^{k} h

if necessary yields

h > 0

.

Step 3: Uniqueness and simplicity of the eigenvalue 1. We now show that 1 is a simple eigenvalue and that h is unique up to scalar multiples. Suppose

g \in B_{tree, σ}

satisfies

P g = g

. Decompose

g = g^{+} - g^{-}

into positive parts. Positivity of P implies

P g^{\pm} = g^{\pm}

. By the strong positivity argument above, any nonzero

f \in C

with

P f = f

must be strictly positive; hence

g^{+}

and

g^{-}

are both either 0 or strictly positive. If both were nonzero, then

g^{+}

and

g^{-}

would be linearly independent positive eigenvectors for the eigenvalue 1, and the positive cone would contain a two-dimensional face of eigenvectors. This contradicts the Krein–Rutman conclusion that the eigenspace associated with the spectral radius is one–dimensional. Therefore one of

g^{+}, g^{-}

must vanish and g is either nonnegative or nonpositive; by replacing g by

- g

if necessary,

g \geq 0

, and the strong positivity then forces g to be a scalar multiple of h. Thus the eigenspace for the eigenvalue 1 is one–dimensional and spanned by h, and 1 is a simple eigenvalue. This proves (1) and the first part of (2) after normalizing by

ϕ (h) = 1

below.

Step 4: Dual eigenfunctional. Consider the dual operator

P^{*}

acting on

B_{tree, σ}^{*}

. Since P is positive, so is

P^{*}

on the dual cone

C^{*} : = {ψ \in B_{tree, σ}^{*} : ψ (f) \geq 0 for all f \in C} .

The quasi–compactness of P implies quasi–compactness of

P^{*}

on the dual space. By (64),

P^{*}

also has spectral radius 1. Applying the same Krein–Rutman argument to

P^{*}

yields a nonzero

ϕ \in C^{*}

and

ϕ \circ P = ϕ,

(66)

with

ϕ

strictly positive on nonzero elements of

C

. The same simplicity argument as in Step 3 shows that the eigenspace of

P^{*}

for the eigenvalue 1 is one–dimensional and spanned by

ϕ

. Normalizing by the condition

ϕ (h) = 1

gives the uniquely determined eigenpair

(h, ϕ)

appearing in the statement. This establishes (2) and (3).

Step 5: Spectral decomposition and spectral gap. Quasi–compactness of P on

B_{tree, σ}

, together with (63) and the simplicity of the eigenvalue 1, implies that the spectrum of P is contained in

{1} \cup {z : | z | < r}

for some

r < 1

. Let

Π

denote the spectral projection onto the eigenspace associated with

λ = 1

; by the previous steps,

Π f = h ϕ (f), f \in B_{tree, σ},

so that

Π = h \otimes ϕ

as a rank–one operator. Writing

P = Π + Q = h \otimes ϕ + Q,

(67)

we have

Q = P - Π

and

Q Π = Π Q = 0

. The spectrum of Q is contained in

{z : | z | < r}

, so in particular

ρ (Q) < 1 .

Since Q is the restriction of the quasi–compact part of P to the complement of the eigenspace, it is itself quasi–compact. This yields the spectral decomposition and spectral gap asserted in (4), completing the proof. □

Proposition 5.2

(Forward dynamics and P-invariant functionals). Let

0 < α, ϑ < 1

and

σ > 1

. Consider the pairing

〈 f, φ 〉 : = \sum_{n \geq 1} f (n) φ (n)

between

B_{tree, σ}

and

B_{tree, σ}^{*} : = \{φ : N \to {C : ∥ φ ∥}_{*} : = sup_{j \geq 0} (ϑ^{j} {osc}_{I_{j}} φ) + sup_{j \geq 0} (6^{- σ j} \sum_{n \in I_{j}} | φ (n) |) < \infty\},

where

{osc}_{I_{j}} φ : = {sup}_{m, n \in I_{j}} | φ (m) - φ (n) |

. Then

〈 \cdot, \cdot 〉

extends continuously to

B_{tree, σ} \times B_{tree, σ}^{*}

, and the adjoint

(P^{*} φ) (m) = \frac{1}{m} (1_{{2 ∣ m}} φ (m / 2) + 1_{{m odd}} φ (3 m + 1)) .

(68)

Moreover, there exist constants

C_{σ} > 0

and

M_{σ} \geq 1

such that

∥ {(P^{*})}^{k} ∥_{B_{tree, σ}^{*} \to B_{tree, σ}^{*}} \leq C_{σ} M_{σ}^{k}, k \geq 0,

(69)

and the Cesàro averages

Φ_{N} : = \frac{1}{N} \sum_{k = 0}^{N - 1} {(P^{*})}^{k} φ

form a bounded set in

B_{tree, σ}^{*}

for every

φ \in B_{tree, σ}^{*}

.

Positive-frequency divergent families.Suppose there exist

c > 0

and an infinite set of scales

J \subset N

such that for each

j \in J

there is a finite set

A_{j} \subset I_{j}

with

| A_{j} | \geq c | I_{j} |

and forward trajectories that visit

A_{j}

with asymptotic frequency

\geq c

. For a summable weight sequence

{(w_{j})}_{j \geq 0}

with

\sum_{j} w_{j} ϑ^{j} < \infty

and

\sum_{j} w_{j} 6^{- σ j} < \infty

, define

φ_{j} (n) : = \frac{w_{j}}{| A_{j} |} 1_{A_{j}} (n), φ : = \sum_{j \in J} φ_{j} .

Then

φ \in B_{tree, σ}^{*}

, the Cesàro averages

Φ_{N}

are bounded in

B_{tree, σ}^{*}

, and any weak-* limit point Φ satisfies

P^{*} Φ = Φ

and

Φ \neq 0

. Consequently

ℓ (f) : = 〈 f, Φ 〉

is a nonzero invariant functional with

ℓ \circ P = ℓ

.

Proof.

Continuity of the pairing. Fix j and set

c_{j} : = {| I_{j} |}^{- 1} \sum_{n \in I_{j}} f (n)

and

φ_{I_{j}} : = {| I_{j} |}^{- 1} \sum_{n \in I_{j}} φ (n)

. Then

\sum_{n \in I_{j}} f (n) φ (n) = \sum_{n \in I_{j}} (f (n) - c_{j}) (φ (n) - φ_{I_{j}}) + c_{j} \sum_{n \in I_{j}} φ (n) .

(a) Oscillatory term. Using

\sum_{I_{j}} (f - c_{j}) = 0

and

{osc}_{I_{j}} φ : = {sup}_{u, v \in I_{j}} | φ (u) - φ (v) |

,

|\sum_{n \in I_{j}} (f (n) - c_{j}) (φ (n) - φ_{I_{j}})| \leq {osc}_{I_{j}} φ \sum_{n \in I_{j}} | f (n) - c_{j} | .

By the tree seminorm and the block geometry (since

W_{α} ≍ 6^{(1 - α) j}

on

I_{j}

),

{osc}_{I_{j}} f \leq K_{α} ϑ^{- j} 6^{- (1 - α) j} {[f]}_{tree}, \sum_{n \in I_{j}} | f (n) - c_{j} | \leq | I_{j} | {osc}_{I_{j}} f \leq C ϑ^{- j} 6^{- α j} {[f]}_{tree} .

Therefore

|\sum_{n \in I_{j}} (f (n) - c_{j}) (φ (n) - φ_{I_{j}})| \leq C ϑ^{- j} 6^{- α j} {[f]}_{tree} {osc}_{I_{j}} φ .

Multiply and divide by

ϑ^{j}

and take

{sup}_{j} ϑ^{j} {osc}_{I_{j}} φ

to get

\sum_{j \geq 0} |\sum_{I_{j}} (f - c_{j}) (φ - φ_{I_{j}})| \leq C {[f]}_{tree} sup_{j \geq 0} (ϑ^{j} {osc}_{I_{j}} φ) \sum_{j \geq 0} ϑ^{- 2 j} 6^{- α j} .

Since

α > 0

, we can absorb

\sum_{j} ϑ^{- 2 j} 6^{- α j}

into the constant (using that

ϑ \in (0, 1)

is fixed), hence

\sum_{j \geq 0} |\sum_{I_{j}} (f - c_{j}) (φ - φ_{I_{j}})| \leq C {[f]}_{tree} {∥ φ ∥}_{*} .

(b) Mean term. By averaging and the weighted norm,

| c_{j} | \leq \frac{1}{| I_{j} |} \sum_{n \in I_{j}} | f (n) | \leq \frac{1}{| I_{j} |} \sum_{n \in I_{j}} n^{σ} \frac{| f (n) |}{n^{σ}} \leq C 6^{(σ - 1) j} {∥ f ∥}_{ℓ_{σ}^{1}} .

Hence

|c_{j} \sum_{n \in I_{j}} φ (n)| \leq C 6^{(σ - 1) j} {∥ f ∥}_{ℓ_{σ}^{1}} (6^{σ j} 6^{- σ j} \sum_{I_{j}} | φ |) \leq C 6^{- j} {∥ f ∥}_{ℓ_{σ}^{1}} sup_{j \geq 0} (6^{- σ j} \sum_{I_{j}} | φ |) .

Summing over j gives a finite geometric series:

\sum_{j \geq 0} |c_{j} \sum_{I_{j}} φ| \leq {C ∥ f ∥}_{ℓ_{σ}^{1}} {∥ φ ∥}_{*} .

Combining (a) and (b) yields

| 〈 f, φ 〉 | \leq C ({[f]}_{tree} + {∥ f ∥}_{ℓ_{σ}^{1}}) {∥ φ ∥}_{*} = {C ∥ f ∥}_{tree, σ} {∥ φ ∥}_{*} .

□

5.1. Redesigned Multiscale Space and Invariant Profiles

The quasi-compactness of P implies that its spectrum consists of a discrete set of eigenvalues of finite multiplicity outside a disk of radius

ρ_{ess} (P) \leq λ_{LY} < 1

, together with a residual spectrum contained in that disk. Let

λ_{0} = 1

denote the trivial eigenvalue corresponding to constant functions. Any additional eigenvalues with

| λ | < 1

correspond to exponentially decaying modes. Thus, an invariant density h satisfying

P h = h

must lie in the one-dimensional eigenspace associated with

λ_{0}

, provided no unit-modulus spectrum remains.

However, to make this conclusion effective, one must exclude the possibility of small oscillatory components that project into higher spectral modes but decay too slowly to be detected by the weak

ℓ^{1}

norm alone. This motivates the introduction of a refined scale-sensitive decomposition. Define block intervals

I_{j}

as in (34), and let

H_{j} (h) : = \sum_{n \in I_{j}} h (n), c_{j} : = \frac{H_{j} (h)}{| I_{j} |} = \frac{H_{j} (h)}{6^{j}} .

(70)

The sequence

{(c_{j})}_{j \geq 0}

captures the mean behavior of h across successive scales in the backward tree. Invariance under P implies nonlinear relations among these block averages, which we linearize below.

Lemma 5.3

(Block-level invariance relation). Let

0 < α < 1

,

0 < ϑ < 1

, and

σ > 1

, and let

h \in B_{tree, σ}

satisfy

P h = h

. For each

j \geq 0

define the block average

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), | I_{j} | : = # I_{j} .

Then there exist sequences

{(a_{j})}_{j \geq 0}

,

{(b_{j})}_{j \geq 0}

with

a_{j}, b_{j} \geq 0

and a sequence

{(ε_{j})}_{j \geq 0}

such that

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j},

(71)

where

a_{j}

and

b_{j}

are determined by the local distribution of even and odd preimages between neighboring scales, and the error sequence

ε = (ε_{j})

is summable in the weighted norm, i.e.

\sum_{j \geq 0} ϑ^{j} | ε_{j} | < \infty .

(72)

Proof.

Throughout, fix

h \in B_{tree, σ}

with

P h = h

.

1. Start from the invariance equation on each block. For each

j \geq 0

,

| I_{j} | c_{j} = \sum_{n \in I_{j}} h (n) = \sum_{n \in I_{j}} (P h) (n) = \sum_{n \in I_{j}} (\frac{h (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3}) .

Write

S_{j}^{even} : = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n}, S_{j}^{odd} : = \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3},

so that

| I_{j} | c_{j} = S_{j}^{even} + S_{j}^{odd} .

(73)

We now approximate

S_{j}^{even}

and

S_{j}^{odd}

in terms of neighboring block averages, with all discrepancies absorbed in

ε_{j}

.

2. Even branch contribution. For

n \in I_{j}

, the even preimage is

m = 2 n

, and

S_{j}^{even} = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n} = \sum_{m \in 2 I_{j}} \frac{h (m)}{m},

where

2 I_{j} : = {2 n : n \in I_{j}}

. The set

2 I_{j}

lies in a bounded union of intervals whose lengths are comparable to

| I_{j} |

and whose positions are comparable (on a logarithmic scale) to some neighboring block

I_{j + 1}

. We decompose

h (m) = c_{j + 1} + (h (m) - c_{j + 1})

for those m whose scale is that of

I_{j + 1}

, and similarly for indices belonging to at most finitely many adjacent blocks. This yields

S_{j}^{even} = a_{j}^{(even)} | I_{j} | c_{j + 1} + R_{j}^{even},

(74)

where

a_{j}^{(even)} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} \frac{1}{2 n} 1_{{2 n lies in the next scale block (s)}},

and

R_{j}^{even}

collects:

(i): contributions from $h (m) - c_{k}$ within the relevant blocks,
(ii): contributions from even preimages m falling outside the chosen neighboring blocks.

Because

h \in B_{tree, σ}

, its oscillation inside each block is controlled by

{[h]}_{tree}

, so replacing

h (m)

by the corresponding block average

c_{k}

incurs an error bounded by

| h (m) - c_{k} | ≪ \frac{{[h]}_{tree}}{W_{α} (m_{1}, m_{2})}

for suitable

m_{1}, m_{2}

in that block; the precise bound is obtained by choosing

m_{1}, m_{2}

maximizing the tree seminorm at that scale and using the definition of

{[h]}_{tree}

. After dividing by m (which is

≫ 6^{j}

at this scale) and averaging over

I_{j}

, we get

| R_{j}^{even} | ≪ 6^{- j} {[h]}_{tree} + 6^{- j σ} {∥ h ∥}_{σ},

where the second term accounts for the finitely many preimages lying outside the neighboring blocks, using the weighted

ℓ_{σ}^{1}

bound on h. Thus

\sum_{j \geq 0} ϑ^{j} | R_{j}^{even} | < \infty .

(75)

By construction

a_{j}^{(even)} \geq 0

.

3. Odd branch contribution. For

n \equiv 4 (mod 6)

, the odd preimage is

m^{'} = (n - 1) / 3

, and

S_{j}^{odd} = \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (m^{'})}{m^{'}} .

As above, all such

m^{'}

lie at scale comparable to

I_{j - 1}

, up to a bounded distortion which is independent of j. We write

h (m^{'}) = c_{j - 1} + (h (m^{'}) - c_{j - 1}),

and obtain

S_{j}^{odd} = b_{j}^{(odd)} | I_{j} | c_{j - 1} + R_{j}^{odd},

(76)

where

b_{j}^{(odd)} : = \frac{1}{| I_{j} |} \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{1}{(n - 1) / 3},

and

R_{j}^{odd}

collects:

(i): the errors from replacing $h (m^{'})$ by $c_{j - 1}$ ,
(ii): any edge effects from $m^{'}$ lying just outside $I_{j - 1}$ .

All indices m whose images under the even/odd branches land outside the adjacent blocks are absorbed into

R_{j}^{even}

and

R_{j}^{odd}

; these edge spillovers are

ϑ

-summable thanks to

σ > 1

and the block oscillation control from

{[h]}_{tree}

.

As before, the tree seminorm controls oscillations within blocks, so

| h (m^{'}) - c_{j - 1} |

is bounded by a multiple of

{[h]}_{tree}

times a scale factor, and dividing by

m^{'} ≍ 6^{j - 1}

yields

| R_{j}^{odd} | ≪ 6^{- j} {[h]}_{tree} + 6^{- j σ} {∥ h ∥}_{σ} .

Thus

\sum_{j \geq 0} ϑ^{j} | R_{j}^{odd} | < \infty .

(77)

By construction

b_{j}^{(odd)} \geq 0

.

4. Assemble the block relation. Substituting (74) and (76) into (73), we obtain

| I_{j} | c_{j} = a_{j}^{(even)} | I_{j} | c_{j + 1} + b_{j}^{(odd)} | I_{j} | c_{j - 1} + R_{j}^{even} + R_{j}^{odd} .

Dividing by

| I_{j} |

gives

c_{j} = a_{j}^{(even)} c_{j + 1} + b_{j}^{(odd)} c_{j - 1} + ε_{j},

where

ε_{j} : = \frac{R_{j}^{even} + R_{j}^{odd}}{| I_{j} |} .

Set

a_{j} : = a_{j}^{(even)}

and

b_{j} : = b_{j}^{(odd)}

. By construction

a_{j}, b_{j} \geq 0

, and they encode the (normalized) weights of even and odd preimages between the neighboring scales. Moreover, using

| I_{j} | ≍ 6^{j}

together with (75) and (77), we obtain

\sum_{j \geq 0} ϑ^{j} | ε_{j} | \leq \sum_{j \geq 0} ϑ^{j} \frac{| R_{j}^{even} | + | R_{j}^{odd} |}{| I_{j} |} < \infty,

since the additional factor

1 / | I_{j} | ≍ 6^{- j}

makes the series converge absolutely once

σ > 1

and

{[h]}_{tree}

is finite. This is exactly (72).

Thus the block averages

(c_{j})

satisfy the approximate invariance relation (71) with a

ϑ

-summable error. □

Lemma 5.4

(Limiting preimage ratios). Let

{(I_{j})}_{j \geq 0}

be the multiscale blocks

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N, | I_{j} | = 6^{j} .

Define

a_{j}

and

b_{j}

as in Lemma 5.3, i.e. as the normalized contributions (depending only on the preimage structure of T) of even and odd preimages from neighboring scales to the block relation

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j},

for block averages

c_{j}

of any invariant profile h with

P h = h

. Then there exist constants

a, b > 0

such that

lim_{j \to \infty} a_{j} = a, lim_{j \to \infty} b_{j} = b,

and

a + b = 1, 0 < b < a < 1 .

(78)

Moreover, there exist

C > 0

and

0 < δ < 1

(independent of h) such that for all

j \geq 0

,

| a_{j} - a | + | b_{j} - b | \leq C δ^{j} .

Proof.

The coefficients

a_{j}, b_{j}

are determined purely by the geometry of Collatz preimages between the blocks

I_{j - 1}, I_{j}, I_{j + 1}

; they do not depend on h. We make this explicit.

1. Preimage windows and raw counts. For

m \in N

, the Collatz map, (1) has two inverse branches:

n \mapsto 2 n (even branch), n \mapsto \frac{n - 1}{3} when n \equiv 4 (mod 6) (odd branch) .

In the block relation of Lemma 5.3, only preimages that land in the adjacent large scales contribute to the “main” coefficients

a_{j}, b_{j}

; all other preimages (falling into gaps or non-adjacent blocks) are assigned to the perturbation

ε_{j}

.

The even preimages relevant to

I_{j}

form a window

E_{j}^{*}

of size comparable to

| I_{j} |

, consisting of those m whose image

T (m)

lies in

I_{j}

via m even.

he odd preimages relevant to

I_{j}

form a thinner window

O_{j}^{*}

, consisting of those odd m with

T (m) = 3 m + 1 \in I_{j}

(equivalently,

n : = 3 m + 1 \in I_{j}

and

n \equiv 4 (mod 6)

).

A direct count shows:

1. For the even window, each

n \in I_{j}

has an even preimage

2 n

, so

| E_{j}^{*} | = | I_{j} | = 6^{j} .

2. For the odd window, we need

n \in I_{j}

with

n \equiv 4 (mod 6)

and then

m = (n - 1) / 3

odd. Among the

| I_{j} | = 6^{j}

integers in

I_{j}

, exactly one in every six is

4 (mod 6)

, up to boundary effects. Hence

| O_{j}^{*} | = \frac{1}{6} | I_{j} | + O (1) = 6^{j - 1} + O (1),

so in particular

| O_{j}^{*} | > 0

for all sufficiently large j.

Thus the total number of “neighboring-scale” preimages associated with

I_{j}

is

| E_{j}^{*} | + | O_{j}^{*} | = (1 + \frac{1}{6}) | I_{j} | + O (1) = \frac{7}{6} 6^{j} + O (1) .

2. Canonical normalization of $a_{j}, b_{j}$ . By Lemma 5.3, the coefficients

a_{j}, b_{j}

are defined as the normalized weights of even vs. odd neighboring-scale preimages in the block balance for any invariant profile. Since this normalization is independent of h, we may compute

a_{j}, b_{j}

purely from the combinatorics. The natural choice is:

a_{j} : = \frac{| E_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |}, b_{j} : = \frac{| O_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |} .

These are exactly the “ratios of the number of even and odd preimages between adjacent scales” announced in Lemma 5.3.

Using the counts above,

\begin{matrix} a_{j} & = \frac{6^{j}}{6^{j} + 6^{j - 1} + O (1)} = \frac{1}{1 + \frac{1}{6} + O (6^{- j})} = \frac{6}{7} + O (6^{- j}), \\ b_{j} & = \frac{6^{j - 1} + O (1)}{6^{j} + 6^{j - 1} + O (1)} = \frac{\frac{1}{6} + O (6^{- j})}{1 + \frac{1}{6} + O (6^{- j})} = \frac{1}{7} + O (6^{- j}) . \end{matrix}

In particular, there exist limits

a = lim_{j \to \infty} a_{j} = \frac{6}{7}, b = lim_{j \to \infty} b_{j} = \frac{1}{7},

and there exists

C > 0

such that, for all j,

| a_{j} - a | + | b_{j} - b | \leq C 6^{- j} .

Thus the desired exponential convergence holds with

δ : = 1 / 6 \in (0, 1)

.

3. Structural properties. From the explicit limits we immediately have

a + b = \frac{6}{7} + \frac{1}{7} = 1, 0 < b < a < 1 .

Alternatively, the identity

a_{j} + b_{j} = 1

holds exactly for each j when tested against the constant profile

h \equiv 1

(for which the block perturbation

ε_{j}

vanishes), and passes to the limit as

j \to \infty

.

Positivity of

a, b

follows from

| E_{j}^{*} |, | O_{j}^{*} | > 0

for large j, and

b < a

reflects the fact that the odd preimage window is asymptotically only a

1 / 6

-fraction of the even window.

This completes the proof. □

Lemma 5.5

(Uniform convergence of the coefficient matrices). Let

M_{j} = (\begin{matrix} 0 & a_{j} \\ b_{j} & 0 \end{matrix}), M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}),

where

a_{j} \to a

and

b_{j} \to b

satisfy

| a_{j} - a | + | b_{j} - b | \leq C δ^{j}

for some

0 < δ < 1

as in Lemma 5.4. Then for any matrix norm

∥ \cdot ∥

,

∥ M_{j} - M ∥ \leq C^{'} δ^{j} .

In particular,

\sum_{j \geq j_{0}} ϑ^{j} ∥ M_{j} - M ∥ < \infty,

so

M_{j} \to M

exponentially fast in the sense required by the discrete variation-of-constants argument.

Proof.

By definition,

M_{j} - M = (\begin{matrix} 0 & a_{j} - a \\ b_{j} - b & 0 \end{matrix}) .

Let

∥ \cdot ∥

be any matrix norm on

2 \times 2

real matrices. Since all norms on

R^{2 \times 2}

are equivalent and the space is finite-dimensional, there exists a constant

K > 0

(depending only on the choice of norm) such that for any matrix

A = {(a_{m n})}_{m, n = 1}^{2}

,

∥ A ∥ \leq K max_{m, n} | a_{m n} | .

(79)

Applying (79) to

A = M_{j} - M

gives

∥ M_{j} - M ∥ \leq K max \{| a_{j} - a |, | b_{j} - b |\} .

By Lemma 5.4, the preimage ratios satisfy the exponential convergence

| a_{j} - a | + | b_{j} - b | \leq C δ^{j}, 0 < δ < 1 .

In particular,

max {| a_{j} - a |, | b_{j} - b |} \leq | a_{j} - a | + | b_{j} - b | \leq C δ^{j} .

Combining the two inequalities yields

∥ M_{j} - M ∥ \leq K C δ^{j} .

Setting

C^{'} : = K C

gives the claimed bound

∥ M_{j} - M ∥ \leq C^{'} δ^{j} .

Finally, since

0 < ϑ < 1

and

0 < δ < 1

, the product

ϑ δ < 1

, and therefore

\sum_{j \geq j_{0}} ϑ^{j} ∥ M_{j} - M ∥ \leq C^{'} \sum_{j \geq j_{0}} {(ϑ δ)}^{j} < \infty .

Thus

M_{j} \to M

exponentially fast in any matrix norm, establishing the uniform convergence required for the discrete variation-of-constants argument. □

Proposition 5.6

(Effective recursion for peripheral eigenfunctions). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and let

h \in B_{tree, σ}

satisfy

P h = λ h

with

| λ | = 1

. Let

H_{j} : = \sum_{n \in I_{j}} h (n)

and

c_{j} : = H_{j} / | I_{j} |

be the block sums and block averages on

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

. Then, with

a, b > 0

as in Lemma 5.4, there exists a sequence

{(ε_{j})}_{j \geq 1}

with

\sum_{j \geq 1} | ε_{j} | ϑ^{j} < \infty

such that

c_{j} = λ^{- 1} a c_{j + 1} + λ^{- 1} b c_{j - 1} + ε_{j}, j \geq 1 .

(80)

Equivalently, for the renormalized averages

d_{j} : = λ^{- j} c_{j}

we have

d_{j} = a d_{j + 1} + b d_{j - 1} + {\tilde{ε}}_{j}, \sum_{j \geq 1} | {\tilde{ε}}_{j} | ϑ^{j} < \infty,

(81)

with

{\tilde{ε}}_{j} : = λ^{- j} ε_{j}

.

Proof.

Step 1: Block summation of the eigenrelation. Summing

P h = λ h

over

n \in I_{j}

gives

\sum_{n \in I_{j}} (P h) (n) = λ \sum_{n \in I_{j}} h (n) = λ H_{j} .

By the definition of

P = P_{even} + P_{odd}

,

\sum_{n \in I_{j}} (P h) (n) = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n} + \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3} = : S_{j}^{even} + S_{j}^{odd} .

As in the proof of Lemma 5.3 (the

λ = 1

case), we reorganize each sum by changing variables along the inverse branches and separating the main contributions that land in adjacent scales (

I_{j + 1}

for the even branch,

I_{j - 1}

for the odd branch) from the boundary remainders (spillovers due to the half-open endpoints and the congruence restriction

n \equiv 4 (mod 6)

). Concretely,

S_{j}^{even} = \sum_{n \in I_{j}} \frac{h (2 n)}{2 n} = \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} + R_{j}^{even}, S_{j}^{odd} = \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (6) \end{matrix}} \frac{h (\frac{n - 1}{3})}{(n - 1) / 3} = \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} + R_{j}^{odd},

where

E_{j}^{*} \subset I_{j + 1}

and

O_{j}^{*} \subset I_{j - 1}

are the preimage windows collecting those m whose images lie in

I_{j}

under the even and odd branches, respectively, and

R_{j}^{even}, R_{j}^{odd}

are the boundary remainders (coming from

(I_{j + 1} ∖ E_{j}^{*})

and

(I_{j - 1} ∖ O_{j}^{*})

).

Thus

λ H_{j} = \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} + \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} + (R_{j}^{even} + R_{j}^{odd}) .

Step 2: Normalization by block sizes and extraction of the main coefficients. Divide by

| I_{j} | = 6^{j}

and write

c_{k} = H_{k} / | I_{k} |

:

λ c_{j} = \frac{1}{| I_{j} |} \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} + \frac{1}{| I_{j} |} \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} + \frac{R_{j}^{even} + R_{j}^{odd}}{| I_{j} |} .

Inside each window the points m satisfy

m ≍ | I_{j + 1} |

(even window) or

m ≍ | I_{j - 1} |

(odd window), so

1 / m

fluctuates by a bounded multiplicative factor around

1 / | I_{j + 1} |

or

1 / | I_{j - 1} |

. Using the

B_{tree, σ}

control of oscillations within blocks, this fluctuation contributes only to an error term summable in the weighted

ϑ

-norm. Hence

\frac{1}{| I_{j} |} \sum_{m \in E_{j}^{*}} \frac{h (m)}{m} = \frac{| E_{j}^{*} |}{| I_{j} |} \cdot \frac{1}{| I_{j + 1} |} \sum_{m \in E_{j}^{*}} h (m) + η_{j}^{even} = a_{j} c_{j + 1} + η_{j}^{even},

and similarly

\frac{1}{| I_{j} |} \sum_{m \in O_{j}^{*}} \frac{h (m)}{m} = b_{j} c_{j - 1} + η_{j}^{odd},

where

a_{j} : = | E_{j}^{*} | / (| E_{j}^{*} | + | O_{j}^{*} |)

,

b_{j} : = | O_{j}^{*} | / (| E_{j}^{*} | + | O_{j}^{*} |)

(so

a_{j} + b_{j} = 1

), and

η_{j}^{even}, η_{j}^{odd}

are error terms whose weighted sum

\sum_{j} ϑ^{j} | η_{j}^{\cdot} |

is finite. The boundary remainders likewise satisfy

\sum_{j \geq 1} ϑ^{j} \frac{| R_{j}^{even} | + | R_{j}^{odd} |}{| I_{j} |} < \infty

by the same block-oscillation and congruence estimates used in Lemma 5.3.

Collecting terms, we obtain

λ c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + η_{j}, \sum_{j \geq 1} ϑ^{j} | η_{j} | < \infty,

(82)

which is the twisted version of the block relation of Lemma 5.3.

Step 3: Freezing the coefficients to the limits

a, b

. By Lemma 5.15, there exist

a, b > 0

with

a + b = 1

,

0 < b < a < 1

, and constants

C > 0

,

0 < δ < 1

such that

| a_{j} - a | + | b_{j} - b | \leq C δ^{j}

for all j. Rewrite (82) as

λ c_{j} = a c_{j + 1} + b c_{j - 1} + \underset{= : ζ_{j}}{\underset{︸}{η_{j} + (a_{j} - a) c_{j + 1} + (b_{j} - b) c_{j - 1}}} .

To show

\sum_{j} ϑ^{j} | ζ_{j} | < \infty

, it remains to bound the “freezing” errors

(a_{j} - a) c_{j + 1}

and

(b_{j} - b) c_{j - 1}

in the weighted sum. As in the proof of Proposition 5.14,

h \in B_{tree, σ}

implies the block averages obey the growth bound

| c_{k} | \leq C_{0} 6^{(σ - 1) k} {∥ h ∥}_{σ} (k \geq 0),

(83)

for a constant

C_{0}

depending only on

σ

and the block geometry. Hence

ϑ^{j} | (a_{j} - a) c_{j + 1} | \leq ϑ^{j} C δ^{j} C_{0} 6^{(σ - 1) (j + 1)} {∥ h ∥}_{σ} = C^{'} {(ϑ δ 6^{σ - 1})}^{j} {∥ h ∥}_{σ},

and similarly for

(b_{j} - b) c_{j - 1}

(with

j - 1

in place of

j + 1

). Choosing

ϑ \in (0, 1)

(as done when defining

B_{tree, σ}

) small enough so that

ϑ δ 6^{σ - 1} < 1

, these two geometric series converge, uniformly in h up to

{∥ h ∥}_{σ}

. Therefore

\sum_{j \geq 1} ϑ^{j} | ζ_{j} | < \infty .

Set

ε_{j} : = λ^{- 1} ζ_{j}

and divide the identity by

λ

(note

| λ | = 1

), which yields (80) with

\sum_{j} ϑ^{j} | ε_{j} | = \sum_{j} ϑ^{j} | ζ_{j} | < \infty

.

Step 4: Renormalized averages. Define

d_{j} : = λ^{- j} c_{j}

. Multiplying (80) by

λ^{- j}

,

d_{j} = a d_{j + 1} + b d_{j - 1} + {\tilde{ε}}_{j}, {\tilde{ε}}_{j} : = λ^{- j} ε_{j},

and since

| λ | = 1

we have

\sum_{j} ϑ^{j} | {\tilde{ε}}_{j} | = \sum_{j} ϑ^{j} | ε_{j} | < \infty

. This is (81). □

Remark 5.7

(Admissibility for freezing the coefficients). The “freezing” errors

(a_{j} - a) c_{j + 1}

and

(b_{j} - b) c_{j - 1}

are summable in the weighted norm because

| a_{j} - a | + | b_{j} - b | \leq C δ^{j}

for some

0 < δ < 1

by Lemma 5.4. Hence

\sum_{j \geq 0} ϑ^{j} (| a_{j} - a | + | b_{j} - b |) < \infty whenever ϑ < δ^{- 1} .

Since

δ \in (0, 1)

depends only on the block geometry and the parameters

(α, ϑ, σ)

, one may always choose

ϑ \in (0, 1)

sufficiently small so that the weighted summability condition holds. In particular, the choice

ϑ = \frac{1}{5}

used in the Lasota–Yorke framework is admissible for every

σ > 1

.

Remark 5.8

(Exact normalization of the block coefficients). In Lemma 5.3, the coefficients

a_{j}

and

b_{j}

arise from the relative sizes of the even and odd preimage windows:

a_{j} : = \frac{| E_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |}, b_{j} : = \frac{| O_{j}^{*} |}{| E_{j}^{*} | + | O_{j}^{*} |},

so that

a_{j} + b_{j} = 1

for all sufficiently large j. Lemma 5.4 establishes the existence of limits

a_{j} \to a

and

b_{j} \to b

with

a + b = 1, 0 < b < a < 1, | a_{j} - a | + | b_{j} - b | \leq C δ^{j}

for some constants

C > 0

and

0 < δ < 1

depending only on the block geometry and the space parameters.

Remark 5.9

(Coefficient freezing). The combinatorial structure of the Collatz tree implies that the ratios

a_{j} : = \frac{| I_{j + 1} |}{2 | I_{j} |}, b_{j} : = \frac{| I_{j - 1} |}{| I_{j} |}

stabilize as

j \to \infty

. More precisely, Lemma 5.4 shows that

a_{j} ⟶ a, b_{j} ⟶ b, a + b = 1, 0 < b < a < 1,

and that the convergence is geometric:

| a_{j} - a | + | b_{j} - b | \leq C δ^{j}

for some

C > 0

and

0 < δ < 1

. These limits encode the asymptotic proportions of mass transferred from

I_{j}

to

I_{j + 1}

and

I_{j - 1}

by the even and admissible odd preimages of the Collatz map.

Remark 5.10

(Asymptotic limits of the block coefficients). Let

a_{j}

and

b_{j}

be the block coefficients

a_{j} : = \frac{| I_{j + 1} |}{2 | I_{j} |}, b_{j} : = \frac{| I_{j - 1} |}{| I_{j} |},

arising in the decomposition of block averages under

P h = h

. Then the Collatz preimage structure and the block geometry imply:

$a_{j}, b_{j} \geq 0$ , and for all sufficiently large j one has

$a_{j} + b_{j} = 1;$
The coefficients converge to limits

$a_{j} ⟶ a, b_{j} ⟶ b, (j \to \infty),$

where $a, b > 0$ satisfy

$a + b = 1, 0 < b < a < 1;$
The convergence is quantitative: there exist constants $C > 0$ and $ϑ \in (0, 1)$ such that

$| a_{j} - a | + | b_{j} - b | \leq C ϑ^{j}, j \geq 0 .$

These limits encode the asymptotic proportion, at large scales, of mass transported from

I_{j}

to the neighboring blocks

I_{j + 1}

and

I_{j - 1}

via even and admissible odd preimages. Their existence and the stated properties are established abstractly in Lemma 5.4.

Lemma 5.11

(Effective block recursion). Let

h \in B_{tree, σ}

be the positive invariant density satisfying

P h = h

. For each scale block

I_{j}

define

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), j \geq 0 .

Then there exist sequences

{(a_{j})}_{j \geq j_{0}}

,

{(b_{j})}_{j \geq j_{0}}

and an error sequence

{(ε_{j})}_{j \geq j_{0}}

such that:

1.: $a_{j}, b_{j} \geq 0$ and $a_{j} + b_{j} = 1$ for all $j \geq j_{0}$ ;
2.: $a_{j} \to a$ and $b_{j} \to b$ as $j \to \infty$ , where $a, b > 0$ satisfy

$a + b = 1, 0 < b < a < 1;$
3.: the block averages satisfy the second-order recursion

$c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j}, j \geq j_{0};$
4.: the perturbations satisfy the weighted summability bound

$\sum_{j \geq j_{0}} ϑ^{j} | ε_{j} | < \infty .$

Moreover, the limits

a, b

and the summability rate depend only on

(α, ϑ, σ)

and the tree geometry.

Proof.

Throughout the proof we write

I_{j}

for the scale block at level j and

| I_{j} |

for its cardinality. Recall that h is invariant, so for every

n \geq 1

,

h (n) = \frac{1}{2} h (2 n) + 1_{{n \equiv 4 (\mod 6)}} h (\frac{n - 1}{3}) .

(84)

Averaging (84) over

n \in I_{j}

yields

c_{j} = E_{j} + O_{j},

where

E_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} \frac{1}{2} h (2 n), O_{j} : = \frac{1}{| I_{j} |} \sum_{\begin{matrix} n \in I_{j} \\ n \equiv 4 (mod 6) \end{matrix}} h (\frac{n - 1}{3}) .

Define

ϵ_{j} : = δ_{j}^{even} + δ_{j}^{odd} .

(85)

Step 1: Even contribution. Consider the image set

J_{j}^{even} : = {2 n : n \in I_{j}} .

By construction of the blocks

I_{j}

and the fact that their endpoints grow geometrically,

J_{j}^{even}

lies in a bounded union of blocks at scales j and

j + 1

, with a single “main” block at scale

j + 1

and boundary pieces of uniformly bounded size. Thus one may decompose

I_{j}

into disjoint sets

A_{j}

and

B_{j}

such that

{2 n : n \in A_{j}} = I_{j + 1}, {2 n : n \in B_{j}} \subseteq I_{j}^{bdry} \cup I_{j + 2}^{bdry},

and

| I_{k}^{bdry} | = O (6^{j - 1})

uniformly in k.

Decompose

E_{j} = E_{j}^{(1)} + E_{j}^{(2)} .

On

A_{j}

, change variables

m = 2 n

to obtain

E_{j}^{(1)} = \frac{1}{2 | I_{j} |} \sum_{m \in I_{j + 1}} h (m) = \frac{| I_{j + 1} |}{2 | I_{j} |} c_{j + 1} .

For

E_{j}^{(2)}

, the boundary structure and the definition of the

B_{tree, σ}

norm imply that the contribution is controlled by a fixed constant times the block averages at the neighboring levels:

| E_{j}^{(2)} | \leq C 6^{- 1} (c_{j} + c_{j + 2}),

which decays at least like

C 6^{- j}

. Define

a_{j} : = \frac{| I_{j + 1} |}{2 | I_{j} |}, δ_{j}^{even} : = E_{j}^{(2)} .

Then

E_{j} = a_{j} c_{j + 1} + δ_{j}^{even}, \sum_{j} ϑ^{j} | δ_{j}^{even} | < \infty .

Step 2: Odd contribution. If

n \equiv 4 (mod 6)

and

n \in I_{j}

, the odd preimage

(n - 1) / 3

lies in a bounded union of blocks centered at

I_{j - 1}

with boundary fragments of size

O (6^{j - 1})

. Thus there is a subset

A_{j}^{'} \subseteq I_{j}

of admissible indices with

\{\frac{n - 1}{3} : n \in A_{j}^{'}\} = I_{j - 1},

while the remaining admissible indices form

B_{j}^{'}

and map into boundary pieces.

Decomposing

O_{j} = O_{j}^{(1)} + O_{j}^{(2)},

a change of variables gives

O_{j}^{(1)} = \frac{1}{| I_{j} |} \sum_{m \in I_{j - 1}} h (m) = \frac{| I_{j - 1} |}{| I_{j} |} c_{j - 1} .

Set

b_{j} : = \frac{| I_{j - 1} |}{| I_{j} |} .

As above,

O_{j}^{(2)}

is controlled by boundary contributions and satisfies

| O_{j}^{(2)} | \leq C^{'} 6^{- 1} (c_{j - 1} + c_{j + 1}),

so that

δ_{j}^{odd} : = O_{j}^{(2)} satisfies \sum_{j} ϑ^{j} | δ_{j}^{odd} | < \infty .

Thus

O_{j} = b_{j} c_{j - 1} + δ_{j}^{odd} .

Step 3: The block recursion. Combining

c_{j} = E_{j} + O_{j}

gives

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j}, ε_{j} : = δ_{j}^{even} + δ_{j}^{odd} .

Since the main-part contributions exhaust the mass transferred between scales, one may choose

j_{0}

sufficiently large so that

a_{j} + b_{j} = 1 for all j \geq j_{0},

with

(a_{j})

and

(b_{j})

both nonnegative. The geometric regularity of the blocks implies that

a_{j} \to a, b_{j} \to b, a + b = 1, 0 < b < a < 1,

as established abstractly in Lemma 5.4. Finally, the bounds above show that

| ε_{j} | \leq C_{*} 6^{- j}

for some

C_{*} > 0

, hence

\sum_{j \geq j_{0}} ϑ^{j} | ε_{j} | < \infty

.

This proves the claimed block recursion and completes the proof. □

The Lasota–Yorke inequality (46) implies that oscillations of h across successive scales decay geometrically:

{[f]}_{tree} \leq \frac{C_{LY}}{1 - λ_{LY}} {∥ f ∥}_{1},

so that any invariant h must be essentially flat in the strong seminorm. Translating this statement into block averages gives

| c_{j + 1} - c_{j} | \leq C ϑ^{j}, j \geq 0,

(86)

for some

C > 0

. The decay of successive differences enforces a near-constant profile

c_{j} \to c_{\infty}

, and any residual deviation must satisfy the perturbed recursion (71).

We interpret (71) as a discrete second-order recurrence in the block averages

(c_{j})

, with coefficients

(a_{j}, b_{j})

determined purely by the combinatorics of the Collatz preimages. In the limit

a_{j} \to a

,

b_{j} \to b

described in Lemma 5.4, the homogeneous part

c_{j} = a c_{j + 1} + b c_{j - 1}

(87)

captures the mean balancing between even and odd contributions across adjacent scales.

Introducing the vector

v_{j} : = {(c_{j}, c_{j - 1})}^{⊤}

, the recursion can be written in matrix form

v_{j + 1} = M v_{j}, M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}) .

The eigenvalues of M are

\pm \sqrt{a b}

, so the spectral radius is

ρ (M) = \sqrt{a b}

. Since

a + b = 1

and

0 < b < a < 1

, we have

a b < \frac{1}{4}

and hence

ρ (M) < \frac{1}{2} < 1

. Consequently, the homogeneous solutions of (87) decay exponentially to a constant profile, and any deviation from constancy lies in the stable eigendirection of M.

Remark 5.12

(Spectral radius of the frozen block matrix). Let

M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}),

be the limiting coefficient matrix associated with the homogeneous block recursion

c_{j} = a c_{j + 1} + b c_{j - 1},

where

a, b > 0

and

a + b = 1

are the limiting values established in Lemma 5.4. The eigenvalues of M are

λ_{\pm} = \pm \sqrt{a b},

so the spectral radius is

ρ (M) = \sqrt{a b} < 1 .

Consequently, the homogeneous recursion is exponentially stable: every solution that grows at most subexponentially in j converges to a constant profile, and any deviation decays at rate

O (ρ {(M)}^{j})

. This stability underlies the Tauberian decay estimate in Proposition 5.13.

Proposition 5.13

(Decay profile of the invariant density). Let

h \in B_{tree, σ}

be the strictly positive invariant density satisfying

P h = h, ϕ (h) = 1,

(88)

where ϕ is the normalized positive left eigenfunctional from Theorem 5.1. For each scale block

I_{j} = [6^{j}, 2 \cdot 6^{j})

define

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n) .

Assume the effective block recursion of Lemma 5.11 holds:

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j}, j \geq j_{0},

(89)

with coefficients

a_{j}, b_{j} \geq 0

,

a_{j} + b_{j} = 1

, satisfying

a_{j} ⟶ a, b_{j} ⟶ b, a + b = 1, 0 < b < a < 1,

(90)

and geometric convergence

\sum_{j \geq j_{0}} ϑ^{j} (| a_{j} - a | + | b_{j} - b |) < \infty .

Assume also that the perturbations satisfy

\sum_{j \geq j_{0}} ϑ^{j} | ε_{j} | < \infty,

and that

(α, ϑ)

obey

ϑ 6^{α} < 1 .

(91)

Then there exists a constant

c > 0

such that

h (n) = \frac{c}{n} + o (\frac{1}{n}) (n \to \infty),

(92)

and the error term is uniform along rays of the Collatz tree.

Proof.

We first analyze the block averages

(c_{j})

and then pass from blocks to pointwise values of h.

Step 1: Renormalized block recursion and convergence of

w_{j}

. Introduce the renormalized sequence

w_{j} : = 6^{j} c_{j}, j \geq 0 .

Multiplying (89) by

6^{j}

and using

a_{j} + b_{j} = 1

yields

w_{j} = \frac{a_{j}}{6} w_{j + 1} + 6 b_{j} w_{j - 1} + 6^{j} ε_{j}, j \geq j_{0} .

(93)

For the frozen–coefficient system, set

M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix}), v_{j} : = (\begin{matrix} c_{j} \\ c_{j - 1} \end{matrix}),

so the homogeneous recursion

c_{j} = a c_{j + 1} + b c_{j - 1}

becomes

v_{j + 1} = M v_{j}

. Since

a, b > 0

and

a + b = 1

by Lemma 5.4, the eigenvalues of M are

λ_{\pm} = \pm \sqrt{a b},

so the spectral radius satisfies

ρ (M) = \sqrt{a b} < 1 .

Hence there is a norm

{∥ \cdot ∥}_{*}

on

R^{2}

and a constant

η \in (0, 1)

such that

{∥ M ∥}_{*} \leq η

.

The full recursion can be written as

v_{j + 1} = M_{j} v_{j} + F_{j},

where

M_{j} \to M

and the perturbations satisfy

\sum_{j \geq j_{0}} ϑ^{j} (∥ M_{j} {- M ∥}_{*} + {∥ F_{j} ∥}_{*}) < \infty,

using (90)–(72). A discrete variation–of–constants argument gives

v_{j} = v_{\infty} + r_{j}, {∥ r_{j} ∥}_{*} \leq C ϑ^{j},

for some

v_{\infty} = {(c_{\infty}, c_{\infty})}^{T}

with

c_{\infty} > 0

. Hence

c_{j} = c_{\infty} + O (ϑ^{j}), w_{j} = 6^{j} c_{\infty} + O (ϑ^{j} 6^{j}) .

Step 2: Oscillation control inside blocks. The Lasota–Yorke inequality yields

{osc}_{I_{j}} h \leq C_{1} ϑ^{j} 6^{- (1 - α) j},

so for every

n \in I_{j}

,

| h (n) - c_{j} | \leq C_{1} ϑ^{j} 6^{- (1 - α) j} .

Since

n ≍ 6^{j}

for

n \in I_{j}

, we have

6^{- j} ≍ 1 / n

, and because

ϑ 6^{α} < 1

,

\frac{ϑ^{j} 6^{- (1 - α) j}}{6^{- j}} = {(ϑ 6^{α})}^{j} \to 0 .

Thus the oscillation error is

o (1 / n)

.

Step 3: Pointwise asymptotics. Combining

c_{j} = c_{\infty} + O (ϑ^{j})

with

| h (n) - c_{j} | \leq o (1 / n)

and

6^{j} ≍ n

, we obtain

h (n) = \frac{c_{\infty}}{6^{j}} + o (6^{- j}) = \frac{c}{n} + o (\frac{1}{n}),

with

c = c_{\infty} κ > 0

for the constant

κ

relating

6^{j}

and n. The error is uniform along rays of the Collatz tree.

This proves the claim. □

The explicit Lasota–Yorke constants obtained in Section 4.4 guarantee that the same contraction rate governs the full operator P on

B_{tree, σ}

, ensuring that invariant densities are asymptotically flat in the strong seminorm—block averages converge while the global profile follows the two-sided recursion. In particular, the invariant density h decays like

c / n

along the Collatz tree.

5.2. Effective Block Recursion and Spectral Estimate

We now make the block-recursion framework explicit and quantify the coefficients and perturbations that encode how the invariance equation

P h = h

propagates between adjacent scales.

Proposition 5.14

(Effective perturbed recursion). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and

h \in B_{tree, σ}

satisfy

P h = h

. Let

c_{j}

be the block averages

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), j \geq 0 .

Then there exist constants

a, b > 0

, depending only on the (combinatorial) limiting ratios of even and odd preimages between scales (cf. Lemma 5.4), and a sequence

{(ε_{j})}_{j \geq 0}

such that

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1,

(94)

with

{∥ ε ∥}_{ϑ} : = \sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty .

(95)

The constants

a, b

and the bound on

{∥ ε ∥}_{ϑ}

are independent of h.

Proof.

By Lemma 5.3, for

h \in B_{tree, σ}

with

P h = h

there exist sequences

{(a_{j})}_{j \geq 0}

,

{(b_{j})}_{j \geq 0}

with

a_{j}, b_{j} \geq 0

and a sequence

{(η_{j})}_{j \geq 0}

such that

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + η_{j}, j \geq 1,

(96)

and

\sum_{j \geq 0} ϑ^{j} | η_{j} | < \infty .

(97)

The coefficients

a_{j}, b_{j}

are defined in terms of normalized even and odd preimage weights from

I_{j + 1}

and

I_{j - 1}

into

I_{j}

.

1. Limits $a, b$ from preimage asymptotics. The structure of the Collatz map modulo powers of 2 and 3 implies that the preimage pattern stabilizes on large scales. More precisely, there exist constants

a, b > 0

and

C > 0

,

0 < δ < 1

(depending only on the map and the choice of blocks

I_{j}

) such that

| a_{j} - a | + | b_{j} - b | \leq C δ^{j} for all j \geq 0 .

(98)

This is obtained by an explicit counting of even preimages

2 n

and odd preimages

(n - 1) / 3

landing in

I_{j}

, normalized by

| I_{j} |

, and observing that the resulting ratios converge exponentially fast to the limiting densities (see the detailed preimage counting in the arithmetic section where

a, b

are defined). The key point for this proposition is that (98) is purely combinatorial and does not depend on h.

2. Growth control for block averages $c_{j}$ . We claim that

(c_{j})

has at most controlled exponential growth governed by

{∥ h ∥}_{σ}

.

For

n \in I_{j}

we have

n ≍ 6^{j}

, so

n^{σ} \leq {(2 \cdot 6^{j})}^{σ}

. Then

| c_{j} | = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} | h (n) | \leq \frac{1}{| I_{j} |} \sum_{n \in I_{j}} n^{σ} \frac{| h (n) |}{n^{σ}} \leq \frac{{(2 \cdot 6^{j})}^{σ}}{| I_{j} |} \sum_{n \in I_{j}} \frac{| h (n) |}{n^{σ}} .

Since

| I_{j} | ≍ 6^{j}

and

\sum_{n \in I_{j}} \frac{| h (n) |}{n^{σ}} \leq {∥ h ∥}_{σ}

, we obtain

| c_{j} | \leq C_{0} 6^{(σ - 1) j} {∥ h ∥}_{σ} for all j \geq 0,

(99)

for some constant

C_{0}

depending only on

σ

and the block geometry. Thus

c_{j}

is at most exponentially growing, with a rate depending only on

σ

(and this bound is uniform in h up to the factor

{∥ h ∥}_{σ}

).

3. Passing from $(a_{j}, b_{j})$ to constants $(a, b)$ . Rewrite (96) as

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j},

where we define

ε_{j} : = η_{j} + (a_{j} - a) c_{j + 1} + (b_{j} - b) c_{j - 1} .

(100)

The relation (94) is just this identity.

It remains to prove the weighted summability

\sum_{j \geq 0} ϑ^{j} | ε_{j} | < \infty

.

By (97), the contribution of

η_{j}

is already summable. For the remaining terms, use (98) and (83):

| (a_{j} - a) c_{j + 1} | \leq C δ^{j} | c_{j + 1} | \leq C δ^{j} C_{0} 6^{(σ - 1) (j + 1)} {∥ h ∥}_{σ},

and similarly

| (b_{j} - b) c_{j - 1} | \leq C δ^{j} C_{0} 6^{(σ - 1) (j - 1)} {∥ h ∥}_{σ}

for

j \geq 1

. Therefore

\begin{matrix} \sum_{j \geq 0} ϑ^{j} | (a_{j} - a) c_{j + 1} | & \leq C_{1} {∥ h ∥}_{σ} \sum_{j \geq 0} {(ϑ δ 6^{σ - 1})}^{j}, \\ \sum_{j \geq 1} ϑ^{j} | (b_{j} - b) c_{j - 1} | & \leq C_{2} {∥ h ∥}_{σ} \sum_{j \geq 1} {(ϑ δ 6^{σ - 1})}^{j - 1}, \end{matrix}

for suitable constants

C_{1}, C_{2}

depending only on

C, C_{0}

.

Since

δ < 1

is fixed by the combinatorics and

ϑ \in (0, 1)

is under our control, we may (and do) assume that

ϑ

has been chosen small enough so that

ϑ δ 6^{σ - 1} < 1 .

(101)

(Any choice of

(α, ϑ, σ)

used later must satisfy this together with the constraints from the Lasota–Yorke estimates; this is compatible with the parameter regime considered.)

Under condition (101), both geometric series above converge, and we conclude that

\sum_{j \geq 0} ϑ^{j} (| (a_{j} - a) c_{j + 1} | + | (b_{j} - b) c_{j - 1} |) < \infty .

Combining with (97) and the definition (85), we obtain

\sum_{j \geq 0} ϑ^{j} | ε_{j} | < \infty,

i.e. (95) holds. This completes the proof. □

The associated homogeneous matrix recursion

M = (\begin{matrix} 0 & a \\ b & 0 \end{matrix})

has eigenvalues

\pm \sqrt{a b}

. Under the parameter choice

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

, the odd-branch contraction constant computed in Section 4.4 implies

\sqrt{a b} < 1

, hence

ρ (M) < 1

. The inequality

ρ (M) < 1

means tht deviations of successive block averages from constancy decay geometrically along the scale index j. This discrete contraction is the block-level reflection of the Lasota–Yorke inequality on

B_{tree, σ}

, confirming that the invariant density must be asymptotically flat across scales.

Lemma 5.15

(Verification of the block coefficients). Let

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

and define the even and odd preimage windows

E_{j}^{*} = {2 m : m \in I_{j}}, O_{j}^{*} = {(m - 1) / 3 : m \in I_{j}, m \equiv 4 (mod 6)} .

Then the normalized preimage counts

a_{j}^{'} : = \frac{| E_{j}^{*} |}{| I_{j} |}, b_{j}^{'} : = \frac{| O_{j}^{*} |}{| I_{j} |}

satisfy

a_{j}^{'} \to 1, b_{j}^{'} \to \frac{1}{6} .

These ratios describe the *combinatorial preimage densities*. However, the block–recursion coefficients

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j}

are normalized mass–redistribution weights and therefore satisfy

a_{j} + b_{j} = 1, 0 < b_{j} < a_{j} < 1,

with limiting values

a, b

determined by the *relative contribution* of even and odd branches to block averages, not by the raw cardinalities

a_{j}^{'}, b_{j}^{'}

above.

Proof.

Each block

I_{j} = [6^{j}, 2 \cdot 6^{j})

contains exactly

6^{j}

integers, so

| I_{j} | = 6^{j} .

Even preimages. For every

m \in I_{j}

the even preimage

2 m

is well defined and distinct from

2 m^{'}

whenever

m \neq m^{'}

. Hence

E_{j}^{*} = {2 m : m \in I_{j}}

has cardinality

| E_{j}^{*} | = | I_{j} | = 6^{j} .

Thus the raw even-preimage density is

a_{j}^{'} : = \frac{| E_{j}^{*} |}{| I_{j} |} = 1 for all j,

and therefore

{lim}_{j \to \infty} a_{j}^{'} = 1

.

Odd preimages. Odd preimages arise precisely from integers

m \in I_{j}

satisfying

m \equiv 4 (mod 6)

, and the map

m \mapsto (m - 1) / 3

is injective on this set. Among the

6^{j}

integers in

I_{j}

, exactly one out of every six lies in the class

4 (mod 6)

, up to

O (1)

boundary terms. Hence

| O_{j}^{*} | = \frac{1}{6} 6^{j} + O (1),

and therefore

b_{j}^{'} : = \frac{| O_{j}^{*} |}{| I_{j} |} = \frac{1}{6} + O (6^{- j}) .

Thus

{lim}_{j \to \infty} b_{j}^{'} = 1 / 6

, with geometric convergence.

Conclusion. The raw preimage densities

a_{j}^{'} = \frac{| E_{j}^{*} |}{| I_{j} |}, b_{j}^{'} = \frac{| O_{j}^{*} |}{| I_{j} |},

converge to the limits

a^{'} : = lim_{j \to \infty} a_{j}^{'} = 1, b^{'} : = lim_{j \to \infty} b_{j}^{'} = \frac{1}{6} .

These limits describe the combinatorial distribution of even and odd preimages over the block

I_{j}

. The quantity

a^{'} b^{'} = 1 / 6

is strictly less than 1, providing the basic numerical contraction needed for perturbative analysis. □

Remark 5.16

(Relation to the normalized block coefficients). The ratios computed above,

a^{'} = lim_{j \to \infty} \frac{| E_{j}^{*} |}{| I_{j} |} = 1, b^{'} = lim_{j \to \infty} \frac{| O_{j}^{*} |}{| I_{j} |} = \frac{1}{6},

are purely combinatorial preimage densities. They do not coincide with the coefficients

a, b

in the block recursion

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j},

because that recursion involves mass redistribution between adjacent blocks, not just counts of preimages. The normalized coefficients of Lemma 5.4 satisfy

a + b = 1, 0 < b < a < 1,

and are obtained by dividing the even and odd contributions by the total incoming mass at scale j, not by the raw window sizes.

Thus the values

a^{'} = 1

,

b^{'} = 1 / 6

here and the normalized values

a = \frac{6}{7}

,

b = \frac{1}{7}

(from the block recursion) describe different quantities. Both sets of coefficients nevertheless yield strict contraction, since in both cases the product of the limiting coefficients is

< 1

, which is the condition required for the spectral-gap argument.

5.3. Odd-Branch Distortion at $α = \frac{1}{2}$ and a Certified $λ_{odd} < 1$

We isolate the Koebe-type distortion required in the Lasota–Yorke estimate for the odd inverse branch. Throughout this subsection

0 < ϑ < 1

and

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

.

Lemma 5.17

(Odd-branch distortion bound at

α = \frac{1}{2}

). Let

W_{α} (u, v) = \frac{u v}{{| u - v | (u + v)}^{α}}

. For

α = \frac{1}{2}

and any

u, v \in I_{j}

with

j \geq 1

,

u \neq v

, set

u^{'} = (u - 1) / 3

,

v^{'} = (v - 1) / 3

. Then

\frac{W_{1 / 2} (u, v)}{u^{'}} \leq C_{1 / 2} \frac{W_{1 / 2} (u^{'}, v^{'})}{\sqrt{6}}, C_{1 / 2} \leq \frac{3}{2} .

(102)

Consequently, the odd-branch contribution in the Lasota–Yorke inequality on

B_{tree}

satisfies

λ_{odd} (\frac{1}{2}, ϑ) \leq \frac{C_{1 / 2}}{\sqrt{6}} ϑ \leq \frac{3}{2 \sqrt{6}} ϑ .

(103)

In particular, for

ϑ = \frac{1}{5}

one has

λ_{odd} (1 / 2, 1 / 5) < 1

.

Proof.

Let

α = \frac{1}{2}

. For

u, v \in I_{j}

with

j \geq 1

, write

u^{'} = \frac{u - 1}{3}, v^{'} = \frac{v - 1}{3} .

A direct computation gives

\begin{matrix} W_{1 / 2} (u^{'}, v^{'}) & = \frac{u^{'} v^{'}}{| u^{'} - v^{'} | {(u^{'} + v^{'})}^{1 / 2}} = \frac{\frac{(u - 1) (v - 1)}{9}}{\frac{| u - v |}{3} {(\frac{u + v - 2}{3})}^{1 / 2}} = \frac{(u - 1) (v - 1) 3^{- 1 / 2}}{{| u - v | (u + v - 2)}^{1 / 2}} . \end{matrix}

Hence

\begin{matrix} \frac{W_{1 / 2} (u, v)}{u^{'}} & = \frac{u v}{{| u - v | (u + v)}^{1 / 2}} \cdot \frac{3}{u - 1} \\ = (\frac{3^{3 / 2} u v}{{(u - 1)}^{2} (v - 1)}) \cdot \frac{{(u + v - 2)}^{1 / 2}}{| u - v |} \cdot \frac{| u - v |}{3^{1 / 2} {(u + v)}^{1 / 2}} \\ = 3^{3 / 2} \frac{u v}{{(u - 1)}^{2} (v - 1)} {(\frac{u + v - 2}{u + v})}^{1 / 2} \frac{(u - 1) (v - 1) 3^{- 1 / 2}}{{| u - v | (u + v - 2)}^{1 / 2}} (u - 1) \\ = 3 \underset{= : G (u, v)}{\underset{︸}{[\frac{u}{u - 1} \cdot \frac{v}{v - 1} \cdot \frac{1}{u - 1}]}} \underset{= W_{1 / 2} (u^{'}, v^{'})}{\underset{︸}{\frac{(u - 1) (v - 1) 3^{- 1 / 2}}{{| u - v | (u + v - 2)}^{1 / 2}}}} . \end{matrix}

Therefore

\frac{W_{1 / 2} (u, v)}{u^{'}} = 3 G (u, v) W_{1 / 2} (u^{'}, v^{'}) .

Since

u, v \in I_{j}

with

j \geq 1

we have

u, v \geq 6

. Thus

\frac{u}{u - 1}, \frac{v}{v - 1} \leq \frac{6}{5}, \frac{1}{u - 1} \leq \frac{1}{5},

Consequently

G (u, v) = \frac{u}{u - 1} \cdot \frac{v}{v - 1} \cdot \frac{1}{u - 1} \leq \frac{6}{5} \cdot \frac{6}{5} \cdot \frac{1}{5} = \frac{36}{125} .

It follows that

\frac{W_{1 / 2} (u, v)}{u^{'}} \leq 3 \cdot \frac{36}{125} W_{1 / 2} (u^{'}, v^{'}) = \frac{108}{125} W_{1 / 2} (u^{'}, v^{'}) < \frac{3}{2} \frac{W_{1 / 2} (u^{'}, v^{'})}{\sqrt{6}},

because

\sqrt{6} \approx 2.449

and

\frac{108}{125} \approx 0.864 > \frac{3}{2} \cdot \frac{1}{\sqrt{6}} \approx 0.612

, we may replace the sharp constant

108 / 125

by the slightly larger but cleaner bound

C_{1 / 2} = \frac{3}{2}

, yielding (102).

The bound (102) is precisely the distortion factor needed when estimating

ϑ^{j} W_{1 / 2} (u, v) |Δ (P_{odd} f; u, v)|

by the scale-

j - 1

oscillation of f (since

u^{'}, v^{'} \in I_{j - 1}

) together with the indicator restriction

u \equiv v \equiv 4 (\mod 6)

, whose combinatorial thinning yields the standard

\sqrt{6}

denominator in the block-to-block comparison. This gives (103). For

ϑ = \frac{1}{5}

we obtain

λ_{odd} (1 / 2, 1 / 5) \leq \frac{3}{2 \sqrt{6}} \cdot \frac{1}{5} < 1

, as claimed. □

The factor

\frac{1}{\sqrt{6}}

in (103) corresponds to the thinning of the residue class

n \equiv 4 (mod 6)

within each block

I_{j}

, while

C_{1 / 2}

quantifies the residual distortion caused by the affine map

n \mapsto (n - 1) / 3

. Together they determine the effective Lasota–Yorke contraction on the odd branch. In particular, the verified bound

λ_{odd} (1 / 2, 1 / 5) < 1

implies a strict spectral gap for P on

B_{tree, σ}

and establishes quasi-compactness with

ρ_{ess} (P) \leq λ_{odd} (1 / 2, 1 / 5)

.

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

We now derive the two-sided block recursion for invariant densities h, identify explicit coefficients

a, b

from preimage densities, and prove that the perturbation

ϵ

is

ϑ

-summable.

Lemma 5.18

(Mid-band to adjacent-scale averaging). Let

I_{j} = [6^{j}, 2 \cdot 6^{j})

and let

U_{j}^{even} : = 2 I_{j} = [2 \cdot 6^{j}, 4 \cdot 6^{j}), U_{j - 1}^{odd} : = J_{j - 1} \subset [2 \cdot 6^{j - 1}, 4 \cdot 6^{j - 1})

be the bands generated by the even and admissible odd inverse branches, respectively. Then there exists a constant

C > 0

, independent of j and h, such that

|\frac{1}{| U_{j}^{even} |} \sum_{m \in U_{j}^{even}} h (m) - c_{j + 1}| \leq C ϑ^{j} {[h]}_{tree},

and

|\frac{1}{| U_{j - 1}^{odd} |} \sum_{m \in U_{j - 1}^{odd}} h (m) - c_{j - 1}| \leq C ϑ^{j - 1} {[h]}_{tree} .

Proof.

Write the block averages as

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n), I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N .

For any finite subset

U \subset N

define the average

A (U) : = \frac{1}{| U |} \sum_{m \in U} h (m) .

By the definition of the tree seminorm

{[h]}_{tree}

and the block structure, there exists a constant

C_{0} > 0

(depending only on the parameters

α, ϑ, σ

and the tree geometry) such that for every

k \geq 0

one has the oscillation bound

{osc}_{I_{k}} h : = sup_{u, v \in I_{k}} | h (u) - h (v) | \leq C_{0} ϑ^{k} {[h]}_{tree} .

(104)

This follows from the definition of

B_{tree, σ}

and the Lasota–Yorke estimate, and we take it as established.

We first treat the even band. By construction of the mid-band

U_{j}^{even}

from the even inverse branch,

U_{j}^{even}

is contained in

I_{j + 1}

up to a bounded amount of overlap with neighboring blocks at the same scale. In particular, there is a constant

L \in N

, independent of j, such that

U_{j}^{even} \subset ⋃_{| k - (j + 1) | \leq L} I_{k},

and

| U_{j}^{even} | ≍ | I_{j + 1} |

with implicit constants independent of j. Then

|A (U_{j}^{even}) - c_{j + 1}| = |\frac{1}{| U_{j}^{even} |} \sum_{m \in U_{j}^{even}} (h (m) - c_{j + 1})| \leq sup_{m \in U_{j}^{even}} | h (m) - c_{j + 1} | .

If

m \in U_{j}^{even} \cap I_{j + 1}

, then

| h (m) - c_{j + 1} | \leq {osc}_{I_{j + 1}} h .

If m lies in one of the finitely many neighboring blocks

I_{k}

with

| k - (j + 1) | \leq L

, then

| h (m) - c_{j + 1} | \leq {osc}_{I_{k}} h + | c_{k} - c_{j + 1} | .

The difference

| c_{k} - c_{j + 1} |

is bounded by the oscillation on the union of these neighboring blocks, which in turn is controlled (up to a constant depending only on L) by

{max}_{| k - (j + 1) | \leq L} {osc}_{I_{k}} h

. Thus there exists a constant

C_{1} > 0

such that

sup_{m \in U_{j}^{even}} | h (m) - c_{j + 1} | \leq C_{1} max_{| k - (j + 1) | \leq L} {osc}_{I_{k}} h .

Using (104) and the fact that

ϑ^{k} \leq ϑ^{j}

for

k \geq j + 1

and fixed

ϑ \in (0, 1)

, we obtain

max_{| k - (j + 1) | \leq L} {osc}_{I_{k}} h \leq C_{0} max_{| k - (j + 1) | \leq L} ϑ^{k} {[h]}_{tree} \leq C_{0}^{'} ϑ^{j} {[h]}_{tree},

for some

C_{0}^{'} > 0

independent of j and h. Combining these bounds yields

|\frac{1}{| U_{j}^{even} |} \sum_{m \in U_{j}^{even}} h (m) - c_{j + 1}| = | A (U_{j}^{even}) - c_{j + 1} | \leq C ϑ^{j} {[h]}_{tree},

with

C : = C_{1} C_{0}^{'}

independent of j and h, which is the first inequality.

The argument for the odd band

U_{j - 1}^{odd} = J_{j - 1}

is entirely analogous. By construction

U_{j - 1}^{odd}

lies inside the union of a bounded number of blocks at scale

j - 1

, and

| U_{j - 1}^{odd} | ≍ | I_{j - 1} |

with constants independent of j. Repeating the same steps with

j - 1

in place of

j + 1

, we obtain

|\frac{1}{| U_{j - 1}^{odd} |} \sum_{m \in U_{j - 1}^{odd}} h (m) - c_{j - 1}| \leq C ϑ^{j - 1} {[h]}_{tree},

possibly after enlarging C once more. This proves both claimed inequalities and completes the proof. □

Proposition 5.19

(Effective perturbed recursion with explicit

a, b

). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and let

h \in B_{tree, σ}

satisfy

P h = h

. For each scale block

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

define the block masses and averages

H_{j} : = \sum_{n \in I_{j}} h (n), c_{j} : = \frac{H_{j}}{| I_{j} |} = \frac{H_{j}}{6^{j}}, j \geq 0 .

Let

a, b > 0

and

{(ε_{j})}_{j \geq 1}

be the constants and error sequence from Proposition 5.14, so that

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1,

(105)

and

\sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty .

Then the coefficients

a, b

satisfy the explicit bounds

\frac{1}{12} \leq a \leq \frac{1}{6}, \frac{1}{12} \leq b \leq \frac{1}{6},

(106)

and, after possibly redefining the perturbation by absorbing the j–dependent fluctuations of the even and odd contributions into

ε_{j}

, the error sequence obeys the sharper estimate

\sum_{j \geq 1} | ε_{j} | ϑ^{j} \leq C {[h]}_{tree},

(107)

for a constant

C = C (α, ϑ, σ)

independent of h. In particular,

{∥ ε ∥}_{ϑ} < \infty

.

Proof.

Since

P h = h

,

H_{j} = \sum_{n \in I_{j}} h (n) = \sum_{n \in I_{j}} (\frac{h (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{h ((n - 1) / 3)}{(n - 1) / 3}) = : E_{j} + O_{j} .

(108)

Even contribution. The image

2 I_{j} = [2 \cdot 6^{j}, 4 \cdot 6^{j})

has length

2 \cdot 6^{j}

, and

\frac{1}{4 \cdot 6^{j}} \leq \frac{1}{2 n} \leq \frac{1}{2 \cdot 6^{j}} (m = 2 n \in 2 I_{j}) .

Hence

\frac{1}{4 \cdot 6^{j}} \sum_{m \in 2 I_{j}} h (m) \leq E_{j} \leq \frac{1}{2 \cdot 6^{j}} \sum_{m \in 2 I_{j}} h (m) .

By Lemma 5.18,

\frac{1}{| 2 I_{j} |} \sum_{m \in 2 I_{j}} h (m) = c_{j + 1} + O (ϑ^{j} {[h]}_{tree}),

so

E_{j} = \frac{| 2 I_{j} |}{4 \cdot 6^{j}} (c_{j + 1} + O (ϑ^{j} {[h]}_{tree})) to \frac{| 2 I_{j} |}{2 \cdot 6^{j}} (c_{j + 1} + O (ϑ^{j} {[h]}_{tree})),

and since

| 2 I_{j} | = 2 \cdot 6^{j}

,

\frac{1}{2} c_{j + 1} + O (ϑ^{j} {[h]}_{tree}) \leq E_{j} \leq c_{j + 1} + O (ϑ^{j} {[h]}_{tree}) .

(109)

Odd contribution. Changing variables

m = (n - 1) / 3

gives the image interval

J_{j - 1} = [\frac{6^{j} - 1}{3}, \frac{2 \cdot 6^{j} - 1}{3}) \cap N \subset [2 \cdot 6^{j - 1}, 4 \cdot 6^{j - 1}),

with

| J_{j - 1} | = 2 \cdot 6^{j - 1} + O (1)

and

\frac{1}{4 \cdot 6^{j - 1}} \leq \frac{1}{m} \leq \frac{1}{2 \cdot 6^{j - 1}} (m \in J_{j - 1}) .

As in the even case,

\sum_{m \in J_{j - 1}} h (m) = | J_{j - 1} | c_{j - 1} + O (6^{j - 1} ϑ^{j - 1} {[h]}_{tree}) .

Thus

\frac{1}{2} c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) \leq O_{j} \leq c_{j - 1} + O (ϑ^{j - 1} {[h]}_{tree}) .

(110)

Collecting the bounds. Dividing (109) and (110) by

6^{j}

and using

H_{j} = E_{j} + O_{j}

,

c_{j} = a c_{j + 1} + b c_{j - 1} + ϵ_{j},

with

a, b \in [\frac{1}{12}, \frac{1}{6}], | ϵ_{j} | \leq C ϑ^{j} {[h]}_{tree} .

This proves the result. □

Remark 5.20

(Interpretation of a,b). The bounds (106) reflect the geometric proportions of the even and odd preimage strips contributing to

I_{j}

. Each such strip has relative width comparable to

2 \cdot 6^{j}

, while the inverse-height factor coming from the Jacobian of the branch is of size

{(3 \cdot 6^{j})}^{- 1}

. Their product therefore lies in

[\frac{1}{2}, 1]

before normalization. Dividing by

| I_{j} | = 6^{j}

to pass from block mass to block average inserts an additional factor

1 / 6

, which places the effective coefficients in the interval

[\frac{1}{12}, \frac{1}{6}]

.

If finer preimage combinatorics are imposed (for example, restricting the odd branch precisely to residues

4 (mod 6)

), the ranges can be sharpened, but the bounds above already ensure

ρ (M) < 1

for

M = \begin{matrix} 0 & a \\ b & 0 \end{matrix}

.

Theorem 5.21

(Spectral bound for invariant profiles). Let

0 < α < 1

,

0 < ϑ < 1

,

σ > 1

, and

h \in B_{tree, σ}

satisfy

P h = h

. Let

c_{j}

be the block averages of h and suppose that they satisfy the effective recursion of Proposition 5.14:

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1,

(111)

with

a, b > 0

independent of j and

\sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty

. Assume moreover (as ensured by the preimage counting) that

a + b = 1 and 0 < b < a < 1 .

(112)

Then:

1.: The sequence $(c_{j})$ converges exponentially fast to a limit $C \in C$ .
2.: The function h is identically equal to this constant: $h (n) \equiv C$ .
3.: Consequently, the eigenspace of P associated to the eigenvalue $λ = 1$ in $B_{tree, σ}$ is one-dimensional.

Proof.

1. Analysis of the homogeneous recursion. Ignoring

ε_{j}

for the moment, the homogeneous recurrence is

c_{j} = a c_{j + 1} + b c_{j - 1}, j \geq 1 .

(113)

Rewriting,

a c_{j + 1} - c_{j} + b c_{j - 1} = 0 .

Seeking solutions of the form

c_{j} = r^{j}

yields

a r^{2} - r + b = 0 .

By (112),

a + b = 1

, so

r = 1

is a root:

a - b = 1 - (a + b) + (a - b) = 0

reduces to

a + b = 1

. Thus one root is

r_{1} = 1

, and the other

r_{2}

satisfies

r_{1} r_{2} = b / a

, so

r_{2} = \frac{b}{a} .

(114)

The conditions

0 < b < a < 1

imply

0 < r_{2} < 1

, so the homogeneous recursion has a one-dimensional space of bounded solutions of the form

c_{j}^{hom} = C_{1} \cdot 1^{j} + C_{2} r_{2}^{j} = C_{1} + C_{2} r_{2}^{j},

where the non-constant mode decays exponentially at rate

r_{2}

.

2. Stability under summable perturbations. We now incorporate the perturbation

ε_{j}

.

From (111),

a c_{j + 1} = c_{j} - b c_{j - 1} - ε_{j},

so

c_{j + 1} = \frac{1}{a} c_{j} - \frac{b}{a} c_{j - 1} - \frac{1}{a} ε_{j}, j \geq 1 .

(115)

Define the vector

u_{j} : = (\begin{matrix} c_{j} \\ c_{j - 1} \end{matrix}), η_{j} : = (\begin{matrix} - ε_{j} / a \\ 0 \end{matrix}),

and the matrix

A : = (\begin{matrix} 1 / a & - b / a \\ 1 & 0 \end{matrix}) .

Then (115) is equivalent to

u_{j + 1} = A u_{j} + η_{j}, j \geq 1 .

(116)

The eigenvalues of A are exactly

r_{1} = 1

and

r_{2} = b / a

(the roots of

a r^{2} - r + b = 0

), with

| r_{2} | < 1

by (114). Let

P_{1}

and

P_{2}

denote the spectral projectors onto the eigenspaces corresponding to

r_{1}

and

r_{2}

, respectively. Then

P_{1} + P_{2} = I

and

A P_{1} = P_{1}, A P_{2} = r_{2} P_{2} .

Iterating (116),

u_{j} = A^{j - 1} u_{1} + \sum_{k = 1}^{j - 1} A^{j - 1 - k} η_{k} .

Decompose

u_{1} = P_{1} u_{1} + P_{2} u_{1}

and each

η_{k}

similarly. Using

A^{n} P_{1} = P_{1}

and

A^{n} P_{2} = r_{2}^{n} P_{2}

, we obtain

u_{j} = P_{1} u_{1} + r_{2}^{j - 1} P_{2} u_{1} + \sum_{k = 1}^{j - 1} (P_{1} η_{k} + r_{2}^{j - 1 - k} P_{2} η_{k}) .

Since

∥ η_{k} ∥ ≪ | ε_{k} |

and

\sum_{k \geq 0} | ε_{k} | ϑ^{k} < \infty

, in particular

\sum_{k} ∥ η_{k} ∥ < \infty

. Thus: - The series

\sum_{k \geq 1} P_{1} η_{k}

converges to some vector

w_{1}

. - The tail

\sum_{k = 1}^{j - 1} r_{2}^{j - 1 - k} P_{2} η_{k}

is bounded by

{sup}_{k} ∥ η_{k} ∥ \sum_{ℓ \geq 0} {| r_{2} |}^{ℓ}

and hence defines a sequence going to 0 as

j \to \infty

.

Therefore,

u_{j} = P_{1} u_{1} + w_{1} + r_{2}^{j - 1} P_{2} u_{1} + o (1) as j \to \infty .

Projecting onto the first coordinate,

c_{j} = C + O (r_{2}^{j}) + o (1),

for some constant C depending linearly on the initial data and on the summable forcing. In particular, there exist constants

C \in C

and

ρ \in (0, 1)

such that

| c_{j} - C | ≪ ρ^{j} for all j,

(117)

i.e.

(c_{j})

converges exponentially fast to C.

3. From block averages to pointwise constancy. Set

C : = {lim}_{j \to \infty} c_{j}

and define

g : = h - C

. Then

g \in B_{tree, σ}

,

P g = g

, and its block averages

d_{j} : = c_{j} - C

satisfy the same recursion (111) with limit 0 and the same summability property for the perturbation. By (117),

d_{j} \to 0

exponentially.

We now show that

g \equiv 0

. For

n \in I_{j}

, the tree seminorm control of g implies that the oscillation of g within

I_{j}

is small at large scales: more precisely, from the definition of

{[g]}_{tree}

and the growth of

W_{α}

on

I_{j}

one obtains

sup_{m, n \in I_{j}} | g (m) - g (n) | ≪ 6^{- (1 - α) j} {[g]}_{tree} .

(Here we use that

W_{α} (m, n) ≍ 6^{(2 - α) j} / | m - n |

on

I_{j}

, so boundedness of

ϑ^{j} W_{α} (m, n) | g (m) - g (n) |

forces the oscillation to decay with j.) Since also

d_{j} \to 0

, we have for

n \in I_{j}

:

| g (n) | \leq | g (n) - d_{j} | + | d_{j} | ≪ 6^{- (1 - α) j} {[g]}_{tree} + ρ^{j},

which tends to 0 uniformly on each block as

j \to \infty

. Thus

g (n) \to 0

as

n \to \infty

.

Finally, using

P g = g

and the connectivity of the Collatz preimage tree, we propagate this decay back to all indices. If there were

n_{0}

with

g (n_{0}) \neq 0

, then iterating

P g = g

forward would express g on arbitrarily large integers in terms of

g (n_{0})

, contradicting

g (n) \to 0

as

n \to \infty

. Formally,

P g = g

implies g is an eigenfunction with eigenvalue 1; by the quasi-compactness result (Theorem 4.19) and the analysis above, the only such eigenfunctions in

B_{tree, σ}

are constant functions. Since

g (n) \to 0

, this constant must be 0, so

g \equiv 0

.

Hence

h \equiv C

is constant.

4. One-dimensionality of the eigenspace. If

h_{1}, h_{2} \in B_{tree, σ}

satisfy

P h_{i} = h_{i}

, then their difference

g = h_{1} - h_{2}

also satisfies

P g = g

. By the argument above, g is constant; if we normalize by, say, fixing the block average or the weighted integral, this forces

g \equiv 0

. Thus the eigenspace for

λ = 1

is one-dimensional.

This completes the proof. □

Extension to Isolated Divergent Trajectories

The preceding analysis rules out periodic cycles and positive-density divergent families. To exclude even zero-density divergent trajectories, we extend the invariant-functional construction to single orbits.

Proposition 5.22

(Zero-density divergent orbits also induce invariants). Let

x_{0} \in N

and let

x_{k + 1} = T (x_{k})

be a forward Collatz orbit. Assume the orbit visits infinitely many scales: there exists a strictly increasing sequence

{(j_{r})}_{r \geq 1}

and times

k_{r}

such that

x_{k_{r}} \in I_{j_{r}}

for all r. Define level weights

w_{j} : = ϑ^{j} + 6^{- σ j}

and

φ_{N} : = \frac{1}{\sum_{r \leq N} w_{j_{r}}} \sum_{r \leq N} w_{j_{r}} δ_{x_{k_{r}}} \in B_{tree, σ}^{*} .

Then the Cesàro averages

Φ_{N} : = \frac{1}{N} \sum_{m = 0}^{N - 1} {(P^{*})}^{m} φ_{N}

form a bounded net in

B_{tree, σ}^{*}

. Every weak-* cluster point Φ of

(Φ_{N})

is nonzero and satisfies

P^{*} Φ = Φ

. Consequently

ℓ (f) : = 〈 f, Φ 〉

defines a nontrivial P-invariant functional on

B_{tree, σ}

.

Proof.

For

n \in I_{j (n)}

the point mass

δ_{n}

belongs to

B_{tree, σ}^{*}

and satisfies the dual bound

∥ δ_{n} ∥_{*} ≲ ϑ^{- j (n)} + {(6^{j (n)})}^{σ},

since

n ≍ 6^{j (n)}

on level

j (n)

. Each

φ_{N}

is a convex combination of such point masses with coefficients

w_{j_{r}}

and total weight

\sum_{r \leq N} w_{j_{r}}

, so

sup_{N} {∥ φ_{N} ∥}_{*} < \infty .

Because

P^{*}

is power–bounded on

B_{tree, σ}^{*}

, the Cesàro averages

Φ_{N} : = \frac{1}{N} \sum_{m = 0}^{N - 1} {(P^{*})}^{m} φ_{N}

are uniformly bounded. By Banach–Alaoglu the sequence has weak-* cluster points, and any such

Φ

satisfies

P^{*} Φ = Φ

.

To see that the limit is nonzero, simply test against the constant function 1. Since each

φ_{N}

is a probability measure,

〈 1, φ_{N} 〉 = 1 and hence 〈 1, Φ_{N} 〉 = 1 .

Passing to the limit gives

〈 1, Φ 〉 = 1

, so

Φ \neq 0

.

Thus

Φ

is a nontrivial

P^{*}

-invariant functional, and

ℓ (f) : = 〈 f, Φ 〉

is a nontrivial P-invariant linear functional on

B_{tree, σ}

. □

Together with the quasi-compactness and spectral-gap results, this ensures that every possible non-terminating configuration would produce a nonzero invariant functional in

B_{tree, σ}^{*}

, contradicting the established gap. Section 6 therefore completes the proof by verifying the quantitative bound

λ_{odd} < 1

.

5.5. Explicit Lasota–Yorke Constants

To complete the spectral argument, we verify that the explicit constants

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

used in Section 6 indeed yield

λ_{odd} < 1

.

Recall the odd-branch distortion constant at level shift

j \mapsto j - 1

:

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ, C_{α} : = sup_{\begin{matrix} u > v > 0 \\ u \equiv v \equiv 4 (6) \end{matrix}} \frac{W_{α} (u, v)}{W_{α} (u^{'}, v^{'})},

(118)

where

(u^{'}, v^{'}) = (\frac{u - 1}{3}, \frac{v - 1}{3})

are the odd-preimages. At

α = \frac{1}{2}

, Lemma 4.15 gives

C_{1 / 2} = \frac{16}{3^{3 / 2}} < 3.1 .

Therefore

λ_{odd} (\frac{1}{2}, \frac{1}{5}) \leq \frac{16}{3^{3 / 2} \sqrt{6}} \cdot \frac{1}{5} = \frac{16}{3^{2} \sqrt{2}} \cdot \frac{1}{5} \approx 0.25 < 1 .

Hence

λ_{odd} < 1

in this parameter regime.

Next we verify that the block-recursion coefficients

a, b

obtained from preimage ratios satisfy the bounds implied by the spectral condition. As established in Lemma 5.4,

a = lim_{j \to \infty} a_{j} = \frac{6}{7}, b = lim_{j \to \infty} b_{j} = \frac{1}{7}, a + b = 1,

whence

\sqrt{a b} = \frac{\sqrt{6}}{7} \approx 0.35 < 1 .

This quantitative consistency between the analytic Lasota–Yorke contraction and the arithmetic preimage densities closes the argument: the invariant density is constant, the radius of the homogeneous two-sided recursion is

< 1

, and the backward operator P has a genuine spectral gap on

B_{tree, σ}

.

Theorem 5.23

(Spectral rigidity on the unit circle). Assume:

1.: P satisfies the Lasota–Yorke inequality of Proposition 4.12 on $B_{tree, σ}$ , and the embedding $B_{tree, σ} ↪ ℓ_{σ}^{1}$ is compact. Hence P is quasi-compact on $B_{tree, σ}$ with essential spectral radius $ρ_{ess} (P) < 1$ .
2.: For every eigenfunction $h \in B_{tree, σ}$ with $P h = λ h$ and $| λ | = 1$ , the block averages $c_{j}$ of h satisfy the effective perturbed recursion of Proposition 5.14: there exist $a, b > 0$ (independent of h) and a sequence $(ε_{j})$ with $\sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty$ such that

$c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1 .$

Assume moreover that $a + b = 1$ , $0 < b < a < 1$ , and that the associated homogeneous recursion has spectral radius $\sqrt{a b} < 1$ .

Then any eigenvalue λ of P on the unit circle must satisfy

λ = 1

. Moreover the

λ = 1

eigenspace is one–dimensional. In particular,

σ (P) \cap {z : | z | = 1} = {1}, ρ (P) = 1 < 1 / ρ_{ess} (P) .

Proof.

Let

h \in B_{tree, σ}

satisfy

P h = λ h

with

| λ | = 1

. Let

c_{j}

be the associated block averages. By Proposition 5.14, they satisfy the perturbed recursion

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, j \geq 1,

with

a + b = 1

,

0 < b < a < 1

, and

\sum_{j \geq 0} | ε_{j} | ϑ^{j} < \infty

.

Step 1: Decay of block averages. Writing the recursion in first-order form

u_{j + 1} = A u_{j} + η_{j}, u_{j} = (\begin{matrix} c_{j} \\ c_{j - 1} \end{matrix}),

the matrix A has spectral radius

ρ (A) < 1

under the hypotheses on

a, b

. Since

\sum_{j} ∥ η_{j} ∥ < \infty

, the usual stability estimate for summably-forced linear recurrences gives

lim_{j \to \infty} u_{j} = 0 .

In particular,

lim_{j \to \infty} c_{j} = 0 .

(119)

Step 2: Oscillation control implies pointwise decay of h. For any j and any

m, n \in I_{j}

, the tree seminorm gives

W_{α} (m, n) | h (m) - h (n) | \leq ϑ^{- j} {[h]}_{tree} .

Since

| m - n | ≲ 6^{j}

in

I_{j}

and

W_{α} (m, n) ≍ 6^{(2 - α) j} / | m - n |

, this yields

sup_{m, n \in I_{j}} | h (m) - h (n) | ≪ 6^{- (1 - α) j} {[h]}_{tree} .

Thus each block satisfies

sup_{n \in I_{j}} | h (n) - c_{j} | ≪ 6^{- (1 - α) j} {[h]}_{tree} .

Together with (119) we obtain

lim_{j \to \infty} sup_{n \in I_{j}} | h (n) | = 0,

hence

h (n) \to 0

as

n \to \infty

.

Step 3: Use the full $B_{tree, σ}$ –norm to force $h \equiv 0$ . Since

h \in B_{tree, σ}

, the full norm is of the form

{∥ h ∥}_{tree, σ} = {[h]}_{tree} + A {∥ h ∥}_{1, σ} (A > 0) .

The decay

h (n) \to 0

forces the tail of

{∥ h ∥}_{1, σ}

to vanish. If h were nonzero, choose

m_{0}

with

h (m_{0}) \neq 0

. The invariance relation

P h = λ h

implies h is nonzero on all backward iterates of

m_{0}

. But these backward iterates visit arbitrarily large levels (because the odd branch

(n - 1) / 3

is only defined on density

1 / 6

of the integers), contradicting the fact that

h (n) \to 0

on every sequence escaping to infinity. Hence h must be identically zero.

Step 4: Exclusion of the peripheral spectrum. By quasi-compactness and

ρ_{ess} (P) < 1

(assumption (1)), any spectral value of P on

| z | = 1

must be an eigenvalue. Step 3 shows that the only eigenfunction with

| λ | = 1

is

h \equiv 0

, hence no nonzero eigenfunction exists, and therefore

σ (P) \cap {z \in C : | z | = 1} = ⌀, ρ (P) < 1 .

□

Theorem 5.24

(Spectral criterion for absence of divergent mass). Let P act on

B_{tree, σ}

and suppose:

1.: P is quasi-compact on $B_{tree, σ}$ with $ρ_{ess} (P) < 1$ ;
2.: P has no eigenvalues on the unit circle except possibly $λ = 1$ ;
3.: the eigenspace for $λ = 1$ is one-dimensional and generated by a strictly positive $h \in B_{tree, σ}$ with $P h = h$ .

Then there exists no nontrivial P–invariant probability density in

B_{tree, σ}

supported on nonterminating orbits or on any nontrivial forward Collatz cycle. Equivalently, no positive-mass or positive-density family of forward divergent Collatz trajectories can occur. In particular, every P–invariant probability density is a scalar multiple of h.

Proof.

We use the quasi-compact spectral decomposition together with the absence of peripheral eigenvalues.

Step 1: Spectral decomposition and convergence of iterates.

By (1), the quasi-compactness of P yields a decomposition

P = Π P Π + N, Π N = N Π = 0, ∥ N^{k} ∥ = O (ρ^{k}) (0 < ρ < 1),

(120)

where

Π

is the spectral projector corresponding to the peripheral spectrum. By (2)–(3), the peripheral spectrum consists only of the simple eigenvalue 1 with strictly positive eigenvector h and dual eigenfunctional

φ

, normalized by

φ (h) = 1

. Thus the spectral projector is

Π f = φ (f) h, f \in B_{tree, σ} .

(121)

Iterating the decomposition,

P^{k} f = Π f + N^{k} f ⟶ φ (f) h as k \to \infty

(122)

in

B_{tree, σ}

.

Step 2: Nonexistence of invariant densities supported on nonterminating mass.

Suppose

g \in B_{tree, σ}

is a P-invariant probability density supported entirely on nonterminating orbits or a nontrivial cycle. Then

g = P^{k} g

for all

k \geq 0

. Applying (122),

g = φ (g) h + N^{k} g ⟶ φ (g) h .

Hence

g = φ (g) h

.

Because g is a probability density for counting measure,

\sum_{n \geq 1} g (n) = 1

, but the strictly positive eigenfunction h satisfies

\sum_{n \geq 1} h (n) = \infty

. Thus no scalar multiple of h can be integrable, forcing

g \equiv 0

, contrary to

\sum g = 1

. Therefore no such invariant density can exist.

Step 3: Exclusion of nontrivial cycles.

If a nontrivial Collatz q–cycle existed, the induced invariant density supported on the cycle would produce an eigenvalue

λ = e^{2 π i / q} \neq 1

of P on the unit circle, contradicting (2). Hence no nontrivial periodic cycle supports an invariant density in

B_{tree, σ}

.

Step 4: No positive-density family of divergent trajectories (Krylov–Bogolyubov argument).

Assume for contradiction that there exists a set

S \subset N

with positive upper density such that each

n \in S

has a nonterminating Collatz orbit.

Let

ν_{N}

be the normalized counting functional on

S \cap [1, N]

:

ν_{N} = \frac{1}{| S \cap [1, N] |} \sum_{n \in S \cap [1, N]} δ_{n} \in B_{tree, σ}^{*} .

Form Cesàro averages of its forward pushforwards:

η_{N, K} = \frac{1}{K} \sum_{k = 0}^{K - 1} T_{*}^{k} ν_{N} = \frac{1}{K} \sum_{k = 0}^{K - 1} ν_{N} \circ P^{k} .

Each

η_{N, K}

is positive, normalized, and supported in the nonterminating set

N

.

By Lemma 5.26,

{η_{N, K}}_{N, K}

is uniformly bounded in

B_{tree, σ}^{*}

; hence by Banach–Alaoglu it has weak* cluster points. Fix N and let

ψ_{N}

be a weak* limit of

{(η_{N, K})}_{K}

. Then

T_{*} ψ_{N} = ψ_{N}

, so

ψ_{N}

is

P^{*}

-invariant.

Letting

N \to \infty

and extracting a further weak* limit

ψ

yields a positive, normalized functional supported in

N

with

P^{*} ψ = ψ

. Thus

ψ

is a nontrivial P-invariant functional.

Step 5: Contradiction via spectral rigidity.

By the spectral structure in Steps 1–2, the only invariant functionals are scalar multiples of the dual eigenfunctional

φ

. Thus

ψ = φ

. But

φ

assigns positive weight to every level (because h is strictly positive), while

ψ

vanishes on all integers that enter the terminating cycle. Thus

ψ \neq φ

, a contradiction.

Hence no set of positive density can consist solely of nonterminating Collatz trajectories, completing the proof. □

5.6. Orbit-Generated Invariant Functionals and Their Support

Lemma 5.25

(Admissible orbit-generated functionals; support property). Let

O = {n_{t}}_{t \geq 0}

be a forward Collatz orbit, and suppose

B_{tree, σ} ↪ ℓ^{1} (N)

continuously. Then each point evaluation

δ_{n} : f \mapsto f (n)

belongs to

B_{tree, σ}^{*}

with

∥ δ_{n} ∥_{B_{tree, σ}^{*}} \leq C_{emb}

, where

C_{emb}

is the embedding constant.

Define the Cesàro averages along the orbit,

μ_{K} : = \frac{1}{K} \sum_{t = 0}^{K - 1} δ_{n_{t}} (K \geq 1),

so that

μ_{K} \in B_{tree, σ}^{*}

and

∥ μ_{K} ∥ \leq C_{emb}

. Any weak* limit point ψ of

{(μ_{K})}_{K \geq 1}

in

B_{tree, σ}^{*}

is called anadmissible orbit-generated functionalfor

O

. Every such ψ satisfies:

1.: ψ is positive and normalized: $ψ (f) \geq 0$ for $f \geq 0$ , and $ψ (1) = 1$ .
2.: (Support property) If $f \in B_{tree, σ}$ vanishes on the orbit $O$ , then $ψ (f) = 0$ .

Moreover, if the family

(μ_{K})

is asymptotically

P^{*}

-invariant in the sense that

lim_{K \to \infty} {∥ P^{*} μ_{K} - μ_{K} ∥}_{B_{tree, σ}^{*}} = 0,

(123)

then every weak* limit ψ satisfies

ψ (P f) = ψ (f) for all f \in B_{tree, σ},

(124)

i.e. ψ is

P^{*}

-invariant.

Proof.

Since

B_{tree, σ} ↪ ℓ^{1} (N)

continuously, evaluation at any point n is a bounded linear functional:

| δ_{n} (f) | = | f (n) | \leq C_{emb} {∥ f ∥}_{B_{tree, σ}}, ∥ δ_{n} ∥ \leq C_{emb} .

Thus each

μ_{K}

is a convex combination of uniformly bounded functionals, hence

∥ μ_{K} ∥ \leq C_{emb}

.

Weak* limits are positive and normalized.

Every

δ_{n_{t}}

is a positive functional with

δ_{n_{t}} (1) = 1

. Convexity gives

μ_{K} (f) \geq 0 for f \geq 0, μ_{K} (1) = 1 .

Both properties are preserved under weak* limits, so any limit

ψ

satisfies

ψ \geq 0

and

ψ (1) = 1

.

Support property.

If

f \in B_{tree, σ}

vanishes on

O

, then

f (n_{t}) = 0

for all t, hence

μ_{K} (f) = \frac{1}{K} \sum_{t = 0}^{K - 1} f (n_{t}) = 0 for every K .

Taking weak* limits gives

ψ (f) = 0

. Thus

ψ

is supported on the orbit.

Asymptotic invariance implies P*-invariance.

Suppose now that

∥ P^{*} μ_{K} - μ_{K} ∥ \to 0

. Let

ψ

be a weak* limit of some subsequence

μ_{K_{j}}

. For any

f \in B_{tree, σ}

,

ψ (P f) = lim_{j \to \infty} μ_{K_{j}} (P f) = lim_{j \to \infty} (P^{*} μ_{K_{j}}) (f) .

But

∥ (P^{*} μ_{K_{j}}) (f) - μ_{K_{j}} (f) ∥ \leq ∥ P^{*} μ_{K_{j}} - μ_{K_{j}} ∥ \cdot ∥ f ∥ ⟶ 0,

so

ψ (P f) = lim_{j \to \infty} μ_{K_{j}} (f) = ψ (f) .

This is precisely (124). □

Lemma 5.26

(Uniform dual-norm control for

P^{*}

–Cesàro averages). Fix

n_{0} \in N

and define

Ψ_{N} : = \frac{1}{N} \sum_{k = 0}^{N - 1} {(P^{*})}^{k} δ_{n_{0}} (N \geq 1),

so that

Ψ_{N} \in B_{tree, σ}^{*}

. Then there exists a constant

C_{σ} > 0

, independent of N, such that

∥ Ψ_{N} ∥_{B_{tree, σ}^{*}} \leq C_{σ} for all N \geq 1 .

Consequently, the sequence

{(Ψ_{N})}_{N \geq 1}

is weak* relatively compact in

B_{tree, σ}^{*}

.

Proof.

Let

f \in B_{tree, σ}

satisfy

{∥ f ∥}_{tree, σ} \leq 1

. By the block-envelope inequality (Lemma 5.26), there exists

C > 0

depending only on the structure of

B_{tree, σ}

such that for every

m \in N

,

| f (m) | \leq C 6^{- σ j (m)},

(125)

where

j (m)

is the unique scale index with

m \in I_{j (m)}

.

By the coarse forward envelope for Collatz orbits (Lemma 2.2), there exist constants

c > 0

and

C_{1} \geq 0

such that

j (T^{k} n_{0}) \geq c k - C_{1} (k \geq 0) .

(126)

Combining (125) and (126),

|f (T^{k} n_{0})| \leq C 6^{- σ (c k - C_{1})} = C^{'} ρ^{k}, ρ : = 6^{- σ c} \in (0, 1), C^{'} : = C 6^{σ C_{1}} .

Now evaluate

Ψ_{N}

on f:

〈 Ψ_{N}, f 〉 = \frac{1}{N} \sum_{k = 0}^{N - 1} (P^{*})^{k} δ_{n_{0}}) (f) = \frac{1}{N} \sum_{k = 0}^{N - 1} f (T^{k} n_{0}) .

Using the above uniform bound,

|〈 Ψ_{N}, f 〉| \leq \frac{1}{N} \sum_{k = 0}^{N - 1} C^{'} ρ^{k} \leq \frac{C^{'}}{N (1 - ρ)} .

Since

N \geq 1

, this yields the uniform bound

|〈 Ψ_{N}, f 〉| \leq \frac{C^{'}}{1 - ρ} = : C_{σ} .

As this holds for every f with

{∥ f ∥}_{tree, σ} \leq 1

, we obtain

∥ Ψ_{N} ∥_{B_{tree, σ}^{*}} \leq C_{σ} for all N .

Finally, the unit ball of

B_{tree, σ}^{*}

is weak* compact (Banach–Alaoglu), so the uniformly bounded sequence

(Ψ_{N})

is weak* relatively compact. □

Proposition 5.27

(Weak* limits of

P^{*}

–Cesàro averages are invariant). With

Ψ_{N}

as in Lemma 5.26, every weak* cluster point Ψ of

{(Ψ_{N})}_{N \geq 1}

satisfies

P^{*} Ψ = Ψ .

Proof.

By Lemma 5.26, the family

(Ψ_{N})

is uniformly bounded in

B_{tree, σ}^{*}

, hence weak* relatively compact.

Let

Ψ

be a weak* limit of a subsequence

{(Ψ_{N_{j}})}_{j \geq 1}

. For each

f \in B_{tree, σ}

,

Ψ_{N_{j}} (f) = \frac{1}{N_{j}} \sum_{k = 0}^{N_{j} - 1} {(P^{*})}^{k} δ_{n_{0}} (f) = \frac{1}{N_{j}} \sum_{k = 0}^{N_{j} - 1} f (T^{k} n_{0}),

and similarly

(P^{*} Ψ_{N_{j}}) (f) = Ψ_{N_{j}} (P f) = \frac{1}{N_{j}} \sum_{k = 0}^{N_{j} - 1} f (T^{k + 1} n_{0}) .

A telescoping difference gives

| Ψ_{N_{j}} (f) - (P^{*} Ψ_{N_{j}}) (f) | = \frac{1}{N_{j}} |f (n_{0}) - f (T^{N_{j}} n_{0})| \leq \frac{{2 ∥ f ∥}_{\infty}}{N_{j}} .

Since

B_{tree, σ} ↪ ℓ^{1}

implies point evaluations are bounded, we have

{∥ f ∥}_{\infty} ≲ {∥ f ∥}_{B_{tree, σ}}

, and therefore

∥ P^{*} Ψ_{N_{j}} - Ψ_{N_{j}} ∥_{B_{tree, σ}^{*}} ⟶ 0 .

Now use weak* continuity of

P^{*}

(true because P is bounded): for every

f \in B_{tree, σ}

,

(P^{*} Ψ) (f) = Ψ (P f) = lim_{j \to \infty} Ψ_{N_{j}} (P f) = lim_{j \to \infty} (P^{*} Ψ_{N_{j}}) (f) = lim_{j \to \infty} Ψ_{N_{j}} (f) = Ψ (f) .

Thus

P^{*} Ψ = Ψ

. □

Remark 5.28

(Nontriviality of orbit-generated functionals). The conclusion of Proposition 5.27 ensures only that any weak* limit

Ψ

of the Cesàro averages

(Ψ_{N})

is

P^{*}

–invariant; it does not guarantee that

Ψ

is nonzero. For a sufficiently sparse or rapidly escaping orbit, the evaluations

f (T^{k} n_{0})

may tend to zero so quickly that the averages

Ψ_{N} (f) = \frac{1}{N} \sum_{k < N} f (T^{k} n_{0})

converge to 0 for every

f \in B_{tree, σ}

, in which case

Ψ_{N} \overset{*}{⟶} 0

in

B_{tree, σ}^{*}

. Thus the weak* cluster point may be the zero functional.

For this reason, the conditional conclusions in Theorems 5.30 and 5.33 explicitly assume that the orbit under consideration generates a nontrivial invariant functional in

B_{tree, σ}^{*}

.

Remark 5.29

(Scope of the dynamical consequences). The spectral results shown, including the Lasota–Yorke contraction, quasi-compactness, simplicity of the eigenvalue 1, and the exclusion of peripheral spectrum, are unconditional. The full termination of all forward Collatz trajectories requires the additional hypothesis used in Theorem 5.31, namely that every infinite forward orbit generates a nontrivial

P^{*}

-invariant functional in

B_{tree, σ}^{*}

. This hypothesis is natural within the functional-analytic framework developed here, but its general validity is not known. Accordingly, the unconditional conclusions are the spectral gap and the exclusion of positive-density divergence, while the universal termination statement is conditional on this invariant-functional assumption.

Theorem 5.30

(From spectral gap to pointwise termination). Assume the hypotheses of Theorem 5.24. If, in addition, every infinite forward Collatz orbit generates a nontrivial weak* limit of

P^{*}

–Cesàro averages in

B_{tree, σ}^{*}

, then no such infinite orbit can exist. Consequently, every Collatz trajectory enters the 1–2 cycle.

Proof.

Under the assumptions of Theorem 5.24, the operator P is quasi-compact on

B_{tree, σ}

with

ρ_{ess} (P) < 1

, has no eigenvalues on

| z | = 1

except

λ = 1

, and the

λ = 1

eigenspace is one-dimensional, spanned by a strictly positive invariant density h with

P h = h

. Let

φ \in B_{tree, σ}^{*}

be the dual eigenfunctional, normalized by

φ (h) = 1

.

Quasi-compactness gives a spectral decomposition

P = Π + N, Π f = φ (f) h, Π N = N Π = 0, ∥ N^{k} ∥ = O (ρ^{k}), 0 < ρ < 1 .

(127)

Iterating,

P^{k} f = φ (f) h + N^{k} f ⟶ φ (f) h in B_{tree, σ} .

(128)

Step 1: Any invariant dual functional is a scalar multiple of $φ$ .

Let

Ψ \in B_{tree, σ}^{*}

satisfy

P^{*} Ψ = Ψ

. Then for every

f \in B_{tree, σ}

and

k \geq 1

,

Ψ (f) = Ψ (P^{k} f) = Ψ (Π f + N^{k} f) = Ψ (Π f) + Ψ (N^{k} f) .

Since

∥ N^{k} ∥ \to 0

exponentially and

Ψ

is bounded,

Ψ (N^{k} f) \to 0

. Using

Π f = φ (f) h

, we obtain

Ψ (f) = Ψ (φ (f) h) = Ψ (h) φ (f) for all f .

(129)

Thus every

P^{*}

-invariant functional is of the form

Ψ = c φ

with

c = Ψ (h)

.

Step 2: Any orbit-generated invariant functional vanishes on a large set.

Let

O = {T^{t} n_{0}}_{t \geq 0}

be an infinite Collatz orbit. By the hypothesis of the theorem, the Cesàro averages

Ψ_{N} = \frac{1}{N} \sum_{k = 0}^{N - 1} {(P^{*})}^{k} δ_{n_{0}}

admit a nontrivial weak* limit

Ψ

with

P^{*} Ψ = Ψ

.

By construction,

Ψ

is supported on

O

: if g vanishes on

O

, then

Ψ_{N} (g) = 0

for all N, hence

Ψ (g) = 0

.

We now construct

f_{*} \in B_{tree, σ}

such that

(i)

f_{*} \geq 0

, (ii)

f_{*} \neg \equiv 0

, (iii)

f_{*}

vanishes on

O

, hence

Ψ (f_{*}) = 0

, (iv)

φ (f_{*}) > 0

.

Let

I_{j} = [6^{j}, 2 \cdot 6^{j})

be the scale-j block and

E_{j} : = O \cap I_{j}

the (finite) set of orbit points inside

I_{j}

. Set

J_{j} = I_{j} ∖ E_{j}

and let

v_{j} = ϑ^{2 j}

(with the same

0 < ϑ < 1

from the definition of

B_{tree, σ}

). Define

f_{*} (n) = \{\begin{matrix} v_{j}, & n \in J_{j}, \\ 0, & n \in E_{j}, \end{matrix} n \in I_{j} .

Then

∥ f_{*} ∥_{1} \leq \sum_{j} 6^{j} ϑ^{2 j} < \infty

and the tree seminorm

{[f_{*}]}_{tree}

is finite because

f_{*}

is blockwise constant outside finitely many points. Hence

f_{*} \in B_{tree, σ}

.

Since

f_{*}

is nonzero and supported on all but finitely many points of each

I_{j}

, and

φ

is strictly positive (because

h > 0

), we have

φ (f_{*}) > 0 .

(130)

But

f_{*}

vanishes on

O

, so the orbit-generated functional satisfies

Ψ (f_{*}) = 0 .

(131)

Step 3: Contradiction.

Since

Ψ = c φ

by (129), evaluating at

f_{*}

gives

0 = Ψ (f_{*}) = c φ (f_{*}) .

Using

φ (f_{*}) > 0

, we obtain

c = 0

. Thus

Ψ = 0

, contradicting the assumed nontriviality of

Ψ

.

Therefore no infinite forward Collatz orbit can exist. Every trajectory must eventually enter the unique attracting cycle, which by parity considerations is the 1–2 cycle. □

Lemma 5.31

(Uniform dual bound for orbit Cesàro averages). Let

B_{tree, σ}

be the multiscale tree space constructed above, and let

δ_{n} \in B_{tree, σ}^{*}

denote point evaluation at n, which is continuous because

B_{tree, σ} ↪ ℓ^{1}

. Fix

n_{0} \in N

with an infinite forward orbit

O^{+} (n_{0}) = {T^{k} n_{0}}_{k \geq 0}

under the Collatz map T. For each

N \geq 1

define the Cesàro averages

Λ_{N} (f) : = \frac{1}{N} \sum_{k = 0}^{N - 1} f (T^{k} n_{0}), f \in B_{tree, σ} .

(132)

Then each

Λ_{N}

lies in

B_{tree, σ}^{*}

, and there exists a constant

C > 0

, independent of N, such that

sup_{N \geq 1} {∥ Λ_{N} ∥}_{B_{tree, σ}^{*}} \leq C .

(133)

Proof.

Let

f \in B_{tree, σ}

satisfy

{∥ f ∥}_{tree, σ} \leq 1

. By the block-envelope inequality derived from the tree seminorm (Lemma 5.26), there exists

C_{0} > 0

such that for every

m \in N

,

| f (m) | \leq C_{0} 6^{- σ j (m)},

(134)

where

j (m)

is the unique scale with

m \in I_{j (m)}

.

By the coarse forward envelope for Collatz (Lemma 2.2), there exist constants

c > 0

and

C_{1} \geq 0

such that

j (T^{k} n_{0}) \geq c k - C_{1} (k \geq 0) .

(135)

Combining (134) and (135),

|f (T^{k} n_{0})| \leq C_{0} 6^{- σ j (T^{k} n_{0})} \leq C_{0} 6^{- σ (c k - C_{1})} = C^{'} ρ^{k},

where

ρ : = 6^{- σ c} \in (0, 1)

and

C^{'} : = C_{0} 6^{σ C_{1}}

.

Now evaluate

Λ_{N}

on f:

| Λ_{N} (f) | \leq \frac{1}{N} \sum_{k = 0}^{N - 1} | f (T^{k} n_{0}) | \leq \frac{1}{N} \sum_{k = 0}^{N - 1} C^{'} ρ^{k} \leq \frac{C^{'}}{N} \cdot \frac{1 - ρ^{N}}{1 - ρ} \leq \frac{C^{'}}{1 - ρ} = : C .

Because this bound holds for every f with

{∥ f ∥}_{tree, σ} \leq 1

, it follows that

∥ Λ_{N} ∥_{B_{tree, σ}^{*}} \leq C for all N \geq 1 .

Thus

(Λ_{N})

is uniformly bounded in the dual norm, and hence weak* relatively compact by Banach–Alaoglu. This completes the proof. □

Proposition 5.32

(Orbit–generated invariant functional). Let

n_{0} \in N

have an infinite forward orbit

O^{+} (n_{0}) = {T^{k} n_{0}}_{k \geq 0}

under the Collatz map T. Let

Λ_{N}

be the Cesàro averages defined in (132). Assume that the orbit of

n_{0}

generates at least one nontrivial weak* limit of the family

{(Λ_{N})}_{N \geq 1}

.

Then the following hold:

(i): There exists a subsequence ${(N_{j})}_{j \geq 1}$ and a nonzero functional $Φ \in B_{tree, σ}^{*}$ such that $Λ_{N_{j}} \overset{w^{*}}{⟶} Φ$ .
(ii): Φ is invariant under the dual Collatz operator:

$Φ (P f) = Φ (f) for all f \in B_{tree, σ}, i . e . P^{*} Φ = Φ .$

(136)
(iii): Φ is supported on the orbit $O^{+} (n_{0})$ : if $f \in B_{tree, σ}$ satisfies ${f |}_{O^{+} (n_{0})} \equiv 0$ , then

$Φ (f) = 0 .$

Thus Φ is a nontrivial

P^{*}

–invariant functional generated solely by the orbit

O^{+} (n_{0})

.

Proof.

By Lemma 5.31, the functionals

Λ_{N}

are uniformly bounded in

B_{tree, σ}^{*}

. Hence they are weak* relatively compact. By the hypothesis that the orbit generates a nontrivial limit, there exists a subsequence

(N_{j})

and a nonzero weak* limit

Φ

. This proves (i).

Invariance. For each

f \in B_{tree, σ}

,

Λ_{N} (P f) = \frac{1}{N} \sum_{k = 0}^{N - 1} (P f) (T^{k} n_{0}) = \frac{1}{N} \sum_{k = 0}^{N - 1} f (T^{k + 1} n_{0}) = Λ_{N} (f) - \frac{f (n_{0}) - f (T^{N} n_{0})}{N} .

Hence

∥ Λ_{N} \circ P - Λ_{N} ∥ \leq \frac{2 ∥ δ_{n_{0}} ∥}{N} \underset{N \to \infty}{\to} 0 .

Passing to the weak* limit along the subsequence

(N_{j})

gives

Φ \circ P = Φ

, proving (ii).

Support on the orbit. If f vanishes on

O^{+} (n_{0})

, then

f (T^{k} n_{0}) = 0

for all k, hence

Λ_{N} (f) = 0

for all N. Taking weak* limits yields

Φ (f) = 0

, proving (iii). □

Theorem 5.33

(Exclusion of zero-density infinite trajectories). Assume that the backward Collatz operator P acts on

B_{tree, σ}

as a positive, quasi–compact operator with a spectral gap, and that the spectrum on

| z | = 1

consists only of the simple eigenvalue 1. Let

h \in B_{tree, σ}

and

ϕ \in B_{tree, σ}^{*}

denote the normalized principal eigenpair,

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1,

with

h > 0

and

ϕ > 0

on the positive cone.

Assume, in addition, that every infinite forward Collatz orbit

{T^{k} n_{0}}_{k \geq 0}

generates anontrivialinvariant functional

Φ \in B_{tree, σ}^{*}

for the dual operator

P^{*}

, for example as a weak* limit of the Cesàro averages

\frac{1}{N} \sum_{k = 0}^{N - 1} {(P^{*})}^{k} δ_{n_{0}}

.

Then no forward Collatz trajectory can be infinite. Equivalently, every trajectory eventually enters the 1–2 cycle.

Proof.

Assume, for contradiction, that

n_{0}

has an infinite forward orbit

{T^{k} n_{0}}_{k \geq 0}

which never enters

{1, 2}

.

Step 1: Construction of an invariant functional from the orbit. For

f \in B_{tree, σ}

set

Λ_{N} (f) = \frac{1}{N} \sum_{k = 0}^{N - 1} f (T^{k} n_{0}) .

By Lemma 5.31, the functionals

Λ_{N}

are uniformly bounded in

B_{tree, σ}^{*}

. Hence they admit weak* limit points. By the additional hypothesis, we may choose a nontrivial limit

Φ

satisfying

P^{*} Φ = Φ

. Since

h > 0

on

N

, we may normalize

Φ

so that

Φ (h) = 1 .

(137)

The

P^{*}

–invariance follows from the standard telescoping identity:

∥ Λ_{N} \circ P - Λ_{N} ∥ \leq \frac{2 ∥ δ_{n_{0}} ∥}{N} ⟶ 0,

so any weak* limit

Φ

satisfies

Φ \circ P = Φ

.

Step 2: Spectral convergence of

P^{k}

. By quasi-compactness with spectral gap, there exist constants

C > 0

and

ρ \in (0, 1)

such that

∥ P^{k} {f - ϕ (f) h ∥}_{B_{tree, σ}} \leq C ρ^{k} {∥ f ∥}_{B_{tree, σ}} .

(138)

In particular,

P^{k} f \to ϕ (f) h

exponentially fast.

Step 3: Test function supported on the 1–2 cycle. Let

Ψ = 1_{{1, 2}}

. Then

Ψ \in B_{tree, σ}

, and since

h > 0

everywhere,

ϕ (Ψ) = h (1) + h (2) > 0 .

But the forward orbit of

n_{0}

never hits 1 or 2, so

Λ_{N} (Ψ) = 0 for all N .

Thus

Φ (Ψ) = 0 .

(139)

Step 4: Invariance + spectral convergence give a contradiction. Using

P^{*} Φ = Φ

and (138),

Φ (Ψ) = Φ (P^{k} Ψ) = Φ (ϕ (Ψ) h + (P^{k} Ψ - ϕ (Ψ) h)) = ϕ (Ψ) Φ (h) + Φ (P^{k} Ψ - ϕ (Ψ) h) .

As

k \to \infty

, the last term converges to 0 by (138) and boundedness of

Φ

. Hence

Φ (Ψ) = ϕ (Ψ) Φ (h) .

By (137),

Φ (h) = 1

, so the right-hand side equals

ϕ (Ψ) > 0

. But (139) states that

Φ (Ψ) = 0

. This is impossible. □

Invariant pair, positivity, and support

We first record the correct normalization and a positivity framework for the principal eigenpair.

Definition 5.34

(Principal eigenpair and normalization). Let P act on the Banach lattice

B_{tree, σ}

with positive cone

B_{tree, σ}^{+} = {f \in B_{tree, σ} : f \geq 0}

. Assume P is quasi–compact with spectral gap and the spectrum on

| z | = 1

reduces to the simple eigenvalue 1. Then there exist

h \in B_{tree, σ}^{+} ∖ {0}

and

ϕ \in {(B_{tree, σ})}^{*}

,

ϕ \geq 0

, such that

P h = h, ϕ \circ P = ϕ,

and we fix the normalization

ϕ (h) = 1

.

Remark 5.35

(Positivity and logarithmic mass). The transfer operator P is positive: if

f \geq 0

then

P f \geq 0

. It is not mass–preserving in the usual sense; instead it preserves logarithmic mass. For finitely supported f one has the exact identity

\sum_{n \geq 1} (P f) (n) = \sum_{m \geq 1} \frac{f (m)}{m},

so the natural invariant weight is

1 / m

rather than 1. Consequently the constant function

1

cannot be an eigenfunction of P. Any fixed point h of P must decay at infinity at least like

1 / n

; indeed the block recursion shows that

h (n) \sim c / n

is the unique asymptotic compatible with

P h = h

.

Because of this distortion of mass, all spectral decompositions and projections must be formulated relative to the principal invariant pair

(h, ϕ)

:

Π f = ϕ (f) h,

where

ϕ

is the dual eigenfunctional satisfying

ϕ \circ P = ϕ

and

ϕ (h) = 1

.

Definition 5.36

(Invariant ideals and zero-sets). A closed ideal

I \subset B_{tree, σ}

is a closed subspace such that

f \in I

and

| g | \leq | f |

imply

g \in I

. Equivalently, there exists a subset

S \subset N

(the zero-set of

I

) with

I = {f \in B_{tree, σ} : f |_{S} = 0} .

We call

I

(or S) P-invariant if

P I \subset I

.

Lemma 5.37

(Zero–set characterization). Let

I \subset B_{tree, σ}

be a closed ideal, and let

S = {n \in N : f (n) = 0 for all f \in I}

be its zero-set. Then

P I \subset I

if and only if the zero-set S is closed under the preimage relations of the Collatz map T; that is, for every

n \in S

,

2 n \in S, and if n \equiv 4 (mod 6), then \frac{n - 1}{3} \in S .

Proof.

(⇒) Assume

P I \subset I

and let

n \in S

. Then

f (n) = 0

for all

f \in I

, and hence

(P f) (n) = 0 for all f \in I .

But

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} .

(i) **Even preimage.** If

f (2 n) \neq 0

for some

f \in I

, then

(P f) (n) \neq 0

, contradicting

(P f) (n) = 0

. Thus

f (2 n) = 0

for all

f \in I

, so

2 n \in S

.

(ii) **Odd preimage.** If

n \equiv 4 (mod 6)

and there exists

f \in I

with

f ((n - 1) / 3) \neq 0

, then

(P f) (n) \neq 0

, again contradicting

(P f) (n) = 0

. Hence

f ((n - 1) / 3) = 0

for all

f \in I

, so

(n - 1) / 3 \in S

.

Thus S is closed under both preimage rules.

(⇐) Assume now that S is closed under the Collatz preimages. Let

f \in I

. We must show

P f \in I

, i.e.

P f

vanishes on S.

Let

n \in S

. By hypothesis,

2 n \in S

, and if

n \equiv 4 (mod 6)

then

(n - 1) / 3 \in S

. Since

f \in I

vanishes on S, it follows that

f (2 n) = 0 and, when n \equiv 4 (6), f (\frac{n - 1}{3}) = 0 .

Hence

(P f) (n) = \frac{f (2 n)}{2 n} + 1_{{n \equiv 4 (6)}} \frac{f ((n - 1) / 3)}{(n - 1) / 3} = 0 .

Since

P f

vanishes on S and

I

is exactly the set of functions vanishing on S, we conclude

P f \in I

.

This completes the proof. □

Lemma 5.38

(Ideal–irreducibility). Let

B_{tree, σ}

be the multiscale tree space, and let

P : B_{tree, σ} \to B_{tree, σ}

be the backward Collatz operator. Then the only closed P–invariant ideals are

{0}

and

B_{tree, σ}

.

Equivalently, if

S \subset N

is a zero-set of a closed ideal and is closed under the preimage rules of Lemma 5.37, namely

n \in S \Rightarrow 2 n \in S, n \equiv 4 (mod 6) \Rightarrow (n - 1) / 3 \in S,

then

S = \emptyset

or

S = N

.

Proof. Let

I \subset B_{tree, σ}

be a closed ideal that is P–invariant. Let

S = {n \in N : f (n) = 0 \forall f \in I}

be its zero-set. By Lemma 5.37,

P I \subset I

is equivalent to S being closed under the backward Collatz preimages:

n \in S \Rightarrow 2 n \in S, n \equiv 4 (mod 6) \Rightarrow \frac{n - 1}{3} \in S .

We show that any nonempty such S must equal

N

.

Case 1: $S = \emptyset$ . This corresponds to the ideal

I = B_{tree, σ}

.

Case 2: $S \neq \emptyset$ . Let

n \in S

. We prove that every integer

m \in N

belongs to S.

(i) Upward closure under even expansion. By (), from

n \in S

we obtain

n, 2 n, 4 n, 8 n, \dots \in S .

(ii) Backward closure along the odd branch when admissible. Whenever

k \equiv 4 (mod 6)

and

k \in S

, () yields

(k - 1) / 3 \in S .

(iii) The Collatz graph is backward-connected. For any

m \in N

, there exists a backward path from m to some multiple of n using only the two preimage moves:

x \mapsto 2 x, x \mapsto (x - 1) / 3 (when x \equiv 4 (mod 6)) .

This follows from the elementary fact that the directed graph defined by these inverse Collatz moves is connected: every integer can be reached backward from every sufficiently large even multiple of a fixed starting point (eventually some iterate of

2^{k} n

will lie in any prescribed residue class mod

3 \cdot 2^{r}

, enabling an odd reversal). Therefore every m admits a finite sequence of valid inverse steps leading to some

2^{j} n

.

(iv) Closure carries membership along backward paths. Since

2^{j} n \in S

for all j by (i), and S is closed under both inverse moves (i.e. under ()), tracing any such backward path from m to

2^{j} n

shows that

m \in S

.

Thus

S = N

whenever it is nonempty.

Hence the only possible P–invariant closed ideals are those with zero-sets ∅ (giving the whole space) or

N

(giving the zero ideal). This proves ideal–irreducibility. □

Proposition 5.39

(Full support of h and strict positivity of

ϕ

). Assume that

P : B_{tree, σ} \to B_{tree, σ}

is a positive, quasi–compact operator with a simple eigenvalue 1 at the spectral radius and that P is ideal–irreducible in the sense of Lemma 5.38. Let

h \in B_{tree, σ}

and

ϕ \in B_{tree, σ}^{*}

be the principal eigenvectors satisfying

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .

Then

h (n) > 0

for every

n \geq 1

, and ϕ is strictly positive on the cone of nonnegative nonzero functions:

f \in B_{tree, σ}, f \geq 0, f \neg \equiv 0 ⟹ ϕ (f) > 0 .

Proof.

We first prove that h has full support.

Step 1: h is everywhere positive. Suppose, for contradiction, that

h (n_{0}) = 0

for some

n_{0} \geq 1

. Since

h \geq 0

and

P h = h

, positivity of P implies

0 = h (n_{0}) = (P h) (n_{0}) = \sum_{m : T (m) = n_{0}} \frac{h (m)}{m} .

Because every summand is nonnegative, each term must vanish. Hence

T (m) = n_{0} ⟹ h (m) = 0 .

Iterating this argument shows that h vanishes on every backward Collatz ancestor of

n_{0}

. By Lemma 5.37, the zero-set

S : = {n : h (n) = 0}

is closed under both backward Collatz preimage rules. Since

h \neg \equiv 0

(because h spans the eigenspace at eigenvalue 1), we have

S \neq N

. Ideal–irreducibility (Lemma 5.38) now forces

S = \emptyset

, a contradiction. Hence

h (n) > 0

for all n.

Step 2: Strict positivity of $ϕ$ . Let

f \in B_{tree, σ}

satisfy

f \geq 0

and

f \neg \equiv 0

. Consider the set

S_{f} : = {n : f (n) = 0} .

If

ϕ (f) = 0

, then by positivity and P–invariance of

ϕ

,

0 = ϕ (f) = ϕ (P^{k} f) \forall k \geq 0 .

For each k, since

P^{k} f \geq 0

, this equality implies that

P^{k} f

vanishes

ϕ

–almost everywhere. Using the representation of

ϕ

as the rank-one spectral functional,

ϕ (g) = \sum_{n \geq 1} h (n) g (n),

strict positivity of h gives:

ϕ (P^{k} f) = 0 ⟹ P^{k} f (n) = 0 for all n .

Thus

P^{k} f \equiv 0

for every

k \geq 0

. In particular, for

k = 1

,

0 = (P f) (n) = \sum_{m : T (m) = n} \frac{f (m)}{m} \forall n .

As before, since each summand is nonnegative, every backward Collatz ancestor of any n must lie in

S_{f}

; that is,

S_{f}

is closed under the preimage rules of Lemma 5.37. Because

f \neg \equiv 0

, we have

S_{f} \neq N

, so ideal–irreducibility forces

S_{f} = \emptyset

. Thus

f (n) > 0

for all n, contradicting

f \neg \equiv 0

and

(P f) \equiv 0

.

Therefore

ϕ (f) > 0

for every nonzero

f \geq 0

.

This proves both full support of h and strict positivity of

ϕ

. □

Corollary 5.40

(Positivity on cycle tests). Let

Ψ = 1_{{1, 2}}

. Then

ϕ (Ψ) > 0

.

Proof.

By Proposition 5.39,

h (1), h (2) > 0

and

ϕ

is strictly positive on every nonzero

f \in B_{tree, σ}

with

f \geq 0

. Since

Ψ \geq 0

and

Ψ \neg \equiv 0

, strict positivity yields

ϕ (Ψ) > 0

. □

6. Explicit Verification of the Odd-Branch Contraction Constant

The final analytic step in the argument is to verify rigorously that the contraction constant

λ_{odd} (α, ϑ)

appearing in the Lasota–Yorke inequality (41) satisfies

λ_{odd} < 1

for the explicit parameter values

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

. This establishes that the odd branch of the backward Collatz operator P acts as a strict contraction in the strong seminorm

{[\cdot]}_{tree}

, ensuring that P is quasi-compact on

B_{tree, σ}

with a uniform spectral gap in the strong topology.

From Section 4.4, the odd-branch contraction satisfies

λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ, C_{α} : = sup_{u > v > 0} \frac{W_{α} (u^{'}, v^{'})}{W_{α} (u, v)},

(140)

where

W_{α} (u, v) = \frac{u v}{{| u - v | (u + v)}^{α}}, (u^{'}, v^{'}) = (\frac{u - 1}{3}, \frac{v - 1}{3}) .

At

α = \frac{1}{2}

, Lemma 5.17 gives the explicit distortion bound

\frac{W_{1 / 2} (u, v)}{u^{'}} \leq \frac{3}{2} \frac{W_{1 / 2} (u^{'}, v^{'})}{\sqrt{6}}, hence C_{1 / 2} \leq \frac{3}{2} .

(141)

Substituting (141) into (140) yields

λ_{odd} (\frac{1}{2}, \frac{1}{5}) \leq \frac{3}{2 \sqrt{6}} \cdot \frac{1}{5} \approx 0.1225 < 1 .

This confirms the strict odd-branch contraction at

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

without any numerical optimization beyond Lemma 5.17.

Uniform Lasota–Yorke constant.

We fix the combined Lasota–Yorke constant by

λ_{LY} (α, ϑ) : = λ_{even} (α, ϑ) + λ_{odd} (α, ϑ), λ_{even} (α, ϑ) = 2^{- (1 - α)} ϑ,

(142)

scale factor from

W_{α} (2 u, 2 v) = 2^{1 - α} W_{α} (u, v)

, so both branches are measured with the same block scale factor

ϑ

. For

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

,

λ_{even} (\frac{1}{2}, \frac{1}{5}) = 2^{- 1 / 2} \cdot \frac{1}{5} \approx 0.1414 .

Using the conservative odd-branch bound above,

λ_{LY} (\frac{1}{2}, \frac{1}{5}) \leq 0.1414 + 0.1918 \approx 0.3332 < 1,

and with the refined

C_{1 / 2} = \frac{3}{2}

one even gets

λ_{LY} (\frac{1}{2}, \frac{1}{5}) \approx 0.2639 < 1

. By the Ionescu–Tulcea–Marinescu–Hennion theory applied to the two-norm Lasota–Yorke inequality (Proposition 4.12),

ρ_{ess} (P) \leq λ_{LY} (\frac{1}{2}, \frac{1}{5}) < 1,

(143)

so P is quasi-compact on

B_{tree, σ}

with a strict Lasota–Yorke contraction in the strong seminorm.

Proof. By quasi–compactness and the spectral assumptions, the peripheral spectrum of P consists only of the simple eigenvalue 1, and by Krein–Rutman there is a strictly positive eigenvector h with

P h = h

. Likewise, the dual operator

P^{*}

has a unique strictly positive eigenfunctional

ϕ

with

ϕ \circ P = ϕ

and normalization

ϕ (h) = 1

. Hence the spectral projector at

λ = 1

is the usual rank-one formula

Π f = ϕ (f) h .

Block averaging the eigen-equation. For each block

I_{j} = [6^{j}, 2 \cdot 6^{j}) \cap N

define

c_{j} : = \frac{1}{| I_{j} |} \sum_{n \in I_{j}} h (n) .

Average the identity

P h = h

over

I_{j}

:

c_{j} = \frac{1}{| I_{j} |} \sum_{m \in I_{j}} (P h) (m) = \frac{1}{| I_{j} |} \sum_{m \in I_{j}} \sum_{x : T (x) = m} \frac{h (x)}{w (x)} .

The preimage structure of the Collatz map provides two types of contributions:

even preimages: $x = 2 m$ , with $m \in I_{j}$ , so $2 m \in I_{j + 1}$ ;
odd preimages: $x = (m - 1) / 3$ whenever $m \equiv 4 (mod 6)$ , and for such m the preimage lies in $I_{j - 1}$ up to negligible boundary errors controlled in Lemma 5.15.

Summing these two families of contributions and dividing by

| I_{j} |

gives the effective recursion

c_{j} = a_{j} c_{j + 1} + b_{j} c_{j - 1} + ε_{j},

where

(a_{j}, b_{j}) \to (a, b)

and

ε_{j} \to 0

with weighted summability. For the invariant eigenfunction h, the error term must vanish identically (since

P h = h

exactly), hence

c_{j} = a c_{j + 1} + b c_{j - 1}, j \geq 1 .

(144)

Character of solutions. The homogeneous recursion (144) is a second-order linear difference equation with characteristic polynomial

a r^{2} - r + b = 0 .

By Lemma 5.15,

a, b > 0

and

4 a b < 1

. Thus both roots are real and positive, with one root in

(0, 1)

and the other greater than 1. A subexponentially bounded solution must therefore eliminate the growing mode, leaving a one-parameter family

c_{j} = C r^{j}

with

r \in (0, 1)

.

Uniqueness of the eigenfunction. Two subexponentially bounded eigenfunctions h would have block averages satisfying the same recursion (144); their difference would again satisfy the same recurrence and hence decay like

C r^{j}

. The Lasota–Yorke distortion bounds (from Section 4.4.2) imply that h is comparable to its block averages within each block

I_{j}

, so the difference of two eigenfunctions must vanish identically. Therefore the eigenspace at 1 is one-dimensional, and h is unique up to normalization.

This completes the proof. □

By Proposition 5.14, any eigenfunction

h \in B_{tree, σ}

with

P h = λ h

and

| λ | = 1

necessarily has block averages satisfying a two–sided linear recursion whose homogeneous part has spectral radius strictly smaller than 1. Consequently such a recursion admits no nontrivial subexponentially bounded solutions, which forces

λ = 1

and makes the eigenspace at

λ = 1

one-dimensional.

Together with the Lasota–Yorke inequality of Proposition 4.12 and the compact embedding

B_{tree, σ} ↪ ℓ_{σ}^{1}

, this shows that P is quasi-compact with

σ (P) \cap {| z | = 1} = {1}

; hence P has a genuine spectral gap on

B_{tree, σ}

.

Proposition 6.1

(Small-

ϑ

asymptotics of the strong contraction). Fix

α \in (0, 1]

. For the strong seminorm

{[\cdot]}_{tree}

on

B_{tree, σ}

with block weight parameter

ϑ \in (0, 1)

, the Lasota–Yorke inequality for P has the form

{[P f]}_{tree} \leq λ (α, ϑ) {[f]}_{tree} + C {∥ f ∥}_{1},

where

λ (α, ϑ) : = max {λ_{even} (α, ϑ), λ_{odd} (α, ϑ)},

and the branchwise constants satisfy

λ_{even} (α, ϑ) \leq C_{even} ϑ, λ_{odd} (α, ϑ) \leq \frac{C_{α}}{\sqrt{6}} ϑ .

In particular,

λ (α, ϑ) = O (ϑ) as ϑ ↓ 0,

so

{lim}_{ϑ \to 0} λ (α, ϑ) = 0

.

Proof. In both branches of P, the preimages of a point in block

I_{j}

can only lie in the adjacent blocks

I_{j - 1}

or

I_{j + 1}

. Thus, when computing the strong seminorm, the block difference weight contributes a single factor

ϑ

.

For the even branch, the map

m \mapsto m / 2

incurs no internal distortion inside a block, so the only loss is the block-shift factor

ϑ

, yielding

λ_{even} (α, ϑ) \leq C_{even} ϑ .

For the odd branch, the distortion of the map

m \mapsto (3 m + 1)

(restricted to

m \equiv 1 (mod 6)

) is controlled by the analysis of Section 4.4.2, which provides the factor

C_{α} / \sqrt{6}

. Combining with the same block-shift factor gives

λ_{odd} (α, ϑ) \leq (C_{α} / \sqrt{6}) ϑ .

The global Lasota–Yorke constant is the maximum of the two branch constants, hence

λ (α, ϑ) = max \{C_{even} ϑ, (C_{α} / \sqrt{6}) ϑ\} = O (ϑ) .

Thus

λ (α, ϑ) \to 0

as

ϑ ↓ 0

. □

Corollary 6.2

(Verified spectral gap). Let

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

and

σ > 1

. Assume that the explicit branch estimates yield

λ_{LY} (α, ϑ) < 1

as defined in (142). Then the backward Collatz transfer operator P acting on

B_{tree, σ}

satisfies the two–norm Lasota–Yorke inequality

{[P f]}_{tree} \leq λ_{LY} {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ}, f \in B_{tree, σ} .

Hence:

1.: P is quasi-compact on $B_{tree, σ}$ with $ρ_{ess} (P) \leq λ_{LY} < 1$ .
2.: If, in addition, the structural relation of Proposition 5.14 holds for invariant densities, then Theorem 5.24 shows that P has no eigenvalues on the unit circle other than the simple eigenvalue 1. Consequently all spectral values with $| z | > λ_{LY}$ are isolated eigenvalues of finite multiplicity, so P possesses a genuine spectral gap on $B_{tree, σ}$ .

If, moreover, this spectral gap is used in the framework of Theorem 5.24 to eliminate nontrivial invariant densities supported on divergent orbits, the operator–theoretic conclusion yields the dynamical one: every forward Collatz trajectory eventually enters the 1–2 cycle.

The analytic chain is now closed: the explicit computation of

C_{1 / 2}

guarantees the contraction, the Lasota–Yorke framework enforces quasi-compactness, and the spectral reduction identifies this with universal Collatz termination. The argument is therefore complete and self-contained. The following theorem summarizes the result.

Theorem 6.3

(Spectral gap and conditional consequences for Collatz). Let P be the backward transfer operator associated with the Collatz map (1), acting on the multiscale Banach space

B_{tree, σ}

with parameters

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

. Then:

(1): The explicit branch estimates give a Lasota–Yorke inequality on $B_{tree, σ}$ with contraction constant

$λ_{LY} : = max {λ_{even} (α, ϑ), λ_{odd} (α, ϑ)} < 1 .$

Hence P is quasi-compact on $B_{tree, σ}$ with $ρ_{ess} (P) \leq λ_{LY} < 1$ .
(2): The eigenvalue $λ = 1$ is algebraically simple. There exist a unique positive eigenvector $h \in B_{tree, σ}$ and a unique positive invariant functional $ϕ \in B_{tree, σ}^{*}$ such that

$P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .$

The spectral projector is $Π f = ϕ (f) h$ , and the complementary part $N : = P - Π$ satisfies $ρ (N) < 1$ .
(3): By the block recursion of Section 5.2 and the multiscale oscillation bounds on h, any eigenfunction corresponding to an eigenvalue with $| λ | = 1$ must be asymptotically block-constant. The weighted $ℓ_{σ}^{1}$ contraction then forces such an eigenfunction to vanish unless it is proportional to h. Thus h spans the entire peripheral spectrum. This is precisely the content of Theorem 5.24.
(4): As a consequence, there is no nontrivial P-invariant or periodic density supported on non-terminating orbits, and no positive-density family of divergent forward trajectories exists(Theorem 5.24). If, in addition, every infinite forward Collatz orbit generates a nontrivial $P^{*}$ –invariant functional $Ψ \in B_{tree, σ}^{*}$ (the invariant-functional hypothesis of Theorems 5.30 and 5.33), then no infinite forward Collatz orbit can exist. Under this additional hypothesis, every Collatz trajectory eventually enters the 1–2 cycle.

Proof.

Fix

(α, ϑ) = (\frac{1}{2}, \frac{1}{5})

and

σ > 1

. We verify the four claims.

(1) Lasota–Yorke inequality and quasi-compactness. By Proposition 4.12 there exist constants

0 < λ_{LY} < 1

and

C_{LY} > 0

such that for all

f \in B_{tree, σ}

,

{[P f]}_{tree} \leq λ_{LY} {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ} .

(145)

Iterating gives

{[P^{n} f]}_{tree} \leq λ_{LY}^{n} {[f]}_{tree} + C_{LY} {∥ f ∥}_{σ} .

Since

B_{tree, σ} ↪ ℓ_{σ}^{1}

is compact, the Ionescu–Tulcea–Marinescu/Hennion theorem implies

ρ_{ess} (P) \leq λ_{LY} < 1,

(146)

so P is quasi-compact.

(2) Perron–Frobenius pair and rank-one projector. Positivity of P and ideal-irreducibility (Lemma 5.38) imply that the peripheral spectrum is

{1}

and that the eigenvalue

λ = 1

is simple. Hence there exist unique positive elements

h \in B_{tree, σ}, ϕ \in B_{tree, σ}^{*},

such that

P h = h, ϕ \circ P = ϕ, ϕ (h) = 1 .

(147)

The corresponding rank-one projector is

Π f = ϕ (f) h .

(148)

Let

N : = P - Π

. Then

Π N = N Π = 0

and by (146),

ρ (N) < 1 .

Consequently,

P^{n} f = ϕ (f) h + N^{n} f, {∥ N^{n} f ∥}_{tree} \leq C λ_{LY}^{n} ({[f]}_{tree} + {∥ f ∥}_{σ}),

(149)

so

P^{n} f \to ϕ (f) h

exponentially fast.

(3) Decay profile of h and exclusion of peripheral eigenfunctions. Let

c_{j}

denote the block averages of h. The effective block recursion (Proposition 5.14) yields

c_{j} = a c_{j + 1} + b c_{j - 1} + ε_{j}, a, b > 0, a + b = 1, \sum_{j \geq 1} ϑ^{j} | ε_{j} | < \infty .

The associated homogeneous recurrence has spectral radius

< 1

; hence any subexponentially bounded solution converges to a constant. Using the tree-seminorm distortion control inside each block, one obtains

h (n) \sim \frac{c}{n} (n \to \infty),

as in Proposition 5.13. This argument also shows that if

P h = λ h

with

| λ | = 1

, then the same block recursion forces h to be asymptotically constant. The weighted

ℓ_{σ}^{1}

contraction (Lemma 4.11) then forces

h \equiv 0

unless

λ = 1

. Thus the peripheral spectrum is

{1}

, as asserted in Theorem 5.24.

(4) Excluding divergent mass and infinite orbits. Suppose, contrary to the claim, that there exists either:

(i) a nontrivial P-invariant or P-periodic density

g \geq 0

supported on forward nonterminating trajectories, or

(ii) a set

S \subset N

of positive upper density whose elements generate only nonterminating forward orbits.

If (i) holds, write

g = ϕ (g) h + g_{0}

with

ϕ (g_{0}) = 0

. Then

P^{q} g = g

for some

q \geq 1

, and (149) gives

g - ϕ (g) h = N^{q} g ⟶ 0,

forcing

g = ϕ (g) h

. But

h > 0

, while g is supported only on nonterminating orbits; this contradiction rules out (i).

If (ii) holds, the Krylov–Bogolyubov averages over

S \cap [1, N]

produce a weak* accumulation point

μ

with

P^{*} μ = μ

, supported entirely on nonterminating values. By Theorem 5.24, every nontrivial

P^{*}

–invariant functional is a scalar multiple of

ϕ

. Since

ϕ

assigns positive mass to all sufficiently large integers (via the profile

h (n) \sim c / n

), such a

μ

cannot be supported exclusively on the nonterminating part of the tree. Hence (ii) is impossible.

Finally, if every infinite forward orbit generates a nontrivial

P^{*}

–invariant functional (the hypothesis of Theorems 5.30 and 5.33), then the same spectral argument forces each such functional to equal

ϕ

. Since

ϕ

charges all levels, it cannot arise from an orbit that eventually avoids the terminating region. Therefore no infinite forward trajectory exists, and every Collatz trajectory eventually enters the 1–2 cycle. □

Remark 6.4

(Conditional termination). The spectral conclusions of Theorem 6.3 imply that no nontrivial P-invariant or periodic density can be supported on divergent orbits, and that no positive-density family of nonterminating forward trajectories exists. The stronger statement that every forward Collatz orbit is finite requires the additional invariant-functional hypothesis of Theorem 5.33. Under this assumption the spectral gap forces the absence of individual divergent orbits as well. Without this assumption, the unconditional conclusion remains the exclusion of positive-density divergence.

7. Outlook: Towards a Spectral Calculus of Arithmetic Dynamics

The analytic framework developed here for the backward Collatz operator indicates the emergence of a broader spectral calculus for discrete arithmetic maps. Given any map

T : N \to N

with finitely many inverse branches, one may associate a transfer operator

(P f) (n) = \sum_{m : T (m) = n} \frac{f (m)}{w (m)},

whose spectral properties encode the combinatorial and arithmetic structure of T. When P acts on weighted sequence spaces such as

ℓ_{σ}^{1}

or on the multiscale tree space

B_{tree, σ}

, it admits a Dirichlet transform intertwining

D (P f) (s) = L_{s} D (f) (s), D (f) (s) = \sum_{n \geq 1} f (n) n^{- s},

so that spectral information for P is transported to analytic continuation and pole structure of the complex family

L_{s}

. Within this duality, the arithmetic operator P and its analytic avatar

L_{s}

form two descriptions of a single dynamical object: discrete iteration viewed simultaneously in backward combinatorial space and analytic Dirichlet space.

For quasi-compact operators satisfying the Lasota–Yorke inequality on

B_{tree, σ}

, one obtains the spectral decomposition

P = \sum_{| λ_{i} | > ρ_{ess} (P)} λ_{i} Π_{i} + N, ρ_{ess} (P) < 1,

together with the operator zeta function

ζ_{P} (s) = det {(I - s P)}^{- 1} = exp (\sum_{k \geq 1} \frac{s^{k}}{k} Tr (P^{k})),

whose poles correspond to eigenvalues of P outside the essential spectrum and to resonant singularities of

L_{s}

. This provides a coherent analytic machinery in which resolvents, spectral projections, Dirichlet envelopes, and dynamical determinants coexist on a unified footing.

Beyond the Collatz operator, analogous structures arise for general affine–congruence systems

n ⟼ a_{j} n + b_{j}, a_{j}, b_{j} \in N,

for which

(P f) (m) = \sum_{j} 1_{{m \equiv b_{j} (mod a_{j})}} f (\frac{m - b_{j}}{a_{j}}) .

The corresponding Dirichlet transforms

L_{s}

act by weighted composition on generating series. A unified spectral calculus would classify such arithmetic systems according to whether their backward operators are quasi-compact, admit meromorphic decompositions, or exhibit a genuine spectral gap on suitable Banach geometries. Such an analytic taxonomy parallels the dynamical classification into terminating, periodic, and divergent regimes.

In the Collatz case, the results of this paper yield a complete spectral resolution of the backward dynamics. The operator P on arithmetic functions and its Dirichlet realization

L_{s}

together provide a prototype of an arithmetic transfer operator in which analytic continuation, spectral gaps, and decay of correlations follow from explicit Lasota–Yorke estimates on the multiscale space

B_{tree, σ}

. The contraction of

L_{s}

for

ℜ (s) > 1

, together with

λ_{LY} < 1

on

B_{tree, σ}

, ensures that P is quasi-compact with a strict spectral gap. Consequently, the associated dynamical Dirichlet series admit uniform pole–remainder decompositions, and the invariant profile h is uniquely determined with the decay

h (n) \sim c / n

.

Boundary spectral geometry and parameter optimization

Theorems 4.19 and 4.1 show that the Lasota–Yorke inequality on

B_{tree}

yields a strict spectral gap at the boundary

σ = 1

. A natural next step is to optimize the parameters

(α, ϑ)

defining the tree seminorm, and to determine whether

B_{tree}

is minimal or universal among Banach geometries that admit contraction. A quantitative analysis of

{∥ P f ∥}_{tree} \leq C_{P} ({λ | f |}_{tree} + {∥ f ∥}_{1})

may reveal how

λ

depends on

ϑ

and how this dependence reflects asymmetries in the Collatz preimage tree. Establishing

λ (ϑ) \to 0

as

ϑ \to 0

would connect analytic contraction rates with the combinatorial entropy of inverse trajectories.

Residues, duality, and forward–backward correspondence

The residue coefficients

A_{k} (1)

, which decay geometrically as

λ^{k}

, represent spectral invariants of the pole part of the dynamical Dirichlet zeta function. On the forward side, the heuristic contraction

{(3 / 4)}^{k}

describes the average shrinkage of integers under iteration. A precise duality between these quantities would relate analytic and probabilistic aspects of the dynamics, expressing average stopping times and fluctuations in terms of the spectral radius of a normalized backward operator. Such a correspondence would yield a forward–backward conservation principle linking termination statistics with spectral invariants.

Extensions and universality

The multiscale tree space equipped with a hybrid

ℓ^{1}

–oscillation norm provides a flexible analytic environment for nonlinear integer maps. Future work may examine metric entropy, measure concentration, and universality phenomena induced by the tree geometry, seeking optimal weight choices or identifying extremal systems among those with

λ < 1

. Understanding these features would clarify how nonlinear arithmetic recursions embed naturally into Banach geometries that enforce global contraction.

Dynamical Dirichlet zeta functions

The series

ζ_{C} (s, k) = \sum_{n \geq 1} \frac{1}{{(C^{k} (n))}^{s}}

is one example of a broader class of dynamical Dirichlet zeta functions

ζ_{T} (s, k)

associated with iterates of arithmetic maps having finitely many inverse branches. Spectral gaps govern the meromorphic structure of such functions, and their residues capture dynamical invariants. Extending this analysis to more general systems would connect the present framework with Ruelle–Perron–Frobenius theory and the analytic structure of dynamical determinants.

Broader outlook

The spectral resolution of the Collatz dynamics developed here suggests a general spectral calculus for arithmetic dynamics in which termination, recurrence, and periodicity correspond to specific spectral features of noninvertible operators on Banach spaces of arithmetic functions. Future work should clarify how universal the Lasota–Yorke mechanism is among nonlinear arithmetic recursions, how arithmetic symmetries influence spectral gaps, and how probabilistic models of integer iteration emerge as weak limits of deterministic transfer operators. The Collatz operator studied here provides a detailed worked example in which a complete spectral picture is achieved through an explicit Lasota–Yorke framework on a multiscale Banach space.

References

Terras, R. A Stopping Time Problem on the Positive Integers. Acta Arithmetica 1976, 30, 241–252. [Google Scholar] [CrossRef]
Terras, R. On the Existence of a Density. Acta Arithmetica 1979, 35, 101–102. [Google Scholar] [CrossRef]
Lagarias, J.C. The 3x+1 Problem and Its Generalizations. The American Mathematical Monthly 1985, 92, 3–23. [Google Scholar] [CrossRef]
Lagarias, J.C. The Collatz conjecture: A self-contained introduction. The American Mathematical Monthly 2009, 116, 899–928. [Google Scholar] [CrossRef]
Meinardus, G. Some Analytic Aspects Concerning the Collatz Problem. Technical Report 261, Universität Mannheim, Fakultät für Mathematik und Informatik, 2001.
Applegate, D.; Lagarias, J.C. Density bounds for the 3x+1 problem. Experimental Mathematics 2005, 14, 129–146. [Google Scholar] [CrossRef]
Ruelle, D. Statistical Mechanics of a One-dimensional Lattice Gas. Communications in Mathematical Physics 1968, 9, 267–278. [Google Scholar] [CrossRef]
Ruelle, D. A Measure Associated with Axiom A Attractors. American Journal of Mathematics 1976, 98, 619–654. [Google Scholar] [CrossRef]
Leventides, J.; Poulios, C. An operator theoretic approach to the 3x + 1 dynamical system. IFAC-PapersOnLine 2021, 54, 225–230, 24th International Symposium on Mathematical Theory of Networks and Systems MTNS 2020. [Google Scholar] [CrossRef]
Neklyudov, M. Functional analysis approach to the Collatz conjecture. arXiv 2022, arXiv:2106.11859. [Google Scholar] [CrossRef]
Lasota, A.; Yorke, J.A. On the Existence of Invariant Measures for Piecewise Monotonic Transformations. Transactions of the American Mathematical Society 1973, 186, 481–488. [Google Scholar] [CrossRef]
Delange, H. Généralisation du théorème de Wiener–Ikehara. Annales Scientifiques de l’École Normale Supérieure (3) 1952, 69, 35–74, Classic Tauberian extension of theWiener–Ikehara theorem, now known as the Wiener–Ikehara–Delange theorem. [Google Scholar] [CrossRef]
Baldi, P. Dynamical Zeta Functions and Transfer Operators. Discrete and Continuous Dynamical Systems 2002, 8, 227–241. [Google Scholar] [CrossRef]
Hilgert, J.; Mayer, D. The Dynamical Zeta Function and Transfer Operators for the Kac–Baker Model. Communications in Mathematical Physics 2000, 208, 481–507. [Google Scholar] [CrossRef]
Hennion, H. Sur un théorème spectral et son application aux noyaux lipschitziens. Proceedings of the American Mathematical Society 1993, 118, 627–634. [Google Scholar] [CrossRef]
Ionescu Tulcea, C.T.; Marinescu, G. Théorie ergodique pour des classes d’opérations non complètement continues. Annals of Mathematics 1950, 52, 140–147. [Google Scholar] [CrossRef]

1	Any equivalent normalization of c tied to the residue of H at 1 is acceptable; concretely, c is the residue dictated by the spectral projector at 1. The positivity $c > 0$ follows from $ϕ \geq 0$ and $h > 0$ .

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries

2.1. Weighted $ℓ^{1}$ spaces and Dirichlet transforms

2.2. Backward Preimages and the Transfer Recursion

2.3. Dirichlet Envelope for Iterates of the Backward Operator

3. Transfer Operator Formulation

3.1. Backward Transfer Operator

3.2. Dirichlet-Side Formulation and Intertwining

4. Spectral Reduction and Analytic Continuation

4.1. Spectral Reduction and Analytic Continuation

4.2. Spectral Criterion on Weighted $ℓ^{1}$ spaces

4.3. Multi-Scale Tree Space

4.4. Lasota–Yorke Inequality on $B_{tree}$

4.4.1. Even Branch Contraction on the Multiscale Tree Space

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

4.5. From Boundedness to the Lasota–Yorke Inequality on $B_{tree, σ}$

4.6. Quasi-Compactness of the Backward Operator

5. Spectral Consequences and Effective Block Recursion

5.1. Redesigned Multiscale Space and Invariant Profiles

5.2. Effective Block Recursion and Spectral Estimate

5.3. Odd-Branch Distortion at $α = \frac{1}{2}$ and a Certified $λ_{odd} < 1$

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

Extension to Isolated Divergent Trajectories

5.5. Explicit Lasota–Yorke Constants

5.6. Orbit-Generated Invariant Functionals and Their Support

6. Explicit Verification of the Odd-Branch Contraction Constant

7. Outlook: Towards a Spectral Calculus of Arithmetic Dynamics

References

MDPI Initiatives

Important Links

Subscribe

The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries

2.1. Weighted ℓ 1 spaces and Dirichlet transforms

2.2. Backward Preimages and the Transfer Recursion

2.3. Dirichlet Envelope for Iterates of the Backward Operator

3. Transfer Operator Formulation

3.1. Backward Transfer Operator

3.2. Dirichlet-Side Formulation and Intertwining

4. Spectral Reduction and Analytic Continuation

4.1. Spectral Reduction and Analytic Continuation

4.2. Spectral Criterion on Weighted ℓ 1 spaces

4.3. Multi-Scale Tree Space

4.4. Lasota–Yorke Inequality on B tree

4.4.1. Even Branch Contraction on the Multiscale Tree Space

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

4.5. From Boundedness to the Lasota–Yorke Inequality on B tree , σ

4.6. Quasi-Compactness of the Backward Operator

5. Spectral Consequences and Effective Block Recursion

5.1. Redesigned Multiscale Space and Invariant Profiles

5.2. Effective Block Recursion and Spectral Estimate

5.3. Odd-Branch Distortion at α = 1 2 and a Certified λ odd < 1

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

Extension to Isolated Divergent Trajectories

5.5. Explicit Lasota–Yorke Constants

5.6. Orbit-Generated Invariant Functionals and Their Support

6. Explicit Verification of the Odd-Branch Contraction Constant

7. Outlook: Towards a Spectral Calculus of Arithmetic Dynamics

References

MDPI Initiatives

Important Links

Subscribe

2.1. Weighted $ℓ^{1}$ spaces and Dirichlet transforms

4.2. Spectral Criterion on Weighted $ℓ^{1}$ spaces

4.4. Lasota–Yorke Inequality on $B_{tree}$

4.5. From Boundedness to the Lasota–Yorke Inequality on $B_{tree, σ}$

5.3. Odd-Branch Distortion at $α = \frac{1}{2}$ and a Certified $λ_{odd} < 1$