Preprint
Article

This version is not peer-reviewed.

The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

Submitted:

25 November 2025

Posted:

25 November 2025

Read the latest preprint version here

Abstract
We develop an operator--theoretic framework for the Collatz map based on its backward transfer operator acting on weighted Banach spaces of arithmetic functions. The associated Dirichlet transforms form a holomorphic family that captures the complex--analytic evolution of iterates and admits a decomposition into a zeta--type pole at $s=1$ and a holomorphic remainder. Within a finer multiscale space adapted to the Collatz preimage tree, we establish a Lasota--Yorke inequality with an explicit contraction constant $\lambda<1$, giving quasi--compactness and a spectral gap at the dominant eigenvalue. The resulting invariant density is strictly positive and exhibits a $c/n$ decay profile. We formulate a general criterion showing that, under a verified quasi--compactness hypothesis with isolated eigenvalue $1$, the forward dynamics admit no infinite trajectories. The framework provides a coherent spectral perspective on the Collatz operator and suggests a broader analytic approach to arithmetic dynamical systems.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

The Collatz conjecture asserts that every positive integer n eventually reaches the 1–2 cycle under repeated application of
T ( n ) = n / 2 , n even , 3 n + 1 , n odd .
Equivalently, every forward orbit O + ( n ) = { T k ( n ) : k 0 } is conjectured to terminate in { 1 , 2 } . Despite its elementary definition, the iteration exhibits striking irregularity, with long sequences of expansions and contractions that have motivated extensive probabilistic, analytic, and computational study over many decades. Classical work of Terras [1,2] established early density results and stopping-time estimates, while the surveys of Lagarias [3,4] synthesized a wide range of heuristic and structural approaches. Subsequent analytic contributions, including those of Meinardus [5] and Applegate–Lagarias [6], have developed refined density bounds and asymptotic estimates for the distribution of orbits. Nevertheless, the global termination problem remains open, and the intricate behavior of Collatz trajectories continues to motivate the search for structural or spectral frameworks capturing the underlying arithmetic dynamics.
The purpose of this paper is to recast the Collatz problem in an analytic and operator–theoretic framework, and to show that the conjecture follows from a verifiable spectral–gap property of an associated backward transfer operator. Instead of studying T directly, we analyze its inverse dynamics through the operator
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m ,
acting on arithmetic functions f : N C . Transfer–operator methods of this type originate in statistical mechanics and dynamical systems [7,8], and have more recently been applied to 3 x + 1 –type maps in various analytic and functional–analytic contexts [9,10]. For the Collatz map (1), each n has an even preimage 2 n and an additional odd preimage ( n 1 ) / 3 whenever n 4 ( mod 6 ) , giving
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( mod 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 .
The weights 1 / m normalize the operator so that P acts as a mass–preserving average on non-negative 1 sequences, reflecting the logarithmic contraction inherent in the preimage structure of T.
Remark 1.1 
(Invariant density and logarithmic mass balance). Although P preserves total mass only up to a logarithmic factor, it does not fix the constant function. Indeed,
( P 1 ) ( n ) = 1 2 n + 1 { n 4 ( mod 6 ) } 3 n 1 C n ( n ) ,
so ( P 1 ) 1 . More generally,
n 1 ( P f ) ( n ) = m 1 f ( m ) m ,
which shows that P is logarithmically mass–preserving: the pushforward of mass is reweighted by the harmonic kernel m 1 / m .
This logarithmic balance forces any P–invariant density h to satisfy P h = h with a decay of order 1 / n as n . In particular, the explicit block recursion developed in Section 5.2, together with the oscillation control provided by the Lasota–Yorke inequality [11], yields the precise asymptotic profile
h ( n ) c n , n ,
consistent with Tauberian heuristics of Delange type [12]. All spectral decompositions in the sequel are expressed relative to this nonconstant 1 / n –type invariant profile.
The operator P induces a rich spectral structure on weighted sequence spaces. On σ 1 , defined by f σ = n 1 | f ( n ) | n σ , the Dirichlet transform
D f ( s ) = n 1 f ( n ) n s ,
intertwines P with analytic continuation in the half-plane ( s ) > σ . Uniform σ 1 bounds on P k translate into exponential envelopes for D ( P k f ) ( s ) and yield meromorphic continuations of the corresponding Collatz–Dirichlet series, whose pole at s = 1 reflects the average branching behavior [13,14]. The spectral radius of P on σ 1 captures the global weighted expansion rate of inverse branches and determines the analytic location of dominant singularities.
To resolve finer dynamical properties, we refine this setting to a multiscale Banach space B tree , σ built from dyadic–triadic block averages and oscillation seminorms that encode the hierarchical structure of the Collatz preimage tree. On this space, P satisfies a two-norm Lasota–Yorke inequality,
[ P f ] tree , σ λ LY [ f ] tree , σ + C f σ , 0 < λ LY < 1 ,
placing the dynamics within the classical Ionescu–Tulcea–Marinescu and Hennion spectral frameworks for quasi–compact operators [15,16]. The precise Lasota–Yorke bounds, including the explicit contraction of the odd branch, are developed in Section 4, Section 5 and Section 6.
The main theorem of the paper establishes that when the odd-branch contraction constant λ odd ( α , ϑ ) satisfies λ odd < 1 for specific parameters ( α , ϑ ) = ( 1 2 , 1 5 ) , the backward Collatz operator P possesses a strict spectral gap on B tree , σ . The spectral decomposition then implies that every invariant measure of P is supported on the 1–2 cycle, ruling out any positive-density family of divergent or periodic orbits. A strengthened criterion shows that a non-trivial invariant functional in B tree , σ * would contradict the spectral gap, hence all Collatz trajectories must terminate.
The remainder of the paper is organized as follows. Section 2 establishes notation and basic properties of the weighted σ 1 spaces together with the associated Dirichlet transforms. Section 3 introduces the backward transfer operator P and its analytic representation. Section 4 constructs the multiscale space B tree , σ adapted to the Collatz preimage tree and proves the corresponding Lasota–Yorke inequalities. Section 6 verifies that the odd branch admits an explicit contraction constant λ odd < 1 for the chosen parameters, yielding quasi–compactness and a spectral gap. Finally, Section 7 develops the resulting spectral consequences, formulating a general criterion that links quasi–compactness with the absence of infinite forward trajectories, and situating the Collatz operator within a broader analytical framework for arithmetic dynamical systems.

2. Preliminaries

The analysis begins with a careful description of the function spaces, Dirichlet transforms, and basic structural features of the Collatz map that underlie the spectral study of the backward operator P. Throughout we work with complex-valued arithmetic functions f : N C . We start with a simple unbounded estimate.
Lemma 2.1 
(Coarse k-step envelopes). Let T : N N denote the Collatz map (1). For every n N and k N 0 ,
n 2 k T k ( n ) 3 k n + 3 k 1 2 .
Proof. 
For every m 1 , the definition of T gives
m 2 T ( m ) 3 m + 1 .
Iterating the lower bound yields T k ( n ) n / 2 k . For the upper bound, the recurrence
T k + 1 ( n ) 3 T k ( n ) + 1
immediately gives, by a simple induction on k, the explicit estimate T k ( n ) 3 k n + ( 3 k 1 ) / 2 . This proves (6). □
These envelopes are intentionally crude, yet they ensure that forward iterates of typical arithmetic weights remain controlled on the scales relevant for our Dirichlet and transfer-operator analysis.

2.1. Weighted 1 spaces and Dirichlet transforms

For σ > 0 we define the weighted 1 space
σ 1 : = f : N C : f σ : = n 1 | f ( n ) | n σ < .
The weight exponent σ measures polynomial decay and is chosen so that Dirichlet series associated with f converge absolutely in a half-plane ( s ) > σ .
Given f σ 1 , we define its Dirichlet transform
D f ( s ) : = n 1 f ( n ) n s , ( s ) > σ .
Lemma 2.2 
(Dirichlet convergence). Let σ > 0 and let f σ 1 , so that
f σ : = n 1 | f ( n ) | n σ < .
Then the Dirichlet transform
D f ( s ) : = n 1 f ( n ) n s
converges absolutely for ( s ) > σ and defines a bounded holomorphic function on every half-plane ( s ) σ + ε , ε > 0 . Moreover,
| D f ( s ) | f σ sup n 1 n σ ( s ) = f σ ( ( s ) > σ ) .
Proof. 
Let s C with ( s ) > σ . Then
n 1 f ( n ) n s = n 1 | f ( n ) | n ( s ) = n 1 | f ( n ) | n σ n σ ( s ) .
Since ( s ) > σ implies σ ( s ) < 0 , the sequence n σ ( s ) is decreasing to 0, and hence
sup n 1 n σ ( s ) = 1 .
Therefore,
n 1 f ( n ) n s f σ < ,
so the Dirichlet series converges absolutely.
For every ε > 0 , the same bound holds uniformly on the half-plane ( s ) σ + ε , since then σ ( s ) ε and n σ ( s ) n ε 0 as n . Thus the convergence is locally uniform in ( s ) σ + ε , and classical Dirichlet-series theory implies that D f is holomorphic on this region.
The bound (9) follows directly from the estimate above. □
We write 1 = 0 1 for the unweighted space with norm f 1 = n 1 | f ( n ) | .

2.2. Backward Preimages and the Transfer Recursion

For each n 1 , define the even and odd preimage sets
E ( n ) : = { m N : T ( m ) = n , m even } , O ( n ) : = { m N : T ( m ) = n , m odd } .
Lemma 2.3 
(Preimage structure). For every n N ,
E ( n ) = { 2 n } , O ( n ) = { ( n 1 ) / 3 } , n 4 ( mod 6 ) , , otherwise ,
and in the first case ( n 1 ) / 3 is odd. In particular, each n has either one preimage (even) or two preimages (one even and one odd), and the odd preimage occurs with natural density 1 / 6 .
Proof. 
If m is even and T ( m ) = n , then m / 2 = n , so m = 2 n , establishing E ( n ) = { 2 n } .
If m is odd and T ( m ) = n , then 3 m + 1 = n , so m = ( n 1 ) / 3 . This is an integer precisely when n 1 ( mod 3 ) . For m to be odd, n 1 must be divisible by 3 but not by 6, so n 4 ( mod 6 ) . In that case ( n 1 ) / 3 is odd. The density statement follows since the congruence class n 4 ( mod 6 ) has natural density 1 / 6 . □
Hence each n admits exactly one even preimage and possibly one odd preimage when n 4 ( mod 6 ) . The corresponding backward transfer operator is defined as
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
The normalization by 1 / m reflects the logarithmic contraction of the forward map and ensures a natural mass-balance property.
Lemma 2.4 
(Weighted mass preservation). Let f : N [ 0 , ) satisfy
m 1 f ( m ) m < .
Then the backward transfer operator
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m
preserves the weighted mass in the sense that
n 1 ( P f ) ( n ) = m 1 f ( m ) m .
Proof. 
Since f 0 and m 1 f ( m ) / m < , Tonelli’s theorem justifies rearranging the nonnegative double series. Using the definition of P,
n 1 ( P f ) ( n ) = n 1 m : T ( m ) = n f ( m ) m .
Each m 1 has exactly one image T ( m ) , so it appears in exactly one of the inner sums. Hence we can rewrite the double sum directly over m:
n 1 m : T ( m ) = n f ( m ) m = m 1 f ( m ) m ,
which is precisely (12). □

2.3. Dirichlet Envelope for Iterates of the Backward Operator

The preimage structure allows a crude but useful bound on P acting on σ 1 .
Proposition 2.5 
(Backward operator bound). Let σ > 0 and let P be defined by (11). Then P : σ 1 σ 1 is bounded and
P f σ C σ f σ , C σ : = 2 σ + 3 σ ,
for all f σ 1 . Consequently, for every k 1 ,
P k f σ C σ k f σ .
Proof. 
From (11),
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
Hence
P f σ S even + S odd ,
with
S even : = n 1 | f ( 2 n ) | 2 n n σ , S odd : = n 1 n 4 ( 6 ) f n 1 3 n 1 3 n σ .
For the even branch, set m = 2 n , so n = m / 2 and
S even = m 1 m even | f ( m ) | m ( m / 2 ) σ = m 1 m even 2 σ | f ( m ) | m σ + 1 2 σ m 1 | f ( m ) | m σ = 2 σ f σ .
For the odd branch, write m = ( n 1 ) / 3 , so n = 3 m + 1 and m is odd. Then
S odd = m 1 m odd | f ( m ) | m ( 3 m + 1 ) σ m 1 | f ( m ) | m ( 3 m ) σ = 3 σ m 1 | f ( m ) | m σ + 1 3 σ f σ .
Combining the two estimates gives (13), and iterating yields (14). □
The constant C σ = 2 σ + 3 σ is an explicit growth factor for P on σ 1 . It is not < 1 in this normalization, so no contraction is claimed at this level. The genuine contraction mechanism is obtained later on the multiscale Banach space B tree , where a strong seminorm captures oscillatory decay along the Collatz tree while the 1 component provides compactness.

3. Transfer Operator Formulation

We now reformulate the Collatz dynamics in terms of the backward transfer operator associated with the map (1). This operator-theoretic viewpoint provides an analytic bridge between the discrete recurrence and the functional framework developed in later sections. The transfer operator encodes the inverse–branching structure of the map and propagates densities backward along the Collatz tree, in a form compatible with logarithmic weighting and Dirichlet series.
Recall that the Collatz map, (1), by Lemma 2.3, each n 1 has the even preimage 2 n , together with an additional odd preimage ( n 1 ) / 3 precisely when n 4 ( mod 6 ) .

3.1. Backward Transfer Operator

Definition 3.1 
(Backward transfer operator). For an arithmetic function f : N C , define
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 , n N ,
where 1 A denotes the indicator of the condition A.
Lemma 3.2 
(Dirichlet transform intertwining). Let f σ 1 with σ > 1 , and define
D ( f ) ( s ) = n 1 f ( n ) n s .
For ( s ) > σ , the series converges absolutely and
D ( P f ) ( s ) = L s D ( f ) ( s ) ,
where the multiplier L s encodes the contribution of the two inverse branches of T:
L s z = 2 1 s z + 3 1 s z · 1 { m 1 ( 3 ) } .
Indeed,
D ( P f ) ( s ) = n 1 m : T ( m ) = n f ( m ) m n s = m 1 f ( m ) m 1 s 2 s 1 m 0 ( 2 ) + 3 s 1 m 1 ( 3 ) = L s D ( f ) ( s ) .
Proof. 
Fix f σ 1 with σ > 1 . By definition of the σ 1 -norm,
n 1 | f ( n ) | n σ < .
If ( s ) > σ , then n ( s ) n σ , so
n 1 | f ( n ) | n ( s ) n 1 | f ( n ) | n σ < .
Thus D ( f ) ( s ) = n 1 f ( n ) n s converges absolutely for ( s ) > σ .
Next we show that D ( P f ) ( s ) converges absolutely for the same range. From the definition of P,
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( mod 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 ,
so
| P f ( n ) | | f ( 2 n ) | 2 n + 1 { n 4 ( mod 6 ) } | f ( ( n 1 ) / 3 ) | ( n 1 ) / 3 .
Hence
n 1 | P f ( n ) | n ( s ) S even + S odd ,
where
S even : = n 1 | f ( 2 n ) | 2 n n ( s ) , S odd : = n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) / 3 n ( s ) .
For the even contribution, set m = 2 n so n = m / 2 and m is even. Then
S even = m 1 m even | f ( m ) | m m 2 ( s ) = 2 ( s ) m 1 m even | f ( m ) | m 1 ( s ) .
Since ( s ) > σ implies ( s ) + 1 > σ , we have m 1 ( s ) m σ , and therefore
S even 2 ( s ) m 1 m even | f ( m ) | m σ 2 ( s ) m 1 | f ( m ) | m σ < .
For the odd contribution, write n = 3 k + 1 with k 1 odd (this is equivalent to n 4 ( mod 6 ) and ( n 1 ) / 3 = k odd). Then
S odd = k 1 k odd | f ( k ) | k ( 3 k + 1 ) ( s ) .
Since 3 k + 1 k for all k 1 , we have ( 3 k + 1 ) ( s ) k ( s ) , and hence
S odd k 1 k odd | f ( k ) | k 1 ( s ) k 1 | f ( k ) | k 1 ( s ) .
Again ( s ) + 1 > σ gives k 1 ( s ) k σ , so
S odd k 1 | f ( k ) | k σ < .
Thus S even + S odd < , and D ( P f ) ( s ) converges absolutely for ( s ) > σ .
We now compute D ( P f ) ( s ) explicitly and identify it with ( L s D ( f ) ) ( s ) . By definition,
D ( P f ) ( s ) = n 1 ( P f ) ( n ) n s .
Substituting the formula for P and splitting according to the two branches,
D ( P f ) ( s ) = n 1 f ( 2 n ) 2 n n s + n 1 n 4 ( 6 ) f ( ( n 1 ) / 3 ) ( n 1 ) / 3 n s .
For the even part, set again m = 2 n :
n 1 f ( 2 n ) 2 n n s = m 1 m even f ( m ) m m 2 s = 2 s m 1 m even f ( m ) m 1 s .
For the odd part, write n = 3 k + 1 with k 1 odd and ( n 1 ) / 3 = k :
n 1 n 4 ( 6 ) f ( ( n 1 ) / 3 ) ( n 1 ) / 3 n s = k 1 k odd f ( k ) k ( 3 k + 1 ) s .
Putting the two contributions together,
D ( P f ) ( s ) = 2 s m 1 m even f ( m ) m 1 s + k 1 k odd f ( k ) k 1 ( 3 k + 1 ) s .
Now let F ( s ) = D ( f ) ( s ) = n 1 a n n s with a n = f ( n ) . By definition of L s in the lemma,
( L s F ) ( s ) = 2 s m 1 m even a m m 1 s + k 1 k odd a k k 1 ( 3 k + 1 ) s ,
and with a n = f ( n ) this matches exactly the expression we have obtained for D ( P f ) ( s ) . Hence
D ( P f ) ( s ) = ( L s D ( f ) ) ( s )
for all ( s ) > σ , as claimed. □
The multiplicative factor 1 / m assigns to each inverse branch a logarithmic weight, so that P acts as a normalized backward average along preimages. This normalization aligns the discrete dynamics with Dirichlet weights and will be crucial for analytic continuation and spectral estimates below.
Positivity. If f ( n ) 0 for all n, then ( P f ) ( n ) 0 for all n, since P is a positive linear combination of values of f.
Weighted mass preservation. A direct change of variables shows that for every nonnegative f satisfying m 1 | f ( m ) | / m < ,
n 1 ( P f ) ( n ) = m 1 f ( m ) m .
Thus P preserves the logarithmically weighted mass f ( m ) / m ; plain 1 mass is not preserved under this normalization.
Boundedness on weighted spaces. Let
σ 1 : = f : N C : f σ 1 : = n 1 | f ( n ) | n σ < , σ > 0 .
A direct change of variables in (15) yields, for all f σ 1 ,
P f σ 1 = n 1 | ( P f ) ( n ) | n σ n 1 | f ( 2 n ) | 2 n 1 + σ + 1 { n 4 ( 6 ) } f ( ( n 1 ) / 3 ) ( ( n 1 ) / 3 ) 1 + σ = 1 2 n 1 | f ( 2 n ) | n 1 + σ + 3 1 + σ n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) 1 + σ .
Changing variables m = 2 n in the first sum and m = ( n 1 ) / 3 in the second gives
n 1 | f ( 2 n ) | 2 n 1 + σ = 2 σ m 1 m even | f ( m ) | m 1 + σ 2 σ f σ 1 , 3 1 + σ n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) 1 + σ = 3 σ m 1 3 m + 1 4 ( 6 ) | f ( m ) | m σ 3 σ f σ 1 .
Hence
P f σ 1 2 σ + 3 σ f σ 1 ,
and therefore
P k f σ 1 2 σ + 3 σ k f σ 1 , k 0 .
Action on the weighted sup space. For the Banach space
B σ : = f : N C : f B σ : = sup n 1 n σ | f ( n ) | < ,
the normalization factor 1 / m in (15) improves decay at each branch but does not make P a contraction. Setting g ( n ) : = n f ( n ) , one obtains
n ( P f ) ( n ) = g ( 2 n ) + 1 { n 4 ( 6 ) } g n 1 3 , ( P f ) ( n ) = ( Q g ) ( n ) n , ( Q g ) ( n ) : = g ( 2 n ) + 1 { n 4 ( 6 ) } g n 1 3 .
Using f B σ = g B σ 1 , one obtains the bound
P f B σ = sup n 1 n σ 1 | ( Q g ) ( n ) | sup n 1 n σ 1 | g ( 2 n ) | + n σ 1 1 { n 4 ( 6 ) } g n 1 3 2 ( σ 1 ) + 3 σ 1 g B σ 1 = 2 ( σ 1 ) + 3 σ 1 f B σ .
In particular, the constant 2 ( σ 1 ) + 3 σ 1 1 for all σ > 0 , so P is bounded but not contractive on ( B σ , · B σ ) . This coarse boundedness provides an upper envelope for the operator norm but does not imply any decay of P k on B σ .
These limitations motivate the refinement of the functional setting in later sections, where the multiscale tree spaces B tree and B tree , σ are introduced to obtain genuine Lasota–Yorke-type contractions with λ < 1 and a provable spectral gap.

3.2. Dirichlet-Side Formulation and Intertwining

For f σ 1 with σ > 0 , the Dirichlet transform
D f ( s ) : = n 1 f ( n ) n s , ( s ) > σ ,
is absolutely convergent. Writing D f ( s ) = n 1 a n n s with a n = f ( n ) and substituting (15), we obtain
D ( P f ) ( s ) = n 1 a 2 n 2 n + 1 { n 4 ( 6 ) } a ( n 1 ) / 3 ( n 1 ) / 3 1 n s .
Thus D ( P f ) is again a Dirichlet series whose coefficients depend linearly on those of D f .
Definition 3.3 
(Dirichlet–Ruelle operator). Let D σ denote the space of Dirichlet series
F ( s ) = n 1 a n n s with n 1 | a n | n σ < .
Define L : D σ D σ by
( L F ) ( s ) : = n 1 b n n s , b n : = a 2 n 2 n + 1 { n 4 ( 6 ) } a ( n 1 ) / 3 ( n 1 ) / 3 .
Lemma 3.4 
(Operator norm of L). For σ > 0 , let F σ : = n 1 | a n | / n σ . Then L : D σ D σ is bounded and
L σ 2 σ + 3 σ .
Proof. 
From (23),
L F σ = n 1 | b n | n σ n 1 | a 2 n | 2 n n σ + n 1 n 4 ( 6 ) | a ( n 1 ) / 3 | ( n 1 ) / 3 1 n σ = : S even + S odd .
For the even term, set m = 2 n . Then
S even = m even | a m | 2 ( m / 2 ) 1 + σ = m even 2 σ | a m | m 1 + σ 2 σ m even | a m | m σ 2 σ F σ .
For the odd term, write m = ( n 1 ) / 3 , so n = 3 m + 1 and
S odd = m 1 | a m | m ( 3 m + 1 ) σ 3 σ m 1 | a m | m σ = 3 σ F σ .
Combining the two estimates gives
L F σ ( 2 σ + 3 σ ) F σ ,
proving (24). □
Lemma 3.5 
(Intertwining of P and L). For every f σ 1 with σ > 0 ,
D ( P f ) = L ( D f ) , D ( P k f ) = L k ( D f ) , k 0 ,
whenever the series converge absolutely.
Proof. 
The Dirichlet coefficients of D ( P f ) in (22) are precisely the b n of (23), so D ( P f ) = L ( D f ) ; iteration gives the second identity. □
The intertwining relation shows that spectral information for P on σ 1 transfers to L on D σ . However, since P is not contractive on σ 1 or B σ , the inequality (24) provides only a uniform boundedness envelope for L k σ , not exponential decay. Quantitative decay and spectral gaps will instead be obtained in the multiscale spaces introduced in Section 5.
Define w k : = P k 1 with 1 ( n ) 1 and
ζ C ( s , k ) : = n 1 w k ( n ) n s , ( s ) large .
By Lemma 3.5,
ζ C ( s , 0 ) = ζ ( s ) , ζ C ( s , k ) = L k ζ ( s ) , k 1 .
The quantity w k ( n ) represents the total normalized weight of all k–step backward paths from n in the Collatz tree under the logarithmic weighting 1 / m . The family ζ C ( s , k ) therefore encodes, in Dirichlet form, the distribution of these weighted backward configurations at depth k. By Lemma 3.4,
L k σ ( 2 σ + 3 σ ) k ,
so the Dirichlet coefficients of ζ C ( s , k ) are uniformly bounded in ( s ) > σ but do not necessarily decay in k. Later sections refine this estimate by passing to the multiscale tree space B tree , σ , where the Lasota–Yorke inequality ensures a true spectral gap and exponential decay of P k .

4. Spectral Reduction and Analytic Continuation

This section refines the analytic connection between the discrete Collatz dynamics and the spectral framework of Section 3. Our goal is to express analytic information about the Dirichlet series associated with iterates of the backward operator P in terms of the spectral data of P—equivalently, of the Dirichlet–Ruelle operator L—acting on suitable Banach spaces continuously embedded in σ 1 . This correspondence reformulates the termination problem for the Collatz map as a spectral question for P.
Throughout this section we fix σ > 1 and a Banach space B σ , 1 of arithmetic functions such that B σ , 1 σ 1 continuously, P ( B σ , 1 ) B σ , 1 , and the Dirichlet transform
D f ( s ) = n 1 f ( n ) n s
defines a holomorphic function for ( s ) > σ whenever f B σ , 1 . The intertwining relation (25) then yields, for all k 0 ,
D ( P k f ) ( s ) = n 1 ( P k f ) ( n ) n s , ( s ) > σ .
Since B σ , 1 σ 1 , each series converges absolutely. By the σ 1 estimate (18),
| D ( P k f ) ( s ) | P k f σ 1 2 σ + 3 σ k f σ 1 , ( s ) > σ .
The bound (28) shows that the iterates of P are uniformly bounded on σ 1 , though not contractive; a genuine contraction will appear only after the refinement to the multiscale tree spaces introduced in Section 4.4.
Generating function and operator resolvent. For z C with | z | < ( 2 σ + 3 σ ) 1 , define the two–variable generating function
G f ( s , z ) : = k 0 z k D ( P k f ) ( s ) .
The series converges absolutely and locally uniformly for ( s ) > σ , hence G f is holomorphic in ( s , z ) on the domain
Ω σ : = { ( s , z ) C 2 : ( s ) > σ , | z | < ( 2 σ + 3 σ ) 1 } .
On the operator side, for such z the Neumann series
( I z P ) 1 = k 0 z k P k
converges in operator norm on B σ , 1 , and thus
G f ( s , z ) = D ( I z P ) 1 f ( s ) , ( s , z ) Ω σ .
The poles of ( I z P ) 1 in the z–plane occur precisely at the reciprocals of the spectral values of P on B σ , 1 . Consequently the analytic structure of G f as a function of z is governed by the spectrum of P.
At this point we recall that the backward Collatz operator P preserves total mass on 1 :
n 1 ( P f ) ( n ) = m 1 f ( m ) ,
so 1 is a simple eigenvalue corresponding to the eigenvector 1 ( n ) 1 . Hence the spectral analysis of P will focus on demonstrating a spectral gap at 1: all other spectral values satisfy | λ | λ LY < 1 . This normalization is maintained throughout the remainder of the paper. The resolvent expansion (30) is therefore analytic for | z | < 1 except at the simple pole z = 1 , whose residue encodes the invariant functional associated with 1 .
The coarse resolvent radius ( 2 σ + 3 σ ) 1 merely provides an elementary domain of convergence. A sharper meromorphic continuation—reflecting the true spectral radius r ( P ) = 1 and the subdominant bound ρ ess ( P ) λ LY < 1 —will be obtained on the refined spaces B tree and B tree , σ , where the Lasota–Yorke inequality gives quantitative contraction of oscillations between adjacent scales.
Finally, for the constant function 1 ( n ) 1 (whenever 1 B σ , 1 ), the coefficients of G 1 ( s , z ) are precisely the Collatz Dirichlet series ζ C ( s , k ) defined in (26). Thus the analytic continuation and asymptotic decay of ζ C ( s , k ) as k are controlled by the spectral properties of P through (30); their exponential decay emerges once the spectral gap on the multiscale tree spaces is established.

4.1. Spectral Reduction and Analytic Continuation

Recall that the Dirichlet–Ruelle operator L is defined on D σ by (23). The intertwining Lemma 3.5 asserts that for all f σ 1 ,
D ( P f ) = L ( D f ) .
Since D is injective on σ 1 , every eigenpair ( λ , f ) of P with f σ 1 produces an eigenpair ( λ , D f ) of L. Conversely, if L F = λ F and F = D f lies in the image of D , then P f = λ f . Hence the point spectra of P on B σ , 1 and of L on D σ coincide on the subspace D ( B σ , 1 ) . In particular,
ρ ( L ) ρ ( P ) ,
and any spectral gap or peripheral spectral property of P transfers to the induced action of L on Dirichlet series arising from B σ , 1 .
We emphasize that equality σ ( L ) = σ ( P ) is not assumed. The partial correspondence (31) suffices for analytic reduction: the Dirichlet-side continuation of D ( P k f ) reflects the spectral geometry of P.
Mass preservation and spectral gap. Because P only preserves total mass up to a logarithmic factor, we have
n 1 ( P f ) ( n ) = m 1 f ( m ) m ,
so the constant function 1 ( n ) 1 is not an eigenvector. Instead, P admits a unique positive invariant density h B tree , σ and a unique positive invariant functional ϕ B tree , σ * with
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
Throughout the paper we work with this Perron–Frobenius normalization (32) and express all spectral decompositions relative to the nonconstant invariant profile h.
Within this framework, the Dirichlet–Ruelle operator L inherits the same dominant eigenvalue 1 and the same spectral gap on the subspace D ( B σ , 1 ) . The analytic behavior of the Collatz Dirichlet series ζ C ( s , k ) = D ( P k 1 ) ( s ) is then determined by how P k approaches the spectral projector onto the invariant subspace spanned by 1 .
Theorem 4.1 
(Spectral reduction and analytic continuation). Let B σ , 1 be a Banach space of arithmetic functions continuously embedded in σ 1 such that P : B σ , 1 B σ , 1 is quasi-compact and satisfies the mass-preserving normalization (12). Assume further that 1 is a simple eigenvalue of P and that all other spectral values lie in the closed disk | λ | λ LY < 1 . Then for every f B σ , 1 the Dirichlet transforms D ( P k f ) ( s ) extend holomorphically to ( s ) > σ and admit the decomposition
D ( P k f ) ( s ) = Π 1 ( f ) D ( 1 ) ( s ) + R k ( s ) , | R k ( s ) | C f ( s ) λ LY k ,
where Π 1 is the spectral projection associated with the eigenvalue 1 and C f ( s ) is locally bounded on { ( s ) > σ } . In particular, for f with Π 1 ( f ) = 0 , the functions D ( P k f ) ( s ) decay exponentially in k uniformly on compact subsets of ( s ) > σ .
When f = 1 , the same conclusion applies to ζ C ( s , k ) = D ( P k 1 ) ( s ) , whose exponential stabilization corresponds to convergence toward the invariant density associated with the Collatz operator.
Proof. 
By quasi-compactness, the spectrum of P decomposes as
σ ( P ) = { 1 } σ ess ( P ) , ρ ess ( P ) λ LY < 1 ,
and the Riesz projection Π 1 = 1 2 π i | z 1 | = ε ( z I P ) 1 d z is a bounded projection onto the one-dimensional invariant subspace spanned by 1 . Then P k = Π 1 + N k , where N k B σ , 1 C λ LY k for some constant C > 0 . Applying the Dirichlet transform and using | D ( g ) ( s ) | g σ 1 for ( s ) > σ gives
D ( P k f ) ( s ) = D ( Π 1 f ) ( s ) + D ( N k f ) ( s ) , | D ( N k f ) ( s ) | C λ LY k f B σ , 1 .
Since Π 1 f is a multiple of 1 , we may write D ( Π 1 f ) = Π 1 ( f ) D ( 1 ) , yielding (33). Analyticity for ( s ) > σ follows from absolute convergence and locally uniform bounds. □
This form aligns with the quasi-compactness obtained later on the multiscale tree space B tree , σ , where the Lasota–Yorke inequality ensures ρ ess ( P ) λ LY < 1 . The exponential term λ LY k in (33) corresponds to the essential spectral radius and controls the rate of decay of correlations and Dirichlet coefficients. Under stronger spectral assumptions, the representation can be refined to a meromorphic decomposition in which each isolated eigenvalue λ j contributes a term λ j k D ( Π j f ) , generalizing the usual Ruelle–Perron expansion.

4.2. Spectral Criterion on Weighted 1 spaces

The preceding analysis shows that sufficiently strong spectral control of P on an appropriate Banach space B σ , 1 forces all Dirichlet data generated by the backward Collatz tree to exhibit exponential stabilization toward the invariant profile. Since P is not contractive on σ 1 or B σ , such behavior can only arise on refined Banach spaces where a genuine spectral gap at the eigenvalue 1 has been established. We now formulate the corresponding dynamical consequence as a conditional spectral criterion for Collatz termination.
Theorem 4.2 
(Spectral criterion for Collatz termination). Let P act on a Banach space B σ , 1 σ 1 such that P ( B σ , 1 ) B σ , 1 and 1 B σ , 1 . Assume that P is quasi-compact on B σ , 1 , that 1 is a simple eigenvalue of P corresponding to the unique positive invariant density h, and that all other spectral values satisfy
σ ( P ) { 1 } { z C : | z | λ LY < 1 } .
Then every f B σ , 1 admits a decomposition
P k f = Π 1 f + N k f , N k f B σ , 1 C λ LY k f B σ , 1 ,
where Π 1 is the spectral projection onto span { h } . Consequently, there exists no nontrivial invariant or periodic density for the backward Collatz dynamics in B σ , 1 ; the only invariant direction is the positive eigenfunction h. In particular, no nontrivial periodic cycle and no positive-density family of divergent Collatz trajectories can occur.
Proof. 
By quasi-compactness, the spectrum of P decomposes as σ ( P ) = { 1 } σ ess ( P ) with ρ ess ( P ) λ LY < 1 . The associated Riesz projection
Π 1 = 1 2 π i | z 1 | = ε ( z I P ) 1 d z
is bounded and satisfies P Π 1 = Π 1 P = Π 1 . Since 1 is a simple eigenvalue with positive eigenfunction h, we have
Π 1 f = ϕ ( f ) h ,
where ϕ is the corresponding eigenfunctional normalized so that ϕ ( h ) = 1 .
Hence the power iterates decompose as
P k = Π 1 + N k , N k B σ , 1 C λ LY k ,
for some constant C > 0 .
If a nontrivial invariant density f B σ , 1 satisfied P f = f , then f would belong to the eigenspace of λ = 1 . Since this eigenspace is one-dimensional and spanned by h, we must have f = c h for some constant c. Thus no additional invariant densities exist beyond span { h } .
If a periodic density f satisfied P q f = f for some q > 0 , then f would belong to an eigenspace associated with an eigenvalue λ satisfying | λ | = 1 . Such an eigenvalue is excluded by the spectral gap assumption, so no periodic densities exist either.
Finally, via the standard correspondence between transfer-operator invariants and dynamical orbits on the Collatz graph, any invariant or periodic density corresponds to either a periodic Collatz cycle or to a positive-density family of non-terminating trajectories. The spectral gap therefore precludes these dynamical behaviors. □
Section 4.4 constructs the multiscale tree Banach space B tree and establishes a Lasota–Yorke inequality that ensures quasi-compactness of P with an explicit contraction constant λ LY < 1 in the strong seminorm. Verification of the hypotheses of Theorem 4.2 on B tree , σ provides the analytic–spectral bridge: a strict spectral gap for P on B tree , σ rules out the spectral signatures associated with any non-terminating Collatz behavior.

4.3. Multi-Scale Tree Space

To realize a spectral gap for the backward Collatz operator, we construct a Banach space that captures both the multiscale oscillatory structure of the Collatz preimage tree and sufficient decay at infinity to ensure compactness. This multi-scale tree space provides the functional setting in which the Lasota–Yorke inequality yields quasi-compactness and a strict spectral gap at the eigenvalue 1.
For j 0 define the scale blocks
I j : = [ 6 j , 2 · 6 j ) N .
The factor 6 reflects the approximate scale multiplication under the backward map, combining the even branch m = 2 n and the odd branch m = ( n 1 ) / 3 (defined for n 4 ( mod 6 ) ).
Fix parameters 0 < α < 1 and 0 < ϑ < 1 . For indices u , v > 0 , define the scale-sensitive weight
W α ( u , v ) : = u v | u v | ( u + v ) α , u v .
This weight penalizes small separations between indices, emphasizing local oscillations of f, while the factor ( u + v ) α damps sensitivity at large scales. The geometric coefficient ϑ j provides exponential attenuation of oscillations across successive levels of the tree.
Definition 4.3 
(Multiscale tree seminorm and space). For f : N C define
[ f ] tree : = j 0 ϑ j sup m , n I j m n W α ( m , n ) | f ( m ) f ( n ) | .
The corresponding Banach space
B tree : = f : N C : f 1 + [ f ] tree < , f tree : = f 1 + [ f ] tree ,
is called the multiscale tree space.
Standard arguments for weighted variation-type seminorms show that ( B tree , · tree ) is complete. The seminorm [ f ] tree controls the oscillatory irregularity of f within each scale block I j , while the 1 component controls the overall magnitude. However, B tree alone does not impose sufficient decay as n to guarantee compactness.
Weighted extension. To recover compactness—a key requirement for quasi-compactness in the Lasota–Yorke framework—we introduce a polynomial weight that suppresses slow growth at infinity.
Definition 4.4 
(Weighted tree space). For parameters 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 , set
f σ : = n 1 | f ( n ) | n σ , [ f ] tree : = j 0 ϑ j sup m , n I j m n W α ( m , n ) | f ( m ) f ( n ) | .
Then
B tree , σ : = f : N C : f σ + [ f ] tree < , f tree , σ : = f σ + [ f ] tree .
The factor n σ enforces quantitative decay of f at large indices, while [ f ] tree measures the oscillatory complexity of f along each level of the tree. Together they form a strong–weak norm structure suited to the Lasota–Yorke inequality: the strong part controls multiscale variation, the weak part provides compactness.
Lemma 4.5 
(Compact embedding). For fixed 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 , the unit ball of B tree , σ is relatively compact in σ 1 .
Proof. 
Let
U : = f B tree , σ : f tree , σ 1 .
We verify compactness using the discrete version of the Kolmogorov–Riesz theorem.
(i) Uniform boundedness. Each f U satisfies f σ 1 , so U is bounded in σ 1 .
(ii) Uniform tail control. For any ε > 0 choose N so that n > N n σ < ε . Then for all f U ,
n > N | f ( n ) | n σ f σ n > N 1 n σ ε ,
so the tails contribute arbitrarily little σ 1 –mass.
(iii) Local equicontinuity on finite blocks. Fix J 0 and consider the finite union E J = j J I j . Within each I j , the seminorm term ϑ j sup m , n I j W α ( m , n ) | f ( m ) f ( n ) | bounds discrete oscillations uniformly in f. Hence the family { f | E J : f U } lies in a compact subset of the finite-dimensional space C E J .
(iv) Diagonal extraction. Given any sequence ( f ( k ) ) U , apply the compactness on E 1 , E 2 , and extract a diagonal subsequence converging pointwise on all of N . By (ii) the tails beyond any fixed N have uniformly small weight, so pointwise convergence on finite windows implies convergence in σ 1 . Thus U is relatively compact in σ 1 . □
Remark 4.6. 
The weight n σ is essential. Without it, the unit ball of B tree is not precompact in 1 : one can construct sequences of disjointly supported spikes whose tree seminorms remain bounded while their supports drift to infinity. Taking σ > 1 eliminates this escape to infinity, yielding the compact embedding required for quasi-compactness.
The space B tree , σ thus provides the natural functional environment for the Lasota–Yorke inequality. Its compact embedding into σ 1 ensures that the essential spectral radius of P on B tree , σ is strictly smaller than its spectral radius, a prerequisite for establishing a genuine spectral gap. The strong seminorm captures multiscale regularity across the Collatz tree, while the weighted 1 norm supplies the compactness that underlies the spectral analysis of the backward transfer operator.

4.4. Lasota–Yorke Inequality on B tree

Recall from (11) that
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
It is convenient to split P into its even and odd components:
( P even f ) ( n ) : = f ( 2 n ) 2 n , ( P odd f ) ( n ) : = 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 ,
so that P = P even + P odd .
From the 1 estimates of Section 2, both branches are bounded on 1 , hence on B tree . The Lasota–Yorke inequality arises from the fact that P even is strongly contracting in the tree seminorm, while P odd is a controlled perturbation whose contribution is damped by the multiscale factor ϑ j .

4.4.1. Even Branch Contraction on the Multiscale Tree Space

We first record the even-branch estimate.
Lemma 4.7 
(Even branch contraction on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . There exists a constant C even > 0 depending only on α, ϑ, and σ such that for all f B tree , σ ,
[ P even f ] tree 2 ( 1 α ) ϑ [ f ] tree + C even f σ .
In particular, once α is fixed, choosing ϑ sufficiently small makes P even strictly contracting in the tree seminorm up to a controlled · σ error term.
Proof. 
Recall that ( P even f ) ( n ) = f ( 2 n ) / ( 2 n ) . For each j 0 , the block seminorm of P even f is
Δ j ( P even f ) : = sup u , v I j u v 1 6 j W α ( u , v ) ( P even f ) ( u ) ( P even f ) ( v ) .
Fix j and u , v I j with u v . We decompose
( P even f ) ( u ) ( P even f ) ( v ) = f ( 2 u ) f ( 2 v ) 2 u + f ( 2 v ) 1 2 u 1 2 v = : D 1 ( u , v ) + D 2 ( u , v ) ,
and estimate the two terms separately.
(1) The oscillatory part D 1 . Since
W α ( 2 u , 2 v ) = 2 1 α W α ( u , v ) ,
we have
W α ( u , v ) = 2 ( 1 α ) W α ( 2 u , 2 v ) .
Hence
1 6 j W α ( u , v ) | D 1 ( u , v ) | 2 ( 1 α ) 6 j W α ( 2 u , 2 v ) | f ( 2 u ) f ( 2 v ) | 2 u .
Since u I j = [ 6 j , 2 · 6 j ) , u 6 j , so 1 / ( 2 u ) 1 / ( 2 · 6 j ) and
1 6 j W α ( u , v ) | D 1 ( u , v ) | 2 ( 1 α ) 1 6 2 j W α ( 2 u , 2 v ) | f ( 2 u ) f ( 2 v ) | .
The pair ( 2 u , 2 v ) lies at scale comparable to 6 j , i.e. within a bounded number of block levels. Hence there exists a constant c 0 > 0 depending only on the block geometry such that
1 6 2 j W α ( 2 u , 2 v ) c 0 1 6 j W α ( 2 u , 2 v ) for some j { j , j + 1 } .
Taking the supremum over u , v I j gives
Δ j ( P even f ; D 1 ) c 0 2 ( 1 α ) 1 max { Δ j ( f ) , Δ j + 1 ( f ) } .
Multiplying by ϑ j and using ϑ j Δ j ( f ) [ f ] tree and ϑ j Δ j + 1 ( f ) ϑ 1 [ f ] tree , we obtain
ϑ j Δ j ( P even f ; D 1 ) c 1 2 ( 1 α ) ϑ [ f ] tree ,
for some constant c 1 depending only on α and ϑ . Taking the supremum over j yields
[ P even f ] tree ( D 1 ) c 1 2 ( 1 α ) ϑ [ f ] tree .
(2) The denominator part D 2 . Assume u > v . Then
1 2 u 1 2 v = | u v | 2 u v , | D 2 ( u , v ) | = | f ( 2 v ) | | u v | 2 u v .
Thus
W α ( u , v ) | D 2 ( u , v ) | = u v | u v | ( u + v ) α | f ( 2 v ) | | u v | 2 u v = | f ( 2 v ) | 2 ( u + v ) α .
For u , v I j , we have u + v 2 · 6 j , so
W α ( u , v ) | D 2 ( u , v ) | C α 6 α j | f ( 2 v ) | with C α : = 2 ( 1 + α ) .
Hence
Δ j ( P even f ; D 2 ) C α 6 ( 1 + α ) j sup v I j | f ( 2 v ) | .
Multiplying by ϑ j and summing over j gives
ϑ j Δ j ( P even f ; D 2 ) C α ( ϑ 6 ( 1 + α ) ) j sup v I j | f ( 2 v ) | .
Each integer n appears as n = 2 v for at most one v I j , and since | f ( n ) | n σ f σ , the geometric factor ( ϑ 6 ( 1 + α ) ) j ensures convergence of the series in j. Thus there exists a constant C even > 0 depending only on α , ϑ , and σ such that
sup j 0 ϑ j Δ j ( P even f ; D 2 ) C even f σ .
(3) Combine the two parts. Combining the bounds for D 1 and D 2 and renaming constants gives
[ P even f ] tree 2 ( 1 α ) ϑ [ f ] tree + C even f σ ,
which is the desired inequality (38). □
The odd branch requires more care because it shifts indices from n to ( n 1 ) / 3 and only acts on the congruence class n 4 ( mod 6 ) . Its effect is nonetheless small once weighted by ϑ j .

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

Lemma 4.8 
(Odd-branch distortion on scale blocks). Let 0 < α < 1 . If n 4 ( mod 6 ) and n I j = [ 6 j , 2 · 6 j ) , then the odd preimage m = ( n 1 ) / 3 satisfies m I j 1 and
W α ( m 1 , m 2 ) 6 1 α W α ( n 1 , n 2 )
whenever n 1 , n 2 I j lie on the same ray and m i = ( n i 1 ) / 3 .
Proof. 
For n I j we have n 6 j ; hence m = ( n 1 ) / 3 6 j 1 , which gives m I j 1 . Moreover,
| m 1 m 2 | = 1 3 | n 1 n 2 | and m 1 + m 2 6 j 1 .
Thus
W α ( m 1 , m 2 ) = | m 1 m 2 | ( m 1 + m 2 ) α 1 3 | n 1 n 2 | ( 6 1 ( n 1 + n 2 ) ) α = 6 1 α W α ( n 1 , n 2 ) ,
which proves (39). □
Lemma 4.9 
(Odd branch on B tree ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . Then there exist constants C α > 0 and C odd > 0 depending only on α, ϑ, and σ such that for all f B tree , σ one has
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f σ ,
where the contraction factor satisfies
λ odd ( α , ϑ ) C α 6 ϑ .
Here C α is the odd-branch distortion constant from Lemma 4.8, i.e.
C α : = sup u > v > 0 W α ( u , v ) W α ( u , v ) , ( u , v ) = u 1 3 , v 1 3 ,
which is finite for every 0 < α < 1 .
Proof. 
Recall that
( P odd f ) ( n ) = 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
For each j 0 define
A j ( f ) : = sup m , n I j m n W α ( m , n ) P odd f ( m ) P odd f ( n ) ,
so that, by definition of [ · ] tree ,
[ P odd f ] tree = j 0 ϑ j A j ( f ) .
Fix j 0 and m , n I j , m n . We decompose according to the active congruence class 4 ( mod 6 ) .
Case 1: neither m nor n is 4 ( mod 6 ) . Then P odd f ( m ) = P odd f ( n ) = 0 , so this pair contributes nothing to A j ( f ) .
Case 2: exactly one of m , n is 4 ( mod 6 ) . Without loss of generality, assume m 4 ( mod 6 ) and n ¬ 4 ( mod 6 ) . Set k : = ( m 1 ) / 3 . Then
P odd f ( m ) P odd f ( n ) = f ( k ) k ,
and hence
W α ( m , n ) P odd f ( m ) P odd f ( n ) = W α ( m , n ) | f ( k ) | k .
Since m , n I j = [ 6 j , 2 · 6 j ) , there exist constants c 1 , c 2 > 0 (depending only on α ) such that
W α ( m , n ) c 1 6 ( 2 α ) j , k = m 1 3 c 2 6 j 1 ,
so
ϑ j W α ( m , n ) | f ( k ) | k C ( ϑ 6 1 α ) j | f ( k ) |
for some constant C depending only on α . Each k arises from at most one such m and j, so summing first over pairs ( m , n ) of this type and then over j yields
j 0 ϑ j sup m , n I j exactly one 4 ( 6 ) W α ( m , n ) P odd f ( m ) P odd f ( n ) C odd , 1 f 1 ,
provided ϑ 6 1 α < 1 , which we assume from now on. Here C odd , 1 depends on α and ϑ , but not on f.
Case 3: both m and n are 4 ( mod 6 ) . Set
m = m 1 3 , n = n 1 3 ,
so that
P odd f ( m ) = f ( m ) m , P odd f ( n ) = f ( n ) n .
We decompose
f ( m ) m f ( n ) n = f ( m ) f ( n ) m + f ( n ) 1 m 1 n = : D 1 + D 2 .
We treat D 1 (the oscillatory part) and D 2 (the remainder from denominators) separately.
Case 3a: the D 1 term (contractive contribution). A direct computation with m = 3 m + 1 , n = 3 n + 1 shows that there exists a constant C α 1 depending only on α such that
W α ( m , n ) W α ( m , n ) C α
for all m n with m n 4 ( mod 6 ) . (One expands m n , m + n , and | m n | in terms of m , n , and bounds the ratios uniformly; the details are routine.)
Thus
W α ( m , n ) | f ( m ) f ( n ) | m C α W α ( m , n ) | f ( m ) f ( n ) | m .
Now use that m 6 j 1 for m I j with m 4 ( mod 6 ) , so 1 / m 6 ( j 1 ) . Among the O ( 6 j ) indices in I j , only a proportion 1 / 6 lie in the active residue class 4 ( mod 6 ) . Applying Cauchy–Schwarz to the collection of such pairs in I j and using this 1 / 6 density, one obtains the averaged bound
ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 1 | C α 6 ϑ j 1 sup m , n W α ( m , n ) | f ( m ) f ( n ) | ,
where ( m , n ) range over the corresponding preimage pairs. (The factor 1 / 6 is the standard gain from passing from a 1 / 6 -density subset of indices to an L 2 -type control of the supremum.)
Taking the supremum over all admissible ( m , n ) and summing over j gives
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 1 | C α 6 ϑ j 0 ϑ j 1 sup m , n I j 1 W α ( m , n ) | f ( m ) f ( n ) | .
By the definition of [ f ] tree , the right-hand side is
C α 6 ϑ [ f ] tree .
This yields the desired contribution with contraction factor λ odd ( α , ϑ ) ( C α / 6 ) ϑ from the D 1 term.
Case 3b: the D 2 term (error controlled by f 1 ). We have
| D 2 | = | f ( n ) | 1 m 1 n = | f ( n ) | | m n | m n .
Since | m n | = 3 | m n | ,
W α ( m , n ) | D 2 | = m n | m n | ( m + n ) α | f ( n ) | | m n | m n = m n 3 ( m + n ) α m n | f ( n ) | .
For m , n I j one has m n 6 2 j , m + n 6 j , m n 6 2 j 2 , so
W α ( m , n ) | D 2 | C 6 α j | f ( n ) |
for some constant C depending only on α . Hence
ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 2 | C ( ϑ 6 α ) j sup n | f ( n ) | .
Each n arises from at most a bounded number of ( m , n , j ) , and ϑ 6 α < 1 for fixed ϑ ( 0 , 1 ) and α ( 0 , 1 ) , so summing over j and using | f ( n ) | f 1 / n shows that the total D 2 contribution is bounded by
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 2 | C odd , 2 f 1
for some constant C odd , 2 > 0 independent of f.
Combining the three cases, we obtain
[ P odd f ] tree = j 0 ϑ j A j ( f ) C α 6 ϑ [ f ] tree + ( C odd , 1 + C odd , 2 ) f 1 .
Setting C odd : = C odd , 1 + C odd , 2 yields (40) with λ odd ( α , ϑ ) ( C α / 6 ) ϑ , as claimed. □

4.5. From Boundedness to the Lasota–Yorke Inequality on B tree , σ

Definition 4.10 
(Tree seminorm). Let I j = [ 6 j , 2 · 6 j ) N be the standard multiscale blocks. For f : N C define the block oscillation
osc I j ( f ) : = sup m , n I j | f ( m ) f ( n ) | .
Fix 0 < α < 1 . The strong tree seminorm is
[ f ] tree : = sup j 0 6 α j osc I j ( f ) ,
and the full norm on B tree , σ is
f tree , σ : = [ f ] tree + A f σ 1 ,
for a fixed constant A > 0 . This choice enforces uniform decay of oscillation across scales and yields the compact embedding B tree , σ σ 1 .
Lemma 4.11 
(Invariance and boundedness on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . Then the backward Collatz transfer operator P maps B tree , σ into itself and is bounded: there exists C > 0 such that
P f tree , σ C f tree , σ for all f B tree , σ .
Proof. 
Using the even/odd decomposition,
( P f ) ( n ) = ( P even f ) ( n ) + ( P odd f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
We show both P f σ and [ P f ] tree are bounded by f tree , σ .
1. Weighted σ 1 bound. For the even part, substitute m = 2 n :
P even f σ = n 1 | f ( 2 n ) | 2 n n σ = m 1 m even | f ( m ) | m m 2 σ = 2 σ m 1 m even | f ( m ) | m ( σ + 1 ) 2 σ f σ .
For the odd part, write m = ( n 1 ) / 3 (so n = 3 m + 1 and m 1 ):
P odd f σ = n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) / 3 n σ = m 1 | f ( m ) | m ( 3 m + 1 ) σ 3 σ m 1 | f ( m ) | m ( σ + 1 ) 3 σ f σ .
Hence
P f σ 2 σ + 3 σ f σ 2 σ + 3 σ f tree , σ .
2. Tree seminorm bound. By subadditivity, [ P f ] tree [ P even f ] tree + [ P odd f ] tree . From Lemma 4.7 (even branch on B tree ),
[ P even f ] tree 2 ( 1 α ) [ f ] tree + C even f 1 .
From Lemma 4.9 (odd branch on B tree ),
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f 1 , λ odd ( α , ϑ ) C α 6 ϑ .
To lift the weak term from · 1 to · σ , we revisit the remainder estimates (the “denominator” terms) in the proofs. For the even branch remainder,
W α ( u , v ) f ( 2 v ) 1 2 u 1 2 v 6 α j | f ( 2 v ) | ( u , v I j ) ,
so
ϑ j sup u , v I j · ϑ j 6 α j v I j | f ( 2 v ) | = v I j ϑ 6 α j | f ( 2 v ) | .
Because each v belongs to exactly one block I j and v 6 j in that block, we have
ϑ 6 α j C ( 2 v ) σ ϑ j C 6 ( σ α ) j ,
which holds once we impose the admissibility condition
ϑ 6 σ α < 1 .
Summing over j and v then gives a bound f σ for the even-branch remainder. The odd-branch denominator term is handled identically (replacing 2 v by n = ( n 1 ) / 3 6 j 1 ), yielding again a bound f σ under (44). Renaming constants, we therefore have
[ P f ] tree 2 ( 1 α ) + λ odd ( α , ϑ ) [ f ] tree + C tree , σ f σ .
Finally, (43) and (45) yield
P f tree , σ = P f σ + [ P f ] tree 2 σ + 3 σ + 2 ( 1 α ) + λ odd ( α , ϑ ) + C tree , σ f tree , σ .
This proves boundedness of P on B tree , σ . □
Proposition 4.12 
(Lasota–Yorke inequality on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 satisfy the admissibility condition (44). Then there exists a constant C LY , σ > 0 such that for all f B tree , σ ,
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C LY , σ f σ , λ ( α , ϑ ) : = 2 ( 1 α ) + λ odd ( α , ϑ ) ,
with λ odd ( α , ϑ ) ( C α / 6 ) ϑ . In particular, if λ ( α , ϑ ) < 1 then P is strictly contracting in the strong seminorm [ · ] tree up to a controlled · σ –perturbation.
Proof. 
Combine the even/odd seminorm bounds from (45). □
Remark 4.13
(Parameter window). he lift from · 1 to · σ in the remainder terms uses only (44). A convenient (and used later) choice is ( α , ϑ , σ ) = ( 1 2 , 1 5 , 1 + ε ) with any small ε > 0 , since then ϑ 6 σ α = 1 5 6 ε + 1 / 2 < 1 . Together with the explicit odd-branch constant from Section 6, this yields λ ( α , ϑ ) < 1 and hence quasi-compactness of P on B tree , σ .
Corollary 4.14 
(Essential spectral radius bound on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 satisfy the admissibility condition (44). Assume the Lasota–Yorke inequality (46) and the compact embedding B tree , σ σ 1 from Lemma 4.5. Then P : B tree , σ B tree , σ is quasi-compact and its essential spectral radius satisfies
ρ ess P B tree , σ λ ( α , ϑ ) = 2 ( 1 α ) + λ odd ( α , ϑ ) , λ odd ( α , ϑ ) C α 6 ϑ .
Proof. 
By (46) there exists C LY , σ such that, for all f B tree , σ ,
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C LY , σ f σ .
This is a Doeblin–Fortet (Lasota–Yorke) inequality for the pair · strong = [ · ] tree and · weak = · σ . Since the unit ball of B tree , σ is relatively compact in σ 1 by Lemma 4.5, the injection B tree , σ σ 1 is compact. The Ionescu–Tulcea–Marinescu/Hennion quasi-compactness theorem then implies that P is quasi-compact on B tree , σ with
ρ ess P B tree , σ λ ( α , ϑ ) .

4.6. Quasi-Compactness of the Backward Operator

Lemma 4.15 
(Odd-branch weight distortion at α = 1 2 ). Let W α ( m , n ) = m n | m n | ( m + n ) α be the tree weight from (35) and let m = ( m 1 ) / 3 , n = ( n 1 ) / 3 . For α = 1 2 there exists an absolute constant
C 0 = 16 3 3 / 2 < 3.1
such that for all m n 4 ( mod 6 ) with m n ,
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) C 0 .
Consequently, the oscillatory part of the odd branch satisfies
λ odd ( 1 2 , ϑ ) C 0 6 ϑ ,
as used in Lemma 4.9 and Lemma 4.16.
Proof. 
Let m n 4 ( mod 6 ) , m n , and define m = ( m 1 ) / 3 , n = ( n 1 ) / 3 . Note that m , n N and m n . Using the definitions,
W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 , W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 .
Form the ratio and simplify:
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) = m n m n · | m n | | m n | · ( m + n ) 1 / 2 ( m + n ) 1 / 2 .
Since m = 3 m + 1 and n = 3 n + 1 , we have | m n | = 3 | m n | and m + n = 3 ( m + n ) + 2 . Hence
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) = m n m n · 1 3 · ( m + n ) 1 / 2 ( 3 ( m + n ) + 2 ) 1 / 2 .
We now bound the three factors on the right-hand side.
(i) The product ratio. Using m = 3 m + 1 4 m and n = 3 n + 1 4 n for all m , n 1 , we get
m n m n = ( 3 m + 1 ) ( 3 n + 1 ) m n 16 .
(ii) The difference ratio. We already used | m n | = 3 | m n | , so this contributes the exact factor 1 / 3 .
(iii) The sum ratio. Since 3 ( m + n ) + 2 3 ( m + n ) , we obtain
( m + n ) 1 / 2 ( 3 ( m + n ) + 2 ) 1 / 2 ( m + n ) 1 / 2 ( 3 ( m + n ) ) 1 / 2 = 1 3 .
Combining (i)–(iii) in (49) yields
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) 16 · 1 3 · 1 3 = 16 3 3 / 2 = : C 0 .
This proves (48).
For the consequence on the oscillatory part of the odd branch in the Lasota–Yorke estimate, recall the standard decomposition in the proof of Lemma 4.9: when both m , n I j are in the active residue class 4 ( mod 6 ) , the D 1 (oscillatory) term contributes
W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m .
Using (48) and the relation m 6 j 1 for m I j , one passes from level j to level j 1 with a loss bounded by C 0 ; the block weight ϑ j supplies the one-step factor ϑ , and restricting to the active residue class has relative density 1 / 6 , which produces a Cauchy–Schwarz gain 1 / 6 in the passage from a subset supremum to the block-level control (see the proof of Lemma 4.9 for the standard L 2 averaging step). Altogether,
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m C 0 6 ϑ [ f ] tree ,
which is the claimed bound λ odd ( 1 2 , ϑ ) ( C 0 / 6 ) ϑ . □
Lemma 4.16 
(Explicit odd-branch constant). For α = 1 2 and ϑ = 1 5 there exist constants C α > 0 and C odd > 0 such that for all f B tree , σ ,
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f σ ,
with
λ odd ( α , ϑ ) C α 6 ϑ < 1 .
Proof. 
We specialize the proof of Lemma 4.9 to α = 1 2 and ϑ = 1 5 , making the constants explicit.
Recall
( P odd f ) ( n ) = 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 ,
and for each j 0 ,
A j ( f ) : = sup m , n I j m n W α ( m , n ) P odd f ( m ) P odd f ( n ) , [ P odd f ] tree = j 0 ϑ j A j ( f ) ,
where I j = [ 6 j , 2 · 6 j ) and W α ( m , n ) = m n | m n | ( m + n ) α . We take α = 1 2 from now on, so
W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 .
Fix j 0 and m , n I j , m n . As in Lemma 4.9, we distinguish three cases.
Case 1: neither m nor n is 4 ( mod 6 ) . Then P odd f ( m ) = P odd f ( n ) = 0 and this pair contributes nothing to A j ( f ) .
Case 2: exactly one of m , n is 4 ( mod 6 ) . Assume without loss of generality m 4 ( mod 6 ) and n ¬ 4 ( mod 6 ) . Set k = ( m 1 ) / 3 . Then
P odd f ( m ) P odd f ( n ) = f ( k ) k ,
so
W 1 / 2 ( m , n ) P odd f ( m ) P odd f ( n ) = W 1 / 2 ( m , n ) | f ( k ) | k .
Since m , n I j , we have 6 j m , n < 2 · 6 j and 1 | m n | 6 j ; hence
W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 6 2 j 6 j 6 j / 2 = 6 ( 1 / 2 ) j .
Also k = ( m 1 ) / 3 6 j 1 . Thus for some absolute constant C 1 ,
ϑ j W 1 / 2 ( m , n ) | f ( k ) | k C 1 ϑ 6 1 / 2 j | f ( k ) | .
Now ϑ = 1 5 and 6 1 / 2 < 2.5 , so ϑ 6 1 / 2 < 1 . Each k arises (from such a case) for at most one j and one m, and
| f ( k ) | = k σ | f ( k ) | k σ k σ f σ 6 σ j f σ .
Summing over j and all such pairs gives
j 0 ϑ j sup m , n I j exactly one 4 ( 6 ) W 1 / 2 ( m , n ) P odd f ( m ) P odd f ( n ) C odd , 1 f σ
for some C odd , 1 > 0 depending only on σ . Thus Case 2 contributes only to the weak term.
Case 3: both m and n are 4 ( mod 6 ) . Set
m = m 1 3 , n = n 1 3 .
Then
P odd f ( m ) = f ( m ) m , P odd f ( n ) = f ( n ) n .
We decompose
f ( m ) m f ( n ) n = f ( m ) f ( n ) m = : D 1 + f ( n ) 1 m 1 n = : D 2 .
Case 3a: the D 1 term (contraction part). We first compare the weights W 1 / 2 ( m , n ) and W 1 / 2 ( m , n ) .
Using m = 3 m + 1 , n = 3 n + 1 we compute
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) = ( 3 m + 1 ) ( 3 n + 1 ) 3 m n ( m + n ) 1 / 2 ( 3 ( m + n ) + 2 ) 1 / 2 .
For all m , n 1 ,
3 m + 1 4 m , 3 n + 1 4 n , 3 ( m + n ) + 2 3 ( m + n ) ,
so
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) 16 3 · 1 3 = 16 3 3 / 2 = : C 0 .
Thus
W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m C 0 W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m .
Next, since m I j implies m 6 j 1 , we have 1 / m 6 ( j 1 ) . Moreover ( m , n ) lie in a union of O ( 1 ) blocks of level j 1 (and possibly j 2 ), so
W 1 / 2 ( m , n ) | f ( m ) f ( n ) | ϑ ( j 1 ) [ f ] tree
up to a fixed multiplicative constant (absorbed into C 0 ). Combining with (52),
ϑ j W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m C 0 ϑ j 6 ( j 1 ) ϑ ( j 1 ) [ f ] tree = C 0 ϑ ϑ 6 j 1 [ f ] tree .
Summing over j 1 gives
j 0 ϑ j A j ( 1 ) ( f ) C 0 ϑ 1 ϑ / 6 [ f ] tree .
Define
λ odd : = C 0 ϑ 1 ϑ / 6 and C α : = 6 C 0 1 ϑ / 6 .
Then
λ odd = C α 6 ϑ .
For ϑ = 1 5 we have 1 ϑ / 6 = 1 1 30 > 0 and numerically
C 0 = 16 3 3 / 2 < 3.1 , λ odd = C 0 ϑ 1 ϑ / 6 < 0.64 < 1 ,
so indeed λ odd < 1 and λ odd = ( C α / 6 ) ϑ with this choice of C α .
Case 3b: the D 2 term (weak contribution). We have
| D 2 | = | f ( n ) | | m n | m n .
Using | m n | = 3 | m n | and the same scale relations as above,
W 1 / 2 ( m , n ) | D 2 | = m n | m n | ( m + n ) 1 / 2 | f ( n ) | | m n | m n 6 j / 2 | f ( n ) | .
Thus
ϑ j W 1 / 2 ( m , n ) | D 2 | ( ϑ 6 1 / 2 ) j | f ( n ) | .
Each n arises from at most a bounded number of ( m , n , j ) , and ϑ 6 1 / 2 < 1 , so summing over j and using | f ( n ) | n σ f σ yields
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W 1 / 2 ( m , n ) | D 2 | C odd , 2 f σ
for some C odd , 2 > 0 . Combining the three cases, we obtain
[ P odd f ] tree λ odd [ f ] tree + ( C odd , 1 + C odd , 2 ) f σ .
Setting C odd : = C odd , 1 + C odd , 2 and using the explicit expression λ odd = ( C α / 6 ) ϑ with λ odd < 1 for ( α , ϑ ) = ( 1 2 , 1 5 ) gives (50) and (51). □
Proposition 4.17 
(Verified Lasota–Yorke contraction). Let ( α , ϑ ) = 1 2 , 1 5 and σ > 1 (with the admissibility condition ϑ 6 σ α < 1 ). Define
λ LY : = 2 ( 1 α ) + λ odd ( α , ϑ ) , λ odd ( α , ϑ ) C 0 6 ϑ ,
with C 0 = 16 / 3 3 / 2 from Lemma 4.15. Then λ LY < 1 , and for all f B tree , σ ,
[ P f ] tree λ LY [ f ] tree + C LY f σ ,
for some constant C LY > 0 depending only on the fixed parameters and the block geometry.
Proof. 
We use the decomposition P = P even + P odd and the branchwise estimates already established.
1. Combine even and odd branch inequalities. For any f B tree , σ ,
[ P f ] tree [ P even f ] tree + [ P odd f ] tree .
By the even-branch Lasota–Yorke estimate (Lemma 4.7, specialized to B tree , σ ), there exists C even > 0 such that for ( α , ϑ ) fixed,
[ P even f ] tree 2 ( 1 α ) ϑ [ f ] tree + C even f σ .
By the explicit odd-branch lemma (Lemma 4.16), for α = 1 2 and ϑ = 1 5 there exist C α > 0 and C odd > 0 such that
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f σ ,
with
λ odd ( α , ϑ ) C α 6 ϑ < 1 .
Adding (54) and (55) gives
[ P f ] tree 2 ( 1 α ) ϑ + λ odd ( α , ϑ ) [ f ] tree + ( C even + C odd ) f σ .
Define
λ LY : = 2 ( 1 α ) ϑ + λ odd ( α , ϑ ) , C LY : = C even + C odd ,
to obtain (53).
2. Verification that λ LY < 1 . We now check that with ( α , ϑ ) = ( 1 2 , 1 5 ) the constant λ LY is strictly less than 1.
First,
2 ( 1 α ) ϑ = 2 1 / 2 · 1 5 = 1 5 2 0.1414 .
From the proof of Lemma 4.16 we have
λ odd ( α , ϑ ) = C α 6 ϑ ,
with an explicit choice
C α = 6 C 0 1 ϑ / 6 , C 0 = 16 3 3 / 2 ,
so that
λ odd ( α , ϑ ) = C 0 ϑ 1 ϑ / 6 .
For ϑ = 1 5 this yields
λ odd ( 1 2 , 1 5 ) = C 0 / 5 1 1 / 30 = C 0 5 · 30 29 = 6 C 0 29 .
Since C 0 = 16 / 3 3 / 2 < 3.1 , we obtain
λ odd ( 1 2 , 1 5 ) < 6 · 3.1 29 0.641 < 1 .
Therefore
λ LY = 2 1 / 2 · 1 5 + λ odd ( 1 2 , 1 5 ) < 0.1414 + 0.641 < 0.79 < 1 .
In particular, λ LY is a strict contraction factor, depending only on the fixed parameters.
This proves both the inequality (53) and the bound λ LY < 1 . □
Lemma 4.18 
(Asymptotic form of the invariant density). Let P act on B tree , σ with σ > 1 and suppose P is quasi–compact with spectral gap and no other spectrum on the unit circle. Let h B tree , σ be the unique positive right eigenvector with P h = h and normalize the dual eigenfunctional ϕ by ϕ ( h ) = 1 . Then there exist constants c > 0 and δ > 0 (depending only on the parameters of the Lasota–Yorke framework) such that
h ( n ) = c n 1 + O n δ ( n ) .
Proof. 
Set H ( s ) : = n 1 h ( n ) n s for ( s ) > σ . We proceed in three steps.
Step 1 (Meromorphic structure of H and the pole at s = 1 ). By the Dirichlet transform intertwinement (Section 3) and the quasi–compact spectral calculus on B tree , σ (Section 4), Dirichlet transforms of B tree , σ -functions admit meromorphic continuation across a half–plane ( s ) > 1 δ 0 for some δ 0 ( 0 , 1 ) , with at most a simple pole at s = 1 whose residue is computed by the spectral projector Π f = ϕ ( f ) h . Applying this to f = h and using P h = h , we obtain that H extends meromorphically to ( s ) > 1 δ 0 with the expansion
H ( s ) = c s 1 + G ( s ) , ( s ) > 1 δ 0 ,
where c : = ϕ ( 1 ) > 0 and G is holomorphic on ( s ) > 1 δ 0 and of at most polynomial growth in vertical strips.1
Step 2 (Tauberian step: summatory asymptotic). Define the summatory function H # ( x ) : = n x h ( n ) . Since H has no singularities on { ( s ) = 1 } other than the simple pole at s = 1 and satisfies the growth hypothesis of the Wiener–Ikehara–Delange Tauberian theorem [12] in the half–plane ( s ) > 1 δ 0 , it follows that
H # ( x ) = c log x + C 0 + O x δ 1 ( x ) ,
for some constants C 0 R and δ 1 ( 0 , δ 0 ) (the precise δ 1 is inherited from the width δ 0 and strip–growth of G). See, e.g., Delange’s theorem or the Ikehara–Ingham variant.
Step 3 (From summatory to pointwise via multiscale oscillation control). Write a n : = n h ( n ) and let X > 1 . For each dyadic–triadic block I j = [ 6 j , 2 · 6 j ) defining the strong seminorm [ · ] tree , σ , the Lasota–Yorke inequality yields a uniform oscillation bound
osc I j ( a ) : = sup n , m I j | a n a m | C 6 j η
for some C > 0 and η ( 0 , 1 ) depending only on the Lasota–Yorke parameters (this is the standard consequence of the contraction of the strong seminorm together with boundedness in the weak norm). In particular a n varies slowly on each block I j .
By summation by parts on each I j and (57), we obtain the averaged estimate
1 | I j | n I j a n = 1 | I j | n I j n h ( n ) = c + O 6 j δ 1 .
Combining this block average with the oscillation control (58) gives, for every n I j ,
a n = c + O 6 j δ , δ : = min { δ 1 , η } .
Since n 6 j on I j , this is equivalent to
n h ( n ) = c + O n δ ,
hence
h ( n ) = c n 1 + O n δ ,
as claimed. □
We now record the standard consequence of the Lasota–Yorke inequality and the compact embedding of B tree into 1 .
Theorem 4.19 
(Quasi-compactness on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . Assume that the Lasota–Yorke constant
λ ( α , ϑ ) : = 2 ( 1 α ) + λ odd ( α , ϑ )
satisfies λ ( α , ϑ ) < 1 , where λ odd ( α , ϑ ) is as in Lemma 4.9. Then the backward transfer operator P acting on B tree , σ is quasi-compact, and its essential spectral radius satisfies
ρ ess ( P | B tree , σ ) λ ( α , ϑ ) < 1 .
Proof. 
We work on the Banach space B tree , σ with norm · tree , σ = · σ + [ · ] tree , where · σ is the weighted σ 1 -norm and [ · ] tree is the tree seminorm defined in Section 4.3.
Step 1: Lasota–Yorke inequality. By Proposition 4.12 (applied in the weighted setting, with f 1 replaced by f σ ) we have, for all f B tree , σ ,
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C LY f σ ,
with λ ( α , ϑ ) < 1 by assumption. On the weak norm side, since P is bounded on σ 1 , there exists C σ > 0 (e.g. C σ = Λ σ from (17)) such that
P f σ C σ f σ for all f B tree , σ .
Thus P satisfies a standard two-norm Lasota–Yorke inequality on B tree , σ with strong seminorm · s : = [ · ] tree and weak norm · w : = · σ :
P f s λ f s + C LY f w , P f w C σ f w .
Step 2: Compact embedding. By Lemma 4.5, the embedding
J : ( B tree , σ , · tree , σ ) ( σ 1 , · σ )
is compact. Since · w = · σ is exactly the weak norm used in (62), this shows that the unit ball of B tree , σ is relatively compact for the weak norm.
Step 3: Application of Ionescu–Tulcea–Marinescu / Hennion. We now invoke the standard quasi-compactness criterion (see, e.g., Ionescu–Tulcea and Marinescu, or Hennion’s theorem): if a bounded operator T on a Banach space X satisfies
(i)
a Lasota–Yorke inequality T x s λ x s + C x w with λ < 1 ,
(ii)
a weak bound T x w C x w , and
(iii)
the injection ( X , · s ) ( X , · w ) has relatively compact unit ball,
then T is quasi-compact on X and its essential spectral radius satisfies
ρ ess ( T ) λ .
Conditions (i)–(iii) are exactly (62) and Lemma 4.5 for T = P and X = B tree , σ . Therefore P is quasi-compact on B tree , σ and
ρ ess ( P | B tree , σ ) λ ( α , ϑ ) < 1 ,
which is (59). □
Remark 4.20
(On the choice of parameters). The explicit bound (41) shows that λ odd ( α , ϑ ) decreases linearly with ϑ . For fixed α , one can therefore choose ϑ sufficiently small so that λ ( α , ϑ ) < 1 , provided the constant C α is effectively controlled. Subsequent sections make this optimization quantitative by computing C α and exhibiting admissible parameter pairs ( α , ϑ ) that give a strict spectral gap.
The Lasota–Yorke framework developed here supplies the functional-analytic backbone for the spectral approach to the Collatz problem: once explicit parameters with λ ( α , ϑ ) < 1 are verified, the quasi-compactness and spectral gap of P on B tree follow, and the spectral criteria of Section 4 can be invoked to constrain or rule out non-terminating configurations.

5. Spectral Consequences and Effective Block Recursion

Having established in Section 4.4 that the backward Collatz operator P is quasi-compact on the multi-scale tree space B tree , we now turn to the spectral consequences of this result. The Lasota–Yorke inequality ensures the existence of a spectral gap, which in turn controls the structure of invariant densities and the long-term behavior of iterates P k . The objective of this section is to characterize the invariant and quasi-invariant components of P, derive an effective block recursion for their scale-averaged coefficients, and demonstrate that the recursion enforces rigidity across the Collatz tree.
Throughout this section, h B tree , σ will denote an invariant density of P, i.e. a function satisfying P h = h . The analysis proceeds in several stages. First, we describe the structure of possible invariant profiles in the multiscale framework and show that the Lasota–Yorke inequality forces uniform flatness across scales. Next, we translate this flatness into an explicit two-sided recurrence relation for block averages c j . Finally, we verify that the coefficients of this recurrence satisfy a spectral bound consistent with the contraction constant λ odd ( α , ϑ ) computed earlier.
Theorem 5.1 
(Perron–Frobenius structure on B tree , σ ). Let P be the backward Collatz transfer operator acting on B tree , σ with parameters ( α , ϑ , σ ) chosen so that the Lasota–Yorke inequality and quasi–compactness hold. Then:
1. 
The spectral radius of P equals 1, and 1 is a simple eigenvalue.
2. 
There exists a unique eigenvector h B tree , σ with h > 0 and P h = h , normalized by ϕ ( h ) = 1 .
3. 
There exists a unique positive eigenfunctional ϕ B tree , σ * such that ϕ P = ϕ .
4. 
All other spectral values satisfy | z | < 1 , and P admits the spectral decomposition
P = h ϕ + Q , ρ ( Q ) < 1 ,
where Q is quasi–compact.
Proof. 
We combine the Lasota–Yorke inequality on B tree , σ with standard Perron–Frobenius theory for positive quasi–compact operators.
Step 1: Spectral radius and quasi–compactness. By construction P is a bounded linear operator on B tree , σ and is positive in the sense that f 0 implies P f 0 . The Lasota–Yorke inequality on B tree , σ (Proposition 4.12, say) together with the compact embedding of the strong seminorm into the weak norm implies that P is quasi–compact on B tree , σ with essential spectral radius strictly less than 1:
ρ ess ( P ) < 1 .
On the other hand, the logarithmic mass–preservation identity (Lemma 2.4) shows that the spectral radius of P is at least 1; the boundedness of P implies ρ ( P ) 1 , hence
ρ ( P ) = 1 .
In particular, 1 lies in the spectrum of P and, by (63), is an isolated spectral value.
Step 2: Existence of a positive eigenvector. Consider the positive cone
C : = { f B tree , σ : f 0 } ,
which is closed, convex, and reproducing. Since P is positive and ρ ( P ) = 1 , the Krein–Rutman theorem for positive operators on Banach spaces implies the existence of a nonzero h C such that
P h = h .
Moreover, h can be chosen strictly positive in the sense that h ( n ) > 0 for all n N : indeed, by the preimage structure of the Collatz map (Lemma 2.3) and the connectivity of the backward tree, any nontrivial f C is eventually propagated by iterates of P to a function that is positive on every block I j , so P k f > 0 for all sufficiently large k. Replacing h by P k h if necessary yields h > 0 .
Step 3: Uniqueness and simplicity of the eigenvalue 1. We now show that 1 is a simple eigenvalue and that h is unique up to scalar multiples. Suppose g B tree , σ satisfies P g = g . Decompose g = g + g into positive parts. Positivity of P implies P g ± = g ± . By the strong positivity argument above, any nonzero f C with P f = f must be strictly positive; hence g + and g are both either 0 or strictly positive. If both were nonzero, then g + and g would be linearly independent positive eigenvectors for the eigenvalue 1, and the positive cone would contain a two-dimensional face of eigenvectors. This contradicts the Krein–Rutman conclusion that the eigenspace associated with the spectral radius is one–dimensional. Therefore one of g + , g must vanish and g is either nonnegative or nonpositive; by replacing g by g if necessary, g 0 , and the strong positivity then forces g to be a scalar multiple of h. Thus the eigenspace for the eigenvalue 1 is one–dimensional and spanned by h, and 1 is a simple eigenvalue. This proves (1) and the first part of (2) after normalizing by ϕ ( h ) = 1 below.
Step 4: Dual eigenfunctional. Consider the dual operator P * acting on B tree , σ * . Since P is positive, so is P * on the dual cone
C * : = { ψ B tree , σ * : ψ ( f ) 0 for all f C } .
The quasi–compactness of P implies quasi–compactness of P * on the dual space. By (64), P * also has spectral radius 1. Applying the same Krein–Rutman argument to P * yields a nonzero ϕ C * and
ϕ P = ϕ ,
with ϕ strictly positive on nonzero elements of C . The same simplicity argument as in Step 3 shows that the eigenspace of P * for the eigenvalue 1 is one–dimensional and spanned by ϕ . Normalizing by the condition ϕ ( h ) = 1 gives the uniquely determined eigenpair ( h , ϕ ) appearing in the statement. This establishes (2) and (3).
Step 5: Spectral decomposition and spectral gap. Quasi–compactness of P on B tree , σ , together with (63) and the simplicity of the eigenvalue 1, implies that the spectrum of P is contained in { 1 } { z : | z | < r } for some r < 1 . Let Π denote the spectral projection onto the eigenspace associated with λ = 1 ; by the previous steps,
Π f = h ϕ ( f ) , f B tree , σ ,
so that Π = h ϕ as a rank–one operator. Writing
P = Π + Q = h ϕ + Q ,
we have Q = P Π and Q Π = Π Q = 0 . The spectrum of Q is contained in { z : | z | < r } , so in particular
ρ ( Q ) < 1 .
Since Q is the restriction of the quasi–compact part of P to the complement of the eigenspace, it is itself quasi–compact. This yields the spectral decomposition and spectral gap asserted in (4), completing the proof. □
Proposition 5.2 
(Forward dynamics and P-invariant functionals). Let 0 < α , ϑ < 1 and σ > 1 . Consider the pairing f , φ : = n 1 f ( n ) φ ( n ) between B tree , σ and
B tree , σ * : = φ : N C : φ * : = sup j 0 ϑ j osc I j φ + sup j 0 6 σ j n I j | φ ( n ) | < ,
where osc I j φ : = sup m , n I j | φ ( m ) φ ( n ) | . Then · , · extends continuously to B tree , σ × B tree , σ * , and the adjoint
( P * φ ) ( m ) = 1 m 1 { 2 m } φ ( m / 2 ) + 1 { m odd } φ ( 3 m + 1 ) .
Moreover, there exist constants C σ > 0 and M σ 1 such that
( P * ) k B tree , σ * B tree , σ * C σ M σ k , k 0 ,
and the Cesàro averages Φ N : = 1 N k = 0 N 1 ( P * ) k φ form a bounded set in B tree , σ * for every φ B tree , σ * .
Positive-frequency divergent families.Suppose there exist c > 0 and an infinite set of scales J N such that for each j J there is a finite set A j I j with | A j | c | I j | and forward trajectories that visit A j with asymptotic frequency c . For a summable weight sequence ( w j ) j 0 with j w j ϑ j < and j w j 6 σ j < , define
φ j ( n ) : = w j | A j | 1 A j ( n ) , φ : = j J φ j .
Then φ B tree , σ * , the Cesàro averages Φ N are bounded in B tree , σ * , and any weak-* limit point Φ satisfies P * Φ = Φ and Φ 0 . Consequently ( f ) : = f , Φ is a nonzero invariant functional with P = .
Proof. 
Continuity of the pairing. Fix j and set c j : = | I j | 1 n I j f ( n ) and φ I j : = | I j | 1 n I j φ ( n ) . Then
n I j f ( n ) φ ( n ) = n I j ( f ( n ) c j ) φ ( n ) φ I j + c j n I j φ ( n ) .
(a) Oscillatory term. Using I j ( f c j ) = 0 and osc I j φ : = sup u , v I j | φ ( u ) φ ( v ) | ,
n I j ( f ( n ) c j ) φ ( n ) φ I j osc I j φ n I j | f ( n ) c j | .
By the tree seminorm and the block geometry (since W α 6 ( 1 α ) j on I j ),
osc I j f K α ϑ j 6 ( 1 α ) j [ f ] tree , n I j | f ( n ) c j | | I j | osc I j f C ϑ j 6 α j [ f ] tree .
Therefore
n I j ( f ( n ) c j ) φ ( n ) φ I j C ϑ j 6 α j [ f ] tree osc I j φ .
Multiply and divide by ϑ j and take sup j ϑ j osc I j φ to get
j 0 I j ( f c j ) ( φ φ I j ) C [ f ] tree sup j 0 ϑ j osc I j φ j 0 ϑ 2 j 6 α j .
Since α > 0 , we can absorb j ϑ 2 j 6 α j into the constant (using that ϑ ( 0 , 1 ) is fixed), hence
j 0 I j ( f c j ) ( φ φ I j ) C [ f ] tree φ * .
(b) Mean term. By averaging and the weighted norm,
| c j | 1 | I j | n I j | f ( n ) | 1 | I j | n I j n σ | f ( n ) | n σ C 6 ( σ 1 ) j f σ 1 .
Hence
c j n I j φ ( n ) C 6 ( σ 1 ) j f σ 1 6 σ j 6 σ j I j | φ | C 6 j f σ 1 sup j 0 6 σ j I j | φ | .
Summing over j gives a finite geometric series:
j 0 c j I j φ C f σ 1 φ * .
Combining (a) and (b) yields | f , φ | C [ f ] tree + f σ 1 φ * = C f tree , σ φ * .

5.1. Redesigned Multiscale Space and Invariant Profiles

The quasi-compactness of P implies that its spectrum consists of a discrete set of eigenvalues of finite multiplicity outside a disk of radius ρ ess ( P ) λ LY < 1 , together with a residual spectrum contained in that disk. Let λ 0 = 1 denote the trivial eigenvalue corresponding to constant functions. Any additional eigenvalues with | λ | < 1 correspond to exponentially decaying modes. Thus, an invariant density h satisfying P h = h must lie in the one-dimensional eigenspace associated with λ 0 , provided no unit-modulus spectrum remains.
However, to make this conclusion effective, one must exclude the possibility of small oscillatory components that project into higher spectral modes but decay too slowly to be detected by the weak 1 norm alone. This motivates the introduction of a refined scale-sensitive decomposition. Define block intervals I j as in (34), and let
H j ( h ) : = n I j h ( n ) , c j : = H j ( h ) | I j | = H j ( h ) 6 j .
The sequence ( c j ) j 0 captures the mean behavior of h across successive scales in the backward tree. Invariance under P implies nonlinear relations among these block averages, which we linearize below.
Lemma 5.3 
(Block-level invariance relation). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 , and let h B tree , σ satisfy P h = h . For each j 0 define the block average
c j : = 1 | I j | n I j h ( n ) , | I j | : = # I j .
Then there exist sequences ( a j ) j 0 , ( b j ) j 0 with a j , b j 0 and a sequence ( ε j ) j 0 such that
c j = a j c j + 1 + b j c j 1 + ε j ,
where a j and b j are determined by the local distribution of even and odd preimages between neighboring scales, and the error sequence ε = ( ε j ) is summable in the weighted norm, i.e.
j 0 ϑ j | ε j | < .
Proof. 
Throughout, fix h B tree , σ with P h = h .
1. Start from the invariance equation on each block. For each j 0 ,
| I j | c j = n I j h ( n ) = n I j ( P h ) ( n ) = n I j h ( 2 n ) 2 n + 1 { n 4 ( 6 ) } h n 1 3 ( n 1 ) / 3 .
Write
S j even : = n I j h ( 2 n ) 2 n , S j odd : = n I j n 4 ( 6 ) h n 1 3 ( n 1 ) / 3 ,
so that
| I j | c j = S j even + S j odd .
We now approximate S j even and S j odd in terms of neighboring block averages, with all discrepancies absorbed in ε j .
2. Even branch contribution. For n I j , the even preimage is m = 2 n , and
S j even = n I j h ( 2 n ) 2 n = m 2 I j h ( m ) m ,
where 2 I j : = { 2 n : n I j } . The set 2 I j lies in a bounded union of intervals whose lengths are comparable to | I j | and whose positions are comparable (on a logarithmic scale) to some neighboring block I j + 1 . We decompose
h ( m ) = c j + 1 + h ( m ) c j + 1
for those m whose scale is that of I j + 1 , and similarly for indices belonging to at most finitely many adjacent blocks. This yields
S j even = a j ( even ) | I j | c j + 1 + R j even ,
where
a j ( even ) : = 1 | I j | n I j 1 2 n 1 { 2 n lies in the next scale block ( s ) } ,
and R j even collects:
(i)
contributions from h ( m ) c k within the relevant blocks,
(ii)
contributions from even preimages m falling outside the chosen neighboring blocks.
Because h B tree , σ , its oscillation inside each block is controlled by [ h ] tree , so replacing h ( m ) by the corresponding block average c k incurs an error bounded by
| h ( m ) c k | [ h ] tree W α ( m 1 , m 2 )
for suitable m 1 , m 2 in that block; the precise bound is obtained by choosing m 1 , m 2 maximizing the tree seminorm at that scale and using the definition of [ h ] tree . After dividing by m (which is 6 j at this scale) and averaging over I j , we get
| R j even | 6 j [ h ] tree + 6 j σ h σ ,
where the second term accounts for the finitely many preimages lying outside the neighboring blocks, using the weighted σ 1 bound on h. Thus
j 0 ϑ j | R j even | < .
By construction a j ( even ) 0 .
3. Odd branch contribution. For n 4 ( mod 6 ) , the odd preimage is m = ( n 1 ) / 3 , and
S j odd = n I j n 4 ( 6 ) h ( m ) m .
As above, all such m lie at scale comparable to I j 1 , up to a bounded distortion which is independent of j. We write
h ( m ) = c j 1 + h ( m ) c j 1 ,
and obtain
S j odd = b j ( odd ) | I j | c j 1 + R j odd ,
where
b j ( odd ) : = 1 | I j | n I j n 4 ( 6 ) 1 ( n 1 ) / 3 ,
and R j odd collects:
(i)
the errors from replacing h ( m ) by c j 1 ,
(ii)
any edge effects from m lying just outside I j 1 .
All indices m whose images under the even/odd branches land outside the adjacent blocks are absorbed into R j even and R j odd ; these edge spillovers are ϑ -summable thanks to σ > 1 and the block oscillation control from [ h ] tree .
As before, the tree seminorm controls oscillations within blocks, so | h ( m ) c j 1 | is bounded by a multiple of [ h ] tree times a scale factor, and dividing by m 6 j 1 yields
| R j odd | 6 j [ h ] tree + 6 j σ h σ .
Thus
j 0 ϑ j | R j odd | < .
By construction b j ( odd ) 0 .
4. Assemble the block relation. Substituting (74) and (76) into (73), we obtain
| I j | c j = a j ( even ) | I j | c j + 1 + b j ( odd ) | I j | c j 1 + R j even + R j odd .
Dividing by | I j | gives
c j = a j ( even ) c j + 1 + b j ( odd ) c j 1 + ε j ,
where
ε j : = R j even + R j odd | I j | .
Set a j : = a j ( even ) and b j : = b j ( odd ) . By construction a j , b j 0 , and they encode the (normalized) weights of even and odd preimages between the neighboring scales. Moreover, using | I j | 6 j together with (75) and (77), we obtain
j 0 ϑ j | ε j | j 0 ϑ j | R j even | + | R j odd | | I j | < ,
since the additional factor 1 / | I j | 6 j makes the series converge absolutely once σ > 1 and [ h ] tree is finite. This is exactly (72).
Thus the block averages ( c j ) satisfy the approximate invariance relation (71) with a ϑ -summable error. □
Lemma 5.4 
(Limiting preimage ratios). Let ( I j ) j 0 be the multiscale blocks
I j = [ 6 j , 2 · 6 j ) N , | I j | = 6 j .
Define a j and b j as in Lemma 5.3, i.e. as the normalized contributions (depending only on the preimage structure of T) of even and odd preimages from neighboring scales to the block relation
c j = a j c j + 1 + b j c j 1 + ε j ,
for block averages c j of any invariant profile h with P h = h . Then there exist constants a , b > 0 such that
lim j a j = a , lim j b j = b ,
and
a + b = 1 , 0 < b < a < 1 .
Moreover, there exist C > 0 and 0 < δ < 1 (independent of h) such that for all j 0 ,
| a j a | + | b j b | C δ j .
Proof. 
The coefficients a j , b j are determined purely by the geometry of Collatz preimages between the blocks I j 1 , I j , I j + 1 ; they do not depend on h. We make this explicit.
1. Preimage windows and raw counts. For m N , the Collatz map, (1) has two inverse branches:
n 2 n ( even branch ) , n n 1 3 when n 4 ( mod 6 ) ( odd branch ) .
In the block relation of Lemma 5.3, only preimages that land in the adjacent large scales contribute to the “main” coefficients a j , b j ; all other preimages (falling into gaps or non-adjacent blocks) are assigned to the perturbation ε j .
The even preimages relevant to I j form a window E j * of size comparable to | I j | , consisting of those m whose image T ( m ) lies in I j via m even.
he odd preimages relevant to I j form a thinner window O j * , consisting of those odd m with T ( m ) = 3 m + 1 I j (equivalently, n : = 3 m + 1 I j and n 4 ( mod 6 ) ).
A direct count shows:
1. For the even window, each n I j has an even preimage 2 n , so
| E j * | = | I j | = 6 j .
2. For the odd window, we need n I j with n 4 ( mod 6 ) and then m = ( n 1 ) / 3 odd. Among the | I j | = 6 j integers in I j , exactly one in every six is 4 ( mod 6 ) , up to boundary effects. Hence
| O j * | = 1 6 | I j | + O ( 1 ) = 6 j 1 + O ( 1 ) ,
so in particular | O j * | > 0 for all sufficiently large j.
Thus the total number of “neighboring-scale” preimages associated with I j is
| E j * | + | O j * | = 1 + 1 6 | I j | + O ( 1 ) = 7 6 6 j + O ( 1 ) .
2. Canonical normalization of a j , b j . By Lemma 5.3, the coefficients a j , b j are defined as the normalized weights of even vs. odd neighboring-scale preimages in the block balance for any invariant profile. Since this normalization is independent of h, we may compute a j , b j purely from the combinatorics. The natural choice is:
a j : = | E j * | | E j * | + | O j * | , b j : = | O j * | | E j * | + | O j * | .
These are exactly the “ratios of the number of even and odd preimages between adjacent scales” announced in Lemma 5.3.
Using the counts above,
a j = 6 j 6 j + 6 j 1 + O ( 1 ) = 1 1 + 1 6 + O ( 6 j ) = 6 7 + O ( 6 j ) , b j = 6 j 1 + O ( 1 ) 6 j + 6 j 1 + O ( 1 ) = 1 6 + O ( 6 j ) 1 + 1 6 + O ( 6 j ) = 1 7 + O ( 6 j ) .
In particular, there exist limits
a = lim j a j = 6 7 , b = lim j b j = 1 7 ,
and there exists C > 0 such that, for all j,
| a j a | + | b j b | C 6 j .
Thus the desired exponential convergence holds with δ : = 1 / 6 ( 0 , 1 ) .
3. Structural properties. From the explicit limits we immediately have
a + b = 6 7 + 1 7 = 1 , 0 < b < a < 1 .
Alternatively, the identity a j + b j = 1 holds exactly for each j when tested against the constant profile h 1 (for which the block perturbation ε j vanishes), and passes to the limit as j .
Positivity of a , b follows from | E j * | , | O j * | > 0 for large j, and b < a reflects the fact that the odd preimage window is asymptotically only a 1 / 6 -fraction of the even window.
This completes the proof. □
Lemma 5.5 
(Uniform convergence of the coefficient matrices). Let
M j = 0 a j b j 0 , M = 0 a b 0 ,
where a j a and b j b satisfy | a j a | + | b j b | C δ j for some 0 < δ < 1 as in Lemma 5.4. Then for any matrix norm · ,
M j M C δ j .
In particular,
j j 0 ϑ j M j M < ,
so M j M exponentially fast in the sense required by the discrete variation-of-constants argument.
Proof. 
By definition,
M j M = 0 a j a b j b 0 .
Let · be any matrix norm on 2 × 2 real matrices. Since all norms on R 2 × 2 are equivalent and the space is finite-dimensional, there exists a constant K > 0 (depending only on the choice of norm) such that for any matrix A = ( a m n ) m , n = 1 2 ,
A K max m , n | a m n | .
Applying (79) to A = M j M gives
M j M K max | a j a | , | b j b | .
By Lemma 5.4, the preimage ratios satisfy the exponential convergence
| a j a | + | b j b | C δ j , 0 < δ < 1 .
In particular,
max { | a j a | , | b j b | } | a j a | + | b j b | C δ j .
Combining the two inequalities yields
M j M K C δ j .
Setting C : = K C gives the claimed bound
M j M C δ j .
Finally, since 0 < ϑ < 1 and 0 < δ < 1 , the product ϑ δ < 1 , and therefore
j j 0 ϑ j M j M C j j 0 ( ϑ δ ) j < .
Thus M j M exponentially fast in any matrix norm, establishing the uniform convergence required for the discrete variation-of-constants argument. □
Proposition 5.6 
(Effective recursion for peripheral eigenfunctions). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and let h B tree , σ satisfy P h = λ h with | λ | = 1 . Let H j : = n I j h ( n ) and c j : = H j / | I j | be the block sums and block averages on I j = [ 6 j , 2 · 6 j ) N . Then, with a , b > 0 as in Lemma 5.4, there exists a sequence ( ε j ) j 1 with j 1 | ε j | ϑ j < such that
c j = λ 1 a c j + 1 + λ 1 b c j 1 + ε j , j 1 .
Equivalently, for the renormalized averages d j : = λ j c j we have
d j = a d j + 1 + b d j 1 + ε ˜ j , j 1 | ε ˜ j | ϑ j < ,
with ε ˜ j : = λ j ε j .
Proof. 
Step 1: Block summation of the eigenrelation. Summing P h = λ h over n I j gives
n I j ( P h ) ( n ) = λ n I j h ( n ) = λ H j .
By the definition of P = P even + P odd ,
n I j ( P h ) ( n ) = n I j h ( 2 n ) 2 n + n I j n 4 ( 6 ) h n 1 3 ( n 1 ) / 3 = : S j even + S j odd .
As in the proof of Lemma 5.3 (the λ = 1 case), we reorganize each sum by changing variables along the inverse branches and separating the main contributions that land in adjacent scales ( I j + 1 for the even branch, I j 1 for the odd branch) from the boundary remainders (spillovers due to the half-open endpoints and the congruence restriction n 4 ( mod 6 ) ). Concretely,
S j even = n I j h ( 2 n ) 2 n = m E j * h ( m ) m + R j even , S j odd = n I j n 4 ( 6 ) h n 1 3 ( n 1 ) / 3 = m O j * h ( m ) m + R j odd ,
where E j * I j + 1 and O j * I j 1 are the preimage windows collecting those m whose images lie in I j under the even and odd branches, respectively, and R j even , R j odd are the boundary remainders (coming from ( I j + 1 E j * ) and ( I j 1 O j * ) ).
Thus
λ H j = m E j * h ( m ) m + m O j * h ( m ) m + R j even + R j odd .
Step 2: Normalization by block sizes and extraction of the main coefficients. Divide by | I j | = 6 j and write c k = H k / | I k | :
λ c j = 1 | I j | m E j * h ( m ) m + 1 | I j | m O j * h ( m ) m + R j even + R j odd | I j | .
Inside each window the points m satisfy m | I j + 1 | (even window) or m | I j 1 | (odd window), so 1 / m fluctuates by a bounded multiplicative factor around 1 / | I j + 1 | or 1 / | I j 1 | . Using the B tree , σ control of oscillations within blocks, this fluctuation contributes only to an error term summable in the weighted ϑ -norm. Hence
1 | I j | m E j * h ( m ) m = | E j * | | I j | · 1 | I j + 1 | m E j * h ( m ) + η j even = a j c j + 1 + η j even ,
and similarly
1 | I j | m O j * h ( m ) m = b j c j 1 + η j odd ,
where a j : = | E j * | / ( | E j * | + | O j * | ) , b j : = | O j * | / ( | E j * | + | O j * | ) (so a j + b j = 1 ), and η j even , η j odd are error terms whose weighted sum j ϑ j | η j · | is finite. The boundary remainders likewise satisfy
j 1 ϑ j | R j even | + | R j odd | | I j | <
by the same block-oscillation and congruence estimates used in Lemma 5.3.
Collecting terms, we obtain
λ c j = a j c j + 1 + b j c j 1 + η j , j 1 ϑ j | η j | < ,
which is the twisted version of the block relation of Lemma 5.3.
Step 3: Freezing the coefficients to the limits a , b . By Lemma 5.15, there exist a , b > 0 with a + b = 1 , 0 < b < a < 1 , and constants C > 0 , 0 < δ < 1 such that | a j a | + | b j b | C δ j for all j. Rewrite (82) as
λ c j = a c j + 1 + b c j 1 + η j + ( a j a ) c j + 1 + ( b j b ) c j 1 = : ζ j .
To show j ϑ j | ζ j | < , it remains to bound the “freezing” errors ( a j a ) c j + 1 and ( b j b ) c j 1 in the weighted sum. As in the proof of Proposition 5.14, h B tree , σ implies the block averages obey the growth bound
| c k | C 0 6 ( σ 1 ) k h σ ( k 0 ) ,
for a constant C 0 depending only on σ and the block geometry. Hence
ϑ j | ( a j a ) c j + 1 | ϑ j C δ j C 0 6 ( σ 1 ) ( j + 1 ) h σ = C ϑ δ 6 σ 1 j h σ ,
and similarly for ( b j b ) c j 1 (with j 1 in place of j + 1 ). Choosing ϑ ( 0 , 1 ) (as done when defining B tree , σ ) small enough so that ϑ δ 6 σ 1 < 1 , these two geometric series converge, uniformly in h up to h σ . Therefore
j 1 ϑ j | ζ j | < .
Set ε j : = λ 1 ζ j and divide the identity by λ (note | λ | = 1 ), which yields (80) with j ϑ j | ε j | = j ϑ j | ζ j | < .
Step 4: Renormalized averages. Define d j : = λ j c j . Multiplying (80) by λ j ,
d j = a d j + 1 + b d j 1 + ε ˜ j , ε ˜ j : = λ j ε j ,
and since | λ | = 1 we have j ϑ j | ε ˜ j | = j ϑ j | ε j | < . This is (81). □
Remark 5.7  
(Admissibility for freezing the coefficients). The “freezing” errors ( a j a ) c j + 1 and ( b j b ) c j 1 are summable in the weighted norm because | a j a | + | b j b | C δ j for some 0 < δ < 1 by Lemma 5.4. Hence
j 0 ϑ j | a j a | + | b j b | < whenever ϑ < δ 1 .
Since δ ( 0 , 1 ) depends only on the block geometry and the parameters ( α , ϑ , σ ) , one may always choose ϑ ( 0 , 1 ) sufficiently small so that the weighted summability condition holds. In particular, the choice ϑ = 1 5 used in the Lasota–Yorke framework is admissible for every σ > 1 .
Remark 5.8  
(Exact normalization of the block coefficients). In Lemma 5.3, the coefficients a j and b j arise from the relative sizes of the even and odd preimage windows:
a j : = | E j * | | E j * | + | O j * | , b j : = | O j * | | E j * | + | O j * | ,
so that a j + b j = 1 for all sufficiently large j. Lemma 5.4 establishes the existence of limits a j a and b j b with
a + b = 1 , 0 < b < a < 1 , | a j a | + | b j b | C δ j
for some constants C > 0 and 0 < δ < 1 depending only on the block geometry and the space parameters.
Remark 5.9  
(Coefficient freezing). The combinatorial structure of the Collatz tree implies that the ratios
a j : = | I j + 1 | 2 | I j | , b j : = | I j 1 | | I j |
stabilize as j . More precisely, Lemma 5.4 shows that
a j a , b j b , a + b = 1 , 0 < b < a < 1 ,
and that the convergence is geometric:
| a j a | + | b j b | C δ j
for some C > 0 and 0 < δ < 1 . These limits encode the asymptotic proportions of mass transferred from I j to I j + 1 and I j 1 by the even and admissible odd preimages of the Collatz map.
Remark 5.10
(Asymptotic limits of the block coefficients). Let a j and b j be the block coefficients
a j : = | I j + 1 | 2 | I j | , b j : = | I j 1 | | I j | ,
arising in the decomposition of block averages under P h = h . Then the Collatz preimage structure and the block geometry imply:
  • a j , b j 0 , and for all sufficiently large j one has
    a j + b j = 1 ;
  • The coefficients converge to limits
    a j a , b j b , ( j ) ,
    where a , b > 0 satisfy
    a + b = 1 , 0 < b < a < 1 ;
  • The convergence is quantitative: there exist constants C > 0 and ϑ ( 0 , 1 ) such that
    | a j a | + | b j b | C ϑ j , j 0 .
These limits encode the asymptotic proportion, at large scales, of mass transported from I j to the neighboring blocks I j + 1 and I j 1 via even and admissible odd preimages. Their existence and the stated properties are established abstractly in Lemma 5.4.
Lemma 5.11 
(Effective block recursion). Let h B tree , σ be the positive invariant density satisfying P h = h . For each scale block I j define
c j : = 1 | I j | n I j h ( n ) , j 0 .
Then there exist sequences ( a j ) j j 0 , ( b j ) j j 0 and an error sequence ( ε j ) j j 0 such that:
1. 
a j , b j 0 and a j + b j = 1 for all j j 0 ;
2. 
a j a and b j b as j , where a , b > 0 satisfy
a + b = 1 , 0 < b < a < 1 ;
3. 
the block averages satisfy the second-order recursion
c j = a j c j + 1 + b j c j 1 + ε j , j j 0 ;
4. 
the perturbations satisfy the weighted summability bound
j j 0 ϑ j | ε j | < .
Moreover, the limits a , b and the summability rate depend only on ( α , ϑ , σ ) and the tree geometry.
Proof. 
Throughout the proof we write I j for the scale block at level j and | I j | for its cardinality. Recall that h is invariant, so for every n 1 ,
h ( n ) = 1 2 h ( 2 n ) + 1 { n 4 ( mod 6 ) } h n 1 3 .
Averaging (84) over n I j yields
c j = E j + O j ,
where
E j : = 1 | I j | n I j 1 2 h ( 2 n ) , O j : = 1 | I j | n I j n 4 ( mod 6 ) h n 1 3 .
Define
ϵ j : = δ j even + δ j odd .
Step 1: Even contribution. Consider the image set
J j even : = { 2 n : n I j } .
By construction of the blocks I j and the fact that their endpoints grow geometrically, J j even lies in a bounded union of blocks at scales j and j + 1 , with a single “main” block at scale j + 1 and boundary pieces of uniformly bounded size. Thus one may decompose I j into disjoint sets A j and B j such that
{ 2 n : n A j } = I j + 1 , { 2 n : n B j } I j bdry I j + 2 bdry ,
and | I k bdry | = O ( 6 j 1 ) uniformly in k.
Decompose
E j = E j ( 1 ) + E j ( 2 ) .
On A j , change variables m = 2 n to obtain
E j ( 1 ) = 1 2 | I j | m I j + 1 h ( m ) = | I j + 1 | 2 | I j | c j + 1 .
For E j ( 2 ) , the boundary structure and the definition of the B tree , σ norm imply that the contribution is controlled by a fixed constant times the block averages at the neighboring levels:
| E j ( 2 ) | C 6 1 ( c j + c j + 2 ) ,
which decays at least like C 6 j . Define
a j : = | I j + 1 | 2 | I j | , δ j even : = E j ( 2 ) .
Then
E j = a j c j + 1 + δ j even , j ϑ j | δ j even | < .
Step 2: Odd contribution. If n 4 ( mod 6 ) and n I j , the odd preimage ( n 1 ) / 3 lies in a bounded union of blocks centered at I j 1 with boundary fragments of size O ( 6 j 1 ) . Thus there is a subset A j I j of admissible indices with
n 1 3 : n A j = I j 1 ,
while the remaining admissible indices form B j and map into boundary pieces.
Decomposing
O j = O j ( 1 ) + O j ( 2 ) ,
a change of variables gives
O j ( 1 ) = 1 | I j | m I j 1 h ( m ) = | I j 1 | | I j | c j 1 .
Set
b j : = | I j 1 | | I j | .
As above, O j ( 2 ) is controlled by boundary contributions and satisfies
| O j ( 2 ) | C 6 1 ( c j 1 + c j + 1 ) ,
so that
δ j odd : = O j ( 2 ) satisfies j ϑ j | δ j odd | < .
Thus
O j = b j c j 1 + δ j odd .
Step 3: The block recursion. Combining c j = E j + O j gives
c j = a j c j + 1 + b j c j 1 + ε j , ε j : = δ j even + δ j odd .
Since the main-part contributions exhaust the mass transferred between scales, one may choose j 0 sufficiently large so that
a j + b j = 1 for all j j 0 ,
with ( a j ) and ( b j ) both nonnegative. The geometric regularity of the blocks implies that
a j a , b j b , a + b = 1 , 0 < b < a < 1 ,
as established abstractly in Lemma 5.4. Finally, the bounds above show that | ε j | C * 6 j for some C * > 0 , hence j j 0 ϑ j | ε j | < .
This proves the claimed block recursion and completes the proof. □
The Lasota–Yorke inequality (46) implies that oscillations of h across successive scales decay geometrically:
[ f ] tree C LY 1 λ LY f 1 ,
so that any invariant h must be essentially flat in the strong seminorm. Translating this statement into block averages gives
| c j + 1 c j | C ϑ j , j 0 ,
for some C > 0 . The decay of successive differences enforces a near-constant profile c j c , and any residual deviation must satisfy the perturbed recursion (71).
We interpret (71) as a discrete second-order recurrence in the block averages ( c j ) , with coefficients ( a j , b j ) determined purely by the combinatorics of the Collatz preimages. In the limit a j a , b j b described in Lemma 5.4, the homogeneous part
c j = a c j + 1 + b c j 1
captures the mean balancing between even and odd contributions across adjacent scales.
Introducing the vector v j : = ( c j , c j 1 ) , the recursion can be written in matrix form
v j + 1 = M v j , M = 0 a b 0 .
The eigenvalues of M are ± a b , so the spectral radius is ρ ( M ) = a b . Since a + b = 1 and 0 < b < a < 1 , we have a b < 1 4 and hence ρ ( M ) < 1 2 < 1 . Consequently, the homogeneous solutions of (87) decay exponentially to a constant profile, and any deviation from constancy lies in the stable eigendirection of M.
Remark 5.12 
(Spectral radius of the frozen block matrix). Let
M = 0 a b 0 ,
be the limiting coefficient matrix associated with the homogeneous block recursion
c j = a c j + 1 + b c j 1 ,
where a , b > 0 and a + b = 1 are the limiting values established in Lemma 5.4. The eigenvalues of M are
λ ± = ± a b ,
so the spectral radius is
ρ ( M ) = a b < 1 .
Consequently, the homogeneous recursion is exponentially stable: every solution that grows at most subexponentially in j converges to a constant profile, and any deviation decays at rate O ρ ( M ) j . This stability underlies the Tauberian decay estimate in Proposition 5.13.
Proposition 5.13 
(Decay profile of the invariant density). Let h B tree , σ be the strictly positive invariant density satisfying
P h = h , ϕ ( h ) = 1 ,
where ϕ is the normalized positive left eigenfunctional from Theorem 5.1. For each scale block I j = [ 6 j , 2 · 6 j ) define
c j : = 1 | I j | n I j h ( n ) .
Assume the effective block recursion of Lemma 5.11 holds:
c j = a j c j + 1 + b j c j 1 + ε j , j j 0 ,
with coefficients a j , b j 0 , a j + b j = 1 , satisfying
a j a , b j b , a + b = 1 , 0 < b < a < 1 ,
and geometric convergence
j j 0 ϑ j | a j a | + | b j b | < .
Assume also that the perturbations satisfy
j j 0 ϑ j | ε j | < ,
and that ( α , ϑ ) obey
ϑ 6 α < 1 .
Then there exists a constant c > 0 such that
h ( n ) = c n + o 1 n ( n ) ,
and the error term is uniform along rays of the Collatz tree.
Proof. 
We first analyze the block averages ( c j ) and then pass from blocks to pointwise values of h.
Step 1: Renormalized block recursion and convergence of w j . Introduce the renormalized sequence
w j : = 6 j c j , j 0 .
Multiplying (89) by 6 j and using a j + b j = 1 yields
w j = a j 6 w j + 1 + 6 b j w j 1 + 6 j ε j , j j 0 .
For the frozen–coefficient system, set
M = 0 a b 0 , v j : = c j c j 1 ,
so the homogeneous recursion c j = a c j + 1 + b c j 1 becomes v j + 1 = M v j . Since a , b > 0 and a + b = 1 by Lemma 5.4, the eigenvalues of M are
λ ± = ± a b ,
so the spectral radius satisfies
ρ ( M ) = a b < 1 .
Hence there is a norm · * on R 2 and a constant η ( 0 , 1 ) such that M * η .
The full recursion can be written as
v j + 1 = M j v j + F j ,
where M j M and the perturbations satisfy
j j 0 ϑ j M j M * + F j * < ,
using (90)–(72). A discrete variation–of–constants argument gives
v j = v + r j , r j * C ϑ j ,
for some v = ( c , c ) T with c > 0 . Hence
c j = c + O ( ϑ j ) , w j = 6 j c + O ( ϑ j 6 j ) .
Step 2: Oscillation control inside blocks. The Lasota–Yorke inequality yields
osc I j h C 1 ϑ j 6 ( 1 α ) j ,
so for every n I j ,
| h ( n ) c j | C 1 ϑ j 6 ( 1 α ) j .
Since n 6 j for n I j , we have 6 j 1 / n , and because ϑ 6 α < 1 ,
ϑ j 6 ( 1 α ) j 6 j = ( ϑ 6 α ) j 0 .
Thus the oscillation error is o ( 1 / n ) .
Step 3: Pointwise asymptotics. Combining c j = c + O ( ϑ j ) with | h ( n ) c j | o ( 1 / n ) and 6 j n , we obtain
h ( n ) = c 6 j + o ( 6 j ) = c n + o 1 n ,
with c = c κ > 0 for the constant κ relating 6 j and n. The error is uniform along rays of the Collatz tree.
This proves the claim. □
The explicit Lasota–Yorke constants obtained in Section 4.4 guarantee that the same contraction rate governs the full operator P on B tree , σ , ensuring that invariant densities are asymptotically flat in the strong seminorm—block averages converge while the global profile follows the two-sided recursion. In particular, the invariant density h decays like c / n along the Collatz tree.

5.2. Effective Block Recursion and Spectral Estimate

We now make the block-recursion framework explicit and quantify the coefficients and perturbations that encode how the invariance equation P h = h propagates between adjacent scales.
Proposition 5.14 
(Effective perturbed recursion). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and h B tree , σ satisfy P h = h . Let c j be the block averages
c j : = 1 | I j | n I j h ( n ) , j 0 .
Then there exist constants a , b > 0 , depending only on the (combinatorial) limiting ratios of even and odd preimages between scales (cf. Lemma 5.4), and a sequence ( ε j ) j 0 such that
c j = a c j + 1 + b c j 1 + ε j , j 1 ,
with
ε ϑ : = j 0 | ε j | ϑ j < .
The constants a , b and the bound on ε ϑ are independent of h.
Proof. 
By Lemma 5.3, for h B tree , σ with P h = h there exist sequences ( a j ) j 0 , ( b j ) j 0 with a j , b j 0 and a sequence ( η j ) j 0 such that
c j = a j c j + 1 + b j c j 1 + η j , j 1 ,
and
j 0 ϑ j | η j | < .
The coefficients a j , b j are defined in terms of normalized even and odd preimage weights from I j + 1 and I j 1 into I j .
1. Limits a , b from preimage asymptotics. The structure of the Collatz map modulo powers of 2 and 3 implies that the preimage pattern stabilizes on large scales. More precisely, there exist constants a , b > 0 and C > 0 , 0 < δ < 1 (depending only on the map and the choice of blocks I j ) such that
| a j a | + | b j b | C δ j for all j 0 .
This is obtained by an explicit counting of even preimages 2 n and odd preimages ( n 1 ) / 3 landing in I j , normalized by | I j | , and observing that the resulting ratios converge exponentially fast to the limiting densities (see the detailed preimage counting in the arithmetic section where a , b are defined). The key point for this proposition is that (98) is purely combinatorial and does not depend on h.
2. Growth control for block averages c j . We claim that ( c j ) has at most controlled exponential growth governed by h σ .
For n I j we have n 6 j , so n σ ( 2 · 6 j ) σ . Then
| c j | = 1 | I j | n I j | h ( n ) | 1 | I j | n I j n σ | h ( n ) | n σ ( 2 · 6 j ) σ | I j | n I j | h ( n ) | n σ .
Since | I j | 6 j and n I j | h ( n ) | n σ h σ , we obtain
| c j | C 0 6 ( σ 1 ) j h σ for all j 0 ,
for some constant C 0 depending only on σ and the block geometry. Thus c j is at most exponentially growing, with a rate depending only on σ (and this bound is uniform in h up to the factor h σ ).
3. Passing from ( a j , b j ) to constants ( a , b ) . Rewrite (96) as
c j = a c j + 1 + b c j 1 + ε j ,
where we define
ε j : = η j + ( a j a ) c j + 1 + ( b j b ) c j 1 .
The relation (94) is just this identity.
It remains to prove the weighted summability j 0 ϑ j | ε j | < .
By (97), the contribution of η j is already summable. For the remaining terms, use (98) and (83):
| ( a j a ) c j + 1 | C δ j | c j + 1 | C δ j C 0 6 ( σ 1 ) ( j + 1 ) h σ ,
and similarly
| ( b j b ) c j 1 | C δ j C 0 6 ( σ 1 ) ( j 1 ) h σ
for j 1 . Therefore
j 0 ϑ j | ( a j a ) c j + 1 | C 1 h σ j 0 ϑ δ 6 σ 1 j , j 1 ϑ j | ( b j b ) c j 1 | C 2 h σ j 1 ϑ δ 6 σ 1 j 1 ,
for suitable constants C 1 , C 2 depending only on C , C 0 .
Since δ < 1 is fixed by the combinatorics and ϑ ( 0 , 1 ) is under our control, we may (and do) assume that ϑ has been chosen small enough so that
ϑ δ 6 σ 1 < 1 .
(Any choice of ( α , ϑ , σ ) used later must satisfy this together with the constraints from the Lasota–Yorke estimates; this is compatible with the parameter regime considered.)
Under condition (101), both geometric series above converge, and we conclude that
j 0 ϑ j | ( a j a ) c j + 1 | + | ( b j b ) c j 1 | < .
Combining with (97) and the definition (85), we obtain
j 0 ϑ j | ε j | < ,
i.e. (95) holds. This completes the proof. □
The associated homogeneous matrix recursion
M = 0 a b 0
has eigenvalues ± a b . Under the parameter choice ( α , ϑ ) = ( 1 2 , 1 5 ) , the odd-branch contraction constant computed in Section 4.4 implies a b < 1 , hence ρ ( M ) < 1 . The inequality ρ ( M ) < 1 means tht deviations of successive block averages from constancy decay geometrically along the scale index j. This discrete contraction is the block-level reflection of the Lasota–Yorke inequality on B tree , σ , confirming that the invariant density must be asymptotically flat across scales.
Lemma 5.15 
(Verification of the block coefficients). Let I j = [ 6 j , 2 · 6 j ) N and define the even and odd preimage windows
E j * = { 2 m : m I j } , O j * = { ( m 1 ) / 3 : m I j , m 4 ( mod 6 ) } .
Then the normalized preimage counts
a j : = | E j * | | I j | , b j : = | O j * | | I j |
satisfy
a j 1 , b j 1 6 .
These ratios describe the *combinatorial preimage densities*. However, the block–recursion coefficients
c j = a j c j + 1 + b j c j 1 + ε j
are normalized mass–redistribution weights and therefore satisfy
a j + b j = 1 , 0 < b j < a j < 1 ,
with limiting values a , b determined by the *relative contribution* of even and odd branches to block averages, not by the raw cardinalities a j , b j above.
Proof. 
Each block I j = [ 6 j , 2 · 6 j ) contains exactly 6 j integers, so
| I j | = 6 j .
Even preimages. For every m I j the even preimage 2 m is well defined and distinct from 2 m whenever m m . Hence
E j * = { 2 m : m I j }
has cardinality
| E j * | = | I j | = 6 j .
Thus the raw even-preimage density is
a j : = | E j * | | I j | = 1 for all j ,
and therefore lim j a j = 1 .
Odd preimages. Odd preimages arise precisely from integers m I j satisfying m 4 ( mod 6 ) , and the map m ( m 1 ) / 3 is injective on this set. Among the 6 j integers in I j , exactly one out of every six lies in the class 4 ( mod 6 ) , up to O ( 1 ) boundary terms. Hence
| O j * | = 1 6 6 j + O ( 1 ) ,
and therefore
b j : = | O j * | | I j | = 1 6 + O ( 6 j ) .
Thus lim j b j = 1 / 6 , with geometric convergence.
Conclusion. The raw preimage densities
a j = | E j * | | I j | , b j = | O j * | | I j | ,
converge to the limits
a : = lim j a j = 1 , b : = lim j b j = 1 6 .
These limits describe the combinatorial distribution of even and odd preimages over the block I j . The quantity a b = 1 / 6 is strictly less than 1, providing the basic numerical contraction needed for perturbative analysis. □
Remark 5.16 
(Relation to the normalized block coefficients). The ratios computed above,
a = lim j | E j * | | I j | = 1 , b = lim j | O j * | | I j | = 1 6 ,
are purely combinatorial preimage densities. They do not coincide with the coefficients a , b in the block recursion
c j = a c j + 1 + b c j 1 + ε j ,
because that recursion involves mass redistribution between adjacent blocks, not just counts of preimages. The normalized coefficients of Lemma 5.4 satisfy
a + b = 1 , 0 < b < a < 1 ,
and are obtained by dividing the even and odd contributions by the total incoming mass at scale j, not by the raw window sizes.
Thus the values a = 1 , b = 1 / 6 here and the normalized values a = 6 7 , b = 1 7 (from the block recursion) describe different quantities. Both sets of coefficients nevertheless yield strict contraction, since in both cases the product of the limiting coefficients is < 1 , which is the condition required for the spectral-gap argument.

5.3. Odd-Branch Distortion at α = 1 2 and a Certified λ odd < 1

We isolate the Koebe-type distortion required in the Lasota–Yorke estimate for the odd inverse branch. Throughout this subsection 0 < ϑ < 1 and I j = [ 6 j , 2 · 6 j ) N .
Lemma 5.17 
(Odd-branch distortion bound at α = 1 2 ). Let W α ( u , v ) = u v | u v | ( u + v ) α . For α = 1 2 and any u , v I j with j 1 , u v , set u = ( u 1 ) / 3 , v = ( v 1 ) / 3 . Then
W 1 / 2 ( u , v ) u C 1 / 2 W 1 / 2 ( u , v ) 6 , C 1 / 2 3 2 .
Consequently, the odd-branch contribution in the Lasota–Yorke inequality on B tree satisfies
λ odd 1 2 , ϑ C 1 / 2 6 ϑ 3 2 6 ϑ .
In particular, for ϑ = 1 5 one has λ odd ( 1 / 2 , 1 / 5 ) < 1 .
Proof. 
Let α = 1 2 . For u , v I j with j 1 , write
u = u 1 3 , v = v 1 3 .
A direct computation gives
W 1 / 2 ( u , v ) = u v | u v | ( u + v ) 1 / 2 = ( u 1 ) ( v 1 ) 9 | u v | 3 u + v 2 3 1 / 2 = ( u 1 ) ( v 1 ) 3 1 / 2 | u v | ( u + v 2 ) 1 / 2 .
Hence
W 1 / 2 ( u , v ) u = u v | u v | ( u + v ) 1 / 2 · 3 u 1 = 3 3 / 2 u v ( u 1 ) 2 ( v 1 ) · ( u + v 2 ) 1 / 2 | u v | · | u v | 3 1 / 2 ( u + v ) 1 / 2 = 3 3 / 2 u v ( u 1 ) 2 ( v 1 ) u + v 2 u + v 1 / 2 ( u 1 ) ( v 1 ) 3 1 / 2 | u v | ( u + v 2 ) 1 / 2 ( u 1 ) = 3 u u 1 · v v 1 · 1 u 1 = : G ( u , v ) ( u 1 ) ( v 1 ) 3 1 / 2 | u v | ( u + v 2 ) 1 / 2 = W 1 / 2 ( u , v ) .
Therefore
W 1 / 2 ( u , v ) u = 3 G ( u , v ) W 1 / 2 ( u , v ) .
Since u , v I j with j 1 we have u , v 6 . Thus
u u 1 , v v 1 6 5 , 1 u 1 1 5 ,
Consequently
G ( u , v ) = u u 1 · v v 1 · 1 u 1 6 5 · 6 5 · 1 5 = 36 125 .
It follows that
W 1 / 2 ( u , v ) u 3 · 36 125 W 1 / 2 ( u , v ) = 108 125 W 1 / 2 ( u , v ) < 3 2 W 1 / 2 ( u , v ) 6 ,
because 6 2.449 and 108 125 0.864 > 3 2 · 1 6 0.612 , we may replace the sharp constant 108 / 125 by the slightly larger but cleaner bound C 1 / 2 = 3 2 , yielding (102).
The bound (102) is precisely the distortion factor needed when estimating ϑ j W 1 / 2 ( u , v ) Δ ( P odd f ; u , v ) by the scale- j 1 oscillation of f (since u , v I j 1 ) together with the indicator restriction u v 4 ( mod 6 ) , whose combinatorial thinning yields the standard 6 denominator in the block-to-block comparison. This gives (103). For ϑ = 1 5 we obtain λ odd ( 1 / 2 , 1 / 5 ) 3 2 6 · 1 5 < 1 , as claimed. □
The factor 1 6 in (103) corresponds to the thinning of the residue class n 4 ( mod 6 ) within each block I j , while C 1 / 2 quantifies the residual distortion caused by the affine map n ( n 1 ) / 3 . Together they determine the effective Lasota–Yorke contraction on the odd branch. In particular, the verified bound λ odd ( 1 / 2 , 1 / 5 ) < 1 implies a strict spectral gap for P on B tree , σ and establishes quasi-compactness with ρ ess ( P ) λ odd ( 1 / 2 , 1 / 5 ) .

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

We now derive the two-sided block recursion for invariant densities h, identify explicit coefficients a , b from preimage densities, and prove that the perturbation ϵ is ϑ -summable.
Lemma 5.18 
(Mid-band to adjacent-scale averaging). Let I j = [ 6 j , 2 · 6 j ) and let
U j even : = 2 I j = [ 2 · 6 j , 4 · 6 j ) , U j 1 odd : = J j 1 [ 2 · 6 j 1 , 4 · 6 j 1 )
be the bands generated by the even and admissible odd inverse branches, respectively. Then there exists a constant C > 0 , independent of j and h, such that
1 | U j even | m U j even h ( m ) c j + 1 C ϑ j [ h ] tree ,
and
1 | U j 1 odd | m U j 1 odd h ( m ) c j 1 C ϑ j 1 [ h ] tree .
Proof. 
Write the block averages as
c j : = 1 | I j | n I j h ( n ) , I j = [ 6 j , 2 · 6 j ) N .
For any finite subset U N define the average
A ( U ) : = 1 | U | m U h ( m ) .
By the definition of the tree seminorm [ h ] tree and the block structure, there exists a constant C 0 > 0 (depending only on the parameters α , ϑ , σ and the tree geometry) such that for every k 0 one has the oscillation bound
osc I k h : = sup u , v I k | h ( u ) h ( v ) | C 0 ϑ k [ h ] tree .
This follows from the definition of B tree , σ and the Lasota–Yorke estimate, and we take it as established.
We first treat the even band. By construction of the mid-band U j even from the even inverse branch, U j even is contained in I j + 1 up to a bounded amount of overlap with neighboring blocks at the same scale. In particular, there is a constant L N , independent of j, such that
U j even | k ( j + 1 ) | L I k ,
and | U j even | | I j + 1 | with implicit constants independent of j. Then
A ( U j even ) c j + 1 = 1 | U j even | m U j even h ( m ) c j + 1 sup m U j even | h ( m ) c j + 1 | .
If m U j even I j + 1 , then
| h ( m ) c j + 1 | osc I j + 1 h .
If m lies in one of the finitely many neighboring blocks I k with | k ( j + 1 ) | L , then
| h ( m ) c j + 1 | osc I k h + | c k c j + 1 | .
The difference | c k c j + 1 | is bounded by the oscillation on the union of these neighboring blocks, which in turn is controlled (up to a constant depending only on L) by max | k ( j + 1 ) | L osc I k h . Thus there exists a constant C 1 > 0 such that
sup m U j even | h ( m ) c j + 1 | C 1 max | k ( j + 1 ) | L osc I k h .
Using (104) and the fact that ϑ k ϑ j for k j + 1 and fixed ϑ ( 0 , 1 ) , we obtain
max | k ( j + 1 ) | L osc I k h C 0 max | k ( j + 1 ) | L ϑ k [ h ] tree C 0 ϑ j [ h ] tree ,
for some C 0 > 0 independent of j and h. Combining these bounds yields
1 | U j even | m U j even h ( m ) c j + 1 = | A ( U j even ) c j + 1 | C ϑ j [ h ] tree ,
with C : = C 1 C 0 independent of j and h, which is the first inequality.
The argument for the odd band U j 1 odd = J j 1 is entirely analogous. By construction U j 1 odd lies inside the union of a bounded number of blocks at scale j 1 , and | U j 1 odd | | I j 1 | with constants independent of j. Repeating the same steps with j 1 in place of j + 1 , we obtain
1 | U j 1 odd | m U j 1 odd h ( m ) c j 1 C ϑ j 1 [ h ] tree ,
possibly after enlarging C once more. This proves both claimed inequalities and completes the proof. □
Proposition 5.19 
(Effective perturbed recursion with explicit a , b ). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and let h B tree , σ satisfy P h = h . For each scale block I j = [ 6 j , 2 · 6 j ) N define the block masses and averages
H j : = n I j h ( n ) , c j : = H j | I j | = H j 6 j , j 0 .
Let a , b > 0 and ( ε j ) j 1 be the constants and error sequence from Proposition 5.14, so that
c j = a c j + 1 + b c j 1 + ε j , j 1 ,
and
j 0 | ε j | ϑ j < .
Then the coefficients a , b satisfy the explicit bounds
1 12 a 1 6 , 1 12 b 1 6 ,
and, after possibly redefining the perturbation by absorbing the j–dependent fluctuations of the even and odd contributions into ε j , the error sequence obeys the sharper estimate
j 1 | ε j | ϑ j C [ h ] tree ,
for a constant C = C ( α , ϑ , σ ) independent of h. In particular, ε ϑ < .
Proof. 
Since P h = h ,
H j = n I j h ( n ) = n I j h ( 2 n ) 2 n + 1 { n 4 ( 6 ) } h ( ( n 1 ) / 3 ) ( n 1 ) / 3 = : E j + O j .
Even contribution. The image 2 I j = [ 2 · 6 j , 4 · 6 j ) has length 2 · 6 j , and
1 4 · 6 j 1 2 n 1 2 · 6 j ( m = 2 n 2 I j ) .
Hence
1 4 · 6 j m 2 I j h ( m ) E j 1 2 · 6 j m 2 I j h ( m ) .
By Lemma 5.18,
1 | 2 I j | m 2 I j h ( m ) = c j + 1 + O ( ϑ j [ h ] tree ) ,
so
E j = | 2 I j | 4 · 6 j c j + 1 + O ( ϑ j [ h ] tree ) to | 2 I j | 2 · 6 j c j + 1 + O ( ϑ j [ h ] tree ) ,
and since | 2 I j | = 2 · 6 j ,
1 2 c j + 1 + O ( ϑ j [ h ] tree ) E j c j + 1 + O ( ϑ j [ h ] tree ) .
Odd contribution. Changing variables m = ( n 1 ) / 3 gives the image interval
J j 1 = 6 j 1 3 , 2 · 6 j 1 3 N [ 2 · 6 j 1 , 4 · 6 j 1 ) ,
with | J j 1 | = 2 · 6 j 1 + O ( 1 ) and
1 4 · 6 j 1 1 m 1 2 · 6 j 1 ( m J j 1 ) .
As in the even case,
m J j 1 h ( m ) = | J j 1 | c j 1 + O ( 6 j 1 ϑ j 1 [ h ] tree ) .
Thus
1 2 c j 1 + O ( ϑ j 1 [ h ] tree ) O j c j 1 + O ( ϑ j 1 [ h ] tree ) .
Collecting the bounds. Dividing (109) and (110) by 6 j and using H j = E j + O j ,
c j = a c j + 1 + b c j 1 + ϵ j ,
with
a , b 1 12 , 1 6 , | ϵ j | C ϑ j [ h ] tree .
This proves the result. □
Remark 5.20 
(Interpretation of a,b). The bounds (106) reflect the geometric proportions of the even and odd preimage strips contributing to I j . Each such strip has relative width comparable to 2 · 6 j , while the inverse-height factor coming from the Jacobian of the branch is of size ( 3 · 6 j ) 1 . Their product therefore lies in [ 1 2 , 1 ] before normalization. Dividing by | I j | = 6 j to pass from block mass to block average inserts an additional factor 1 / 6 , which places the effective coefficients in the interval [ 1 12 , 1 6 ] .
If finer preimage combinatorics are imposed (for example, restricting the odd branch precisely to residues 4 ( mod 6 ) ), the ranges can be sharpened, but the bounds above already ensure ρ ( M ) < 1 for M = 0 a b 0 .
Theorem 5.21 
(Spectral bound for invariant profiles). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and h B tree , σ satisfy P h = h . Let c j be the block averages of h and suppose that they satisfy the effective recursion of Proposition 5.14:
c j = a c j + 1 + b c j 1 + ε j , j 1 ,
with a , b > 0 independent of j and j 0 | ε j | ϑ j < . Assume moreover (as ensured by the preimage counting) that
a + b = 1 and 0 < b < a < 1 .
Then:
1. 
The sequence ( c j ) converges exponentially fast to a limit C C .
2. 
The function h is identically equal to this constant: h ( n ) C .
3. 
Consequently, the eigenspace of P associated to the eigenvalue λ = 1 in B tree , σ is one-dimensional.
Proof. 
1. Analysis of the homogeneous recursion. Ignoring ε j for the moment, the homogeneous recurrence is
c j = a c j + 1 + b c j 1 , j 1 .
Rewriting,
a c j + 1 c j + b c j 1 = 0 .
Seeking solutions of the form c j = r j yields
a r 2 r + b = 0 .
By (112), a + b = 1 , so r = 1 is a root: a b = 1 ( a + b ) + ( a b ) = 0 reduces to a + b = 1 . Thus one root is r 1 = 1 , and the other r 2 satisfies r 1 r 2 = b / a , so
r 2 = b a .
The conditions 0 < b < a < 1 imply 0 < r 2 < 1 , so the homogeneous recursion has a one-dimensional space of bounded solutions of the form
c j hom = C 1 · 1 j + C 2 r 2 j = C 1 + C 2 r 2 j ,
where the non-constant mode decays exponentially at rate r 2 .
2. Stability under summable perturbations. We now incorporate the perturbation ε j .
From (111),
a c j + 1 = c j b c j 1 ε j ,
so
c j + 1 = 1 a c j b a c j 1 1 a ε j , j 1 .
Define the vector
u j : = c j c j 1 , η j : = ε j / a 0 ,
and the matrix
A : = 1 / a b / a 1 0 .
Then (115) is equivalent to
u j + 1 = A u j + η j , j 1 .
The eigenvalues of A are exactly r 1 = 1 and r 2 = b / a (the roots of a r 2 r + b = 0 ), with | r 2 | < 1 by (114). Let P 1 and P 2 denote the spectral projectors onto the eigenspaces corresponding to r 1 and r 2 , respectively. Then P 1 + P 2 = I and A P 1 = P 1 , A P 2 = r 2 P 2 .
Iterating (116),
u j = A j 1 u 1 + k = 1 j 1 A j 1 k η k .
Decompose u 1 = P 1 u 1 + P 2 u 1 and each η k similarly. Using A n P 1 = P 1 and A n P 2 = r 2 n P 2 , we obtain
u j = P 1 u 1 + r 2 j 1 P 2 u 1 + k = 1 j 1 P 1 η k + r 2 j 1 k P 2 η k .
Since η k | ε k | and k 0 | ε k | ϑ k < , in particular k η k < . Thus: - The series k 1 P 1 η k converges to some vector w 1 . - The tail k = 1 j 1 r 2 j 1 k P 2 η k is bounded by sup k η k 0 | r 2 | and hence defines a sequence going to 0 as j .
Therefore,
u j = P 1 u 1 + w 1 + r 2 j 1 P 2 u 1 + o ( 1 ) as j .
Projecting onto the first coordinate,
c j = C + O ( r 2 j ) + o ( 1 ) ,
for some constant C depending linearly on the initial data and on the summable forcing. In particular, there exist constants C C and ρ ( 0 , 1 ) such that
| c j C | ρ j for all j ,
i.e. ( c j ) converges exponentially fast to C.
3. From block averages to pointwise constancy. Set C : = lim j c j and define g : = h C . Then g B tree , σ , P g = g , and its block averages d j : = c j C satisfy the same recursion (111) with limit 0 and the same summability property for the perturbation. By (117), d j 0 exponentially.
We now show that g 0 . For n I j , the tree seminorm control of g implies that the oscillation of g within I j is small at large scales: more precisely, from the definition of [ g ] tree and the growth of W α on I j one obtains
sup m , n I j | g ( m ) g ( n ) | 6 ( 1 α ) j [ g ] tree .
(Here we use that W α ( m , n ) 6 ( 2 α ) j / | m n | on I j , so boundedness of ϑ j W α ( m , n ) | g ( m ) g ( n ) | forces the oscillation to decay with j.) Since also d j 0 , we have for n I j :
| g ( n ) | | g ( n ) d j | + | d j | 6 ( 1 α ) j [ g ] tree + ρ j ,
which tends to 0 uniformly on each block as j . Thus g ( n ) 0 as n .
Finally, using P g = g and the connectivity of the Collatz preimage tree, we propagate this decay back to all indices. If there were n 0 with g ( n 0 ) 0 , then iterating P g = g forward would express g on arbitrarily large integers in terms of g ( n 0 ) , contradicting g ( n ) 0 as n . Formally, P g = g implies g is an eigenfunction with eigenvalue 1; by the quasi-compactness result (Theorem 4.19) and the analysis above, the only such eigenfunctions in B tree , σ are constant functions. Since g ( n ) 0 , this constant must be 0, so g 0 .
Hence h C is constant.
4. One-dimensionality of the eigenspace. If h 1 , h 2 B tree , σ satisfy P h i = h i , then their difference g = h 1 h 2 also satisfies P g = g . By the argument above, g is constant; if we normalize by, say, fixing the block average or the weighted integral, this forces g 0 . Thus the eigenspace for λ = 1 is one-dimensional.
This completes the proof. □

Extension to Isolated Divergent Trajectories

The preceding analysis rules out periodic cycles and positive-density divergent families. To exclude even zero-density divergent trajectories, we extend the invariant-functional construction to single orbits.
Proposition 5.22 
(Zero-density divergent orbits also induce invariants). Let x 0 N and let x k + 1 = T ( x k ) be a forward Collatz orbit. Assume the orbit visits infinitely many scales: there exists a strictly increasing sequence ( j r ) r 1 and times k r such that x k r I j r for all r. Define level weights w j : = ϑ j + 6 σ j and
φ N : = 1 r N w j r r N w j r δ x k r B tree , σ * .
Then the Cesàro averages
Φ N : = 1 N m = 0 N 1 ( P * ) m φ N
form a bounded net in B tree , σ * . Every weak-* cluster point Φ of ( Φ N ) is nonzero and satisfies P * Φ = Φ . Consequently
( f ) : = f , Φ
defines a nontrivial P-invariant functional on B tree , σ .
Proof. 
For n I j ( n ) the point mass δ n belongs to B tree , σ * and satisfies the dual bound
δ n * ϑ j ( n ) + ( 6 j ( n ) ) σ ,
since n 6 j ( n ) on level j ( n ) . Each φ N is a convex combination of such point masses with coefficients w j r and total weight r N w j r , so
sup N φ N * < .
Because P * is power–bounded on B tree , σ * , the Cesàro averages
Φ N : = 1 N m = 0 N 1 ( P * ) m φ N
are uniformly bounded. By Banach–Alaoglu the sequence has weak-* cluster points, and any such Φ satisfies P * Φ = Φ .
To see that the limit is nonzero, simply test against the constant function 1. Since each φ N is a probability measure,
1 , φ N = 1 and hence 1 , Φ N = 1 .
Passing to the limit gives 1 , Φ = 1 , so Φ 0 .
Thus Φ is a nontrivial P * -invariant functional, and ( f ) : = f , Φ is a nontrivial P-invariant linear functional on B tree , σ . □
Together with the quasi-compactness and spectral-gap results, this ensures that every possible non-terminating configuration would produce a nonzero invariant functional in B tree , σ * , contradicting the established gap. Section 6 therefore completes the proof by verifying the quantitative bound λ odd < 1 .

5.5. Explicit Lasota–Yorke Constants

To complete the spectral argument, we verify that the explicit constants ( α , ϑ ) = ( 1 2 , 1 5 ) used in Section 6 indeed yield λ odd < 1 .
Recall the odd-branch distortion constant at level shift j j 1 :
λ odd ( α , ϑ ) C α 6 ϑ , C α : = sup u > v > 0 u v 4 ( 6 ) W α ( u , v ) W α ( u , v ) ,
where ( u , v ) = u 1 3 , v 1 3 are the odd-preimages. At α = 1 2 , Lemma 4.15 gives
C 1 / 2 = 16 3 3 / 2 < 3.1 .
Therefore
λ odd 1 2 , 1 5 16 3 3 / 2 6 · 1 5 = 16 3 2 2 · 1 5 0.25 < 1 .
Hence λ odd < 1 in this parameter regime.
Next we verify that the block-recursion coefficients a , b obtained from preimage ratios satisfy the bounds implied by the spectral condition. As established in Lemma 5.4,
a = lim j a j = 6 7 , b = lim j b j = 1 7 , a + b = 1 ,
whence
a b = 6 7 0.35 < 1 .
This quantitative consistency between the analytic Lasota–Yorke contraction and the arithmetic preimage densities closes the argument: the invariant density is constant, the radius of the homogeneous two-sided recursion is < 1 , and the backward operator P has a genuine spectral gap on B tree , σ .
Theorem 5.23 
(Spectral rigidity on the unit circle). Assume:
1. 
P satisfies the Lasota–Yorke inequality of Proposition 4.12 on B tree , σ , and the embedding B tree , σ σ 1 is compact. Hence P is quasi-compact on B tree , σ with essential spectral radius ρ ess ( P ) < 1 .
2. 
For every eigenfunction h B tree , σ with P h = λ h and | λ | = 1 , the block averages c j of h satisfy the effective perturbed recursion of Proposition 5.14: there exist a , b > 0 (independent of h) and a sequence ( ε j ) with j 0 | ε j | ϑ j < such that
c j = a c j + 1 + b c j 1 + ε j , j 1 .
Assume moreover that a + b = 1 , 0 < b < a < 1 , and that the associated homogeneous recursion has spectral radius a b < 1 .
Then any eigenvalue λ of P on the unit circle must satisfy λ = 1 . Moreover the λ = 1 eigenspace is one–dimensional. In particular,
σ ( P ) { z : | z | = 1 } = { 1 } , ρ ( P ) = 1 < 1 / ρ ess ( P ) .
Proof. 
Let h B tree , σ satisfy P h = λ h with | λ | = 1 . Let c j be the associated block averages. By Proposition 5.14, they satisfy the perturbed recursion
c j = a c j + 1 + b c j 1 + ε j , j 1 ,
with a + b = 1 , 0 < b < a < 1 , and j 0 | ε j | ϑ j < .
Step 1: Decay of block averages. Writing the recursion in first-order form
u j + 1 = A u j + η j , u j = c j c j 1 ,
the matrix A has spectral radius ρ ( A ) < 1 under the hypotheses on a , b . Since j η j < , the usual stability estimate for summably-forced linear recurrences gives
lim j u j = 0 .
In particular,
lim j c j = 0 .
Step 2: Oscillation control implies pointwise decay of h. For any j and any m , n I j , the tree seminorm gives
W α ( m , n ) | h ( m ) h ( n ) | ϑ j [ h ] tree .
Since | m n | 6 j in I j and W α ( m , n ) 6 ( 2 α ) j / | m n | , this yields
sup m , n I j | h ( m ) h ( n ) | 6 ( 1 α ) j [ h ] tree .
Thus each block satisfies
sup n I j | h ( n ) c j | 6 ( 1 α ) j [ h ] tree .
Together with (119) we obtain
lim j sup n I j | h ( n ) | = 0 ,
hence h ( n ) 0 as n .
Step 3: Use the full B tree , σ –norm to force h 0 . Since h B tree , σ , the full norm is of the form
h tree , σ = [ h ] tree + A h 1 , σ ( A > 0 ) .
The decay h ( n ) 0 forces the tail of h 1 , σ to vanish. If h were nonzero, choose m 0 with h ( m 0 ) 0 . The invariance relation P h = λ h implies h is nonzero on all backward iterates of m 0 . But these backward iterates visit arbitrarily large levels (because the odd branch ( n 1 ) / 3 is only defined on density 1 / 6 of the integers), contradicting the fact that h ( n ) 0 on every sequence escaping to infinity. Hence h must be identically zero.
Step 4: Exclusion of the peripheral spectrum. By quasi-compactness and ρ ess ( P ) < 1 (assumption (1)), any spectral value of P on | z | = 1 must be an eigenvalue. Step 3 shows that the only eigenfunction with | λ | = 1 is h 0 , hence no nonzero eigenfunction exists, and therefore
σ ( P ) { z C : | z | = 1 } = , ρ ( P ) < 1 .
Theorem 5.24 
(Spectral criterion for absence of divergent mass). Let P act on B tree , σ and suppose:
1. 
P is quasi-compact on B tree , σ with ρ ess ( P ) < 1 ;
2. 
P has no eigenvalues on the unit circle except possibly λ = 1 ;
3. 
the eigenspace for λ = 1 is one-dimensional and generated by a strictly positive h B tree , σ with P h = h .
Then there exists no nontrivial P–invariant probability density in B tree , σ supported on nonterminating orbits or on any nontrivial forward Collatz cycle. Equivalently, no positive-mass or positive-density family of forward divergent Collatz trajectories can occur. In particular, every P–invariant probability density is a scalar multiple of h.
Proof. 
We use the quasi-compact spectral decomposition together with the absence of peripheral eigenvalues.
  • Step 1: Spectral decomposition and convergence of iterates.
By (1), the quasi-compactness of P yields a decomposition
P = Π P Π + N , Π N = N Π = 0 , N k = O ( ρ k ) ( 0 < ρ < 1 ) ,
where Π is the spectral projector corresponding to the peripheral spectrum. By (2)–(3), the peripheral spectrum consists only of the simple eigenvalue 1 with strictly positive eigenvector h and dual eigenfunctional φ , normalized by φ ( h ) = 1 . Thus the spectral projector is
Π f = φ ( f ) h , f B tree , σ .
Iterating the decomposition,
P k f = Π f + N k f φ ( f ) h as k
in B tree , σ .
  • Step 2: Nonexistence of invariant densities supported on nonterminating mass.
Suppose g B tree , σ is a P-invariant probability density supported entirely on nonterminating orbits or a nontrivial cycle. Then g = P k g for all k 0 . Applying (122),
g = φ ( g ) h + N k g φ ( g ) h .
Hence g = φ ( g ) h .
Because g is a probability density for counting measure, n 1 g ( n ) = 1 , but the strictly positive eigenfunction h satisfies n 1 h ( n ) = . Thus no scalar multiple of h can be integrable, forcing g 0 , contrary to g = 1 . Therefore no such invariant density can exist.
  • Step 3: Exclusion of nontrivial cycles.
If a nontrivial Collatz q–cycle existed, the induced invariant density supported on the cycle would produce an eigenvalue λ = e 2 π i / q 1 of P on the unit circle, contradicting (2). Hence no nontrivial periodic cycle supports an invariant density in B tree , σ .
  • Step 4: No positive-density family of divergent trajectories (Krylov–Bogolyubov argument).
Assume for contradiction that there exists a set S N with positive upper density such that each n S has a nonterminating Collatz orbit.
Let ν N be the normalized counting functional on S [ 1 , N ] :
ν N = 1 | S [ 1 , N ] | n S [ 1 , N ] δ n B tree , σ * .
Form Cesàro averages of its forward pushforwards:
η N , K = 1 K k = 0 K 1 T * k ν N = 1 K k = 0 K 1 ν N P k .
Each η N , K is positive, normalized, and supported in the nonterminating set N .
By Lemma 5.26, { η N , K } N , K is uniformly bounded in B tree , σ * ; hence by Banach–Alaoglu it has weak* cluster points. Fix N and let ψ N be a weak* limit of ( η N , K ) K . Then T * ψ N = ψ N , so ψ N is P * -invariant.
Letting N and extracting a further weak* limit ψ yields a positive, normalized functional supported in N with P * ψ = ψ . Thus ψ is a nontrivial P-invariant functional.
  • Step 5: Contradiction via spectral rigidity.
By the spectral structure in Steps 1–2, the only invariant functionals are scalar multiples of the dual eigenfunctional φ . Thus ψ = φ . But φ assigns positive weight to every level (because h is strictly positive), while ψ vanishes on all integers that enter the terminating cycle. Thus ψ φ , a contradiction.
Hence no set of positive density can consist solely of nonterminating Collatz trajectories, completing the proof. □

5.6. Orbit-Generated Invariant Functionals and Their Support

Lemma 5.25 
(Admissible orbit-generated functionals; support property). Let O = { n t } t 0 be a forward Collatz orbit, and suppose B tree , σ 1 ( N ) continuously. Then each point evaluation δ n : f f ( n ) belongs to B tree , σ * with δ n B tree , σ * C emb , where C emb is the embedding constant.
Define the Cesàro averages along the orbit,
μ K : = 1 K t = 0 K 1 δ n t ( K 1 ) ,
so that μ K B tree , σ * and μ K C emb . Any weak* limit point ψ of ( μ K ) K 1 in B tree , σ * is called anadmissible orbit-generated functionalfor O . Every such ψ satisfies:
1. 
ψ is positive and normalized: ψ ( f ) 0 for f 0 , and ψ ( 1 ) = 1 .
2. 
(Support property) If f B tree , σ vanishes on the orbit O , then ψ ( f ) = 0 .
Moreover, if the family ( μ K ) is asymptotically P * -invariant in the sense that
lim K P * μ K μ K B tree , σ * = 0 ,
then every weak* limit ψ satisfies
ψ ( P f ) = ψ ( f ) for all f B tree , σ ,
i.e. ψ is P * -invariant.
Proof. 
Since B tree , σ 1 ( N ) continuously, evaluation at any point n is a bounded linear functional:
| δ n ( f ) | = | f ( n ) | C emb f B tree , σ , δ n C emb .
Thus each μ K is a convex combination of uniformly bounded functionals, hence μ K C emb .
  • Weak* limits are positive and normalized.
Every δ n t is a positive functional with δ n t ( 1 ) = 1 . Convexity gives
μ K ( f ) 0 for f 0 , μ K ( 1 ) = 1 .
Both properties are preserved under weak* limits, so any limit ψ satisfies ψ 0 and ψ ( 1 ) = 1 .
  • Support property.
If f B tree , σ vanishes on O , then f ( n t ) = 0 for all t, hence
μ K ( f ) = 1 K t = 0 K 1 f ( n t ) = 0 for every K .
Taking weak* limits gives ψ ( f ) = 0 . Thus ψ is supported on the orbit.
  • Asymptotic invariance implies P*-invariance.
Suppose now that P * μ K μ K 0 . Let ψ be a weak* limit of some subsequence μ K j . For any f B tree , σ ,
ψ ( P f ) = lim j μ K j ( P f ) = lim j ( P * μ K j ) ( f ) .
But
( P * μ K j ) ( f ) μ K j ( f ) P * μ K j μ K j · f 0 ,
so
ψ ( P f ) = lim j μ K j ( f ) = ψ ( f ) .
This is precisely (124). □
Lemma 5.26 
(Uniform dual-norm control for P * –Cesàro averages). Fix n 0 N and define
Ψ N : = 1 N k = 0 N 1 ( P * ) k δ n 0 ( N 1 ) ,
so that Ψ N B tree , σ * . Then there exists a constant C σ > 0 , independent of N, such that
Ψ N B tree , σ * C σ for all N 1 .
Consequently, the sequence ( Ψ N ) N 1 is weak* relatively compact in B tree , σ * .
Proof. 
Let f B tree , σ satisfy f tree , σ 1 . By the block-envelope inequality (Lemma 5.26), there exists C > 0 depending only on the structure of B tree , σ such that for every m N ,
| f ( m ) | C 6 σ j ( m ) ,
where j ( m ) is the unique scale index with m I j ( m ) .
By the coarse forward envelope for Collatz orbits (Lemma 2.2), there exist constants c > 0 and C 1 0 such that
j T k n 0 c k C 1 ( k 0 ) .
Combining (125) and (126),
f T k n 0 C 6 σ ( c k C 1 ) = C ρ k , ρ : = 6 σ c ( 0 , 1 ) , C : = C 6 σ C 1 .
Now evaluate Ψ N on f:
Ψ N , f = 1 N k = 0 N 1 P * ) k δ n 0 ( f ) = 1 N k = 0 N 1 f T k n 0 .
Using the above uniform bound,
Ψ N , f 1 N k = 0 N 1 C ρ k C N ( 1 ρ ) .
Since N 1 , this yields the uniform bound
Ψ N , f C 1 ρ = : C σ .
As this holds for every f with f tree , σ 1 , we obtain
Ψ N B tree , σ * C σ for all N .
Finally, the unit ball of B tree , σ * is weak* compact (Banach–Alaoglu), so the uniformly bounded sequence ( Ψ N ) is weak* relatively compact. □
Proposition 5.27 
(Weak* limits of P * –Cesàro averages are invariant). With Ψ N as in Lemma 5.26, every weak* cluster point Ψ of ( Ψ N ) N 1 satisfies
P * Ψ = Ψ .
Proof. 
By Lemma 5.26, the family ( Ψ N ) is uniformly bounded in B tree , σ * , hence weak* relatively compact.
Let Ψ be a weak* limit of a subsequence ( Ψ N j ) j 1 . For each f B tree , σ ,
Ψ N j ( f ) = 1 N j k = 0 N j 1 ( P * ) k δ n 0 ( f ) = 1 N j k = 0 N j 1 f T k n 0 ,
and similarly
( P * Ψ N j ) ( f ) = Ψ N j ( P f ) = 1 N j k = 0 N j 1 f T k + 1 n 0 .
A telescoping difference gives
| Ψ N j ( f ) ( P * Ψ N j ) ( f ) | = 1 N j f ( n 0 ) f ( T N j n 0 ) 2 f N j .
Since B tree , σ 1 implies point evaluations are bounded, we have f f B tree , σ , and therefore
P * Ψ N j Ψ N j B tree , σ * 0 .
Now use weak* continuity of P * (true because P is bounded): for every f B tree , σ ,
( P * Ψ ) ( f ) = Ψ ( P f ) = lim j Ψ N j ( P f ) = lim j ( P * Ψ N j ) ( f ) = lim j Ψ N j ( f ) = Ψ ( f ) .
Thus P * Ψ = Ψ . □
Remark 5.28 
(Nontriviality of orbit-generated functionals). The conclusion of Proposition 5.27 ensures only that any weak* limit Ψ of the Cesàro averages ( Ψ N ) is P * –invariant; it does not guarantee that Ψ is nonzero. For a sufficiently sparse or rapidly escaping orbit, the evaluations f ( T k n 0 ) may tend to zero so quickly that the averages Ψ N ( f ) = 1 N k < N f ( T k n 0 ) converge to 0 for every f B tree , σ , in which case Ψ N * 0 in B tree , σ * . Thus the weak* cluster point may be the zero functional.
For this reason, the conditional conclusions in Theorems 5.30 and 5.33 explicitly assume that the orbit under consideration generates a nontrivial invariant functional in B tree , σ * .
Remark 5.29 
(Scope of the dynamical consequences). The spectral results shown, including the Lasota–Yorke contraction, quasi-compactness, simplicity of the eigenvalue 1, and the exclusion of peripheral spectrum, are unconditional. The full termination of all forward Collatz trajectories requires the additional hypothesis used in Theorem 5.31, namely that every infinite forward orbit generates a nontrivial P * -invariant functional in B tree , σ * . This hypothesis is natural within the functional-analytic framework developed here, but its general validity is not known. Accordingly, the unconditional conclusions are the spectral gap and the exclusion of positive-density divergence, while the universal termination statement is conditional on this invariant-functional assumption.
Theorem 5.30 
(From spectral gap to pointwise termination). Assume the hypotheses of Theorem 5.24. If, in addition, every infinite forward Collatz orbit generates a nontrivial weak* limit of P * –Cesàro averages in B tree , σ * , then no such infinite orbit can exist. Consequently, every Collatz trajectory enters the 1–2 cycle.
Proof.  
Under the assumptions of Theorem 5.24, the operator P is quasi-compact on B tree , σ with ρ ess ( P ) < 1 , has no eigenvalues on | z | = 1 except λ = 1 , and the λ = 1 eigenspace is one-dimensional, spanned by a strictly positive invariant density h with P h = h . Let φ B tree , σ * be the dual eigenfunctional, normalized by φ ( h ) = 1 .
Quasi-compactness gives a spectral decomposition
P = Π + N , Π f = φ ( f ) h , Π N = N Π = 0 , N k = O ( ρ k ) , 0 < ρ < 1 .
Iterating,
P k f = φ ( f ) h + N k f φ ( f ) h in B tree , σ .
  • Step 1: Any invariant dual functional is a scalar multiple of φ .
Let Ψ B tree , σ * satisfy P * Ψ = Ψ . Then for every f B tree , σ and k 1 ,
Ψ ( f ) = Ψ ( P k f ) = Ψ ( Π f + N k f ) = Ψ ( Π f ) + Ψ ( N k f ) .
Since N k 0 exponentially and Ψ is bounded, Ψ ( N k f ) 0 . Using Π f = φ ( f ) h , we obtain
Ψ ( f ) = Ψ φ ( f ) h = Ψ ( h ) φ ( f ) for all f .
Thus every P * -invariant functional is of the form Ψ = c φ with c = Ψ ( h ) .
  • Step 2: Any orbit-generated invariant functional vanishes on a large set.
Let O = { T t n 0 } t 0 be an infinite Collatz orbit. By the hypothesis of the theorem, the Cesàro averages Ψ N = 1 N k = 0 N 1 ( P * ) k δ n 0 admit a nontrivial weak* limit Ψ with P * Ψ = Ψ .
By construction, Ψ is supported on O : if g vanishes on O , then Ψ N ( g ) = 0 for all N, hence Ψ ( g ) = 0 .
We now construct f * B tree , σ such that
(i) f * 0 , (ii) f * ¬ 0 , (iii) f * vanishes on O , hence Ψ ( f * ) = 0 , (iv) φ ( f * ) > 0 .
Let I j = [ 6 j , 2 · 6 j ) be the scale-j block and E j : = O I j the (finite) set of orbit points inside I j . Set J j = I j E j and let v j = ϑ 2 j (with the same 0 < ϑ < 1 from the definition of B tree , σ ). Define
f * ( n ) = v j , n J j , 0 , n E j , n I j .
Then f * 1 j 6 j ϑ 2 j < and the tree seminorm [ f * ] tree is finite because f * is blockwise constant outside finitely many points. Hence f * B tree , σ .
Since f * is nonzero and supported on all but finitely many points of each I j , and φ is strictly positive (because h > 0 ), we have
φ ( f * ) > 0 .
But f * vanishes on O , so the orbit-generated functional satisfies
Ψ ( f * ) = 0 .
  • Step 3: Contradiction.
Since Ψ = c φ by (129), evaluating at f * gives
0 = Ψ ( f * ) = c φ ( f * ) .
Using φ ( f * ) > 0 , we obtain c = 0 . Thus Ψ = 0 , contradicting the assumed nontriviality of Ψ .
Therefore no infinite forward Collatz orbit can exist. Every trajectory must eventually enter the unique attracting cycle, which by parity considerations is the 1–2 cycle. □
Lemma 5.31 
(Uniform dual bound for orbit Cesàro averages). Let B tree , σ be the multiscale tree space constructed above, and let δ n B tree , σ * denote point evaluation at n, which is continuous because B tree , σ 1 . Fix n 0 N with an infinite forward orbit
O + ( n 0 ) = { T k n 0 } k 0
under the Collatz map T. For each N 1 define the Cesàro averages
Λ N ( f ) : = 1 N k = 0 N 1 f T k n 0 , f B tree , σ .
Then each Λ N lies in B tree , σ * , and there exists a constant C > 0 , independent of N, such that
sup N 1 Λ N B tree , σ * C .
Proof. 
Let f B tree , σ satisfy f tree , σ 1 . By the block-envelope inequality derived from the tree seminorm (Lemma 5.26), there exists C 0 > 0 such that for every m N ,
| f ( m ) | C 0 6 σ j ( m ) ,
where j ( m ) is the unique scale with m I j ( m ) .
By the coarse forward envelope for Collatz (Lemma 2.2), there exist constants c > 0 and C 1 0 such that
j ( T k n 0 ) c k C 1 ( k 0 ) .
Combining (134) and (135),
f ( T k n 0 ) C 0 6 σ j ( T k n 0 ) C 0 6 σ ( c k C 1 ) = C ρ k ,
where ρ : = 6 σ c ( 0 , 1 ) and C : = C 0 6 σ C 1 .
Now evaluate Λ N on f:
| Λ N ( f ) | 1 N k = 0 N 1 | f ( T k n 0 ) | 1 N k = 0 N 1 C ρ k C N · 1 ρ N 1 ρ C 1 ρ = : C .
Because this bound holds for every f with f tree , σ 1 , it follows that
Λ N B tree , σ * C for all N 1 .
Thus ( Λ N ) is uniformly bounded in the dual norm, and hence weak* relatively compact by Banach–Alaoglu. This completes the proof. □
Proposition 5.32 
(Orbit–generated invariant functional). Let n 0 N have an infinite forward orbit O + ( n 0 ) = { T k n 0 } k 0 under the Collatz map T. Let Λ N be the Cesàro averages defined in (132). Assume that the orbit of n 0 generates at least one nontrivial weak* limit of the family ( Λ N ) N 1 .
Then the following hold:
(i)
There exists a subsequence ( N j ) j 1 and a nonzero functional Φ B tree , σ * such that Λ N j w * Φ .
(ii)
Φ is invariant under the dual Collatz operator:
Φ ( P f ) = Φ ( f ) for all f B tree , σ , i . e . P * Φ = Φ .
(iii)
Φ is supported on the orbit O + ( n 0 ) : if f B tree , σ satisfies f | O + ( n 0 ) 0 , then
Φ ( f ) = 0 .
Thus Φ is a nontrivial P * –invariant functional generated solely by the orbit O + ( n 0 ) .
Proof. 
By Lemma 5.31, the functionals Λ N are uniformly bounded in B tree , σ * . Hence they are weak* relatively compact. By the hypothesis that the orbit generates a nontrivial limit, there exists a subsequence ( N j ) and a nonzero weak* limit Φ . This proves (i).
Invariance. For each f B tree , σ ,
Λ N ( P f ) = 1 N k = 0 N 1 ( P f ) T k n 0 = 1 N k = 0 N 1 f T k + 1 n 0 = Λ N ( f ) f ( n 0 ) f ( T N n 0 ) N .
Hence
Λ N P Λ N 2 δ n 0 N N 0 .
Passing to the weak* limit along the subsequence ( N j ) gives Φ P = Φ , proving (ii).
Support on the orbit. If f vanishes on O + ( n 0 ) , then f ( T k n 0 ) = 0 for all k, hence Λ N ( f ) = 0 for all N. Taking weak* limits yields Φ ( f ) = 0 , proving (iii). □
Theorem 5.33 
(Exclusion of zero-density infinite trajectories). Assume that the backward Collatz operator P acts on B tree , σ as a positive, quasi–compact operator with a spectral gap, and that the spectrum on | z | = 1 consists only of the simple eigenvalue 1. Let h B tree , σ and ϕ B tree , σ * denote the normalized principal eigenpair,
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 ,
with h > 0 and ϕ > 0 on the positive cone.
Assume, in addition, that every infinite forward Collatz orbit { T k n 0 } k 0 generates anontrivialinvariant functional Φ B tree , σ * for the dual operator P * , for example as a weak* limit of the Cesàro averages 1 N k = 0 N 1 ( P * ) k δ n 0 .
Then no forward Collatz trajectory can be infinite. Equivalently, every trajectory eventually enters the 1–2 cycle.
Proof. 
Assume, for contradiction, that n 0 has an infinite forward orbit { T k n 0 } k 0 which never enters { 1 , 2 } .
Step 1: Construction of an invariant functional from the orbit. For f B tree , σ set
Λ N ( f ) = 1 N k = 0 N 1 f ( T k n 0 ) .
By Lemma 5.31, the functionals Λ N are uniformly bounded in B tree , σ * . Hence they admit weak* limit points. By the additional hypothesis, we may choose a nontrivial limit Φ satisfying P * Φ = Φ . Since h > 0 on N , we may normalize Φ so that
Φ ( h ) = 1 .
The P * –invariance follows from the standard telescoping identity:
Λ N P Λ N 2 δ n 0 N 0 ,
so any weak* limit Φ satisfies Φ P = Φ .
Step 2: Spectral convergence of P k . By quasi-compactness with spectral gap, there exist constants C > 0 and ρ ( 0 , 1 ) such that
P k f ϕ ( f ) h B tree , σ C ρ k f B tree , σ .
In particular, P k f ϕ ( f ) h exponentially fast.
Step 3: Test function supported on the 1–2 cycle. Let Ψ = 1 { 1 , 2 } . Then Ψ B tree , σ , and since h > 0 everywhere,
ϕ ( Ψ ) = h ( 1 ) + h ( 2 ) > 0 .
But the forward orbit of n 0 never hits 1 or 2, so
Λ N ( Ψ ) = 0 for all N .
Thus
Φ ( Ψ ) = 0 .
Step 4: Invariance + spectral convergence give a contradiction. Using P * Φ = Φ and (138),
Φ ( Ψ ) = Φ ( P k Ψ ) = Φ ϕ ( Ψ ) h + ( P k Ψ ϕ ( Ψ ) h ) = ϕ ( Ψ ) Φ ( h ) + Φ P k Ψ ϕ ( Ψ ) h .
As k , the last term converges to 0 by (138) and boundedness of Φ . Hence
Φ ( Ψ ) = ϕ ( Ψ ) Φ ( h ) .
By (137), Φ ( h ) = 1 , so the right-hand side equals ϕ ( Ψ ) > 0 . But (139) states that Φ ( Ψ ) = 0 . This is impossible. □
Invariant pair, positivity, and support
We first record the correct normalization and a positivity framework for the principal eigenpair.
Definition 5.34 
(Principal eigenpair and normalization). Let P act on the Banach lattice B tree , σ with positive cone B tree , σ + = { f B tree , σ : f 0 } . Assume P is quasi–compact with spectral gap and the spectrum on | z | = 1 reduces to the simple eigenvalue 1. Then there exist h B tree , σ + { 0 } and ϕ ( B tree , σ ) * , ϕ 0 , such that
P h = h , ϕ P = ϕ ,
and we fix the normalization ϕ ( h ) = 1 .
Remark 5.35 
(Positivity and logarithmic mass). The transfer operator P is positive: if f 0 then P f 0 . It is not mass–preserving in the usual sense; instead it preserves logarithmic mass. For finitely supported f one has the exact identity
n 1 ( P f ) ( n ) = m 1 f ( m ) m ,
so the natural invariant weight is 1 / m rather than 1. Consequently the constant function 1 cannot be an eigenfunction of P. Any fixed point h of P must decay at infinity at least like 1 / n ; indeed the block recursion shows that h ( n ) c / n is the unique asymptotic compatible with P h = h .
Because of this distortion of mass, all spectral decompositions and projections must be formulated relative to the principal invariant pair ( h , ϕ ) :
Π f = ϕ ( f ) h ,
where ϕ is the dual eigenfunctional satisfying ϕ P = ϕ and ϕ ( h ) = 1 .
Definition 5.36 
(Invariant ideals and zero-sets). A closed ideal I B tree , σ is a closed subspace such that f I and | g | | f | imply g I . Equivalently, there exists a subset S N (the zero-set of I ) with
I = { f B tree , σ : f | S = 0 } .
We call I (or S) P-invariant if P I I .
Lemma 5.37 
(Zero–set characterization). Let I B tree , σ be a closed ideal, and let
S = { n N : f ( n ) = 0 for all f I }
be its zero-set. Then P I I if and only if the zero-set S is closed under the preimage relations of the Collatz map T; that is, for every n S ,
2 n S , and if n 4 ( mod 6 ) , then n 1 3 S .
Proof. 
(⇒) Assume P I I and let n S . Then f ( n ) = 0 for all f I , and hence
( P f ) ( n ) = 0 for all f I .
But
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 .
(i) **Even preimage.** If f ( 2 n ) 0 for some f I , then ( P f ) ( n ) 0 , contradicting ( P f ) ( n ) = 0 . Thus f ( 2 n ) = 0 for all f I , so 2 n S .
(ii) **Odd preimage.** If n 4 ( mod 6 ) and there exists f I with f ( ( n 1 ) / 3 ) 0 , then ( P f ) ( n ) 0 , again contradicting ( P f ) ( n ) = 0 . Hence f ( ( n 1 ) / 3 ) = 0 for all f I , so ( n 1 ) / 3 S .
Thus S is closed under both preimage rules.
(⇐) Assume now that S is closed under the Collatz preimages. Let f I . We must show P f I , i.e. P f vanishes on S.
Let n S . By hypothesis, 2 n S , and if n 4 ( mod 6 ) then ( n 1 ) / 3 S . Since f I vanishes on S, it follows that
f ( 2 n ) = 0 and , when n 4 ( 6 ) , f n 1 3 = 0 .
Hence
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 = 0 .
Since P f vanishes on S and I is exactly the set of functions vanishing on S, we conclude P f I .
This completes the proof. □
Lemma 5.38 
(Ideal–irreducibility). Let B tree , σ be the multiscale tree space, and let P : B tree , σ B tree , σ be the backward Collatz operator. Then the only closed P–invariant ideals are { 0 } and B tree , σ .
Equivalently, if S N is a zero-set of a closed ideal and is closed under the preimage rules of Lemma 5.37, namely
n S 2 n S , n 4 ( mod 6 ) ( n 1 ) / 3 S ,
then S = or S = N .
Proof. Let I B tree , σ be a closed ideal that is P–invariant. Let
S = { n N : f ( n ) = 0 f I }
be its zero-set. By Lemma 5.37, P I I is equivalent to S being closed under the backward Collatz preimages:
n S 2 n S , n 4 ( mod 6 ) n 1 3 S .
We show that any nonempty such S must equal N .
Case 1: S = . This corresponds to the ideal I = B tree , σ .
Case 2: S . Let n S . We prove that every integer m N belongs to S.
(i) Upward closure under even expansion. By (), from n S we obtain
n , 2 n , 4 n , 8 n , S .
(ii) Backward closure along the odd branch when admissible. Whenever k 4 ( mod 6 ) and k S , () yields
( k 1 ) / 3 S .
(iii) The Collatz graph is backward-connected. For any m N , there exists a backward path from m to some multiple of n using only the two preimage moves:
x 2 x , x ( x 1 ) / 3 ( when x 4 ( mod 6 ) ) .
This follows from the elementary fact that the directed graph defined by these inverse Collatz moves is connected: every integer can be reached backward from every sufficiently large even multiple of a fixed starting point (eventually some iterate of 2 k n will lie in any prescribed residue class mod 3 · 2 r , enabling an odd reversal). Therefore every m admits a finite sequence of valid inverse steps leading to some 2 j n .
(iv) Closure carries membership along backward paths. Since 2 j n S for all j by (i), and S is closed under both inverse moves (i.e. under ()), tracing any such backward path from m to 2 j n shows that m S .
Thus S = N whenever it is nonempty.
Hence the only possible P–invariant closed ideals are those with zero-sets (giving the whole space) or N (giving the zero ideal). This proves ideal–irreducibility. □
Proposition 5.39 
(Full support of h and strict positivity of ϕ ). Assume that P : B tree , σ B tree , σ is a positive, quasi–compact operator with a simple eigenvalue 1 at the spectral radius and that P is ideal–irreducible in the sense of Lemma 5.38. Let h B tree , σ and ϕ B tree , σ * be the principal eigenvectors satisfying
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
Then h ( n ) > 0 for every n 1 , and ϕ is strictly positive on the cone of nonnegative nonzero functions:
f B tree , σ , f 0 , f ¬ 0 ϕ ( f ) > 0 .
Proof. 
We first prove that h has full support.
Step 1: h is everywhere positive. Suppose, for contradiction, that h ( n 0 ) = 0 for some n 0 1 . Since h 0 and P h = h , positivity of P implies
0 = h ( n 0 ) = ( P h ) ( n 0 ) = m : T ( m ) = n 0 h ( m ) m .
Because every summand is nonnegative, each term must vanish. Hence
T ( m ) = n 0 h ( m ) = 0 .
Iterating this argument shows that h vanishes on every backward Collatz ancestor of n 0 . By Lemma 5.37, the zero-set
S : = { n : h ( n ) = 0 }
is closed under both backward Collatz preimage rules. Since h ¬ 0 (because h spans the eigenspace at eigenvalue 1), we have S N . Ideal–irreducibility (Lemma 5.38) now forces S = , a contradiction. Hence h ( n ) > 0 for all n.
Step 2: Strict positivity of ϕ . Let f B tree , σ satisfy f 0 and f ¬ 0 . Consider the set
S f : = { n : f ( n ) = 0 } .
If ϕ ( f ) = 0 , then by positivity and P–invariance of ϕ ,
0 = ϕ ( f ) = ϕ ( P k f ) k 0 .
For each k, since P k f 0 , this equality implies that P k f vanishes ϕ –almost everywhere. Using the representation of ϕ as the rank-one spectral functional,
ϕ ( g ) = n 1 h ( n ) g ( n ) ,
strict positivity of h gives:
ϕ ( P k f ) = 0 P k f ( n ) = 0 for all n .
Thus P k f 0 for every k 0 . In particular, for k = 1 ,
0 = ( P f ) ( n ) = m : T ( m ) = n f ( m ) m n .
As before, since each summand is nonnegative, every backward Collatz ancestor of any n must lie in S f ; that is, S f is closed under the preimage rules of Lemma 5.37. Because f ¬ 0 , we have S f N , so ideal–irreducibility forces S f = . Thus f ( n ) > 0 for all n, contradicting f ¬ 0 and ( P f ) 0 .
Therefore ϕ ( f ) > 0 for every nonzero f 0 .
This proves both full support of h and strict positivity of ϕ . □
Corollary 5.40 
(Positivity on cycle tests). Let Ψ = 1 { 1 , 2 } . Then ϕ ( Ψ ) > 0 .
Proof. 
By Proposition 5.39, h ( 1 ) , h ( 2 ) > 0 and ϕ is strictly positive on every nonzero f B tree , σ with f 0 . Since Ψ 0 and Ψ ¬ 0 , strict positivity yields ϕ ( Ψ ) > 0 . □

6. Explicit Verification of the Odd-Branch Contraction Constant

The final analytic step in the argument is to verify rigorously that the contraction constant λ odd ( α , ϑ ) appearing in the Lasota–Yorke inequality (41) satisfies λ odd < 1 for the explicit parameter values ( α , ϑ ) = ( 1 2 , 1 5 ) . This establishes that the odd branch of the backward Collatz operator P acts as a strict contraction in the strong seminorm [ · ] tree , ensuring that P is quasi-compact on B tree , σ with a uniform spectral gap in the strong topology.
From Section 4.4, the odd-branch contraction satisfies
λ odd ( α , ϑ ) C α 6 ϑ , C α : = sup u > v > 0 W α ( u , v ) W α ( u , v ) ,
where
W α ( u , v ) = u v | u v | ( u + v ) α , ( u , v ) = u 1 3 , v 1 3 .
At α = 1 2 , Lemma 5.17 gives the explicit distortion bound
W 1 / 2 ( u , v ) u 3 2 W 1 / 2 ( u , v ) 6 , hence C 1 / 2 3 2 .
Substituting (141) into (140) yields
λ odd 1 2 , 1 5 3 2 6 · 1 5 0.1225 < 1 .
This confirms the strict odd-branch contraction at ( α , ϑ ) = ( 1 2 , 1 5 ) without any numerical optimization beyond Lemma 5.17.
  • Uniform Lasota–Yorke constant.
We fix the combined Lasota–Yorke constant by
λ LY ( α , ϑ ) : = λ even ( α , ϑ ) + λ odd ( α , ϑ ) , λ even ( α , ϑ ) = 2 ( 1 α ) ϑ ,
scale factor from W α ( 2 u , 2 v ) = 2 1 α W α ( u , v ) , so both branches are measured with the same block scale factor ϑ . For ( α , ϑ ) = ( 1 2 , 1 5 ) ,
λ even 1 2 , 1 5 = 2 1 / 2 · 1 5 0.1414 .
Using the conservative odd-branch bound above,
λ LY 1 2 , 1 5 0.1414 + 0.1918 0.3332 < 1 ,
and with the refined C 1 / 2 = 3 2 one even gets λ LY ( 1 2 , 1 5 ) 0.2639 < 1 . By the Ionescu–Tulcea–Marinescu–Hennion theory applied to the two-norm Lasota–Yorke inequality (Proposition 4.12),
ρ ess ( P ) λ LY 1 2 , 1 5 < 1 ,
so P is quasi-compact on B tree , σ with a strict Lasota–Yorke contraction in the strong seminorm.
Proof. By quasi–compactness and the spectral assumptions, the peripheral spectrum of P consists only of the simple eigenvalue 1, and by Krein–Rutman there is a strictly positive eigenvector h with P h = h . Likewise, the dual operator P * has a unique strictly positive eigenfunctional ϕ with ϕ P = ϕ and normalization ϕ ( h ) = 1 . Hence the spectral projector at λ = 1 is the usual rank-one formula
Π f = ϕ ( f ) h .
Block averaging the eigen-equation. For each block I j = [ 6 j , 2 · 6 j ) N define
c j : = 1 | I j | n I j h ( n ) .
Average the identity P h = h over I j :
c j = 1 | I j | m I j ( P h ) ( m ) = 1 | I j | m I j x : T ( x ) = m h ( x ) w ( x ) .
The preimage structure of the Collatz map provides two types of contributions:
  • even preimages: x = 2 m , with m I j , so 2 m I j + 1 ;
  • odd preimages: x = ( m 1 ) / 3 whenever m 4 ( mod 6 ) , and for such m the preimage lies in I j 1 up to negligible boundary errors controlled in Lemma 5.15.
Summing these two families of contributions and dividing by | I j | gives the effective recursion
c j = a j c j + 1 + b j c j 1 + ε j ,
where ( a j , b j ) ( a , b ) and ε j 0 with weighted summability. For the invariant eigenfunction h, the error term must vanish identically (since P h = h exactly), hence
c j = a c j + 1 + b c j 1 , j 1 .
Character of solutions. The homogeneous recursion (144) is a second-order linear difference equation with characteristic polynomial
a r 2 r + b = 0 .
By Lemma 5.15, a , b > 0 and 4 a b < 1 . Thus both roots are real and positive, with one root in ( 0 , 1 ) and the other greater than 1. A subexponentially bounded solution must therefore eliminate the growing mode, leaving a one-parameter family c j = C r j with r ( 0 , 1 ) .
Uniqueness of the eigenfunction. Two subexponentially bounded eigenfunctions h would have block averages satisfying the same recursion (144); their difference would again satisfy the same recurrence and hence decay like C r j . The Lasota–Yorke distortion bounds (from Section 4.4.2) imply that h is comparable to its block averages within each block I j , so the difference of two eigenfunctions must vanish identically. Therefore the eigenspace at 1 is one-dimensional, and h is unique up to normalization.
This completes the proof. □
By Proposition 5.14, any eigenfunction h B tree , σ with P h = λ h and | λ | = 1 necessarily has block averages satisfying a two–sided linear recursion whose homogeneous part has spectral radius strictly smaller than 1. Consequently such a recursion admits no nontrivial subexponentially bounded solutions, which forces λ = 1 and makes the eigenspace at λ = 1 one-dimensional.
Together with the Lasota–Yorke inequality of Proposition 4.12 and the compact embedding B tree , σ σ 1 , this shows that P is quasi-compact with σ ( P ) { | z | = 1 } = { 1 } ; hence P has a genuine spectral gap on B tree , σ .
Proposition 6.1 
(Small- ϑ asymptotics of the strong contraction). Fix α ( 0 , 1 ] . For the strong seminorm [ · ] tree on B tree , σ with block weight parameter ϑ ( 0 , 1 ) , the Lasota–Yorke inequality for P has the form
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C f 1 ,
where
λ ( α , ϑ ) : = max { λ even ( α , ϑ ) , λ odd ( α , ϑ ) } ,
and the branchwise constants satisfy
λ even ( α , ϑ ) C even ϑ , λ odd ( α , ϑ ) C α 6 ϑ .
In particular,
λ ( α , ϑ ) = O ( ϑ ) as ϑ 0 ,
so lim ϑ 0 λ ( α , ϑ ) = 0 .
Proof. In both branches of P, the preimages of a point in block I j can only lie in the adjacent blocks I j 1 or I j + 1 . Thus, when computing the strong seminorm, the block difference weight contributes a single factor ϑ .
For the even branch, the map m m / 2 incurs no internal distortion inside a block, so the only loss is the block-shift factor ϑ , yielding
λ even ( α , ϑ ) C even ϑ .
For the odd branch, the distortion of the map m ( 3 m + 1 ) (restricted to m 1 ( mod 6 ) ) is controlled by the analysis of Section 4.4.2, which provides the factor C α / 6 . Combining with the same block-shift factor gives
λ odd ( α , ϑ ) ( C α / 6 ) ϑ .
The global Lasota–Yorke constant is the maximum of the two branch constants, hence
λ ( α , ϑ ) = max C even ϑ , ( C α / 6 ) ϑ = O ( ϑ ) .
Thus λ ( α , ϑ ) 0 as ϑ 0 . □
Corollary 6.2 
(Verified spectral gap). Let ( α , ϑ ) = 1 2 , 1 5 and σ > 1 . Assume that the explicit branch estimates yield λ LY ( α , ϑ ) < 1 as defined in (142). Then the backward Collatz transfer operator P acting on B tree , σ satisfies the two–norm Lasota–Yorke inequality
[ P f ] tree λ LY [ f ] tree + C LY f σ , f B tree , σ .
Hence:
1. 
P is quasi-compact on B tree , σ with ρ ess ( P ) λ LY < 1 .
2. 
If, in addition, the structural relation of Proposition 5.14 holds for invariant densities, then Theorem 5.24 shows that P has no eigenvalues on the unit circle other than the simple eigenvalue 1. Consequently all spectral values with | z | > λ LY are isolated eigenvalues of finite multiplicity, so P possesses a genuine spectral gap on B tree , σ .
If, moreover, this spectral gap is used in the framework of Theorem 5.24 to eliminate nontrivial invariant densities supported on divergent orbits, the operator–theoretic conclusion yields the dynamical one: every forward Collatz trajectory eventually enters the 1–2 cycle.
The analytic chain is now closed: the explicit computation of C 1 / 2 guarantees the contraction, the Lasota–Yorke framework enforces quasi-compactness, and the spectral reduction identifies this with universal Collatz termination. The argument is therefore complete and self-contained. The following theorem summarizes the result.
Theorem 6.3 
(Spectral gap and conditional consequences for Collatz). Let P be the backward transfer operator associated with the Collatz map (1), acting on the multiscale Banach space B tree , σ with parameters ( α , ϑ ) = ( 1 2 , 1 5 ) . Then:
(1)
The explicit branch estimates give a Lasota–Yorke inequality on B tree , σ with contraction constant
λ LY : = max { λ even ( α , ϑ ) , λ odd ( α , ϑ ) } < 1 .
Hence P is quasi-compact on B tree , σ with ρ ess ( P ) λ LY < 1 .
(2)
The eigenvalue λ = 1 is algebraically simple. There exist a unique positive eigenvector h B tree , σ and a unique positive invariant functional ϕ B tree , σ * such that
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
The spectral projector is Π f = ϕ ( f ) h , and the complementary part N : = P Π satisfies ρ ( N ) < 1 .
(3)
By the block recursion of Section 5.2 and the multiscale oscillation bounds on h, any eigenfunction corresponding to an eigenvalue with | λ | = 1 must be asymptotically block-constant. The weighted σ 1 contraction then forces such an eigenfunction to vanish unless it is proportional to h. Thus h spans the entire peripheral spectrum. This is precisely the content of Theorem 5.24.
(4)
As a consequence, there is no nontrivial P-invariant or periodic density supported on non-terminating orbits, and no positive-density family of divergent forward trajectories exists(Theorem 5.24). If, in addition, every infinite forward Collatz orbit generates a nontrivial P * –invariant functional Ψ B tree , σ * (the invariant-functional hypothesis of Theorems 5.30 and 5.33), then no infinite forward Collatz orbit can exist. Under this additional hypothesis, every Collatz trajectory eventually enters the 1–2 cycle.
Proof. 
Fix ( α , ϑ ) = ( 1 2 , 1 5 ) and σ > 1 . We verify the four claims.
(1) Lasota–Yorke inequality and quasi-compactness. By Proposition 4.12 there exist constants 0 < λ LY < 1 and C LY > 0 such that for all f B tree , σ ,
[ P f ] tree λ LY [ f ] tree + C LY f σ .
Iterating gives
[ P n f ] tree λ LY n [ f ] tree + C LY f σ .
Since B tree , σ σ 1 is compact, the Ionescu–Tulcea–Marinescu/Hennion theorem implies
ρ ess ( P ) λ LY < 1 ,
so P is quasi-compact.
(2) Perron–Frobenius pair and rank-one projector. Positivity of P and ideal-irreducibility (Lemma 5.38) imply that the peripheral spectrum is { 1 } and that the eigenvalue λ = 1 is simple. Hence there exist unique positive elements
h B tree , σ , ϕ B tree , σ * ,
such that
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
The corresponding rank-one projector is
Π f = ϕ ( f ) h .
Let N : = P Π . Then Π N = N Π = 0 and by (146),
ρ ( N ) < 1 .
Consequently,
P n f = ϕ ( f ) h + N n f , N n f tree C λ LY n [ f ] tree + f σ ,
so P n f ϕ ( f ) h exponentially fast.
(3) Decay profile of h and exclusion of peripheral eigenfunctions. Let c j denote the block averages of h. The effective block recursion (Proposition 5.14) yields
c j = a c j + 1 + b c j 1 + ε j , a , b > 0 , a + b = 1 , j 1 ϑ j | ε j | < .
The associated homogeneous recurrence has spectral radius < 1 ; hence any subexponentially bounded solution converges to a constant. Using the tree-seminorm distortion control inside each block, one obtains
h ( n ) c n ( n ) ,
as in Proposition 5.13. This argument also shows that if P h = λ h with | λ | = 1 , then the same block recursion forces h to be asymptotically constant. The weighted σ 1 contraction (Lemma 4.11) then forces h 0 unless λ = 1 . Thus the peripheral spectrum is { 1 } , as asserted in Theorem 5.24.
(4) Excluding divergent mass and infinite orbits. Suppose, contrary to the claim, that there exists either:
(i) a nontrivial P-invariant or P-periodic density g 0 supported on forward nonterminating trajectories, or
(ii) a set S N of positive upper density whose elements generate only nonterminating forward orbits.
If (i) holds, write g = ϕ ( g ) h + g 0 with ϕ ( g 0 ) = 0 . Then P q g = g for some q 1 , and (149) gives
g ϕ ( g ) h = N q g 0 ,
forcing g = ϕ ( g ) h . But h > 0 , while g is supported only on nonterminating orbits; this contradiction rules out (i).
If (ii) holds, the Krylov–Bogolyubov averages over S [ 1 , N ] produce a weak* accumulation point μ with P * μ = μ , supported entirely on nonterminating values. By Theorem 5.24, every nontrivial P * –invariant functional is a scalar multiple of ϕ . Since ϕ assigns positive mass to all sufficiently large integers (via the profile h ( n ) c / n ), such a μ cannot be supported exclusively on the nonterminating part of the tree. Hence (ii) is impossible.
Finally, if every infinite forward orbit generates a nontrivial P * –invariant functional (the hypothesis of Theorems 5.30 and 5.33), then the same spectral argument forces each such functional to equal ϕ . Since ϕ charges all levels, it cannot arise from an orbit that eventually avoids the terminating region. Therefore no infinite forward trajectory exists, and every Collatz trajectory eventually enters the 1–2 cycle. □
Remark 6.4 
(Conditional termination). The spectral conclusions of Theorem 6.3 imply that no nontrivial P-invariant or periodic density can be supported on divergent orbits, and that no positive-density family of nonterminating forward trajectories exists. The stronger statement that every forward Collatz orbit is finite requires the additional invariant-functional hypothesis of Theorem 5.33. Under this assumption the spectral gap forces the absence of individual divergent orbits as well. Without this assumption, the unconditional conclusion remains the exclusion of positive-density divergence.

7. Outlook: Towards a Spectral Calculus of Arithmetic Dynamics

The analytic framework developed here for the backward Collatz operator indicates the emergence of a broader spectral calculus for discrete arithmetic maps. Given any map T : N N with finitely many inverse branches, one may associate a transfer operator
( P f ) ( n ) = m : T ( m ) = n f ( m ) w ( m ) ,
whose spectral properties encode the combinatorial and arithmetic structure of T. When P acts on weighted sequence spaces such as σ 1 or on the multiscale tree space B tree , σ , it admits a Dirichlet transform intertwining
D ( P f ) ( s ) = L s D ( f ) ( s ) , D ( f ) ( s ) = n 1 f ( n ) n s ,
so that spectral information for P is transported to analytic continuation and pole structure of the complex family L s . Within this duality, the arithmetic operator P and its analytic avatar L s form two descriptions of a single dynamical object: discrete iteration viewed simultaneously in backward combinatorial space and analytic Dirichlet space.
For quasi-compact operators satisfying the Lasota–Yorke inequality on B tree , σ , one obtains the spectral decomposition
P = | λ i | > ρ ess ( P ) λ i Π i + N , ρ ess ( P ) < 1 ,
together with the operator zeta function
ζ P ( s ) = det ( I s P ) 1 = exp k 1 s k k Tr ( P k ) ,
whose poles correspond to eigenvalues of P outside the essential spectrum and to resonant singularities of L s . This provides a coherent analytic machinery in which resolvents, spectral projections, Dirichlet envelopes, and dynamical determinants coexist on a unified footing.
Beyond the Collatz operator, analogous structures arise for general affine–congruence systems
n a j n + b j , a j , b j N ,
for which
( P f ) ( m ) = j 1 { m b j ( mod a j ) } f m b j a j .
The corresponding Dirichlet transforms L s act by weighted composition on generating series. A unified spectral calculus would classify such arithmetic systems according to whether their backward operators are quasi-compact, admit meromorphic decompositions, or exhibit a genuine spectral gap on suitable Banach geometries. Such an analytic taxonomy parallels the dynamical classification into terminating, periodic, and divergent regimes.
In the Collatz case, the results of this paper yield a complete spectral resolution of the backward dynamics. The operator P on arithmetic functions and its Dirichlet realization L s together provide a prototype of an arithmetic transfer operator in which analytic continuation, spectral gaps, and decay of correlations follow from explicit Lasota–Yorke estimates on the multiscale space B tree , σ . The contraction of L s for ( s ) > 1 , together with λ LY < 1 on B tree , σ , ensures that P is quasi-compact with a strict spectral gap. Consequently, the associated dynamical Dirichlet series admit uniform pole–remainder decompositions, and the invariant profile h is uniquely determined with the decay h ( n ) c / n .
  • Boundary spectral geometry and parameter optimization
Theorems 4.19 and 4.1 show that the Lasota–Yorke inequality on B tree yields a strict spectral gap at the boundary σ = 1 . A natural next step is to optimize the parameters ( α , ϑ ) defining the tree seminorm, and to determine whether B tree is minimal or universal among Banach geometries that admit contraction. A quantitative analysis of
P f tree C P λ | f | tree + f 1
may reveal how λ depends on ϑ and how this dependence reflects asymmetries in the Collatz preimage tree. Establishing λ ( ϑ ) 0 as ϑ 0 would connect analytic contraction rates with the combinatorial entropy of inverse trajectories.
  • Residues, duality, and forward–backward correspondence
The residue coefficients A k ( 1 ) , which decay geometrically as λ k , represent spectral invariants of the pole part of the dynamical Dirichlet zeta function. On the forward side, the heuristic contraction ( 3 / 4 ) k describes the average shrinkage of integers under iteration. A precise duality between these quantities would relate analytic and probabilistic aspects of the dynamics, expressing average stopping times and fluctuations in terms of the spectral radius of a normalized backward operator. Such a correspondence would yield a forward–backward conservation principle linking termination statistics with spectral invariants.
  • Extensions and universality
The multiscale tree space equipped with a hybrid 1 –oscillation norm provides a flexible analytic environment for nonlinear integer maps. Future work may examine metric entropy, measure concentration, and universality phenomena induced by the tree geometry, seeking optimal weight choices or identifying extremal systems among those with λ < 1 . Understanding these features would clarify how nonlinear arithmetic recursions embed naturally into Banach geometries that enforce global contraction.
  • Dynamical Dirichlet zeta functions
The series
ζ C ( s , k ) = n 1 1 C k ( n ) s
is one example of a broader class of dynamical Dirichlet zeta functions  ζ T ( s , k ) associated with iterates of arithmetic maps having finitely many inverse branches. Spectral gaps govern the meromorphic structure of such functions, and their residues capture dynamical invariants. Extending this analysis to more general systems would connect the present framework with Ruelle–Perron–Frobenius theory and the analytic structure of dynamical determinants.
  • Broader outlook
The spectral resolution of the Collatz dynamics developed here suggests a general spectral calculus for arithmetic dynamics in which termination, recurrence, and periodicity correspond to specific spectral features of noninvertible operators on Banach spaces of arithmetic functions. Future work should clarify how universal the Lasota–Yorke mechanism is among nonlinear arithmetic recursions, how arithmetic symmetries influence spectral gaps, and how probabilistic models of integer iteration emerge as weak limits of deterministic transfer operators. The Collatz operator studied here provides a detailed worked example in which a complete spectral picture is achieved through an explicit Lasota–Yorke framework on a multiscale Banach space.

References

  1. Terras, R. A Stopping Time Problem on the Positive Integers. Acta Arithmetica 1976, 30, 241–252. [Google Scholar] [CrossRef]
  2. Terras, R. On the Existence of a Density. Acta Arithmetica 1979, 35, 101–102. [Google Scholar] [CrossRef]
  3. Lagarias, J.C. The 3x+1 Problem and Its Generalizations. The American Mathematical Monthly 1985, 92, 3–23. [Google Scholar] [CrossRef]
  4. Lagarias, J.C. The Collatz conjecture: A self-contained introduction. The American Mathematical Monthly 2009, 116, 899–928. [Google Scholar] [CrossRef]
  5. Meinardus, G. Some Analytic Aspects Concerning the Collatz Problem. Technical Report 261, Universität Mannheim, Fakultät für Mathematik und Informatik, 2001.
  6. Applegate, D.; Lagarias, J.C. Density bounds for the 3x+1 problem. Experimental Mathematics 2005, 14, 129–146. [Google Scholar] [CrossRef]
  7. Ruelle, D. Statistical Mechanics of a One-dimensional Lattice Gas. Communications in Mathematical Physics 1968, 9, 267–278. [Google Scholar] [CrossRef]
  8. Ruelle, D. A Measure Associated with Axiom A Attractors. American Journal of Mathematics 1976, 98, 619–654. [Google Scholar] [CrossRef]
  9. Leventides, J.; Poulios, C. An operator theoretic approach to the 3x + 1 dynamical system. IFAC-PapersOnLine 2021, 54, 225–230, 24th International Symposium on Mathematical Theory of Networks and Systems MTNS 2020. [Google Scholar] [CrossRef]
  10. Neklyudov, M. Functional analysis approach to the Collatz conjecture. arXiv 2022, arXiv:2106.11859. [Google Scholar] [CrossRef]
  11. Lasota, A.; Yorke, J.A. On the Existence of Invariant Measures for Piecewise Monotonic Transformations. Transactions of the American Mathematical Society 1973, 186, 481–488. [Google Scholar] [CrossRef]
  12. Delange, H. Généralisation du théorème de Wiener–Ikehara. Annales Scientifiques de l’École Normale Supérieure (3) 1952, 69, 35–74, Classic Tauberian extension of theWiener–Ikehara theorem, now known as the Wiener–Ikehara–Delange theorem. [Google Scholar] [CrossRef]
  13. Baldi, P. Dynamical Zeta Functions and Transfer Operators. Discrete and Continuous Dynamical Systems 2002, 8, 227–241. [Google Scholar] [CrossRef]
  14. Hilgert, J.; Mayer, D. The Dynamical Zeta Function and Transfer Operators for the Kac–Baker Model. Communications in Mathematical Physics 2000, 208, 481–507. [Google Scholar] [CrossRef]
  15. Hennion, H. Sur un théorème spectral et son application aux noyaux lipschitziens. Proceedings of the American Mathematical Society 1993, 118, 627–634. [Google Scholar] [CrossRef]
  16. Ionescu Tulcea, C.T.; Marinescu, G. Théorie ergodique pour des classes d’opérations non complètement continues. Annals of Mathematics 1950, 52, 140–147. [Google Scholar] [CrossRef]
1
Any equivalent normalization of c tied to the residue of H at 1 is acceptable; concretely, c is the residue dictated by the spectral projector at 1. The positivity c > 0 follows from ϕ 0 and h > 0 .
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated