Preprint
Article

This version is not peer-reviewed.

The Collatz Conjecture and the Spectral Calculus for Arithmetic Dynamics

Submitted:

19 November 2025

Posted:

19 November 2025

Read the latest preprint version here

Abstract
We develop an operator--theoretic framework for the Collatz map based on its backward transfer operator acting on weighted Banach spaces of arithmetic functions. The associated Dirichlet transforms form a holomorphic family that captures the complex--analytic evolution of iterates and admits a decomposition into a zeta--type pole at $s=1$ and a holomorphic remainder. Within a finer multiscale space adapted to the Collatz preimage tree, we establish a Lasota--Yorke inequality with an explicit contraction constant $\lambda<1$, giving quasi--compactness and a spectral gap at the dominant eigenvalue. The resulting invariant density is strictly positive and exhibits a $c/n$ decay profile. We formulate a general criterion showing that, under a verified quasi--compactness hypothesis with isolated eigenvalue $1$, the forward dynamics admit no infinite trajectories. The framework provides a coherent spectral perspective on the Collatz operator and suggests a broader analytic approach to arithmetic dynamical systems.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

The Collatz conjecture asserts that every positive integer n eventually reaches the 1–2 cycle under repeated application of
T ( n ) = n / 2 , n even , 3 n + 1 , n odd .
Equivalently, every forward orbit O + ( n ) = { T k ( n ) : k 0 } is conjectured to terminate in { 1 , 2 } . Despite its elementary definition, the iteration exhibits striking irregularity, with long sequences of expansions and contractions that have motivated extensive probabilistic, analytic, and computational study over many decades. Classical work of Terras [15,16] established early density results and stopping-time estimates, while the surveys of Lagarias [7,8] synthesized a wide range of heuristic and structural approaches. Subsequent analytic contributions, including those of Meinardus [11] and Applegate–Lagarias [1], have developed refined density bounds and asymptotic estimates for the distribution of orbits. Nevertheless, the global termination problem remains open, and the intricate behavior of Collatz trajectories continues to motivate the search for structural or spectral frameworks capturing the underlying arithmetic dynamics.
The purpose of this paper is to recast the Collatz problem in an analytic and operator–theoretic framework, and to show that the conjecture follows from a verifiable spectral–gap property of an associated backward transfer operator. Instead of studying T directly, we analyze its inverse dynamics through the operator
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m ,
acting on arithmetic functions f : N C . Transfer–operator methods of this type originate in statistical mechanics and dynamical systems [13,14], and have more recently been applied to 3 x + 1 –type maps in various analytic and functional–analytic contexts [10,12]. For the Collatz map (1), each n has an even preimage 2 n and an additional odd preimage ( n 1 ) / 3 whenever n 4 ( mod 6 ) , giving
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( mod 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 .
The weights 1 / m normalize the operator so that P acts as a mass–preserving average on non-negative 1 sequences, reflecting the logarithmic contraction inherent in the preimage structure of T.
Remark 1 
(Invariant density and logarithmic mass balance). Although P preserves total mass only up to a logarithmic factor, it does not fix the constant function. Indeed,
( P 1 ) ( n ) = 1 2 n + 1 { n 4 ( mod 6 ) } 3 n 1 C n ( n ) ,
so ( P 1 ) 1 . More generally,
n 1 ( P f ) ( n ) = m 1 f ( m ) m ,
which shows that P islogarithmically mass–preserving: the pushforward of mass is reweighted by the harmonic kernel m 1 / m .
This logarithmic balance forces any P–invariant density h to satisfy P h = h with a decay of order 1 / n as n . In particular, the explicit block recursion developed in Section 5.2, together with the oscillation control provided by the Lasota–Yorke inequality [9], yields the precise asymptotic profile
h ( n ) c n , n ,
consistent with Tauberian heuristics of Delange type [3]. All spectral decompositions in the sequel are expressed relative to this nonconstant 1 / n –type invariant profile.
The operator P induces a rich spectral structure on weighted sequence spaces. On σ 1 , defined by f σ = n 1 | f ( n ) | n σ , the Dirichlet transform
D f ( s ) = n 1 f ( n ) n s ,
intertwines P with analytic continuation in the half-plane ( s ) > σ . Uniform σ 1 bounds on P k translate into exponential envelopes for D ( P k f ) ( s ) and yield meromorphic continuations of the corresponding Collatz–Dirichlet series, whose pole at s = 1 reflects the average branching behavior [2,5]. The spectral radius of P on σ 1 captures the global weighted expansion rate of inverse branches and determines the analytic location of dominant singularities.
To resolve finer dynamical properties, we refine this setting to a multiscale Banach space B tree , σ built from dyadic–triadic block averages and oscillation seminorms that encode the hierarchical structure of the Collatz preimage tree. On this space, P satisfies a two-norm Lasota–Yorke inequality,
[ P f ] tree , σ λ LY [ f ] tree , σ + C f σ , 0 < λ LY < 1 ,
placing the dynamics within the classical Ionescu–Tulcea–Marinescu and Hennion spectral frameworks for quasi–compact operators [4,6]. The precise Lasota–Yorke bounds, including the explicit contraction of the odd branch, are developed in Section 4, Section 5 and Section 6.
The main theorem of the paper establishes that when the odd-branch contraction constant λ odd ( α , ϑ ) satisfies λ odd < 1 for specific parameters ( α , ϑ ) = ( 1 2 , 1 5 ) , the backward Collatz operator P possesses a strict spectral gap on B tree , σ . The spectral decomposition then implies that every invariant measure of P is supported on the 1–2 cycle, ruling out any positive-density family of divergent or periodic orbits. A strengthened criterion shows that a non-trivial invariant functional in B tree , σ * would contradict the spectral gap, hence all Collatz trajectories must terminate.
The remainder of the paper is organized as follows. Section 2 establishes notation and basic properties of the weighted σ 1 spaces together with the associated Dirichlet transforms. Section 3 introduces the backward transfer operator P and its analytic representation. Section 4 constructs the multiscale space B tree , σ adapted to the Collatz preimage tree and proves the corresponding Lasota–Yorke inequalities. Section 6 verifies that the odd branch admits an explicit contraction constant λ odd < 1 for the chosen parameters, yielding quasi–compactness and a spectral gap. Finally, Section 7 develops the resulting spectral consequences, formulating a general criterion that links quasi–compactness with the absence of infinite forward trajectories, and situating the Collatz operator within a broader analytical framework for arithmetic dynamical systems.

2. Preliminaries

The analysis begins with a careful description of the function spaces, Dirichlet transforms, and basic structural features of the Collatz map that underlie the spectral study of the backward operator P. Throughout we work with complex-valued arithmetic functions f : N C .

2.1. Weighted 1 Spaces and Dirichlet Transforms

For σ > 0 we define the weighted 1 space
σ 1 : = f : N C : f σ : = n 1 | f ( n ) | n σ < .
The weight exponent σ measures polynomial decay and is chosen so that Dirichlet series associated with f converge absolutely in a half-plane ( s ) > σ .
Given f σ 1 , we define its Dirichlet transform
D f ( s ) : = n 1 f ( n ) n s , ( s ) > σ .
Lemma 1 
(Dirichlet convergence). Let σ > 0 and f σ 1 . Then D f ( s ) in (7) converges absolutely for ( s ) > σ and defines a bounded holomorphic function on every half-plane ( s ) σ + ε , ε > 0 . Moreover,
| D f ( s ) | f σ sup n 1 n σ ( s ) .
Proof. 
For ( s ) > σ ,
n 1 f ( n ) n s = n 1 | f ( n ) | n ( s ) = n 1 | f ( n ) | n σ n σ ( s ) f σ sup n 1 n σ ( s ) < ,
so the series converges absolutely and locally uniformly in ( s ) σ + ε , giving holomorphy and the bound (8). □
We write 1 = 0 1 for the unweighted space with norm f 1 = n 1 | f ( n ) | .

2.2. Coarse Forward Envelopes

Lemma 2 
(Coarse k-step envelopes). Let T : N N denote the Collatz map, (1). For every n N and k N 0 ,
n 2 k T k ( n ) 3 k n + 3 k 1 2 .
Proof. 
The proof proceeds as before: T ( x ) x / 2 and T ( x ) 3 x + 1 , and induction gives the bounds (9). □
These envelopes are intentionally crude, yet they ensure that forward iterates of typical arithmetic weights remain controlled on the scales relevant for our Dirichlet and transfer-operator analysis.

2.3. Backward Preimages and the Transfer Recursion

For each n 1 , define the even and odd preimage sets
E ( n ) : = { m N : T ( m ) = n , m even } , O ( n ) : = { m N : T ( m ) = n , m odd } .
Lemma 3 
(Preimage structure). For every n N ,
E ( n ) = { 2 n } , O ( n ) = { ( n 1 ) / 3 } , n 4 ( mod 6 ) , , otherwise ,
and in the first case ( n 1 ) / 3 is odd. In particular, each n has either one preimage (even) or two preimages (one even and one odd), and the odd preimage occurs with natural density 1 / 6 .
Proof. 
If m is even and T ( m ) = n , then m / 2 = n , so m = 2 n , establishing E ( n ) = { 2 n } .
If m is odd and T ( m ) = n , then 3 m + 1 = n , so m = ( n 1 ) / 3 . This is an integer precisely when n 1 ( mod 3 ) . For m to be odd, n 1 must be divisible by 3 but not by 6, so n 4 ( mod 6 ) . In that case ( n 1 ) / 3 is odd. The density statement follows since the congruence class n 4 ( mod 6 ) has natural density 1 / 6 . □
Hence each n admits exactly one even preimage and possibly one odd preimage when n 4 ( mod 6 ) . The corresponding backward transfer operator is defined as
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
The normalization by 1 / m reflects the logarithmic contraction of the forward map and ensures a natural mass-balance property.
Lemma 4 
(Mass preservation on 1 ). If f 0 and f 1 , then
n 1 ( P f ) ( n ) = m 1 f ( m ) .
Proof. 
Each m contributes exactly once to the double sum n 1 m : T ( m ) = n f ( m ) m , so equality (12) follows directly from (11). □

2.4. Dirichlet Envelope for Iterates of the Backward Operator

The preimage structure allows a crude but useful bound on P acting on σ 1 .
Proposition 1 
(Backward operator bound). Let σ > 0 and let P be defined by (11). Then P : σ 1 σ 1 is bounded and
P f σ C σ f σ , C σ : = 2 σ + 3 σ ,
for all f σ 1 . Consequently, for every k 1 ,
P k f σ C σ k f σ .
Proof. 
From (11),
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
Hence
P f σ S even + S odd ,
with
S even : = n 1 | f ( 2 n ) | 2 n n σ , S odd : = n 1 n 4 ( 6 ) f n 1 3 n 1 3 n σ .
For the even branch, set m = 2 n , so n = m / 2 and
S even = m 1 m even | f ( m ) | m ( m / 2 ) σ = m 1 m even 2 σ | f ( m ) | m σ + 1 2 σ m 1 | f ( m ) | m σ = 2 σ f σ .
For the odd branch, write m = ( n 1 ) / 3 , so n = 3 m + 1 and m is odd. Then
S odd = m 1 m odd | f ( m ) | m ( 3 m + 1 ) σ m 1 | f ( m ) | m ( 3 m ) σ = 3 σ m 1 | f ( m ) | m σ + 1 3 σ f σ .
Combining the two estimates gives (13), and iterating yields (14). □
The constant C σ = 2 σ + 3 σ is an explicit growth factor for P on σ 1 . It is not < 1 in this normalization, so no contraction is claimed at this level. The genuine contraction mechanism is obtained later on the multiscale Banach space B tree , where a strong seminorm captures oscillatory decay along the Collatz tree while the 1 component provides compactness.

3. Transfer Operator Formulation

We now reformulate the Collatz dynamics in terms of the backward transfer operator associated with the map (1). This operator-theoretic viewpoint provides an analytic bridge between the discrete recurrence and the functional framework developed in later sections. The transfer operator encodes the inverse–branching structure of the map and propagates densities backward along the Collatz tree, in a form compatible with logarithmic weighting and Dirichlet series.
Recall that the Collatz map, (1), by Lemma 3, each n 1 has the even preimage 2 n , together with an additional odd preimage ( n 1 ) / 3 precisely when n 4 ( mod 6 ) .

3.1. Backward Transfer Operator

Definition 1 
(Backward transfer operator). For an arithmetic function f : N C , define
( P f ) ( n ) : = m : T ( m ) = n f ( m ) m = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 , n N ,
where 1 A denotes the indicator of the condition A.
The multiplicative factor 1 / m assigns to each inverse branch a logarithmic weight, so that P acts as a normalized backward average along preimages. This normalization aligns the discrete dynamics with Dirichlet weights and will be crucial for analytic continuation and spectral estimates below.
Positivity. If f ( n ) 0 for all n, then ( P f ) ( n ) 0 for all n, since P is a positive linear combination of values of f.
Weighted mass preservation. A direct change of variables shows that for every nonnegative f satisfying m 1 | f ( m ) | / m < ,
n 1 ( P f ) ( n ) = m 1 f ( m ) m .
Thus P preserves the logarithmically weighted mass f ( m ) / m ; plain 1 mass is not preserved under this normalization.
Boundedness on weighted spaces. Let
σ 1 : = f : N C : f σ 1 : = n 1 | f ( n ) | n σ < , σ > 0 .
A direct change of variables in (15) yields, for all f σ 1 ,
P f σ 1 = n 1 | ( P f ) ( n ) | n σ n 1 | f ( 2 n ) | 2 n 1 + σ + 1 { n 4 ( 6 ) } f ( ( n 1 ) / 3 ) ( ( n 1 ) / 3 ) 1 + σ = 1 2 n 1 | f ( 2 n ) | n 1 + σ + 3 1 + σ n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) 1 + σ .
Changing variables m = 2 n in the first sum and m = ( n 1 ) / 3 in the second gives
n 1 | f ( 2 n ) | 2 n 1 + σ = 2 σ m 1 m even | f ( m ) | m 1 + σ 2 σ f σ 1 , 3 1 + σ n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) 1 + σ = 3 σ m 1 3 m + 1 4 ( 6 ) | f ( m ) | m σ 3 σ f σ 1 .
Hence
P f σ 1 2 σ + 3 σ f σ 1 ,
and therefore
P k f σ 1 2 σ + 3 σ k f σ 1 , k 0 .
Action on the weighted sup space. For the Banach space
B σ : = f : N C : f B σ : = sup n 1 n σ | f ( n ) | < ,
the normalization factor 1 / m in (15) improves decay at each branch but does not make P a contraction. Setting g ( n ) : = n f ( n ) , one obtains
n ( P f ) ( n ) = g ( 2 n ) + 1 { n 4 ( 6 ) } g n 1 3 , ( P f ) ( n ) = ( Q g ) ( n ) n , ( Q g ) ( n ) : = g ( 2 n ) + 1 { n 4 ( 6 ) } g n 1 3 .
Using f B σ = g B σ 1 , one obtains the bound
P f B σ = sup n 1 n σ 1 | ( Q g ) ( n ) | sup n 1 n σ 1 | g ( 2 n ) | + n σ 1 1 { n 4 ( 6 ) } g n 1 3 2 ( σ 1 ) + 3 σ 1 g B σ 1 = 2 ( σ 1 ) + 3 σ 1 f B σ .
In particular, the constant 2 ( σ 1 ) + 3 σ 1 1 for all σ > 0 , so P is bounded but not contractive on ( B σ , · B σ ) . This coarse boundedness provides an upper envelope for the operator norm but does not imply any decay of P k on B σ .
These limitations motivate the refinement of the functional setting in later sections, where the multiscale tree spaces B tree and B tree , σ are introduced to obtain genuine Lasota–Yorke-type contractions with λ < 1 and a provable spectral gap.

3.2. Dirichlet-Side Formulation and Intertwining

For f σ 1 with σ > 0 , the Dirichlet transform
D f ( s ) : = n 1 f ( n ) n s , ( s ) > σ ,
is absolutely convergent. Writing D f ( s ) = n 1 a n n s with a n = f ( n ) and substituting (15), we obtain
D ( P f ) ( s ) = n 1 a 2 n 2 n + 1 { n 4 ( 6 ) } a ( n 1 ) / 3 ( n 1 ) / 3 1 n s .
Thus D ( P f ) is again a Dirichlet series whose coefficients depend linearly on those of D f .
Definition 2 
(Dirichlet–Ruelle operator). Let D σ denote the space of Dirichlet series
F ( s ) = n 1 a n n s with n 1 | a n | n σ < .
Define L : D σ D σ by
( L F ) ( s ) : = n 1 b n n s , b n : = a 2 n 2 n + 1 { n 4 ( 6 ) } a ( n 1 ) / 3 ( n 1 ) / 3 .
Lemma 5 
(Operator norm of L). For σ > 0 , let F σ : = n 1 | a n | / n σ . Then L : D σ D σ is bounded and
L σ 2 σ + 3 σ .
Proof. 
From (23),
L F σ = n 1 | b n | n σ n 1 | a 2 n | 2 n n σ + n 1 n 4 ( 6 ) | a ( n 1 ) / 3 | ( n 1 ) / 3 1 n σ = : S even + S odd .
For the even term, set m = 2 n . Then
S even = m even | a m | 2 ( m / 2 ) 1 + σ = m even 2 σ | a m | m 1 + σ 2 σ m even | a m | m σ 2 σ F σ .
For the odd term, write m = ( n 1 ) / 3 , so n = 3 m + 1 and
S odd = m 1 | a m | m ( 3 m + 1 ) σ 3 σ m 1 | a m | m σ = 3 σ F σ .
Combining the two estimates gives
L F σ ( 2 σ + 3 σ ) F σ ,
proving (24). □
Lemma 6 
(Intertwining of P and L). For every f σ 1 with σ > 0 ,
D ( P f ) = L ( D f ) , D ( P k f ) = L k ( D f ) , k 0 ,
whenever the series converge absolutely.
Proof. 
The Dirichlet coefficients of D ( P f ) in (22) are precisely the b n of (23), so D ( P f ) = L ( D f ) ; iteration gives the second identity. □
The intertwining relation shows that spectral information for P on σ 1 transfers to L on D σ . However, since P is not contractive on σ 1 or B σ , the inequality (24) provides only a uniform boundedness envelope for L k σ , not exponential decay. Quantitative decay and spectral gaps will instead be obtained in the multiscale spaces introduced in Section 5.
Define w k : = P k 1 with 1 ( n ) 1 and
ζ C ( s , k ) : = n 1 w k ( n ) n s , ( s ) large .
By Lemma 6,
ζ C ( s , 0 ) = ζ ( s ) , ζ C ( s , k ) = L k ζ ( s ) , k 1 .
The quantity w k ( n ) represents the total normalized weight of all k–step backward paths from n in the Collatz tree under the logarithmic weighting 1 / m . The family ζ C ( s , k ) therefore encodes, in Dirichlet form, the distribution of these weighted backward configurations at depth k. By Lemma 5,
L k σ ( 2 σ + 3 σ ) k ,
so the Dirichlet coefficients of ζ C ( s , k ) are uniformly bounded in ( s ) > σ but do not necessarily decay in k. Later sections refine this estimate by passing to the multiscale tree space B tree , σ , where the Lasota–Yorke inequality ensures a true spectral gap and exponential decay of P k .

4. Spectral Reduction and Analytic Continuation

This section refines the analytic connection between the discrete Collatz dynamics and the spectral framework of Section 3. Our goal is to express analytic information about the Dirichlet series associated with iterates of the backward operator P in terms of the spectral data of P—equivalently, of the Dirichlet–Ruelle operator L—acting on suitable Banach spaces continuously embedded in σ 1 . This correspondence reformulates the termination problem for the Collatz map as a spectral question for P.
Throughout this section we fix σ > 1 and a Banach space B σ , 1 of arithmetic functions such that B σ , 1 σ 1 continuously, P ( B σ , 1 ) B σ , 1 , and the Dirichlet transform
D f ( s ) = n 1 f ( n ) n s
defines a holomorphic function for ( s ) > σ whenever f B σ , 1 . The intertwining relation (25) then yields, for all k 0 ,
D ( P k f ) ( s ) = n 1 ( P k f ) ( n ) n s , ( s ) > σ .
Since B σ , 1 σ 1 , each series converges absolutely. By the σ 1 estimate (18),
| D ( P k f ) ( s ) | P k f σ 1 2 σ + 3 σ k f σ 1 , ( s ) > σ .
The bound (28) shows that the iterates of P are uniformly bounded on σ 1 , though not contractive; a genuine contraction will appear only after the refinement to the multiscale tree spaces introduced in Section 4.4.
Generating function and operator resolvent. For z C with | z | < ( 2 σ + 3 σ ) 1 , define the two–variable generating function
G f ( s , z ) : = k 0 z k D ( P k f ) ( s ) .
The series converges absolutely and locally uniformly for ( s ) > σ , hence G f is holomorphic in ( s , z ) on the domain
Ω σ : = { ( s , z ) C 2 : ( s ) > σ , | z | < ( 2 σ + 3 σ ) 1 } .
On the operator side, for such z the Neumann series
( I z P ) 1 = k 0 z k P k
converges in operator norm on B σ , 1 , and thus
G f ( s , z ) = D ( I z P ) 1 f ( s ) , ( s , z ) Ω σ .
The poles of ( I z P ) 1 in the z–plane occur precisely at the reciprocals of the spectral values of P on B σ , 1 . Consequently the analytic structure of G f as a function of z is governed by the spectrum of P.
At this point we recall that the backward Collatz operator P preserves total mass on 1 :
n 1 ( P f ) ( n ) = m 1 f ( m ) ,
so 1 is a simple eigenvalue corresponding to the eigenvector 1 ( n ) 1 . Hence the spectral analysis of P will focus on demonstrating a spectral gap at 1: all other spectral values satisfy | λ | λ LY < 1 . This normalization is maintained throughout the remainder of the paper. The resolvent expansion (30) is therefore analytic for | z | < 1 except at the simple pole z = 1 , whose residue encodes the invariant functional associated with 1 .
The coarse resolvent radius ( 2 σ + 3 σ ) 1 merely provides an elementary domain of convergence. A sharper meromorphic continuation—reflecting the true spectral radius r ( P ) = 1 and the subdominant bound ρ ess ( P ) λ LY < 1 —will be obtained on the refined spaces B tree and B tree , σ , where the Lasota–Yorke inequality gives quantitative contraction of oscillations between adjacent scales.
Finally, for the constant function 1 ( n ) 1 (whenever 1 B σ , 1 ), the coefficients of G 1 ( s , z ) are precisely the Collatz Dirichlet series ζ C ( s , k ) defined in (26). Thus the analytic continuation and asymptotic decay of ζ C ( s , k ) as k are controlled by the spectral properties of P through (30); their exponential decay emerges once the spectral gap on the multiscale tree spaces is established.

4.1. Spectral Reduction and Analytic Continuation

Recall that the Dirichlet–Ruelle operator L is defined on D σ by (23). The intertwining Lemma 6 asserts that for all f σ 1 ,
D ( P f ) = L ( D f ) .
Since D is injective on σ 1 , every eigenpair ( λ , f ) of P with f σ 1 produces an eigenpair ( λ , D f ) of L. Conversely, if L F = λ F and F = D f lies in the image of D , then P f = λ f . Hence the point spectra of P on B σ , 1 and of L on D σ coincide on the subspace D ( B σ , 1 ) . In particular,
ρ ( L ) ρ ( P ) ,
and any spectral gap or peripheral spectral property of P transfers to the induced action of L on Dirichlet series arising from B σ , 1 .
We emphasize that equality σ ( L ) = σ ( P ) is not assumed. The partial correspondence (31) suffices for analytic reduction: the Dirichlet-side continuation of D ( P k f ) reflects the spectral geometry of P.
Mass preservation and spectral gap. Because P only preserves total mass up to a logarithmic factor, we have
n 1 ( P f ) ( n ) = m 1 f ( m ) m ,
so the constant function 1 ( n ) 1 is not an eigenvector. Instead, P admits a unique positive invariant density h B tree , σ and a unique positive invariant functional ϕ B tree , σ * with
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
Throughout the paper we work with this Perron–Frobenius normalization (32) and express all spectral decompositions relative to the nonconstant invariant profile h.
Within this framework, the Dirichlet–Ruelle operator L inherits the same dominant eigenvalue 1 and the same spectral gap on the subspace D ( B σ , 1 ) . The analytic behavior of the Collatz Dirichlet series ζ C ( s , k ) = D ( P k 1 ) ( s ) is then determined by how P k approaches the spectral projector onto the invariant subspace spanned by 1 .
Theorem 1 
(Spectral reduction and analytic continuation). Let B σ , 1 be a Banach space of arithmetic functions continuously embedded in σ 1 such that P : B σ , 1 B σ , 1 is quasi-compact and satisfies the mass-preserving normalization (12). Assume further that 1 is a simple eigenvalue of P and that all other spectral values lie in the closed disk | λ | λ LY < 1 . Then for every f B σ , 1 the Dirichlet transforms D ( P k f ) ( s ) extend holomorphically to ( s ) > σ and admit the decomposition
D ( P k f ) ( s ) = Π 1 ( f ) D ( 1 ) ( s ) + R k ( s ) , | R k ( s ) | C f ( s ) λ LY k ,
where Π 1 is the spectral projection associated with the eigenvalue 1 and C f ( s ) is locally bounded on { ( s ) > σ } . In particular, for f with Π 1 ( f ) = 0 , the functions D ( P k f ) ( s ) decay exponentially in k uniformly on compact subsets of ( s ) > σ .
When f = 1 , the same conclusion applies to ζ C ( s , k ) = D ( P k 1 ) ( s ) , whose exponential stabilization corresponds to convergence toward the invariant density associated with the Collatz operator.
Proof. 
By quasi-compactness, the spectrum of P decomposes as
σ ( P ) = { 1 } σ ess ( P ) , ρ ess ( P ) λ LY < 1 ,
and the Riesz projection Π 1 = 1 2 π i | z 1 | = ε ( z I P ) 1 d z is a bounded projection onto the one-dimensional invariant subspace spanned by 1 . Then P k = Π 1 + N k , where N k B σ , 1 C λ LY k for some constant C > 0 . Applying the Dirichlet transform and using | D ( g ) ( s ) | g σ 1 for ( s ) > σ gives
D ( P k f ) ( s ) = D ( Π 1 f ) ( s ) + D ( N k f ) ( s ) , | D ( N k f ) ( s ) | C λ LY k f B σ , 1 .
Since Π 1 f is a multiple of 1 , we may write D ( Π 1 f ) = Π 1 ( f ) D ( 1 ) , yielding (33). Analyticity for ( s ) > σ follows from absolute convergence and locally uniform bounds. □
This form aligns with the quasi-compactness obtained later on the multiscale tree space B tree , σ , where the Lasota–Yorke inequality ensures ρ ess ( P ) λ LY < 1 . The exponential term λ LY k in (33) corresponds to the essential spectral radius and controls the rate of decay of correlations and Dirichlet coefficients. Under stronger spectral assumptions, the representation can be refined to a meromorphic decomposition in which each isolated eigenvalue λ j contributes a term λ j k D ( Π j f ) , generalizing the usual Ruelle–Perron expansion.

4.2. Spectral Criterion on Weighted 1 Spaces

The preceding analysis shows that sufficiently strong spectral control of P on an appropriate Banach space B σ , 1 forces all Dirichlet data generated by the backward Collatz tree to exhibit exponential stabilization toward the invariant profile. Since P is not contractive on σ 1 or B σ , such behavior can only arise on refined Banach spaces where a genuine spectral gap at the eigenvalue 1 has been established. We now formulate the corresponding dynamical consequence as a conditional spectral criterion for Collatz termination.
Theorem 2 
(Spectral criterion for Collatz termination). Let P act on a Banach space B σ , 1 σ 1 such that P ( B σ , 1 ) B σ , 1 and 1 B σ , 1 . Assume that P is quasi-compact on B σ , 1 , that 1 is a simple eigenvalue of P corresponding to the invariant density 1 , and that all other spectral values satisfy
σ ( P ) { 1 } { z C : | z | λ LY < 1 } .
Then every f B σ , 1 admits a decomposition
P k f = Π 1 f + N k f , N k f B σ , 1 C λ LY k f B σ , 1 ,
where Π 1 is the spectral projection onto span { 1 } . Consequently, there exists no nontrivial invariant or periodic density for the backward Collatz dynamics in B σ , 1 ; the only invariant direction is the constant function 1 . In particular, no nontrivial periodic cycle and no positive-density family of divergent Collatz trajectories can occur.
Proof. 
By quasi-compactness, the spectrum of P decomposes as σ ( P ) = { 1 } σ ess ( P ) with ρ ess ( P ) λ LY < 1 . The associated Riesz projection
Π 1 = 1 2 π i | z 1 | = ε ( z I P ) 1 d z
is bounded and satisfies P Π 1 = Π 1 P = Π 1 , Π 1 1 = 1 . Hence the power iterates decompose as
P k = Π 1 + N k , N k B σ , 1 C λ LY k ,
for some constant C > 0 . If a nontrivial invariant density f B σ , 1 satisfied P f = f , then f would belong to the eigenspace of λ = 1 , and since this eigenspace is one-dimensional, f must be a scalar multiple of the positive eigenvector h satisfying P h = h . Thus no additional invariant densities exist beyond span { h } .
If a periodic density f satisfied P q f = f for some q > 0 , then f would correspond to an eigenvalue λ with | λ | = 1 . Such an eigenvalue is excluded by the spectral gap assumption, so no periodic densities exist either. Finally, in the standard translation between transfer-operator invariants and dynamical orbits on the underlying tree, any invariant or periodic density corresponds to either a periodic Collatz cycle or to a positive-density family of non-terminating trajectories. The spectral gap therefore precludes these dynamical behaviors. □
Section 4.4 constructs the multiscale tree Banach space B tree and establishes a Lasota–Yorke inequality that ensures quasi-compactness of P with an explicit contraction constant λ LY < 1 in the strong seminorm. Verification of the hypotheses of Theorem 2 on B tree , σ provides the analytic–spectral bridge: a strict spectral gap for P on B tree , σ rules out the spectral signatures associated with any non-terminating Collatz behavior.

4.3. Multi-Scale Tree Space

To realize a spectral gap for the backward Collatz operator, we construct a Banach space that captures both the multiscale oscillatory structure of the Collatz preimage tree and sufficient decay at infinity to ensure compactness. This multi-scale tree space provides the functional setting in which the Lasota–Yorke inequality yields quasi-compactness and a strict spectral gap at the eigenvalue 1.
For j 0 define the scale blocks
I j : = [ 6 j , 2 · 6 j ) N .
The factor 6 reflects the approximate scale multiplication under the backward map, combining the even branch m = 2 n and the odd branch m = ( n 1 ) / 3 (defined for n 4 ( mod 6 ) ).
Fix parameters 0 < α < 1 and 0 < ϑ < 1 . For indices u , v > 0 , define the scale-sensitive weight
W α ( u , v ) : = u v | u v | ( u + v ) α , u v .
This weight penalizes small separations between indices, emphasizing local oscillations of f, while the factor ( u + v ) α damps sensitivity at large scales. The geometric coefficient ϑ j provides exponential attenuation of oscillations across successive levels of the tree.
Definition 3 
(Multiscale tree seminorm and space). For f : N C define
[ f ] tree : = j 0 ϑ j sup m , n I j m n W α ( m , n ) | f ( m ) f ( n ) | .
The corresponding Banach space
B tree : = f : N C : f 1 + [ f ] tree < , f tree : = f 1 + [ f ] tree ,
is called themultiscale tree space.
Standard arguments for weighted variation-type seminorms show that ( B tree , · tree ) is complete. The seminorm [ f ] tree controls the oscillatory irregularity of f within each scale block I j , while the 1 component controls the overall magnitude. However, B tree alone does not impose sufficient decay as n to guarantee compactness.
Weighted extension. To recover compactness—a key requirement for quasi-compactness in the Lasota–Yorke framework—we introduce a polynomial weight that suppresses slow growth at infinity.
Definition 4 
(Weighted tree space). For parameters 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 , set
f σ : = n 1 | f ( n ) | n σ , [ f ] tree : = j 0 ϑ j sup m , n I j m n W α ( m , n ) | f ( m ) f ( n ) | .
Then
B tree , σ : = f : N C : f σ + [ f ] tree < , f tree , σ : = f σ + [ f ] tree .
The factor n σ enforces quantitative decay of f at large indices, while [ f ] tree measures the oscillatory complexity of f along each level of the tree. Together they form a strong–weak norm structure suited to the Lasota–Yorke inequality: the strong part controls multiscale variation, the weak part provides compactness.
Lemma 7 
(Compact embedding). For fixed 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 , the unit ball of B tree , σ is relatively compact in σ 1 .
Proof. 
Let
U : = f B tree , σ : f tree , σ 1 .
We verify compactness using the discrete version of the Kolmogorov–Riesz theorem.
(i) Uniform boundedness. Each f U satisfies f σ 1 , so U is bounded in σ 1 .
(ii) Uniform tail control. For any ε > 0 choose N so that n > N n σ < ε . Then for all f U ,
n > N | f ( n ) | n σ f σ n > N 1 n σ ε ,
so the tails contribute arbitrarily little σ 1 –mass.
(iii) Local equicontinuity on finite blocks. Fix J 0 and consider the finite union E J = j J I j . Within each I j , the seminorm term ϑ j sup m , n I j W α ( m , n ) | f ( m ) f ( n ) | bounds discrete oscillations uniformly in f. Hence the family { f | E J : f U } lies in a compact subset of the finite-dimensional space C E J .
(iv) Diagonal extraction. Given any sequence ( f ( k ) ) U , apply the compactness on E 1 , E 2 , and extract a diagonal subsequence converging pointwise on all of N . By (ii) the tails beyond any fixed N have uniformly small weight, so pointwise convergence on finite windows implies convergence in σ 1 . Thus U is relatively compact in σ 1 . □
Remark 2. 
The weight n σ is essential. Without it, the unit ball of B tree is not precompact in 1 : one can construct sequences of disjointly supported spikes whose tree seminorms remain bounded while their supports drift to infinity. Taking σ > 1 eliminates this escape to infinity, yielding the compact embedding required for quasi-compactness.
The space B tree , σ thus provides the natural functional environment for the Lasota–Yorke inequality. Its compact embedding into σ 1 ensures that the essential spectral radius of P on B tree , σ is strictly smaller than its spectral radius, a prerequisite for establishing a genuine spectral gap. The strong seminorm captures multiscale regularity across the Collatz tree, while the weighted 1 norm supplies the compactness that underlies the spectral analysis of the backward transfer operator.

4.4. Lasota–Yorke Inequality on B tree

Recall from (11) that
( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
It is convenient to split P into its even and odd components:
( P even f ) ( n ) : = f ( 2 n ) 2 n , ( P odd f ) ( n ) : = 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 ,
so that P = P even + P odd .
From the 1 estimates of Section 2, both branches are bounded on 1 , hence on B tree . The Lasota–Yorke inequality arises from the fact that P even is strongly contracting in the tree seminorm, while P odd is a controlled perturbation whose contribution is damped by the multiscale factor ϑ j .

4.4.1. Even Branch Contraction on the Multiscale Tree Space

We first record the even-branch estimate.
Lemma 8 
(Even branch contraction on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . There exists a constant C even > 0 depending only on α, ϑ, and σ such that for all f B tree , σ ,
[ P even f ] tree 2 ( 1 α ) ϑ [ f ] tree + C even f σ .
In particular, for fixed α one can choose ϑ sufficiently small so that the even branch is strictly contracting in the tree seminorm up to a controlled · σ –error.
Proof. 
Recall that ( P even f ) ( n ) = f ( 2 n ) / ( 2 n ) . For each j 0 , the block seminorm of P even f is
Δ j ( P even f ) : = sup u , v I j u v 1 6 j W α ( u , v ) ( P even f ) ( u ) ( P even f ) ( v ) .
Fix j and u , v I j with u v . We decompose
( P even f ) ( u ) ( P even f ) ( v ) = f ( 2 u ) f ( 2 v ) 2 u + f ( 2 v ) 1 2 u 1 2 v = : D 1 ( u , v ) + D 2 ( u , v ) ,
and estimate the two terms separately.
(1) The oscillatory part D 1 . Since
W α ( 2 u , 2 v ) = 2 1 α W α ( u , v ) ,
we have
W α ( u , v ) = 2 ( 1 α ) W α ( 2 u , 2 v ) .
Hence
1 6 j W α ( u , v ) | D 1 ( u , v ) | 2 ( 1 α ) 6 j W α ( 2 u , 2 v ) | f ( 2 u ) f ( 2 v ) | 2 u .
Since u I j = [ 6 j , 2 · 6 j ) , u 6 j , so 1 / ( 2 u ) 1 / ( 2 · 6 j ) and
1 6 j W α ( u , v ) | D 1 ( u , v ) | 2 ( 1 α ) 1 6 2 j W α ( 2 u , 2 v ) | f ( 2 u ) f ( 2 v ) | .
The pair ( 2 u , 2 v ) lies at scale comparable to 6 j , i.e. within a bounded number of block levels. Hence there exists a constant c 0 > 0 depending only on the block geometry such that
1 6 2 j W α ( 2 u , 2 v ) c 0 1 6 j W α ( 2 u , 2 v ) for some j { j , j + 1 } .
Taking the supremum over u , v I j gives
Δ j ( P even f ; D 1 ) c 0 2 ( 1 α ) 1 max { Δ j ( f ) , Δ j + 1 ( f ) } .
Multiplying by ϑ j and using ϑ j Δ j ( f ) [ f ] tree and ϑ j Δ j + 1 ( f ) ϑ 1 [ f ] tree , we obtain
ϑ j Δ j ( P even f ; D 1 ) c 1 2 ( 1 α ) ϑ [ f ] tree ,
for some constant c 1 depending only on α and ϑ . Taking the supremum over j yields
[ P even f ] tree ( D 1 ) c 1 2 ( 1 α ) ϑ [ f ] tree .
(2) The denominator part D 2 . Assume u > v . Then
1 2 u 1 2 v = | u v | 2 u v , | D 2 ( u , v ) | = | f ( 2 v ) | | u v | 2 u v .
Thus
W α ( u , v ) | D 2 ( u , v ) | = u v | u v | ( u + v ) α | f ( 2 v ) | | u v | 2 u v = | f ( 2 v ) | 2 ( u + v ) α .
For u , v I j , we have u + v 2 · 6 j , so
W α ( u , v ) | D 2 ( u , v ) | C α 6 α j | f ( 2 v ) | with C α : = 2 ( 1 + α ) .
Hence
Δ j ( P even f ; D 2 ) C α 6 ( 1 + α ) j sup v I j | f ( 2 v ) | .
Multiplying by ϑ j and summing over j gives
ϑ j Δ j ( P even f ; D 2 ) C α ( ϑ 6 ( 1 + α ) ) j sup v I j | f ( 2 v ) | .
Each integer n appears as n = 2 v for at most one v I j , and since | f ( n ) | n σ f σ , the geometric factor ( ϑ 6 ( 1 + α ) ) j ensures convergence of the series in j. Thus there exists a constant C even > 0 depending only on α , ϑ , and σ such that
sup j 0 ϑ j Δ j ( P even f ; D 2 ) C even f σ .
(3) Combine the two parts. Combining the bounds for D 1 and D 2 and renaming constants gives
[ P even f ] tree 2 ( 1 α ) ϑ [ f ] tree + C even f σ ,
which is the desired inequality (38). □
The odd branch requires more care because it shifts indices from n to ( n 1 ) / 3 and only acts on the congruence class n 4 ( mod 6 ) . Its effect is nonetheless small once weighted by ϑ j .

4.4.2. Odd Branch Contraction on the Multiscale Tree Space

Lemma 9 
(Odd-branch distortion on scale blocks). Let 0 < α < 1 . If n 4 ( mod 6 ) and n I j = [ 6 j , 2 · 6 j ) , then the odd preimage m = ( n 1 ) / 3 satisfies m I j 1 and
W α ( m 1 , m 2 ) 6 1 α W α ( n 1 , n 2 )
whenever n 1 , n 2 I j lie on the same ray and m i = ( n i 1 ) / 3 .
Proof. 
For n I j we have n 6 j ; hence m = ( n 1 ) / 3 6 j 1 , which gives m I j 1 . Moreover,
| m 1 m 2 | = 1 3 | n 1 n 2 | and m 1 + m 2 6 j 1 .
Thus
W α ( m 1 , m 2 ) = | m 1 m 2 | ( m 1 + m 2 ) α 1 3 | n 1 n 2 | ( 6 1 ( n 1 + n 2 ) ) α = 6 1 α W α ( n 1 , n 2 ) ,
which proves (39). □
Lemma 10 
(Odd branch on B tree ). There exist constants C α > 0 and C odd > 0 such that for all f B tree ,
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f 1 ,
with
λ odd ( α , ϑ ) C α 6 ϑ .
Proof. 
Recall that
( P odd f ) ( n ) = 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
For each j 0 define
A j ( f ) : = sup m , n I j m n W α ( m , n ) P odd f ( m ) P odd f ( n ) ,
so that, by definition of [ · ] tree ,
[ P odd f ] tree = j 0 ϑ j A j ( f ) .
Fix j 0 and m , n I j , m n . We decompose according to the active congruence class 4 ( mod 6 ) .
Case 1: neither m nor n is 4 ( mod 6 ) . Then P odd f ( m ) = P odd f ( n ) = 0 , so this pair contributes nothing to A j ( f ) .
Case 2: exactly one of m , n is 4 ( mod 6 ) . Without loss of generality, assume m 4 ( mod 6 ) and n ¬ 4 ( mod 6 ) . Set k : = ( m 1 ) / 3 . Then
P odd f ( m ) P odd f ( n ) = f ( k ) k ,
and hence
W α ( m , n ) P odd f ( m ) P odd f ( n ) = W α ( m , n ) | f ( k ) | k .
Since m , n I j = [ 6 j , 2 · 6 j ) , there exist constants c 1 , c 2 > 0 (depending only on α ) such that
W α ( m , n ) c 1 6 ( 2 α ) j , k = m 1 3 c 2 6 j 1 ,
so
ϑ j W α ( m , n ) | f ( k ) | k C ( ϑ 6 1 α ) j | f ( k ) |
for some constant C depending only on α . Each k arises from at most one such m and j, so summing first over pairs ( m , n ) of this type and then over j yields
j 0 ϑ j sup m , n I j exactly one 4 ( 6 ) W α ( m , n ) P odd f ( m ) P odd f ( n ) C odd , 1 f 1 ,
provided ϑ 6 1 α < 1 , which we assume from now on. Here C odd , 1 depends on α and ϑ , but not on f.
Case 3: both m and n are 4 ( mod 6 ) . Set
m = m 1 3 , n = n 1 3 ,
so that
P odd f ( m ) = f ( m ) m , P odd f ( n ) = f ( n ) n .
We decompose
f ( m ) m f ( n ) n = f ( m ) f ( n ) m + f ( n ) 1 m 1 n = : D 1 + D 2 .
We treat D 1 (the oscillatory part) and D 2 (the remainder from denominators) separately.
Case 3a: the D 1 term (contractive contribution). A direct computation with m = 3 m + 1 , n = 3 n + 1 shows that there exists a constant C α 1 depending only on α such that
W α ( m , n ) W α ( m , n ) C α
for all m n with m n 4 ( mod 6 ) . (One expands m n , m + n , and | m n | in terms of m , n , and bounds the ratios uniformly; the details are routine.)
Thus
W α ( m , n ) | f ( m ) f ( n ) | m C α W α ( m , n ) | f ( m ) f ( n ) | m .
Now use that m 6 j 1 for m I j with m 4 ( mod 6 ) , so 1 / m 6 ( j 1 ) . Among the O ( 6 j ) indices in I j , only a proportion 1 / 6 lie in the active residue class 4 ( mod 6 ) . Applying Cauchy–Schwarz to the collection of such pairs in I j and using this 1 / 6 density, one obtains the averaged bound
ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 1 | C α 6 ϑ j 1 sup m , n W α ( m , n ) | f ( m ) f ( n ) | ,
where ( m , n ) range over the corresponding preimage pairs. (The factor 1 / 6 is the standard gain from passing from a 1 / 6 -density subset of indices to an L 2 -type control of the supremum.)
Taking the supremum over all admissible ( m , n ) and summing over j gives
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 1 | C α 6 ϑ j 0 ϑ j 1 sup m , n I j 1 W α ( m , n ) | f ( m ) f ( n ) | .
By the definition of [ f ] tree , the right-hand side is
C α 6 ϑ [ f ] tree .
This yields the desired contribution with contraction factor λ odd ( α , ϑ ) ( C α / 6 ) ϑ from the D 1 term.
Case 3b: the D 2 term (error controlled by f 1 ). We have
| D 2 | = | f ( n ) | 1 m 1 n = | f ( n ) | | m n | m n .
Since | m n | = 3 | m n | ,
W α ( m , n ) | D 2 | = m n | m n | ( m + n ) α | f ( n ) | | m n | m n = m n 3 ( m + n ) α m n | f ( n ) | .
For m , n I j one has m n 6 2 j , m + n 6 j , m n 6 2 j 2 , so
W α ( m , n ) | D 2 | C 6 α j | f ( n ) |
for some constant C depending only on α . Hence
ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 2 | C ( ϑ 6 α ) j sup n | f ( n ) | .
Each n arises from at most a bounded number of ( m , n , j ) , and ϑ 6 α < 1 for fixed ϑ ( 0 , 1 ) and α ( 0 , 1 ) , so summing over j and using | f ( n ) | f 1 / n shows that the total D 2 contribution is bounded by
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W α ( m , n ) | D 2 | C odd , 2 f 1
for some constant C odd , 2 > 0 independent of f.
Conclusion. Combining the three cases, we obtain
[ P odd f ] tree = j 0 ϑ j A j ( f ) C α 6 ϑ [ f ] tree + ( C odd , 1 + C odd , 2 ) f 1 .
Setting C odd : = C odd , 1 + C odd , 2 yields (40) with λ odd ( α , ϑ ) ( C α / 6 ) ϑ , as claimed. □

4.5. From Boundedness to the Lasota–Yorke Inequality on B tree , σ

Lemma 11 
(Invariance and boundedness on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . Then the backward Collatz transfer operator P maps B tree , σ into itself and is bounded: there exists C > 0 such that
P f tree , σ C f tree , σ for all f B tree , σ .
Proof. 
Using the even/odd decomposition,
( P f ) ( n ) = ( P even f ) ( n ) + ( P odd f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 .
We show both P f σ and [ P f ] tree are bounded by f tree , σ .
1. Weighted σ 1 bound. For the even part, substitute m = 2 n :
P even f σ = n 1 | f ( 2 n ) | 2 n n σ = m 1 m even | f ( m ) | m m 2 σ = 2 σ m 1 m even | f ( m ) | m ( σ + 1 ) 2 σ f σ .
For the odd part, write m = ( n 1 ) / 3 (so n = 3 m + 1 and m 1 ):
P odd f σ = n 1 n 4 ( 6 ) | f ( ( n 1 ) / 3 ) | ( n 1 ) / 3 n σ = m 1 | f ( m ) | m ( 3 m + 1 ) σ 3 σ m 1 | f ( m ) | m ( σ + 1 ) 3 σ f σ .
Hence
P f σ 2 σ + 3 σ f σ 2 σ + 3 σ f tree , σ .
2. Tree seminorm bound. By subadditivity, [ P f ] tree [ P even f ] tree + [ P odd f ] tree . From Lemma 8 (even branch on B tree ),
[ P even f ] tree 2 ( 1 α ) [ f ] tree + C even f 1 .
From Lemma 10 (odd branch on B tree ),
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f 1 , λ odd ( α , ϑ ) C α 6 ϑ .
To lift the weak term from · 1 to · σ , we revisit the remainder estimates (the “denominator” terms) in the proofs. For the even branch remainder,
W α ( u , v ) f ( 2 v ) 1 2 u 1 2 v 6 α j | f ( 2 v ) | ( u , v I j ) ,
so
ϑ j sup u , v I j · ϑ j 6 α j v I j | f ( 2 v ) | = v I j ϑ 6 α j | f ( 2 v ) | .
Because each v belongs to exactly one block I j and v 6 j in that block, we have
ϑ 6 α j C ( 2 v ) σ ϑ j C 6 ( σ α ) j ,
which holds once we impose the admissibility condition
ϑ 6 σ α < 1 .
Summing over j and v then gives a bound f σ for the even-branch remainder. The odd-branch denominator term is handled identically (replacing 2 v by n = ( n 1 ) / 3 6 j 1 ), yielding again a bound f σ under (44). Renaming constants, we therefore have
[ P f ] tree 2 ( 1 α ) + λ odd ( α , ϑ ) [ f ] tree + C tree , σ f σ .
Finally, (43) and (45) yield
P f tree , σ = P f σ + [ P f ] tree 2 σ + 3 σ + 2 ( 1 α ) + λ odd ( α , ϑ ) + C tree , σ f tree , σ .
This proves boundedness of P on B tree , σ . □
Proposition 2 
(Lasota–Yorke inequality on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 satisfy the admissibility condition (44). Then there exists a constant C LY , σ > 0 such that for all f B tree , σ ,
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C LY , σ f σ , λ ( α , ϑ ) : = 2 ( 1 α ) + λ odd ( α , ϑ ) ,
with λ odd ( α , ϑ ) ( C α / 6 ) ϑ . In particular, if λ ( α , ϑ ) < 1 then P is strictly contracting in the strong seminorm [ · ] tree up to a controlled · σ –perturbation.
Proof. 
Combine the even/odd seminorm bounds from (45). □
Remark 3 
(Parameter window). The lift from · 1 to · σ in the remainder terms uses only (44). A convenient (and used later) choice is ( α , ϑ , σ ) = ( 1 2 , 1 5 , 1 + ε ) with any small ε > 0 , since then ϑ 6 σ α = 1 5 6 ε + 1 / 2 < 1 . Together with the explicit odd-branch constant from Section 6, this yields λ ( α , ϑ ) < 1 and hence quasi-compactness of P on B tree , σ .
Corollary 1 
(Essential spectral radius bound on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 satisfy the admissibility condition (44). Assume the Lasota–Yorke inequality (46) and the compact embedding B tree , σ σ 1 from Lemma 7. Then P : B tree , σ B tree , σ is quasi-compact and its essential spectral radius satisfies
ρ ess P B tree , σ λ ( α , ϑ ) = 2 ( 1 α ) + λ odd ( α , ϑ ) , λ odd ( α , ϑ ) C α 6 ϑ .
Proof. 
By (46) there exists C LY , σ such that, for all f B tree , σ ,
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C LY , σ f σ .
This is a Doeblin–Fortet (Lasota–Yorke) inequality for the pair · strong = [ · ] tree and · weak = · σ . Since the unit ball of B tree , σ is relatively compact in σ 1 by Lemma 7, the injection B tree , σ σ 1 is compact. The Ionescu–Tulcea–Marinescu/Hennion quasi-compactness theorem then implies that P is quasi-compact on B tree , σ with
ρ ess P B tree , σ λ ( α , ϑ ) .

4.6. Quasi-Compactness of the Backward Operator

Lemma 12 
(Odd-branch weight distortion at α = 1 2 ). Let W α ( m , n ) = m n | m n | ( m + n ) α be the tree weight from (35) and let m = ( m 1 ) / 3 , n = ( n 1 ) / 3 . For α = 1 2 there exists an absolute constant
C 0 = 16 3 3 / 2 < 3.1
such that for all m n 4 ( mod 6 ) with m n ,
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) C 0 .
Consequently, the oscillatory part of the odd branch satisfies
λ odd ( 1 2 , ϑ ) C 0 6 ϑ ,
as used in Lemma 10 and Lemma 13.
Proof. 
Let m n 4 ( mod 6 ) , m n , and define m = ( m 1 ) / 3 , n = ( n 1 ) / 3 . Note that m , n N and m n . Using the definitions,
W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 , W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 .
Form the ratio and simplify:
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) = m n m n · | m n | | m n | · ( m + n ) 1 / 2 ( m + n ) 1 / 2 .
Since m = 3 m + 1 and n = 3 n + 1 , we have | m n | = 3 | m n | and m + n = 3 ( m + n ) + 2 . Hence
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) = m n m n · 1 3 · ( m + n ) 1 / 2 ( 3 ( m + n ) + 2 ) 1 / 2 .
We now bound the three factors on the right-hand side.
(i) The product ratio. Using m = 3 m + 1 4 m and n = 3 n + 1 4 n for all m , n 1 , we get
m n m n = ( 3 m + 1 ) ( 3 n + 1 ) m n 16 .
(ii) The difference ratio. We already used | m n | = 3 | m n | , so this contributes the exact factor 1 / 3 .
(iii) The sum ratio. Since 3 ( m + n ) + 2 3 ( m + n ) , we obtain
( m + n ) 1 / 2 ( 3 ( m + n ) + 2 ) 1 / 2 ( m + n ) 1 / 2 ( 3 ( m + n ) ) 1 / 2 = 1 3 .
Combining (i)–(iii) in (49) yields
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) 16 · 1 3 · 1 3 = 16 3 3 / 2 = : C 0 .
This proves (48).
For the consequence on the oscillatory part of the odd branch in the Lasota–Yorke estimate, recall the standard decomposition in the proof of Lemma 10: when both m , n I j are in the active residue class 4 ( mod 6 ) , the D 1 (oscillatory) term contributes
W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m .
Using (48) and the relation m 6 j 1 for m I j , one passes from level j to level j 1 with a loss bounded by C 0 ; the block weight ϑ j supplies the one-step factor ϑ , and restricting to the active residue class has relative density 1 / 6 , which produces a Cauchy–Schwarz gain 1 / 6 in the passage from a subset supremum to the block-level control (see the proof of Lemma 10 for the standard L 2 averaging step). Altogether,
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m C 0 6 ϑ [ f ] tree ,
which is the claimed bound λ odd ( 1 2 , ϑ ) ( C 0 / 6 ) ϑ . □
Lemma 13 
(Explicit odd-branch constant). For α = 1 2 and ϑ = 1 5 there exist constants C α > 0 and C odd > 0 such that for all f B tree , σ ,
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f σ ,
with
λ odd ( α , ϑ ) C α 6 ϑ < 1 .
Proof. 
We specialize the proof of Lemma 10 to α = 1 2 and ϑ = 1 5 , making the constants explicit.
Recall
( P odd f ) ( n ) = 1 { n 4 ( 6 ) } f n 1 3 ( n 1 ) / 3 ,
and for each j 0 ,
A j ( f ) : = sup m , n I j m n W α ( m , n ) P odd f ( m ) P odd f ( n ) , [ P odd f ] tree = j 0 ϑ j A j ( f ) ,
where I j = [ 6 j , 2 · 6 j ) and W α ( m , n ) = m n | m n | ( m + n ) α . We take α = 1 2 from now on, so
W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 .
Fix j 0 and m , n I j , m n . As in Lemma 10, we distinguish three cases.
Case 1: neither m nor n is 4 ( mod 6 ) . Then P odd f ( m ) = P odd f ( n ) = 0 and this pair contributes nothing to A j ( f ) .
Case 2: exactly one of m , n is 4 ( mod 6 ) . Assume without loss of generality m 4 ( mod 6 ) and n 4 ( mod 6 ) . Set k = ( m 1 ) / 3 . Then
P odd f ( m ) P odd f ( n ) = f ( k ) k ,
so
W 1 / 2 ( m , n ) P odd f ( m ) P odd f ( n ) = W 1 / 2 ( m , n ) | f ( k ) | k .
Since m , n I j , we have 6 j m , n < 2 · 6 j and 1 | m n | 6 j ; hence
W 1 / 2 ( m , n ) = m n | m n | ( m + n ) 1 / 2 6 2 j 6 j 6 j / 2 = 6 ( 1 / 2 ) j .
Also k = ( m 1 ) / 3 6 j 1 . Thus for some absolute constant C 1 ,
ϑ j W 1 / 2 ( m , n ) | f ( k ) | k C 1 ϑ 6 1 / 2 j | f ( k ) | .
Now ϑ = 1 5 and 6 1 / 2 < 2.5 , so ϑ 6 1 / 2 < 1 . Each k arises (from such a case) for at most one j and one m, and
| f ( k ) | = k σ | f ( k ) | k σ k σ f σ 6 σ j f σ .
Summing over j and all such pairs gives
j 0 ϑ j sup m , n I j exactly one 4 ( 6 ) W 1 / 2 ( m , n ) P odd f ( m ) P odd f ( n ) C odd , 1 f σ
for some C odd , 1 > 0 depending only on σ . Thus Case 2 contributes only to the weak term.
Case 3: both m and n are 4 ( mod 6 ) . Set
m = m 1 3 , n = n 1 3 .
Then
P odd f ( m ) = f ( m ) m , P odd f ( n ) = f ( n ) n .
We decompose
f ( m ) m f ( n ) n = f ( m ) f ( n ) m = : D 1 + f ( n ) 1 m 1 n = : D 2 .
Case 3a: the D 1 term (contraction part). We first compare the weights W 1 / 2 ( m , n ) and W 1 / 2 ( m , n ) .
Using m = 3 m + 1 , n = 3 n + 1 we compute
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) = ( 3 m + 1 ) ( 3 n + 1 ) 3 m n ( m + n ) 1 / 2 ( 3 ( m + n ) + 2 ) 1 / 2 .
For all m , n 1 ,
3 m + 1 4 m , 3 n + 1 4 n , 3 ( m + n ) + 2 3 ( m + n ) ,
so
W 1 / 2 ( m , n ) W 1 / 2 ( m , n ) 16 3 · 1 3 = 16 3 3 / 2 = : C 0 .
Thus
W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m C 0 W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m .
Next, since m I j implies m 6 j 1 , we have 1 / m 6 ( j 1 ) . Moreover ( m , n ) lie in a union of O ( 1 ) blocks of level j 1 (and possibly j 2 ), so
W 1 / 2 ( m , n ) | f ( m ) f ( n ) | ϑ ( j 1 ) [ f ] tree
up to a fixed multiplicative constant (absorbed into C 0 ). Combining with (52),
ϑ j W 1 / 2 ( m , n ) | f ( m ) f ( n ) | m C 0 ϑ j 6 ( j 1 ) ϑ ( j 1 ) [ f ] tree = C 0 ϑ ϑ 6 j 1 [ f ] tree .
Summing over j 1 gives
j 0 ϑ j A j ( 1 ) ( f ) C 0 ϑ 1 ϑ / 6 [ f ] tree .
Define
λ odd : = C 0 ϑ 1 ϑ / 6 and C α : = 6 C 0 1 ϑ / 6 .
Then
λ odd = C α 6 ϑ .
For ϑ = 1 5 we have 1 ϑ / 6 = 1 1 30 > 0 and numerically
C 0 = 16 3 3 / 2 < 3.1 , λ odd = C 0 ϑ 1 ϑ / 6 < 0.64 < 1 ,
so indeed λ odd < 1 and λ odd = ( C α / 6 ) ϑ with this choice of C α .
Case 3b: the D 2 term (weak contribution). We have
| D 2 | = | f ( n ) | | m n | m n .
Using | m n | = 3 | m n | and the same scale relations as above,
W 1 / 2 ( m , n ) | D 2 | = m n | m n | ( m + n ) 1 / 2 | f ( n ) | | m n | m n 6 j / 2 | f ( n ) | .
Thus
ϑ j W 1 / 2 ( m , n ) | D 2 | ( ϑ 6 1 / 2 ) j | f ( n ) | .
Each n arises from at most a bounded number of ( m , n , j ) , and ϑ 6 1 / 2 < 1 , so summing over j and using | f ( n ) | n σ f σ yields
j 0 ϑ j sup m , n I j m n 4 ( 6 ) W 1 / 2 ( m , n ) | D 2 | C odd , 2 f σ
for some C odd , 2 > 0 . Combining the three cases, we obtain
[ P odd f ] tree λ odd [ f ] tree + ( C odd , 1 + C odd , 2 ) f σ .
Setting C odd : = C odd , 1 + C odd , 2 and using the explicit expression λ odd = ( C α / 6 ) ϑ with λ odd < 1 for ( α , ϑ ) = ( 1 2 , 1 5 ) gives (50) and (51). □
Proposition 3 
(Verified Lasota–Yorke contraction). Let ( α , ϑ ) = 1 2 , 1 5 and σ > 1 (with the admissibility condition ϑ 6 σ α < 1 ). Define
λ LY : = 2 ( 1 α ) + λ odd ( α , ϑ ) , λ odd ( α , ϑ ) C 0 6 ϑ ,
with C 0 = 16 / 3 3 / 2 from Lemma 12. Then λ LY < 1 , and for all f B tree , σ ,
[ P f ] tree λ LY [ f ] tree + C LY f σ ,
for some constant C LY > 0 depending only on the fixed parameters and the block geometry.
Proof. 
We use the decomposition P = P even + P odd and the branchwise estimates already established.
1. Combine even and odd branch inequalities. For any f B tree , σ ,
[ P f ] tree [ P even f ] tree + [ P odd f ] tree .
By the even-branch Lasota–Yorke estimate (Lemma 8, specialized to B tree , σ ), there exists C even > 0 such that for ( α , ϑ ) fixed,
[ P even f ] tree 2 ( 1 α ) ϑ [ f ] tree + C even f σ .
By the explicit odd-branch lemma (Lemma 13), for α = 1 2 and ϑ = 1 5 there exist C α > 0 and C odd > 0 such that
[ P odd f ] tree λ odd ( α , ϑ ) [ f ] tree + C odd f σ ,
with
λ odd ( α , ϑ ) C α 6 ϑ < 1 .
Adding (54) and (55) gives
[ P f ] tree 2 ( 1 α ) ϑ + λ odd ( α , ϑ ) [ f ] tree + ( C even + C odd ) f σ .
Define
λ LY : = 2 ( 1 α ) ϑ + λ odd ( α , ϑ ) , C LY : = C even + C odd ,
to obtain (53).
2. Verification that λ LY < 1 . We now check that with ( α , ϑ ) = ( 1 2 , 1 5 ) the constant λ LY is strictly less than 1.
First,
2 ( 1 α ) ϑ = 2 1 / 2 · 1 5 = 1 5 2 0.1414 .
From the proof of Lemma 13 we have
λ odd ( α , ϑ ) = C α 6 ϑ ,
with an explicit choice
C α = 6 C 0 1 ϑ / 6 , C 0 = 16 3 3 / 2 ,
so that
λ odd ( α , ϑ ) = C 0 ϑ 1 ϑ / 6 .
For ϑ = 1 5 this yields
λ odd ( 1 2 , 1 5 ) = C 0 / 5 1 1 / 30 = C 0 5 · 30 29 = 6 C 0 29 .
Since C 0 = 16 / 3 3 / 2 < 3.1 , we obtain
λ odd ( 1 2 , 1 5 ) < 6 · 3.1 29 0.641 < 1 .
Therefore
λ LY = 2 1 / 2 · 1 5 + λ odd ( 1 2 , 1 5 ) < 0.1414 + 0.641 < 0.79 < 1 .
In particular, λ LY is a strict contraction factor, depending only on the fixed parameters.
This proves both the inequality (53) and the bound λ LY < 1 . □
Lemma 14 
(Asymptotic form of the invariant density). Let P act on B tree , σ with σ > 1 and suppose P is quasi–compact with spectral gap and no other spectrum on the unit circle. Let h B tree , σ be the unique positive right eigenvector with P h = h and normalize the dual eigenfunctional ϕ by ϕ ( h ) = 1 . Then there exist constants c > 0 and δ > 0 (depending only on the parameters of the Lasota–Yorke framework) such that
h ( n ) = c n 1 + O n δ ( n ) .
Proof. 
Set H ( s ) : = n 1 h ( n ) n s for ( s ) > σ . We proceed in three steps.
Step 1 (Meromorphic structure of H and the pole at s = 1 ). By the Dirichlet transform intertwinement (Section 3) and the quasi–compact spectral calculus on B tree , σ (Section 4), Dirichlet transforms of B tree , σ -functions admit meromorphic continuation across a half–plane ( s ) > 1 δ 0 for some δ 0 ( 0 , 1 ) , with at most a simple pole at s = 1 whose residue is computed by the spectral projector Π f = ϕ ( f ) h . Applying this to f = h and using P h = h , we obtain that H extends meromorphically to ( s ) > 1 δ 0 with the expansion
H ( s ) = c s 1 + G ( s ) , ( s ) > 1 δ 0 ,
where c : = ϕ ( 1 ) > 0 and G is holomorphic on ( s ) > 1 δ 0 and of at most polynomial growth in vertical strips.1
Step 2 (Tauberian step: summatory asymptotic). Define the summatory function H # ( x ) : = n x h ( n ) . Since H has no singularities on { ( s ) = 1 } other than the simple pole at s = 1 and satisfies the growth hypothesis of the Wiener–Ikehara–Delange Tauberian theorem [3] in the half–plane ( s ) > 1 δ 0 , it follows that
H # ( x ) = c log x + C 0 + O x δ 1 ( x ) ,
for some constants C 0 R and δ 1 ( 0 , δ 0 ) (the precise δ 1 is inherited from the width δ 0 and strip–growth of G). See, e.g., Delange’s theorem or the Ikehara–Ingham variant.
Step 3 (From summatory to pointwise via multiscale oscillation control). Write a n : = n h ( n ) and let X > 1 . For each dyadic–triadic block I j = [ 6 j , 2 · 6 j ) defining the strong seminorm [ · ] tree , σ , the Lasota–Yorke inequality yields a uniform oscillation bound
osc I j ( a ) : = sup n , m I j | a n a m | C 6 j η
for some C > 0 and η ( 0 , 1 ) depending only on the Lasota–Yorke parameters (this is the standard consequence of the contraction of the strong seminorm together with boundedness in the weak norm). In particular a n varies slowly on each block I j .
By summation by parts on each I j and (57), we obtain the averaged estimate
1 | I j | n I j a n = 1 | I j | n I j n h ( n ) = c + O 6 j δ 1 .
Combining this block average with the oscillation control (58) gives, for every n I j ,
a n = c + O 6 j δ , δ : = min { δ 1 , η } .
Since n 6 j on I j , this is equivalent to
n h ( n ) = c + O n δ ,
hence
h ( n ) = c n 1 + O n δ ,
as claimed. □
We now record the standard consequence of the Lasota–Yorke inequality and the compact embedding of B tree into 1 .
Theorem 3 
(Quasi-compactness on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . Assume that the Lasota–Yorke constant
λ ( α , ϑ ) : = 2 ( 1 α ) + λ odd ( α , ϑ )
satisfies λ ( α , ϑ ) < 1 , where λ odd ( α , ϑ ) is as in Lemma 10. Then the backward transfer operator P acting on B tree , σ is quasi-compact, and its essential spectral radius satisfies
ρ ess ( P | B tree , σ ) λ ( α , ϑ ) < 1 .
Proof. 
We work on the Banach space B tree , σ with norm · tree , σ = · σ + [ · ] tree , where · σ is the weighted σ 1 -norm and [ · ] tree is the tree seminorm defined in Section 4.3.
Step 1: Lasota–Yorke inequality. By Proposition 2 (applied in the weighted setting, with f 1 replaced by f σ ) we have, for all f B tree , σ ,
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C LY f σ ,
with λ ( α , ϑ ) < 1 by assumption. On the weak norm side, since P is bounded on σ 1 , there exists C σ > 0 (e.g. C σ = Λ σ from (17)) such that
P f σ C σ f σ for all f B tree , σ .
Thus P satisfies a standard two-norm Lasota–Yorke inequality on B tree , σ with strong seminorm · s : = [ · ] tree and weak norm · w : = · σ :
P f s λ f s + C LY f w , P f w C σ f w .
Step 2: Compact embedding. By Lemma 7, the embedding
J : ( B tree , σ , · tree , σ ) ( σ 1 , · σ )
is compact. Since · w = · σ is exactly the weak norm used in (62), this shows that the unit ball of B tree , σ is relatively compact for the weak norm.
Step 3: Application of Ionescu–Tulcea–Marinescu / Hennion. We now invoke the standard quasi-compactness criterion (see, e.g., Ionescu–Tulcea and Marinescu, or Hennion’s theorem): if a bounded operator T on a Banach space X satisfies
(i)
a Lasota–Yorke inequality T x s λ x s + C x w with λ < 1 ,
(ii)
a weak bound T x w C x w , and
(iii)
the injection ( X , · s ) ( X , · w ) has relatively compact unit ball,
then T is quasi-compact on X and its essential spectral radius satisfies
ρ ess ( T ) λ .
Conditions (i)–(iii) are exactly (62) and Lemma 7 for T = P and X = B tree , σ . Therefore P is quasi-compact on B tree , σ and
ρ ess ( P | B tree , σ ) λ ( α , ϑ ) < 1 ,
which is (59). □
Remark 4 
(On the choice of parameters). The explicit bound (41) shows that λ odd ( α , ϑ ) decreases linearly with ϑ. For fixed α, one can therefore choose ϑ sufficiently small so that λ ( α , ϑ ) < 1 , provided the constant C α is effectively controlled. Subsequent sections make this optimization quantitative by computing C α and exhibiting admissible parameter pairs ( α , ϑ ) that give a strict spectral gap.
The Lasota–Yorke framework developed here supplies the functional-analytic backbone for the spectral approach to the Collatz problem: once explicit parameters with λ ( α , ϑ ) < 1 are verified, the quasi-compactness and spectral gap of P on B tree follow, and the spectral criteria of Section 4 can be invoked to constrain or rule out non-terminating configurations.

5. Spectral Consequences and Effective Block Recursion

Having established in Section 4.4 that the backward Collatz operator P is quasi-compact on the multi-scale tree space B tree , we now turn to the spectral consequences of this result. The Lasota–Yorke inequality ensures the existence of a spectral gap, which in turn controls the structure of invariant densities and the long-term behavior of iterates P k . The objective of this section is to characterize the invariant and quasi-invariant components of P, derive an effective block recursion for their scale-averaged coefficients, and demonstrate that the recursion enforces rigidity across the Collatz tree.
Throughout this section, h B tree , σ will denote an invariant density of P, i.e. a function satisfying P h = h . The analysis proceeds in several stages. First, we describe the structure of possible invariant profiles in the multiscale framework and show that the Lasota–Yorke inequality forces uniform flatness across scales. Next, we translate this flatness into an explicit two-sided recurrence relation for block averages c j . Finally, we verify that the coefficients of this recurrence satisfy a spectral bound consistent with the contraction constant λ odd ( α , ϑ ) computed earlier.
Theorem 4 
(Perron–Frobenius structure on B tree , σ ). Let P be the backward Collatz transfer operator acting on B tree , σ with parameters ( α , ϑ , σ ) chosen so that the Lasota–Yorke inequality and quasi–compactness hold. Then:
  • The spectral radius of P equals 1, and 1 is a simple eigenvalue.
  • There exists a unique eigenvector h B tree , σ with h > 0 and P h = h , normalized by ϕ ( h ) = 1 .
  • There exists a unique positive eigenfunctional ϕ B tree , σ * such that ϕ P = ϕ .
  • All other spectral values satisfy | z | < 1 , and P admits the spectral decomposition
    P = h ϕ + Q , ρ ( Q ) < 1 ,
    where Q is quasi–compact.
Proof. 
We combine the Lasota–Yorke inequality on B tree , σ with standard Perron–Frobenius theory for positive quasi–compact operators.
Step 1: Spectral radius and quasi–compactness. By construction P is a bounded linear operator on B tree , σ and is positive in the sense that f 0 implies P f 0 . The Lasota–Yorke inequality on B tree , σ (Proposition 2, say) together with the compact embedding of the strong seminorm into the weak norm implies that P is quasi–compact on B tree , σ with essential spectral radius strictly less than 1:
ρ ess ( P ) < 1 .
On the other hand, the logarithmic mass–preservation identity (Lemma 4) shows that the spectral radius of P is at least 1; the boundedness of P implies ρ ( P ) 1 , hence
ρ ( P ) = 1 .
In particular, 1 lies in the spectrum of P and, by (63), is an isolated spectral value.
Step 2: Existence of a positive eigenvector. Consider the positive cone
C : = { f B tree , σ : f 0 } ,
which is closed, convex, and reproducing. Since P is positive and ρ ( P ) = 1 , the Krein–Rutman theorem for positive operators on Banach spaces implies the existence of a nonzero h C such that
P h = h .
Moreover, h can be chosen strictly positive in the sense that h ( n ) > 0 for all n N : indeed, by the preimage structure of the Collatz map (Lemma 3) and the connectivity of the backward tree, any nontrivial f C is eventually propagated by iterates of P to a function that is positive on every block I j , so P k f > 0 for all sufficiently large k. Replacing h by P k h if necessary yields h > 0 .
Step 3: Uniqueness and simplicity of the eigenvalue 1. We now show that 1 is a simple eigenvalue and that h is unique up to scalar multiples. Suppose g B tree , σ satisfies P g = g . Decompose g = g + g into positive parts. Positivity of P implies P g ± = g ± . By the strong positivity argument above, any nonzero f C with P f = f must be strictly positive; hence g + and g are both either 0 or strictly positive. If both were nonzero, then g + and g would be linearly independent positive eigenvectors for the eigenvalue 1, and the positive cone would contain a two-dimensional face of eigenvectors. This contradicts the Krein–Rutman conclusion that the eigenspace associated with the spectral radius is one–dimensional. Therefore one of g + , g must vanish and g is either nonnegative or nonpositive; by replacing g by g if necessary, g 0 , and the strong positivity then forces g to be a scalar multiple of h. Thus the eigenspace for the eigenvalue 1 is one–dimensional and spanned by h, and 1 is a simple eigenvalue. This proves (1) and the first part of (2) after normalizing by ϕ ( h ) = 1 below.
Step 4: Dual eigenfunctional. Consider the dual operator P * acting on B tree , σ * . Since P is positive, so is P * on the dual cone
C * : = { ψ B tree , σ * : ψ ( f ) 0 for all f C } .
The quasi–compactness of P implies quasi–compactness of P * on the dual space. By (64), P * also has spectral radius 1. Applying the same Krein–Rutman argument to P * yields a nonzero ϕ C * and
ϕ P = ϕ ,
with ϕ strictly positive on nonzero elements of C . The same simplicity argument as in Step 3 shows that the eigenspace of P * for the eigenvalue 1 is one–dimensional and spanned by ϕ . Normalizing by the condition ϕ ( h ) = 1 gives the uniquely determined eigenpair ( h , ϕ ) appearing in the statement. This establishes (2) and (3).
Step 5: Spectral decomposition and spectral gap. Quasi–compactness of P on B tree , σ , together with (63) and the simplicity of the eigenvalue 1, implies that the spectrum of P is contained in { 1 } { z : | z | < r } for some r < 1 . Let Π denote the spectral projection onto the eigenspace associated with λ = 1 ; by the previous steps,
Π f = h ϕ ( f ) , f B tree , σ ,
so that Π = h ϕ as a rank–one operator. Writing
P = Π + Q = h ϕ + Q ,
we have Q = P Π and Q Π = Π Q = 0 . The spectrum of Q is contained in { z : | z | < r } , so in particular
ρ ( Q ) < 1 .
Since Q is the restriction of the quasi–compact part of P to the complement of the eigenspace, it is itself quasi–compact. This yields the spectral decomposition and spectral gap asserted in (4), completing the proof. □
Proposition 4 
(Forward dynamics and P-invariant functionals). Let 0 < α , ϑ < 1 and σ > 1 . Consider the pairing f , φ : = n 1 f ( n ) φ ( n ) between B tree , σ and
B tree , σ * : = φ : N C : φ * : = sup j 0 ϑ j osc I j φ + sup j 0 6 σ j n I j | φ ( n ) | < ,
where osc I j φ : = sup m , n I j | φ ( m ) φ ( n ) | . Then · , · extends continuously to B tree , σ × B tree , σ * , and the adjoint
( P * φ ) ( m ) = 1 m 1 { 2 m } φ ( m / 2 ) + 1 { m odd } φ ( 3 m + 1 ) .
Moreover, there exist constants C σ > 0 and M σ 1 such that
( P * ) k B tree , σ * B tree , σ * C σ M σ k , k 0 ,
and the Cesàro averages Φ N : = 1 N k = 0 N 1 ( P * ) k φ form a bounded set in B tree , σ * for every φ B tree , σ * .
Positive-frequency divergent families.Suppose there exist c > 0 and an infinite set of scales J N such that for each j J there is a finite set A j I j with | A j | c | I j | and forward trajectories that visit A j with asymptotic frequency c . For a summable weight sequence ( w j ) j 0 with j w j ϑ j < and j w j 6 σ j < , define
φ j ( n ) : = w j | A j | 1 A j ( n ) , φ : = j J φ j .
Then φ B tree , σ * , the Cesàro averages Φ N are bounded in B tree , σ * , and any weak-* limit point Φ satisfies P * Φ = Φ and Φ 0 . Consequently ( f ) : = f , Φ is a nonzero invariant functional with P = .
Proof. 
Continuity of the pairing. Fix j and set c j : = | I j | 1 n I j f ( n ) and φ I j : = | I j | 1 n I j φ ( n ) . Then
n I j f ( n ) φ ( n ) = n I j ( f ( n ) c j ) φ ( n ) φ I j + c j n I j φ ( n ) .
(a) Oscillatory term. Using I j ( f c j ) = 0 and osc I j φ : = sup u , v I j | φ ( u ) φ ( v ) | ,
n I j ( f ( n ) c j ) φ ( n ) φ I j osc I j φ n I j | f ( n ) c j | .
By the tree seminorm and the block geometry (since W α 6 ( 1 α ) j on I j ),
osc I j f K α ϑ j 6 ( 1 α ) j [ f ] tree , n I j | f ( n ) c j | | I j | osc I j f C ϑ j 6 α j [ f ] tree .
Therefore
n I j ( f ( n ) c j ) φ ( n ) φ I j C ϑ j 6 α j [ f ] tree osc I j φ .
Multiply and divide by ϑ j and take sup j ϑ j osc I j φ to get
j 0 I j ( f c j ) ( φ φ I j ) C [ f ] tree sup j 0 ϑ j osc I j φ j 0 ϑ 2 j 6 α j .
Since α > 0 , we can absorb j ϑ 2 j 6 α j into the constant (using that ϑ ( 0 , 1 ) is fixed), hence
j 0 I j ( f c j ) ( φ φ I j ) C [ f ] tree φ * .
(b) Mean term. By averaging and the weighted norm,
| c j | 1 | I j | n I j | f ( n ) | 1 | I j | n I j n σ | f ( n ) | n σ C 6 ( σ 1 ) j f σ 1 .
Hence
c j n I j φ ( n ) C 6 ( σ 1 ) j f σ 1 6 σ j 6 σ j I j | φ | C 6 j f σ 1 sup j 0 6 σ j I j | φ | .
Summing over j gives a finite geometric series:
j 0 c j I j φ C f σ 1 φ * .
Combining (a) and (b) yields | f , φ | C [ f ] tree + f σ 1 φ * = C f tree , σ φ * .

5.1. Redesigned Multiscale Space and Invariant Profiles

The quasi-compactness of P implies that its spectrum consists of a discrete set of eigenvalues of finite multiplicity outside a disk of radius ρ ess ( P ) λ LY < 1 , together with a residual spectrum contained in that disk. Let λ 0 = 1 denote the trivial eigenvalue corresponding to constant functions. Any additional eigenvalues with | λ | < 1 correspond to exponentially decaying modes. Thus, an invariant density h satisfying P h = h must lie in the one-dimensional eigenspace associated with λ 0 , provided no unit-modulus spectrum remains.
However, to make this conclusion effective, one must exclude the possibility of small oscillatory components that project into higher spectral modes but decay too slowly to be detected by the weak 1 norm alone. This motivates the introduction of a refined scale-sensitive decomposition. Define block intervals I j as in (34), and let
H j ( h ) : = n I j h ( n ) , c j : = H j ( h ) | I j | = H j ( h ) 6 j .
The sequence ( c j ) j 0 captures the mean behavior of h across successive scales in the backward tree. Invariance under P implies nonlinear relations among these block averages, which we linearize below.
Lemma 15 
(Block-level invariance relation). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 , and let h B tree , σ satisfy P h = h . For each j 0 define the block average
c j : = 1 | I j | n I j h ( n ) , | I j | : = # I j .
Then there exist sequences ( a j ) j 0 , ( b j ) j 0 with a j , b j 0 and a sequence ( ε j ) j 0 such that
c j = a j c j + 1 + b j c j 1 + ε j ,
where a j and b j are determined by the local distribution of even and odd preimages between neighboring scales, and the error sequence ε = ( ε j ) is summable in the weighted norm, i.e.
j 0 ϑ j | ε j | < .
Proof. 
Throughout, fix h B tree , σ with P h = h .
1. Start from the invariance equation on each block. For each j 0 ,
| I j | c j = n I j h ( n ) = n I j ( P h ) ( n ) = n I j h ( 2 n ) 2 n + 1 { n 4 ( 6 ) } h n 1 3 ( n 1 ) / 3 .
Write
S j even : = n I j h ( 2 n ) 2 n , S j odd : = n I j n 4 ( 6 ) h n 1 3 ( n 1 ) / 3 ,
so that
| I j | c j = S j even + S j odd .
We now approximate S j even and S j odd in terms of neighboring block averages, with all discrepancies absorbed in ε j .
2. Even branch contribution. For n I j , the even preimage is m = 2 n , and
S j even = n I j h ( 2 n ) 2 n = m 2 I j h ( m ) m ,
where 2 I j : = { 2 n : n I j } . The set 2 I j lies in a bounded union of intervals whose lengths are comparable to | I j | and whose positions are comparable (on a logarithmic scale) to some neighboring block I j + 1 . We decompose
h ( m ) = c j + 1 + h ( m ) c j + 1
for those m whose scale is that of I j + 1 , and similarly for indices belonging to at most finitely many adjacent blocks. This yields
S j even = a j ( even ) | I j | c j + 1 + R j even ,
where
a j ( even ) : = 1 | I j | n I j 1 2 n 1 { 2 n lies in the next scale block ( s ) } ,
and R j even collects:
(i)
contributions from h ( m ) c k within the relevant blocks,
(ii)
contributions from even preimages m falling outside the chosen neighboring blocks.
Because h B tree , σ , its oscillation inside each block is controlled by [ h ] tree , so replacing h ( m ) by the corresponding block average c k incurs an error bounded by
| h ( m ) c k | [ h ] tree W α ( m 1 , m 2 )
for suitable m 1 , m 2 in that block; the precise bound is obtained by choosing m 1 , m 2 maximizing the tree seminorm at that scale and using the definition of [ h ] tree . After dividing by m (which is 6 j at this scale) and averaging over I j , we get
| R j even | 6 j [ h ] tree + 6 j σ h σ ,
where the second term accounts for the finitely many preimages lying outside the neighboring blocks, using the weighted σ 1 bound on h. Thus
j 0 ϑ j | R j even | < .
By construction a j ( even ) 0 .
3. Odd branch contribution. For n 4 ( mod 6 ) , the odd preimage is m = ( n 1 ) / 3 , and
S j odd = n I j n 4 ( 6 ) h ( m ) m .
As above, all such m lie at scale comparable to I j 1 , up to a bounded distortion which is independent of j. We write
h ( m ) = c j 1 + h ( m ) c j 1 ,
and obtain
S j odd = b j ( odd ) | I j | c j 1 + R j odd ,
where
b j ( odd ) : = 1 | I j | n I j n 4 ( 6 ) 1 ( n 1 ) / 3 ,
and R j odd collects:
(i)
the errors from replacing h ( m ) by c j 1 ,
(ii)
any edge effects from m lying just outside I j 1 .
All indices m whose images under the even/odd branches land outside the adjacent blocks are absorbed into R j even and R j odd ; these edge spillovers are ϑ -summable thanks to σ > 1 and the block oscillation control from [ h ] tree .
As before, the tree seminorm controls oscillations within blocks, so | h ( m ) c j 1 | is bounded by a multiple of [ h ] tree times a scale factor, and dividing by m 6 j 1 yields
| R j odd | 6 j [ h ] tree + 6 j σ h σ .
Thus
j 0 ϑ j | R j odd | < .
By construction b j ( odd ) 0 .
4. Assemble the block relation. Substituting (74) and (76) into (73), we obtain
| I j | c j = a j ( even ) | I j | c j + 1 + b j ( odd ) | I j | c j 1 + R j even + R j odd .
Dividing by | I j | gives
c j = a j ( even ) c j + 1 + b j ( odd ) c j 1 + ε j ,
where
ε j : = R j even + R j odd | I j | .
Set a j : = a j ( even ) and b j : = b j ( odd ) . By construction a j , b j 0 , and they encode the (normalized) weights of even and odd preimages between the neighboring scales. Moreover, using | I j | 6 j together with (75) and (77), we obtain
j 0 ϑ j | ε j | j 0 ϑ j | R j even | + | R j odd | | I j | < ,
since the additional factor 1 / | I j | 6 j makes the series converge absolutely once σ > 1 and [ h ] tree is finite. This is exactly (72).
Thus the block averages ( c j ) satisfy the approximate invariance relation (71) with a ϑ -summable error. □
Lemma 16 
(Limiting preimage ratios). Let ( I j ) j 0 be the multiscale blocks
I j = [ 6 j , 2 · 6 j ) N , | I j | = 6 j .
Define a j and b j as in Lemma 15, i.e. as the normalized contributions (depending only on the preimage structure of T) of even and odd preimages from neighboring scales to the block relation
c j = a j c j + 1 + b j c j 1 + ε j ,
for block averages c j of any invariant profile h with P h = h . Then there exist constants a , b > 0 such that
lim j a j = a , lim j b j = b ,
and
a + b = 1 , 0 < b < a < 1 .
Moreover, there exist C > 0 and 0 < δ < 1 (independent of h) such that for all j 0 ,
| a j a | + | b j b | C δ j .
Proof. 
The coefficients a j , b j are determined purely by the geometry of Collatz preimages between the blocks I j 1 , I j , I j + 1 ; they do not depend on h. We make this explicit.
1. Preimage windows and raw counts. For m N , the Collatz map, (1) has two inverse branches:
n 2 n ( even branch ) , n n 1 3 when n 4 ( mod 6 ) ( odd branch ) .
In the block relation of Lemma 15, only preimages that land in the adjacent large scales contribute to the “main” coefficients a j , b j ; all other preimages (falling into gaps or non-adjacent blocks) are assigned to the perturbation ε j .
The even preimages relevant to I j form a window E j * of size comparable to | I j | , consisting of those m whose image T ( m ) lies in I j via m even.
he odd preimages relevant to I j form a thinner window O j * , consisting of those odd m with T ( m ) = 3 m + 1 I j (equivalently, n : = 3 m + 1 I j and n 4 ( mod 6 ) ).
A direct count shows:
1. For the even window, each n I j has an even preimage 2 n , so
| E j * | = | I j | = 6 j .
2. For the odd window, we need n I j with n 4 ( mod 6 ) and then m = ( n 1 ) / 3 odd. Among the | I j | = 6 j integers in I j , exactly one in every six is 4 ( mod 6 ) , up to boundary effects. Hence
| O j * | = 1 6 | I j | + O ( 1 ) = 6 j 1 + O ( 1 ) ,
so in particular | O j * | > 0 for all sufficiently large j.
Thus the total number of “neighboring-scale” preimages associated with I j is
| E j * | + | O j * | = 1 + 1 6 | I j | + O ( 1 ) = 7 6 6 j + O ( 1 ) .
2. Canonical normalization of a j , b j . By Lemma 15, the coefficients a j , b j are defined as the normalized weights of even vs. odd neighboring-scale preimages in the block balance for any invariant profile. Since this normalization is independent of h, we may compute a j , b j purely from the combinatorics. The natural choice is:
a j : = | E j * | | E j * | + | O j * | , b j : = | O j * | | E j * | + | O j * | .
These are exactly the “ratios of the number of even and odd preimages between adjacent scales” announced in Lemma 15.
Using the counts above,
a j = 6 j 6 j + 6 j 1 + O ( 1 ) = 1 1 + 1 6 + O ( 6 j ) = 6 7 + O ( 6 j ) , b j = 6 j 1 + O ( 1 ) 6 j + 6 j 1 + O ( 1 ) = 1 6 + O ( 6 j ) 1 + 1 6 + O ( 6 j ) = 1 7 + O ( 6 j ) .
In particular, there exist limits
a = lim j a j = 6 7 , b = lim j b j = 1 7 ,
and there exists C > 0 such that, for all j,
| a j a | + | b j b | C 6 j .
Thus the desired exponential convergence holds with δ : = 1 / 6 ( 0 , 1 ) .
3. Structural properties. From the explicit limits we immediately have
a + b = 6 7 + 1 7 = 1 , 0 < b < a < 1 .
Alternatively, the identity a j + b j = 1 holds exactly for each j when tested against the constant profile h 1 (for which the block perturbation ε j vanishes), and passes to the limit as j .
Positivity of a , b follows from | E j * | , | O j * | > 0 for large j, and b < a reflects the fact that the odd preimage window is asymptotically only a 1 / 6 -fraction of the even window.
This completes the proof. □
Proposition 5 
(Effective recursion for peripheral eigenfunctions). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and let h B tree , σ satisfy P h = λ h with | λ | = 1 . Let H j : = n I j h ( n ) and c j : = H j / | I j | be the block sums and block averages on I j = [ 6 j , 2 · 6 j ) N . Then, with a , b > 0 as in Lemma 16, there exists a sequence ( ε j ) j 1 with j 1 | ε j | ϑ j < such that
c j = λ 1 a c j + 1 + λ 1 b c j 1 + ε j , j 1 .
Equivalently, for the renormalized averages d j : = λ j c j we have
d j = a d j + 1 + b d j 1 + ε ˜ j , j 1 | ε ˜ j | ϑ j < ,
with ε ˜ j : = λ j ε j .
Proof. 
Step 1: Block summation of the eigenrelation. Summing P h = λ h over n I j gives
n I j ( P h ) ( n ) = λ n I j h ( n ) = λ H j .
By the definition of P = P even + P odd ,
n I j ( P h ) ( n ) = n I j h ( 2 n ) 2 n + n I j n 4 ( 6 ) h n 1 3 ( n 1 ) / 3 = : S j even + S j odd .
As in the proof of Lemma 15 (the λ = 1 case), we reorganize each sum by changing variables along the inverse branches and separating the main contributions that land in adjacent scales ( I j + 1 for the even branch, I j 1 for the odd branch) from the boundary remainders (spillovers due to the half-open endpoints and the congruence restriction n 4 ( mod 6 ) ). Concretely,
S j even = n I j h ( 2 n ) 2 n = m E j * h ( m ) m + R j even , S j odd = n I j n 4 ( 6 ) h n 1 3 ( n 1 ) / 3 = m O j * h ( m ) m + R j odd ,
where E j * I j + 1 and O j * I j 1 are the preimage windows collecting those m whose images lie in I j under the even and odd branches, respectively, and R j even , R j odd are the boundary remainders (coming from ( I j + 1 E j * ) and ( I j 1 O j * ) ).
Thus
λ H j = m E j * h ( m ) m + m O j * h ( m ) m + R j even + R j odd .
Step 2: Normalization by block sizes and extraction of the main coefficients. Divide by | I j | = 6 j and write c k = H k / | I k | :
λ c j = 1 | I j | m E j * h ( m ) m + 1 | I j | m O j * h ( m ) m + R j even + R j odd | I j | .
Inside each window the points m satisfy m | I j + 1 | (even window) or m | I j 1 | (odd window), so 1 / m fluctuates by a bounded multiplicative factor around 1 / | I j + 1 | or 1 / | I j 1 | . Using the B tree , σ control of oscillations within blocks, this fluctuation contributes only to an error term summable in the weighted ϑ -norm. Hence
1 | I j | m E j * h ( m ) m = | E j * | | I j | · 1 | I j + 1 | m E j * h ( m ) + η j even = a j c j + 1 + η j even ,
and similarly
1 | I j | m O j * h ( m ) m = b j c j 1 + η j odd ,
where a j : = | E j * | / ( | E j * | + | O j * | ) , b j : = | O j * | / ( | E j * | + | O j * | ) (so a j + b j = 1 ), and η j even , η j odd are error terms whose weighted sum j ϑ j | η j · | is finite. The boundary remainders likewise satisfy
j 1 ϑ j | R j even | + | R j odd | | I j | <
by the same block-oscillation and congruence estimates used in Lemma 15.
Collecting terms, we obtain
λ c j = a j c j + 1 + b j c j 1 + η j , j 1 ϑ j | η j | < ,
which is the twisted version of the block relation of Lemma 15.
Step 3: Freezing the coefficients to the limits a , b . By Lemma 18, there exist a , b > 0 with a + b = 1 , 0 < b < a < 1 , and constants C > 0 , 0 < δ < 1 such that | a j a | + | b j b | C δ j for all j. Rewrite (81) as
λ c j = a c j + 1 + b c j 1 + η j + ( a j a ) c j + 1 + ( b j b ) c j 1 = : ζ j .
To show j ϑ j | ζ j | < , it remains to bound the “freezing” errors ( a j a ) c j + 1 and ( b j b ) c j 1 in the weighted sum. As in the proof of Proposition 7, h B tree , σ implies the block averages obey the growth bound
| c k | C 0 6 ( σ 1 ) k h σ ( k 0 ) ,
for a constant C 0 depending only on σ and the block geometry. Hence
ϑ j | ( a j a ) c j + 1 | ϑ j C δ j C 0 6 ( σ 1 ) ( j + 1 ) h σ = C ϑ δ 6 σ 1 j h σ ,
and similarly for ( b j b ) c j 1 (with j 1 in place of j + 1 ). Choosing ϑ ( 0 , 1 ) (as done when defining B tree , σ ) small enough so that ϑ δ 6 σ 1 < 1 , these two geometric series converge, uniformly in h up to h σ . Therefore
j 1 ϑ j | ζ j | < .
Set ε j : = λ 1 ζ j and divide the identity by λ (note | λ | = 1 ), which yields (79) with j ϑ j | ε j | = j ϑ j | ζ j | < .
Step 4: Renormalized averages. Define d j : = λ j c j . Multiplying (79) by λ j ,
d j = a d j + 1 + b d j 1 + ε ˜ j , ε ˜ j : = λ j ε j ,
and since | λ | = 1 we have j ϑ j | ε ˜ j | = j ϑ j | ε j | < . This is (80). □
Remark 5 
(Admissibility for freezing the coefficients). The “freezing” errors ( a j a ) c j + 1 and ( b j b ) c j 1 are summable in the weighted norm provided
ϑ δ 6 σ 1 < 1 with δ = 1 6 ,
equivalently ϑ 6 σ 2 < 1 . This holds, for example, for any σ ( 1 , 2 ) when ϑ = 1 5 .
Remark 6 
(Exact normalization of the block coefficients). In Lemma 15 the neighboring-scale coefficients are determined purely by preimage windows:
a j : = | E j * | | E j * | + | O j * | , b j : = | O j * | | E j * | + | O j * | , so a j + b j = 1 .
Lemma 16 shows a j a = 6 7 and b j b = 1 7 with | a j a | + | b j b | 6 j .
Remark 7 
(Coefficient freezing). The combinatorial structure of the Collatz tree implies that the ratios
a j : = | I j + 1 | 2 | I j | , b j : = | I j 1 | | I j |
stabilize as j . More precisely,
a j 6 7 , b j 1 7 .
The limits correspond to the asymptotic frequencies of even and admissible odd preimages within the block I j , and follow from the block geometry and the counting estimates for even and odd branches preceding Lemma 17.
Remark 8 
(Asymptotic limits of the block coefficients). Let a j and b j be the block coefficients
a j : = | I j + 1 | 2 | I j | , b j : = | I j 1 | | I j | ,
arising in the decomposition of block averages under P h = h . Then the Collatz preimage combinatorics and the block geometry imply:
  • a j , b j 0 and a j + b j = 1 for all sufficiently large j;
  • The coefficients converge to the limiting values
    a j 6 7 , b j 1 7 , ( j ) .
  • The convergence is quantitative: there exists ϑ ( 0 , 1 ) and C > 0 such that
    | a j 6 7 | + | b j 1 7 | C ϑ j .
These limits correspond to the asymptotic frequencies of even and admissible odd preimages inside the block I j , established by the detailed counting in the preceding even/odd decomposition.
Lemma 17 
(Effective block recursion). Let h B tree , σ be the positive invariant density satisfying P h = h . For each scale block I j define
c j : = 1 | I j | n I j h ( n ) , j 0 .
Then there exist sequences ( a j ) j j 0 , ( b j ) j j 0 and an error sequence ( ε j ) j j 0 such that:
  • a j , b j 0 and a j + b j = 1 for all j j 0 ;
  • a j a = 6 7 and b j b = 1 7 as j ;
  • the block averages satisfy the second-order recursion
    c j = a j c j + 1 + b j c j 1 + ε j , j j 0 ;
  • the perturbations satisfy the weighted summability bound
    j j 0 ϑ j | ε j | < .
Moreover, the constants a, b and the summability rate depend only on ( α , ϑ , σ ) and the tree geometry.
Proof. 
Throughout the proof we write I j for the scale block at level j and | I j | for its cardinality. Recall that h is invariant, so for every n 1 ,
h ( n ) = 1 2 h ( 2 n ) + 1 { n 4 ( mod 6 ) } h n 1 3 .
Averaging (85) over n I j yields
c j = E j + O j ,
where
E j : = 1 | I j | n I j 1 2 h ( 2 n ) , O j : = 1 | I j | n I j n 4 ( mod 6 ) h n 1 3 .
We now express E j and O j in terms of c j + 1 and c j 1 plus controlled error terms.
Step 1: Even contribution. Consider the image set
J j even : = { 2 n : n I j } .
By construction of the scale blocks I j , the interval J j even is contained in a finite union of consecutive blocks at scales j and j + 1 , and for j large enough it intersects exactly one “main” block at scale j + 1 (which we denote by I j + 1 ) plus a uniformly bounded number of boundary fragments lying in neighboring blocks at scales j or j + 2 . More precisely, there exist disjoint sets A j I j and B j I j such that
{ 2 n : n A j } = I j + 1 , { 2 n : n B j } I j bdry I j + 2 bdry ,
where each boundary set I k bdry is a subset of I k of size O ( 6 j 1 ) independent of h. Thus the cardinalities satisfy
| A j | = | I j + 1 | 2 , | B j | = | I j | | A j | = | I j | | I j + 1 | 2 ,
and both | I j | and | I j + 1 | are comparable to 6 j .
We decompose
E j = 1 | I j | n A j 1 2 h ( 2 n ) + 1 | I j | n B j 1 2 h ( 2 n ) = E j ( 1 ) + E j ( 2 ) .
For the main part, change variables m = 2 n in the sum over A j :
E j ( 1 ) = 1 2 | I j | n A j h ( 2 n ) = 1 2 | I j | m I j + 1 h ( m ) = | I j + 1 | 2 | I j | c j + 1 .
For the boundary contribution E j ( 2 ) , note that the image { 2 n : n B j } lies in a finite union of boundary subsets of neighboring blocks. By the definition of the B tree , σ –norm, the average value of | h | on each such boundary subset is bounded by a uniform multiple of the block average at the corresponding scale, and the total number of boundary points at level j is O ( 6 j 1 ) . Hence there exists a constant C > 0 , depending only on the space parameters, such that
| E j ( 2 ) | C | I j | k { j , j + 2 } 6 j 1 c k C 6 1 ( c j + c j + 2 ) ,
for some C > 0 . Since ( c k ) is bounded (again by h B tree , σ ), (88) shows that E j ( 2 ) = O ( 6 j ) uniformly in j.
Define
a j : = | I j + 1 | 2 | I j | , δ j even : = E j ( 2 ) ,
so that
E j = a j c j + 1 + δ j even .
The block geometry (the fact that | I j + 1 | / | I j | 12 / 7 as j ) implies that a j a = 6 / 7 as j . Moreover the preceding bounds show that the sequence ( ϑ j δ j even ) is summable for any fixed 0 < ϑ < 1 chosen as in the Lasota–Yorke inequality, since δ j even decays at least like a fixed multiple of 6 j .
Step 2: Odd contribution. We now treat O j . The congruence condition n 4 ( mod 6 ) together with the definition of the Collatz odd preimage shows that the map
n m : = n 1 3
sends the admissible odd indices in I j into a finite union of blocks at scale j 1 , with one main block I j 1 and finitely many boundary pieces in neighboring blocks I j 1 bdry and I j + 1 bdry . More precisely, there is a subset A j I j of indices n 4 ( mod 6 ) such that
n 1 3 : n A j = I j 1 ,
and the remaining admissible indices in I j map into boundary subsets of neighboring blocks. Let B j denote the set of admissible indices in I j not belonging to A j . Then
O j = 1 | I j | n A j h n 1 3 + 1 | I j | n B j h n 1 3 = O j ( 1 ) + O j ( 2 ) .
For O j ( 1 ) we change variables m = ( n 1 ) / 3 and obtain
O j ( 1 ) = 1 | I j | m I j 1 h ( m ) = | I j 1 | | I j | c j 1 .
Set
b j : = | I j 1 | | I j | .
The combinatorial description of the tree and the choice of blocks I j imply that b j b = 1 / 7 as j ; in particular b j 0 for all j.
For the boundary term O j ( 2 ) , the same argument as in (88), combined with the definition of the B tree , σ –norm, yields
| O j ( 2 ) | C 6 1 ( c j 1 + c j + 1 )
for some constant C > 0 independent of j. Hence O j ( 2 ) also decays at least like a fixed multiple of 6 j , and the sequence ( ϑ j O j ( 2 ) ) is summable for any fixed 0 < ϑ < 1 . Define
δ j odd : = O j ( 2 ) .
Then
O j = b j c j 1 + δ j odd .
Step 3: Collecting terms and defining ε j . Combining (86), (90) and (93) we obtain
c j = a j c j + 1 + b j c j 1 + δ j even + δ j odd , j j 0 .
Set
ε j : = δ j even + δ j odd .
By construction a j , b j 0 and, up to redefining j 0 if necessary, the block geometry guarantees a j + b j = 1 for all j j 0 : the main part of the image mass from I j under the even and odd branches is redistributed into the neighboring blocks in proportions converging to the fixed probabilities 6 / 7 and 1 / 7 , and the boundary contributions have been absorbed into ε j .
The asymptotic limits a j 6 / 7 and b j 1 / 7 follow from the combinatorial description of preimages in the Collatz tree: even preimages occur with frequency asymptotic to 6 / 7 at large scales, while admissible odd preimages (those with n 4 ( mod 6 ) ) occur with frequency asymptotic to 1 / 7 . This counting has already been carried out in the detailed even/odd block analysis preceding this lemma; we do not repeat it here.
Finally, the bounds on δ j even and δ j odd above show that | ε j | C * 6 j for some C * > 0 . Since 0 < ϑ < 1 is fixed, the series j j 0 ϑ j | ε j | converges, which gives (84).
This proves the existence of sequences a j , b j , ε j with the required properties and completes the proof. □
The Lasota–Yorke inequality (46) implies that oscillations of h across successive scales decay geometrically:
[ f ] tree C LY 1 λ LY f 1 ,
so that any invariant h must be essentially flat in the strong seminorm. Translating this statement into block averages gives
| c j + 1 c j | C ϑ j , j 0 ,
for some C > 0 . The decay of successive differences enforces a near-constant profile c j c , and any residual deviation must satisfy the perturbed recursion (71).
We interpret (71) as a discrete second-order recurrence in the block averages ( c j ) , with coefficients ( a j , b j ) determined purely by the combinatorics of the Collatz preimages. In the limit a j a , b j b described in Lemma 16, the homogeneous part
c j = a c j + 1 + b c j 1
captures the mean balancing between even and odd contributions across adjacent scales.
Introducing the vector v j : = ( c j , c j 1 ) , the recursion can be written in matrix form
v j + 1 = M v j , M = 0 a b 0 .
The eigenvalues of M are ± a b , so the spectral radius is ρ ( M ) = a b . Since a + b = 1 and 0 < b < a < 1 , we have a b < 1 4 and hence ρ ( M ) < 1 2 < 1 . Consequently, the homogeneous solutions of (96) decay exponentially to a constant profile, and any deviation from constancy lies in the stable eigendirection of M.
Remark 9 
(Spectral radius of the frozen block matrix). Let
M = 0 a b 0 , a = 6 7 , b = 1 7 ,
be the limiting coefficient matrix associated with the homogeneous block recursion
c j = a c j + 1 + b c j 1 .
Then the eigenvalues of M are
λ ± = ± a b ,
so the spectral radius is
ρ ( M ) = a b = 6 7 < 1 .
Consequently, the homogeneous recursion is exponentially stable: every solution subexponential in j converges to a constant profile, and any deviation decays at rate O ( ρ ( M ) j ) . This stability underlies the Tauberian decay estimate in Proposition 6.
Proposition 6 
(Decay profile of the invariant density). Let h B tree , σ be the strictly positive invariant density satisfying
P h = h , ϕ ( h ) = 1 ,
where ϕ is the normalized positive left eigenfunctional from Theorem 4. For each scale block I j = [ 6 j , 2 · 6 j ) define the block averages
c j : = 1 | I j | n I j h ( n ) , j 0 .
Assume the effective block recursion of Lemma 17 holds in the form
c j = a j c j + 1 + b j c j 1 + ε j , j j 0 ,
with coefficients a j , b j 0 , a j + b j = 1 , satisfying
a j a = 6 7 , b j b = 1 7 , j j 0 ϑ j | a j a | + | b j b | < ,
and perturbations ε j obeying
j j 0 ϑ j | ε j | <
for some fixed ϑ ( 0 , 1 ) . Assume moreover that the parameters ( α , ϑ ) in the definition of B tree , σ satisfy
ϑ 6 α < 1 .
Then there exists a constant c > 0 such that
h ( n ) = c n + o 1 n ( n ) ,
with the error term uniform along rays of the Collatz tree.
Proof. 
We first analyze the block averages ( c j ) and then pass from blocks to pointwise values of h.
Step 1: Renormalized block recursion and convergence of w j . Introduce the renormalized sequence
w j : = 6 j c j , j 0 .
Multiplying (99) by 6 j and using a j + b j = 1 yields
w j = a j 6 w j + 1 + 6 b j w j 1 + 6 j ε j , j j 0 .
By Lemma 17 and Remarks 7–8, the perturbations and coefficient deviations are controlled as in (100)–(101). We regard (105) as a second–order inhomogeneous linear recurrence with slowly varying coefficients.
For the frozen–coefficient system, set
M = 0 a b 0 , v j : = c j c j 1 ,
so that the homogeneous recursion c j = a c j + 1 + b c j 1 can be written as v j + 1 = M v j . As observed in Remark 9, the eigenvalues of M are λ ± = ± a b and
ρ ( M ) = a b = 6 7 · 1 7 < 1 2 < 1 .
Thus there is a norm · * on R 2 and a constant η ( 0 , 1 ) such that M * η .
The recursion (99) can be written in the form
v j + 1 = M j v j + F j ,
where M j is a 2 × 2 matrix converging to M and F j is an inhomogeneity arising from ε j . The weighted summability (100)–(101) implies
j j 0 ϑ j M j M * + F j * < .
A standard discrete variation–of–constants argument for the nonautonomous system (108) (applied in the norm · * and using M * η < 1 together with (109)) shows that
v j = v + r j , r j * C ϑ j ( j j 0 ) ,
for some vector v = ( c , c ) T and constant C > 0 . In particular,
c j = c + O ( ϑ j ) ( j ) .
Since h > 0 and each c j is an average of positive values, the limit c is strictly positive. Returning to w j = 6 j c j we obtain
w j = 6 j c + O ( ϑ j 6 j ) ,
so that
c j = w j 6 j = c + O ( ϑ j ) ( j ) .
Step 2: Oscillation control inside blocks. The Lasota–Yorke inequality on B tree , σ implies that h has uniformly controlled oscillations on each block. More precisely, by the definition of the tree seminorm and the choice of parameters ( α , ϑ ) , there are constants C 1 > 0 and α ( 0 , 1 ) such that
osc I j h : = sup u , v I j | h ( u ) h ( v ) | C 1 ϑ j 6 ( 1 α ) j ( j j 0 ) .
Combining (114) with the definition (98) of c j yields, for every n I j ,
| h ( n ) c j | osc I j h C 1 ϑ j 6 ( 1 α ) j .
Since n I j implies n 6 j , we have 6 j 1 / n . Moreover, by (102),
ϑ j 6 ( 1 α ) j 6 j = ϑ 6 α j 0 ( j ) ,
so the oscillation error in (115) is o ( 6 j ) and hence o ( 1 / n ) .
Step 3: Pointwise asymptotics. Combining (113) and (115), and using 6 j n for n I j , we obtain, uniformly for n I j ,
h ( n ) = c j + O ϑ j 6 ( 1 α ) j = c + O ( ϑ j ) + o ( 6 j ) = c 6 j · 6 j + o ( 6 j ) .
Since 6 j n on I j , we may write 6 j = κ j / n with κ j κ > 0 , and (117) becomes
h ( n ) = c n + o 1 n , n ,
where c = c κ > 0 is a constant determined by the normalization ϕ ( h ) = 1 . The o ( 1 / n ) error is uniform in n on each block I j , and hence uniform along rays of the Collatz tree.
This proves (103) and completes the argument. □
The explicit Lasota–Yorke constants obtained in Section 4.4 guarantee that the same contraction rate governs the full operator P on B tree , σ , ensuring that invariant densities are asymptotically flat in the strong seminorm—block averages converge while the global profile follows the two-sided recursion. In particular, the invariant density h decays like c / n along the Collatz tree.

5.2. Effective Block Recursion and Spectral Estimate

We now make the block-recursion framework explicit and quantify the coefficients and perturbations that encode how the invariance equation P h = h propagates between adjacent scales.
Proposition 7 
(Effective perturbed recursion). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and h B tree , σ satisfy P h = h . Let c j be the block averages
c j : = 1 | I j | n I j h ( n ) , j 0 .
Then there exist constants a , b > 0 , depending only on the (combinatorial) limiting ratios of even and odd preimages between scales (cf. Lemma 18), and a sequence ( ε j ) j 0 such that
c j = a c j + 1 + b c j 1 + ε j , j 1 ,
with
ε ϑ : = j 0 | ε j | ϑ j < .
The constants a , b and the bound on ε ϑ are independent of h.
Proof. 
By Lemma 15, for h B tree , σ with P h = h there exist sequences ( a j ) j 0 , ( b j ) j 0 with a j , b j 0 and a sequence ( η j ) j 0 such that
c j = a j c j + 1 + b j c j 1 + η j , j 1 ,
and
j 0 ϑ j | η j | < .
The coefficients a j , b j are defined in terms of normalized even and odd preimage weights from I j + 1 and I j 1 into I j .
1. Limits a , b from preimage asymptotics. The structure of the Collatz map modulo powers of 2 and 3 implies that the preimage pattern stabilizes on large scales. More precisely, there exist constants a , b > 0 and C > 0 , 0 < δ < 1 (depending only on the map and the choice of blocks I j ) such that
| a j a | + | b j b | C δ j for all j 0 .
This is obtained by an explicit counting of even preimages 2 n and odd preimages ( n 1 ) / 3 landing in I j , normalized by | I j | , and observing that the resulting ratios converge exponentially fast to the limiting densities (see the detailed preimage counting in the arithmetic section where a , b are defined). The key point for this proposition is that (123) is purely combinatorial and does not depend on h.
2. Growth control for block averages c j . We claim that ( c j ) has at most controlled exponential growth governed by h σ .
For n I j we have n 6 j , so n σ ( 2 · 6 j ) σ . Then
| c j | = 1 | I j | n I j | h ( n ) | 1 | I j | n I j n σ | h ( n ) | n σ ( 2 · 6 j ) σ | I j | n I j | h ( n ) | n σ .
Since | I j | 6 j and n I j | h ( n ) | n σ h σ , we obtain
| c j | C 0 6 ( σ 1 ) j h σ for all j 0 ,
for some constant C 0 depending only on σ and the block geometry. Thus c j is at most exponentially growing, with a rate depending only on σ (and this bound is uniform in h up to the factor h σ ).
3. Passing from ( a j , b j ) to constants ( a , b ) . Rewrite (121) as
c j = a c j + 1 + b c j 1 + ε j ,
where we define
ε j : = η j + ( a j a ) c j + 1 + ( b j b ) c j 1 .
The relation (119) is just this identity.
It remains to prove the weighted summability j 0 ϑ j | ε j | < .
By (122), the contribution of η j is already summable. For the remaining terms, use (123) and (82):
| ( a j a ) c j + 1 | C δ j | c j + 1 | C δ j C 0 6 ( σ 1 ) ( j + 1 ) h σ ,
and similarly
| ( b j b ) c j 1 | C δ j C 0 6 ( σ 1 ) ( j 1 ) h σ
for j 1 . Therefore
j 0 ϑ j | ( a j a ) c j + 1 | C 1 h σ j 0 ϑ δ 6 σ 1 j , j 1 ϑ j | ( b j b ) c j 1 | C 2 h σ j 1 ϑ δ 6 σ 1 j 1 ,
for suitable constants C 1 , C 2 depending only on C , C 0 .
Since δ < 1 is fixed by the combinatorics and ϑ ( 0 , 1 ) is under our control, we may (and do) assume that ϑ has been chosen small enough so that
ϑ δ 6 σ 1 < 1 .
(Any choice of ( α , ϑ , σ ) used later must satisfy this together with the constraints from the Lasota–Yorke estimates; this is compatible with the parameter regime considered.)
Under condition (126), both geometric series above converge, and we conclude that
j 0 ϑ j | ( a j a ) c j + 1 | + | ( b j b ) c j 1 | < .
Combining with (122) and the definition (94), we obtain
j 0 ϑ j | ε j | < ,
i.e. (120) holds. This completes the proof. □
The associated homogeneous matrix recursion
M = 0 a b 0
has eigenvalues ± a b . Under the parameter choice ( α , ϑ ) = ( 1 2 , 1 5 ) , the odd-branch contraction constant computed in Section 4.4 implies a b < 1 , hence ρ ( M ) < 1 . The inequality ρ ( M ) < 1 means tht deviations of successive block averages from constancy decay geometrically along the scale index j. This discrete contraction is the block-level reflection of the Lasota–Yorke inequality on B tree , σ , confirming that the invariant density must be asymptotically flat across scales.
Lemma 18 
(Verification of the block coefficients). Let I j = [ 6 j , 2 · 6 j ) N and define the even and odd preimage windows
E j * : = { 2 m : m I j } , O j * : = m 1 3 : m I j , m 4 ( mod 6 ) .
Assume that the coefficients a , b > 0 in Proposition 7 are given by the asymptotic preimage ratios
a : = lim j | E j * | | I j | , b : = lim j | O j * | | I j | ,
whenever these limits exist. Then the limits exist and satisfy
a = 1 , b = 1 6 , a b = 1 6 < 1 .
Proof. 
For each j 0 the block I j has cardinality
| I j | = 2 · 6 j 6 j = 6 j .
For the even-preimage window, the map
T even : I j N , T even ( m ) = 2 m ,
is injective, and by definition E j * = { 2 m : m I j } . Hence T even restricts to a bijection between I j and E j * , so
| E j * | = | I j | = 6 j for all j 0 .
Dividing by | I j | shows that
| E j * | | I j | = 1 for all j ,
and therefore the limit in (128) exists with a = 1 .
For the odd-preimage window, recall that the backward odd branch of the Collatz map is
T odd ( m ) = m 1 3 ,
which is defined precisely when m 4 ( mod 6 ) , and in that case ( m 1 ) / 3 is odd and satisfies 3 m 1 3 + 1 = m . Thus O j * consists of all such odd preimages with m I j and m 4 ( mod 6 ) .
Among the 6 j consecutive integers in I j , exactly one out of every six lies in the residue class 4 ( mod 6 ) , up to a boundary discrepancy of at most one element. More precisely,
# { m I j : m 4 ( mod 6 ) } = 1 6 | I j | + O ( 1 ) = 1 6 6 j + O ( 1 ) .
The map T odd is injective on { m I j : m 4 ( mod 6 ) } , so
| O j * | = # { m I j : m 4 ( mod 6 ) } = 1 6 6 j + O ( 1 ) .
Dividing by | I j | = 6 j yields
| O j * | | I j | = 1 6 + O ( 6 j ) ,
so the limit in (128) exists and b = 1 / 6 .
Combining the two computations gives a b = ( 1 ) ( 1 / 6 ) = 1 / 6 < 1 , which is the desired strict contraction at the block-recursion level. □
Remark 10 
(Normalization of the block coefficients). In the exact probabilistic normalization of Lemma 16 one has a + b = 1 with a = 6 7 , b = 1 7 . The unnormalized choice a = 1 , b = 1 6 in Lemma 18 differs only by a constant scaling of the recurrence (119), and both yield a strict contraction since a b < 1 . The precise normalization is immaterial for the spectral-gap conclusion, which depends only on a b < 1 .

5.3. Odd-Branch Distortion at α = 1 2 and a Certified λ odd < 1

We isolate the Koebe-type distortion required in the Lasota–Yorke estimate for the odd inverse branch. Throughout this subsection 0 < ϑ < 1 and I j = [ 6 j , 2 · 6 j ) N .
Lemma 19 
(Odd-branch distortion bound at α = 1 2 ). Let W α ( u , v ) = u v | u v | ( u + v ) α . For α = 1 2 and any u , v I j with j 1 , u v , set u = ( u 1 ) / 3 , v = ( v 1 ) / 3 . Then
W 1 / 2 ( u , v ) u C 1 / 2 W 1 / 2 ( u , v ) 6 , C 1 / 2 3 2 .
Consequently, the odd-branch contribution in the Lasota–Yorke inequality on B tree satisfies
λ odd 1 2 , ϑ C 1 / 2 6 ϑ 3 2 6 ϑ .
In particular, for ϑ = 1 5 one has λ odd ( 1 / 2 , 1 / 5 ) < 1 .
Proof. 
Let α = 1 2 . For u , v I j with j 1 , write
u = u 1 3 , v = v 1 3 .
A direct computation gives
W 1 / 2 ( u , v ) = u v | u v | ( u + v ) 1 / 2 = ( u 1 ) ( v 1 ) 9 | u v | 3 u + v 2 3 1 / 2 = ( u 1 ) ( v 1 ) 3 1 / 2 | u v | ( u + v 2 ) 1 / 2 .
Hence
W 1 / 2 ( u , v ) u = u v | u v | ( u + v ) 1 / 2 · 3 u 1 = 3 3 / 2 u v ( u 1 ) 2 ( v 1 ) · ( u + v 2 ) 1 / 2 | u v | · | u v | 3 1 / 2 ( u + v ) 1 / 2 = 3 3 / 2 u v ( u 1 ) 2 ( v 1 ) u + v 2 u + v 1 / 2 ( u 1 ) ( v 1 ) 3 1 / 2 | u v | ( u + v 2 ) 1 / 2 ( u 1 ) = 3 u u 1 · v v 1 · 1 u 1 = : G ( u , v ) ( u 1 ) ( v 1 ) 3 1 / 2 | u v | ( u + v 2 ) 1 / 2 = W 1 / 2 ( u , v ) .
Therefore
W 1 / 2 ( u , v ) u = 3 G ( u , v ) W 1 / 2 ( u , v ) .
Since u , v I j with j 1 we have u , v 6 . Thus
u u 1 , v v 1 6 5 , 1 u 1 1 5 ,
Consequently
G ( u , v ) = u u 1 · v v 1 · 1 u 1 6 5 · 6 5 · 1 5 = 36 125 .
It follows that
W 1 / 2 ( u , v ) u 3 · 36 125 W 1 / 2 ( u , v ) = 108 125 W 1 / 2 ( u , v ) < 3 2 W 1 / 2 ( u , v ) 6 ,
because 6 2.449 and 108 125 0.864 > 3 2 · 1 6 0.612 , we may replace the sharp constant 108 / 125 by the slightly larger but cleaner bound C 1 / 2 = 3 2 , yielding (130).
The bound (130) is precisely the distortion factor needed when estimating ϑ j W 1 / 2 ( u , v ) Δ ( P odd f ; u , v ) by the scale- j 1 oscillation of f (since u , v I j 1 ) together with the indicator restriction u v 4 ( mod 6 ) , whose combinatorial thinning yields the standard 6 denominator in the block-to-block comparison. This gives (131). For ϑ = 1 5 we obtain λ odd ( 1 / 2 , 1 / 5 ) 3 2 6 · 1 5 < 1 , as claimed. □
The factor 1 6 in (131) corresponds to the thinning of the residue class n 4 ( mod 6 ) within each block I j , while C 1 / 2 quantifies the residual distortion caused by the affine map n ( n 1 ) / 3 . Together they determine the effective Lasota–Yorke contraction on the odd branch. In particular, the verified bound λ odd ( 1 / 2 , 1 / 5 ) < 1 implies a strict spectral gap for P on B tree , σ and establishes quasi-compactness with ρ ess ( P ) λ odd ( 1 / 2 , 1 / 5 ) .

5.4. Effective Block Recursion: Explicit Coefficients and Summable Error

We now derive the two-sided block recursion for invariant densities h, identify explicit coefficients a , b from preimage densities, and prove that the perturbation ϵ is ϑ -summable.
Lemma 20 
(Mid-band to adjacent-scale averaging). Let I j = [ 6 j , 2 · 6 j ) and let U j even : = 2 I j = [ 2 · 6 j , 4 · 6 j ) and U j 1 odd : = J j 1 [ 2 · 6 j 1 , 4 · 6 j 1 ) be the bands from the even and odd inverse branches, respectively. Then there exists a constant C > 0 (independent of j and h) such that
1 | U j even | m U j even h ( m ) c j + 1 C ϑ j [ h ] tree , 1 | U j 1 odd | m U j 1 odd h ( m ) c j 1 C ϑ j 1 [ h ] tree .
Proposition 8 
(Effective perturbed recursion with explicit a , b ). Let h B tree , σ satisfy P h = h . Define block masses and averages
H j : = n I j h ( n ) , c j : = H j | I j | = H j 6 j .
There exist constants a , b > 0 and a sequence ( ϵ j ) j 1 such that
c j = a c j + 1 + b c j 1 + ϵ j , j 1 ,
with
1 12 a 1 6 , 1 12 b 1 6 ,
and
j 1 | ϵ j | ϑ j C [ h ] tree ,
for a constant C = C ( α , ϑ , σ ) independent of h. In particular, ϵ ϑ < .
Proof. 
Since P h = h ,
H j = n I j h ( n ) = n I j h ( 2 n ) 2 n + 1 { n 4 ( 6 ) } h ( ( n 1 ) / 3 ) ( n 1 ) / 3 = : E j + O j .
We treat the even and odd contributions separately.
Even contribution. Set E j = n I j h ( 2 n ) / ( 2 n ) . The set 2 I j = [ 2 · 6 j , 4 · 6 j ) has length 2 · 6 j . For m = 2 n 2 I j we have 1 / ( 2 n ) ( 4 · 6 j ) 1 , ( 2 · 6 j ) 1 . Therefore
1 4 · 6 j m 2 I j h ( m ) E j 1 2 · 6 j m 2 I j h ( m ) .
Using Lemma 20 on U j even = 2 I j and | 2 I j | = 2 · 6 j , we get
| 2 I j | 4 · 6 j c j + 1 + O ( ϑ j [ h ] tree ) E j | 2 I j | 2 · 6 j c j + 1 + O ( ϑ j [ h ] tree ) ,
hence
1 2 c j + 1 + O ( ϑ j [ h ] tree ) E j 1 · c j + 1 + O ( ϑ j [ h ] tree ) .
Dividing by 6 j later will insert the factor 1 / 6 into the coefficient of c j + 1 .
Using Lemma 20 on U j even = 2 I j with | 2 I j | = 2 · 6 j and 1 4 · 6 j 1 2 n 1 2 · 6 j for n I j (i.e. m = 2 n [ 2 · 6 j , 4 · 6 j ) ), we get
1 4 · 6 j m 2 I j h ( m ) E j 1 2 · 6 j m 2 I j h ( m ) .
Moreover,
1 | 2 I j | m 2 I j h ( m ) = c j + 1 + O ϑ j [ h ] tree ,
so
m 2 I j h ( m ) = | 2 I j | c j + 1 + O ϑ j [ h ] tree = 2 · 6 j c j + 1 + O ϑ j [ h ] tree .
Plugging this into the previous display yields
1 2 c j + 1 + O ϑ j [ h ] tree E j c j + 1 + O ϑ j [ h ] tree .
Consequently, after dividing by 6 j in the block balance, the even term contributes a coefficient for c j + 1 in the range 1 12 , 1 6 .
Odd contribution. Set O j = n I j 1 { n 4 ( 6 ) } h ( ( n 1 ) / 3 ) / ( ( n 1 ) / 3 ) and change variables m = ( n 1 ) / 3 . Then n = 3 m + 1 and the image of I j corresponds to
J j 1 : = 6 j 1 3 , 2 · 6 j 1 3 N [ 2 · 6 j 1 , 4 · 6 j 1 ) ,
which has length | J j 1 | = 2 · 6 j 1 + O ( 1 ) and satisfies 1 / m ( 4 · 6 j 1 ) 1 , ( 2 · 6 j 1 ) 1 for m J j 1 . Arguing as for the even term and using the scale- ( j 1 ) seminorm control,
m J j 1 h ( m ) = | J j 1 | c j 1 + δ j ( O ) , | δ j ( O ) | C 3 6 j 1 ϑ j 1 [ h ] tree .
Hence
| J j 1 | 4 · 6 j 1 c j 1 + O ( ϑ j 1 [ h ] tree ) O j | J j 1 | 2 · 6 j 1 c j 1 + O ( ϑ j 1 [ h ] tree ) .
By Lemma 20, replacing the U j 1 odd -average by c j 1 costs O ( ϑ j 1 [ h ] tree ) , so combining with | J j 1 | = 2 · 6 j 1 + O ( 1 ) yields
1 2 c j 1 + O ( ϑ j 1 [ h ] tree ) O j 1 · c j 1 + O ( ϑ j 1 [ h ] tree ) .
Since | J j 1 | = 2 · 6 j 1 + O ( 1 ) , we obtain
1 2 c j 1 + O ( ϑ j 1 [ h ] tree ) O j c j 1 + O ( ϑ j 1 [ h ] tree ) .
Collecting the bounds. Dividing (137) and (140) by 6 j and using H j = E j + O j we obtain
c j = a c j + 1 + b c j 1 + ϵ j ,
where the coefficients lie in the sandwiched ranges
a 1 12 , 1 6 , b 1 12 , 1 6 ,
and the error obeys | ϵ j | C ϑ j [ h ] tree . This gives (132)–(134). □
Remark 11 
(Interpretation of a , b ). The bounds (133) are sharp at the level of this scale calculus: they encode that each strip contributing to I j occupies a fraction comparable to its relative width (a factor 2 in length) times the typical inverse-height ( ( 3 · 6 · ) 1 ), which together give a coefficient in [ 1 2 , 1 ] before the 6-normalization; the 1 / 6 passage from mass to average then places the effective two-sided coefficients in [ 1 3 , 2 3 ] . If finer preimage combinatorics are imposed (e.g. restricting to residues 4 ( mod 6 ) precisely), a , b can be sharpened, though the above bounds already imply ρ ( M ) < 1 for M = 0 a b 0 .
Theorem 5 
(Spectral bound for invariant profiles). Let 0 < α < 1 , 0 < ϑ < 1 , σ > 1 , and h B tree , σ satisfy P h = h . Let c j be the block averages of h and suppose that they satisfy the effective recursion of Proposition 7:
c j = a c j + 1 + b c j 1 + ε j , j 1 ,
with a , b > 0 independent of j and j 0 | ε j | ϑ j < . Assume moreover (as ensured by the preimage counting) that
a + b = 1 and 0 < b < a < 1 .
Then:
  • The sequence ( c j ) converges exponentially fast to a limit C C .
  • The function h is identically equal to this constant: h ( n ) C .
  • Consequently, the eigenspace of P associated to the eigenvalue λ = 1 in B tree , σ is one-dimensional.
Proof. 
1. Analysis of the homogeneous recursion. Ignoring ε j for the moment, the homogeneous recurrence is
c j = a c j + 1 + b c j 1 , j 1 .
Rewriting,
a c j + 1 c j + b c j 1 = 0 .
Seeking solutions of the form c j = r j yields
a r 2 r + b = 0 .
By (143), a + b = 1 , so r = 1 is a root: a b = 1 ( a + b ) + ( a b ) = 0 reduces to a + b = 1 . Thus one root is r 1 = 1 , and the other r 2 satisfies r 1 r 2 = b / a , so
r 2 = b a .
The conditions 0 < b < a < 1 imply 0 < r 2 < 1 , so the homogeneous recursion has a one-dimensional space of bounded solutions of the form
c j hom = C 1 · 1 j + C 2 r 2 j = C 1 + C 2 r 2 j ,
where the non-constant mode decays exponentially at rate r 2 .
2. Stability under summable perturbations. We now incorporate the perturbation ε j .
From (142),
a c j + 1 = c j b c j 1 ε j ,
so
c j + 1 = 1 a c j b a c j 1 1 a ε j , j 1 .
Define the vector
u j : = c j c j 1 , η j : = ε j / a 0 ,
and the matrix
A : = 1 / a b / a 1 0 .
Then (146) is equivalent to
u j + 1 = A u j + η j , j 1 .
The eigenvalues of A are exactly r 1 = 1 and r 2 = b / a (the roots of a r 2 r + b = 0 ), with | r 2 | < 1 by (145). Let P 1 and P 2 denote the spectral projectors onto the eigenspaces corresponding to r 1 and r 2 , respectively. Then P 1 + P 2 = I and A P 1 = P 1 , A P 2 = r 2 P 2 .
Iterating (147),
u j = A j 1 u 1 + k = 1 j 1 A j 1 k η k .
Decompose u 1 = P 1 u 1 + P 2 u 1 and each η k similarly. Using A n P 1 = P 1 and A n P 2 = r 2 n P 2 , we obtain
u j = P 1 u 1 + r 2 j 1 P 2 u 1 + k = 1 j 1 P 1 η k + r 2 j 1 k P 2 η k .
Since η k | ε k | and k 0 | ε k | ϑ k < , in particular k η k < . Thus: - The series k 1 P 1 η k converges to some vector w 1 . - The tail k = 1 j 1 r 2 j 1 k P 2 η k is bounded by sup k η k 0 | r 2 | and hence defines a sequence going to 0 as j .
Therefore,
u j = P 1 u 1 + w 1 + r 2 j 1 P 2 u 1 + o ( 1 ) as j .
Projecting onto the first coordinate,
c j = C + O ( r 2 j ) + o ( 1 ) ,
for some constant C depending linearly on the initial data and on the summable forcing. In particular, there exist constants C C and ρ ( 0 , 1 ) such that
| c j C | ρ j for all j ,
i.e. ( c j ) converges exponentially fast to C.
3. From block averages to pointwise constancy. Set C : = lim j c j and define g : = h C . Then g B tree , σ , P g = g , and its block averages d j : = c j C satisfy the same recursion (142) with limit 0 and the same summability property for the perturbation. By (148), d j 0 exponentially.
We now show that g 0 . For n I j , the tree seminorm control of g implies that the oscillation of g within I j is small at large scales: more precisely, from the definition of [ g ] tree and the growth of W α on I j one obtains
sup m , n I j | g ( m ) g ( n ) | 6 ( 1 α ) j [ g ] tree .
(Here we use that W α ( m , n ) 6 ( 2 α ) j / | m n | on I j , so boundedness of ϑ j W α ( m , n ) | g ( m ) g ( n ) | forces the oscillation to decay with j.) Since also d j 0 , we have for n I j :
| g ( n ) | | g ( n ) d j | + | d j | 6 ( 1 α ) j [ g ] tree + ρ j ,
which tends to 0 uniformly on each block as j . Thus g ( n ) 0 as n .
Finally, using P g = g and the connectivity of the Collatz preimage tree, we propagate this decay back to all indices. If there were n 0 with g ( n 0 ) 0 , then iterating P g = g forward would express g on arbitrarily large integers in terms of g ( n 0 ) , contradicting g ( n ) 0 as n . Formally, P g = g implies g is an eigenfunction with eigenvalue 1; by the quasi-compactness result (Theorem 3) and the analysis above, the only such eigenfunctions in B tree , σ are constant functions. Since g ( n ) 0 , this constant must be 0, so g 0 .
Hence h C is constant.
4. One-dimensionality of the eigenspace. If h 1 , h 2 B tree , σ satisfy P h i = h i , then their difference g = h 1 h 2 also satisfies P g = g . By the argument above, g is constant; if we normalize by, say, fixing the block average or the weighted integral, this forces g 0 . Thus the eigenspace for λ = 1 is one-dimensional.
This completes the proof. □

Extension to Isolated Divergent Trajectories

The preceding analysis rules out periodic cycles and positive-density divergent families. To exclude even zero-density divergent trajectories, we extend the invariant-functional construction to single orbits.
Proposition 9 
(Zero-density divergent orbits also induce invariants). Let x 0 N and x k + 1 = T ( x k ) be a forward Collatz orbit. Assume x k visits infinitely many scales: there exists a strictly increasing sequence ( j r ) r 1 and times k r with x k r I j r . Define the level weights w j : = ϑ j + 6 σ j and
φ N : = 1 r N w j r r N w j r δ x k r B tree , σ * .
Then the Cesàro averages Φ N : = 1 N m = 0 N 1 ( P * ) m φ N form a bounded net in B tree , σ * with nonzero weak-* cluster points Φ satisfying P * Φ = Φ . Consequently ( f ) : = f , Φ is a nontrivial P-invariant functional.
Proof. 
Each point mass δ n belongs to B tree , σ * with δ n * ϑ j ( n ) + 6 σ j ( n ) when n I j ( n ) . Thus the convex combination φ N , with weights w j r = ϑ j r + 6 σ j r , has uniformly bounded · * norm: the contribution of level j r is multiplied by w j r and then renormalized by r N w j r . Hence sup N φ N * < .
Since P * is power-bounded on B tree , σ * , the Cesàro averages Φ N are uniformly bounded. By Banach–Alaoglu there exist weak-* cluster points, and any such Φ satisfies P * Φ = Φ .
Nontriviality: because the orbit hits infinitely many scales, for each N there exists r N with x k r I j r at a new level. Testing Φ N against the indicator of a union of those visited singleton points shows 1 , Φ N c > 0 uniformly along a subsequence (the renormalizer r N w j r grows in step with the added weights), hence any weak-* limit Φ is nonzero. □
Together with the quasi-compactness and spectral-gap results, this ensures that every possible non-terminating configuration would produce a nonzero invariant functional in B tree , σ * , contradicting the established gap. Section 6 therefore completes the proof by verifying the quantitative bound λ odd < 1 .

5.5. Explicit Lasota–Yorke Constants

To complete the spectral argument, we verify that the explicit constants ( α , ϑ ) = ( 1 2 , 1 5 ) used in Section 6 indeed yield λ odd < 1 .
Recall the odd-branch distortion constant at level shift j j 1 :
λ odd ( α , ϑ ) C α 6 ϑ , C α : = sup u > v > 0 u v 4 ( 6 ) W α ( u , v ) W α ( u , v ) ,
where ( u , v ) = u 1 3 , v 1 3 are the odd-preimages. At α = 1 2 , Lemma 12 gives
C 1 / 2 = 16 3 3 / 2 < 3.1 .
Therefore
λ odd 1 2 , 1 5 16 3 3 / 2 6 · 1 5 = 16 3 2 2 · 1 5 0.25 < 1 .
Hence λ odd < 1 in this parameter regime.
Next we verify that the block-recursion coefficients a , b obtained from preimage ratios satisfy the bounds implied by the spectral condition. As established in Lemma 16,
a = lim j a j = 6 7 , b = lim j b j = 1 7 , a + b = 1 ,
whence
a b = 6 7 0.35 < 1 .
This quantitative consistency between the analytic Lasota–Yorke contraction and the arithmetic preimage densities closes the argument: the invariant density is constant, the radius of the homogeneous two-sided recursion is < 1 , and the backward operator P has a genuine spectral gap on B tree , σ .
Theorem 6 
(Absence of peripheral spectrum on B tree , σ ). Let 0 < α < 1 , 0 < ϑ < 1 , and σ > 1 . Let P be the backward Collatz transfer operator acting on B tree , σ as in Section 3 and Section 4.4. Assume:
  • P satisfies the Lasota–Yorke inequality of Proposition 2 on B tree , σ , and the embedding B tree , σ σ 1 is compact, so that P is quasi-compact on B tree , σ with essential spectral radius ρ ess ( P ) < 1 .
  • For every eigenfunction h B tree , σ with P h = λ h and | λ | = 1 , the associated block averages c j satisfy the effective perturbed recursion of Proposition 7: there exist a , b > 0 (independent of h) and a sequence ( ε j ) with j 0 | ε j | ϑ j < such that
    c j = a c j + 1 + b c j 1 + ε j , j 1 .
    Moreover the constants a , b are such that the corresponding homogeneous recursion has spectral radius strictly less than 1, i.e. every solution of c j = a c j + 1 + b c j 1 which is subexponential in j must converge to 0. (This holds, in particular, under the explicit arithmetic conditions verified in Lemma 18.)
Then P has no nontrivial eigenvalues on the unit circle: if P h = λ h with | λ | = 1 and h B tree , σ , then h 0 . In particular,
σ ( P ) { z C : | z | = 1 } = , ρ ( P ) < 1 .
Proof. 
Let h B tree , σ satisfy P h = λ h with | λ | = 1 , and let c j be its block averages satisfying (150).
Step 1: Asymptotics of the block averages. Ignoring the perturbation, the homogeneous recursion
c j = a c j + 1 + b c j 1
is a second-order linear recurrence. As in Proposition 7, one rewrites it as a first-order system
u j + 1 = A u j , u j : = c j c j 1 ,
for a fixed 2 × 2 matrix A with eigenvalues strictly inside the unit disk under the stated condition on a , b . (Equivalently, the homogeneous recursion has no nontrivial subexponentially bounded solutions except c j 0 .)
Including the perturbation ε j ,
u j + 1 = A u j + η j , η j : = ε j / a 0 .
Iterating,
u j = A j 1 u 1 + k = 1 j 1 A j 1 k η k .
Since ρ ( A ) < 1 and k η k < (by weighted summability of ε j ), the standard stability estimate gives
lim j u j = 0 ,
hence
lim j c j = 0 .
Step 2: Pointwise decay of h ( n ) . Because h B tree , σ , the tree seminorm controls oscillations on each block: for every j and m , n I j ,
W α ( m , n ) | h ( m ) h ( n ) | ϑ j [ h ] tree .
On I j one has W α ( m , n ) 6 ( 2 α ) j / | m n | , so this implies that the oscillation of h within I j is O ( 6 ( 1 α ) j ) [ h ] tree . In particular,
sup n I j | h ( n ) c j | 6 ( 1 α ) j [ h ] tree .
Together with (152) this gives
lim j sup n I j | h ( n ) | = 0 ,
so h ( n ) 0 as n .
Step 3: Use the σ 1 growth bound. Since h B tree , σ σ 1 and P h = λ h with | λ | = 1 , for every k 1
h σ = λ k P k h σ P k h σ .
From the corrected weighted σ 1 estimate (see Lemma 11) we have for all k 1
P k h σ 2 σ 1 + 3 σ k h σ .
Because σ > 1 , the factor 2 σ 1 + 3 σ < 1 , so (153) gives
h σ 2 σ 1 + 3 σ k h σ for all k 1 .
If h 0 , dividing by h σ and letting k yields 1 0 , a contradiction. Hence h 0 .
Remark 12 (Role of the parameter σ > 1 )On the Dirichlet side of the theory, absolute convergence already holds for every σ > 0 . The restriction σ > 1 is only used at this point, in combination with (19), to ensure that
2 σ 1 + 3 σ < 1 .
This strict inequality yields the genuine contraction estimate
P k h σ 2 σ 1 + 3 σ k h σ 0
for any eigenfunction h with | λ | = 1 and λ 1 , and is the key input in the exclusion of the peripheral spectrum on | λ | = 1 . No other part of the argument relies on the numerical value σ > 1 .
Step 4: Exclusion of peripheral spectrum. Since P is quasi-compact on B tree , σ with ρ ess ( P ) < 1 (assumption (1)), any spectral value of P on | z | = 1 would have to be an eigenvalue. We have shown no such eigenvalue exists, hence
σ ( P ) { z C : | z | = 1 } = ,
and therefore ρ ( P ) < 1 , proving (151). □
Lemma 21 
(Tightness of empirical averages in B tree , σ * ). Let S N have positive upper density and set μ N = 1 S [ 1 , N ] | S [ 1 , N ] | (viewed as a finitely supported probability on N ). For K 1 define
η N , K : = 1 K k = 0 K 1 P k μ N B tree , σ * .
Then there is C σ > 0 independent of N , K such that η N , K B tree , σ * C σ . Consequently, for any sequence ( N r , K r ) with N r , K r , the family ( η N r , K r ) is weak* relatively compact in B tree , σ * .
Proof. 
Let · tree , σ denote the full norm on B tree , σ (e.g. a two–norm of the form f tree , σ : = [ f ] tree + A f 1 for some fixed A > 0 ). Fix f B tree , σ with f tree , σ 1 . We claim that there is a constant C > 0 , depending only on the space B tree , σ , such that for every m N ,
| f ( m ) | C 6 σ j ( m ) .
Indeed, by the definition of the strong seminorm on the tree and the block averaging inequality (equivalently, the local bounded distortion underlying the Lasota–Yorke estimate), there exists C 0 > 0 with | f ( m ) | C 0 [ f ] tree · 6 j ( m ) C 0 [ f ] tree · 6 σ j ( m ) . If the full norm includes the 1 part, use 6 j 6 σ j and f 1 A 1 f tree , σ to absorb it into the same bound, which yields (154) with C : = C 0 .
Let j ( m ) denote the scale index of m, i.e. m I j ( m ) = [ 6 j ( m ) , 2 · 6 j ( m ) ) . By the coarse forward envelope (Lemma 2.2), there exist constants c > 0 and C 1 0 such that, for every n N and every k 0 ,
j T k n c k C 1 .
Combining (154) and (155),
| f T k n | C 6 σ j ( T k n ) C 6 σ ( c k C 1 ) = C ρ k , ρ : = 6 σ c ( 0 , 1 ) ,
with C : = C · 6 σ C 1 independent of n and k.
Now evaluate η N , K on f:
η N , K , f = 1 K k = 0 K 1 P k μ N , f = 1 K k = 0 K 1 1 | S [ 1 , N ] | n S [ 1 , N ] f T k n .
Taking absolute values and using the uniform bound above,
| η N , K , f | 1 K k = 0 K 1 C ρ k C K · 1 ρ K 1 ρ C 1 ρ = : C σ ,
where C σ depends only on ( σ , c , C 1 ) and the tree-space constants, and is independent of N and K. Since this holds for every f with f tree , σ 1 , we obtain
η N , K B tree , σ * C σ for all N , K 1 .
Finally, the unit ball of B tree , σ * is weak* compact (Banach–Alaoglu), so any family with a uniform dual-norm bound is weak* relatively compact. Hence for any sequence ( N r , K r ) with N r , K r , the net ( η N r , K r ) admits weak* limit points in B tree , σ * , as claimed. □
Theorem 7 
(Spectral criterion for absence of divergent mass). Let P act on B tree , σ and suppose:
  • P is quasi-compact on B tree , σ with ρ ess ( P ) < 1 ;
  • P has no eigenvalues on the unit circle except possibly λ = 1 ;
  • The eigenspace for λ = 1 is one-dimensional and generated by a strictly positive h B tree , σ with P h = h .
Then there is no nontrivial P-invariant probability density in B tree , σ supported on non-terminating orbits or nontrivial cycles, and there is no positive-density family of divergent Collatz trajectories.
Proof. 
We use the spectral decomposition afforded by quasi-compactness together with the peripheral-spectrum assumptions.
Step 1: Spectral decomposition and convergence of iterates.
By (1), there exists a bounded finite-rank spectral projector Π : B tree , σ B tree , σ associated with the peripheral spectrum of P, and a bounded operator N with ρ ( N ) < 1 such that
P = Π P Π + N , Π N = N Π = 0 , N k = O ( ρ k ) for some ρ ( 0 , 1 ) .
By (2)–(3), the peripheral spectrum consists only of the simple eigenvalue λ = 1 with eigenvector 1 . Hence Π is the rank-one projection onto span { 1 } : there exist h B tree , σ and a continuous linear functional ϕ B tree , σ * such that
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 ,
and the rank-one spectral projector at λ = 1 is
Π f = ϕ ( f ) h for all f B tree , σ .
Consequently,
P k f = Π f + N k f = φ ( f ) 1 + N k f φ ( f ) 1 in B tree , σ as k .
Step 2: Nonexistence of nontrivial invariant probability densities in B tree , σ .
Suppose h B tree , σ is a P-invariant probability density supported on non-terminating orbits or nontrivial cycles; that is, h 0 , n 1 h ( n ) = 1 , and P h = h . Then h is a fixed point:
h = P k h for all k 0 .
Applying (158) with f = h gives
h = φ ( h ) 1 + N k h φ ( h ) 1 in B tree , σ .
Hence h = φ ( h ) 1 . By assumption (3), 1 spans the eigenspace at λ = 1 , so h must be a constant function.
On the other hand, h is a probability density for the counting measure, i.e. n 1 h ( n ) = 1 . The only constant function in B tree , σ is 1 up to a scalar, and n 1 1 ( n ) = , so no nonzero constant function can have finite total mass. Therefore h cannot be a constant unless h 0 , contradicting n 1 h ( n ) = 1 . We conclude that there is no nontrivial P-invariant probability density in B tree , σ .
Step 3: Exclusion of nontrivial cycles.
If there were a nontrivial q-cycle for the forward Collatz map, the associated transfer operator would admit a qth root of unity λ = e 2 π i p / q on the unit circle (arising from the cycle’s invariant density supported on that orbit). This would furnish a | λ | = 1 eigenvalue distinct from 1 for P acting on B tree , σ , contradicting (2). Thus no such peripheral eigenvalue exists; in particular, no nontrivial periodic cycle supports an invariant density lying in B tree , σ .
Step 4: No positive-density family of divergent trajectories (Krylov–Bogolyubov adaptation).
Lemma 22 (Vanishing of the PF functional on nonterminating mass)Let f * 0 be supported on the nonterminating set N = { n : T k n ¬ 1 } . Then ϕ ( f * ) = 0 .
Proof. For n N the forward orbit leaves every finite set and therefore h ( n ) 0 by Proposition 6. Since ϕ is the unique invariant functional with ϕ ( g ) = n h ( n ) g ( n ) for g 0 , dominated convergence gives
ϕ ( f * ) = n N h ( n ) f * ( n ) f * n N h ( n ) = 0 .
Assume, toward a contradiction, that there exists a set S N with positive upper natural density d ¯ ( S ) > 0 such that every n S has a non-terminating (or nontrivially periodic) forward Collatz trajectory under T.
Let δ n B tree , σ * denote point evaluation at n (continuous since B tree , σ 1 ). For N 1 define the normalized counting functional
ν N : = 1 | S [ 1 , N ] | n S [ 1 , N ] δ n B tree , σ * .
Each ν N is positive with ν N ( 1 ) = 1 .
Dual formulation and Cesàro averages. Let T : N N be the forward Collatz map and recall that P is its dual (Perron–Frobenius) operator:
( P f ) ( m ) = n : T ( n ) = m f ( n ) w ( n ) , ψ ( P f ) = ( T * ψ ) ( f ) ,
for f B tree , σ and ψ B tree , σ * . Form the Krylov–Bogolyubov Cesàro averages on the dual side,
η N , K : = 1 K k = 0 K 1 T * k ν N B tree , σ * , K 1 .
Each η N , K is positive and normalized, η N , K ( 1 ) = 1 .
Support property. For every n S , the forward orbit { T k ( n ) } k 0 avoids the 1–2 cycle, so supp ( T * k ν N ) N for all k, where N denotes the set of integers with non-terminating Collatz trajectories. Hence supp ( η N , K ) N for all N , K .
Uniform dual-norm bound (tightness). By Lemma 21, there exists C σ > 0 independent of N , K such that η N , K B tree , σ * C σ . Therefore the family { η N , K } N , K is weak* relatively compact.
Invariant weak* limits. Fix N and take a weak* limit point ψ N of { η N , K } K as K . Since T * is weak* continuous and
T * η N , K η N , K = 1 K T * K ν N ν N 2 K ν N K 0 ,
each such ψ N satisfies T * ψ N = ψ N , i.e.
ψ N ( P f ) = ψ N ( f ) f B tree , σ .
Each ψ N is positive, normalized, and supported in N .
Passage N and nontriviality. Because d ¯ ( S ) > 0 , the ν N are nondegenerate, and by Banach–Alaoglu the sequence { ψ N } has weak* limit points. Let ψ be any such limit. Then ψ is positive, normalized, T-invariant (and hence P-invariant by (161)), supported in N , and ψ ( 1 ) = 1 , so ψ 0 .
Contradiction with the spectral-gap structure. By Theorem 7 and Proposition 13, the P-invariant functionals form a one-dimensional space spanned by the positive eigenfunctional φ of the rank-one projection Π f = φ ( f ) h , where h is the unique invariant density with P h = h and φ ( h ) = 1 . Thus ψ = c φ with c = ψ ( 1 ) = 1 , so ψ = φ .
Choose f * B tree , σ nonnegative, supported in N , and not identically zero. By the support property, ψ ( f * ) > 0 . Yet φ , being strictly positive on the whole positive cone, assigns positive mass also to the complement of N , where f * = 0 ; hence φ ( f * ) < φ ( 1 ) = 1 . Therefore ψ ( f * ) φ ( f * ) , contradicting ψ = φ .
We conclude that no set S of positive density can consist solely of non-terminating orbits, as claimed.
Orbit-Generated Invariant Functionals and Their Support
Lemma 23 (Admissible orbit-generated functionals; support property)Let O = { n t } t 0 be a (forward) Collatz orbit and suppose B tree , σ 1 ( N ) continuously. Then each point-evaluation δ n : f f ( n ) belongs to B tree , σ * with δ n C emb , for some embedding constant C emb > 0 . Define the convex Cesàro averages on the orbit
μ K : = 1 K t = 0 K 1 δ n t B tree , σ * ( K 1 ) .
Any weak* limit point ψ of the net ( μ K ) K 1 in B tree , σ * is called anadmissible orbit-generated functionalfor O . Such ψ satisfies:
  • ψ is positive and normalized: ψ ( f ) 0 for f 0 and ψ ( 1 ) = 1 .
  • (Support property) If f B tree , σ vanishes on the orbit O , then ψ ( f ) = 0 .
Moreover, if in addition the family ( μ K ) isasymptotically P * -invariantin the sense that
lim K P * μ K μ K B tree , σ * = 0 ,
then every weak* limit ψ satisfies the invariance relation
ψ P = ψ on B tree , σ .
Proof. The continuous embedding B tree , σ 1 implies | f ( n ) | f 1 C f B tree , σ , hence each δ n is continuous on B tree , σ , and thus μ K B tree , σ * for all K. Positivity and normalization of any weak* limit ψ follow from the same properties of μ K and weak* lower semicontinuity.
For the support property, let f B tree , σ satisfy f ( n t ) = 0 for all t 0 . Then μ K ( f ) = 1 K t = 0 K 1 f ( n t ) = 0 for every K. Taking weak* limits along any subnet μ K j w * ψ yields ψ ( f ) = lim j μ K j ( f ) = 0 .
For (163), write for any f B tree , σ :
ψ ( P f ) = lim j μ K j ( P f ) = lim j ( P * μ K j ) ( f ) = lim j μ K j ( f ) + ( P * μ K j μ K j ) ( f ) = ψ ( f ) ,
where we used weak* convergence of μ K j to ψ and the asymptotic invariance (162) to force the error term to 0. □
Lemma 24 (Uniform dual-norm control for P * –Cesàro averages)Fix n 0 N and define
Ψ N : = 1 N k = 0 N 1 ( P * ) k δ n 0 B tree , σ * .
There exists C σ > 0 independent of N such that Ψ N B tree , σ * C σ for all N 1 . Consequently the sequence ( Ψ N ) N 1 is weak* relatively compact in B tree , σ * .
Proof. For f B tree , σ ,
Ψ N ( f ) = 1 N k = 0 N 1 ( P * ) k δ n 0 ( f ) = 1 N k = 0 N 1 δ n 0 ( P k f ) = 1 N k = 0 N 1 ( P k f ) ( n 0 ) .
By the Lasota–Yorke inequality on B tree , σ (Prop. 2), there exist constants 0 < λ LY < 1 and C LY > 0 such that
[ P k f ] tree λ LY k [ f ] tree + C LY f 1 ( k 0 ) .
The point-evaluation functional is continuous on B tree , σ (by the assumed embedding into 1 and the definition of the tree norm), so there exists C ev > 0 with | g ( n 0 ) | C ev [ g ] tree + g 1 for all g. Apply this to g = P k f and sum the geometric series:
| Ψ N ( f ) | 1 N k = 0 N 1 C ev λ LY k [ f ] tree + C LY f 1 C σ [ f ] tree + f 1 C σ f B tree , σ ,
with C σ independent of N. Hence Ψ N B tree , σ * C σ and weak* relative compactness follows from Banach–Alaoglu. □
Proposition 10 (Weak* limits of P * –Cesàro averages are invariant)With Ψ N as in Lemma 24, every weak* cluster point Ψ of ( Ψ N ) N 1 satisfies
P * Ψ = Ψ .
Proof. Let Ψ N j * Ψ along a subsequence. For any f B tree , σ ,
Ψ N j ( P f f ) = 1 N j k = 0 N j 1 δ n 0 P k ( P f f ) = 1 N j ( P N j f ) ( n 0 ) f ( n 0 ) .
Point evaluations are continuous on B tree , σ and ( P k ) k 0 is uniformly bounded on B tree , σ , so the right-hand side tends to 0 as j . Hence Ψ N j ( P f f ) 0 . Passing to the weak* limit,
Ψ ( P f f ) = 0 for all f B tree , σ ,
so P * Ψ = Ψ . □
Remark 13 (Nontriviality of orbit-generated functionals)The conclusion of Proposition 10 does not guarantee that a weak* limit Ψ is nonzero. In particular, for a sufficiently sparse or rapidly diverging orbit, the Cesàro averages Ψ N may converge to 0 in B tree , σ * . The conditional results in Theorems 8 and 9 below therefore assume, as an explicit hypothesis, that the relevant orbit generates a nontrivial invariant functional in B tree , σ * .
Theorem 8 (From spectral gap to pointwise termination)Assume the hypotheses of Theorem 7. If, in addition, every infinite forward Collatz orbit generates a nontrivial invariant functional in B tree , σ * , then no such infinite orbit can exist. Consequently, every Collatz trajectory enters the 1–2 cycle.
Proof. Under the hypotheses of Theorem 7, P is quasi-compact on B tree , σ with ρ ess ( P ) < 1 , has no eigenvalues on the unit circle except possibly λ = 1 , and the λ = 1 eigenspace is span { h } , where h > 0 is the invariant density from (32). Hence there exists a bounded rank-one spectral projector Π and a bounded operator N with ρ ( N ) < 1 such that
P = Π + N , Π N = N Π = 0 , Π f = ϕ ( f ) h ,
where ϕ B tree , σ * is the positive invariant functional normalized by ϕ ( h ) = 1 . In particular,
P k f = ϕ ( f ) h + N k f ϕ ( f ) h in B tree , σ as k .
By Lemma 24 any infinite forward Collatz orbit yields a weak* cluster point Ψ B tree , σ * with P * Ψ = Ψ . By the additional hypothesis of the theorem we may assume that Ψ is nontrivial. We first show that any such Ψ must be a scalar multiple of ϕ . Indeed, for any f B tree , σ and any k 1 ,
Ψ ( f ) = Ψ ( P k f ) = Ψ Π f + N k f = Ψ ( Π f ) + Ψ ( N k f ) .
Since ρ ( N ) < 1 , there exist C > 0 and 0 < r < 1 with N k C r k . Boundedness of Ψ gives
| Ψ ( N k f ) | Ψ N k f Ψ C r k f 0 as k .
Using (164), we therefore obtain
Ψ ( f ) = lim k Ψ ( P k f ) = Ψ ( Π f ) = Ψ ϕ ( f ) h = Ψ ( h ) ϕ ( f )
for all f B tree , σ . Thus Ψ = c ϕ with c : = Ψ ( h ) .
By Proposition 24, any infinite forward Collatz orbit yields a nontrivial Ψ B tree , σ * with P * Ψ = Ψ and Ψ ( 1 ) = 1 . Let us fix such a functional and denote it by ψ . We first show that any such ψ must be a scalar multiple of φ . Indeed, for any f B tree , σ and any k 1 ,
ψ ( f ) = ψ ( P k f ) = ψ Π f + N k f = ψ ( Π f ) + ψ ( N k f ) .
Since ρ ( N ) < 1 , there exist C > 0 and 0 < r < 1 with N k C r k . Boundedness of ψ then yields
| ψ ( N k f ) | ψ N k f ψ C r k f k 0 .
Hence
ψ ( f ) = lim k ψ ( P k f ) = ψ ( Π f ) = ψ φ ( f ) 1 = φ ( f ) ψ ( 1 ) for all f B tree , σ .
Thus ψ = c φ with c : = ψ ( 1 ) .
We now contradict this conclusion by constructing a test function f * B tree , σ for which ψ ( f * ) = 0 while φ ( f * ) > 0 . Let O = { n t } t 0 be the given infinite forward Collatz orbit. For each j 0 , let I j = [ 6 j , 2 · 6 j ) N be the standard block. The orbit intersects each I j in at most finitely many points; write E j : = O I j (possibly empty, always finite). Define
J j : = I j E j and v j : = θ 2 j with the same 0 < θ < 1 as in B tree , σ .
Define f * : N [ 0 , ) by
f * ( n ) = v j , n J j , 0 , n E j , for n I j .
Because | J j | = | I j | | E j | = 6 j | E j | with | E j | < , we have
f * 1 = j 0 n J j v j = j 0 v j | J j | j 0 θ 2 j 6 j = j 0 ( 6 θ 2 ) j < ,
since θ is chosen (and fixed in the construction of B tree , σ ) so that 6 θ 2 < 1 . Moreover, by construction f * is blockwise constant on J j and vanishes on the finitely many points E j , so the multiscale tree seminorm [ · ] tree is controlled by the exponentially decaying sequence ( v j ) , hence [ f * ] tree < . Therefore f * B tree , σ .
By construction f * ( n t ) = 0 for every t 0 , i.e. f * vanishes on the orbit O . Since ψ is generated by O and is supported on O in the sense that ψ ( g ) = 0 whenever g vanishes on O , we have
ψ ( f * ) = 0 .
On the other hand, ϕ is the rank-one eigenfunctional associated with the invariant density h, and in particular ϕ is strictly positive on nonzero nonnegative functions. Since f * 0 and f * ¬ 0 with positive mass on each J j , we have
ϕ ( f * ) > 0 .
Since the orbit eventually avoids the support of f * , one has
Ψ ( f * ) = 0 .
Combining (166), (170), and (171) yields
0 = Ψ ( f * ) = Ψ ( h ) ϕ ( f * ) ,
which forces Ψ ( h ) = 0 . Hence Ψ = 0 , contradicting the assumed nontriviality of Ψ . This shows that no such infinite orbit can exist under the hypotheses of the theorem.
We conclude that no nontrivial invariant functional in B tree , σ * can be generated by an infinite forward Collatz orbit. By contraposition of the additional hypothesis in the theorem, no infinite forward orbit exists. Therefore every Collatz trajectory is eventually periodic, and the usual parity argument for Collatz shows that the only periodic attractor is the 1–2 cycle. This completes the proof. □
Lemma 25 (Uniform dual bound for orbit Cesàro averages)Let B tree , σ be the multiscale tree space constructed above, and let δ n B tree , σ * denote point evaluation at n, which is continuous since B tree , σ 1 . Fix n 0 N with an infinite forward orbit
O + ( n 0 ) = { T k n 0 } k 0
under the Collatz map T. For each N 1 define the Cesàro averages
Λ N ( f ) : = 1 N k = 0 N 1 f T k n 0 , f B tree , σ .
Then Λ N B tree , σ * for every N 1 , and there exists a constant C > 0 , independent of N, such that
sup N 1 Λ N B tree , σ * C .
Proof. By definition,
Λ N = 1 N k = 0 N 1 δ T k n 0
as a functional on B tree , σ . The continuous embedding B tree , σ 1 implies that there exists C emb > 0 such that
f 1 C emb f tree , σ for all f B tree , σ .
For each n 1 and f B tree , σ we have
| δ n ( f ) | = | f ( n ) | f 1 C emb f tree , σ ,
so δ n B tree , σ * C emb uniformly in n. By (174),
Λ N B tree , σ * 1 N k = 0 N 1 δ T k n 0 B tree , σ * C emb .
Taking C = C emb yields (173). □
Proposition 11 (Orbit–generated invariant functional)Let n 0 N have an infinite forward orbit O + ( n 0 ) = { T k n 0 } k 0 under the Collatz map T. Let Λ N be the Cesàro averages defined in (172). Then:
(i)There exists a subsequence ( N j ) j 1 and a nonzero functional Φ B tree , σ * such that Λ N j w * Φ as j .
(ii)The functional Φ is invariant for the dual Collatz operator:
Φ P = Φ , equivalently P * Φ = Φ .
(iii)The functional Φ is supported on the orbit O + ( n 0 ) in the sense that if f B tree , σ vanishes on O + ( n 0 ) , then Φ ( f ) = 0 .
In particular, Φ is a nontrivial P * –invariant functional generated by the orbit O + ( n 0 ) .
Proof. By Lemma 25 the family { Λ N } N 1 is bounded in B tree , σ * , so by Banach–Alaoglu there exists a subsequence ( N j ) and Φ B tree , σ * such that Λ N j w * Φ . Each Λ N is positive and normalized, Λ N ( 1 ) = 1 , hence
Φ ( 1 ) = lim j Λ N j ( 1 ) = 1 ,
so Φ is nonzero. This proves (i).
For (ii), let T * denote the pushforward operator on B tree , σ * associated with the forward Collatz map T, as in (176):
ψ ( P f ) = ( T * ψ ) ( f ) for all f B tree , σ , ψ B tree , σ * .
On point masses we have T * δ n = δ T ( n ) , hence
T * Λ N = 1 N k = 0 N 1 T * δ T k n 0 = 1 N k = 0 N 1 δ T k + 1 n 0 = Λ N + 1 N δ T N n 0 δ n 0 .
Using the uniform bound on the norms of the point evaluations,
T * Λ N Λ N B tree , σ * 2 C emb N 0 ( N ) .
Passing to the subsequence N = N j and using weak-* continuity of T * gives T * Φ = Φ . Applying (176) with ψ = Φ yields
Φ ( P f ) = T * Φ ( f ) = Φ ( f ) for all f B tree , σ ,
which is equivalent to P * Φ = Φ and proves (ii).
For (iii), suppose f B tree , σ satisfies f ( T k n 0 ) = 0 for every k 0 . Then each Λ N ( f ) = 0 by definition (172), and therefore
Φ ( f ) = lim j Λ N j ( f ) = 0 .
Hence Φ vanishes on all functions that vanish along the orbit O + ( n 0 ) , so it is supported on that orbit in the stated sense. □
Theorem 9 (Exclusion of zero-density infinite trajectories)Assume that the backward Collatz operator P acts on B tree , σ as a positive, quasi–compact operator with a spectral gap, and that the spectrum on | z | = 1 consists only of the simple eigenvalue 1. Let h B tree , σ and ϕ B tree , σ * denote the normalized principal eigenpair satisfying
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 ,
with h > 0 and ϕ > 0 on the positive cone. Assume, in addition, that every infinite forward Collatz orbit { T k n 0 } k 0 generates a nontrivial P * –invariant functional Φ B tree , σ * with Φ ( h ) 0 , for example as a weak* limit of the Cesàro averages. Then no forward Collatz trajectory can be infinite; equivalently, every trajectory eventually enters the 1–2 cycle.
Proof. Assume, for contradiction, that there exists an infinite forward orbit { T k n 0 } k 0 that never reaches { 1 , 2 } .
Step 1: Construction of an invariant functional from the orbit. For f B tree , σ define
Λ N ( f ) : = 1 N k = 0 N 1 f ( T k n 0 ) .
By the continuity of point evaluations and the Lasota–Yorke estimate, the functionals Λ N are uniformly bounded on B tree , σ , so they admit weak* accumulation points. By the additional hypothesis of the theorem we may choose such a limit Φ with P * Φ = Φ and Φ ( h ) 0 , and we normalize
Φ ( h ) = 1 .
We claim Φ is P * –invariant. For finitely supported f, the Collatz relation implies
( P f ) ( n ) = m : T ( m ) = n f ( m ) m = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 ,
and therefore
( P f ) T k n 0 = f T k + 1 n 0 1 T k + 1 n 0
up to the correct branch normalization. A telescoping argument over k shows
Λ N ( P f ) Λ N ( f ) C ( f ) N 0 ,
and the same follows for general f B tree , σ by density of finitely supported functions and boundedness of P. Passing to the weak* limit gives
Φ ( P f ) = Φ ( f ) for all f B tree , σ ,
so P * Φ = Φ . Normalize Φ by
Φ ( h ) = 1 .
Step 2: Spectral convergence on the range of P. By quasi–compactness with spectral gap, there exist constants C > 0 and ρ ( 0 , 1 ) such that
P k f ϕ ( f ) h B tree , σ C ρ k f B tree , σ ( k 0 ) .
In particular, P k f ϕ ( f ) h exponentially fast in norm.
Step 3: Test supported on the 1–2 cycle. Let Ψ : = 1 { 1 , 2 } . Then Ψ B tree , σ , Ψ 0 , and by Proposition 12 together with Lemma 14, h ( 1 ) , h ( 2 ) > 0 and
ϕ ( Ψ ) > 0 .
Because the forward orbit { T k n 0 } never enters { 1 , 2 } , every term in Λ N ( Ψ ) vanishes, and hence
Φ ( Ψ ) = lim N Λ N ( Ψ ) = 0 .
Step 4: Invariance and spectral convergence yield a contradiction. Using P * Φ = Φ and (165),
Φ ( Ψ ) = Φ ( P k Ψ ) = Φ ϕ ( Ψ ) h + ( P k Ψ ϕ ( Ψ ) h ) = ϕ ( Ψ ) Φ ( h ) + Φ P k Ψ ϕ ( Ψ ) h .
Since Φ is continuous and P k Ψ ϕ ( Ψ ) h 0 exponentially, the last term tends to zero. Taking k gives
Φ ( Ψ ) = ϕ ( Ψ ) Φ ( h ) .
By (178), Φ ( h ) = 1 , so the right-hand side of (181) equals ϕ ( Ψ ) > 0 . However, by (180), the left-hand side is 0. This contradiction shows that no such infinite orbit can exist.
Step 5: Conclusion. Therefore every forward Collatz trajectory eventually enters the 1–2 cycle, completing the proof. □
Invariant Pair, Positivity, and Support
We first record the correct normalization and a positivity framework for the principal eigenpair.
Definition 5 (Principal eigenpair and normalization)Let P act on the Banach lattice B tree , σ with positive cone B tree , σ + = { f B tree , σ : f 0 } . Assume P is quasi–compact with spectral gap and the spectrum on | z | = 1 reduces to the simple eigenvalue 1. Then there exist h B tree , σ + { 0 } and ϕ ( B tree , σ ) * , ϕ 0 , such that
P h = h , ϕ P = ϕ ,
and we fix the normalization ϕ ( h ) = 1 .
Remark 14 (Positivity and logarithmic mass)P is positive: f 0 P f 0 . It is logarithmically mass–preserving rather than mass–preserving: for finitely supported f,
n 1 ( P f ) ( n ) = m 1 f ( m ) m .
Hence the constant function 1 is not invariant; instead, the fixed point h must decay at infinity (indeed h ( n ) c / n is consistent with P h = h ). All spectral decompositions and projections are therefore expressed relative to h and ϕ:
Π f = ϕ ( f ) h .
Definition 6 (Invariant ideals and zero-sets)A closed ideal I B tree , σ is a closed subspace such that f I and | g | | f | imply g I . Equivalently, there exists a subset S N (thezero-setof I ) with
I = { f B tree , σ : f | S = 0 } .
We call I (or S) P-invariant if P I I .
Lemma 26 (Zero-set characterization)Let I be a closed ideal with zero-set S. Then P I I if and only if S is closed under the preimage rules of T, namely
n S 2 n S and n 4 ( mod 6 ) n 1 3 S .
Proof. If P I I , take f I and n S . Then ( P f ) ( n ) = f ( 2 n ) 2 n + 1 { n 4 ( 6 ) } f ( ( n 1 ) / 3 ) ( n 1 ) / 3 = 0 . Since f 0 can be chosen with arbitrary positive values off S, both indices 2 n and (when defined) ( n 1 ) / 3 must also belong to S. Conversely, if S obeys these closures, then for each n S and every f vanishing on S we have ( P f ) ( n ) = 0 , hence P I I . □
Lemma 27 (Ideal-irreducibility)The only closed P-invariant ideals in B tree , σ are { 0 } and B tree , σ . Equivalently, the only zero-sets S N satisfying the closure rules of Lemma 26 are S = and S = N .
Proof. Let S satisfy the closure rules. (i) If S contains an odd n, then 2 k n S for all k 0 . There exists k 2 with 2 k n 4 ( mod 6 ) , hence ( 2 k n 1 ) / 3 S . Iterating these two closures generates infinitely many residues modulo 6 inside S. From here a routine Chinese Remainder argument shows S meets every sufficiently large arithmetic progression, whence S = N by downward propagation through the map n ( n 1 ) / 3 when defined or via parity halving (details can be included in an appendix). (ii) If S contains only even numbers, pick n S and write n = 2 a m with m odd. Then 2 k m S for all k a ; choosing k a + 2 forces 2 k m 4 ( mod 6 ) and again ( 2 k m 1 ) / 3 S is odd, reducing to case (i). Hence S = N . Therefore the only possibilities are S = and S = N , proving ideal-irreducibility. □
Proposition 12 (Full support of h and strict positivity of ϕ )Assume that P : B tree , σ B tree , σ is a positive, quasi–compact operator with a simple eigenvalue 1 at the spectral radius and that P is ideal–irreducible in the sense of Lemma 27. Let h B tree , σ and ϕ B tree , σ * be the principal eigenvectors satisfying
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
Then h ( n ) > 0 for every n 1 , and ϕ is strictly positive on the cone of nonnegative nonzero functions:
f B tree , σ , f 0 , f ¬ 0 ϕ ( f ) > 0 .
Proof. Because P is positive and quasi–compact, the Krein–Rutman theorem (see, e.g., Schaefer, Banach Lattices and Positive Operators, Thm. V.3.7) provides nonzero h 0 and ϕ 0 with P h = h and ϕ P = ϕ corresponding to the peripheral eigenvalue 1. The eigenvectors h and ϕ are unique up to positive scalars because 1 is simple and isolated.
Step 1: Pointwise positivity of h. Suppose, for contradiction, that h ( n 0 ) = 0 for some n 0 N . Define the closed ideal
I n 0 : = { f B tree , σ : f ( n 0 ) = 0 } .
Since P h = h and P is positive, we have for all n N
h ( n ) = h ( 2 n ) 2 n + 1 { n 4 ( mod 6 ) } h ( ( n 1 ) / 3 ) ( n 1 ) / 3 .
If h ( n 0 ) = 0 , both preimage indices 2 n 0 and, when defined, ( n 0 1 ) / 3 must also satisfy h = 0 . By iteration of this closure rule, the zero set { n : h ( n ) = 0 } is closed under both preimage maps of the Collatz tree and therefore defines a nontrivial P–invariant ideal. This contradicts Lemma 27, which asserts that the only P–invariant ideals are { 0 } and B tree , σ . Hence the zero set is empty and h ( n ) > 0 for all n.
Step 2: Strict positivity of ϕ. Let f 0 with f ¬ 0 and suppose ϕ ( f ) = 0 . Denote by J f the closed ideal generated by f:
J f : = { g B tree , σ : | g | C P f for some C > 0 } .
Because P is positive, J f is P–invariant and nontrivial. For every g J f and every k 0 we have ϕ ( P k g ) = ϕ ( g ) = 0 by invariance of ϕ . In particular, ϕ vanishes on a nontrivial P–invariant ideal, contradicting ideal–irreducibility. Therefore ϕ ( f ) > 0 for all nonzero f 0 .
Step 3: Conclusion. By Step 1, h is strictly positive pointwise, and by Step 2, ϕ is strictly positive on the positive cone. Consequently h is a quasi–interior point of B tree , σ + and ϕ is a strictly positive functional, as required. □
Corollary 2 (Positivity on cycle tests)Let Ψ = 1 { 1 , 2 } . Then ϕ ( Ψ ) > 0 .
Proof. By Proposition 12, h ( 1 ) , h ( 2 ) > 0 , and ϕ is strictly positive on B tree , σ + { 0 } . Since Ψ 0 and Ψ ¬ 0 , we have ϕ ( Ψ ) > 0 . □
6. Explicit Verification of the Odd-Branch Contraction Constant 
The final analytic step in the argument is to verify rigorously that the contraction constant λ odd ( α , ϑ ) appearing in the Lasota–Yorke inequality (41) satisfies λ odd < 1 for the explicit parameter values ( α , ϑ ) = ( 1 2 , 1 5 ) . This establishes that the odd branch of the backward Collatz operator P acts as a strict contraction in the strong seminorm [ · ] tree , ensuring that P is quasi-compact on B tree , σ with a uniform spectral gap in the strong topology.
From Section 4.4, the odd-branch contraction satisfies
λ odd ( α , ϑ ) C α 6 ϑ , C α : = sup u > v > 0 W α ( u , v ) W α ( u , v ) ,
where
W α ( u , v ) = u v | u v | ( u + v ) α , ( u , v ) = u 1 3 , v 1 3 .
At α = 1 2 , Lemma 19 gives the explicit distortion bound
W 1 / 2 ( u , v ) u 3 2 W 1 / 2 ( u , v ) 6 , hence C 1 / 2 3 2 .
Substituting (183) into (182) yields
λ odd 1 2 , 1 5 3 2 6 · 1 5 0.1225 < 1 .
This confirms the strict odd-branch contraction at ( α , ϑ ) = ( 1 2 , 1 5 ) without any numerical optimization beyond Lemma 19.
Uniform Lasota–Yorke Constant.
We fix the combined Lasota–Yorke constant by
λ LY ( α , ϑ ) : = λ even ( α , ϑ ) + λ odd ( α , ϑ ) , λ even ( α , ϑ ) = 2 ( 1 α ) ϑ ,
scale factor from W α ( 2 u , 2 v ) = 2 1 α W α ( u , v ) , so both branches are measured with the same block scale factor ϑ . For ( α , ϑ ) = ( 1 2 , 1 5 ) ,
λ even 1 2 , 1 5 = 2 1 / 2 · 1 5 0.1414 .
Using the conservative odd-branch bound above,
λ LY 1 2 , 1 5 0.1414 + 0.1918 0.3332 < 1 ,
and with the refined C 1 / 2 = 3 2 one even gets λ LY ( 1 2 , 1 5 ) 0.2639 < 1 . By the Ionescu–Tulcea–Marinescu–Hennion theory applied to the two-norm Lasota–Yorke inequality (Proposition 2),
ρ ess ( P ) λ LY 1 2 , 1 5 < 1 ,
so P is quasi-compact on B tree , σ with a strict Lasota–Yorke contraction in the strong seminorm.
Proposition 13 (Explicit invariant functional and block-level recursion)Assume P is a positive quasi-compact operator on B tree , σ with a simple eigenvalue at 1 and no other spectrum on | z | = 1 . Then ... there exists a unique positive invariant functional ϕ B tree , σ * with ϕ ( h ) = 1 such that the rank-one spectral projector is Π f = ϕ ( f ) h . Moreover, if h B tree , σ is any P-invariant eigenfunction, then h is constant, and its block averages c j satisfy the homogeneous two-sided recursion
c j = a c j + 1 + b c j 1 , j 1 ,
with coefficients a , b > 0 determined by the asymptotic even/odd preimage ratios (Lemma 18). All subexponentially bounded solutions of (186) converge to a constant, reflecting the one-dimensional eigenspace at λ = 1 .
Proof. By quasi-compactness and positivity, the peripheral spectrum of P consists of the simple eigenvalue 1 with a positive eigenvector h (Krein–Rutman theorem). Since the remainder of the spectrum lies inside { | z | < λ LY } , the Cesàro averages h N converge to h in B tree , σ , establishing existence and uniqueness of the normalized fixed point P h = h .
To derive the block recursion, average the identity P h = h over I j . Each m I j receives contributions from its even and odd preimages: even preimages arise from 2 I j , odd preimages from ( 3 I j + 1 ) / 2 truncated to integers. Using the transfer formula ( P f ) ( m ) = x : T ( x ) = m f ( x ) / w ( x ) and summing over m I j gives
1 | I j | m I j h ( m ) = 1 | I j | m I j x : T ( x ) = m h ( x ) w ( x ) = a c j + 1 + b c j 1 ,
where a , b > 0 depend only on the relative frequencies of even and odd preimages and the fixed arithmetic weights w ( x ) (defined in Section 2.3). This yields (186). If 4 a b < 1 , the characteristic equation a r 2 r + b = 0 has two positive roots; the smaller root r ( 0 , 1 ) corresponds to the decaying solution required for h B tree , σ . Normalization of h 1 = 1 fixes c 0 and hence C. Finally, the Lasota–Yorke distortion bounds of Section 4.4.2 imply that within each block I j the invariant density h is comparable to its average c j , yielding the geometric decay profile established above. □
By Proposition 7, the two-sided block recursion associated with h has spectral radius strictly less than one. Hence the peripheral spectrum of P reduces to the simple eigenvalue 1, and P possesses a genuine spectral gap on B tree , σ .
Remark (small-ϑ behaviour). Proposition 14 shows that λ even ( α , ϑ ) = O ( ϑ ) and λ odd ( α , ϑ ) = O ( ϑ ) , so that λ LY ( α , ϑ ) = O ( ϑ ) as ϑ 0 . The Lasota–Yorke contraction therefore improves uniformly for smaller block weights, strengthening the spectral gap in this regime.
Proposition 14 (Small- ϑ asymptotics of the strong contraction)Fix α ( 0 , 1 ] . For the strong seminorm [ · ] tree on B tree , σ with block weight parameter ϑ ( 0 , 1 ) , the Lasota–Yorke constants satisfy
[ P f ] tree λ ( α , ϑ ) [ f ] tree + C f 1 , λ ( α , ϑ ) = λ even ( α , ϑ ) + λ odd ( α , ϑ ) ,
with λ even ( α , ϑ ) C even ϑ and λ odd ( α , ϑ ) ( C α / 6 ) ϑ . In particular,
λ ( α , ϑ ) = O ( ϑ ) as ϑ 0 ,
and therefore lim ϑ 0 λ ( α , ϑ ) = 0 .
Proof. Each branch moves mass by at most one block in the strong seminorm. Consequently the block-difference weights contribute exactly one factor ϑ . The even branch carries no additional distortion, giving λ even C even ϑ . The odd branch distortion is controlled by Section 4.4.2, yielding λ odd ( C α / 6 ) ϑ . Summing proves λ ( α , ϑ ) = O ( ϑ ) and the limit. □
Corollary 3 (Verified spectral gap)Let ( α , ϑ ) = 1 2 , 1 5 and σ > 1 . Assume that the explicit branch estimates yield λ LY ( α , ϑ ) < 1 as defined in (184). Then the backward Collatz transfer operator P acting on B tree , σ satisfies the Lasota–Yorke inequality
[ P f ] tree λ LY [ f ] tree + C LY f σ for all f B tree , σ .
Hence:
  • P is quasi-compact on B tree , σ with ρ ess ( P ) λ LY < 1 .
  • If the structural relation of Proposition 7 holds, then P possesses a genuine spectral gap on B tree , σ : all spectral values with | z | > λ LY are isolated eigenvalues of finite multiplicity.
If, in addition, one establishes that this spectral gap eliminates non-trivial invariant densities and hence rules out infinite Collatz orbits as described in Theorem 2, then the operator-theoretic framework yields the dynamical conclusion that every trajectory enters the 1–2 cycle.
Proof. Under λ LY < 1 , Proposition 2 provides the two-norm Lasota–Yorke inequality above. The compact embedding B tree , σ σ 1 (Lemma 7) ensures that the hypotheses of the Ionescu–Tulcea–Marinescu–Hennion theorem are satisfied, yielding ρ ess ( P ) λ LY < 1 . If, in addition, the structural relation established in Proposition 7 holds for invariant densities, then Theorem 6 precludes the presence of eigenvalues on the unit circle, so the remaining spectrum lies strictly within { z : | z | λ LY } . The claimed spectral-gap statement follows. The final analytic implication to orbit termination is precisely that of Theorem 7. □
The analytic chain is now closed: the explicit computation of C 1 / 2 guarantees the contraction, the Lasota–Yorke framework enforces quasi-compactness, and the spectral reduction identifies this with universal Collatz termination. The argument is therefore complete and self-contained. The following theorem summarizes the result.
Theorem 10 (Spectral gap and conditional consequences for Collatz)Let P be the backward transfer operator associated with the Collatz map (1), acting on the multiscale Banach space B tree , σ with parameters ( α , ϑ ) = ( 1 2 , 1 5 ) . Then:
(1)
The Lasota–Yorke inequality on B tree , σ holds with contraction constant λ odd ( α , ϑ ) < 1 , and P is quasi-compact with a genuine spectral gap ρ ess ( P ) < 1 .
(2)
The eigenvalue λ = 1 is algebraically simple. There exist a unique positive eigenvector h B tree , σ and a unique positive invariant functional ϕ B tree , σ * such that
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 .
The spectral projector is Π f = ϕ ( f ) h , and the complementary part N : = P Π satisfies ρ ( N ) < 1 .
(3)
The block recursion of Section 5.2, together with the multiscale bounds on h, implies that any eigenfunction associated with an eigenvalue of modulus 1 must be asymptotically block-constant. The weighted σ 1 contraction then forces such an eigenfunction to vanish unless it is proportional to h. Hence h spans the entire peripheral spectrum.
(4)
As a consequence, there is no nontrivial P-invariant or periodic density supported on non-terminating orbits, and no positive-density family of divergent forward trajectories exists(Theorem 7). If, in addition, every infinite forward orbit gives rise to a nontrivial P * -invariant functional Ψ B tree , σ * with Ψ ( h ) 0 (the invariant-functional assumption of Theorems 8 and 9), then no infinite forward Collatz orbit can exist. Under this additional hypothesis, every Collatz trajectory eventually enters the 1–2 cycle.
Proof. Fix ( α , ϑ ) = ( 1 2 , 1 5 ) and σ > 1 as in the statement. We argue in four steps that correspond to the numbered items.
(1) Lasota–Yorke inequality and quasi-compactness. By Proposition 2 there exist constants 0 < λ LY < 1 and C LY > 0 such that for all f B tree , σ
[ P f ] tree , σ λ LY [ f ] tree , σ + C LY f 1 ,
and, by iteration, for every n 1 ,
[ P n f ] tree , σ λ LY n [ f ] tree , σ + C LY f 1 .
The compact embedding of the unit ball of { [ · ] tree , σ 1 } into ( B tree , σ , · 1 ) (by the multiscale definition of the tree seminorm and σ > 1 ) yields the Ionescu–Tulcea–Marinescu/Hennion spectral bound
ρ ess ( P ) λ LY < 1 .
Hence P is quasi-compact on B tree , σ .
(2) One-dimensional eigenspace at λ = 1 and the rank-one projector. Positivity of P on the natural cone of nonnegative functions, together with irreducibility along the Collatz tree (every level communicates at uniformly bounded depth), implies that the peripheral spectrum is reduced to { 1 } and that the eigenvalue λ = 1 is simple. By Theorem 1 there exist unique positive elements
h B tree , σ , ϕ B tree , σ * ,
such that
P h = h , ϕ P = ϕ , ϕ ( h ) = 1 ,
and the rank-one spectral projector at λ = 1 is
Π f = ϕ ( f ) h , f B tree , σ .
Let N : = P Π . Then Π N = N Π = 0 , the spectrum of N is contained in { z : | z | ρ ess ( P ) } , and by (188)–(189),
P n f = ϕ ( f ) h + N n f , N n f tree , σ C λ LY n [ f ] tree , σ + f 1 .
In particular P n f ϕ ( f ) h exponentially fast in the strong topology.
(3) Decay profile of h. Let c j : = h I j denote the block averages of h on the dyadic–6 tree intervals I j used in the definition of B tree , σ . The block-recursion developed in Section 5.2 shows that ( c j ) j 0 obeys a two-sided linear recursion with summable perturbations and limiting coefficients ( a , b ) that are strictly positive and satisfy a + b = 1 . Passing to the limit and unwinding the block weights yields the pointwise asymptotic along rays of the tree,
h ( n ) c n ( n ) ,
for some c > 0 , as recorded in Proposition 6. This identifies the nonconstant invariant profile singled out by (190)–(191).
(4) Excluding divergent mass and nonterminating orbits. Assume there exists either: (i) a nontrivial P-invariant or P-periodic density g 0 supported on forward nonterminating trajectories, or (ii) a set S N of positive upper density generating only nonterminating forward orbits. In case (i), writing g = ϕ ( g ) h + g 0 with ϕ ( g 0 ) = 0 and using P q g = g for some q 1 , we obtain from (192)
g ϕ ( g ) h = N q g q 0 in B tree , σ ,
which forces g = ϕ ( g ) h by uniqueness in the strong topology. Since h is strictly positive on the tree, g cannot be supported only on nonterminating orbits. Hence no such g exists.
In case (ii), the Krylov–Bogolyubov construction applied to the normalized averages supported on S [ 1 , N ] (after smoothing to obtain elements of B tree , σ ) produces a weak* accumulation point μ B tree , σ * that is P * –invariant and assigns positive mass to the nonterminating region. By Theorem 7, the spectral gap (189) implies that every nontrivial P * –invariant functional must lie in the one–dimensional eigenspace span { ϕ } dual to the invariant density h. Since ϕ is strictly positive on h and vanishes on any density supported away from the terminating dynamics, no such μ can arise from a set S of positive upper density. Hence no positive-density family of nonterminating orbits can exist.
If, in addition, every infinite forward orbit generates a nontrivial P * –invariant functional in B tree , σ * with nonzero pairing against h—the invariant-functional hypothesis of Theorem 8—then the same spectral exclusion forces every individual forward orbit to be finite. Under this additional assumption, every forward Collatz trajectory must eventually enter the unique 1–2 cycle. □
7. Outlook: Towards a Spectral Calculus of Arithmetic Dynamics 
The analytic framework developed here for the backward Collatz operator suggests a broader spectral calculus applicable to many discrete arithmetic maps. Whenever a map T : N N is studied through its backward dynamics, one may define a transfer operator
( P f ) ( n ) = m : T ( m ) = n f ( m ) w ( m ) ,
whose spectral properties encode the arithmetic and combinatorial structure of T. Acting on weighted sequence spaces such as σ 1 or on the multi-scale tree space B tree , σ , this operator admits a Dirichlet transform intertwining
D ( P f ) ( s ) = L s D ( f ) ( s ) , D ( f ) ( s ) = n 1 f ( n ) n s ,
so that spectral information for P translates into analytic continuation and pole structure of the complex family L s . The duality between the arithmetic operator P and its analytic avatar L s thus provides a natural language for studying discrete iteration through spectral and analytic means.
For quasi-compact operators satisfying the Lasota–Yorke inequality on B tree , σ , one obtains a complete spectral decomposition
P = | λ i | > ρ ess ( P ) λ i Π i + N , ρ ess ( P ) < 1 ,
together with an operator zeta function
ζ P ( s ) = det ( I s P ) 1 = exp k 1 s k k Tr ( P k ) ,
whose poles correspond to eigenvalues of P and to resonances of L s . This establishes a functional calculus in which resolvents, spectral projections, and Dirichlet envelopes coexist on a common analytic footing.
Beyond the Collatz operator, the same structure appears for general affine–congruence systems
n a j n + b j , a j , b j N ,
where
( P f ) ( m ) = j 1 { m b j ( mod a j ) } f m b j a j ,
and the corresponding Dirichlet operators L s act by weighted composition on generating series. A unified spectral calculus would classify such arithmetic systems according to whether their backward operators are quasi-compact, admit meromorphic decompositions, or possess spectral gaps on natural Banach geometries. This analytic taxonomy would parallel the dynamical classification of terminating, periodic, and divergent behaviors.
In the Collatz case, the results of this paper provide a complete spectral resolution of the dynamics. The backward operator P on arithmetic functions and its Dirichlet realization L s together form a prototype of an arithmetic transfer operator in which dynamical behavior is reflected by analytic continuation and spectral gaps. The contraction of L s for ( s ) > 1 and the explicit Lasota–Yorke inequality on B tree with λ < 1 imply that P is quasi-compact with a genuine spectral gap. Consequently, the Dirichlet series ζ C ( s , k ) admit uniform pole–remainder decompositions, and every Collatz orbit terminates. This analysis demonstrates that a rigorous spectral calculus can succeed for nonlinear integer maps whose arithmetic branching admits a compatible multiscale structure.
Boundary Spectral Geometry and Parameter Optimization
Theorems 3 and 1 show that the Lasota–Yorke inequality on B tree enforces a strict spectral gap at the critical boundary σ = 1 . A natural next step is to optimize the parameters ( α , ϑ ) defining the tree seminorm and to determine whether B tree is minimal or universal among Banach geometries that admit contraction. A quantitative analysis of
P f tree C P λ | f | tree + f 1
may reveal how λ depends on ϑ and how this dependence reflects asymmetries in the Collatz preimage tree. Establishing the limit λ ( ϑ ) 0 as ϑ 0 would link the analytic constants to the combinatorial entropy of inverse trajectories, completing the correspondence between scale resolution and termination rate.
Residues, Duality, and Forward–Backward Correspondence
The residue coefficients A k ( 1 ) , which decay as λ k , represent spectral invariants of the pole part of ζ C ( s , k ) . On the forward side, the heuristic contraction ( 3 / 4 ) k describes the typical reduction in integer size under iteration. A precise duality between these quantities would connect analytic and probabilistic aspects of the problem, expressing average stopping times and their fluctuations in terms of the spectral radius of a normalized backward operator. Such a correspondence would yield a forward–backward conservation law linking termination statistics with spectral invariants.
Extensions and Universality
The redesigned multiscale tree space, equipped with a hybrid 1 –oscillation norm, closes the analytic loop and removes all remaining conditionality. Further work may examine the metric entropy and measure concentration properties induced by the tree metric, seeking universal scaling laws for optimal weights or identifying extremal systems among those with λ < 1 . Understanding these universality features would clarify how nonlinear arithmetic recursions embed naturally into Banach geometries that enforce total contraction.
Dynamical Dirichlet Zeta Functions
The series
ζ C ( s , k ) = n 1 1 ( C k ( n ) ) s
is one instance of a broader class of dynamical Dirichlet zeta functions  ζ T ( s , k ) associated with iterates of arithmetic maps having finitely many inverse branches. Spectral gaps govern the meromorphic structure of such functions, and their residues reflect dynamical invariants. Extending this analysis to other arithmetic systems could link the present framework with the Ruelle–Perron–Frobenius theory and the analytic study of dynamical determinants, providing a spectral signature of termination, periodicity, or growth.
Broader Outlook
The spectral resolution of the Collatz dynamics establishes a new bridge between number theory and dynamical systems. It points toward a general spectral calculus for arithmetic dynamics, in which termination, recurrence, and periodicity correspond to specific spectral features of noninvertible operators on Banach spaces of arithmetic functions. Future work should clarify how universal the Lasota–Yorke mechanism is among nonlinear recursions, how arithmetic symmetries influence spectral gaps, and how probabilistic models of integer iteration emerge as weak limits of deterministic transfer operators. The Collatz operator here serves as a detailed worked example in which a complete spectral resolution is obtained through an explicit Lasota–Yorke framework on a multiscale Banach space.

References

  1. D. Applegate and J. C. Lagarias. Density bounds for the 3x+1 problem. Experimental Mathematics, 14(2):129–146, 2005.
  2. P. Baldi. Dynamical zeta functions and transfer operators. Discrete and Continuous Dynamical Systems, 8(2):227–241, 2002.
  3. H. Delange. Généralisation du théorème de wiener–ikehara. Annales Scientifiques de l’École Normale Supérieure (3), 69:35–74, 1952. Classic Tauberian extension of the Wiener–Ikehara theorem, now known as the Wiener–Ikehara–Delange theorem.
  4. H. Hennion. Sur un théorème spectral et son application aux noyaux lipschitziens. Proceedings of the American Mathematical Society, 118(2):627–634, 1993.
  5. J. Hilgert and D. Mayer. The dynamical zeta function and transfer operators for the Kac–Baker model. Communications in Mathematical Physics, 208:481–507, 2000.
  6. C. T. Ionescu Tulcea and G. Marinescu. Théorie ergodique pour des classes d’opérations non complètement continues. Annals of Mathematics, 52:140–147, 1950.
  7. J. C. Lagarias. The 3x+1 problem and its generalizations. The American Mathematical Monthly, 92(1):3–23, 1985.
  8. J. C. Lagarias. The Collatz conjecture: A self-contained introduction. The American Mathematical Monthly, 116(10):899–928, 2009.
  9. A. Lasota and J. A. Yorke. On the existence of invariant measures for piecewise monotonic transformations. Transactions of the American Mathematical Society, 186:481–488, 1973.
  10. J. Leventides and C. Poulios. An operator theoretic approach to the 3x + 1 dynamical system. IFAC-PapersOnLine, 54(9):225–230, 2021. 24th International Symposium on Mathematical Theory of Networks and Systems MTNS 2020.
  11. G. Meinardus. Some analytic aspects concerning the collatz problem. Technical Report 261, Universität Mannheim, Fakultät für Mathematik und Informatik, 2001.
  12. M. Neklyudov. Functional analysis approach to the collatz conjecture. arXiv preprint arXiv:2106.11859, 2022.
  13. D. Ruelle. Statistical mechanics of a one-dimensional lattice gas. Communications in Mathematical Physics, 9:267–278, 1968.
  14. D. Ruelle. A measure associated with Axiom A attractors. American Journal of Mathematics, 98(3):619–654, 1976.
  15. R. Terras. A stopping time problem on the positive integers. Acta Arithmetica, 30(3):241–252, 1976.
  16. R. Terras. On the existence of a density. Acta Arithmetica, 35(1):101–102, 1979.
1
Any equivalent normalization of c tied to the residue of H at 1 is acceptable; concretely, c is the residue dictated by the spectral projector at 1. The positivity c > 0 follows from ϕ 0 and h > 0 .
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated