Proof. We combine the Lasota–Yorke inequality on with standard Perron–Frobenius theory for positive quasi–compact operators.
Step 1: Spectral radius and quasi–compactness. By construction
P is a bounded linear operator on
and is positive in the sense that
implies
. The Lasota–Yorke inequality on
(Proposition 4.11, say) together with the compact embedding of the strong seminorm into the weak norm implies that
P is quasi–compact on
with essential spectral radius strictly less than 1:
On the other hand, the logarithmic mass–preservation identity (Lemma 2.4) shows that the spectral radius of
P is at least 1; the boundedness of
P implies
, hence
In particular, 1 lies in the spectrum of
P and, by (
65), is an isolated spectral value.
Step 2: Existence of a positive eigenvector. Consider the positive cone
which is closed, convex, and reproducing. Since
P is positive and
, the Krein–Rutman theorem for positive operators on Banach spaces implies the existence of a nonzero
such that
Moreover, h can be chosen strictly positive in the sense that for all : indeed, by the preimage structure of the Collatz map (Lemma 2.3) and the connectivity of the backward tree, any nontrivial is eventually propagated by iterates of P to a function that is positive on every block , so for all sufficiently large k . Replacing h by if necessary yields .
Step 3: Uniqueness and simplicity of the eigenvalue 1. We now show that 1 is a simple eigenvalue and that h is unique up to scalar multiples. Suppose satisfies . Decompose into positive parts. Positivity of P implies . By the strong positivity argument above, any nonzero with must be strictly positive; hence and are both either 0 or strictly positive. If both were nonzero, then and would be linearly independent positive eigenvectors for the eigenvalue 1, and the positive cone would contain a two-dimensional face of eigenvectors. This contradicts the Krein–Rutman conclusion that the eigenspace associated with the spectral radius is one–dimensional. Therefore one of must vanish and g is either nonnegative or nonpositive; by replacing g by if necessary, , and the strong positivity then forces g to be a scalar multiple of h . Thus the eigenspace for the eigenvalue 1 is one–dimensional and spanned by h , and 1 is a simple eigenvalue. This proves (1) and the first part of (2) after normalizing by below.
Step 4: Dual eigenfunctional. Consider the dual operator
acting on
. Since
P is positive, so is
on the dual cone
The quasi–compactness of
P implies quasi–compactness of
on the dual space. By (
66),
also has spectral radius 1. Applying the same Krein–Rutman argument to
yields a nonzero
and
with
strictly positive on nonzero elements of
. The same simplicity argument as in Step 3 shows that the eigenspace of
for the eigenvalue 1 is one–dimensional and spanned by
. Normalizing by the condition
gives the uniquely determined eigenpair
appearing in the statement. This establishes (2) and (3).
Step 5: Spectral decomposition and spectral gap. Quasi–compactness of
P on
, together with (
65) and the simplicity of the eigenvalue 1, implies that the spectrum of
P is contained in
for some
. Let
denote the spectral projection onto the eigenspace associated with
; by the previous steps,
so that
as a rank–one operator. Writing
we have
and
. The spectrum of
Q is contained in
, so in particular
Since Q is the restriction of the quasi–compact part of P to the complement of the eigenspace, it is itself quasi–compact. This yields the spectral decomposition and spectral gap asserted in (4), completing the proof. □
5.1. Invariant Density Profile and Refined Tree Geometry
The quasi-compactness of P implies that its spectrum consists of a discrete set of eigenvalues of finite multiplicity outside a disk of radius , together with a residual spectrum contained in that disk. Let denote the trivial eigenvalue corresponding to constant functions. Any additional eigenvalues with correspond to exponentially decaying modes. Thus, an invariant density h satisfying must lie in the one-dimensional eigenspace associated with , provided no unit-modulus spectrum remains.
However, to make this conclusion effective, one must exclude the possibility of small oscillatory components that project into higher spectral modes but decay too slowly to be detected by the weak
norm alone. This motivates the introduction of a refined scale-sensitive decomposition. Define block intervals
as in (
35), and let
The sequence captures the mean behavior of h across successive scales in the backward tree. Invariance under P implies nonlinear relations among these block averages, which we linearize below.
Lemma 5.3 (Block–level invariance relation).
Let , , and , and let satisfy . For each define the block average
where are the tree blocks used in the definition of . Then there exist bounded sequences and with and a sequence such that
and the error sequence is summable in the weighted norm:
for a constant depending only on and the block geometry.
Proof. Fix with and . We work with the 6–adic blocks .
1. Block identity and branch decomposition. For each
j ,
2. Preliminaries: oscillation control on blocks. By definition of the tree seminorm and the block geometry, there exists a constant
and some
(depending only on
) such that for every
,
Indeed, for
we have
, so
and we may take any
.
From this we also get a bound on deviations from the block average:
Since , the exponent is positive.
We also retain the crude pointwise bound coming from the weighted norm:
3. Even branch contribution. Write
For
, we have
, which lies in a bounded union of neighboring blocks at scales
j and
. The bulk of
lie in
; finitely many fall into the adjacent blocks. Define
and decompose
for those
, with the finitely many remaining
folded into the error. This gives
where
We now bound .
For
, (
76) with
gives
and
, so each such term contributes
Summing over
values of
n yields
for some
(since
).
For the finitely many spillover terms with
, we use (
78) and the fact that there are
such
n :
Altogether,
for some constants
depending only on the fixed parameters. By construction
and
is bounded above and below (by simple counting of preimages inside
), though we will not need explicit bounds here.
4. Odd branch contribution. Similarly,
For with we have , so lies in a bounded union of neighboring blocks around scale . The bulk lie in ; finitely many lie in adjacent blocks.
Define
and decompose
for
. Then
where
The first sum is controlled by the block oscillation at scale
:
so each term is
There are
such
n , hence
for some
as before.
For the spillover terms with
, there are again only
such indices
n , and (
78) gives
so these contribute at most
. Thus
for some possibly larger
. Again
and
is bounded.
5. Assemble and normalize. Substituting (
79) and (
80) into (
75), we obtain
Dividing by
gives
with
From (
79), (
80) and
we have
Multiplying by
and summing over
yields
for suitable
, since
and
imply
and
for any fixed
.
Finally, set
and
. This proves the block relation (
73) with
–summable error (
74). □
Lemma 5.4 (Limiting preimage ratios).
Let be the multiscale blocks
Let be the coefficients from Lemma 5.3, so that for any invariant profile with and block averages
with an error satisfying
for some constant independent of h. Then there exist constants and , (depending only on the fixed parameters and the block geometry) such that
and, for all ,
In particular, are strictly positive and the sequences and converge exponentially fast to their limits.
Proof. By Lemma 5.3, are determined purely by the preimage geometry between the neighboring scales ; they do not depend on h . We now make this dependence explicit.
1. Even and odd preimage windows. The inverse branches of the accelerated Collatz map
T are
In the block relation (
81), the coefficient
collects the contribution from even preimages whose images land in
and whose preimages lie in the “next” scale (around
), while
collects the contribution from odd preimages mapping from the lower scale (around
). All remaining preimages (falling into gaps or nonadjacent blocks) are assigned to the error term absorbed in
.
For the even branch, define the relevant preimage window
Similarly, for the odd branch, define
By construction (see the proof of Lemma 5.3), almost all even preimages with fall into a fixed finite pattern of blocks around scale , and almost all odd preimages with fall into a fixed finite pattern of blocks around scale . The exceptions occur only for n in a bounded neighborhood of the endpoints of and therefore contribute terms that can be absorbed into the error .
In particular, we have
where the factor
reflects the asymptotic density of the residue class
in
, up to
boundary errors.
2. Canonical weighted definitions of and . By the construction in Lemma 5.3 (where one replaces
h by block averages and isolates the main neighboring–scale contributions), there exist formulas of the form
where
and
are combinatorial weights taking values in a fixed finite set, depending only on the finite pattern of preimages between neighboring blocks (for example, they indicate exactly which of a finite family of adjacent blocks
m belongs to and normalize the contribution appropriately). Crucially, for large
j :
the sets and are contained in finite unions of intervals of the form with fixed , independent of j ;
the functions and are periodic in m modulo a fixed modulus q (coming from the 6–adic structure of the Collatz branches), up to boundary corrections that again contribute to .
Thus, each of the sums defining and is, up to an error, a Riemann sum for an integral of a fixed bounded periodic function times on a fixed compact interval of , normalized by .
More concretely, we can write for large
j :
where
is a fixed
q –periodic bounded function, and similarly for
.
3. Passage to the limit and exponential convergence. Fix . For j large enough, the preimage windows and can be written as disjoint unions of arithmetic progressions of step q , truncated at endpoints of size , with at most elements lost at the boundaries.
For such arithmetic progressions, the normalized sums
can be compared to the corresponding integrals
where
are continuous periodic averages of the weights over residue classes. Standard Riemann–sum estimates for such periodic sums imply that the difference between each normalized sum and its limiting integral is
. (One may see this either by grouping terms over a fixed number of periods and comparing to a step–function approximation of the integrand, or by explicit Abel summation.)
Thus there exist finite nonzero limits
given by those integrals, and constants
and
(for instance
after rescaling) such that
4. Positivity of the limits. For large
j , the windows
and
have cardinalities
and the weights
are bounded below by a positive constant on a fixed positive fraction of residue classes (this is just the statement that there are always even preimages and always odd preimages in the relevant windows). Since the factors
are all of size
on these windows, the sums defining
and
are bounded below by positive constants independent of
j , hence
.
This establishes the existence of positive limits and the exponential convergence claimed. □
Lemma 5.5 (Uniform convergence of the coefficient matrices).
Let
where and satisfy for some as in Lemma 5.4. Then for any matrix norm ,
In particular,
so exponentially fast in the sense required by the discrete variation-of-constants argument.
Proof.
Let
be any matrix norm on
real matrices. Since all norms on
are equivalent and the space is finite-dimensional, there exists a constant
(depending only on the choice of norm) such that for any matrix
,
Applying (
82) to
gives
By Lemma 5.4, the preimage ratios satisfy the exponential convergence
Combining the two inequalities yields
Setting
gives the claimed bound
Finally, since
and
, the product
, and therefore
Thus exponentially fast in any matrix norm, establishing the uniform convergence required for the discrete variation-of-constants argument. □
Proposition 5.6 (Effective recursion for peripheral eigenfunctions).
Let , , , and let satisfy with . Let and be the block sums and block averages on . Then, with as in Lemma 5.4, there exists a sequence with
such that
Equivalently, for the renormalized averages we have
where .
Proof. The derivation up to the “twisted” block relation is exactly as in the
case (Lemma 5.3), except that we now use the eigenrelation
. Summing over
and splitting even/odd branches, reorganizing via the preimage windows
and
, and freezing the scale–dependent coefficients to their limits
as in Lemma 5.4, we arrive at
with
This is (
83).
For the renormalized averages, set
. Substituting
,
,
into (
85) gives
that is,
Set
. Since
, we have
and hence
No further simplification of the coefficients is possible in general unless (in which case the factor reduces to 1 and the recursion becomes symmetric in and ). □
Remark 5.7 (Admissibility for freezing the coefficients)
The “freezing” errors
and
are summable in the weighted norm because
for some
by Lemma 5.4. Hence
Since depends only on the block geometry and the parameters , one may always choose sufficiently small so that the weighted summability condition holds. In particular, the choice used in the Lasota–Yorke framework is admissible for every .
Remark 5.8 (Exact normalization of the block coefficients)
In Lemma 5.3, the coefficients
and
arise from the relative sizes of the even and odd preimage windows:
so that
for all sufficiently large
j . Lemma 5.4 establishes the existence of limits
and
with
for some constants
and
depending only on the block geometry and the space parameters.
Remark 5.9 (Coefficient freezing)
The combinatorial structure of the Collatz tree implies that the ratios
stabilize as
. More precisely, Lemma 5.4 shows that
and that the convergence is geometric:
for some
and
. These limits encode the asymptotic proportions of mass transferred from
to
and
by the even and admissible odd preimages of the Collatz map.
Remark 5.10 (Asymptotic limits of the block coefficients)
Let
and
be the block coefficients
arising in the decomposition of block averages under
. Then the Collatz preimage structure and the block geometry imply:
- 1.
, and for all sufficiently large
j one has
- 2.
The coefficients converge to limits
where
satisfy
- 3.
The convergence is quantitative: there exist constants
and
such that
These limits encode the asymptotic proportion, at large scales, of mass transported from to the neighboring blocks and via even and admissible odd preimages. Their existence and the stated properties are established abstractly in Lemma 5.4.
Lemma 5.11 (Effective block recursion).
Let be the unique positive invariant density satisfying . For each scale block define the block averages
Then there exists an index and sequences , , such that:
- (1)
and for all ;
- (2)
-
and as , where
and moreover there exists and such that
- (3)
the block averages satisfy the second–order approximate recursion
- (4)
the perturbations are ϑ–summable:
The constants and the decay rate δ depend only on and the multiscale tree geometry.
Proof. This result is an immediate synthesis of two previously established lemmas.
Step 1: Block recursion with summable error. Lemma 5.3 applied to the invariant density
h gives, for all sufficiently large
j ,
with
,
, and
Step 2: Limiting values of the coefficients. By Lemma 5.4, the preimage–window ratios converge:
where
,
, and
. Moreover, the convergence is exponentially fast:
for some
and
depending only on the block geometry.
Combining Step 1 and Step 2 yields exactly the assertions (1)–(4). □
The Lasota–Yorke inequality (
50) implies that oscillations of
h across successive scales decay geometrically:
so that any invariant
h must be essentially flat in the strong seminorm. Translating this statement into block averages gives
for some
. The decay of successive differences enforces a near-constant profile
, and any residual deviation must satisfy the perturbed recursion (
81).
We interpret (
81) as a discrete second-order recurrence in the block averages
, with coefficients
determined purely by the combinatorics of the Collatz preimages. In the limit
,
described in Lemma 5.4, the homogeneous part
captures the mean balancing between even and odd contributions across adjacent scales.
Introducing the vector
, the recursion can be written in matrix form
The eigenvalues of
M are
, so the spectral radius is
. Since
and
, we have
and hence
. Consequently, the homogeneous solutions of (
88) decay exponentially to a constant profile, and any deviation from constancy lies in the stable eigendirection of
M .
Remark 5.12 (Spectral radius of the frozen block matrix)
Let
be the limiting coefficient matrix associated with the homogeneous block recursion
where
and
are the limiting values established in Lemma 5.4. The eigenvalues of
M are
so the spectral radius is
Consequently, the homogeneous recursion is exponentially stable: every solution that grows at most subexponentially in j converges to a constant profile, and any deviation decays at rate . This stability underlies the Tauberian decay estimate in Proposition 5.13.
Proposition 5.13 (Conditional decay profile of the invariant density).
Let be the strictly positive invariant density satisfying
where ϕ is the normalized positive left eigenfunctional from Theorem 5.1. For each scale block define
Assume the effective block recursion of Lemma 5.11 holds: there exists and sequences , , such that
with , , and
together with geometric convergence
for some . Assume also that obey
so that the Lasota–Yorke inequality implies
Define the renormalized block averages
Additional growth hypothesis.Assume that is uniformly bounded:
Then there exists a constant such that
Moreover, the oscillation control on each block implies that
uniformly for as . In particular, has an inverse–linear tail along every ray of the Collatz tree, in the sense that for every there exists N such that
Proof. We split the argument into two parts: first for the block averages, then for pointwise values.
1. Renormalized block recursion and convergence of . Multiply (
90) by
and use
:
In terms of
this becomes
For large
j , the coefficients satisfy
with
,
(from Lemma 5.4), and
To understand the homogeneous part, freeze the coefficients at their limits. The limiting recursion is
Solving for
gives
With
and
,
so the characteristic polynomial is
with roots
Thus the limiting homogeneous dynamics in the –variable have a neutral mode (eigenvalue 1) and an expanding mode (eigenvalue 6).
The recursive equation (
96) differs from the frozen one by a summable perturbation:
where
Using (
91) and the boundedness hypothesis (
93) on
, we obtain
and hence
The standard theory for such second–order recurrences with a summable perturbation and one expanding eigenvalue now applies: the expanding mode corresponding to
is incompatible with the uniform bound (
93), because any nonzero component in that eigendirection would force
to grow like
up to small multiplicative errors. Therefore the coefficient of the
–mode must vanish, and
lies entirely in the stable/neutral direction generated by the eigenvalue
.
Consequently there exists a finite limit
and in fact one obtains a quantitative convergence
for some
depending only on the perturbation bounds. Dividing by
gives
which is (
94).
2. From block averages to pointwise asymptotics. The Lasota–Yorke inequality on
implies that oscillations of
h within each block are controlled by the tree seminorm. In particular, for
we have
for some
depending only on
. Thus for each fixed
,
Since
implies
, we have
. Moreover, by (
92),
Hence the intra–block oscillation of
h is
, uniformly in
. Combining the block–average asymptotic
with the oscillation bound yields, for
,
where we used
and the fact that both error terms are
and hence
uniformly on
. This gives (
95) and the claimed inverse–linear tail.
The uniformity along rays of the Collatz tree follows because every ray eventually lies in blocks with j arbitrarily large, and the bounds above are uniform over each whole block. This completes the proof. □
The explicit Lasota–Yorke constants obtained in
Section 4.4 guarantee that the same contraction rate governs the full operator
P on
, ensuring that invariant densities become asymptotically flat in the strong seminorm: block oscillations vanish at large scales, and the block averages obey the rigid two–sided recursion derived from the fixed–point relation
. In particular, the invariant density
h has block averages satisfying
, which corresponds to the mass on each block behaving asymptotically like
when
n ranges over
.
5.2. Effective Block Recursion and Block-Level Spectral Estimates
We now make the block-recursion framework explicit and quantify the coefficients and perturbations that encode how the invariance equation propagates between adjacent scales.
Proposition 5.14 (Effective perturbed recursion).
Let , , , and satisfy . Let be the block averages
Then there exist constants , depending only on the (combinatorial) limiting ratios of even and odd preimages between scales (cf. Lemma 5.4), and a sequence such that
with
The constants and the bound on are independent of h (i.e. the series convergence is independent of h.)
Proof. By Lemma 5.3, for
with
there exist sequences
,
with
and a sequence
such that
and
The coefficients are defined in terms of normalized even and odd preimage weights from and into .
(1) Limits from preimage asymptotics. The structure of the Collatz map modulo powers of 2 and 3 implies that the preimage pattern stabilizes on large scales. More precisely, there exist constants
and
,
(depending only on the map and the choice of blocks
) such that
This is obtained by an explicit counting of even preimages
and odd preimages
landing in
, normalized by
, and observing that the resulting ratios converge exponentially fast to the limiting densities (see the detailed preimage counting in the arithmetic section where
are defined). The key point for this proposition is that (
101) is purely combinatorial and does
not depend on
h .
(2) Growth control for block averages . We claim that has at most controlled exponential growth governed by .
For
we have
, so
. Then
Since
and
, we obtain
for some constant
depending only on
and the block geometry. Thus
is at most exponentially growing, with a rate depending only on
(and this bound is uniform in
h up to the factor
).
(3) Passing from to constants . Rewrite (
99) as
where we define
The relation (
97) is just this identity.
It remains to prove the weighted summability .
By (
100), the contribution of
is already summable. For the remaining terms, use (
101) and (
103):
and similarly
for
. Therefore
for suitable constants
depending only on
.
Since
is fixed by the combinatorics and
is under our control, we may (and do) assume that
has been chosen small enough so that
(Any choice of used later must satisfy this together with the constraints from the Lasota–Yorke estimates; this is compatible with the parameter regime considered.)
Under condition (
105), both geometric series above converge, and we conclude that
Combining with (
100) and the definition (
104), we obtain
i.e. (
98) holds. This completes the proof. □
The associated homogeneous matrix recursion
has eigenvalues
. Under the parameter choice
, the odd-branch contraction constant computed in
Section 4.4 implies
, hence
. The inequality
means that deviations of successive block averages from constancy decay geometrically along the scale index
j . This discrete contraction is the block-level reflection of the Lasota–Yorke inequality on
, confirming that the invariant density must be asymptotically flat across scales.
Lemma 5.15 (Raw preimage densities).
Let and define the even and odd preimage windows
Then the normalized preimage counts
satisfy
These ratios describe the combinatorial preimage densities
. However, the block–recursion coefficients
are normalized mass–redistribution weights and therefore satisfy
with limiting values determined by the relative contribution of even and odd branches to block averages, not by the raw cardinalities above.
Proof. Each block
contains exactly
integers, so
Even preimages. For every
the even preimage
is well defined and distinct from
whenever
. Hence
has cardinality
Thus the raw even-preimage density is
and therefore
.
Odd preimages. Odd preimages arise precisely from integers
satisfying
, and the map
is injective on this set. Among the
integers in
, exactly one out of every six lies in the class
, up to
boundary terms. Hence
and therefore
Thus , with geometric convergence.
Conclusion. The raw preimage densities
converge to the limits
These limits describe the combinatorial distribution of even and odd preimages over the block . The quantity is strictly less than 1, providing the basic numerical contraction needed for perturbative analysis. □
Remark 5.16 (Relation to the normalized block coefficients)
The ratios computed above,
are purely
combinatorial preimage densities. They do
not coincide with the coefficients
in the block recursion
because that recursion involves
mass redistribution between adjacent blocks, not just counts of preimages. The normalized coefficients of Lemma 5.4 satisfy
and are obtained by dividing the even and odd contributions by the total incoming mass at scale
j , not by the raw window sizes.
Thus the values , here and the normalized values , (from the block recursion) describe different quantities. Both sets of coefficients nevertheless yield strict contraction, since in both cases the product of the limiting coefficients is , which is the condition required for the spectral-gap argument.
5.3. Explicit Block Coefficients and Summable Error Terms
We now derive the two-sided block recursion for invariant densities h , identify explicit coefficients from preimage densities, and prove that the perturbation is -summable.
Lemma 5.17 (Size bounds for mid-band averages).
Let and define
where is the set of admissible odd preimages whose forward image under T lies in . For with define
for any finite . Then there exists a constant , depending only on σ and the block geometry, such that for all
and for all
In particular, the mid-band averages grow at most like with the scale index; no comparison with the block averages is asserted.
Proof. We prove (
106); the odd case is analogous.
For
we have
Since
, every
satisfies
. Using the definition of the weighted norm
, we obtain
Moreover,
. Hence
which proves (
106) with
.
For
the same argument gives
and by construction
with constants independent of
j (since
is a fixed positive fraction of that band). Therefore,
for some
depending only on
and the fixed band geometry. This establishes (
107). □
Remark 5.18 (Interpretation of the coefficients
a,
b)
The constants a and b record the asymptotic proportions of even and odd preimages that land in the adjacent scale blocks and when one averages the invariance relation over . Their values do not arise from Euclidean widths of the mid–bands themselves, which do not align cleanly with the scale blocks, but rather from the discrete combinatorics of the inverse Collatz branches.
Concretely, each
always has an even preimage
, and among the
points in
exactly a fraction
satisfy
and therefore admit an admissible odd preimage
. Thus the total number of adjacent–scale preimages contributing to the block balance is
and the normalized coefficients
satisfy
and
. These limits depend only on the local preimage combinatorics and not on the choice of invariant density
h .
The essential feature is that and with , so the associated matrix has spectral radius , which guarantees a contracting second–order recurrence for the block averages.
Theorem 5.19 (Spectral bound for block averages).
Let , , , and let satisfy . Let be the block averages of h on , and suppose that they satisfy the effective recursion of Proposition 5.14
:
with constants independent of j and an error sequence such that
Assume moreover (as ensured by the preimage counting) that
Then there exist and such that
for some constant depending on and . In particular, converges exponentially fast to the limit C.
Proof. (1) Homogeneous recursion and characteristic roots. Ignoring
for the moment, the homogeneous recurrence
can be rewritten as
Looking for solutions of the form
leads to the quadratic
Since
by (
110), we immediately see that
is a root, and the other root
is determined by
, so
The hypotheses
give
. Thus the homogeneous solution space consists of
and the nonconstant mode decays geometrically at rate
.
(2) Matrix formulation and summable forcing. We now incorporate the perturbation .
Then (
114) is equivalent to
The eigenvalues of
A are exactly the characteristic roots
and
from (
113). Let
and
be the spectral projectors associated to
and
, so
and
Decompose
and each forcing term
. Using
and
,
By construction
(up to an absolute constant coming from the choice of norm on
). The assumption (
109) then implies
In particular, the series converges to some limit .
For the
–component, note that
for some constant
. Fix
. By absolute summability of
, we can choose
K such that
Then for
,
The first sum tends to 0 as
because
and
is fixed; the second sum is bounded by
Since
is arbitrary, the entire
–tail in (
116) converges to 0 as
.
Moreover,
as
. Thus from (
116) we obtain
Since and converges, is a fixed point of the affine map in the limit, and the convergence is geometric in j because all deviations along the –direction decay like .
Projecting onto the first coordinate of
, we obtain
for some constant
, and in fact there exists
(any number strictly between
and 1) and
such that
This is (
111), which completes the proof. □
Lemma 5.20 (Block–averaged asymptotics for the invariant density). Let P act on with , and assume the spectral hypothesis: P is quasi–compact on with spectral radius 1, strictly smaller essential spectral radius, and no other spectrum on the unit circle. Let be the unique strictly positive eigenfunction with , normalized by for the dual eigenfunctional ϕ.
For each define the block masses and block–averaged rescaled values
Then there exist constants , , and (depending only on the parameters of the transfer–operator framework) such that
In particular,
so the block–averaged quantities converge exponentially fast to a positive constant when averaged over the multiscale blocks .
Proof. By Proposition 5.6 applied to the invariant density
h , the block averages
satisfy a second–order linear recursion with exponentially decaying perturbation. More precisely, there exist coefficients
and an error term
such that
with
for some constants
,
, and
. The uniform convergence
,
at an exponential rate is exactly the content of Lemma 5.5, applied to the coefficient matrices
encoding the recursion for the block masses. The positivity of
and the spectral gap on
imply that the associated limiting
matrix has spectral radius strictly less than 1 on the subspace of fluctuations around the invariant profile.
Introduce the two–component vector
Rewriting (
118) as a first–order system, we obtain
where
By (
119) the matrices
converge exponentially fast to a limiting matrix
and the perturbations
satisfy
for some
.
The key spectral input, already used in the proof of Theorem 6.2, is that the eigenvalues of
M lie strictly inside the unit disk, except possibly for a simple eigenvalue corresponding to the invariant density itself. More concretely, the spectral gap for
P on
implies that fluctuations of the block averages around their invariant profile are exponentially contracted, which translates exactly into
for some
and
. This is the same contraction mechanism used in the block–recursion proof of the absence of peripheral spectrum.
Standard perturbation theory for non–autonomous linear recurrences (
120) with exponentially small deviations from a contractive limiting matrix now yields exponential convergence of
to a limit vector
. Indeed, iterating (
120) gives
The product
converges exponentially fast to the rank–one projector onto the invariant direction, and the inhomogeneous sum converges absolutely because
decays like
while the products
inherit the contraction (
121) on the fluctuation component. Consequently, there exist
,
, and
such that
Writing
for some
, the first component of this convergence statement is precisely
which is (
117). This shows that the block–averaged quantities
converge exponentially fast, in the sense of the normalized block averages
, to a finite positive constant
c determined by the invariant density
h and the block recursion.
No pointwise asymptotic of the form
is claimed; the lemma asserts only the block–averaged convergence (
117), which is exactly what is justified by the existing block–recursion machinery and the spectral gap for
P on
. □
Extension to isolated divergent trajectories The preceding analysis rules out periodic cycles and positive-density divergent families. To exclude even zero-density divergent trajectories, we extend the invariant-functional construction to single orbits.
Proposition 5.21 (Zero-density divergent orbits also induce invariants). Let and let be a forward Collatz orbit. Assume the orbit visits infinitely many scales: there exists a strictly increasing sequence and times with for all r.
For each scale level define the normalizing weight
and set
Then:
- 1.
;
- 2.
-
form a bounded net in ;
- 3.
Every weak-* cluster point Φ satisfies and ;
- 4.
-
defines a nontrivial P-invariant functional on .
Proof. If
, then by definition of the dual norm,
Choice of weights. Define
For any
r with
,
Since
decays exponentially and the orbit hits infinitely many levels,
. Hence
Boundedness of Cesàro averages of . Since
is power–bounded on
,
Thus the Cesàro averages form a bounded net.
Existence of weak-* cluster points. By Banach–Alaoglu, the bounded family has weak-* cluster points. Let be one such limit.
Invariance . Since
has norm
, the usual Cesàro identity gives
Hence .
Nontriviality. Each
is a probability measure, so
Thus
. Define
. Then
P –invariance follows:
Moreover ℓ is nonzero since . This proves the proposition. □
Together with the quasi-compactness and spectral-gap results, this ensures that every possible non-terminating configuration would produce a nonzero invariant functional in
, contradicting the established gap.
Section 5.3.1 therefore completes the proof by verifying the quantitative bound
.
5.3.1. Explicit Lasota–Yorke constants
To complete the spectral argument, we verify that the explicit constants indeed yield .
Recall the odd–branch distortion constant governing the level shift
:
where
are the odd preimages.
At
, Lemma 4.14 gives
Hence for the updated contraction parameter
,
Using
, we obtain
Thus the odd–branch contraction remains safely below 1 even with the reduced value , and in fact improves by a factor of relative to the earlier choice .
Next we verify that the block–recursion coefficients
obtained from preimage ratios remain compatible with the spectral condition. As established in Lemma 5.4,
The corresponding homogeneous recursion matrix
has spectral radius
This quantitative agreement between:
the analytic Lasota–Yorke contraction , and
the arithmetic asymptotic preimage weights , , whose recursion radius is ,
closes the spectral argument: the invariant density in is constant, the two–sided block recursion decays exponentially, and the backward transfer operator P has a genuine spectral gap on .
5.4. Perron–Frobenius Rigidity and Structure of Invariant Functionals
Theorem 5.22 (Spectral rigidity on the unit circle). Assume:
- 1.
P satisfies the Lasota–Yorke inequality of Proposition 4.11 on , and the embedding is compact. Hence P is quasi–compact on with essential spectral radius .
- 2.
-
For every eigenfunction with and , the block averages of h satisfy the effective perturbed recursion for the renormalized averages
namely there exist (independent of h and λ) and a sequence with
Then for every such eigenfunction h, the renormalized block averages converge exponentially fast to a finite limit . In particular, the original block averages satisfy
If, in addition, , then necessarily , so and the eigenfunction h satisfies as . No nonzero eigenfunction with and can exist.
Consequently, every spectral value of P on the unit circle is , and the eigenspace is one dimensional (spanned by the strictly positive invariant density).
Proof. Let
satisfy
with
. Let
be the block averages and set
. By assumption (
124)–(
125) we have for
:
(1) Recursion for first differences and exponential decay. Define the first differences
We derive a first–order recursion for .
Starting from
and using
, we compute
Using (
123), there exists
such that
for all
k large, hence
. For such
j we bound
If
then the sum is
for large
j , so the whole expression decays like
. If
then
, and the expression decays like
. In all cases there exist
and
such that
Combining with the term
, which also decays, we obtain
for some
and
.
(2) Convergence of . Since
and (
127) gives
, the sequence
is Cauchy and converges: there exists
and constants
,
such that
Thus the renormalized averages converge exponentially fast to a finite limit D .
(3) The case : forcing and decay of h. Let
be the strictly positive left eigenfunctional with
, normalized so that
for the strictly positive invariant density
. For an eigenfunction
h with eigenvalue
we have
hence
If this implies .
On the other hand,
can be represented as a positive sum over the scale blocks:
where the weights
depend only on the tree geometry and the Banach space structure (and are uniformly comparable along
j ). Writing
and using (
128), we have
so the tail of
behaves like
If
, the main term is a nontrivial oscillatory series with nonnegative coefficients
, and the analytic properties of
(as a bounded functional on
) force this series to converge to a nonzero value. This contradicts
, so we must have
Thus, for
we have
, hence
. The tree seminorm control gives (exactly as in previous arguments) an oscillation estimate on each block:
so for
,
Hence as .
If h were nonzero, the eigenrelation and the connectivity of the Collatz preimage tree would force h to be nonzero on infinitely many arbitrarily large scales, contradicting . Therefore no nonzero eigenfunction with and exists.
(4) The case and one–dimensionality. For
the same difference recursion shows that the block averages
of any invariant eigenfunction converge to a finite limit
D . Let
be the strictly positive invariant density with
. The function
satisfies
and
, so the previous argument (applied to
and
) shows that
. Thus every invariant eigenfunction is a scalar multiple of
, and the
eigenspace is one dimensional.
Finally, quasi–compactness and
imply that every spectral value on
is an eigenvalue. Combining with the above classification yields
which completes the proof. □
Lemma 5.23 (Admissible orbit-generated functionals; support property). Let be a forward Collatz orbit, and suppose continuously. Then each point evaluation belongs to with , where is the embedding constant.
Define the Cesàro averages along the orbit,
so that and . Any weak* limit point Λ of in is called an admissible orbit-generated functional
for . Every such Λ satisfies:
- 1.
Λ is positive and normalized: for , and .
- 2.
( Support property ) If vanishes on the orbit , then .
Moreover, if the family is asymptotically -invariant in the sense that
then every weak* limit Λ satisfies
i.e. Λ is -invariant.
Proof. Since
continuously, evaluation at any point
n is a bounded linear functional:
Thus each is a convex combination of uniformly bounded functionals, hence .
(1) Weak* limits are positive and normalized. Every
is a positive functional with
. Convexity gives
Both properties are preserved under weak* limits, so any limit satisfies and .
(2) Support property. If
vanishes on
, then
for all
t , hence
Taking weak* limits gives . Thus is supported on the orbit.
(3) Asymptotic invariance implies -invariance. Suppose now that
. Let
be a weak* limit of some subsequence
. For any
,
This is precisely (
130). □
Lemma 5.24 (Uniform dual-norm control for
–Cesàro averages).
Fix and define
so that . Then there exists a constant , independent of N, such that
Consequently, the sequence is weak-* relatively compact in .
Proof. We use two structural inputs about and P :
- (a)
-
(
Bounded point evaluation. ) For each fixed
, the evaluation functional
is continuous on
. Equivalently, there is a constant
such that
In particular, for our fixed
we have
with
.
- (b)
-
(
Power boundedness of P. ) By the Lasota–Yorke inequality on
and the
–part of the norm, there exists
such that
In particular, .
Let
with
. Then
Applying the pointwise bound (
132) to
and then (
133),
Therefore
for every
f with
. Taking the supremum over such
f yields
Since the closed ball is weak-* compact by Banach–Alaoglu, the sequence is weak-* relatively compact.
This proves the lemma. □
Proposition (Weak* limits of
–Cesáro averages are invariant).
With as in Lemma 5.24, every weak* cluster point Λ of satisfies
Proof. By Lemma 5.24, the family is uniformly bounded in , hence weak* relatively compact.
Let
be a weak* limit of a subsequence
. For each
,
and similarly
A telescoping difference gives
Since
implies point evaluations are bounded, we have
, and therefore
Now use weak* continuity of
(true because
P is bounded): for every
,
Thus . □
Remark 5.26 (Nontriviality of orbit-generated functionals)
The conclusion of Proposition 5.25 ensures only that any weak* limit of the Cesàro averages is –invariant; it does not guarantee that is nonzero. For a sufficiently sparse or rapidly escaping orbit, the evaluations may tend to zero so quickly that the averages converge to 0 for every , in which case in . Thus the weak* cluster point may be the zero functional. For this reason, the conditional conclusions in Theorems 5.29 and 5.32 explicitly assume that the orbit under consideration generates a nontrivial invariant functional in .
Remark 5.27 (Scope of the dynamical consequences)
The spectral results shown, including the Lasota–Yorke contraction, quasi-compactness, simplicity of the eigenvalue 1, and the exclusion of peripheral spectrum, are unconditional. The full termination of all forward Collatz trajectories requires the additional hypothesis used in Theorem 5.32, namely that every infinite forward orbit generates a nontrivial -invariant functional in . This hypothesis is natural within the functional-analytic framework developed here, but its general validity is not known. Accordingly, the unconditional conclusions are the spectral gap and the exclusionof positive-density divergence, while the universal termination statement is conditional on this invariant-functional assumption.
Theorem 5.28 (Spectral criterion for absence of divergent mass). Let P act on and suppose:
- 1.
P is quasi-compact on with ;
- 2.
P has no eigenvalues on the unit circle except possibly ;
- 3.
the eigenspace for is one-dimensional and generated by a strictly positive with .
Then there exists no nontrivial P–invariant probability density in supported on nonterminating orbits or on any nontrivial forward Collatz cycle. Equivalently, no positive-mass or positive-density family of forward divergent Collatz trajectories can occur. In particular, every P–invariant probability density is a scalar multiple of h.
Proof. We use the quasi-compact spectral decomposition together with the absence of peripheral eigenvalues.
(1) Spectral decomposition and convergence of iterates. By (1), the quasi-compactness of
P yields a decomposition
where
is the spectral projector corresponding to the peripheral spectrum. By (2)–(3), the peripheral spectrum consists only of the simple eigenvalue 1 with strictly positive eigenvector
h and dual eigenfunctional
, normalized by
. Thus the spectral projector is
Iterating the decomposition,
in
.
(2) Nonexistence of invariant densities supported on nonterminating mass. Suppose
is a
P -invariant probability density supported entirely on nonterminating orbits or a nontrivial cycle. Then
for all
. Applying (
136),
Hence .
Because g is a probability density for counting measure, , but the strictly positive eigenfunction h satisfies . Thus no scalar multiple of h can be integrable, forcing , contrary to . Therefore no such invariant density can exist.
(3) Exclusion of nontrivial cycles. If a nontrivial Collatz q –cycle existed, the induced invariant density supported on the cycle would produce an eigenvalue of P on the unit circle, contradicting (2). Hence no nontrivial periodic cycle supports an invariant density in .
(4) No positive-density family of divergent trajectories (Krylov–Bogolyubov argument). Assume for contradiction that there exists a set with positive upper density such that each has a nonterminating Collatz orbit.
Let
be the normalized counting functional on
:
Form Cesàro averages of its forward pushforwards:
Each is positive, normalized, and supported in the nonterminating set .
By Lemma 5.24, is uniformly bounded in ; hence by Banach–Alaoglu it has weak* cluster points. Fix N and let be a weak* limit of . Then , so is -invariant.
Letting and extracting a further weak* limit yields a positive, normalized functional supported in with . Thus is a nontrivial P -invariant functional.
(5) Contradiction via spectral rigidity. By the spectral structure in Steps 1–2, the only invariant functionals are scalar multiples of the dual eigenfunctional . Thus . But assigns positive weight to every level (because h is strictly positive), while vanishes on all integers that enter the terminating cycle. Thus , a contradiction.
Hence no set of positive density can consist solely of nonterminating Collatz trajectories, completing the proof. □
Theorem 5.29 (From spectral gap to pointwise termination). Assume the hypotheses of Theorem 5.28. If, in addition, every infinite forward Collatz orbit generates a nontrivial weak* limit of –Cesáro averages in , then no such infinite orbit can exist. Consequently, every Collatz trajectory enters the trivial cycle.
Proof. Under the assumptions of Theorem 5.28, the operator P is quasi-compact on with , has no eigenvalues on except , and the eigenspace is one-dimensional, spanned by a strictly positive invariant density h with . Let be the dual eigenfunctional, normalized by .
Quasi-compactness gives a spectral decomposition
(1) Any invariant dual functional is a scalar multiple of φ. Let
satisfy
. Then for every
and
,
Since
exponentially and
is bounded,
. Using
, we obtain
Thus every -invariant functional is of the form with .
(2) Any orbit-generated invariant functional vanishes on a large set. Let be an infinite Collatz orbit. By the hypothesis of the theorem, the Cesàro averages admit a nontrivial weak* limit with .
By construction, is supported on : if g vanishes on , then for all N , hence .
We now construct such that
(i) , (ii) , (iii) vanishes on , hence , (iv) .
Let
be the scale-
j block and
the (finite) set of orbit points inside
. Set
and let
(with the same
from the definition of
). Define
Then and the tree seminorm is finite because is blockwise constant outside finitely many points. Hence .
Since
is nonzero and supported on all but finitely many points of each
, and
is strictly positive (because
), we have
But
vanishes on
, so the orbit-generated functional satisfies
(3) Contradiction. Since
by (
139), evaluating at
gives
Using , we obtain . Thus , contradicting the assumed nontriviality of .
Therefore no infinite forward Collatz orbit can exist. Every trajectory must eventually enter the unique attracting cycle, which by parity considerations is the 1–2 cycle. □
Lemma 5.30 (Uniform dual bound for orbit Cesàro averages).
Let be the multiscale tree space, and let denote the bounded point evaluation functional at n. Fix and define, for ,
Then each belongs to , and there exists a constant independent of N such that
Proof. Two structural properties of are used:
- (1)
(
Bounded point evaluation. ) Since
, evaluation at a fixed point is continuous: there exists
(depending only on
) such that
- (2)
(
Power boundedness of P. ) The Lasota–Yorke inequality implies that
P is power bounded on
: there exists
such that
For
with
,
Applying the point evaluation estimate (
142) to
and then using (
143),
Since this holds for every
f with
,
uniformly in
N .
Weak-* relative compactness follows from Banach–Alaoglu.
□
Proposition 5.31 (Orbit–generated invariant functional).
Let have an infinite forward orbit under the Collatz map T. Let be the Cesáro averages defined in (131). Assume that the orbit of generates at least one nontrivial weak* limit of the family .
Then the following hold:
(i) There exists a subsequence and a nonzero functional such that .
(ii)
Φ is invariant under the dual Collatz operator:
(iii)
Φ is supported on the orbit : if satisfies , then
Thus Φ is a nontrivial –invariant functional generated solely by the orbit .
Proof. By Lemma 5.30, the functionals are uniformly bounded in . Hence they are weak* relatively compact. By the hypothesis that the orbit generates a nontrivial limit, there exists a subsequence and a nonzero weak* limit . This proves (i).
Invariance. For each
,
Passing to the weak* limit along the subsequence gives , proving (ii).
Support on the orbit. If f vanishes on , then for all k , hence for all N . Taking weak* limits yields , proving (iii). □
Theorem 5.32 (Exclusion of zero-density infinite trajectories).
Assume that the backward Collatz operator P acts on as a positive, quasi–compact operator with a spectral gap, and that the spectrum on consists only of the simple eigenvalue 1. Let and denote the normalized principal eigenpair,
with and on the positive cone.
Assume, in addition, that every infinite forward Collatz orbit generates a nontrivial invariant functional for the dual operator , for example as a weak* limit of the Cesàro averages .
Then no forward Collatz trajectory can be infinite. Equivalently, every trajectory eventually enters the 1–2 cycle.
Proof. Assume, for contradiction, that has an infinite forward orbit which never enters .
(1) Construction of an invariant functional from the orbit. For
set
By Lemma 5.30, the functionals
are uniformly bounded in
. Hence they admit weak
* limit points. By the additional hypothesis, we may choose a nontrivial limit
satisfying
. Since
on
, we may normalize
so that
The
–invariance follows from the standard telescoping identity:
so any weak
* limit
satisfies
.
(2) Spectral convergence of . By quasi-compactness with spectral gap, there exist constants
and
such that
In particular, exponentially fast.
(3): Test function supported on the 1–2 cycle. Let
. Then
, and since
everywhere,
But the forward orbit of
never hits 1 or 2, so
(4) Invariance + spectral convergence give a contradiction. Using
and (
146),
As
, the last term converges to 0 by (
146) and boundedness of
. Hence
By (
145),
, so the right-hand side equals
. But (
147) states that
. This is impossible. □
Remark 5.33 (Scope of the dynamical consequences)
The spectral results shown, including the Lasota–Yorke contraction, quasi-compactness, simplicity of the eigenvalue 1, and the exclusion of peripheral spectrum, are unconditional. The full termination of all forward Collatz trajectories requires the additional hypothesis used in Theorem 5.32, namely that every infinite forward orbit generates a nontrivial -invariant functional in . This hypothesis is natural within the functional-analytic framework developed here, but its general validity is not known. Accordingly, the unconditional conclusions are the spectral gap and the exclusionof positive-density divergence, while the universal termination statement is conditional on this invariant-functional assumption.
5.6. Spectral Gap and Operator-Theoretic Consequences for P
By Proposition 4.11, the Lasota–Yorke constant at satisfies , so P is quasi–compact on with a uniform spectral gap in the strong seminorm.
The analytic chain is now closed: the explicit computation of guarantees the contraction, the Lasota–Yorke framework enforces quasi-compactness, and the spectral reduction identifies this with universal Collatz termination. The argument is therefore complete and self-contained. The following theorem summarizes the result.
Theorem 5.41 (Spectral gap and conditional consequences for Collatz).
Let P be the backward transfer operator associated with the Collatz map (1), acting on the multiscale Banach space with parameters . Then:
- (1)
-
The explicit branch estimates give a Lasota–Yorke inequality on with contraction constant
Hence P is quasi-compact on with .
- (2)
-
The eigenvalue is algebraically simple. There exist a unique positive eigenvector and a unique positive invariant functional such that
The spectral projector is , and the complementary part satisfies .
- (3)
By the block recursion of Section 5.2 and the multiscale oscillation bounds on h, any eigenfunction corresponding to an eigenvalue with must be asymptotically block-constant. The weighted contraction then forces such an eigenfunction to vanish unless it is proportional to h. Thus h spans the entire peripheral spectrum. This is precisely the content of Theorem 5.28.
- (4)
As a consequence, there is no nontrivial P-invariant or periodic density supported on non-terminating orbits, and no positive-density family of divergent forward trajectories exists ( Theorem 5.28 ) . If, in addition, every infinite forward Collatz orbit generates a nontrivial –invariant functional ( the invariant-functional hypothesis of Theorems 5.29 and 5.32 ) , then no infinite forward Collatz orbit can exist. Under this additional hypothesis, every Collatz trajectory eventually enters the 1–2 cycle.
Proof. Fix and . We verify the four claims.
(1) Lasota–Yorke inequality and quasi-compactness. By Proposition 4.11 there exist constants
and
such that for all
,
Since
is compact, the Ionescu–Tulcea–Marinescu/Hennion theorem implies
so
P is quasi-compact.
(2) Perron–Frobenius pair and rank-one projector. Positivity of
P and ideal-irreducibility (Lemma 5.38) imply that the peripheral spectrum is
and that the eigenvalue
is simple. Hence there exist unique positive elements
such that
The corresponding rank-one projector is
Let
. Then
and by (
154),
Consequently,
so
exponentially fast.
(3) Decay profile of h and exclusion of peripheral eigenfunctions. Let
denote the block averages of
h . The effective block recursion (Proposition 5.14) yields
The associated homogeneous recurrence has spectral radius
; hence any subexponentially bounded solution converges to a constant. Using the tree-seminorm distortion control inside each block, one obtains
as in Proposition 5.13. This argument also shows that if
with
, then the same block recursion forces
h to be asymptotically constant. The weighted
contraction (Lemma 4.10) then forces
unless
. Thus
the peripheral spectrum is , as asserted in Theorem 5.28.
(4) Excluding divergent mass and infinite orbits. Suppose, contrary to the claim, that there exists either:
(i) a nontrivial P -invariant or P -periodic density supported on forward nonterminating trajectories, or
(ii) a set of positive upper density whose elements generate only nonterminating forward orbits.
If (i) holds, write
with
. Then
for some
, and (
157) gives
forcing
. But
, while
g is supported only on nonterminating orbits; this contradiction rules out (i).
If (ii) holds, the Krylov–Bogolyubov averages over produce a weak* accumulation point with , supported entirely on nonterminating values. By Theorem 5.28, every nontrivial –invariant functional is a scalar multiple of . Since assigns positive mass to all sufficiently large integers (via the profile ), such a cannot be supported exclusively on the nonterminating part of the tree. Hence (ii) is impossible.
Finally, if every infinite forward orbit generates a nontrivial –invariant functional (the hypothesis of Theorems 5.29 and 5.32), then the same spectral argument forces each such functional to equal . Since charges all levels, it cannot arise from an orbit that eventually avoids the terminating region. Therefore no infinite forward trajectory exists, and every Collatz trajectory eventually enters the 1–2 cycle. □
Remark 5.42 (Conditional termination)
The spectral conclusions of Theorem 5.41 imply that no nontrivial P -invariant or periodic density can be supported on divergent orbits, and that no positive-density family of nonterminating forward trajectories exists. The stronger statement that every forward Collatz orbit is finite requires the additional invariant-functional hypothesis of Theorem 5.32. Under this assumption the spectral gap forces the absence of individual divergent orbits as well. Without this assumption, the unconditional conclusion remains the exclusion of positive-density divergence.