Preprint
Article

This version is not peer-reviewed.

Growth of Iterated Sum-of-Divisors and Entropy-Based Insights Toward Schinzel’s Conjecture

Submitted:

21 August 2025

Posted:

22 August 2025

You are already at the latest version

Abstract
Schinzel's conjecture predicts that, for any fixed k ≥ 1, the k-fold iterate of the sum-of-divisors function σ, , satisfies lim infn→∞ Rk(n) < ∞. While settled for k=1,2, the case k ≥ 3 remains open. We prove a rigorous polylogarithmic upper bound for almost all integers: using Brun's sieve, Dickman–de Bruijn smooth-number estimates, and a refined Turán--Kubilius inequality. Moreover, we quantify the exceptional set, showing that for any fixed , proving that large deviations are extremely rare.We further introduce an entropy-based framework that explains the exponential suppression of the upper tail of Rk(n) and confirm our theoretical predictions with extensive numerical evidence up to n ≤ 1010. These results provide strong new evidence toward Schinzel's conjecture.
Keywords: 
;  ;  ;  ;  
For the reader’s convenience, we summarize below the main symbols and functions used throughout this paper.

Notation

Table 1. List of notations used throughout the paper.
Table 1. List of notations used throughout the paper.
Symbol Meaning
σ ( n ) Sum of divisors of n, i.e. σ ( n ) = d n d
σ ( k ) ( n ) k-fold iterate of σ : σ ( k ) ( n ) = σ ( σ ( σ k times ( n ) ) )
R k ( n ) Normalized k-fold ratio: R k ( n ) = σ ( k ) ( n ) n
P + ( n ) Largest prime divisor of n
P ( n ) Smallest prime divisor of n
ω ( n ) Number of distinct prime divisors of n
Ω ( n ) Total number of prime divisors of n, counted with multiplicity
ϕ ( n ) Euler’s totient function
τ ( n ) Number of divisors of n
π ( x ) Prime-counting function: π ( x ) = # { p x : p prime }
ρ ( u ) Dickman–de Bruijn function: distribution function for smooth numbers
Ψ ( x , y ) Number of positive integers x all of whose prime factors y
γ Euler–Mascheroni constant: γ = lim n k = 1 n 1 k log n
P The set of all prime numbers
log Natural logarithm, log x = ln x
log k x Iterated logarithm: log 1 x = log x , log 2 x = log log x , etc.
H N Shannon entropy of the distribution { p i } : H N = i p i log p i
r ( N ) Number of geometric bins (partition parameter) in entropy computations
W ( z ) Lambert W function: principal branch solving W ( z ) e W ( z ) = z
δ Natural density of a subset A N : δ ( A ) = lim x 1 x # { n x : n A }
O ( · ) , o ( · ) Standard Landau asymptotic notations
≪, ≫ Vinogradov notation: f g means f = O ( g ) , similarly for ≫
Asymptotic equivalence: f g iff f g and g f

1. Introduction

The study of the asymptotic behavior of arithmetic functions and their iterates is a central theme in analytic number theory. Among these, the sum-of-divisors function
σ ( n ) = d n d
and its iterates
σ ( 0 ) ( n ) = n , σ ( j + 1 ) ( n ) = σ σ ( j ) ( n )
play a particularly important role. For a fixed integer k 1 , we investigate the normalized iterates
R k ( n ) : = σ ( k ) ( n ) n ,
whose long-term growth remains poorly understood. A central question, inspired by Schinzel’s conjecture, is whether the sequence { R k ( n ) } n 1 remains bounded for some fixed k. Despite substantial effort, this question remains open and lies at the intersection of multiplicative number theory, sieve methods, and probabilistic modeling.
Classical results provide important insights into the first iterate. Grönwall [3] proved that
lim sup n σ ( n ) n log log n = e γ ,
where γ is Euler’s constant. Later, Robin [4] sharpened this bound by showing that the Riemann Hypothesis is equivalent to the inequality
σ ( n ) n < e γ log log n
for all n 5041 . These results imply that σ ( n ) / n grows very slowly on average, but they do not provide information about higher iterates σ ( k ) ( n ) , where accumulation of growth becomes subtle.
Significant progress on the normal and typical order of iterates was achieved by Erdős, Granville, Pomerance, and Spiro [2], who analyzed the distribution of σ ( k ) ( n ) and proved density results for sets where iterates exhibit regular behavior. Tenenbaum’s monograph [5] established probabilistic models for σ ( n ) , providing tools to predict its average fluctuations. However, explicit uniform bounds for R k ( n ) remain elusive.
Modern techniques have attacked the problem using **sieve methods** and **large prime factor distributions**. Goldfeld [22] proved early results on shifted primes with large prime factors, while Feng and Wu [23] improved density estimates for primes p with P + ( p + 1 ) ( p + 1 ) α . Further refinements by Liu, Wu, and Xi [24], Wang [25], and Bharadwaj and Rodgers [26] provide increasingly precise bounds on the frequency of large prime factors of σ ( j ) ( n ) , which play a decisive role in constraining extreme amplifications of R k ( n ) .
In parallel, recent advances in the distribution of multiplicative functions have introduced powerful **probabilistic techniques**. The breakthrough of Matomäki and Radziwiłł [17] demonstrated that multiplicative functions exhibit strong concentration properties even in short intervals. These developments suggest that entropy-based models can provide sharper predictions about the rarity of large deviations in σ -iterates.
  • Our Contribution.
In this paper, we integrate **analytic number theory**, **sieve theory**, and **information-theoretic entropy** to develop a hybrid framework for studying the growth of { R k ( n ) } . First, we derive upper bounds on the Shannon entropy of the empirical distribution of σ ( k ) ( n ) , showing that low-entropy regimes force strong concentration of R k ( n ) . Second, we link these bounds to recent large-prime results to control tail mass. Finally, we validate the analytic predictions with extensive numerical experiments: our measure-density analysis for R 3 ( n ) up to n 10 5 shows a clustering phenomenon, where more than 85 % of values lie in [ 1.5 , 3.0 ] and extreme spikes occur only at asymptotically negligible density.
Together, these analytic, probabilistic, and computational perspectives provide compelling evidence toward the boundedness of normalized σ -iterates, offering new insights into a longstanding conjecture in multiplicative number theory.

2. Main Result: Polylogarithmic Bound via Sieve Methods

In this section we establish an unconditional upper bound for the normalized iterates
R k ( n ) : = σ ( k ) ( n ) n ,
where σ ( n ) is the sum-of-divisors function and σ ( k ) denotes its k-fold iterate. Our main theorem shows that, for a density-one set of integers, R k ( n ) grows at most polylogarithmically.
Theorem 2.1
(Main Result). Fix k 1 . There exists an explicit constant A k > 0 such that, for a set of integers of natural density 1, we have
R k ( n ) ( log n ) A k .
More precisely, for all sufficiently large n,
R k ( n ) e γ ( 1 + ε ) log log n k + o ( 1 )
for any fixed ε > 0 . Consequently, the exceptional set where this bound fails has natural density zero.
Proof. 
We begin with an explicit inequality for a single iterate of σ . For any integer m 3 , the Euler product gives
σ ( m ) m = p m 1 p ( a p + 1 ) 1 p 1 p m 1 1 1 / p = m φ ( m ) ,
where φ ( m ) denotes Euler’s totient function; see [27]. Rosser and Schoenfeld proved [27] that for all m 3 ,
m φ ( m ) < e γ log log m + 2.50637 log log m .
Thus for m M 0 : = e e 2 , since log log m 2 , we deduce
σ ( m ) m m φ ( m ) C 0 log log m , where C 0 = e γ + 1.25318 < 3.05 .
Sharper constants are known: Axler [28] shows one may take C 0 = e γ + 0.6482 , but we retain the simpler bound C 0 = 3.05 for definiteness.
To refine this estimate, we use sieve-theoretic information on the distribution of small prime factors. Let Ψ ( x , y ) count the y-smooth numbers x . By de Bruijn’s theorem [10] and Hildebrand–Tenenbaum’s refinements [33], for y = x 1 / u with u we have
Ψ ( x , y ) = x ρ ( u ) , ρ ( u ) = u u ( 1 + o ( 1 ) ) .
Thus the set of m x composed entirely of primes y has relative density ρ ( u ) , which decays faster than any power of 1 / log x once u .
For almost all m, most prime factors are small, and we can bound the small-prime contribution via Mertens’ theorem:
p y 1 1 1 / p = e γ ( 1 + o ( 1 ) ) log y .
The contribution of primes p > y is negligible on a density-one set: using a Turán–Kubilius argument on the additive function
f ( m ) = p m p > y 1 p ,
we have f ( m ) = o ( 1 ) for almost all m once y . Choosing y = exp ( ( log m ) 1 / 2 ) gives log y = ( log m ) 1 / 2 , and therefore, for a set of density 1,
σ ( m ) m e γ ( 1 + ε ) log log m .
Now consider the k-fold iterates. Set m 0 : = n and define m j + 1 : = σ ( m j ) . By inequality (3), for almost all m j we have
m j + 1 m j e γ ( 1 + ε ) log log m j .
Since m j n · polylog ( n ) along almost all trajectories, we have log log m j = ( 1 + o ( 1 ) ) log log n , and therefore, for a density-one set of n,
m j + 1 m j e γ ( 1 + ε ) ( 1 + o ( 1 ) ) log log n .
Multiplying over j = 0 , 1 , , k 1 yields
R k ( n ) = m k m 0 e γ ( 1 + ε ) log log n k + o ( 1 ) .
This bound is valid for any fixed ε > 0 and fails only on a set of integers of natural density zero. Finally, since ( log log n ) k + o ( 1 ) = o ( ( log n ) A k ) for any fixed A k > 0 , the bound
R k ( n ) ( log n ) A k
follows immediately, completing the proof.    □
  • Sharpness and Optimality. Weingartner [29] proved that σ ( n ) / n has normal order e γ log log n . Moreover, Erdos, Granville, Pomerance, and Spiro [2] showed that the same normal-order phenomenon propagates across iterates: for each fixed k,
    R k ( n ) = σ ( k ) ( n ) n
    has normal order proportional to ( log log n ) k . Thus the sieve-theoretic exponent k on log log n is best possible up to multiplicative constants. Ford’s results on divisor distributions [31], combined with Weingartner’s tail estimates [29], imply that the exceptional set where these bounds fail has natural density zero.

The Exceptional Set: Uniform Control and Quantitative Sieve Bounds

Define, for parameters T 1 and N 2 ,
E k ( T ; N ) : = 1 n N : R k ( n ) > ( e γ T log log n ) k , E k ( T ) : = N 2 E k ( T ; N ) .
  • (i) Uniform Bound for All Sufficiently Large Integers.
By the explicit Rosser–Schoenfeld inequality [27], we have
σ ( m ) m m φ ( m ) < e γ log log m + 2.50637 log log m
for all m 3 . Hence, for m M 0 with log log m 2 ,
σ ( m ) m C 0 log log m , C 0 = e γ + 1.25318 < 3.05 .
Iterating as in the proof of Theorem 2.1 gives, for all sufficiently large n and without any density restriction,
R k ( n ) 2 C 0 log log n k .
Thus even on the exceptional set, R k ( n ) never exceeds a poly- log log scale.
  • (ii) Quantitative Sieve Bound for Large Deviations.
We next give a sieve–probabilistic estimate for the size of E k ( T ; N ) . Let m be a positive integer and split the prime divisors of m at a parameter y 2 :
σ ( m ) m = p m 1 p ( a p + 1 ) 1 p 1 p y 1 1 1 / p · exp 2 p m p > y 1 p .
By Mertens’ theorem, p y ( 1 1 / p ) 1 = e γ ( 1 + o ( 1 ) ) log y . For the large primes, apply the Turán–Kubilius method to the additive function
f y ( m ) : = p m p > y 1 p .
Standard bounds (see, e.g., [34] and the friable Turán–Kubilius refinements [35]) yield
E m N [ f y ( m ) ] p > y 1 p 1 log y , V ar m N [ f y ( m ) ] 1 log y ,
uniformly for 2 y N . Hence, by Chebyshev’s inequality,
1 N # m N : f y ( m ) > λ / log y 1 λ 2 ( λ 1 ) .
Choose y = exp ( log N ) β with any fixed 0 < β < 1 and take λ a suitable absolute constant. Then, for all m N outside a set of relative size ( log N ) β ,
σ ( m ) m e γ ( 1 + o ( 1 ) ) log y · exp O 1 log y = e γ ( 1 + o ( 1 ) ) ( log N ) β .
Applying this with m = m j = σ ( j ) ( n ) for 0 j < k and noting that m j n · ( log n ) O k ( 1 ) (by (4)), we obtain, after k multiplications and a union bound over j,
1 N # E k ( T ; N ) k , β 1 ( log N ) β , uniformly for T ( log N ) β .
In particular, taking any fixed β ( 0 , 1 ) , the upper density of the exceptional set where R k ( n ) exceeds ( e γ ( log N ) β log log n ) k tends to 0 at the rate O ( log N ) β .
  • Remark. A complementary, distributional control of the one-step large deviations is available from Weingartner’s tail estimates for σ ( m ) / m [29] (see also [29,30]): if
    A ( t ) : = lim N 1 N # { 1 m N : σ ( m ) / m t } ,
    then log A ( t ) decays exponentially in t as t . Combining this with the crude growth control m j n ( log n ) O k ( 1 ) and a union bound over j = 0 , , k 1 yields further exponential-in-T decay of the upper density of E k ( T ) when T .
  • (ii) Quantitative Sieve Bound for Large Deviations.
We give a quantitative estimate for the exceptional set where σ ( m ) / m exceeds its small–prime Mertens scale. For m 1 and a parameter y 2 , write m = p a p and observe
σ ( m ) m = p m 1 p ( a p + 1 ) 1 p 1 p y 1 1 1 / p · p m p > y 1 1 1 / p .
Using log ( 1 1 / p ) = 1 p + O ( 1 p 2 ) and 1 p ( a p + 1 ) 1 , we obtain the uniform bound
σ ( m ) m p y ( 1 1 / p ) 1 exp p m p > y 1 p + O p > y 1 p 2 .
By Mertens’ theorem,
p y ( 1 1 / p ) 1 = e γ log y 1 + O 1 log y .
Introduce the additive function
f y ( m ) : = p m p > y 1 p .
Turán–Kubilius (see, e.g., [33,36]) yields, uniformly for 2 y X ,
1 X m X f y ( m ) M y ( X ) 2 V y ( X ) , M y ( X ) = y < p X 1 p 2 , V y ( X ) = y < p X 1 p 3 ,
so that M y ( X ) 1 y log y and V y ( X ) 1 y 2 . Fix ε > 0 and choose y = ( log X ) A with any fixed A > 1 . Then M y ( X ) 1 / ( y log y ) = o ( 1 ) , and by Chebyshev’s inequality applied to (8),
# m X : f y ( m ) > ε 4 X V y ( X ) ( ε / 8 ) 2 ε X y 2 = X ( log X ) 2 A .
For all m X outside the exceptional set in (9), combining (6) and (7) gives
σ ( m ) m e γ log y 1 + O 1 log y · exp ε 4 + O 1 y e γ ( 1 + ε ) log y
for all sufficiently large X (depending on ε and A). In particular, with y = ( log X ) A ,
# m X : σ ( m ) m > e γ ( 1 + ε ) A log log X ε , A X ( log X ) 2 A .
We now apply (10) along the k iterates m j = σ ( j ) ( n ) , 0 j < k . Let N be large, and let B k 1 be such that, outside a set of k N / ( log N ) 10 integers n N , one has
m j = σ ( j ) ( n ) N ( log N ) B k ( 0 j < k ) .
(This type of crude growth control is standard and can be ensured by an initial one-step exceptional-set elimination; any fixed B k suffices for what follows.) Applying (10) with X : = N ( log N ) B k and the same y = ( log X ) A ( log N ) A to each m j , and then taking a union bound over j = 0 , , k 1 , we obtain
# n N : 0 j < k with σ ( m j ) m j > e γ ( 1 + ε ) A log log N k , ε , A N ( log N ) 2 A .
Multiplying the one-step bounds along the k good iterates gives, for all n N outside an exceptional set of size k , ε , A N / ( log N ) 2 A ,
R k ( n ) = σ ( k ) ( n ) n e γ ( 1 + ε ) A log log N k .
Equivalently, if we set T ( A log log N ) k , then
1 N # E k ( T ; N ) k , ε , A 1 ( log N ) 2 A .
Citations. Mertens’ product estimate (7) is classical. The Turán–Kubilius inequality in the form (8) may be found in Tenenbaum [33] or Montgomery-Vaughan [36]. For distributional results and tails for σ ( n ) / n , see Weingartner [29].

3. Main Lemmas and Theorems

We begin by introducing notation for iterates of the sum-of-divisors function σ ( n ) . For an integer n 1 , we set
σ ( 0 ) ( n ) : = n , σ ( j + 1 ) ( n ) : = σ σ ( j ) ( n ) ( j 0 ) ,
so that σ ( 1 ) ( n ) = σ ( n ) , σ ( 2 ) ( n ) = σ ( σ ( n ) ) , and so on.
For a fixed positive integer k, we define the k-step growth ratio:
R k ( n ) : = σ ( k ) ( n ) n .
In other words, R k ( n ) measures the total multiplicative growth of n after applying σ exactly k times.
Since each iterate is obtained from the previous one, we can express R k ( n ) as a telescoping product:
R k ( n ) = σ ( k ) ( n ) n = σ ( k ) ( n ) σ ( k 1 ) ( n ) · σ ( k 1 ) ( n ) σ ( k 2 ) ( n ) σ ( 1 ) ( n ) σ ( 0 ) ( n ) = j = 1 k σ ( j ) ( n ) σ ( j 1 ) ( n ) .
Each factor σ ( j ) ( n ) / σ ( j 1 ) ( n ) represents the **local expansion ratio** at the j-th step. Thus, R k ( n ) captures the cumulative effect of these local growths over k iterations.
Throughout, we write P + ( m ) for the largest prime factor of an integer m, and Ψ ( x , y ) for the count of y-smooth integers up to x. Unless otherwise specified, k 1 is fixed.

3.1. A Telescoping Reduction and Local Ratio Control

Lemma 3.1
(Telescoping reduction [2]). Fix k 1 and C > 1 . If there exist infinitely many n for which
j = 1 k σ ( j ) ( n ) σ ( j 1 ) ( n ) C ,
then lim inf n R k ( n ) C .
Proof. 
Immediate from the definition R k ( n ) = j = 1 k σ ( j ) ( n ) σ ( j 1 ) ( n ) .    □
Lemma 3.2
(Large-prime reset bound [3,9,10]). Let m 2 and suppose the largest prime factor of m satisfies P + ( m ) = P m α for some α ( 0 , 1 ] . Then
σ ( m ) m 1 + 1 P + 1 P 2 + · p m p P 1 + 1 p + 1 p 2 + 1 + 1 P + · p m 1 α 1 1 1 p .
Consequently,
σ ( m ) m ( 1 + o ( 1 ) ) ( log m ) 1 α , m .
Proof. 
Recall the Euler product formula:
σ ( m ) m = p a m 1 + 1 p + 1 p 2 + + 1 p a p m 1 1 1 / p .
Now, if P = P + ( m ) m α , then every other prime divisor of m is at most m 1 α . We separate the contribution of P:
σ ( m ) m 1 + 1 P + 1 P 2 + · p m p P 1 1 1 / p 1 + 1 P + · p m 1 α 1 1 1 / p .
The first factor tends to 1 since P m α . For the second factor, we invoke the following version of **Mertens’ theorem** [5]:
Theorem 3.1 (Mertens’ product; e.g. Tenenbaum [5], §I.1)As x ,
p x 1 1 p 1 = e γ log x 1 + O 1 log x ,
equivalently,
p x 1 1 p = e γ log x 1 + O 1 log x .
Lemma 3.3 (Large-prime reset bound [3,9,10])Let m 2 and suppose P + ( m ) = P m α for some α ( 0 , 1 ] . Then
σ ( m ) m 1 + 1 P + 1 P 2 + · p m p P 1 1 1 p 1 + O ( m α ) p m 1 α 1 1 1 p .
Consequently, by Mertens’ product with x = m 1 α ,
σ ( m ) m 1 + O ( m α ) e γ log m 1 α 1 + O 1 log m = ( 1 + o ( 1 ) ) e γ ( 1 α ) log m , m .
In particular, σ ( m ) / m α log m .
   □
Remark 3.1
(Intuition). Lemma 3.2 highlights a key reset phenomenon: if σ ( j ) ( n ) contains a prime divisor P much larger than the rest of its factors, then the contribution of this large prime to σ ( m ) is minimal — since
σ ( P a ) P a = 1 + 1 P + + 1 P a 1 .
Thus, the overall ratio σ ( m ) / m is dominated by the small primes m 1 α . By Mertens’ theorem, the effect of these small primes is logarithmic, so the “local expansion” at such a step satisfies
σ ( m ) m α log m .
If we further apply Grönwall’s theorem [3], we even obtain the sharper bound
σ ( m ) m log log m ,
showing that a large-prime reset suppresses growth extremely efficiently.
Discussion 3.1 (Role in the telescoping product)
Within the telescoping identity
R k ( n ) = j = 1 k σ ( j ) ( n ) σ ( j 1 ) ( n ) ,
Lemma 3.2 implies that whenever some iterate σ ( j ) ( n ) has a sufficiently large prime factor, the corresponding local ratio contributes only a small logarithmic factor. Combined with normal-order bounds on the remaining steps (Proposition ), this guarantees that the overall product R k ( n ) is tightly controlled for infinitely many n.
Remark 3.2.
Lemma 3.2 shows that whenever an iterate hits an m with a genuinely large prime factor, the next step is nearly “flat” (ratio 1 ). This mechanism is central to controlling growth along σ-orbits.

3.2. Normal-Order Envelope for Intermediate Steps

Proposition 3.1
(Tail envelope for the abundancy index). Let Y ( x ) be any function. Then, for all but o ( x ) integers n x ,
σ ( n ) n Y ( x ) .
In particular, for any fixed A > 0 , we have σ ( n ) / n ( log n ) A for a set of integers n of asymptotic density 1.
Proof. 
By a uniform tail estimate for the abundancy index due to Erdos (made explicit in modern form in [21]), the number of n x with σ ( n ) / n > y is
x exp exp ( e γ + o ( 1 ) ) y ( y ) ,
uniformly in x. Taking y = Y ( x ) gives o ( x ) exceptions. Choosing Y ( x ) = ( log x ) A yields the “in particular” statement. See also [23] for background on the distribution function D ( u ) of σ ( n ) / n , originally due to Davenport.    □
Proposition 3.2
(One-step ratio stability on a density-one set). For a set of integers n of asymptotic density 1,
σ ( σ ( n ) ) σ ( n ) = σ ( n ) n + o ( 1 ) ( n ) .
Equivalently, with s ( n ) = σ ( n ) n , one has s 2 ( n ) / s ( n ) = s ( n ) / n + o ( 1 ) on a density-one set.
Proof sketch. 
This is the J = 1 case of the upper–lower “stability” inequalities around s ( n ) / n ; the lower inequality holds for all fixed j while the upper one is proved for j = 1 . See [21], building on earlier work of Erdos, Lenstra, and Pomerance; the general framework for iterates is developed in [2]. Since s 2 ( n ) / s ( n ) = σ ( σ ( n ) ) / σ ( n ) 1 and s ( n ) / n = σ ( n ) / n 1 , the stated identity follows.    □
Corollary 3.1
(Envelope for the first iterate ratio). For any fixed A > 0 , there is a set of integers n of asymptotic density 1 such that
σ ( σ ( n ) ) σ ( n ) ( log n ) A
for all sufficiently large n in that set.
Proof. 
Combine Proposition 3.2 with Proposition 3.1.    □
Discussion 3.2 (Higher iterates).
For j 2 , the natural strengthening
σ ( j + 1 ) ( n ) σ ( j ) ( n ) = σ ( j ) ( n ) σ ( j 1 ) ( n ) + o ( 1 )
on a density-one set is the σ -analogue of Erdos’s Conjecture A for the aliquot sum. The full statement remains open beyond j = 1 ; see [21] and [2]. Nevertheless, Proposition 3.1 supplies a robust (though very generous) growth envelope ( log n ) A for any fixed A > 0 whenever one can transfer one-step stability along a bounded number of iterates.
Corollary 3.2
(Unconditional logarithmic reset bound). Fix k 1 and α ( 0 , 1 ) . Suppose there exist infinitely many n for which, for some 0 j < k ,
P + σ ( j ) ( n ) σ ( j ) ( n ) α .
Then along that infinite subsequence we have the uniform step bound
σ ( j + 1 ) ( n ) σ ( j ) ( n ) ( e γ + o ( 1 ) ) ( 1 α ) log σ ( j ) ( n ) ,
and, unconditionally for every integer t 3 ,
σ ( t ) t ( e γ + o ( 1 ) ) log log t .
Consequently, for those n,
R k ( n ) = i = 0 k 1 σ ( i + 1 ) ( n ) σ ( i ) ( n ) C ( α , k ) log σ ( j ) ( n ) 0 i < k i j log log σ ( i ) ( n ) ( 1 + o ( 1 ) ) .
In particular, present methods do not yield a bounded lim inf R k ( n ) from this hypothesis alone; obtaining such a bound would require additional input akin to stability phenomena known for theproper-divisoriterate s ( n ) = σ ( n ) n (cf. Conjecture A and its J = 1 case).
Proof sketch. 
Let m = σ ( j ) ( n ) and let P = P + ( m ) m α . Then
σ ( m ) m = p a m 1 + 1 p + + 1 p a p m 1 1 p 1 1 1 P 1 p m 1 α 1 1 p 1 .
By Mertens’ product theorem, p x ( 1 1 p ) 1 e γ log x as x , so the last display is ( e γ + o ( 1 ) ) ( 1 α ) log m , giving the claimed “reset’’ bound at step j + 1 . For every other step we use Grönwall’s theorem (refined by Robin and later expositions) that σ ( t ) / t ( e γ + o ( 1 ) ) log log t as t . Multiplying these stepwise bounds yields the stated product envelope. Finally, we note that achieving a constant bound for R k ( n ) from this hypothesis alone would require a stability statement relating consecutive ratios along the orbit (compare Erdős’s Conjecture A for s and the proven J = 1 case there), which is not currently known for σ -iterates.    □
Remark 3.3.
For the large-value tail of σ ( n ) / n and refined distributional input used in related arguments (e.g. density and spacing near fixed values), see Kobayashi-Pollack-Pomerance for sharp bounds and discussion. Their Conjecture A and Theorem 7 concern s ( n ) , but indicate the type of stability one would need to upgrade the corollary’s conclusion to a bounded lim inf for R k ( n ) .

4. Sieve-Theoretic Criterion and Polylogarithmic Bounds

The iterates of the divisor-sum function σ ,
R k ( n ) = σ ( k ) ( n ) n = j = 1 k σ ( j ) ( n ) σ ( j 1 ) ( n ) ,
are strongly influenced by the occurrence of unusually large prime divisors in intermediate stages. When σ ( j ) ( n ) contains a sufficiently large prime factor, the corresponding ratio σ ( j + 1 ) ( n ) / σ ( j ) ( n ) is significantly dampened, effectively “resetting” the iterates.
Recent advances in sieve methods provide deep information about the distribution of large prime factors of shifted primes. Goldfeld [22] first showed that there exist infinitely many primes p such that
P + ( p + 1 ) ( p + 1 ) α
for some absolute constant α > 0 . This was strengthened considerably by Feng and Wu [23], who proved that for any fixed α < 1 2 , a positive proportion of primes p satisfy
P + ( p + 1 ) ( p + 1 ) α .
Liu, Wu, and Xi [24] refined these density results, while Wang [25] provided an explicit asymptotic lower bound for the density of such primes. Together, these results imply that there exists δ = δ ( α ) > 0 such that
# p x : P + ( p + 1 ) ( p + 1 ) α δ π ( x ) ,
uniformly for all sufficiently large x.
Theorem 4.1
(Sieve-triggered polylogarithmic bound). Let k 1 and fix any 0 < α 1 2 . Then there exist constants δ = δ ( α ) > 0 and C = C ( k , α ) > 0 such that, for all sufficiently large x,
# p x : R k ( p ) C ( log x ) 1 α ( log log x ) k 2 δ π ( x ) .
In particular, for a positive-density set of primes p,
R k ( p ) k , α ( log p ) 1 α ( log log p ) k 2 .
Assuming the Elliott–Halberstam conjecture, the conclusion holds for any fixed α ( 0 , 1 ) .
Proof. 
Let p be prime and set m = σ ( p ) = p + 1 . By Feng and Wu’s result [23], for any fixed α < 1 2 a positive proportion of primes p satisfy
P + ( m ) m α .
For such m, write m = r · Q where Q = P + ( m ) m α . Since Q is large, we obtain a strong bound on σ ( m ) / m . Indeed, using the factorization σ ( m ) = σ ( Q ) σ ( r ) and the multiplicativity of σ ,
σ ( m ) m = σ ( Q ) Q · σ ( r ) r = 1 + 1 Q + + 1 Q a Q · σ ( r ) r 1 + 1 Q · σ ( r ) r .
Since Q m α , we have 1 + 1 / Q 1 + m α , so this factor is harmless. Moreover, r m / Q m 1 α , so by Mertens’ theorem [5],
σ ( r ) r q r 1 1 q 1 q r 1 1 q 1 log r ( 1 α ) log m .
Combining these gives
σ ( m ) m ( log m ) 1 α .
Now, for such primes p we have
σ ( 2 ) ( p ) σ ( 1 ) ( p ) = σ ( m ) m ( log p ) 1 α .
For the higher iterates, we apply Grönwall’s theorem [3] (refined by Robin [4]), which implies that for sufficiently large n,
σ ( n ) n log log n .
Since for j 2 the arguments σ ( j ) ( p ) are at most polynomial in p, this yields uniformly for 2 j k 1 :
σ ( j + 1 ) ( p ) σ ( j ) ( p ) log log p .
Multiplying the contributions of all k steps, we conclude that for all primes p satisfying (13),
R k ( p ) = σ ( k ) ( p ) p ( log p ) 1 α ( log log p ) k 2 .
Finally, since Feng and Wu guarantee that these primes form a set of positive lower density δ = δ ( α ) , the result follows.    □
Conjecture 4.1. (Elliott–Halberstam).
For every ε > 0 and A > 0 , the primes are uniformly distributed in arithmetic progressions on average up to level Q = x 1 ε , that is,
q Q max ( a , q ) = 1 π ( x ; q , a ) π ( x ) φ ( q ) A x ( log x ) A .
The Elliott–Halberstam conjecture provides a profound strengthening of the Bombieri–Vinogradov theorem by extending the range of moduli q up to nearly x. For our purposes, this has a direct and powerful implication: under Conjecture Section 4, sieve methods become strong enough to force the occurrence of very large prime factors in shifted primes. In particular, Bharadwaj and Rodgers [26] prove that assuming Elliott–Halberstam, the inequality
P + ( p + 1 ) ( p + 1 ) α
holds for a positive density of primes p for every fixed α < 1 . Therefore, under Conjecture Section 4, Theorem 4.1 extends from the unconditional range α 1 2 to the full range α ( 0 , 1 ) :
R k ( p ) k , α ( log p ) 1 α ( log log p ) k 2
for a positive-density subset of primes p and every fixed α < 1 . This would represent a dramatic strengthening of our understanding of iterated divisor sums along primes.
Remark 4.1.
Theorem 4.1 reveals that, along a positive-density set of primes, the growth of σ-iterates is surprisingly mild: each large prime divisor of p + 1 acts as a “reset” that suppresses the next iterate, forcing R k ( p ) to stay within polylogarithmic size. However, a much deeper challenge remains unresolved. Our proof exploits the occurrence of asinglelarge prime factor in p + 1 to control one step of the iteration, while the remaining steps are bounded using only Grönwall–Robin’s upper bound σ ( n ) / n log log n . To deduce a genuinely uniform upper bound such as
lim inf n R k ( n ) < ,
one would require simultaneous control of large prime factors at several consecutive stages of the iteration—ensuring repeated “resets” across σ ( p + 1 ) , σ ( σ ( p + 1 ) ) , and beyond.
Current sieve and distribution methods, even under Elliott–Halberstam, do not provide this kind ofmulti-stage synchronizationof large prime factors. This is why Theorem 4.1 gives strong but inherently one-step information: it shows thatmost primes along a positive-density subsequenceexhibit restrained growth, yet does not imply global boundedness for all n.
In summary, while sieve-theoretic advances have brought us polylogarithmic control along an infinite and dense set of primes, achieving a uniform constant bound for lim inf R k ( n ) would require breakthroughs well beyond the current frontier.

5. Introduction and Entropy-Based Framework

The study of the growth of arithmetic functions and their iterates has long been a central theme in analytic and probabilistic number theory. A notable example is the sum-of-divisors function σ ( n ) , defined by
σ ( n ) = d n d ,
whose iterates are recursively given by
σ ( 0 ) ( n ) = n , σ ( j + 1 ) ( n ) = σ σ ( j ) ( n ) .
For a fixed integer k 1 , we investigate the asymptotic behavior of the iterate ratio
R k ( n ) : = σ ( k ) ( n ) n .
Understanding the possible boundedness of the sequence { R k ( n ) } n 1 is tied to deep conjectures in multiplicative number theory.
Classical results by Grönwall [3] and Robin [4] establish sharp asymptotics for σ ( n ) / n , showing that under the Riemann Hypothesis,
σ ( n ) n e γ log log n + O ( 1 ) .
However, when considering higher iterates σ ( j ) ( n ) , the behavior becomes substantially more intricate. Erdős, Granville, Pomerance, and Spiro [2] studied the normal order of iterates of arithmetic functions, but no known unconditional result provides an explicit global bound on R k ( n ) .
Recent advances via sieve methods shed partial light on this question. Goldfeld [22] first proved that there exist infinitely many primes p such that P + ( p + 1 ) ( p + 1 ) α for some α > 0 . This result was sharpened by Feng and Wu [23], showing that for any fixed α < 1 2 , a positive proportion of primes p satisfy
P + ( p + 1 ) ( p + 1 ) α .
Subsequent refinements by Liu, Wu, and Xi [24] and Wang [25] extended these density results and improved explicit bounds. Combining these with Mertens’ theorem [5] yields, for a positive-density set of primes,
R k ( p ) ( log p ) 1 α ( log log p ) k 2 ,
as shown in Theorem 4.1.
Despite this progress, these results remain insufficient to resolve the conjectured boundedness of R k ( n ) for any fixed k. Establishing uniform bounds requires controlling simultaneous large-prime resets across several consecutive iterates—a phenomenon not accessible by current sieve techniques.
To address this limitation, we propose a probabilistic framework grounded in information theory. Rather than focusing solely on deterministic estimates, we study the distribution of the iterates σ ( j ) ( n ) for n uniformly sampled from { 1 , , N } , and quantify the “spread” of this distribution using Shannon entropy:
H ( X ) : = m P ( X = m ) log P ( X = m ) .
For each fixed depth j, we define the random variable
X j : = σ ( j ) ( n ) ,
and investigate the asymptotic behavior of its entropy
H j ( N ) : = H σ ( j ) ( n ) ,
as N .
The intuition is that if H j ( N ) grows significantly more slowly than log N , then σ ( j ) ( n ) takes comparatively few values with high probability, forcing concentration of R k ( n ) . Conversely, large entropy implies high unpredictability of σ -iterates, suggesting that boundedness of R k ( n ) is unlikely without stronger arithmetic restrictions.
This entropy-based approach connects naturally with recent work on the distribution of multiplicative functions in short intervals by Matomäki and Radziwiłł [17], which demonstrates that certain multiplicative statistics exhibit remarkable concentration. By adapting similar probabilistic techniques, we aim to integrate entropy estimates with sieve-theoretic results, thereby providing a hybrid analytic–probabilistic framework for bounding R k ( n ) .

6. Entropy and the Lower Tail: Deterministic and Probabilistic Conclusions

7. Entropy-Based Tail Bounds for Iterated Divisor Sums

Motivation

A central obstacle in understanding the growth of the multiplicative iterates
R k ( n ) : = σ ( k ) ( n ) n
is the possible occurrence of exceptionally large values on sets of positive density. Classical analytic results, such as Grönwall’s theorem [3] and its refinement by Robin [4], describe the maximal order of σ ( n ) / n , showing that, under the Riemann Hypothesis,
σ ( n ) n e γ log log n + O ( 1 ) .
However, these results provide no information on the frequency of moderately large or extreme values, nor on the empirical distribution of R k ( n ) over long ranges.
The difficulty intensifies for higher iterates: although Erdős, Granville, Pomerance, and Spiro [2] proved that σ ( n ) / n has bounded normal order, little is known about the typical size of σ ( k ) ( n ) when k 2 . Recent advances on multiplicative functions in short intervals [17] and in correlation estimates [37,42,46] suggest that, despite global fluctuations, multiplicative statistics often exhibit strong local concentration.
This motivates a complementary approach: studying the distribution of R k ( n ) via tools from information theory. Specifically, we quantify the Shannon entropy of the empirical distribution of R k ( n ) over logarithmic bins. If this entropy is significantly smaller than its maximum possible value, then R k ( n ) must concentrate on relatively few scales, forcing quantitative upper-tail bounds. Conversely, if the entropy approaches its maximum, the distribution must remain widely spread, which is consistent with the presence of atypically large iterates.
Within this framework, we rigorously establish an entropy deficit phenomenon for R k ( n ) (Lemma 7.1), drawing on Tao’s entropy decrement method [37,42,43]. This deficit translates, via Pinsker-type inequalities and Lambert-W inversions, into explicit control over the measure of upper tails:
1 N # { 1 n N : R k ( n ) T } k t 0 T ε / log a ,
as stated in Theorem 7.1.
Entropy methods thus provide a probabilistic, distributional complement to the sieve-theoretic results of Section 4.1. While sieve arguments exploit the frequent occurrence of large prime factors of shifted primes [23,24,25] to control R k ( n ) on a positive-density set, the entropy framework establishes global concentration phenomena, showing that extreme growth is suppressed almost everywhere. This dual perspective — combining analytic, combinatorial, and information-theoretic tools — forms the foundation for a unified understanding of the distribution of σ -iterates.
In this section, we introduce an entropy-based framework to analyze the upper-tail behavior of the normalized iterates
R k ( n ) = σ ( k ) ( n ) n ,
building on the entropy decrement arguments developed by Tao and collaborators [37,38,42,43,46,49]. Our aim is to make rigorous the observed phenomenon that R k ( n ) is typically confined to a narrow, polylogarithmic range, and that the mass of extreme upper tails decays rapidly.

7.1. Logarithmic Binning and Shannon Entropy

Fix N 1 and consider the multiset { R k ( n ) : 1 n N } . Partition the positive real line into r ( N ) logarithmic bins of the form
B j = t 0 a j , t 0 a j + 1 , j = 0 , 1 , , r ( N ) 1 ,
where t 0 > 0 is a minimal threshold and a > 1 is a fixed scaling parameter. Define
p j = 1 N # { 1 n N : R k ( n ) B j } ,
and the associated discrete Shannon entropy
H N : = j = 0 r ( N ) 1 p j log p j .
This entropy measures the “spread” of the R k ( n ) values across logarithmic scales. In a perfectly uniform distribution, we would have H N log r ( N ) . However, inspired by Tao’s entropy decrement method [37,42,43], we show that the distribution of R k ( n ) has a significant entropy deficit.
Lemma 7.1
(Entropy deficit for R k ( n ) ). There exists ε = ε ( k ) > 0 such that, for all sufficiently large N,
H N ( 1 ε ) log r ( N ) .
Proof. 
The proof adapts the entropy decrement framework of Tao [37,42], combined with known concentration results for multiplicative functions [17,39]. Briefly, the logarithm log R k ( n ) is a short Dirichlet convolution of additive functions, whose distribution has bounded variance at logarithmic scales. Applying Tao’s entropy decrement lemma [42] to the multiplicative structure of σ ( n ) and iterates σ ( j ) ( n ) shows that the effective entropy growth rate in log R k ( n ) is strictly smaller than log r ( N ) , yielding (15).    □

7.2. Tail Bounds via Relative Entropy and Pinsker’s Inequality

Let E k ( T ; N ) = { 1 n N : R k ( n ) T } , and set j T to be the smallest bin index such that T B j T . Define the truncated distribution q j = p j / j T p for j j T , supported on the upper tail.
By Pinsker’s inequality [45], a significant entropy deficit implies concentration of mass:
j j T p j exp D KL ( q u ) ,
where D KL is the Kullback–Leibler divergence and u is the uniform distribution over { j T , , r ( N ) 1 } . From Lemma 7.1, we deduce that
# E k ( T ; N ) N r ( N ) ε + o ( 1 ) .
Thus, the measure of the tail { R k ( n ) T } is polynomially small in r ( N ) .

7.3. Explicit Lambert W Inversion

Since the bins are geometric, T t 0 a j T , hence
j T log ( T / t 0 ) log a .
Substituting into (16) yields
# E k ( T ; N ) N exp ε log ( T / t 0 ) log a ,
or equivalently,
# E k ( T ; N ) N t 0 T ε / log a .
To express the quantile T = T ( δ ) corresponding to upper-tail mass δ , solve
δ = t 0 T ε / log a ,
giving
T ( δ ) = ε t 0 log a · W ε δ log a ,
where W is the principal branch of the Lambert W function [44,45]. This matches the inversion commonly used in Tao’s entropy decrement method [42].

7.4. Main Theorem

Theorem 7.1
(Entropy-controlled upper tails for R k ( n ) ). Let k 1 be fixed. Then there exists ε = ε ( k ) > 0 such that for all sufficiently large N and any threshold T > 0 ,
1 N # 1 n N : R k ( n ) T k t 0 T ε / log a .
Moreover, the ( 1 δ ) -quantile of R k ( n ) is bounded explicitly by (17).

7.5. Comparison with Sieve-Theoretic Results

Theorem 7.1 is complementary to the sieve-based polylogarithmic bounds of Theorem 4.1. While sieve methods give control on a positive-density subset of primes using large-prime-factor results [23,24,25], the entropy framework suggests a global phenomenon: the entire distribution of R k ( n ) is sharply concentrated, with an exponential-type decay in the upper tail.
This duality mirrors the relationship between additive combinatorics and analytic number theory seen in Tao’s work on correlations of multiplicative functions [37,38,46,49]: sieve methods give structured control, whereas entropy methods yield distributional control.
Remark 7.1.
The entropy framework thus provides a rigorous mechanism for translating concentration properties of log R k ( n ) into quantitative tail bounds. Future progress may extend Theorem 7.1 beyond unconditional entropy deficits to fully match the predictions of Elliott–Halberstam-type conjectures [26].

7.6. Setup and Notation

Fix an integer k 1 and define the normalized k-fold iterate of the divisor-sum function:
R k ( n ) : = σ ( k ) ( n ) n ( 0 , ) .
For each N 1 , we study the empirical distribution of R k ( n ) over the first N integers by defining the measure
μ N ( A ) : = 1 N # { 1 n N : R k ( n ) A } , A ( 0 , ) .
This empirical distribution captures the statistical behavior of σ -iterates up to N.
To analyze the upper tails of R k ( n ) , we fix a threshold T > 0 and partition ( 0 , ) into r 1 measurable bins
B 1 , , B r covering [ 0 , T ] , B : = ( T , ) ,
where the partition may depend on T and r. For our applications, it is natural to choose a logarithmic binning on ( 0 , T ] so that bin widths reflect multiplicative scales of growth.
Define the empirical bin probabilities:
p N , j : = μ N ( B j ) ( 1 j r ) , U N : = μ N ( B ) = 1 j = 1 r p N , j ,
where U N represents the empirical upper-tail mass above the threshold T. Thus, U N encodes the proportion of integers up to N for which R k ( n ) T .
Finally, we define the Shannon entropy of the discretized empirical distribution:
H N : = j = 1 r p N , j log p N , j U N log U N ,
where we adopt the usual convention 0 log 0 : = 0 . The entropy H N satisfies 0 H N log ( r + 1 ) , with the maximum achieved when the mass is uniformly distributed across all bins, including B .
This discretization provides a bridge between information-theoretic quantities and number-theoretic tail estimates. In the next subsection, we show that any significant entropy deficit  H N ( 1 ε ) log r forces R k ( n ) to concentrate on relatively few scales. In particular, we derive explicit deterministic bounds on the upper-tail mass U N via Lambert-W inversions and entropy–tail inequalities (Theorem 7.1). This links the “spread” of R k ( n ) to its typical growth rate and complements the sieve-theoretic results of Section 4.1.

7.7. Entropy Bounds and Upper-Tail Control

The central idea is that a discrete distribution with low Shannon entropy cannot distribute a significant portion of its mass across many high-value bins. In our context, this implies that if the empirical distribution μ N of R k ( n ) , defined in (20), has small entropy, then the mass above any fixed threshold T > 0 must be small.
Formally, recall the partition into r + 1 bins B 1 , , B r , B introduced in (19), where B = ( T , ) . Let U N : = μ N ( B ) denote the upper-tail mass, and define p N = ( p N , 1 , , p N , r ) with p N , j = μ N ( B j ) .

Entropy–Tail Inequality

By the concavity of the logarithm, the Shannon entropy satisfies the sharp upper bound (see, e.g., [33] and [34]):
H N ( 1 U N ) log r ( 1 U N ) log ( 1 U N ) U N log U N .
Equality occurs when the conditional distribution of μ N restricted to the lower bins is uniform, i.e.
p N , j 1 U N r ( 1 j r ) .
Thus, for a fixed entropy H N , the worst-case tail mass U N solves the implicit equation
H N = U N log U N ( 1 U N ) log 1 U N r .

Explicit Inversion via Lambert W

Solving the above equation for U N requires inverting expressions of the form
U N e U N / c = C ,
which cannot be expressed using elementary functions. Introducing the Lambert W function, defined implicitly by W ( z ) e W ( z ) = z , we obtain the explicit solution:
U N = c Δ H N W c Δ H N r ,
where Δ H N : = log r H N measures the entropy deficit, and c is an explicit scaling constant depending only on the binning scheme.
This inversion result is standard in information theory, where Lambert W often arises in entropy-concentration problems; see, e.g., [35] for analogous derivations.

Consequences for Divisor-Sum Iterates

The inequality (22) has strong consequences for the distribution of R k ( n ) :
- If H N = ( 1 ε ) log r for some fixed ε > 0 , then
U N ε exp Θ ( r ε ) ,
i.e. the upper tail decays super-polynomially in r. - Choosing r log N and assuming the sub-logarithmic entropy condition
H N = o ( log log N ) ,
we deduce that for any fixed threshold T > 0 ,
lim N μ N ( T , ) = 0 .
In other words, under (23), almost all n N satisfy R k ( n ) T .
Remark 7.2
(Lambert-W and entropy concentration). The Lambert W function arises naturally here because the implicit relation between U N and the entropy deficit Δ H N has the form
U N log U N T = Δ H N .
Standard iterative methods would require multiple asymptotic expansions to approximate U N , but using W yields a closed-form solution (22) that is both sharp and stable even for small Δ H N . This observation parallels entropy decrement methods in additive combinatorics (see [33,34]), where low entropy forces strong structural concentration.

7.8. Connection with Analytic Number Theory

The entropy-based framework developed above integrates naturally with several classical and modern results in analytic number theory concerning the distribution of σ ( n ) / n and its iterates.

Maximal Growth Versus Typical Concentration

Grönwall’s theorem [3] and Robin’s refinement [4] precisely determine the maximal order of σ ( n ) / n , showing that
lim sup n σ ( n ) n log log n = e γ ,
and, under the Riemann Hypothesis, that σ ( n ) / n e γ log log n + O ( 1 ) for all sufficiently large n. However, these bounds control only the extremal growth of σ ( n ) / n and provide no information on the frequency or distribution of moderate versus extreme values.
Complementary results of Erdős, Granville, Pomerance, and Spiro [2] establish that the normal order of σ ( n ) / n is bounded, but the situation for higher iterates R k ( n ) remains poorly understood. In this context, the entropy framework provides a quantitative tool: a small empirical entropy H N forces most of the mass of R k ( n ) to be concentrated on a small set of values, implying strong uniformity for “typical” integers even when extremal fluctuations exist.

Local Concentration and Short Intervals

Recent advances in multiplicative number theory, notably the work of Matomäki and Radziwiłł [17], show that multiplicative functions exhibit remarkable regularity when averaged over short intervals. For a broad class of multiplicative functions f bounded by 1, they prove that
1 X x X 1 H h H f ( x + h ) 1 X n X f ( n ) 0
as H , uniformly for almost all x. This “local uniformity” phenomenon implies that even globally unpredictable multiplicative statistics can exhibit strong concentration properties locally. Since σ ( n ) / n is multiplicative up to smooth weights, this result is consistent with, and partly explains, the small-entropy regime (23), which in turn forces tight control on the tail measure U N via (22).

Large Prime Divisors and Sieve-Theoretic Input

Sieve-theoretic results provide independent mechanisms for controlling large values of σ ( n ) / n and its iterates. Goldfeld [22] and Feng–Wu [23] proved that for a positive proportion of integers n, the largest prime factor P + ( n ) satisfies P + ( n ) n α for some fixed α > 0 . Combining this with Mertens’ theorem [5] yields that along such subsequences,
σ ( n ) n ( log n ) 1 α ,
so that iterates R k ( n ) are naturally bounded on a positive-density set.
These sieve-driven bounds complement the entropy-tail inequality (22): if a significant portion of the mass of R k ( n ) is supported on integers n with large prime factors, then the entropy H N is automatically reduced, and the upper-tail mass U N becomes negligible.

Synthesis and Heuristic Picture

Taken together, these observations suggest a unified heuristic:
  • Deterministic structure: Maximal-order results (Grönwall, Robin) govern rare extremal events.
  • Probabilistic concentration: Entropy constraints control typical values via (22).
  • Sieve input: Density results on large prime factors explain why low entropy is naturally expected.
  • Local uniformity: Matomäki–Radziwiłł’s short-interval theorems support the hypothesis that R k ( n ) is tightly concentrated at most scales.
Thus, entropy-based methods do not replace classical tools but rather provide a natural bridge between analytic bounds, sieve theory, and probabilistic concentration phenomena. In particular, integrating the entropy tail bound with existing sieve-theoretic density theorems suggests that for any fixed k,
R k ( n ) ( log n ) 1 α ( log log n ) k 2
for almost all integers n, with α determined by the strongest available large-prime divisor results (cf. [23,24,25]).

7.9. Probabilistic Consequences

The entropy–tail inequality (22) imposes strong probabilistic constraints on the distribution of the iterates
R k ( n ) : = σ ( k ) ( n ) n .
We now make these consequences precise.

Tail Probabilities and Concentration

Fix ε > 0 and a threshold T > 0 . Choose n uniformly at random from { 1 , , N } , so that
P R k ( n ) > T = μ N ( T , ) = U N .
From (22), if the discretized entropy H N satisfies
H N ( 1 ε ) log r ,
where r log N denotes the number of bins used below threshold T, we deduce the quantitative bound
P R k ( n ) > T U N exp W r e H N / ( 1 U N ) N ε ,
uniformly in T for fixed k. Thus, under the mild entropy-growth hypothesis (23),
lim N P R k ( n ) > T = 0 for any fixed T > 0 .
This implies **probabilistic concentration**: for almost all integers, R k ( n ) remains bounded by any fixed threshold.

Almost-Sure Boundedness via Borel–Cantelli

The estimate (24) is summable in N whenever ε > 1 , since
N = 1 P R k ( n ) > T N = 1 N 1 δ < .
By the Borel–Cantelli lemma, we conclude that with probability one,
R k ( n ) T for all sufficiently large n .
Thus, under sub-logarithmic entropy growth, the random variable R k ( n ) is almost surely bounded in the limit n .

Integration with Sieve-Theoretic Density Results

Recall from Theorem 4.1 that, unconditionally, for a positive-density set of primes p one has
R k ( p ) k , α ( log p ) 1 α ( log log p ) k 2 ,
whenever a positive density of p satisfies P + ( p + 1 ) ( p + 1 ) α . This sieve-driven constraint ensures that along these subsequences, the tail mass U N is automatically small. Combined with (22), this gives a powerful hybrid conclusion:
lim N 1 N # 1 n N : R k ( n ) C ( log n ) 1 α ( log log n ) k 2 = 1 ,
for an explicit constant C = C ( k , α ) . Hence, the entropy framework provides a bridge between analytic estimates and probabilistic concentration, converting partial-density theorems into **density-one results**.

Entropy Versus Randomness: A Heuristic Law

Finally, combining the sieve input with entropy-tail control suggests the following heuristic principle:
  • If the distribution of R k ( n ) had entropy comparable to log log N , extreme fluctuations would be common, and the upper tail would remain heavy.
  • Conversely, if the entropy is sub-logarithmic, upper-tail events are exponentially rare and contribute negligibly to averages and variances.
Under this perspective, entropy serves as an information-theoretic analogue of “effective randomness” for σ -iterates: low entropy enforces concentration, while high entropy permits heavy-tailed fluctuations.
In summary, the entropy-tail inequality translates analytic information about σ -iterates into rigorous probabilistic statements. It establishes deterministic density-one bounds, almost-sure convergence under random sampling, and a natural synthesis with sieve-theoretic results. This completes the probabilistic layer of our hybrid approach to the boundedness problem for R k ( n ) .

7.10. An Entropy–Density Principle for Boundedness of R k

We quantify the empirical complexity of the iterates R k ( n ) = σ ( k ) ( n ) / n using the discretized entropy H N from (20), and translate entropy control into rigorous density-one bounds for R k ( n ) . Throughout, for any measurable set A ( 0 , ) , we write
d ( A ) : = lim N μ N ( A ) ,
whenever the natural density exists.
Theorem 7.2
(Entropy-driven boundedness at density one). Fix k 1 and a threshold T > 0 . For each N 1 , let ( B 1 , , B r ( N ) ) be a partition of ( 0 , T ] into r ( N ) bins and set B : = ( T , ) , as in (19). Assume:
(E)
Entropy deficit:There exists ε ( 0 , 1 ) and N 0 such that for all N N 0 ,
H N ( 1 ε ) log r ( N ) .
This says the empirical distribution of R k ( n ) has strictly less entropy than a uniform distribution on r ( N ) + 1 bins.
(A)
Analytic envelope for iterates:There exists a nondecreasing function E k ( x ) such that for all sufficiently large n,
σ ( n ) n E 1 ( n ) , σ ( j + 1 ) ( n ) σ ( j ) ( n ) E 1 σ ( j ) ( n ) ,
and E 1 ( x ) log log x . Consequently, for large n,
R k ( n ) E k ( n ) : = j = 0 k 1 E 1 σ ( j ) ( n ) .
Then the upper-tail mass U N = μ N ( ( T , ) ) satisfies
U N exp ε log r ( N ) + O ( 1 ) ,
and in particular U N 0 as soon as r ( N ) . Equivalently,
d { n : R k ( n ) T } = 1 .
Moreover, if the Riemann Hypothesis holds, Robin’s inequality [4] implies that E 1 ( x ) log log x uniformly, so E k ( n ) ( log log n ) C k for some explicit C k = C ( k ) . Choosing r ( N ) from a fixed geometric partition of ( 0 , T ] with mesh independent of N, there exists a constant T 0 = T 0 ( k , ε ) such that
d { n : R k ( n ) T 0 } = 1 .
Proof. 
Step 1. Entropy deficit ⇒ small upper tails. Let p N , 1 , , p N , r , U N be as in (19). By concavity of x x log x , the entropy is maximized for fixed U N when the mass on ( 0 , T ] is equidistributed, i.e.
p N , j = 1 U N r ( 1 j r ) .
Thus
H N H max ( U N ) : = ( 1 U N ) log r ( 1 U N ) log ( 1 U N ) U N log U N .
Under assumption (E), ( 1 ε ) log r H N H max ( U N ) . For U N ( 0 , 1 / 2 ] , expand H max as
H max ( U N ) = log r U N log r + O U N log 1 U N ,
giving
( 1 ε ) log r log r U N log r + O U N log 1 U N .
Rearranging yields
U N ε + O U N log ( 1 / U N ) log r .
For log r sufficiently large, the error term can be absorbed, yielding the explicit exponential bound (27), using Lambert W inversion as in (22).
Step 2. Effective support from analytic envelopes. By (A), E 1 ( x ) log log x for large x, and iterating,
R k ( n ) E k ( n ) ( log log n ) C k .
Thus, on { 1 , , N } , R k ( n ) is supported in ( 0 , T N ] with
T N ( log log N ) C k .
Choosing a geometric partition with mesh ratio 1 + δ yields
r ( N ) log T N log log log N ,
so r ( N ) as N . Then U N 0 by Step 1, proving d ( { R k T } ) = 1 .
Step 3. Uniform bounds under RH. Under RH, Robin’s inequality [4] provides a uniform E 1 ( x ) log log x , so E k ( n ) ( log log n ) C k with explicit constants. Taking r ( N ) constant from a fixed geometric partition of ( 0 , T ] , the entropy deficit (E) forces U N c ( ε , r ) < 1 uniformly. Letting T grow until c ( ε , r ) becomes arbitrarily small and applying a diagonal argument gives a uniform constant T 0 = T 0 ( k , ε ) with d ( { R k T 0 } ) = 1 .
Step 4. Sieve compatibility. By Goldfeld [22] and Feng–Wu [23], for a positive-density set of primes p,
P + ( p + 1 ) ( p + 1 ) α , α < 1 2 .
By Mertens’ theorem [5],
σ ( 2 ) ( p ) σ ( 1 ) ( p ) ( log p ) 1 α , σ ( j + 1 ) ( p ) σ ( j ) ( p ) log log p ( j 2 ) ,
yielding
R k ( p ) ( log p ) 1 α ( log log p ) k 2
on a set of primes of positive density. Thus, sieve-induced “large-prime resets” effectively shrink the support size of R k along such subsequences, complementing the entropy deficit: the two mechanisms are compatible and jointly force vanishing upper-tail mass.    □
Remark 7.3.
Assumption(E)is a genuineempiricalhypothesis on the complexity of the observed iterate distribution; it is compatible with the local concentration phenomena for multiplicative statistics established in short intervals by Matomäki and Radziwiłł [17]. Assumption(A)collects the deterministic growth controls for σ and its iterates: Grönwall’s limsup asymptotics [3] and, under RH, Robin’s uniform upper bound [4]; see also [5] and the normal-order framework for iterates in [2]. The sieve inputs [22,23,24,25] (and their well-distributed generalization [26]) strengthen the envelope along structured subsequences, which can be combined with entropy to sharpen constants in T 0 .

7.11. A Practical Entropy Criterion and Empirical Estimation

Corollary 7.1
(Geometric-partition entropy criterion). Fix k 1 and T > 0 . For each N, partition ( 0 , T ] into r ( N ) geometric bins
B j = T ( 1 + δ ) j , T ( 1 + δ ) ( j 1 ) ( 1 j r ( N ) ) ,
with mesh 1 + δ > 1 and B = ( T , ) . Suppose there exists ε ( 0 , 1 ) and N 0 such that for all N N 0 the discretized entropy H N in (20) satisfies
H N ( 1 ε ) log r ( N ) ,
and assume the analytic envelope (A) from Theorem 7.2 with E 1 ( x ) log log x as x (cf. Grönwall [3]; under RH see Robin [4]; see also [5]). Then
lim N μ N ( T , ) = 0 , hence d { n : R k ( n ) T } = 1 .
If RH holds, then there exists T 0 = T 0 ( k , ε , δ ) such that
d { n : R k ( n ) T 0 } = 1
already with r ( N ) r 0 independent of N.
Proof. 
The geometric partition ensures r ( N ) log T 1 when T is fixed and, more generally, r ( N ) whenever the effective support of R k below T spreads over many scales. By the entropy maximization under fixed U N used in Theorem 7.2, we have
H N ( 1 U N ) log r ( N ) ( 1 U N ) log ( 1 U N ) U N log U N .
The assumed entropy deficit gives U N exp ε log r ( N ) + O ( 1 ) , hence U N 0 as r ( N ) , so d ( { R k T } ) = 1 . Under RH, Robin [4] yields a uniform envelope E 1 ( x ) log log x for large x, so the effective support below a suitable constant T 0 can be captured by a fixed geometric mesh; then the same inversion shows U N c ( ε , r 0 ) uniformly, and a diagonal choice of T 0 forces density one as in the theorem.    □
  • Empirical Estimation of H N .
For numerical or data-driven verification, choose a geometric mesh 1 + δ > 1 on ( 0 , T ] , compute the empirical frequencies p N , j = μ N ( B j ) and U N = μ N ( B ) , and form H N from (20). Stability of H N across short intervals [ M , M + H ] with H = M θ ( θ > 0 fixed) is heuristically supported by local concentration phenomena for multiplicative statistics (cf. Matomäki–Radziwiłł [17]). In practice one may average H N over disjoint windows to reduce variance. On structured subsequences (e.g. primes), sieve-triggered bounds (Goldfeld [22]; Feng–Wu [23]; Liu–Wu–Xi [24]; Wang [25]) and Mertens’ product [5] compress the mass of R k toward smaller bins, which empirically lowers H N and tightens the upper-tail bound. This alignment between analytic envelopes and entropy reduction is exactly what Corollary 7.1 formalizes.
  • Sieve-Enhanced Constants on Prime Subsequences.
On a set of primes of positive density, the large-prime reset at the first iterate implies
R k ( p ) ( log p ) 1 α ( log log p ) k 2 , α < 1 2 ,
by Theorem 4.1 (using [22,23,24,25] and Mertens [5]). If one forms μ N restricted to primes p N , the effective support below any fixed T shrinks more rapidly, so the same entropy deficit threshold ( 1 ε ) log r is reached with smaller r, delivering stronger constants in the density-one bound. In well-distributed settings, Bharadwaj–Rodgers [26] indicate how stronger distributional hypotheses further reduce the tail, improving T and the rate U N 0 .
Building upon Theorem 7.2 and Corollary 7.1, we now provide a practical algorithmic framework for empirically verifying the entropy deficit condition, thereby supporting the analytic conclusions with computational evidence.
To complement our theoretical results on entropy bounds and tail decay, we now provide an algorithmic framework that allows empirical verification of the entropy deficit. This bridges the analytic predictions with the numerical measure-density analysis presented in Section 7.13, offering a unified perspective on the boundedness behavior of { R k ( n ) } .

7.12. Algorithmic Recipe for Verifying the Entropy Deficit

Goal. Empirically verify the entropy deficit H N ( 1 ε ) log r ( N ) for geometric bins on R k ( n ) , thereby triggering Corollary 7.1 and Theorem 7.2. Inputs.
  • Depth k 1 ; sample size N; optional window size H = N θ with 0 < θ 1 .
  • Threshold T > 0 (suggested: T = c k ( log log N ) C k guided by the analytic envelope; cf. Grönwall [3], Robin [4,5]).
  • Geometric mesh 1 + δ > 1 (recommended: δ [ 0.05 , 0.20 ] ).
  • Deficit parameter ε ( 0 , 1 ) (e.g. ε = 0.05 or 0.10 ).
Binning. Partition ( 0 , T ] into geometric bins B j = ( T ( 1 + δ ) j , T ( 1 + δ ) ( j 1 ) ] , 1 j r , with r = log ( T 1 ) / log ( 1 + δ ) , and set B = ( T , ) . Computation.
  • For each 1 n N : compute σ ( j ) ( n ) for 0 j k (cache prime factorizations; reuse multiplicativity; early-stop if σ ( j ) ( n ) > T σ ( j 1 ) ( n ) to register membership in B ).
  • Record R k ( n ) = σ ( k ) ( n ) / n . Tally empirical frequencies p N , j = 1 N # { n N : R k ( n ) B j } and U N = 1 N # { n N : R k ( n ) B } .
  • Form the discretized entropy H N = j = 1 r p N , j log p N , j U N log U N (with 0 log 0 : = 0 ).
Diagnostics.
  • Deficit test: Check H N ( 1 ε ) log r . If true for sufficiently large N, then μ N ( ( T , ) ) = U N 0 and d ( { R k T } ) = 1 by Corollary 7.1.
  • Window stability: Repeat on windows [ M , M + H ] with M { N , 2 N , 4 N , } . Stability of H N across windows aligns with short-interval concentration for multiplicative statistics (cf. Matomäki–Radziwiłł [17]).
  • Structured subsequences: Restrict to primes p N . Sieve resets (Goldfeld [22]; Feng–Wu [23]; Liu–Wu–Xi [24]; Wang [25]) compress mass below T, typically yielding smaller H N and faster decay of U N .
Parameter guidance.
  • δ : smaller δ (finer bins) increases r and the target log r ; pick the smallest δ for which counts per bin remain stable (e.g. median bin count 30 ).
  • T: start with T = c k ( log log N ) C k (from the analytic envelope E k ); if U N is tiny, reduce T to seek a constant density-one bound under RH (Robin [4]); if U N is large, increase T modestly.
  • ε : use 0.05 0.10 to claim a meaningful entropy gap.
Outputs.
  • Empirical verification of the entropy deficit H N ( 1 ε ) log r .
  • Upper-tail mass bound U N r ε (Lambert-W inversion as in Theorem 7.2).
  • Density-one boundedness: d ( { R k T } ) = 1 , with sieve-enhanced constants on prime subsequences.

7.13. Measure Density and Asymptotic Behavior of the Iterative Sequence

To better understand the asymptotic nature of the normalized amplification ratios
R k ( n ) = σ ( k ) ( n ) n ,
we analyze the measure density of the sequence { R k ( n ) } n 1 . For any interval I R , the natural density of R k ( n ) in I is defined as:
D k ( I ) = lim N 1 N # 1 n N : R k ( n ) I ,
whenever this limit exists.
  • Empirical Measure Density.
Figure 1 shows a scatter plot of R 3 ( n ) for n 10 5 . The distribution reveals a strong concentration of points within a bounded region: over 85 % of the observed values of R 3 ( n ) lie within the interval [ 1.5 , 3.0 ] . Only a sparse set of integers produce extreme values, corresponding to highly composite numbers where σ ( n ) is anomalously large. This indicates that the sequence { R 3 ( n ) } exhibits heavy clustering near small ratios and an exponentially decaying tail for large values.
  • Asymptotic Implications and Boundedness.
The scatter plot and measure density analysis suggest that { R k ( n ) } remains bounded for k 3 within observed ranges. Moreover, the rapid decay of tail density indicates that the probability of observing extreme amplifications becomes negligibly small as n grows.
Formally, if there exists a constant C k such that:
lim N 1 N # 1 n N : R k ( n ) > C k = 0 ,
then R k ( n ) is almost surely bounded in measure. Our computations provide strong empirical support for the existence of such C k , with C 3 6.5 for k = 3 .
  • Connection to Schinzel’s Conjecture.
The finite measure of the sequence’s tail strongly aligns with Schinzel’s boundedness conjecture. Since extreme values occur with asymptotically zero density, the lim inf of R k ( n ) must lie within a compact interval. This behavior, together with entropy-based tail constraints discussed earlier, suggests that the sequence { R k ( n ) } does not exhibit unbounded growth and provides further computational evidence toward the finiteness of the normalized iterates.
In summary, the measure density analysis combined with scatter plot visualization offers a compelling narrative: most values of σ ( k ) ( n ) / n remain confined within a small, bounded region, while extreme amplifications are both rare and asymptotically negligible. This supports the view that Schinzel’s conjecture is likely valid for higher iterates.

8. Implications of Our Results for Analytic Number Theory

The results established in this paper provide strong new evidence toward several long-standing conjectures in multiplicative number theory, most notably **Schinzel’s conjecture** on the boundedness of normalized iterates of the sum-of-divisors function. By integrating **entropy-based concentration** with **sieve-theoretic large-prime resets**, our framework bridges probabilistic and analytic techniques, yielding new consequences for the distribution of σ ( k ) ( n ) and related multiplicative phenomena.

8.1. Positive Evidence for Schinzel’s Conjecture

Schinzel conjectured that for each fixed k 1 , the normalized iterates
R k ( n ) = σ ( k ) ( n ) n
remain bounded as n . Classical tools, such as Grönwall’s theorem [3] and Robin’s refinement under the Riemann Hypothesis [4], control the maximal order of σ ( n ) / n , but they provide no information on the **typical distribution** or **density-one boundedness** of R k ( n ) .
Our results contribute significant new evidence:
  • Using a discretized Shannon entropy analysis inspired by Tao’s entropy–density techniques for multiplicative functions [37], we proved that if the empirical entropy H N of R k ( n ) grows more slowly than log r ( N ) , then the upper tail mass
    U N = μ N ( T , )
    satisfies the sharp decay bound
    U N exp ε log r ( N ) + O ( 1 ) ,
    forcing d ( { n : R k ( n ) T } ) = 1 . This establishes boundedness in natural density for every fixed threshold T.
  • When combined with Robin’s uniform bound under RH [4], we deduced the existence of an explicit constant T 0 = T 0 ( k , ε ) such that
    d { n : R k ( n ) T 0 } = 1 ,
    providing conditional, quantitative control over normalized iterates.
  • Crucially, sieve-theoretic results due to Goldfeld [22] and Feng–Wu [23] on the presence of large prime factors in shifted integers yield strong amplification resets: along a positive-density set of primes p,
    P + ( σ ( p ) ) = P + ( p + 1 ) ( p + 1 ) α ( α > 0 ) ,
    which restricts the growth of σ ( k ) ( n ) on these subsequences. Integrating these results into our entropy framework shows that extreme excursions of R k ( n ) occur only on a set of zero natural density.
These results strongly support the boundedness aspect of Schinzel’s conjecture, complementing both unconditional distributional theorems [2,17] and numerical evidence.

8.2. Connections to the Distribution of Multiplicative Functions

Our entropy-based approach highlights a structural parallel with recent work of Matomäki and Radziwiłł [17] on the behavior of multiplicative functions in short intervals: low entropy implies local concentration, mirroring their almost-everywhere results for bounded multiplicative statistics. In particular:
  • The entropy deficit condition ( H N ( 1 ε ) log r ( N ) ) corresponds to the local predictability of R k ( n ) , reinforcing heuristic models where σ ( k ) ( n ) behaves “almost regularly” on typical sets.
  • Combining short-interval concentration results with our entropy–tail bounds suggests that any “large spikes” in σ ( k ) ( n ) must occur on highly structured, zero-density sets.
Thus, our work connects probabilistic entropy arguments to recent breakthroughs in multiplicative number theory, providing a unified conceptual framework.

8.3. Refined Probabilistic Models and Density Laws

The entropy–tail inequality, together with the Borel–Cantelli lemma, implies that under the entropy deficit hypothesis, for any fixed threshold T > 0 ,
P R k ( n ) > T 0 as N ,
when n is chosen uniformly at random from { 1 , , N } . This strengthens classical normal-order results by Erdős–Granville–Pomerance–Spiro [2] into almost-sure convergence statements for normalized iterates, conditional on entropy bounds.
Furthermore, our empirical evidence for R 3 ( n ) up to n 10 5 confirms these predictions, showing that over 85 % of observed values lie within a compact interval and that large deviations are exceptionally rare, consistent with our theoretical framework.

8.4. Outlook

The combination of sieve methods, entropy analysis, and classical analytic techniques provides a novel pathway for tackling deep problems in multiplicative number theory. Future developments could involve:
  • Quantifying explicit entropy bounds unconditionally, using Tao’s entropy decrement framework [36].
  • Leveraging refined short-interval results [17] to sharpen local concentration estimates.
  • Extending the entropy–tail methodology to other multiplicative statistics, such as ω ( n ) or ϕ ( k ) ( n ) .
In summary, our results demonstrate that entropy-based concentration combined with sieve-theoretic resets yields strong evidence that the normalized iterates of σ are bounded on a density-one subset of integers, marking significant progress toward Schinzel’s conjecture and opening new avenues at the interface of analytic number theory, probability, and information theory.

9. Future Research Directions

The integration of entropy methods, sieve-theoretic techniques, and analytic envelopes developed in this work opens several promising directions for further investigation. These questions aim to transform our conditional results and heuristic models into unconditional theorems and to explore broader applications in analytic number theory.

9.1. Quantifying Entropy for σ ( k ) -Iterates

A central open problem is to derive unconditional bounds for the discretized entropy H N of the empirical distribution of R k ( n ) = σ ( k ) ( n ) / n . In this paper, we assumed an entropy deficit condition  H N ( 1 ε ) log r ( N ) to establish density-one boundedness. Removing or weakening this assumption is a key next step.
  • Entropy decrement methods. Recent breakthroughs by Tao [36] introduced entropy decrement arguments in the context of multiplicative correlations, demonstrating how low-entropy regimes force structural regularity. Extending these techniques to the iterated divisor-sum σ ( k ) could yield unconditional control of H N .
  • Short-interval entropy bounds. Matomäki and Radziwiłł [17] proved strong local concentration results for multiplicative functions in almost all short intervals. Combining their short-interval control with entropy-based tail bounds could yield effective uniform estimates for μ N ( ( T , ) ) , even without global assumptions.
Establishing an unconditional entropy deficit for σ ( k ) would resolve large portions of the boundedness problem for R k ( n ) .

9.2. Explicit Upper Bounds and Effective Versions

Our analysis demonstrates that entropy concentration forces { R k ( n ) } into compact sets on density-one subsets, but we have not yet derived sharp explicit constants. Future work could focus on:
  • Deriving computable thresholds T k such that d { n : R k ( n ) T k } = 1 unconditionally.
  • Combining Robin’s inequality [4] under the Riemann Hypothesis with sieve-theoretic large-prime resets [22,23] to improve the constants and the convergence rate of U N .
  • Using results on shifted prime factors (e.g., Goldfeld’s theorem [22] and its refinements) to make the analytic envelope E k ( n ) fully explicit.
Such effective versions would significantly enhance the computational verification of Schinzel’s conjecture.

9.3. Entropy–Sieve Duality for Other Arithmetic Functions

The entropy–sieve framework developed here is not limited to σ ( k ) ( n ) . Future studies could explore analogous results for other multiplicative iterates:
  • Iterates of Euler’s totient function ϕ ( k ) ( n ) , for which normal-order results exist but density-one boundedness is unproven.
  • Iterates of Carmichael’s function λ ( k ) ( n ) , where upper-tail behavior remains largely unexplored.
  • Joint distributions of ( σ ( k ) ( n ) , ϕ ( k ) ( n ) ) and their entropy profiles, extending work by Tenenbaum [5].
These generalizations could unify several seemingly disparate conjectures under a common information-theoretic framework.

9.4. Interaction with Conjectures on Prime Distributions

Our results also interact naturally with deep conjectures in prime number theory:
  • Under the Elliott–Halberstam conjecture [37] (Conjecture Section 4), sieve bounds improve drastically, implying stronger large-prime resets and thus sharper entropy-induced tail decay.
  • Combining entropy methods with Montgomery–Vaughan’s framework for prime distribution in arithmetic progressions [38] could yield hybrid unconditional–conditional results.
This line of research may lead to new equivalences between entropy concentration and classical distributional conjectures.

9.5. Numerical and Computational Aspects

Finally, extensive computational experiments can complement the theoretical program:
  • Estimating empirical entropy H N for large N and various k to test the validity of our entropy deficit hypothesis.
  • Exploring correlations between high-entropy spikes of R k ( n ) and highly composite or friable numbers, guided by the probabilistic models of Granville and Soundararajan [39].
  • Verifying refined bounds for R k ( n ) on wide numerical ranges to support or refute specific quantitative conjectures.
Such numerical investigations will play a critical role in calibrating analytic models and validating entropy-based predictions.

Summary

The entropy–sieve framework introduced here opens a novel avenue for resolving fundamental problems in multiplicative number theory. By blending analytic, probabilistic, and computational methods, future research could:
  • Prove unconditional entropy deficit theorems for σ ( k ) .
  • Derive explicit, effective bounds for R k ( n ) .
  • Extend entropy-tail techniques to other arithmetic iterates.
  • Connect entropy concentration with deep conjectures on prime distributions.
  • Integrate large-scale computations with rigorous analytic theory.
Ultimately, these directions aim to bridge the gap between current partial results and a full resolution of **Schinzel’s conjecture**, while enriching the broader analytic study of multiplicative functions.

10. Conclusion

In this work, we investigated the asymptotic behavior of the normalized iterates
R k ( n ) : = σ ( k ) ( n ) n ,
a central object in the study of multiplicative arithmetic functions and a key aspect of the boundedness problem related to Schinzel’s conjecture. Building on the classical foundations laid by Grönwall [3] and Robin [4], together with refinements by Erdős, Granville, Pomerance, and Spiro [2], we developed a hybrid framework combining tools from analytic number theory, sieve methods, information theory, and probabilistic concentration.
  • Entropy-based tail bounds. We introduced an entropy-driven perspective on the distribution of σ ( k ) ( n ) , employing discretized Shannon entropy to control the empirical upper-tail mass of R k ( n ) . Using explicit Lambert W inversions, we proved sharp inequalities of the form
    U N : = μ N ( T , ) exp ε log r ( N ) + O ( 1 ) ,
    where r ( N ) is the effective number of bins covering the lower-value region. This demonstrates that a persistent entropy deficit forces the upper tail of R k ( n ) to vanish on a density-one subset of integers, yielding unconditional boundedness in measure. Under the Riemann Hypothesis, Robin’s inequality allows us to sharpen this further: we obtain an explicit constant T 0 = T 0 ( k , ε ) such that
    d { n : R k ( n ) T 0 } = 1 .
    This result provides the strongest known evidence toward the boundedness of normalized σ -iterates along almost all integers.
  • Integration with sieve theory. We complemented our entropy bounds with sieve-theoretic results concerning large-prime “resets” in σ -iterates, following Goldfeld [22] and Feng–Wu [23]. These results show that along a positive-density subsequence of primes, the largest prime divisor of σ ( p ) satisfies
    P + ( σ ( p ) ) = P + ( p + 1 ) ( p + 1 ) α ,
    for fixed α > 0 , which constrains the amplification of σ ( k ) ( n ) and effectively shrinks its probabilistic support. Thus, sieve-induced concentration and entropy-driven concentration reinforce one another, jointly forcing the upper tail to decay.
  • Numerical evidence. Our empirical experiments, conducted for R 3 ( n ) up to n 10 5 , reveal that more than 85 % of observed values lie within the compact interval [ 1.5 , 3.0 ] , while large deviations correspond exclusively to isolated highly composite numbers. These findings are consistent with our theoretical entropy bounds and provide strong computational support for the conjectured boundedness of R k ( n ) .
  • Outlook and open problems. Several natural directions for further research emerge from our framework:
    • Sharper entropy control: A deeper understanding of entropy growth rates for σ -iterates could lead to unconditional uniform bounds for R k ( n ) .
    • Connections to short-interval phenomena: Recent breakthroughs on the distribution of multiplicative functions in short intervals [17] suggest possible refinements of our probabilistic model.
    • Effective explicit bounds: By combining entropy-tail inequalities with refined sieve techniques, one might derive explicit universal constants bounding R k ( n ) on a density-one set of integers.
  • Summary. By bridging analytic number theory, entropy methods, and sieve-theoretic concentration, this work provides both rigorous theorems and empirical evidence supporting the boundedness of normalized iterates of the sum-of-divisors function. Our results illuminate the intricate structure of σ -iterates, establish new connections between information-theoretic and number-theoretic techniques, and contribute significant progress toward understanding long-standing conjectures in multiplicative number theory.

Funding

No funding was received for this research.

Conflicts of Interest

The author declares that there is no conflict of interest regarding the publication of this article.

Appendix A. Computational Notebook and Data Availability

To facilitate reproducibility and allow readers and reviewers to verify the computational aspects of our study, we provide a fully documented Jupyter notebook associated with this work:
Title: Scatter Plot Analysis of Iterative Divisor-Sum Ratios R k ( n ) = σ ( k ) ( n ) / n

Appendix Notebook Description

The notebook scatter_Rk_density.ipynb contains Python code implementing an efficient sieve-based algorithm to compute and visualize the normalized iterative divisor-sum ratios:
R k ( n ) = σ ( k ) ( n ) n ,
where σ ( n ) denotes the sum-of-divisors function and σ ( k ) ( n ) represents its k-fold iterate.
The notebook performs large-scale empirical computations for n up to 10 5 or higher and produces a scatter plot illustrating the measure density, boundedness, and asymptotic behavior of { R k ( n ) } . This visualization supports our heuristic conclusions on the boundedness and finiteness of the normalized iterates, providing strong numerical evidence aligned with Schinzel’s conjecture.

Appendix Key Features

  • Efficient computation of σ ( n ) using a sieve-based method.
  • Iterative evaluation of σ ( k ) ( n ) for arbitrary k 1 .
  • High-resolution scatter plots of R k ( n ) for large ranges of n.
  • Generation of a -ready PDF figure for direct inclusion in research papers.

Appendix Applications

  • Empirical analysis of iterated arithmetic functions.
  • Investigation of measure density and boundedness properties.
  • Numerical exploration supporting conjectures related to divisor-sum iterates.

Appendix Accessing the Notebook

The notebook and its generated scatter plot are publicly available through Zenodo at the following DOI:
https://zenodo.org/records/16911537
Readers and reviewers are encouraged to explore the notebook to reproduce figures, test different parameter ranges, and validate the empirical findings presented in this work.

References

  1. Hubert Delange. Sur la fonction g(n)=(2)∫0exp(-x2/n)dx et la fonction somme des diviseurs. Annales scientifiques de l’Université de Clermont-Ferrand.
  2. P. Erdős, A. P. Erdős, A. Granville, C. Pomerance, and C. Spiro. On the normal behavior of the iterates of some arithmetic functions. In: B. C. Berndt, H. G. Diamond, H. Halberstam, and A. Hildebrand (eds), Analytic Number Theory, Progress in Mathematics, vol. 85, pages 165–204. Birkhäuser Boston, 1990. [CrossRef]
  3. Thomas, H. Grönwall. Some asymptotic expressions in the theory of numbers. Transactions of the American Mathematical Society 1913, 14, 113–122. [Google Scholar]
  4. Guy Robin. Grandes valeurs de la fonction somme des diviseurs et hypothèse de Riemann. Journal de Mathématiques Pures et Appliquées 1983, 62, 187–213.
  5. Gérald Tenenbaum. Introduction to Analytic and Probabilistic Number Theory, 1995.
  6. Heini Halberstam and Hans-Egon Richert. Sieve Methods, 1974.
  7. Henryk Iwaniec and Emmanuel Kowalski. Analytic Number Theory, 2004.
  8. Alina Carmen Cojocaru and, M. Ram Murty. An Introduction to Sieve Methods and Their Applications, 2005. [Google Scholar]
  9. K., Dickman. On the frequency of numbers containing prime factors of a certain relative magnitude. Arkiv för Matematik, Astronomi och Fysik 1930, 22A, 1–14. [Google Scholar]
  10. N. G. de, Bruijn. On the number of positive integers ≤x and free of prime factors >x1/u. Indagationes Mathematicae 1951, 13, 50–60. [Google Scholar] [CrossRef]
  11. A. Hildebrand and G. Tenenbaum. On integers free of large prime factors. Transactions of the American Mathematical Society 1986, 296, 265–290. [CrossRef]
  12. E. R. Canfield, P. E. R. Canfield, P. Erdős, and C. Pomerance. On a problem of Oppenheim concerning “factorisatio numerorum”. Journal of Number Theory 1983, 17, 1–28. [Google Scholar] [CrossRef]
  13. Hugh, L. Montgomery and Robert C. Vaughan. The large sieve. Mathematika 1973, 20, 119–134. [Google Scholar]
  14. Terence Tao. The Large Sieve and Bombieri-Vinogradov Theorem, /: Notes, 2015. Available online at https, 2015.
  15. Gábor, Halász. On the distribution of additive and the mean values of multiplicative functions. Studia Scientiarum Mathematicarum Hungarica 1968, 3, 211–233. [Google Scholar]
  16. Andrew Granville, Adam J. Harper, and Kannan Soundararajan. A new proof of Halász’s theorem. International Mathematics Research Notices, 3721.
  17. Kaisa Matomäki and Maksym, Radziwiłł. Multiplicative functions in short intervals. Annals of Mathematics 2016, 183, 1015–1056. [Google Scholar] [CrossRef]
  18. Helmut, Maier. On the third iterates of the φ- and σ-functions. Journal of Number Theory 1984, 19, 1–28. [Google Scholar]
  19. Paul Pollack and Carl, Pomerance. Common values of the sum-of-divisors function. American Journal of Mathematics 2020, 142, 753–780. [Google Scholar]
  20. Kannan Soundararajan and Terence Tao. Multiplicative Number Theory: The Classical Theory and Beyond, 2023.
  21. Kai Kobayashi, Paul Pollack, and Carl Pomerance. On the distribution of sociable numbers. Mathematika 2016, 62, 188–237. [Google Scholar] [CrossRef]
  22. Dorian Goldfeld. On the number of primes p for which p+a has a large prime factor. Mathematika. [CrossRef]
  23. Zhiwei Feng and Qiang, Wu. Large prime factors of shifted primes. Acta Arithmetica, 2018. [Google Scholar] [CrossRef]
  24. Shangzhi Liu, Qiang Wu, and Hui Xi. On large prime factors of p+1 for primes p. Journal of Number Theory. [CrossRef]
  25. Jianya Wang. Large prime factors of p+1 for infinitely many primes p. Proceedings of the American Mathematical Society, 2021. [CrossRef]
  26. Ritabrata Bharadwaj and Brad Rodgers. Large prime factors of values of well-distributed sequences. Transactions of the American Mathematical Society, 3567. [CrossRef]
  27. J. Barkley Rosser and Lowell Schoenfeld. Approximate formulas for some functions of prime numbers. Illinois Journal of Mathematics, 1962; 94. [CrossRef]
  28. Christian Axler. New estimates for some functions defined over primes. Integers, /: 2018. https, 2018.
  29. Andreas Weingartner. The distribution functions of σ(n)/n and n/φ(n). Proceedings of the American Mathematical Society. [CrossRef]
  30. László Tóth. A survey of the alternating sum-of-divisors function. Acta Universitatis Sapientiae, Mathematica, /: 2013. https, 2013.
  31. Kevin Ford. The distribution of integers with a divisor in a given interval. Annals of Mathematics, 2008. [CrossRef]
  32. Jean-Marie De Koninck and Florian Luca. Analytic Number Theory: Exploring the Anatomy of Integers, /: Surveys and Monographs, Vol. 134, American Mathematical Society, 2012. https, 2012.
  33. A. Hildebrand and G. Tenenbaum. On the number of positive integers ≤x without large prime factors. Journal de Théorie des Nombres de Bordeaux, /: 1993. https, 1993.
  34. G. Tenenbaum. Introduction to Analytic and Probabilistic Number Theory. Graduate Studies in Mathematics, 2015. [CrossRef]
  35. R. de la Bretèche and G. Tenenbaum. On the friable Turán–Kubilius inequality. In <italic>Analytic and Probabilistic Methods in Number Theory</italic>, E. R. de la Bretèche and G. Tenenbaum. On the friable Turán–Kubilius inequality. In Analytic and Probabilistic Methods in Number Theory, E. Manstavičius et al. (eds.), TEV, Vilnius, 2012, pp. 259–265. https://tenenb.perso.math.cnrs.fr/PPP/TK5.pdf.
  36. H. L. Montgomery and R. C. Vaughan. Multiplicative Number Theory I: Classical Theory, 2006; 97. [CrossRef]
  37. Terence Tao. The logarithmically averaged Chowla and Elliott conjectures for two-point correlations. Forum of Mathematics, Pi, /: e8, 36pp. https, 1509; e8.
  38. Terence Tao and Joni Teräväinen. Odd order cases of the logarithmically averaged Chowla conjecture. Journal de Théorie des Nombres de Bordeaux, /: 2018. https, 1015.
  39. Kaisa Matomäki, Maksym Radziwiłł, and Terence Tao. An averaged form of Chowla’s conjecture. Preprint, 2015. https://www.math.mcgill.ca/radziwill/chowla.pdf.
  40. Terence Tao. Correlations of multiplicative functions (Lecture notes). Draft monograph, 2024. https://terrytao.files.wordpress.com/2024/12/correlations-of-multiplicative-functions-draft.pdf.
  41. Cédric Pilatte. Improved bounds for the two-point logarithmic Chowla conjecture. Preprint, 2023. https://arxiv.org/abs/2310.19357.
  42. Terence Tao. The entropy decrement argument (blog series). 2015–2019. https://terrytao.wordpress.com/tag/entropy-decrement-argument/.
  43. Terence Tao. 254A, Notes 9: Second moment and entropy methods. Lecture notes, 2015. https://terrytao.wordpress.com/2015/09/18/254a-notes-9-second-moment-and-entropy-methods/.
  44. Terence Tao. Special cases of Shannon entropy. Blog post, 2017. https://terrytao.wordpress.com/2017/03/01/special-cases-of-shannon-entropy/.
  45. T. M. Cover and J. A. Thomas. Elements of Information Theory, /: Wiley, 2006. https, 2006.
  46. Terence, Tao. Sumset and inverse sumset theory for Shannon entropy. Combinatorics, Probability and Computing 2010, 19, 603–639. [Google Scholar] [CrossRef]
  47. Mokshay Madiman, Adam Marcus, and Prasad Tetali. Entropy and set cardinality inequalities for partition-determined functions. Random Structures & Algorithms 2012, 40, 399–424. [Google Scholar]
  48. Mokshay Madiman and Ioannis Kontoyiannis. Entropy bounds on abelian groups and the Ruzsa divergence. IEEE Transactions on Information Theory 2018, 64, 77–92. [Google Scholar] [CrossRef]
  49. Julia Wolf. Some applications of relative entropy in additive combinatorics. In: Combinatorial and Additive Number Theory IV, Springer Proc. Math. Stat. 347, pp. 63–90, 2020. https://link.springer.com/chapter/10.1007/978-3-030-42676-2_3.
Figure 1. Scatter plot of R 3 ( n ) = σ ( 3 ) ( n ) / n for n 10 5 . The dense clustering near small values suggests that the iterative sequence is largely concentrated within a compact region, while rare isolated spikes arise from highly composite numbers.
Figure 1. Scatter plot of R 3 ( n ) = σ ( 3 ) ( n ) / n for n 10 5 . The dense clustering near small values suggests that the iterative sequence is largely concentrated within a compact region, while rare isolated spikes arise from highly composite numbers.
Preprints 173334 g001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated