1. Literature Review
The Collatz conjecture, also known as the problem, is one of the most famous unsolved problems in mathematics. It posits that for any positive integer N, repeated application of the function (division by 2 if the number is even; replacement by if odd) eventually leads to 1. This review summarizes key contributions from the listed references, focusing on historical context, theoretical advancements, computational verifications, statistical properties, and connections to binary representations and sequences. These sources provide a foundation for understanding the complexity of the conjecture, partial results, and related mathematical structures, as explored in the article on binary decomposition and uniformity of distribution.
1.1. Historical and Biographical Context
Biography of Lothar Collatz [
1]: Lothar Collatz (1910–1990) was a German mathematician known for contributions to numerical analysis. He proposed the conjecture in 1937 while working on graph theory. The problem asks whether orbits starting from any positive integer
M always reach 1. Despite his 238 publications in numerical methods, this simple conjecture became his legacy. The conjecture has been verified up to
(
) [
10], but remains open, highlighting its deceptive simplicity.
Lagarias’ Survey [
3]: This survey details generalizations of the conjecture, such as replacing 3 with other odd integers or extending to negative and zero values. It discusses equivalent formulations (e.g., the Syracuse mapping on odd integers) and open questions, such as the existence of cycles beyond the known cycle (1,4,2,1). Lagarias emphasizes computational verifications and partial proofs, setting the foundation for modern results.
1.2. Recent Theoretical Advancements
Tao’s Article [
2]: Terence Tao proves that for any function
as
, almost all orbits (in logarithmic density) have a minimum value
. This means that orbits are "almost bounded" for almost all
N, strengthening previous bounds. Using probabilistic models of Syracuse iterations and 3-adic distribution, Tao shows superpolynomial decay in characteristic functions, implying that typical orbits fall below
.
Weyl’s Theorem [
13]: On the asymptotic distribution of fractional parts
for irrational
(e.g.,
). Uniformity modulo 1 is key to the "quasirandomness" of binary digits in
.
1.3. Binary Representations of Powers of 3
MathOverflow Question [
4]: On the longest sequence of 1s (
) in binary
. Simulations up to
show
(observed maximum
), suggesting logarithmic bounds; models like coin tossing give
.
Cook’s Blog [
5]: Visualization of binary
as a grid (rows for
) with a slope boundary
; local structures and "semi-chaos" are observed.
Wolfram Research [
6]: Regularities in subsequences
, 2-adic convergence to 1; discussion of
p-adic perspective and general patterns.
1.3.1. Examples of Binary Decompositions
Here is a table with binary decompositions of for :
Table 1.
Binary representations of 3n for n = 1..10
Table 1.
Binary representations of 3n for n = 1..10
| n |
Binary Representation |
| 1 |
11 |
| 2 |
1001 |
| 3 |
11011 |
| 4 |
1010001 |
| 5 |
11110011 |
| 6 |
1011011001 |
| 7 |
100010001011 |
| 8 |
1100110100001 |
| 9 |
100110011100011 |
| 10 |
1110011010101001 |
1.4. Statistical and Probabilistic Properties
Sinai [
8]: Ergodic properties of the Syracuse mapping and statistical regularity of long orbits.
1.5. Computational Verifications and Bounds
Barina (2021) [
9]: Verification up to
, using GPU and algorithmic optimizations.
Barina (2025) [
10]: Verification up to
(
).
Krasikov–Lagarias [
11]: Bounds through difference inequalities.
1.6. Related Mathematical Theory
Allouche–Shallit [
7]: Automatic sequences and numeration systems.
Everest et al. [
12]: Recurrent sequences,
p-adic limits, connections to orbits.
1.7. Synthesis and Relevance to the Article
The references demonstrate the persistent difficulty of the problem, strong heuristics, and computational bounds [
2,
8,
9,
10]. Binary representations of
[
4,
5,
6] align with the uniformity of fractional parts [
13].
2. Introduction
We study the density of zeros in binary representations of natural numbers using fractional parts, encoding the binary structure. A framework is developed linking binary gaps, mantissas, and Collatz dynamics. In particular, the task of counting zeros in binary
remains open, see [
4,
5,
6].
3. Materials and Methods
Zeros dominate in the Collatz descent: each zero allows division by 2, outpacing the growth of . For , the sequence reduces to 1 in k steps. We decompose M into powers of two and track fractional parts at each stage to quantify the density of zeros.
3.1. Self-Correcting Dynamics of Mantissas
Mantissas induce a self-correcting mechanism, balancing 1s and 0s:
Long series of 1s increase the tail , giving small ; by recurrent formulas (Theorem 1), this requires , adding 0.
Series of 0s decrease , giving and forcing , i.e., a one.
Locally, this gives balance of blocks; globally, together with the uniformity of , an asymptotic density of for ones in binary is obtained.
4. Results
The Collatz conjecture [
1] remains open [
3], see also [
2].
Theorem 1.
Let , , , . Then
where (see Theorem 9). For ,
Proof. We start with the basic equality, which follows from the binary decomposition of the number
, where
are the positions of ones:
where
,
. From this equality, we obtain:
whence
We take the logarithm base 2:
Substitute
,
:
Expand the function
in Taylor series around
:
where
by Theorem 9. For
, similarly expand
, obtaining coefficients
from the appendix and remainder
by Theorem 10. □
Corollary 1. Let . Then:
- 1.
If , then the next bit (after the current gap) is 1 (i.e., ).
- 2.
If , then the next bit is 0 (i.e., ).
- 3.
If , then all subsequent bits (except the leading one) tend to zero.
Proof.
- 1.
From the definition of the recurrent relation for (Corollary 8), is necessary for a real and positive . This corresponds to the next bit being 1.
- 2.
For , the recurrent relation for has no solution in , so , meaning a zero bit.
- 3.
As , the tail , implying no significant terms in the tail sum, i.e., all subsequent bits (except the leading one) are zeros.
□
4.1. Direct Non-Recursive Relation for
Definition 1 (Binary Structure and Intervals). Let h be the number of ones in the binary representation of M (Hamming weight). Let the positions of ones be , and gaps for .
Theorem 2 (Direct Non-Recursive Relation).
Let satisfy
Proof. From the definition of
, we have the full tail sum from
:
The tail from
j is normalized relative to position
:
where
is the normalized tail from
j. Hence
□
Corollary 2 (Case ). , where are the positions of ones.
Proof. For
, the leading position
, and the normalized sum:
Prefix . □
Proof. The tail from position
j starts with the current one (contribution 1) plus the sum from subsequent ones, normalized by gaps relative to
:
where
. Taking
:
□
Corollary 3. Setting , we have and .
Proof. Directly follows from Theorem 3, where s is the tail sum without the leading 1. □
Lemma 1 (General Tail Decomposition).
For fixed j and any with , denote
Proof.
The second sum equals the tail from
, shifted by
:
□
Corollary 4 (Block of Ones).
If , then
Proof. With
for
, we have
for
. Then
By Lemma 1, the tail completes the expression. □
Corollary 5 (Small Tail). If , then .
Proof. Expansion
. Taylor for
, so
The coefficient comes from the exact quadratic term after normalization. □
4.2. Theorem on Maximum Number of 1s
Theorem 4 (Instability of Long Sequences of Ones).
Approximations
cannot hold simultaneously on a long block of consecutive ones. Such a block gives exponential growth of backward, conflicting with the linear decay of small s, forcing interruption of blocks of 1s by zeros.
Proof. Assume a block
for
with large
m. From the recurrent relation for
:
ignoring higher terms. Iteratively:
For small tail
after the block:
From Corollary 4 for the block:
For large
m,
, so
, but more precisely
. Then
which contradicts
. This requires interrupting the block with a gap
(insertion of zeros). □
Theorem 5. The asymptotic density of 1s in binary is at most . If is the length of the representation of , then the number of ones , and the number of zeros .
Proof. Assume the contrary, that for some
and infinitely many
n,
. Then the number of zeros
, average gap
. Let
. The fraction
f of gaps with
satisfies
, so
(constant
). This implies many series of ones. However, modulo 8:
for odd
n, 1 for even (check:
,
,
, etc.). For
ones from LSB,
, contradiction. Thus, series of ones from LSB
, i.e., the third or second bit from LSB is zero. By Theorem 6, each series of ones (even
) requires compensation by a series of zeros (
). The restriction on series at LSB propagates globally through the dynamics of
: frequent short series of 1s for excess
force frequent series of zeros, leading to local density of zeros
. Globally, the uniformity of
by [
13] ensures uniform distribution of compensations, making the overall density of zeros
, contradicting the assumption. LSB=1 fixes the end, deviations
from uniformity. The contradiction completes the proof. □
Theorem 6. A series of consecutive 1s (i.e., ) must be followed by a series of zeros (steps with ), ensuring balance of ones and zeros.
Proof. By Lemma 1 and Corollary 4, a series of 1s increases to , which decreases . To avoid going outside in backward propagation through steps, a step must occur, which "resets" according to the formulas of Theorem 1. □
4.3. Complete Proof by Series
4.4. Operators T and P: Trajectory Decomposition and Decay Estimate
Definitions.
Consider two primitive steps:
One
primitive Collatz step is either
T (when the current number is odd) or
P (when even). The composition of the pair
is conveniently viewed as one affine transformation
Intuitively: each one in the lower bits generates one step T, immediately followed by P (since is even). Therefore, series of ones correspond to blocks , and series of zeros — "pure" divisions P.
Lemma 2 (Iteration of
Blocks).
For any integer and
In particular, is monotonically increasing in x.
Proof. Induction on
m. For
, true by definition. Transition
:
The coefficient for x is positive, so is monotonic. □
Lemma 3 (Exact Decomposition by Number of
T and
P).
Let the first L primitive steps of the trajectory from contain M applications of T and Q divisions P in arbitrary order. Then there exists an integer b, , such that
Proof. Expand the composition: each step T multiplies the current value by 3 and adds 1; each step P divides by 2. Ultimately, the multiplier for is . All additions "+1" from T steps, after passing through some divisions by 2, give a sum of the form with integers . Thus, this sum is of the form for some integer . □
Lemma 4 (Connection to Bit Series).
Consider the lower L bits of the number and break them into series
where , and summarily and equal the numbers of ones and zeros among these L bits . Then in the first L steps:
Proof. Each one in the lower bits generates a step
T; immediately after it, a
P inevitably follows (since
is even). Therefore, each one accounts for at least one
pair — except possibly for end/junction effects between series (giving
). Similarly, each series of zeros is realized as consecutive "pure" divisions
P (again with error at the junction
). Summing over series gives the desired estimates and (
5). □
Theorem 7 (Finite Compression with Balanced Ones and Zeros).
Let among the lower L bits of , hold for some absolute constant . Then there exists a constant such that
In particular, since , for sufficiently large L, we have .
Proof. By (
4) and (
5)
for some constant
. Since
from
and
, we get
Substituting into (
4) and accounting for
, we obtain (
6). □
Corollary 6 (Window Length Selection). Taking and using that on typical windows the fraction of zeros is no less than the fraction of ones (balance by series: after each series of ones comes a series of zeros no shorter), we apply Theorem 7 and conclude: there exists such that for all , .
Remark 1 (Worst Growth and Its Suppression).
Maximum growth is achieved on m consecutive blocks (series of ones), where by Lemma 2 . Any subsequent series of zeros give additional divisions P (multiplier ), and the additive remainder from Lemma 3 does not exceed 1. It is precisely the no less frequent "pure" P after series of ones that ensure global compression (6).
4.5. Deterministic Window Inequality: Accounting by Number of Odd Steps
Instead of fixing the window length in advance by the total number of primitive steps L, we work with a window containing exactly M odd steps T (and, respectively, Q divisions P). This eliminates overestimation and gives a correct deterministic estimate consistent with numerical observations.
Lemma 5 (Exact Decomposition by
M and
Q).
Let in the first primitive steps from exactly M applications of T and Q divisions P occur (in any order). Then there exists an integer b with , such that
Lemma 6 (Deterministic Lower Bound on
Q).
Let M be the number of T applications (odd steps) in the considered window. Then
where the absolute constant depends only on the initial remainder and is at most 4.
Proof Idea. For odd
x, let
be the exponent of 2 in
. The table of remainders modulo 8 gives
and allows tracking transitions to the next odd:
with
and then staying in class
(where
). Hence, each
T step is accompanied by at least one division, and after entering class
— at least two. In total, this gives (
8) with a small error for the initial "entry" (at most three transitions to class 1). □
Theorem 8 (Deterministic Compression after
M Odd Steps).
There exists an absolute constant such that for any
In particular, the multiplier for decays exponentially in M, since .
Proof. From (
7) and Lemma 6:
which gives (
9). □
Corollary 7 (Translation to Estimate by Total Number of Steps).
Let in the window M odd steps and Q divisions occur (total ). By (8) we have , i.e., . Then from (9) it follows
where C depends only on . The number gives a conservative constant of average compression per primitive step.
Comments on Accuracy.
Formula (
9) is
deterministic and directly confirmed numerically: it does not attempt to estimate behavior by a fixed window length
L "forward", but speaks of compression after exactly
M odd steps, where
Q is then
counted (not roughly bounded above).
Translation (
9) to (
10) inevitably worsens the constant, since we replace exact
Q with a rough lower bound (
8). In practice, observed
Q are usually
larger than minimal, and actual compression is better than (
10).
If strengthening is needed, one can account for frequencies of classes modulo 8 on the real trajectory (for example, fix the first exit to class and then use on each odd step); this improves and gives numerically stronger constants without probabilistic assumptions.
Appendix: Details of the Linear System
5.1. Recurrence of the Fractional Part
Let
. Set
and use the domain
from Corollary 8. Then
where
strictly decrease. The fractional parts evolve according to:
where for
:
Remark 2. The case is the quadratic expansion of around , remainder such that . For , is expanded. Exact inversion for : .
Theorem 9 (Uniform Cubic Bound for
).
Let for . Its quadratic Taylor polynomial at :
and the remainder satisfies
Theorem 10 (Uniform Cubic Bound for
).
For and it holds:
where , and are given in (14).
Corollary 8 (Exact Inversion for
).
From it follows
valid for .
References
- O’Connor, J.J.; Robertson, E.F. Lothar Collatz. MacTutor History of Mathematics, University of St Andrews, 2006. Available: https://mathshistory.st-andrews.ac.uk/Biographies/Collatz/.
- Tao, T. Almost all Collatz orbits attain almost bounded values. Forum Math. Pi 2022, 10, e12. [CrossRef]
- Lagarias, J.C. The 3x+1 Problem and Its Generalizations. Amer. Math. Monthly 1985, 92, 3–23. [CrossRef]
- Sequences of 1s in binary expression of powers of 3. MathOverflow, 2024, Question 479499. Available: https://mathoverflow.net/questions/479499.
- Cook, J.D. Powers of 3 in binary. 2021. Available: https://www.johndcook.com/blog/2021/04/28/powers-of-3-in-binary/.
- Wolfram Research. Regularity versus Complexity in the Binary Representation of 3n. 2009. Available: https://wpmedia.wolfram.com/sites/13/2018/02/18-3-6.pdf.
- Allouche, J.P.; Shallit, J. Automatic Sequences: Theory, Applications, Generalizations; Cambridge University Press: Cambridge, UK, 2003.
- Sinai, Y.G. Statistical properties of the 3x+1 problem. Adv. Soviet Math. 1993, 16, 1–22.
- Barina, D. Convergence verification of the Collatz problem. J. Supercomput. 2021, 77, 2681–2688. [CrossRef]
- Barina, D. Improved verification limit for the convergence of the Collatz conjecture. J. Supercomput. 2025. [CrossRef]
- Krasikov, I.; Lagarias, J.C. Bounds for the 3x + 1 problem using difference inequalities. Acta Arith. 2003, 109, 237–258. [CrossRef]
- Everest, G.; van der Poorten, A.; Shparlinski, I.; Ward, T. Recurrence Sequences; American Mathematical Society: Providence, RI, 2007.
- Weyl, H. Über die Gleichverteilung von Zahlen mod. Eins. Math. Ann. 1916, 77, 313–352. [CrossRef]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).