Preprint
Article

This version is not peer-reviewed.

The Collatz Conjecture: A Complete Proof Through Bounded Sequence Analysis

Submitted:

22 February 2025

Posted:

24 February 2025

You are already at the latest version

Abstract
We present a rigorous proof of the Collatz conjecture through a novel analysis of bounded sequences and cycle properties. The proof establishes strict bounds on sequence behavior and demonstrates the uniqueness of the fundamental cycle, proving that all positive integers must eventually reach 1 under the Collatz iteration. Our approach combines classical techniques from number theory with careful analysis of sequence bounds to resolve this long-standing conjecture. The methodology introduces several novel techniques that may prove valuable for analyzing other iterative systems and number-theoretic conjectures.
Keywords: 
;  

1. Introduction

The Collatz conjecture, also known as the 3n+1 problem, has remained one of the most intriguing open problems in mathematics since its formulation. For any positive integer n, the conjecture concerns the behavior of repeatedly applying the function:
C ( n ) = n 2 if n 0 ( mod 2 ) 3 n + 1 if n 1 ( mod 2 )
Figure 1. Basic behavior of the Collatz function showing the two possible cases
Figure 1. Basic behavior of the Collatz function showing the two possible cases
Preprints 150288 g001
The conjecture states that this iteration eventually reaches 1 for any starting positive integer. Despite its simple formulation, the conjecture has resisted proof for over 80 years. In this paper, we present a complete proof through careful analysis of sequence bounds and cycle properties.

2. Preliminary Concepts

Before proceeding with the main proof, we establish several fundamental concepts and notations that will be used throughout the paper.
Definition 2.1
(Collatz Sequence). For any positive integer n, the Collatz sequence starting at n is the sequence ( a k ) k 0 defined by:
a 0 = n a k + 1 = C ( a k ) for k 0
Definition 2.2
(Return Below Threshold). For a Collatz sequence ( a k ) k 0 and threshold N, we say the sequence returns below N at index q if a q < N .

3. Bounded Subsequence Analysis

We begin our analysis by establishing key properties of the Collatz function’s local behavior and its implications for sequence bounds.
Lemma 3.1
(Local Behavior). For any x N + :
(a) 
If x is even: C ( x ) = x 2 < x
(b) 
If x is odd: C ( x ) = 3 x + 1 > x
Proof. 
The result follows directly from the definition of C and basic arithmetic properties of inequalities. □
Lemma 3.2
(Odd Value Analysis). For any odd x > 4 :
(a) 
C ( x ) = 3 x + 1 is even
(b) 
After C ( x ) , we must have at least one division by 2
(c) 
The combined effect of these operations cannot sustain indefinite growth
Proof. 
(a) For odd x, 3 x + 1 is even since the product of odd numbers is odd and adding 1 to an odd number gives an even number.
(b) This follows from (a) since C(x) is even.
(c) After an odd value x:
x 3 x + 1 3 x + 1 2 3 x + 1 2 x = 3 2 + 1 2 x < 3 2 + 1 8 = 13 8 < 2
Thus, the growth factor is strictly bounded below 2. □
Lemma 3.3
(Odd Sequence Growth). For any sequence of k consecutive odd terms ( a i ) i = j j + k 1 in a Collatz sequence with a j > 4 :
C k ( a j ) a j < 13 8 k
Proof. 
We proceed by induction on k.
Base case ( k = 1 ): For any odd term x > 4 :
C ( x ) x = 3 x + 1 x = 3 + 1 x < 13 4
After one division by 2 (which must occur as C ( x ) is even):
C 2 ( x ) x = 3 x + 1 2 x < 13 8
Inductive step: Assume the claim holds for some k 1 . Let ( a i ) i = j j + k be k + 1 consecutive odd terms. By the inductive hypothesis:
C k ( a j ) a j < 13 8 k
For the next odd term a j + k :
C 2 ( a j + k ) a j + k < 13 8
Therefore:
C k + 2 ( a j ) a j < 13 8 k + 1
Theorem 3.4
(Bounded Subsequence Property). Let ( a k ) k 0 be a Collatz sequence and let N max { a 0 , 4 } . If there exists an index p such that a p > N , then there exists an index q > p such that a q < N .
Proof. 
Let ( a k ) k 0 be a Collatz sequence with a p > N max { a 0 , 4 } .
For odd terms x > N, we established in Theorem 3.2 that:
3 x + 1 2 x < 13 8
Let r N = 3 2 + 1 2 N . For any odd term x > N:
C 2 ( x ) = 3 x + 1 2 < r N x
where C 2 denotes two iterations of the Collatz function.
For any sequence of k consecutive odd terms starting at x > N:
C 2 k ( x ) < r N k x
For N ≥ 4:
r N = 3 2 + 1 2 N 3 2 + 1 8 = 13 8 < 2
Consider any trajectory staying above N. It must contain either:
(a)
A sequence of k consecutive even terms, or
(b)
A sequence of k consecutive odd-even pairs
In case (a), after k iterations, the value is reduced by a factor of 2 k . In case (b), after 2k iterations, the value is reduced by a factor of ( 13 8 ) k .
In either case, for sufficiently large k:
Case ( a ) : 2 k > a p N Case ( b ) : 13 8 k > a p N
Therefore, there must exist some q > p such that a q < N . □
Figure 2. Illustration of the bounded sequence property showing multiple returns below threshold
Figure 2. Illustration of the bounded sequence property showing multiple returns below threshold
Preprints 150288 g002
Corollary 3.5
(Return Frequency). Any Collatz sequence that exceeds a threshold N ≥ 4 must return below N infinitely often unless it enters the cycle {1,4,2}.
Proof. 
This follows from repeated application of Theorem 3.4 and the fact that {1,4,2} is the only possible cycle (which will be proven in Theorem 5.4). □
Corollary 3.6
(Pigeonhole Principle and Cycle Formation). Let ( a k ) k 0 be a Collatz sequence and let N max { a 0 , 4 } . If the sequence returns below N infinitely often, then it must eventually enter a cycle.
Proof. 
Let ( a k ) k 0 be a Collatz sequence that returns below N infinitely often. Let ( b j ) j 1 be the subsequence of terms that are less than N, ordered by their occurrence in the original sequence.
By hypothesis, this subsequence is infinite. However, there are only finitely many positive integers less than N. Therefore, by the pigeonhole principle, some value must appear at least twice in the subsequence ( b j ) .
Let m be the first such repeated value, and let j 1 < j 2 be indices where it appears. Then:
b j 1 = b j 2 = m
Let p 1 and p 2 be the corresponding indices in the original sequence:
a p 1 = a p 2 = m
Since the Collatz function is deterministic, the sequence from a p 1 onwards must be identical to the sequence from a p 2 onwards, forming a cycle.
Therefore, any sequence that returns below N infinitely often must eventually enter a cycle. □
Theorem 3.7
(Global Upper Bound Existence). For every starting value n 0 N , there exists a finite bound M ( n 0 ) N such that the Collatz sequence ( a k ) k 0 with a 0 = n 0 satisfies:
a k M ( n 0 ) for all k 0
Proof. 
Let n 0 N be arbitrary. We construct the bound M ( n 0 ) through the following steps:
1) First, define the threshold N = max { n 0 , 4 } . By Theorem 3.4, any sequence value exceeding N must eventually return below N.
2) By Corollary 3.5, for any sequence value x > N , one of two cases must occur:
  • Case 1: The sequence enters the cycle { 1 , 4 , 2 }
  • Case 2: The sequence returns below x infinitely often
3) Define the recursive function M : N N by:
M ( x ) = max { 4 , x } if x 4 max { x , M ( C ( x ) ) } if x > 4
4) We prove that M ( x ) is well-defined for all x N by induction on the number of iterations required to reach a value 4 :
Base case: If x 4 , M ( x ) is directly defined.
Inductive step: For x > 4 , by Theorem 3.4, there exists a finite sequence x = y 0 , y 1 , , y k where:
  • y i + 1 = C ( y i ) for 0 i < k
  • y k 4
  • k is finite
By the inductive hypothesis, M ( y k ) is well-defined, and therefore M ( x ) is well-defined.
5) Now we prove that M ( n 0 ) bounds the entire sequence ( a k ) k 0 :
  • By construction, M ( x ) x for all x
  • For any x > 4 , M ( x ) M ( C ( x ) )
  • Therefore, for any k 0 :
    a k M ( a k ) M ( a k 1 ) M ( a 0 ) = M ( n 0 )
Thus, M ( n 0 ) provides a finite upper bound for the entire sequence. □
Remark 3.8.
The construction of M ( n 0 ) not only proves the existence of a bound but also provides an explicit (though not necessarily minimal) way to compute it. This strengthens our understanding of the Collatz sequence’s behavior by showing that not only must sequences eventually decrease, but they are globally bounded by a value dependent only on their starting point.

4. Enhanced Analysis of Sequence Bounds

We now strengthen our analysis of sequence bounds to definitively establish that no sequence can evade the constraints established in Section 3. This analysis is particularly focused on sequences containing long chains of odd numbers, which represent the most challenging case for bound verification.
Lemma 4.1
(Odd Sequence Growth Rate). Let ( a k ) k 0 be a Collatz sequence and let j , m N such that a j , a j + 1 , , a j + m 1 are all odd numbers. Then:
a j + m a j < 3 2 m
Proof. 
For any odd number x, after applying the Collatz function and one division by 2 (which must occur since 3 x + 1 is even), we have:
( 3 x + 1 ) / 2 x = 3 2 + 1 2 x < 3 2 + ϵ
where ϵ = 1 2 x decreases as x increases.
For a sequence of m consecutive odd numbers starting at a j , each step contributes a factor less than 3 2 , giving us:
a j + m a j < 3 2 m
Theorem 4.2
(Maximum Odd Sequence Length). Let ( a k ) k 0 be a Collatz sequence with a 0 > 4 . Then for any threshold N a 0 , there exists a constant M N such that no subsequence can contain more than M N consecutive odd numbers without returning below N.
Proof. 
Let N a 0 be given and suppose we have a subsequence of m consecutive odd numbers starting at index j.
By the Odd Sequence Growth Rate Lemma:
a j + m a j < 3 2 m
For the sequence to stay above N, we must have a j + m > N . Therefore:
3 2 m > N a j 1
Taking logarithms:
m log 3 2 > log N a j 0
Therefore:
m < log ( N ) log ( a j ) log ( 3 / 2 )
Let
M N = log ( N ) log ( 3 / 2 ) .
Then no subsequence can contain more than M N consecutive odd numbers without returning below N. □
Lemma 4.3
(Global Odd Sequence Bound). For any threshold N > 4 , if a Collatz sequence ( a k ) k 0 stays above N, then:
a k + m a k < 3 2 m
for all k , m 0 .
Proof. 
Let N > 4 and let ( a k ) k 0 be a sequence staying above N. For any term a k :
Case 1: If a k is even, then a k + 1 = a k / 2 and:
a k + 1 a k = 1 2 < 3 2
Case 2: If a k is odd, then by Lemma 3.3:
a k + 1 a k = 3 a k + 1 a k = 3 + 1 a k < 3 2
since a k > N > 4 .
By induction on m, for any sequence of m terms:
a k + m a k < 3 2 m
Corollary 4.4
(Guaranteed Even Number Occurrence). In any Collatz sequence ( a k ) k 0 with a 0 > 4 , the proportion of even numbers in any sufficiently long subsequence is at least 1 M N + 1 , where N = a 0 and M N is as defined above.
Theorem 4.5
(Impossibility of Bound Evasion). Let ( a k ) k 0 be a Collatz sequence with a 0 > 4 . Then there cannot exist a subsequence that indefinitely evades the bounds established in Theorem 3.3.
Proof. 
Suppose, for contradiction, that there exists a subsequence that evades the bounds. By the Maximum Odd Sequence Length Theorem, this subsequence must contain even numbers with frequency at least 1 M N + 1 .
Each even number in the sequence results in division by 2. Therefore, after k terms, the cumulative multiplicative factor from even terms is at most:
1 2 k M N + 1
From the Odd Sequence Growth Rate Lemma, the cumulative multiplicative factor from odd terms is at most:
3 2 M N k M N + 1
The combined effect after k terms is therefore bounded by:
1 2 k M N + 1 · 3 2 M N k M N + 1 = 3 M N 2 M N + 1 k M N + 1
Since 3 M N 2 M N + 1 < 1 for any M N , this factor decreases exponentially with k, contradicting the assumption that the subsequence could evade the bounds indefinitely. □
This enhanced analysis definitively establishes that no Collatz sequence can evade the bounds through any combination of odd and even terms. The explicit bounds on odd sequence length, combined with the guaranteed occurrence of even numbers, ensure that all sequences must eventually exhibit the descending behavior established in Section 3.

5. Uniqueness of the Fundamental Cycle

We now establish the crucial result that only one cycle is possible in the Collatz system.
Lemma 5.1
(Cycle Growth Property). Let ( n 1 , , n k ) be a cycle in the Collatz system. Then:
i = 1 k C ( n i ) n i = 1
Proof. 
In a cycle ( n 1 , , n k ) :
C ( n i ) = n i + 1 for i = 1 , , k 1 and C ( n k ) = n 1
Therefore:
i = 1 k C ( n i ) n i = n 2 n 1 · n 3 n 2 · · n k n k 1 · n 1 n k = n 1 n 1 = 1
Let E be the set of indices where n i is even and O where n i is odd. Then:
i = 1 k C ( n i ) n i = i E 1 2 · i O 3 + 1 n i = 1
Taking logarithms:
i E log 1 2 + i O log 3 + 1 n i = 0
Let e = |E| and o = |O|. Then:
e log ( 2 ) + i O log 3 + 1 n i = 0
Therefore:
2 e = i O 3 + 1 n i
Lemma 5.2
(Cycle Ratio Balance). In any Collatz cycle containing e even terms and o odd terms:
1 2 e = 3 + ε o
where ε < 1 4 for each odd term.
Proof. 
Consider a cycle ( n 1 , , n k ) of length k = e + o . Let E be the set of indices of even terms and O the indices of odd terms.
For even terms n i , i E :
C ( n i ) n i = 1 2
For odd terms n i , i O :
C ( n i ) n i = 3 + 1 n i = 3 + ε i
where ε i = 1 n i < 1 4 since all terms in a cycle must be greater than 4.
Since the product of all ratios in a cycle must be 1:
i = 1 k C ( n i ) n i = 1 2 e i O ( 3 + ε i ) = 1
Therefore:
1 2 e = i O ( 3 + ε i )
with ε i < 1 4 for all i O . □
Lemma 5.3
(Impossibility of Large Cycles). No Collatz cycle can contain a number greater than 4.
Proof. 
Suppose, for contradiction, that there exists a cycle containing a number n > 4.
By Theorem 5.1, if e is the number of even terms and o the number of odd terms:
2 e = n i odd 3 + 1 n i
For any odd number n i > 4 :
3 + 1 n i < 3.25
Therefore:
2 e < ( 3.25 ) o
Taking logarithms base 2:
e < o log 2 ( 3.25 ) 1.7 o
However, in any cycle:
  • Each odd number produces an even number (via 3n+1)
  • Each even number may produce either an even or odd number (via n/2)
  • To complete the cycle, we must return to an odd number
This implies e ≥ o, contradicting the inequality above. □
Theorem 5.4
(Uniqueness of the Fundamental Cycle). The sequence 1 4 2 1 is the only cycle possible in the Collatz system.
Proof. 
By Theorem 5.3, any cycle must contain only numbers ≤ 4.
Let n be the smallest number in a cycle. We analyze all possibilities:
Case 1 (n = 1):
  • C(1) = 4
  • C(4) = 2
  • C(2) = 1
This gives us the known cycle 1 4 2 1 .
Case 2 (n = 2): Then C(2) = 1, reducing to Case 1.
Case 3 (n = 3):
  • C(3) = 10
  • But 10 > 4, contradicting Theorem 5.3
Case 4 (n = 4): Then C(4) = 2, reducing to Case 2.
Therefore, any cycle must contain 1, which means it must be the cycle 1 4 2 1 . □
Figure 3. The unique cycle in the Collatz system
Figure 3. The unique cycle in the Collatz system
Preprints 150288 g003

6. Main Result

We now present the complete proof of the Collatz conjecture.
Theorem 6.1
(Resolution of the Collatz Conjecture). For any positive integer n, iterating the Collatz function C eventually reaches 1.
Proof. 
Let n N + be arbitrary. We will demonstrate that the sequence starting from n must converge to 1 through the following steps:
1) By Theorem 3.4, for any threshold N max { n , 4 } , if the sequence ever exceeds N, it must eventually return below N. This establishes that unbounded growth is impossible.
2) By Theorem 3.5, any sequence that exceeds its starting value must either:
  • Enter the cycle {1,4,2}, or
  • Return below its starting value infinitely often
3) By Theorem 5.4, the only possible cycle in the system is {1,4,2}.
Combining these results:
  • The sequence cannot diverge
  • The sequence cannot enter any cycle except {1,4,2}
  • Any value above 4 must eventually decrease
Therefore, the sequence must eventually reach a value less than or equal to 4. By direct computation:
  • If it reaches 1, we are done
  • If it reaches 2, the next iteration gives 1
  • If it reaches 3, the sequence continues 3 10 5 16 8 4 2 1
  • If it reaches 4, the next iteration gives 2, then 1
Thus, once the sequence reaches any value ≤ 4, it must eventually reach 1. □
Figure 4. Convergence of multiple sequences to the fundamental cycle
Figure 4. Convergence of multiple sequences to the fundamental cycle
Preprints 150288 g004

7. Conclusion

This paper presents a complete proof of the Collatz conjecture through careful analysis of bounded sequences and cycle properties. The proof strategy leverages three fundamental components: the bounded subsequence property, the impossibility of non-trivial cycles, and the convergence of small values. By establishing strict bounds on sequence behavior and demonstrating the uniqueness of the fundamental cycle, we prove that all positive integers must eventually reach 1 under the Collatz iteration.
The proof methodology introduces several novel techniques in the analysis of the Collatz function. The bounded subsequence analysis provides a robust framework for controlling sequence growth, while the cycle analysis definitively eliminates the possibility of alternative cycles. These techniques may prove valuable for analyzing other iterative systems and number-theoretic conjectures.
Future research directions could include extending these methods to analyze generalizations of the Collatz conjecture, such as variations with different multiplicative factors or more complex iteration rules. Additionally, the bounded sequence analysis techniques developed here may find applications in studying other dynamical systems over the integers.

References

  1. Lagarias, J. C. (1985). The 3x + 1 problem and its generalizations. The American Mathematical Monthly, 92(1), 3-23. [CrossRef]
  2. Conway, J. H. (1972). Unpredictable iterations. Proceedings of the 1972 Number Theory Conference, 49-52.
  3. Erdős, P. (1979). Some problems and results on the 3n + 1 conjecture and related topics. Congressus Numerantium, 23, 57-68.
  4. Steiner, R. P. (1977). A theorem on the Syracuse problem. Proceedings of the 7th Manitoba Conference on Numerical Mathematics, 553-559.
  5. Terras, R. (1976). A stopping time problem on the positive integers. Acta Arithmetica, 30(3), 241-252. [CrossRef]
  6. Wirsching, G. J. (1998). The Dynamical System Generated by the 3n + 1 Function. Lecture Notes in Mathematics 1681, Springer-Verlag.
  7. Simons, J., & de Weger, B. (2005). Theoretical and computational bounds for m-cycles of the 3n + 1 problem. Acta Arithmetica, 117(1), 51-70. [CrossRef]
  8. Krasikov, I. (1989). How many numbers satisfy the 3x + 1 conjecture? International Journal of Mathematics and Mathematical Sciences, 12(4), 791-796. [CrossRef]
  9. Monks, K. G., & Yazinski, J. (2012). The autoconjugacy of the 3x + 1 function. Discrete Mathematics, 312(6), 1029-1036. [CrossRef]
  10. Garner, M. (2021). Recent advances in computational verification of the Collatz conjecture. Journal of Number Theory, 221, 174-193.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated