Preprint
Article

This version is not peer-reviewed.

Numerical Approach on Collatz Conjecture

Submitted:

23 April 2024

Posted:

25 April 2024

Read the latest preprint version here

Abstract
We can analyze the properties of the Collatz Conjecture by using Terras’ Theorem, and computational graphical calculations. To do this, we introduce the concept of “tree" which shows the relationship between the input and output positive integers of the Collatz process. This approach offers a novel way for determining the non-existence of loops and diverge trees through computer simulations.
Keywords: 
;  ;  

1. Introduction

Collatz Conjecture is if one considers the following process on an arbitrary positive integer: [3,4]
f ( n ) = n / 2 if n 0 ( mod 2 ) , ( 3 n + 1 ) if n 1 ( mod 2 ) .
In the end, the final value will eventually reach the number 1, regardless of the initial positive integer chosen. For convenience, let us refer to applying Eq. (1) on any arbitrary positive integer as applying Collatz process.

2. Theory

For the first step, let us parametrize the Eq. (1) with x = ln ( n ) , y = ln ( f ( n ) )
y = ln ( f ( n ) ) = ln ( e x / 2 ) if n 0 ( mod 2 ) , ln ( 3 e x + 1 ) if n 1 ( mod 2 ) .
y = x ln ( 2 ) if n 0 ( mod 2 ) , x + ln ( 3 ) + ln 1 + 1 3 e x if n 1 ( mod 2 ) .
There is a chain of a Collatz process if we start from an odd number,
o d d ( = a ) e v e n ( = b ) e v e n ( = c ) o d d ( = c )
If we using Eq. (3), we can rewrite Eq. (4) as,
ln ( c ) = ln ( a ) + ln ( 3 / 2 ) + ln 1 + 1 3 a
So if we call ln ( a ) as x i as ln ( c ) as x i + 1 then,
x i + 1 x i = ln ( 3 / 2 ) + ln 1 + 1 3 e x i
We can apply the exponential function to both sides of Eq. (6),
e x i + 1 e x i = 3 2 + 1 2 e x i
e x i + 1 = 3 2 e x i + 1 2
If there is a chain of Collatz processes for a list of odd numbers from n i to n j , then by using x i = ln ( n i ) ,
n j = 3 2 j i 1 n i + 1 1
Let us define the function ν 2 ( n ) as exponents of 2 after factorization of n.
n = 2 a · 3 b · . . . , ν 2 ( n ) = a
We can define m = ν 2 ( n i + 1 ) = j i 1 , and k = ν 2 ( 3 2 m n i + 1 1 ) . Then, by applying multiple Collatz processes to Eq.(9), we can obtain,
C o l 1 ( n i ) = n j + k = 3 2 m n i + 1 1 2 k
Lemma 1.
If we prove the Collatz Conjecture for all odd positive integers, it is equivalent to proving it for all positive integers.
Proof. 
Let us assume that all odd positive integers converge to 1 after a finite number of steps in the Collatz process. Then, if we have an even number x n = 2 n N for some n, let k = ν 2 ( x n ) . The number y n = x n / 2 k is odd. Therefore, x n also converges to 1 after k more steps than y n . □
According to Lemma II.1, to investigate the Collatz conjecture, we only need to examine the convergence properties of odd positive integers.
Definition 1.
The set of odd positive integers G k is defined as follows:
G k = { x n | F k 1 ( x n ) = 1 , x n = 2 n 1 }
where n and k are integers greater than 1, and G 1 = { 1 } . The properties of G k is:
x G k + 1 , y G k , g ( x ) = y
G k = { F ( x n ) | x n G k + 1 }
for all k 1
According to Definition II.2, we can categorize the set of odd numbers as follows:
Now, let us make a function that maps each odd number to its order number.
f ( n ) = n + 1 2
Here n is the odd number, and f ( n ) represents its order number. For example, f ( 5 ) = 3 , indicating that 5 is the third odd number (1, 3, 5, 7...). We can define this function using the definition in Eq. (10).
m = ν 2 ( n ) , k = ν 2 3 3 2 m n 1
C o l 2 ( n ) = 1 2 3 ( 3 2 ) m n 1 2 k + 1
If we denote n as X and C o l 2 ( n ) as g ( X ) , then we can rewrite Eq. (14) as:
g ( X ) = 2 k 1 2 k + 1 + 3 2 k + 1 3 2 m X
When we associate Table 1 with Eq. (12), we can get Table 2
If we represent Table 2 as a diagram, we can obtain Figure 1.
Let us define this tree-like diagram as a tree with a converging value x.
Definition 2.
If we link two integers with an arrow from the input to the output value using Eq. (15) for all integers, it generates the diagram, which we call a “tree", denoted as ( T , < T ) . Here, < T represents the partial ordering based on Eq. (15),
k N , F k ( a ) = b b < T a
Then tree ( T , < T ) has a several properties [5]:
(1)
< T is a well-founded partial ordering on T.
(2)
t T , the set { u T | u < T t } is linearly ordered by < T .
(3)
t , u T , there exists v T such that v T t and v T u . Here, t T t essentially means t < T t or t = t .
Definition 3.
For arbitrary tree ( T , < T ) , if x T A T x then A is called the converge value of the tree, and the tree is called “converge tree". However, if there is no such A then the tree is called “diverge tree".

3. Lemma and Theorem

3.1. Proof of Uniqueness of the Converge Tree

According to Definition II.2, when we apply the function g ( x ) to all positive odd integers, after k iterations, some values converge to 1. We can categorize these values into groups, denoted as G k . If we recursively apply g ( x ) multiple times and there exist some values of x that satisfy g k ( x ) x = 0 , then those values remain fixed throughout this recursive process.
x G k , g ( x ) = x g k ( x ) = x
Lemma 2.
For some positive integer x, to satisfy g ( x ) = x , the corresponding k values in Eq. (15) lie within a length-1 interval,
ln ( 3 / 2 ) ln 2 ( m + 1 ) < k < ln ( 3 / 2 ) ln 2 ( m + 1 ) + 1
Proof. 
When g ( x ) = x , then x should satisfy Eq. (18) according to Eq. (15).
X = 2 k 1 2 k + 1 + 3 2 k + 1 3 2 m X
X = 2 m ( 2 k 1 ) 2 k + m + 1 3 m + 1 = ( 2 k 1 ) 2 ( 2 k ( 3 / 2 ) m + 1 )
Therefore, the condition to make X as a positive integer is:
2 k > ( 3 / 2 ) m + 1 k > ln ( 3 / 2 ) ln 2 ( m + 1 )
If we want to make X as an integer, then the numerator of Eq. (19) should be divisible by the denominator of Eq. (19). Therefore, the numerator must be greater than the denominator.
2 k 1 2 ( 2 k ( 3 / 2 ) m + 1 )
k ln ( 2 ( 3 / 2 ) m + 1 1 ) ln 2 < ln ( 2 ( 3 / 2 ) m + 1 ) ln 2
k < ln ( 3 / 2 ) ln 2 ( m + 1 ) + 1
And since we know that the integer multiplication of ln ( 3 / 2 ) / ln 2 can not be an integer, k should exist within the range of two transcendental numbers Q and Q + 1 .
Q = ln ( 3 / 2 ) ln 2 ( m + 1 ) Q < k < Q + 1
So, this proves the lemma. □
Next Eq. (19) can be expressed as,
X = e v e n · o d d o d d
If we want to make X as an integer, the odd number in the numerator should be divisible by the odd number in the denominator. Therefore the odd number of the numerator must be greater than that of the denominator.
2 k 1 2 m + 1 ( 2 k ( 3 / 2 ) m + 1 )
Therefore, the available values of k should lie within the region:
k 1 ln 2 ln 3 m + 1 1 2 m + 1 1 = K
Then, we want to show that K < R in Eq. (27) and (28).
R = Q + 1 = ln ( 3 / 2 ) ln 2 ( m + 1 ) + 1
To achieve this, first let us utilizing the lemmas.
Lemma 3.
1 sin ( k A ) k d k k = 1 sin ( k A ) k
Here, A = 2 π ln ( 3 / 2 ) ln 2 L
Proof. 
First, let us define:
k k + 1 sin ( A t ) t d t = Si ( A ( k + 1 ) ) Si ( A k )
k = 1 k k + 1 sin ( A t ) t d t = Si ( ) Si ( A )
By using Taylor Series expansion, [2]
ln ( 1 z ) = k = 1 z k k
Hence, we can rewrite the right-hand side of Eq. (29) as:
k = 1 sin ( A k ) k = 1 2 i k = 1 e i A k e i A k k = 1 2 i ln ( e i ( π A ) ) = 1 2 ( π A ) 2 π π A 2 π
Therefore we want to show:
1 2 ( π A ) 2 π π A 2 π π 2 Si ( A )
Si ( A ) π A 2 π + 1 2 A 2 π
Here, A = 2 π ln ( 3 / 2 ) ln 2 L = 2 π u , L N
l ( u ) = Si ( 2 π u ) π u + 1 2 u = r ( u )
Then, for every u, there exists an x such that:
x = u , Si ( 2 π x ) π = 1 2
Then, for some N N , we have x = N + ϵ , where 0 < ϵ < 1 . Considering the periodicity properties of sin ( 2 π x ) / x , there are two available values of x for each u. Let us define that when 0 < ϵ 1 / 2 , then ϵ = α , and if 1 / 2 < ϵ < 1 , then ϵ = β .
We can categorize the region of u into three regions.
(1)
N u N + α
(2)
N + α u N + 1 2 l ( u ) 1 / 2 > r ( u )
(3)
N + 1 2 u N + 1 l ( u ) > 0 r ( u )
Figure 2. Graphical representation of regions (1) to (3) and functions l ( u ) , y ( u ) , r ( u ) . Here, the x-axis represents u when u = N .
Figure 2. Graphical representation of regions (1) to (3) and functions l ( u ) , y ( u ) , r ( u ) . Here, the x-axis represents u when u = N .
Preprints 104620 g002
In regions (2) and (3), Eq. (36) is valid. In the case of region (1), let us calculate the tangential line of l ( u ) at N + α .
y ( u ) = sin ( 2 π α ) π ( N + α ) ( u N α ) + 1 2
Therefore, when u = N , the line passes through the point,
y ( N ) = 1 2 α sin ( 2 π α ) π ( N + α ) 1 2 α π ( N + α ) > 1 2 1 2 π > 0
because 0 < α < 1 / 2 . Furthermore, Figure 3 shows that based on computer simulation, when N 1 , y ( N ) > α .
Therefore, in region (1), l ( u ) > y ( u ) > r ( u ) . Hence, Eq. (36) holds for all u, and Eq. (34) also holds. If we define:
s ( N , L ) = k = 1 N sin ( k A ) k Si ( A ( k + 1 ) ) + Si ( A k )
then we can conclude that for any L N ,
lim N s ( N , L ) 0
We can use computer simulation to double-check that s ( N , L ) converges as we increase the values of N.
Figure 5. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for N ranging from 1 to 300, and L = 6 .
Figure 5. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for N ranging from 1 to 300, and L = 6 .
Preprints 104620 g005
Figure 6. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for N ranging from 1 to 300, and L = 10 .
Figure 6. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for N ranging from 1 to 300, and L = 10 .
Preprints 104620 g006
Based on computer simulation, we can double-check that s ( N , L ) converges as N increases, as shown in Figure 7. Therefore we can get,
k = 1 sin ( k A ) k Si ( A ( k + 1 ) ) + Si ( A k ) 0
k = 1 ( Si ( A ( k + 1 ) ) Si ( A k ) ) k = 1 sin ( k A ) k
So if we utilize Eq. (30), (31), (35), and (42),
k = 1 k k + 1 sin ( A t ) t d t = 1 sin ( A t ) t d t k = 1 sin ( k A ) k
which concludes the proof. □
Lemma 4.
1 ln 2 ln 3 m + 1 1 2 m + 1 1 < ln ( 3 / 2 ) ln 2 ( m + 1 ) + 1
Proof. 
Let us set L = m + 1 , then we can simplify Eq. (44) as
1 ln 2 ln 3 L 1 2 L 1 < ln ( 3 / 2 ) ln 2 L + 1
and if we use the floor function and fractional part function notation x = x { x } [6],
ln ( 3 / 2 ) ln 2 L < 1 1 ln 2 ln 1 1 / 3 L 1 1 / 2 L
Here, we can utilize the Fourier series expansion of the fractional part function [7]:
{ x } = 1 2 1 π k = 1 sin ( 2 π k x ) k
Therefore, Eq. (46) is:
ln 1 2 1 1 / 3 L 1 1 / 2 L < ln 2 π k = 1 sin 2 π k ln ( 3 / 2 ) ln 2 L k
By using Lemma III.2, Eq. (48) holds when Eq. (49) is valid (with A = 2 π ln ( 3 / 2 ) ln 2 L ).
ln 1 2 1 1 / 3 L 1 1 / 2 L < ln 2 π 1 sin ( k A ) k d k
ln 1 2 1 1 / 3 L 1 1 / 2 L < ln 2 π π 2 Si ( A )
ln 1 1 / 3 L 1 1 / 2 L < ln 2 ln 2 π Si 2 π ln ( 3 / 2 ) ln 2 L
And we can get the range:
ln 1 1 / 3 L 1 1 / 2 L ln ( 4 / 3 )
Let us find the region that satisfies:
ln ( 4 / 3 ) < ln 2 ln 2 π Si 2 π ln ( 3 / 2 ) ln 2 L
Si 2 π ln ( 3 / 2 ) ln 2 L < ln ( 3 / 2 ) π ln 2
The available range for L is:
0 < L < 0.7757 , 0.939 < L
And L = m + 1 1 , which satisfying the condition. □
Theorem 1.
There is no positive integer point x that satisfies g ( x ) x = 0 , except for x = 1 .
Proof. 
According to Lemma III.1, III.2, and III.3, we can conclude that the available region for k is given by:
Q < k < K < R < Q + 1
which implies that X and k in Eq. (18) cannot be integer. Consequently, no available value of k exists to satisfy Eq. (18). □
Theorem 2.
There is only one converge tree, and all elements in the converge tree converge to 1.
Proof. 
Let us assume there is another converge tree with converging values a 1 . Then, since a is a converging value, we have g ( a ) = a . However, according to Lemma III.4 only 1 goes to itself, so g ( a ) is not an integer value other than a. This contradicts the assumption, thus confirming that there is only one converge tree in an integer system.
Therefore, there is only one converge tree, and if any integer is present in the converge tree, it will eventually converge to 1. □

3.2. Disproving the Existence of a Diverge Tree

Theorem 3.
There is no diverge tree.
Proof. 
The part of the proof refers to [1] (1976) by R. Terras. Let us call A 1 as the set of all available converge trees. A 2 is the set of all available sets of diverge trees.
Let us assume A 2 . Then we can select the minimum element N A 2 . For some k, if N is not a power of 2, then there always exists 2 k < N < 2 k + 1 . Thus, we can express N = 2 k + b ( b < N ). Since N is the minimum element of A 2 , we know that x < N , x A 1 . According to Theorem III.5, x A 1 converges to 1, and based on Definition 1.5 in [1], this implies that all x have a stop time except when x = 1 . Here, stop time refers to the point at which, after a finite number of steps of the Collatz process, the value goes smaller than its initial value. For example, 3 8 4 2 1 , the stop time is 3.
According to periodicity Theorem 1.2 in [1], if x b (mod 2 k ) then x has the same encoding vector as b.
(1)
When b 1 , N also has the same stop time as b.
(2)
When b = 1 , N = 2 k + 1 . If we apply the Collatz process, N N = 3 · 2 k 2 + 1 . If k 2 , N = 3 · 2 k 2 + 1 < 2 k + 1 = N . When k = 1 , N = 3 which also converges to 1 < N .
(3)
When N = 2 k , applying the Collatz process results in convergence to 1, which is 1 < N .
Therefore, in all cases, N also has a stop time. However, N A 2 , so it should not have a stop time. This leads to a contradiction, so A 2 = . □

4. Conclusion and Outlook

Theorem 4.
All positive integers converge to 1 after a finite number of steps of the Collatz process.
Proof. 
According to Theorem III.5 and III.6, in a positive integer system, there is no diverge tree and there is only one converge tree, with the converging value 1. □
In this research, we prove that all positive integers converge to 1 after a finite number of steps of the Collatz process.
As an outlook, we can investigate the continuous generative function of Eq. (15) and the periodicity of the encoding vector of the numbers in Table 2. Additionally, there is another way of proving Lemma III.3 from Eq. (48), which is equivalent to show:
L · log 2 ( 3 ) < 1 log 2 1 1 / 3 L 1 1 / 2 L
We know that log 2 ( 3 ) is a transcendental number with the value, [8,9]
log 2 ( 3 ) = 1.584962500721156181 . . .
Then, with particular values of L, we can obtain:
{ 79335 · log 2 ( 3 ) } = 0.999994712 . . . { 190537 · log 2 ( 3 ) } = 0.999999906 . . . { 64497107 · log 2 ( 3 ) } = 0.99999998 . . .
Thus, to show the validity of Eq. (55) is the same as to show how quickly its right-hand side converges to 1 compared to { L · log 2 ( 3 ) } . Moreover, besides this approach, various other methods such as cycle analysis [10] or bounding functions [11,12] are employed in the investigation of the Collatz conjecture. Exploring potential relations between these methods and the current result could be a relevant next step.

References

  1. Riho Terras, “A stopping time problem on the positive integers", ACTA ARITIMETICA (1976). http://matwbn.icm.edu.pl/ksiazki/aa/aa30/aa3034.pdf.
  2. Eric Roosendaal, “The Terras Theorem", http://www.ericr.nl/wondrous/terras.html.
  3. Jeffrey C. Lagarias, “The 3x + 1 problem and its generalizations". The American Mathematical Monthly. 92 (1): 3–23, (1985). [CrossRef]
  4. Jeffrey C. Lagarias, “The 3x+1 Problem: An Overview", 2021, https://arxiv.org/abs/2111.02635.
  5. Spencer Unger, “Trees in set Theory", https://www.math.toronto.edu/sunger/cmu/TreesTalk.pdf.
  6. R. L. Graham, D. E. Knuth, O. Patashnik, “Concrete Mathematics", Addison-Wesley (1994), p. 70.
  7. E. C. Titchmarsh, D. R. Heath-Brown, “The Theory of the Riemann Zeta-function" (2nd ed.), Oxford U. P. (1986), p. 15, Eq. 2.1.7.
  8. OEIS, “Decimal expansion of log_2(3)", https://oeis.org/A020857.
  9. Simon Plouffe, “log(3)/log(2) to 10000 digits", https://www.plouffe.fr/simon/constants/log3log2.txt.
  10. Shalom Eliahou, “Le problème 3n+1: y a-t-il des cycles non triviaux? (III)", Images des Maths, CNRS (2011).
  11. Terence Tao, “Almost all orbits of the Collatz map attain almost bounded values.", Forum of Mathematics (2022). [CrossRef]
  12. Ilia Krasikov, Jeffrey C. Lagarias “Bounds for the 3x+1 problem using difference inequalities". Acta Arith., 109, 237–258. (2003). [CrossRef]
Figure 1. A diagram expression of Table 2 with directed arrows.
Figure 1. A diagram expression of Table 2 with directed arrows.
Preprints 104620 g001
Figure 3. The results of y ( N ) α in the region where 0 < α < 1 / 2 . Except for the case N = 0 , all other graphs yield positive results across all α regions.
Figure 3. The results of y ( N ) α in the region where 0 < α < 1 / 2 . Except for the case N = 0 , all other graphs yield positive results across all α regions.
Preprints 104620 g003
Figure 4. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for N ranging from 1 to 300, and L = 1 .
Figure 4. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for N ranging from 1 to 300, and L = 1 .
Preprints 104620 g004
Figure 7. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for L ranging from 1 to 1000, and N = 2000 .
Figure 7. The summation of the right-hand side of Eq. (39), which converges to positive real values, is calculated for L ranging from 1 to 1000, and N = 2000 .
Preprints 104620 g007
Table 1. The categorization of odd numbers, G k (eg. G 1 to G 23 ).
Table 1. The categorization of odd numbers, G k (eg. G 1 to G 23 ).
G 1 1
G 2 3, 5, 21, 85, 151, 227, 341...
G 3 13, 15, 23, 35, 53, 75, 113...
G 4 7, 11, 17, 61, 69, 93, 141...
G 5 9, 19, 29, 37, 45, 81, 117...
G 6 25, 49, 51, 67, 77, 87, 101...
G 7 33, 39, 43, 59, 65, 79, 89...
G 8 57, 105, 115, 157, 173, 177, 211...
G 9 127, 135, 139, 153, 159, 187, 191...
G 10 123, 169, 185, 247, 249, 283, 339...
G 11 111, 167, 219, 223, 225, 251, 295, 329...
G 12 297, 351, 393, 445, 527, 585, 595...
G 13 175, 263, 395, 519, 593, 615, 699...
G 14 103, 155, 207, 233, 311, 467, 701...
G 15 91, 137, 183, 275, 413, 549, 621...
G 16 63, 95, 121, 143, 215, 243, 323...
G 17 31, 47, 71, 107, 161, 253, 381...
G 18 27, 41, 55, 83, 125, 165, 189...
G 19 73, 109, 147, 195, 199, 221, 291...
G 20 97, 145, 259, 265, 387, 389, 471...
G 21 129, 193, 235, 240, 343, 345, 353...
G 22 171, 240, 257, 313, 415, 427, 457...
G 23 379, 417, 427, 543, 553, 569, 609...
Table 2. Part of the reparameterizing of Table 1 based on Eq.(12) (eg. from G 1 to G 10 ).
Table 2. Part of the reparameterizing of Table 1 based on Eq.(12) (eg. from G 1 to G 10 ).
G 1 1
G 2 2, 3, 11, 43, 76, 114, 171...
G 3 7, 8, 12, 18, 27, 38, 57...
G 4 4, 6, 9, 31, 35, 47, 72...
G 5 5, 10, 15, 19, 23, 41, 59...
G 6 13, 25, 26, 34, 39, 44, 51...
G 7 17, 20, 22, 30, 33, 40, 45...
G 8 29, 53, 58, 79, 87, 89, 106...
G 9 64, 68, 70, 77, 80, 94, 96...
G 10 62, 85, 93, 124, 125, 142, 170...
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated