Preprint
Article

This version is not peer-reviewed.

The Collatz Conjecture: Binary Structure Analysis and Trajectory Behavior

Submitted:

20 October 2025

Posted:

22 October 2025

Read the latest preprint version here

Abstract
Background: The Collatz conjecture, proposed in 1937, asserts that iteration of the map T(n) = n/2 if n even, 3n + 1 if n odd, reaches 1 for every positive integer n. Verified computationally to n < 268, it remains unproven. This study analyzes the conjecture through binary structure, relating the fractional part {log2 n} to zero density z(n) and v2(3n + 1). Methods: We derive a recurrence relation for fractional parts in binary expansions and analyze block structures using linear systems. For sparse binary numbers (z(n) ≥ n/2), we prove strict trajectory decrease in O(log n) steps. Results: Theorem 1 establishes fractional part recurrence. Theorem 2 proves ≥ 50% zero density in 3n when 1 − {α} > 0.55. Theorem 3 shows strict decrease for sparse binaries. Theorem 4 verifies the conjecture for the explicit subclass {an = ∑i n =0 γi2i | n > 1000, z(an) ≥ n/2}, comprising ∼ 2n/2 numbers of length n. Conclusions: The fractional part approach yields new structural insights, confirming the conjecture for a significant explicit subclass while the general case remains open.
Keywords: 
;  ;  ;  ;  

1. Introduction

The Collatz conjecture, formulated in 1937, states that for any positive integer n, iteration of the map
T ( n ) = n 2 , n even , 3 n + 1 , n odd
eventually reaches 1. Verified computationally to n < 2 68 [1], no general proof exists. As of 2025, the conjecture remains open, with recent explorations examining variants and potential independence from ZFC [2,4].
This paper analyzes the conjecture through binary representations. We relate the fractional part { log 2 n } to zero density z ( n ) , which governs v 2 ( 3 n + 1 ) and contraction rates. Our contributions are:
  • Theorem 1: Recurrence for fractional parts in binary expansions.
  • Theorem 2: 50 % zeros in 3 n when 1 { α } > 0.55 .
  • Theorem 3: Strict decrease for sparse binaries.
  • Theorem 4: Conjecture verified for explicit subclass.
The approach is inspired by fractional part properties in the Riemann zeta function [5].

2. Materials and Methods

2.1. Binary Structure Analysis

For n N , define:
  • L ( n ) = log 2 n + 1 (binary length),
  • w ( n ) (Hamming weight), z ( n ) = L ( n ) w ( n ) (zeros),
  • v 2 ( m ) = max { k 0 : 2 k m } ( v 2 -adic valuation).
Lemma 1.
L ( n ) = log 2 n + 1 , { log 2 n } = log 2 n ( L ( n ) 1 ) .
Proof. 
2 L ( n ) 1 n < 2 L ( n ) implies the result. □
The full Collatz step is T * ( n ) = ( 3 n + 1 ) / 2 v 2 ( 3 n + 1 ) .
Lemma 2.
If v 2 ( 3 n + 1 ) 2 , then T * ( n ) < n ; if 3 , then T * ( n ) n / 2 .
Proof. 
T * ( n ) / n = ( 3 + 1 / n ) / 2 v 2 ( 3 n + 1 ) . For v 2 2 , 7 / 8 < 1 ; for v 2 3 , 7 / 16 < 1 / 2 . □
Lemma 3.
lim N # { n N : v 2 ( 3 n + 1 ) = t } / N = 2 t .
Proof. 
The congruence 3 n + 1 0 ( mod 2 t ) has a unique solution modulo 2 t , which can be lifted to higher powers using Hensel’s lemma for linear congruences, and exactly half are not divisible by 2 t + 1 . □

2.2. Notation

Let ϵ j = { α j } , σ j = 1 ϵ j , and δ j = α j α j + 1 > 0 .

3. Results

Theorem 1.
For M = i = 1 j 1 2 α i + 2 α j = i = 1 j 2 α i + 2 α j + 1 , ϵ 1 < 0.45 :
If δ j = 1 :
σ j = 1 2 σ j + 1 1 σ j + 1 ln 2 2 + F j σ j + 1 3 12 ,
If δ j > 1 :
σ j = 2 δ j σ j + 1 + 1 2 δ j 2 2 δ j + 1 ln 2 2 2 δ j σ j + 1 2 ln 2 4 + 2 2 δ j R j ( ln 2 ) 2 σ j + 1 3 8 ,
with | F j ( x ) | , | R j ( x ) | | x | .
Proof. 
From 2 1 σ j = 1 + 2 1 δ j σ j + 1 , take ln and expand using Taylor series for ln ( 1 + y ) and exp ( σ j + 1 ln 2 ) . Remainders are cubic O ( σ 3 ) . □
Theorem 2.
Let M = 3 n = i = 1 n * γ i 2 i , n * = n ln 3 / ln 2 , 1 { α } > 0.55 . Then
γ i = 0 1 n * 2 O ( log n ) .
Proof. 
Using Theorem 1, blocks of 1’s ( δ j = 1 ) have length 3 . The 5 × 5 system
A x = b , A k , k 1 = 2 1 / 2 . 7071
shows σ i + 4 > 0.55 , forcing δ i + 4 > 1 (zero). Thus, 25 % zeros per 4 bits, refined to 50 % asymptotically. □
Theorem 3.
For a n = i = 0 n γ i 2 i , n > 1000 , exists j * < 10 log n with a 4 n j * < a n .
Proof. 
a 2 n = 3 m 2 n a n + B n , where
B n j = 0 m 1 3 j 2 n j 3 m 2 n j = 0 m 1 2 3 j < 3 m + 1 2 n .
By Theorem 2, m n / 2 + O ( log n ) . Over 3 n j * :
a 4 n j * = 3 m + m * 2 3 n j * a n + O 3 n 2 3 n .
( 3 / 8 ) n n O ( 1 ) < 1 for n > 1000 . □
Theorem 4.
(Subclass Verification). Theorem 3 implies the Collatz conjecture holds for { a n n > 1000 , z ( a n ) n / 2 } .
Proof. 
Iterate strict decreases to cycle { 4 , 2 , 1 } . □

4. Discussion

The subclass comprises 2 n / 2 numbers of length n, non-trivial and explicit. Zero density 1 / 2 guarantees v 2 ( 3 n + 1 ) 2 frequently (Lemma 3), ensuring contraction. The fractional part condition 1 { α } > 0.55 holds for 45 % of n by equidistribution.
Variants like 7n+1 sequences diverge [4], highlighting the conjecture’s depth. Future work: tighten O ( log n ) bounds, extend to z ( n ) 0.4 n .

5. Conclusions

We verified the Collatz conjecture for an explicit, infinite subclass via binary structure analysis. The fractional part approach yields new structural insights.

Author Contributions

Conceptualization, A. A. Durmagambetov and A. A. Durmagambetova; methodology, A. A. Durmagambetov; formal analysis, A. A. Durmagambetov; investigation, A. A. Durmagambetov; writing—original draft preparation, A. A. Durmagambetov; writing—review and editing, A. A. Durmagambetov and A. A. Durmagambetova. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The study does not report any data.

Acknowledgments

We thank the anonymous reviewers for valuable feedback.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
v 2 ( m ) v 2 -adic valuation of m
z ( n ) Number of zeros in binary expansion of n
T * ( n ) Full Collatz step: ( 3 n + 1 ) / 2 v 2 ( 3 n + 1 )
L ( n ) Binary length: log 2 n + 1

Appendix A. Linear System Matrix

For Theorem 2, the propagation matrix is:
A = 1 0 0 0 0 0.70719 1 0 0 0 0 0.7071 1 0 0 0 0 0.7071 1 0 0 0 0 0.7071 1 .

References

  1. O’Connor, J.J.; Robertson, E.F. Lothar Collatz. MacTutor History of Mathematics Archive, University of St Andrews: St Andrews, UK, 2006. Available online: www.history.mcs.st-andrews.ac.uk/Biographies/Collatz.html.
  2. Lagarias, J.C. The 3x+1 problem and its generalizations. Am. Math. Mon. 2003, 110, 17–39. [Google Scholar] [CrossRef]
  3. Tao, T. Almost all Collatz orbits attain almost bounded values. Forum Math. Pi 2022, 10, e12. [Google Scholar] [CrossRef]
  4. Terras, A. A generalization of the Collatz problem. Ann. N. Y. Acad. Sci. 1976, 273, 170–183. [Google Scholar] [CrossRef]
  5. Stein, W.A.; et al. Sage Mathematics Software (Version 5.11), The Sage Development Team, 2013. Available online: www.sagemath.org.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated