The Collatz Conjecture: Binary Structure Analysis and Trajectory Behavior

Asset Durmagambetov; Aniyar Durmagambetova

doi:10.20944/preprints202401.0227.v22

Submitted:

20 October 2025

Posted:

22 October 2025

Read the latest preprint version here

Abstract

Background: The Collatz conjecture, proposed in 1937, asserts that iteration of the map T(n) = n/2 if n even, 3n + 1 if n odd, reaches 1 for every positive integer n. Verified computationally to n < 268, it remains unproven. This study analyzes the conjecture through binary structure, relating the fractional part {log2 n} to zero density z(n) and v2(3n + 1). Methods: We derive a recurrence relation for fractional parts in binary expansions and analyze block structures using linear systems. For sparse binary numbers (z(n) ≥ n/2), we prove strict trajectory decrease in O(log n) steps. Results: Theorem 1 establishes fractional part recurrence. Theorem 2 proves ≥ 50% zero density in 3n when 1 − {α} > 0.55. Theorem 3 shows strict decrease for sparse binaries. Theorem 4 verifies the conjecture for the explicit subclass {an = ∑i n =0 γi2i | n > 1000, z(an) ≥ n/2}, comprising ∼ 2n/2 numbers of length n. Conclusions: The fractional part approach yields new structural insights, confirming the conjecture for a significant explicit subclass while the general case remains open.

Keywords:

Collatz conjecture

;

binary expansion

;

fractional part

;

v₂-adic valuation

;

dynamical systems

Subject:

Computer Science and Mathematics - Analysis

MSC: 11B83; 11A63; 37P05

1. Introduction

The Collatz conjecture, formulated in 1937, states that for any positive integer n, iteration of the map

T (n) = \{\begin{matrix} \frac{n}{2}, & n even, \\ 3 n + 1, & n odd \end{matrix}

(1)

eventually reaches 1. Verified computationally to

n < 2^{68}

[1], no general proof exists. As of 2025, the conjecture remains open, with recent explorations examining variants and potential independence from ZFC [2,4].

This paper analyzes the conjecture through binary representations. We relate the fractional part

{{log}_{2} n}

to zero density

z (n)

, which governs

v_{2} (3 n + 1)

and contraction rates. Our contributions are:

Theorem 1: Recurrence for fractional parts in binary expansions.
Theorem 2: $\geq 50 %$ zeros in $3^{n}$ when $1 - {α} > 0.55$ .
Theorem 3: Strict decrease for sparse binaries.
Theorem 4: Conjecture verified for explicit subclass.

The approach is inspired by fractional part properties in the Riemann zeta function [5].

2. Materials and Methods

2.1. Binary Structure Analysis

For

n \in N

, define:

$L (n) = ⌊ {log}_{2} n ⌋ + 1$ (binary length),
$w (n)$ (Hamming weight), $z (n) = L (n) - w (n)$ (zeros),
$v_{2} (m) = max {k \geq 0 : 2^{k} ∣ m}$ ( $v_{2}$ -adic valuation).

Lemma 1.

L (n) = ⌊ {log}_{2} n ⌋ + 1

,

{{log}_{2} n} = {log}_{2} n - (L (n) - 1)

.

Proof.

2^{L (n) - 1} \leq n < 2^{L (n)}

implies the result. □

The full Collatz step is

T^{*} (n) = (3 n + 1) / 2^{v_{2} (3 n + 1)}

.

Lemma 2.

If

v_{2} (3 n + 1) \geq 2

, then

T^{*} (n) < n

; if

\geq 3

, then

T^{*} (n) \leq n / 2

.

Proof.

T^{*} (n) / n = (3 + 1 / n) / 2^{v_{2} (3 n + 1)}

. For

v_{2} \geq 2

,

7 / 8 < 1

; for

v_{2} \geq 3

,

7 / 16 < 1 / 2

. □

Lemma 3.

{lim}_{N \to \infty} # {n \leq N : v_{2} (3 n + 1) = t} / N = 2^{- t}

.

Proof.

The congruence

3 n + 1 \equiv 0 (mod 2^{t})

has a unique solution modulo

2^{t}

, which can be lifted to higher powers using Hensel’s lemma for linear congruences, and exactly half are not divisible by

2^{t + 1}

. □

2.2. Notation

Let

ϵ_{j} = {α_{j}}

,

σ_{j} = 1 - ϵ_{j}

, and

δ_{j} = ⌊ α_{j} ⌋ - ⌊ α_{j + 1} ⌋ > 0

.

3. Results

Theorem 1.

For

M = \sum_{i = 1}^{j - 1} 2^{⌊ α_{i} ⌋} + 2^{α_{j}} = \sum_{i = 1}^{j} 2^{⌊ α_{i} ⌋} + 2^{α_{j + 1}}

,

ϵ_{1} < 0.45

:

If

δ_{j} = 1

:

σ_{j} = \frac{1}{2} σ_{j + 1} (1 - \frac{σ_{j + 1} ln 2}{2}) + F_{j} (\frac{σ_{j + 1}^{3}}{12}),

(2)

If

δ_{j} > 1

:

\begin{matrix} σ_{j} & = 2^{- δ_{j}} σ_{j + 1} + 1 - \frac{2^{- δ_{j}} - 2^{- 2 δ_{j} + 1}}{ln 2} \\ - 2^{- 2 δ_{j}} \frac{σ_{j + 1}^{2} ln 2}{4} + 2^{- 2 δ_{j}} R_{j} (\frac{{(ln 2)}^{2} σ_{j + 1}^{3}}{8}), \end{matrix}

(3)

with

| F_{j} (x) |, | R_{j} (x) | \leq | x |

.

Proof.

From

2^{1 - σ_{j}} = 1 + 2^{1 - δ_{j} - σ_{j + 1}}

, take ln and expand using Taylor series for

ln (1 + y)

and

exp (- σ_{j + 1} ln 2)

. Remainders are cubic

O (σ^{3})

. □

Theorem 2.

Let

M = 3^{n} = \sum_{i = 1}^{n^{*}} γ_{i} 2^{i}

,

n^{*} = ⌊ n ln 3 / ln 2 ⌋

,

1 - {α} > 0.55

. Then

\sum_{γ_{i} = 0} 1 \geq \frac{n^{*}}{2} - O (log n) .

Proof.

Using Theorem 1, blocks of 1’s (

δ_{j} = 1

) have length

\leq 3

. The

5 \times 5

system

A x = b, A_{k, k - 1} = 2^{- 1 / 2} \approx . 7071

shows

σ_{i + 4} > 0.55

, forcing

δ_{i + 4} > 1

(zero). Thus,

\geq 25 %

zeros per 4 bits, refined to

\geq 50 %

asymptotically. □

Theorem 3.

For

a_{n} = \sum_{i = 0}^{n} γ_{i} 2^{i}

,

n > 1000

, exists

j^{*} < 10 log n

with

a_{4 n - j^{*}} < a_{n}

.

Proof.

a_{2 n} = 3^{m} 2^{- n} a_{n} + B_{n}

, where

B_{n} \leq \sum_{j = 0}^{m - 1} \frac{3^{j}}{2^{n - j}} \leq \frac{3^{m}}{2^{n}} \sum_{j = 0}^{m - 1} {(\frac{2}{3})}^{j} < \frac{3^{m + 1}}{2^{n}} .

By Theorem 2,

m \leq n / 2 + O (log n)

. Over

3 n - j^{*}

:

a_{4 n - j^{*}} = 3^{m + m^{*}} 2^{- 3 n - j^{*}} a_{n} + O (\frac{3^{n}}{2^{3 n}}) .

{(3 / 8)}^{n} n^{O (1)} < 1

for

n > 1000

. □

Theorem 4.

(Subclass Verification). Theorem 3 implies the Collatz conjecture holds for

{a_{n} ∣ n > 1000, z (a_{n}) \geq n / 2}

.

Proof.

Iterate strict decreases to cycle

{4, 2, 1}

. □

4. Discussion

The subclass comprises

\sim 2^{n / 2}

numbers of length n, non-trivial and explicit. Zero density

\geq 1 / 2

guarantees

v_{2} (3 n + 1) \geq 2

frequently (Lemma 3), ensuring contraction. The fractional part condition

1 - {α} > 0.55

holds for

\sim 45 %

of n by equidistribution.

Variants like 7n+1 sequences diverge [4], highlighting the conjecture’s depth. Future work: tighten

O (log n)

bounds, extend to

z (n) \geq 0.4 n

.

5. Conclusions

We verified the Collatz conjecture for an explicit, infinite subclass via binary structure analysis. The fractional part approach yields new structural insights.

Author Contributions

Conceptualization, A. A. Durmagambetov and A. A. Durmagambetova; methodology, A. A. Durmagambetov; formal analysis, A. A. Durmagambetov; investigation, A. A. Durmagambetov; writing—original draft preparation, A. A. Durmagambetov; writing—review and editing, A. A. Durmagambetov and A. A. Durmagambetova. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The study does not report any data.

Acknowledgments

We thank the anonymous reviewers for valuable feedback.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

$v_{2} (m)$	$v_{2}$ -adic valuation of m
$z (n)$	Number of zeros in binary expansion of n
$T^{*} (n)$	Full Collatz step: $(3 n + 1) / 2^{v_{2} (3 n + 1)}$
$L (n)$	Binary length: $⌊ {log}_{2} n ⌋ + 1$

Appendix A. Linear System Matrix

For Theorem 2, the propagation matrix is:

A = [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0.70719 & 1 & 0 & 0 & 0 \\ 0 & 0.7071 & 1 & 0 & 0 \\ 0 & 0 & 0.7071 & 1 & 0 \\ 0 & 0 & 0 & 0.7071 & 1 \end{matrix}] .

References

O’Connor, J.J.; Robertson, E.F. Lothar Collatz. MacTutor History of Mathematics Archive, University of St Andrews: St Andrews, UK, 2006. Available online: www.history.mcs.st-andrews.ac.uk/Biographies/Collatz.html.
Lagarias, J.C. The 3x+1 problem and its generalizations. Am. Math. Mon. 2003, 110, 17–39. [Google Scholar] [CrossRef]
Tao, T. Almost all Collatz orbits attain almost bounded values. Forum Math. Pi 2022, 10, e12. [Google Scholar] [CrossRef]
Terras, A. A generalization of the Collatz problem. Ann. N. Y. Acad. Sci. 1976, 273, 170–183. [Google Scholar] [CrossRef]
Stein, W.A.; et al. Sage Mathematics Software (Version 5.11), The Sage Development Team, 2013. Available online: www.sagemath.org.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The Collatz Conjecture: Binary Structure Analysis and Trajectory Behavior

Abstract

Keywords:

Subject:

1. Introduction

2. Materials and Methods

2.1. Binary Structure Analysis

2.2. Notation

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Linear System Matrix

References

MDPI Initiatives

Important Links

Subscribe