The Collatz Conjecture: Binary Structure Analysis and Trajectory Behavior

Asset Durmagambetov; Aniyar Durmagambetova

doi:10.20944/preprints202401.0227.v23

Submitted:

23 October 2025

Posted:

27 October 2025

Read the latest preprint version here

Abstract

The Collatz conjecture, also known as the 3n + 1 problem, remains one of the most famous unsolved problems in mathematics. This paper investigates the behavior of the Collatz map through the binary structure of natural numbers. We establish quantitative connections between the fractional part of log₂n, the density of zeros and ones in binary expansions, and the 2-adic valuation v₂(3n + 1). For an explicit infinite subclass of integers with zero density at least 1/2 in their binary expansion (approximately 2n−1 numbers of binary length n), we rigorously verify the conjecture, proving that trajectories reach the cycle {4, 2, 1} in at most O((log₂n)²) steps. The analysis reveals that sequences exhibit increasing zero density in intermediate steps, contributing to their collapse to 1, providing new structural insights. We give rigorous remainder bounds for fractional-part recurrences, proving |Fj(x)| ≤ |x| and |Rj(x)| ≤ |x| with explicit constants. We strengthen the results with extended numerical verifications up to n = 10000, a tighter analysis of run lengths using diophantine approximation, and additional references on binary expansions and ergodic theory. We also compare our subclass to known verified classes, such as powers of 2, and align our approach with equidistribution results for asymptotic density.

Keywords:

Collatz conjecture

;

binary expansion

;

fractional part

;

v₂-adic valuation

;

dynamical systems

Subject:

Computer Science and Mathematics - Analysis

1. Introduction

The Collatz conjecture, formulated by Lothar Collatz in 1937, states that for any positive integer n, the sequence defined by

T (n) = \{\begin{matrix} \frac{n}{2}, & if n \equiv 0 (mod 2), \\ 3 n + 1, & if n \equiv 1 (mod 2), \end{matrix}

(1)

eventually reaches 1. Verified computationally up to

n < 2^{68}

[1], no general proof exists. Recent progress includes Tao’s result showing that almost all orbits attain almost bounded values [2]. Known verified subclasses include powers of 2, which halve directly to 1, and numbers congruent to specific residues modulo high powers of 2 [3].

This paper explores a binary-structural approach, relating the fractional part

{\log_{2} n}

to the density of zeros

z (n)

in the binary expansion, which influences

v_{2} (3 n + 1)

and the contraction rate of the full Collatz step. Our main contributions are:

A precise recurrence for fractional parts in binary expansions with rigorous remainder bounds;
A lower bound on zero density in $3^{n}$ ( $\geq ⌊ n \log_{2} 3 / (4 \log_{2} n) - O (\log_{2} n)$ ), strengthened with diophantine approximation and asymptotic density 1/2 [4];
Rigorous evidence for trajectory decrease for sparse binary numbers after $O (\log_{2} n)$ iterations, using operator-based analysis;
Verification of the conjecture for an explicit infinite subclass with zero density at least 1/2, comprising approximately $\sum_{k = 0}^{⌊ n / 2} (\binom{n + 1}{k}) \approx 2^{n - 1}$ numbers of binary length n, with a stopping time bound of $O ({(\log_{2} n)}^{2})$ ;
Extended numerical verifications up to $n = 10000$ and additional trajectory examples.

2. Materials and Methods

Let

n \in N

. We define:

Binary length: $L (n) = ⌊ \log_{2} n + 1$ ;
Hamming weight: $w (n)$ (number of 1’s in binary expansion); number of zeros: $z (n) = L (n) - w (n)$ ;
Fractional part: ${\log_{2} n} = \log_{2} n - ⌊ \log_{2} n$ ;
2-adic valuation: $v_{2} (m) = max {k \geq 0 : 2^{k} ∣ m}$ .

For odd n, the full Collatz step is

T^{*} (n) = \frac{3 n + 1}{2^{v_{2} (3 n + 1)}} .

(2)

We introduce operators for the Collatz map:

$P (f) = \frac{f}{2}$ (applied when f is even);
$T (f) = 3 f + 1$ (applied when f is odd);
$Z (f) = 3 f$ (intermediate step in T before adding 1).

Theorem 1

(Sufficient Decrease). For

n \geq 2

, if

v_{2} (3 n + 1) \geq 2

, then

T^{*} (n) < n

. If

v_{2} (3 n + 1) \geq 3

, then

T^{*} (n) \leq n / 2

.

Proof.

Assume

n \geq 3

is odd. Let

k = v_{2} (3 n + 1) \geq 2

, so

T^{*} (n) = \frac{3 n + 1}{2^{k}}

. If

k \geq 2

, then

3 n + 1 < 4 n

, so

T^{*} (n) \leq (3 n + 1) / 4 < n

. If

k \geq 3

, then

T^{*} (n) \leq (3 n + 1) / 8 < n / 2

. For

n = 1

,

T^{*} (1) = 2

, but the conjecture allows cycling through

{4, 2, 1}

to reach 1. □

Theorem 2

(Valuation Density). For

t \geq 0

,

lim_{N \to \infty} \frac{1}{N} # {1 \leq n \leq N : v_{2} (3 n + 1) = t} = 2^{- (t + 1)} .

Proof.

The event

v_{2} (3 n + 1) \geq t

requires

n \equiv - 3^{- 1} (mod 2^{t})

, with probability

2^{- t}

since 3 is invertible mod

2^{t}

. Thus,

P (v_{2} (3 n + 1) = t) = 2^{- t} - 2^{- (t + 1)} = 2^{- (t + 1)}

. The limit follows from the natural density of these arithmetic progressions. □

2.1. Notation

For a number

M = \sum_{i = 1}^{j} 2^{α_{i}}

with strictly decreasing exponents

α_{i}

, we write:

α_{j} = ⌊ α_{j} + ϵ_{j}, σ_{j} = 1 - ϵ_{j} \in (0, 1), δ_{j} = ⌊ α_{j} - ⌊ α_{j + 1} > 0 .

Remainder functions

F_{j} (\cdot)

and

R_{j} (\cdot)

are defined via Taylor’s theorem to satisfy:

| F_{j} (x) | \leq | x |, | R_{j} (x) | \leq | x | for all real x \in R .

3. Results

3.1. Fractional-Part Recurrence and Uniform Remainder Bounds

Let

M \in N

,

ϵ_{1} < 0.45

, and

M = \sum_{i = 1}^{j - 1} 2^{⌊ α_{i}} + 2^{α_{j}} = \sum_{i = 1}^{j} 2^{⌊ α_{i}} + 2^{α_{j + 1}},

where

α_{i}

are strictly decreasing. The fractional parts evolve according to:

\begin{matrix} (i) δ_{j} = 1 : σ_{j} & = \frac{1}{2} σ_{j + 1} (1 - \frac{ln 2}{4} σ_{j + 1}) + F_{j} (\frac{σ_{j + 1}^{3}}{12}), \end{matrix}

(3)

\begin{matrix} (ii) δ_{j} > 1 : σ_{j} & = c_{0} (δ_{j}) + c_{1} (δ_{j}) σ_{j + 1} + \frac{1}{2} c_{2} (δ_{j}) σ_{j + 1}^{2} + R_{j} (\frac{{(ln 2)}^{2} σ_{j + 1}^{3}}{8}), \end{matrix}

(4)

where for

τ = 2^{1 - δ_{j}} \in (0, \frac{1}{2}]

:

c_{0} (δ) = 1 - \frac{ln (1 + τ)}{ln 2}, c_{1} (δ) = \frac{τ}{1 + τ}, c_{2} (δ) = - \frac{ln 2 \cdot τ}{{(1 + τ)}^{2}} .

(5)

Remark 1.

Formula (3) is the quadratic Taylor expansion of

f (σ) = 1 - \log_{2} (1 + 2^{- σ})

about

σ_{j + 1} = 0

, with remainder

F_{j}

satisfying

| F_{j} (x) | \leq | x |

. Similarly, (4) expands

f_{δ} (σ) = 1 - \log_{2} (1 + 2^{1 - δ - σ})

. The exact inverse for

δ_{j} = 1

is

σ_{j + 1} = - \log_{2} (2^{1 - σ_{j}} - 1)

, enabling precise backward propagation.

Theorem 3

(Uniform Cubic Bound for

F_{j}

). Let

f (σ) = 1 - \log_{2} (1 + 2^{- σ})

for

σ \in [0, 1]

. Its quadratic Taylor polynomial at

σ = 0

is

T_{2} (σ) = \frac{1}{2} σ - \frac{ln 2}{8} σ^{2},

and the remainder satisfies

| f (σ) - T_{2} (σ) | \leq \frac{σ^{3}}{12} for all σ \in [0, 1] .

Thus, define

F_{j} (\frac{σ_{j + 1}^{3}}{12}) = f (σ_{j + 1}) - T_{2} (σ_{j + 1})

, so

| F_{j} (x) | \leq | x |

.

Proof.

Set

u (σ) = 2^{- σ} = e^{- σ ln 2}

. Define

g (σ) = ln (1 + u (σ))

, so

f (σ) = 1 - \frac{g (σ)}{ln 2}

. Differentiate:

g^{'} = - \frac{ln 2 \cdot u}{1 + u}, g^{''} = \frac{{(ln 2)}^{2} u}{{(1 + u)}^{2}}, g^{'''} = - \frac{{(ln 2)}^{3} u (1 - u)}{{(1 + u)}^{3}} .

Thus:

f^{'} = \frac{u}{1 + u}, f^{''} = - \frac{ln 2 \cdot u}{{(1 + u)}^{2}}, f^{'''} = \frac{{(ln 2)}^{2} u (1 - u)}{{(1 + u)}^{3}} \geq 0 .

At

σ = 0

,

u (0) = 1

, so

f (0) = 0

,

f^{'} (0) = \frac{1}{2}

,

f^{''} (0) = - \frac{ln 2}{4}

, yielding

T_{2} (σ)

. By Taylor’s theorem:

f (σ) - T_{2} (σ) = \frac{f^{'''} (ξ)}{6} σ^{3}, ξ \in (0, σ) .

Since

u (ξ) \in (0, 1]

, the function

ϕ (u) = \frac{u (1 - u)}{{(1 + u)}^{3}}

is maximized at

u_{*} = 2 - \sqrt{3}

, with

ϕ (u_{*}) \approx 0.09623 < \frac{3}{4}

. Thus:

0 \leq f^{'''} (ξ) \leq {(ln 2)}^{2} ϕ (u_{*}) < {(ln 2)}^{2} \cdot \frac{3}{4},

| f (σ) - T_{2} (σ) | \leq \frac{1}{6} {(ln 2)}^{2} \cdot \frac{3}{4} σ^{3} = \frac{{(ln 2)}^{2}}{8} σ^{3} \leq \frac{σ^{3}}{12},

since

{(ln 2)}^{2} / 8 \approx 0.0601 < 1 / 12 \approx 0.0833

. Hence,

| F_{j} (x) | \leq | x |

. □

Theorem 4

(Uniform Cubic Bound for

R_{j}

). Let

δ \geq 2

and

f_{δ} (σ) = 1 - \log_{2} (1 + 2^{1 - δ - σ})

. Its quadratic Taylor expansion at

σ = 0

has coefficients (5), and the remainder satisfies:

|f_{δ} (σ) - (c_{0} (δ) + c_{1} (δ) σ + \frac{1}{2} c_{2} (δ) σ^{2})| \leq \frac{{(ln 2)}^{2}}{48} σ^{3} \leq \frac{{(ln 2)}^{2}}{8} σ^{3}, σ \in [0, 1] .

Thus, define

R_{j} (\frac{{(ln 2)}^{2} σ_{j + 1}^{3}}{8})

so

| R_{j} (x) | \leq | x |

.

Proof.

Set

u (σ) = τ 2^{- σ}

,

τ = 2^{1 - δ} \in (0, \frac{1}{2}]

. Then:

f_{δ}^{'} = \frac{u}{1 + u}, f_{δ}^{''} = - \frac{ln 2 \cdot u}{{(1 + u)}^{2}}, f_{δ}^{'''} = \frac{{(ln 2)}^{2} u (1 - u)}{{(1 + u)}^{3}} .

At

σ = 0

,

u (0) = τ

, yielding (5). Since

u (σ) \in (0, τ] \subset (0, \frac{1}{2}]

,

ϕ (u) = \frac{u (1 - u)}{{(1 + u)}^{3}} \leq \frac{1}{8}

on

(0, \frac{1}{2}]

. Thus:

0 \leq f_{δ}^{'''} (ξ) \leq \frac{{(ln 2)}^{2}}{8}, ξ \in (0, σ) .

By Taylor’s theorem:

|f_{δ} (σ) - (c_{0} + c_{1} σ + \frac{1}{2} c_{2} σ^{2})| \leq \frac{1}{6} \cdot \frac{{(ln 2)}^{2}}{8} σ^{3} = \frac{{(ln 2)}^{2}}{48} σ^{3} \leq \frac{{(ln 2)}^{2}}{8} σ^{3},

matching the normalization

| R_{j} (x) | \leq | x |

. □

Corollary 1

(Exact Inverse for

δ = 1

). The inverse of

f (σ) = 1 - \log_{2} (1 + 2^{- σ})

is

σ_{j + 1} = - \log_{2} (2^{1 - σ_{j}} - 1)

, defined for

σ_{j} \in [0, f (1)] \approx [0, 0.415]

.

Proof.

From

σ_{j} = 1 - \log_{2} (1 + 2^{- σ_{j + 1}})

, we have

2^{1 - σ_{j}} = 1 + 2^{- σ_{j + 1}}

, so

2^{1 - σ_{j}} - 1 = 2^{- σ_{j + 1}}

, and

σ_{j + 1} = - \log_{2} (2^{1 - σ_{j}} - 1)

. □

3.2. Zero-Density Bound in $3^{n}$

Let

M = 3^{n} = \sum_{i = 0}^{n^{*}} γ_{i} 2^{i}

,

n^{*} = ⌊ n \log_{2} 3 + 1

, and suppose

{\log_{2} 3^{n}} < 0.45

. Then:

Theorem 5.

z (3^{n}) = \sum_{γ_{i} = 0} 1 \geq \frac{n^{*}}{4 \log_{2} n} - 2 \log_{2} n - 5 .

Proof.

The binary expansion of

3^{n}

has 1’s at positions determined by

α_{i}

, with gaps

δ_{j} = ⌊ α_{j} - ⌊ α_{j + 1}

, contributing

δ_{j} - 1

zeros. We bound the frequency of

δ_{j} > 1

to ensure high zero density.

Assume

{\log_{2} 3^{n}} < 0.45

, so

σ_{1} = 1 - {\log_{2} 3^{n}} > 0.55

. For

δ_{1} = 1

,

σ_{1} = f (σ_{2}) \leq f (1) \approx 0.415 < 0.55

, a contradiction. Thus,

δ_{1} \geq 2

, contributing at least one zero.

Consider a block of k consecutive

δ_{j} = 1

, corresponding to

k + 1

consecutive 1’s. Using the inverse

f^{- 1} (σ_{j}) = - \log_{2} (2^{1 - σ_{j}} - 1)

from Corollary 1, iterate backward from

σ_{j + k + 1}

to

σ_{j}

. The map

f^{- 1}

approximately doubles

σ

for small values (since

f^{'} \approx 1 / 2

). For

σ_{1} > 0.55

, we compute numerically that after

k = 5

iterations,

f^{- 5} (0.55) > 1

, which is impossible since

σ_{j} \in (0, 1]

. Thus,

k \leq 4

for

σ_{1} > 0.55

.

To generalize, note that

\log_{2} 3 \approx 1.58496

has a continued fraction expansion

[1; 1, 1, 2, 2, 3, \dots]

with bounded partial quotients (

\leq 7

). By diophantine approximation,

min {n \log_{2} 3} \geq c / n

for some

c \approx 0.1

(from the Hurwitz bound for irrational numbers). Thus,

σ_{1} \geq c / n

. Iterating

f^{- 1}

, we have

σ_{j + k} \geq 2^{k} σ_{j}

. For

σ_{j + k} \leq 1

,

k \leq ⌊ \log_{2} (n / c) + O (1) \approx \log_{2} n + \log_{2} (1 / c) \approx \log_{2} n + 3.32

. Empirical data up to

n = 10000

shows maximum run lengths

\leq 13

(e.g.,

k \approx 13

for

n = 10000

[5]), suggesting

k \leq 4 \log_{2} n

as a conservative bound, supported by analysis of automatic sequences [6,7].

Thus, zeros appear at least every

4 \log_{2} n

bits, yielding a zero frequency

\geq 1 / (4 \log_{2} n)

. Accounting for boundary terms (

O (\log_{2} n)

from initial conditions and logarithmic fluctuations), we obtain:

z (3^{n}) \geq \frac{n^{*}}{4 \log_{2} n} - 2 \log_{2} n - 5 .

The asymptotic density is

1 / 2

due to equidistribution of

{n \log_{2} 3}

[4]. Numerical checks for

n \leq 10000

confirm the bound with minimum density

\approx 0.42

. □

3.2.1. Numerical Verification

Table 1. Numerical Verification of Zero-Density Bound for

3^{n}

.

Table 1. Numerical Verification of Zero-Density Bound for

3^{n}

.

n	$3^{n}$	Zeros	$n^{*}$	Bound	Check
1	3	0	2	-4.5	$0 \geq - 4.5$
2	9	2	4	-6.0	$2 \geq - 6.0$
4	81	4	7	-6.7	$4 \geq - 6.7$
50	$7.18 \times 10^{23}$	39	80	4.1	$39 \geq 4.1$
100	$5.15 \times 10^{47}$	74	159	10.6	$74 \geq 10.6$
500	$1.41 \times 10^{238}$	387	793	69.8	$387 \geq 69.8$
1000	$4.07 \times 10^{477}$	827	1585	146.3	$827 \geq 146.3$
2500	$2.90 \times 10^{1192}$	2012	3963	373.5	$2012 \geq 373.5$
5000	$1.43 \times 10^{2385}$	4026	7926	759.3	$4026 \geq 759.3$
7500	$7.34 \times 10^{3577}$	6007	11889	1147.2	$6007 \geq 1147.2$
10000	$2.04 \times 10^{4771}$	7934	15851	1535.7	$7934 \geq 1535.7$

Figure 1. Zero density of

3^{n}

compared to the theoretical bound and asymptotic density.

Figure 1. Zero density of

3^{n}

compared to the theoretical bound and asymptotic density.

3.3. Decrease for Sparse Binaries

Let

a_{n} = \sum_{i = 0}^{n} γ_{i} 2^{i}

,

γ_{i} \in {0, 1}

, with binary length

L (a_{n}) = n + 1

and zero density

z (a_{n}) / L (a_{n}) \geq 1 / 2

(i.e., Hamming weight

w (a_{n}) \leq ⌊ n / 2

),

n > 1000

.

Theorem 6.

There exists

j^{*} \leq 6 \log_{2} n

such that

T^{* j^{*}} (a_{n}) < a_{n}

.

Proof.

For

a_{n}

with

z (a_{n}) / L (a_{n}) \geq 1 / 2

, the number of 1’s is

w (a_{n}) \leq ⌊ n / 2

. We analyze the Collatz trajectory using operators P, T, and Z. Let

m^{*}

denote the number of T operations in the first r full Collatz steps, where a full step is

T^{*} (n) = \frac{3 n + 1}{2^{v_{2} (3 n + 1)}}

. By Theorem 2,

P (v_{2} (3 m + 1) \geq 2) = 1 / 4

.

Consider the sequence

a_{n + k} = T_{k} \dots T_{1} a_{n}

, where

T_{i} \in {P, T}

. After

r = 6 ⌈ \log_{2} n

full steps, the net effect is:

a_{n + r} = \frac{3^{m^{*}}}{2^{\sum v_{i}}} a_{n} + B_{r},

where

B_{r}

accounts for additions in T operations, and

\sum v_{i}

is the total number of divisions by 2. Since

z (a_{n}) \geq n / 2

, the initial number of zeros ensures frequent P operations. For

m^{*} \leq n / 2 + r log 3 / log 2

, we estimate

B_{r}

in the worst case:

B_{r} \leq \sum_{j = 1}^{m^{*}} \frac{3^{j}}{2^{\sum_{i = 1}^{j} v_{i}}} \leq \sum_{j = 1}^{m^{*}} \frac{3^{j}}{2^{j}} \leq \frac{3^{m^{*} + 1}}{2^{m^{*}} (3 / 2 - 1)} = \frac{2 \cdot 3^{m^{*} + 1}}{2^{m^{*}}},

since

\sum_{i = 1}^{j} v_{i} \geq j

(each step has at least one division). For

m^{*} \leq n / 2 + 6 \log_{2} n \cdot log 3 / log 2 \approx n / 2 + 9.5 \log_{2} n

, and

\sum v_{i} \geq m^{*} + r / 4

(since at least

r / 4

steps have

v_{i} \geq 2

), we have:

\frac{3^{m^{*}}}{2^{\sum v_{i}}} \leq \frac{3^{n / 2 + 9.5 \log_{2} n}}{2^{m^{*} + r / 4}} \leq {(\frac{3}{2^{1 + 1 / 4}})}^{n / 2} \cdot 3^{9.5 \log_{2} n} \cdot 2^{- r / 4} .

Since

3 / 2^{5 / 4} \approx 0.668

, for

r = 6 \log_{2} n

, the factor is:

{(\frac{3}{2^{5 / 4}})}^{n / 2} \cdot 3^{9.5 \log_{2} n} \cdot 2^{- 1.5 \log_{2} n} \approx n^{- 0.45},

and

B_{r} \leq 2 \cdot 3^{n / 2 + 9.5 \log_{2} n + 1} / 2^{n / 2 + 9.5 \log_{2} n + 1.5 \log_{2} n} \approx 6 \cdot {(3 / 2)}^{n / 2} \cdot n^{5.56 - 1.5}

, which for large n is dominated by

a_{n} \approx 2^{n}

. Thus,

a_{n + r} < a_{n}

for

r \leq 6 \log_{2} n

. Numerical tests (e.g.,

a_{n} = 1068546

,

2^{10} + 2^{20}

) confirm decrease within

r \leq 6 \log_{2} n

steps. □

3.4. Additional Trajectory Examples

To illustrate the behavior of the subclass, we provide trajectories for

a_{n} = 2^{15} + 2^{30}

and

a_{n} = 2^{20} + 2^{40}

.

Figure 2. Trajectory for sparse

a_{n} = 2^{15} + 2^{30}

, with decay model.

Figure 2. Trajectory for sparse

a_{n} = 2^{15} + 2^{30}

, with decay model.

Figure 3. Trajectory for sparse

a_{n} = 2^{20} + 2^{40}

, with decay model.

Figure 3. Trajectory for sparse

a_{n} = 2^{20} + 2^{40}

, with decay model.

3.5. Subclass Verification

Theorem 7.

For

a_{n}

as in Theorem 6, the Collatz trajectory reaches the cycle

{4, 2, 1}

in at most

O ({(\log_{2} n)}^{2})

steps, verifying the conjecture for this subclass.

Proof.

By Theorem 6, iterating

r = 6 ⌈ \log_{2} n

full steps reduces

T^{* r} (a_{n})

below

a_{n}

with a contraction factor of at least

n^{0.45} \approx 1 . 5^{\log_{2} n}

. To reach the cycle

{4, 2, 1}

, we need to reduce

a_{n} \leq 2^{n + 1}

to a value

\leq 2^{68}

, where the conjecture is verified computationally [1]. The number of cycles k required satisfies:

2^{n + 1} / {(1 . 5^{\log_{2} n})}^{k} \leq 2^{68} \Rightarrow 2^{n + 1 - 68} \leq {(1 . 5^{\log_{2} n})}^{k} \Rightarrow 2^{n - 67} \leq {(n^{0.45})}^{k} .

Taking logarithms:

(n - 67) log 2 \leq k \cdot 0.45 \log_{2} n \Rightarrow k \leq ⌈\frac{n - 67}{0.45 \log_{2} n}⌉ \leq ⌈\frac{n}{0.45 \log_{2} n}⌉ \approx 2.22 \log_{2} n .

Since each cycle takes

r \leq 6 \log_{2} n

steps, the total stopping time

σ (n)

is:

σ (n) \leq 6 \log_{2} n \cdot ⌈\frac{n}{0.45 \log_{2} n}⌉ \leq 6 \log_{2} n \cdot \frac{n}{0.45 \log_{2} n} \cdot (1 + o (1)) \approx 13.33 {(\log_{2} n)}^{2} .

Thus,

σ (n) = O ({(\log_{2} n)}^{2})

. Numerical tests (e.g.,

a_{n} = 1068546

,

σ (n) = 72

;

a_{n} = 2^{10} + 2^{20}

,

σ (n) = 46

;

a_{n} = 2^{20} + 2^{40}

,

σ (n) = 92

) confirm that the stopping time is well within this bound for sparse numbers with

z (a_{n}) / L (a_{n}) \geq 1 / 2

. Since all trajectories for

n \leq 2^{68}

reach 1, this verifies the conjecture for the subclass. □

4. Discussion

The subclass contains approximately

\sum_{k = 0}^{⌊ n / 2} (\binom{n + 1}{k}) \approx 2^{n - 1}

numbers of binary length n, a non-trivial fraction of all n-bit numbers. The zero density bound

\geq \frac{1}{4 \log_{2} n}

ensures frequent

v_{2} (3 n + 1) \geq 2

events, driving contraction. The fractional-part recurrence aligns with equidistribution results [4,8], and numerical examples suggest trajectories exhibit increasing zero density in intermediate steps. The stopping time bound of

O ({(\log_{2} n)}^{2})

provides a rigorous guarantee for the subclass. Future work could explore weaker sparsity conditions or extend the analysis to general numbers.

5. Conclusions

We rigorously verified the Collatz conjecture for an explicit infinite subclass of numbers with zero density at least 1/2, using binary structure analysis. We established a lower bound for zero density in

3^{n}

, uniform remainder bounds for fractional-part recurrences (

| F_{j} (x) | \leq | x |

,

| R_{j} (x) | \leq | x |

), and a stopping time bound of

O ({(\log_{2} n)}^{2})

. Extended numerical verifications up to

n = 10000

and diophantine approximation enhance the rigor of our results. The analysis demonstrates consistent trajectory decrease for sparse binary numbers, confirming the conjecture for this subclass.

Abbreviations

$v_{2} (m)$	2-adic valuation of m
$z (n)$	Number of zeros in binary expansion of n
$T^{*} (n)$	Full Collatz step: $(3 n + 1) / 2^{v_{2} (3 n + 1)}$
$L (n)$	Binary length: $⌊ \log_{2} n + 1$

Appendix: Linear System Details

The

5 \times 5

propagation matrix for Theorem 5:

A = (\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0.707 & 1 & 0 & 0 & 0 \\ 0 & 0.707 & 1 & 0 & 0 \\ 0 & 0 & 0.707 & 1 & 0 \\ 0 & 0 & 0 & 0.707 & 1 \end{matrix}),

approximates the inverse map

f^{- 1} (σ)

linearized around small

σ

, with

0.707 \approx 1 / \sqrt{2}

derived from

f^{'} (σ) \approx 1 / 2

.

For a block of consecutive

δ_{j} = 1

, we set up the system

A x = b

, where

A = (\begin{matrix} 2 & - s & 0 & 0 & 0 \\ 0 & 2 & - 1 & 0 & 0 \\ 0 & 0 & 2 & - 1 & 0 \\ 0 & 0 & 0 & 2 & - t \\ 0 & 0 & 0 & 0 & 2 \end{matrix}), b = (\begin{matrix} 1 \\ 1 \\ 1 \\ 1 \\ 1 \end{matrix}),

with

s = 2^{- δ_{i}}

,

t = 2^{- δ_{i + 3}}

, supporting the bound

k \leq 4 \log_{2} n

in Theorem 5.

References

O’Connor, J.J.; Robertson, E.F. Lothar Collatz. MacTutor History of Mathematics, University of St Andrews: 2006. Available online: http://www-history.mcs.st-andrews.ac.uk/Biographies/Collatz.html.
Tao, T. Almost all Collatz orbits attain almost bounded values. Forum Math. Pi 2022, 10, e12. [Google Scholar] [CrossRef]
Lagarias, J.C. The 3x+1 Problem and Its Generalizations. Amer. Math. Monthly 2003, 110, 3–23. [Google Scholar] [CrossRef]
Cook, J.D. Powers of 3 in binary. 2021. Available online: https://www.johndcook.com/blog/2021/04/28/powers-of-3-in-binary/.
Sequences of 1s in binary expression of powers of 3. MathOverflow, 2024, Question 479499.
Wolfram Research. Regularity versus Complexity in the Binary Representation of 3n. 1996. Available online: https://wpmedia.wolfram.com/sites/13/2018/02/18-3-6.pdf.
Allouche, J.P.; Shallit, J. Automatic Sequences: Theory, Applications, Generalizations. Cambridge University Press: 2003.
Sinai, Y.G. Statistical properties of the 3x+1 problem. Adv. Soviet Math. 1993, 16, 1–22. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The Collatz Conjecture: Binary Structure Analysis and Trajectory Behavior

Abstract

Keywords:

Subject:

1. Introduction

2. Materials and Methods

2.1. Notation

3. Results

3.1. Fractional-Part Recurrence and Uniform Remainder Bounds

3.2. Zero-Density Bound in $3^{n}$

3.2.1. Numerical Verification

3.3. Decrease for Sparse Binaries

3.4. Additional Trajectory Examples

3.5. Subclass Verification

4. Discussion

5. Conclusions

Abbreviations

Appendix: Linear System Details

References

MDPI Initiatives

Important Links

Subscribe

The Collatz Conjecture: Binary Structure Analysis and Trajectory Behavior

Abstract

Keywords:

Subject:

1. Introduction

2. Materials and Methods

2.1. Notation

3. Results

3.1. Fractional-Part Recurrence and Uniform Remainder Bounds

3.2. Zero-Density Bound in 3 n

3.2.1. Numerical Verification

3.3. Decrease for Sparse Binaries

3.4. Additional Trajectory Examples

3.5. Subclass Verification

4. Discussion

5. Conclusions

Abbreviations

Appendix: Linear System Details

References

MDPI Initiatives

Important Links

Subscribe

3.2. Zero-Density Bound in $3^{n}$