1. Introduction
1.1. Background and Context
The Collatz conjecture, proposed by Lothar Collatz in 1937, has fascinated mathematicians for decades due to its deceptively simple definition and yet unresolved status [
2,
4]. Also known as the
problem, it asserts that for any positive integer
x, repeated application of the function
will eventually reach the cycle
. Despite extensive computational verification and probabilistic arguments supporting the conjecture [
5,
8], a general proof has remained elusive, highlighting a profound disconnect between the conjecture’s elementary formulation and the complex dynamics it generates.
Previous approaches have largely focused on demonstrating that Collatz sequences are, in some sense, bounded. Probabilistic models suggest an average decreasing behavior [
4,
6], while computational efforts have verified convergence for astronomically large starting values [
1,
7]. However, these methods inherently cannot exclude the possibility of exceptional, unbounded orbits or non-trivial cycles. Even Tao’s significant result [
8], proving that
almost all orbits are bounded, does not establish boundedness for
every starting number.
1.2. Key Contributions of This Paper
In this paper, we present a deterministic and fully constructive proof of the Collatz Conjecture by modeling the Collatz dynamics as a finite state machine (FSM) defined over a novel modular classification of the positive integers. Our approach partitions into five disjoint and exhaustive sets based on parity and residue class modulo 9. This yields a 17-state FSM in which each state represents a unique structural class under the Collatz function.
The key idea is to analyze Collatz sequences as symbolic paths through this FSM, showing that every integer transitions through a finite set of transient states before entering a terminal subset. We prove that the only valid entry point into the known cycle is through the integer 8, and we rigorously eliminate the possibility of any alternative cycles or divergent trajectories.
Unlike prior methods, our framework offers a fully deterministic classification of the state space. We show that infinite escape from the terminal cycle is structurally impossible within the FSM. This provides a symbolic, modular, and complete proof of the conjecture. The correctness of our classification and transition model is further supported by empirical verification up to , aligning precisely with known computational results.
1.3. Structure of This Paper
This paper is organized as follows:
The logical flow of the proof is designed to be cumulative: each section builds upon the results of previous sections, culminating in the complete resolution of the conjecture in
Section 6. The finite state machine diagram (
Figure 1) provides a visual summary of the core structural framework.
2. Mathematical Framework and Definitions
To rigorously analyze the Collatz Conjecture, we begin by establishing the fundamental mathematical definitions, notation, and the core function at the heart of the problem.
Definition 1
(Collatz Function).
The Collatz function is defined as
Definition 2
(Collatz Sequence).
For a starting integer , the Collatz sequence is the sequence defined by
Definition 3
(Odd Iterate). Given a Collatz sequence , anodd iterateis a term that is odd. We often denote odd iterates by .
(or accelerated Collatz step)).
Definition 4 (Odd Iteration Anodd iteration(also called anaccelerated Collatz step) is the transformation that maps an odd integer o directly to the next odd integer in its Collatz sequence. It is given by
where denotes denotes the 2-adic valuation of m, i.e., the exponent of the largest power of 2 dividing m. This guarantees that is odd. In some residue class analyses (e.g., modulo 4 or 12) one considers the simplified version
when focusing on residue class transitions and boundedness arguments.
3. State Space Partitioning for Collatz Dynamics
To facilitate a structured analysis of the Collatz process, we begin by partitioning the set of positive integers () into a collection of mutually exclusive and collectively exhaustive sets. The partitioning is designed to capture key properties of numbers under the Collatz function, exposing deterministic relationships between a finite number of sets, facilitating the process of proving the conjecture.
3.1. Defining Fundamental Sets in Collatz Analysis
We begin by defining the key sets that will form the basis of our state space.
Definition 5
(Cycle Set).
The cycle set consists of the numbers known to form a repeating cycle:
Explanation of the cycle set: The cycle set
is fundamental to the Collatz conjecture. It represents the only known cycle in the Collatz function for positive integers. When a Collatz sequence reaches any of these numbers, it enters a loop that cycles as
A central part of the conjecture is to prove that all Collatz sequences eventually enter this cycle.
Definition 6
(ROM3 Set).
The ROM3 set comprises all odd positive multiples of 3:
Explanation of the ROM3 set: The ROM3 set (short for “root odd multiple of 3") consists of those positive integers that are odd multiples of 3. For example, belong to . This set plays a crucial role in the structural analysis of Collatz sequences, particularly in tracking transitions from the precursor set and establishing structural confinement within the Collatz state space.
Definition 7
(Precursor Set).
The precursor set consists of all even positive multiples of 3:
Explanation of the precursor set: The precursor set is defined as the set of positive integers that are even multiples of 3 (i.e., numbers satisfying ). For instance, belong to . The term “precursor" reflects that, under reverse Collatz iteration, numbers in serve as the origins that structurally precede the ROM3 set .
Definition 8
(Immediate Successor Set).
The immediate successor set is defined as
Explanation of the immediate successor set: The immediate successor set consists of numbers of the form with j odd. For example, are in . When the Collatz function is applied to a number in the ROM3 set, the very next number in the sequence falls into , marking the next step in the structural chain.
Definition 9
(Exclusion Set).
The exclusion set consists of numbers that do not belong to , , , or :
Explanation of the exclusion set: The exclusion set is defined by exclusion. consists precisely of positive integers that are not divisible by 3 and are not in or .
3.2. Completeness of Classification
For our state space to be a valid foundation for analysis, we must ensure that every positive integer belongs to exactly one of the defined sets. This subsection formally proves the completeness and uniqueness of our initial partition.
Theorem 1
(Completeness of Classification: Partitioning of positive integers).
The set of positive integers is completely and uniquely partitioned as follows:
That is, every positive integer belongs to exactly one, and only one, of these five sets.
Proof. Proof strategy: We prove completeness by first showing that every belongs to at least one of the five sets (exhaustiveness) and then proving that no x can belong to more than one set (mutual exclusivity).
Step 1: Exhaustiveness.
Let x be an arbitrary positive integer.
Thus, every x is assigned to at least one set.
Step 2: Mutual exclusivity.
We now verify that these sets are pairwise disjoint.
since (none of which are divisible by 3) while every element in is divisible by 3.
because contains only small numbers not divisible by 3 and consists of even multiples of 3.
and by definition.
The remaining intersections (, , , , , ) are similarly ruled out by the definitions and congruence conditions imposed on each set.
Conclusion: Since every positive integer belongs to exactly one of , , , , or , the classification is complete. □
4. Properties of the Collatz Function on the Defined Sets
Having established that the set is a unique, absorbing cycle within the Collatz process, we now proceed to map the properties of the Collatz function on all our sets (as defined in Section 3): , , , , and . This analysis reveals crucial properties that would enable us confine the Collatz process to a finite number of analytical states in Section 5.
4.1. Mapping Properties of the Precursor Set: Initial Transitions
We begin by analyzing the behavior of the precursor set () under the Collatz function, identifying the set to which its elements are mapped in the subsequent iteration.
Lemma 1
( mapping: Descending from the infinite, ordered past). Iterates from the precursor set follow a predictable descent, remaining within until their final transition to .
That is, if , then .
Proof. Proof overview: We express an arbitrary as and apply the Collatz function. Depending on whether j is odd or even, lands in or remains in , respectively.
Step 1: Express x in terms of .
Thus, for some positive integer j.
Step 2: Apply the Collatz function.
Step 3: Analyze based on the parity of j.
Conclusion: In both cases, . □
4.2. Finite Transition from Precursor to ROM3
We now establish a crucial property of the Precursor set (): that repeated application of the Collatz function to any element in will, in a finite number of steps, result in an element in the ROM3 set (). This property is essential for demonstrating the deterministic transition between the initial states of our finite state machine, as will be shown in Section 5.
Lemma 2
(Finite Transition from to ). For any , there exists a finite integer such that , where denotes the n-fold application of the Collatz function (with ).
Proof. By definition, if
, then
for some positive integer
k. We can write
k as
, where
is an integer and
b is an odd integer. Substituting this into the expression for
x, we get:
Now, consider the repeated application of the Collatz function. Since
x is even, we repeatedly divide by 2:
Since b is odd, is an odd multiple of 3. Therefore, . We have found a finite such that . □
4.3. Transition from ROM3 Set to Immediate Successor Set
Following the flow of sequences, we next examine the transformation of the ROM3 set () under the Collatz function, revealing its predictable successor set (). We will demonstrate later that, once a sequence crosses into , it can never return to or .
Lemma 3
(
mapping to immediate successor set
).
For every , we have
Proof. Proof overview: We express an element as (with j odd), apply the Collatz function, and show the resulting number fits the definition of .
Step 1: Express x in terms of .
If , then for some odd integer j.
Step 2: Apply the Collatz function.
Step 3: Verify membership in .
By Definition 8, numbers of the form (with j odd) belong to .
Conclusion: Hence, for every , we have . □
4.4. Descent from Immediate Successor Set into the Exclusion Set
Continuing our analysis of set transitions, we now investigate the immediate successor set () and its image under the Collatz function.
Lemma 4
(Mapping from
to exclusion).
If , then
Proof. Proof overview: We show that for , after applying the Collatz function, the resulting number satisfies the conditions for membership in ; that is, it does not belong to , , , or and the reverse Collatz operation is defined.
Step 1: By Definition 8, if
then
Step 2: Since
x is even, applying the Collatz function yields
Step 3: Verify that satisfies the conditions for :
because and .
: If for some odd k, then and , a contradiction.
or : Similar contradictions arise.
Conclusion: Thus, . □
4.5. Confinement of Sequences Within the Bounded State Space
A crucial step in our analysis is to demonstrate that once a Collatz sequence enters the exclusion set (), it remains confined to a specific subset of our state space, facilitating a more detailed examination of its long-term behavior.
Lemma 5
(Confinement).
If , then
Proof. Proof overview: We prove by contradiction that if , then cannot lie in or ; therefore, it must belong to , , or .
Case 1: Suppose .
Then for some odd j.
If x is even, then implies , so , contradicting .
If x is odd, then implies , which is impossible.
Case 2: Suppose .
Then for some .
If x is even, then implies , so , contradicting .
If x is odd, then implies , impossible.
Conclusion: Since
and
, it follows that
□
4.6. Invariance and Absorbing Nature of the Cycle Set
We now confirm that the known cycle set () has a critical property: once a Collatz sequence enters this set, it never leaves, establishing it as an absorbing set for the Collatz dynamic.
Lemma 6
(Cycle set invariance).
If , then
where .
Proof. Proof overview: We verify the invariance of the cycle set by checking that applying the Collatz function to each element in yields an element that remains in .
Conclusion: In every case, . Thus, the cycle set is invariant under the Collatz function. □
5. Finite State Analysis of Collatz Dynamics
Leveraging the integer partition (Section 3) and set transition properties (Section 4), this section constructs a 17-state finite state machine (FSM) that completely models Collatz dynamics. First, we define the FSM’s components based on Modulo 9 analysis: the initial states corresponding directly to the sets ; the 12 transient states within Stage derived from residue analysis of sets ; and the terminal cycle states () representing the elements of set . Then the core of this section conducts a detailed analysis of the deterministic transitions between all these states under the Collatz function. Specifically, we establish the finite and irreversible transition from the initial states into Stage . Furthermore, we prove that Stage forms a strongly connected component (SCC). Our analysis also identifies a unique gateway state () to the absorbing cycle . Finally, we prove that no Collatz sequence can survive an infinite walk in Stage , setting the stage for our convergence proof in Section 6.
5.1. Definitions - Stages and States
Definition 10
(Initial stage ). Stage corresponds to the union of sets and . This initial stage consists of all positive integers divisible by 3. We break this stage into two states:
The sets and are disjoint by definition (or by Theorem 1), ensuring these states are distinct.
Definition 11
(Transient stage ). Stage corresponds to the union of sets and . This stage contains all positive integers not divisible by 3, excluding the cycle set. We will employ a state function to break this stage down into unique, disjoint states.
Definition 12
(Terminal stage - Cycle States). Stage comprises the three states that represent the elements of cycle set . The cycle states are defined as follows:
: Represents the number 1. Formally, .
: Represents the number 2. Formally, .
: Represents the number 4. Formally, .
By Lemma 6, the transitions between these states follow the Collatz function (), and the cycle set is invariant, causing sequences entering this stage to cycle indefinitely.
Definition 13
(State function for stage
).
The state of a positive integer is defined by the triplet
Remark 1
(Why Modulo 9 is Optimal for Stage ). The choice of modulus 9 in the state function (Definition 13) is specifically tailored to analyzing the behavior of numbers that arenot divisible by 3—that is, numbers in the sets , which together form the entiretransient stage of our 17-state FSM. This choice is motivated by several key observations:
- 1.
-
Restriction to Residues Coprime to 3:Within , the residues of integers not divisible by 3 are:
These six residue classes correspond precisely to the admissible residues for elements in , making modulo 9 a natural framework for organizing this stage of the dynamics.
- 2.
-
Structured Behavior Under :For odd integers , the map induces predictable transformations modulo 9. For example:
These congruences govern how states evolve under the Collatz function and are central to defining deterministic transitions in the transient stage.
- 3.
Balanced Granularity:Modulo 9 is fine enough to distinguish the essential behavior classes for numbers not divisible by 3, yet coarse enough to avoid the greater complexity that might arise from moduli like 18 or 27 without necessarily resolving all state-transition branching.
- 4.
Exact Fit for State Classification:The state function using (residue mod 9, Set , parity) results in exactly 12 valid and disjoint states ( through ) that perfectly partition the transient stage , as demonstrated in Lemma 7.
- 5.
Identification of a Key Funnel State ():The explicit distinction of Set within the state definition uniquely identifies State as the sole entry point into the transient stage for all sequences originating from the infinite sets and . This follows from the deterministic transition (established in Lemma 9, which relies on Lemma 3). Identifying as this "funnel" state captures a crucial aspect of the sequence dynamics and may prove valuable for future investigations into sequence merging after exiting the multiples-of-3 stages.
Thus, the use of modulo 9, combined with the set distinction and parity, provides a well-justified and structurally informative framework required to fully classify Collatz behavior in the transient stage, enabling the deterministic analysis within our 17-state finite state machine.
5.2. Partitioning of Stage
Using the defined state function, we enumerate the resulting finite set of 12 disjoint states that partition the transient stage .
Lemma 7
(12-State Partition of ). The state function in Definition 13 defines a partition of stage into 12 disjoint states: . That is, for every there exists a unique index i with such that , and for any distinct indices , the sets of numbers that map to and are disjoint.
Proof. We prove the lemma in two parts: (1) that for every there exists a unique state with (exhaustiveness), and (2) that these states are pairwise disjoint (mutual exclusivity).
(1) Uniqueness of the state assignment: By definition, the state function assigns to each x a triplet consisting of:
The residue . For x in , the allowed residues are .
A secondary component
, where
which is well defined and disjoint.
The parity function , which is uniquely determined by whether x is even or odd.
Thus, each
is assigned a unique triplet, which by construction corresponds to exactly one of the following 12 states:
(2) Mutual exclusivity: Suppose for contradiction that there exist two distinct indices such that an element x satisfies and . Since the components of (i.e., the residue , the set indicator , and the parity ) are uniquely determined by x, it is impossible for two different triplets to be equal. Hence, the states and must be disjoint.
Conclusion: Every is assigned exactly one state , and the collection forms a partition of stage . □
Remark 2
(Structure of the Full FSM). It is important to emphasize that the 12 states defined by the state function in Definition 13 constitute only thetransient stage of the full 17-state finite state machine. The FSM as a whole also includes:
Theinitial stage, representing all integers divisible by 3.
Theterminal cycle stage, which captures the absorbing cycle .
Thus, while the transient stage handles the majority of the Collatz process, it operates as one of three structurally distinct phases in a unified finite-state framework.
5.3. Completeness of State Partition
Having successfully defined all our states, we now prove that every positive integer corresponds to exactly one of the partitioned states.
Lemma 8
(Completeness of State Assignment). Every positive integer n corresponds to exactly one state in the 17-state FSM defined by Definitions 10, 12, and 11 (or equivalent labels).
Proof. We need to show that for any positive integer n, there exists a unique state S in the set such that n maps to S.
By Theorem 1, the sets form a partition of the positive integers . Therefore, any given belongs to exactly one of these five sets.
Furthermore, every integer n has a unique residue modulo 9 and a unique parity (Even or Odd).
We examine the state definitions based on the unique set membership of n:
If , then by definition, n corresponds uniquely to state .
If , then by definition, n corresponds uniquely to state .
-
If , then n must be 1, 2, or 4.
If , it corresponds uniquely to state .
- -
If , it corresponds uniquely to state .
- -
If , it corresponds uniquely to state .
If , by definition of , for some odd j. This implies and n is always Even. The state function yields , which corresponds uniquely to state .
-
If , then by definition, . This means , so the possible residues modulo 9 are . We examine the combinations:
- -
If : By definition, all numbers in satisfy and are Even. Since contains all numbers where j is odd, and contains numbers not in , any with cannot be Even (otherwise it would be in ). Therefore, if and , nmust be Odd. This corresponds uniquely to state . The combination does not exist for any n.
- -
-
If : For each of these 5 residues, an integer can be either Even or Odd. This yields possible combinations. These are uniquely covered by the state definitions:
- ∗
Residue 2: ,
- ∗
Residue 4: ,
- ∗
Residue 5: ,
- ∗
Residue 7: ,
- ∗
Residue 8: ,
Thus, the state covers the only possible combination for with residue 1, and the states through cover the 10 possible combinations for with residues 2, 4, 5, 7, or 8. In total, the 11 states uniquely cover all possibilities for an integer .
Since every belongs to exactly one of the partitioning sets, and the state definitions uniquely determine a state based on this set membership combined with the unique residue mod 9 and parity (or the specific value for ), every positive integer n corresponds to exactly one state in the 17-state FSM. □
5.4. Deterministic and Finite Transition from Stage to Stage
We now demonstrate the deterministic and finite transition from the initial stage, (representing multiples of 3), to the transient stage, . This transition is irreversible; once a sequence enters , it cannot return to being a multiple of 3.
Lemma 9
(Stage to Stage Transition). The initial stage of the 17-state FSM, , has the following transitions:
- 1.
always transitions to in a finite number of steps.
- 2.
always transitions to in a single step.
Proof. We prove each transition separately:
- 1.
Transition from to (Finite): By definition, state corresponds to the set . Lemma 2 directly states that for any , there exists a finite integer such that . Since state corresponds to the set , this directly implies that any element in state transitions to state in a finite number of steps.
- 2.
Transition from to (Single Step): By definition, state corresponds to the set (Definition 10) and corresponds to (Lemma 7). Lemma 3 states that for all , . This directly implies that transitions to in a single step.
Therefore, any starting number, whether in or , is guaranteed to enter the 12-state stage in a finite number of steps. Furthermore, by Lemma 5, once a sequence enters stage , it can never return to , making this transition irreversible. □
5.5. State Transition Analysis for Transient Stage
We now meticulously analyze how the Collatz function causes transitions between the defined states in stage .
Lemma 10 (State Transition Analysis (12 States)) The transitions between the 12 states under the Collatz function are as follows:
From to (residue 5, , even) or (residue 5, , odd).
From to (residue 4, , even).
From to (residue 1, , even) or (residue 1, , odd).
From to (residue 7, , even).
From to (residue 2, , even) or (residue 2, , odd).
From to (residue 4, , even).
From to (residue 7, , even) or (residue 7, , odd).
From to (residue 7, , even).
From to (residue 8, , even) or (residue 8, , odd).
From to (residue 4, , even).
From to (residue 4, , even) or (residue 4, , odd) or (4, , even).
From to (residue 7, , even).
Proof. We analyze each transition case by case.
Case 1: or .
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ), and (contradiction modulo 9). Therefore, .
Parity: If k is even, is odd (). If k is odd, is even ().
Setup: Let , so for some positive integer m.
Collatz Step:.
Residue:.
Set Membership: (since , ) and (contradiction modulo 9). Therefore, .
Parity: is even.
Case 3: or .
Setup: Let , so for some positive integer m.
Collatz Step:.
Residue:.
Set Membership: (since , ). If m is odd, (). Otherwise, if m is even, then ().
Parity: see Set Membership.
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: is even.
Case 5: or .
Setup: Let , so for some positive integer m.
Collatz Step:.
Residue:.
Set Membership: (since , ) and (contradiction modulo 9). Therefore
Parity: If m is even, is even (). If m is odd, is odd ().
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: is even.
Case 7: or .
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: If k is even, is odd (). If k is odd, is even ().
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: is even.
Case 9: or .
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: If k is even, is even (). If k is odd, is odd ().
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: is even.
Case 11: or or .
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (contradiction modulo 9).
Cycle Entry (Gateway): If , then and , representing a transition into the cycle stage from stage . Otherwise, for , . Therefore .
Parity: If k is even, is even (). If k is odd, is odd ().
Setup: Let , so for some integer .
Collatz Step:.
Residue:.
Set Membership: (since ) and (contradiction modulo 9). Thus, .
Parity: is even.
These transitions fully define the behavior of the FSM within stage , and demonstrate the crucial property that the next state is uniquely determined by the current state. This includes the specific condition where the system transitions into the terminal cycle stage (). □
5.6. Determinism of FSM Evolution
Lemma 11
(Determinism of FSM Evolution). Let be the set of 17 states, and let `getState` be the state assignment function. The evolution of any Collatz sequence under this state assignment isdeterministic. That is, for any positive integer x, the state of its Collatz successor, , is uniquely determined by x. Consequently, the sequence of states is uniquely determined for any starting number .
Proof. We need to show that for any , the value is uniquely defined and belongs to .
By Lemma 8, every positive integer maps to exactly one state in . Since produces a unique positive integer for any , must map to exactly one state .
To be more explicit, we can examine the transitions based on the state :
- 1.
-
If :
If , . By Lemma 1, . Thus, is either or , both unique states in .
If , . By Lemma 3, . Since all elements of map uniquely to state , , a unique state in .
- 2.
-
If : Lemma 10 provides a case-by-case analysis based on . For each case, it determines the properties of (its residue mod 9, its parity, and whether it falls into or ).
For states like , the analysis shows that always maps to a single specific successor state ( respectively), regardless of the specific x within .
For states like , the analysis shows that maps to one of two or three possible successor states (, , , , , respectively). However, the specific successor state is uniquely determined by properties of x (like the parity of k or m in ). Since x is given, is unique, and therefore is also unique, landing in exactly one of those specified possible successor states.
In all sub-cases, results in a unique state within .
- 3.
If : The transitions ensure that is respectively, which are unique states in .
Since for any , is unique and maps to a unique state in , the evolution process defined by repeatedly applying C and then getState is deterministic for any starting number . □
5.7. State as the Unique Gateway
We establish that is the only state in the transient stage that can lead into the cycle stage .
Lemma 12
(S11 as the Unique Gateway State). Within the 17-state FSM, state is the unique gateway from Stage to Stage .
Proof. We proceed in three steps.
- 1.
-
List all preimages of the cycle elements.
(only preimage of 1 is 2);
(only preimage of 2 is 4);
and (preimages of 4 are 1 and 8).
- 2.
Identify the external preimage. The only number not already in that maps into it is 8, with .
- 3.
Locate this in the FSM. By definition , and Lemma 10 (Case 11) gives the transition . No other transient state maps directly to 1, 2, or 4.
Therefore is the unique gateway to the cycle. □
5.8. State Transition Diagram of the 17-State FSM
Figure 1.
State transition diagram for the 17-state finite state machine modeling Collatz dynamics. The three stages are shown: initial stage (, green), transient stage (, white), and terminal cycle stage (, red). The gateway state (orange) provides the unique transition into the terminal stage. All transitions are deterministic under the Collatz function.
Figure 1.
State transition diagram for the 17-state finite state machine modeling Collatz dynamics. The three stages are shown: initial stage (, green), transient stage (, white), and terminal cycle stage (, red). The gateway state (orange) provides the unique transition into the terminal stage. All transitions are deterministic under the Collatz function.
5.9. Strong Connectivity Within Stage and Reachability of the Gateway State
We now prove a crucial property for convergence: The transient stage forms a strongly connected component (SCC) and every state within it has a finite path leading to the unique gateway state .
Lemma 13
(Strong Connectivity and Recurrence within Stage ). Every state in the subsystem belongs to at least one recurrent cycle of state transitions that includes state .
Proof. We will demonstrate this by showing that every state has a path to (reachability), and that any path originating from will eventually return to a state that has a path to . This establishes the cyclical nature.
Part 1: Reachability of
Let be the set of states from which state can be reached in k steps or less. We define and . We will show, by induction, that , meaning all states in can reach in at most 4 steps.
If a state transitions to multiple states, it’s assigned to the corresponding to the shortest path to .
(Base Case)
-
- -
or (Lemma 10, Case 9). Since can transition directly to , it follows that .
-
- -
(Lemma 10, Case 4). Since , it follows that .
- -
or (Lemma 10, Case 7). Since , it follows that .
- -
(Lemma 10, Case 8). Since , it follows that .
- -
(Lemma 10, Case 12). Since , it follows that .
-
- -
or (Lemma 10, Case 1). Since and , it follows that .
- -
or (Lemma 10, Case 5). Since , it follows that .
-
- -
(Lemma 10, Case 2). Since , it follows that .
- -
or (Lemma 10, Case 3). Since , it follows that .
- -
(Lemma 10, Case 6). Since , it follows that
- -
(Lemma 10, Case 6). Since , it follows that .
Since , every state in the subsystem has a finite path to state .
Part 2: Cyclical Return from S11
From Lemma 10 (Case 11), transitions to or . From Part 1 above, and can reach in 3 and 4 steps respectively.
This shows all transitions from S11, no matter the path taken, will lead back to a state which can reach S11, hence forming a cycle of states.
Conclusion:
Since every state has a finite path to , and any sequence starting from ultimately returns to a state with a path to, every state in is part of a cycle of states that includes . □
5.10. Classification of Fundamental FSM Segments
To prove that no infinite trajectory can exist, we must first decompose all possible paths into a finite set of fundamental "building blocks" and analyze their algebraic properties. This first lemma performs that classification, demonstrating that the FSM is composed of contractile segments, with the exception of a single, unique edge case that requires special analysis.
Lemma 14
(Classification of Fundamental Segments and Isolation of the Expansive Edge Case). Of the 12 fundamental segments that compose all possible orbits in the transient FSM, 11 are unconditionally contractile. The FSM contains a single, unique segment—the self-loop 9B—which presents a potential edge case for sustained growth.
Proof. The proof is by exhaustive construction and analysis, using a rigorous "chainable" definition of a fundamental segment.
Definition 14 (Chainable Fundamental Segment)AChainable Fundamental Segmentis a deterministic path that begins with the exit operation from a branching state, , proceeds through any intermediate states, arrives at the next branching state, , and concludes with the exit operation from .
Part 1: The 11 Unconditionally Contractile Segments
These 11 segments connect a branching state to a different branching state. They are all proven to be contractile.
Sub-Class A: Direct Contraction (Ratio )
These six segments connect two adjacent branching states. The path involves two consecutive divisions by 2 (one to exit the origin state, one to exit the destination state). The transformation is , which is strongly contractile.
-
1A:
Path
-
3A:
Path
-
5A:
Path
-
7A:
Path
-
9A:
Path
-
11A:
Path
Sub-Class B: Composite Contraction (Ratio )
These five segments connect two branching states via an intermediate linear (odd) state. The path involves three operations: an initial division by 2, a step, and a final division by 2. The transformation is , which is contractile for .
-
1B:
Path
-
3B:
Path
-
5B:
Path
-
7B:
Path
-
11B:
Path
Part 2: Isolation of the Unique Edge Case
Our analysis reveals exactly one fundamental segment that is not unconditionally contractile.
The Edge Case: Segment 9B (The Self-Loop)
This segment is unique because its origin and destination are the same state: . The path is .
Let us analyze its transformation without immediately assuming an exit to a different segment. The atomic operation of one traversal of this loop, from entry at to re-entry at , involves one division by 2 and one operation.
If a sequence were to traverse this loop N times consecutively, the transformation would be approximately . This potential for repeated application of an expansive operator makes Segment 9B the critical edge case.
Conclusion
The transient stage of the FSM is composed of a system of 11 unconditionally contractile segments and a single, unique self-loop (9B) that is locally expansive. This self-loop constitutes the only structural mechanism that could possibly support sustained growth and must be resolved separately to prove convergence. □
5.11. Resolution of the Expansive Edge Case
The previous analysis isolated a single, unique segment (9B) with the potential for local expansion. This lemma now focuses exclusively on this edge case, proving by algebraic contradiction that it cannot be traversed indefinitely by any positive integer.
Lemma 15 (Resolution of the Expansive Edge Case)The unique, locally expansive segment (9B: ) cannot be traversed indefinitely by any positive integer. Its algebraic structure makes infinite traversal a mathematical impossibility.
Proof. The proof is by algebraic contradiction. We demonstrate that the assumption of infinite traversal requires the starting integer to be , which is not in the domain of the Collatz conjecture.
- 1.
The Atomic Transformation: As established in Lemma 14, the transformation for a single atomic traversal of the
loop is:
where
is the sequence of integers in state
at each iteration of the loop.
- 2.
-
The Algebraic Insight: Instead of analyzing the transformation on
, we analyze the transformation on the quantity
.
This reveals a pure geometric progression. Each traversal of the loop multiplies the quantity by a factor of .
- 3.
-
The Implication of Infinite Traversal: If a trajectory traverses this loop
n times, the relationship becomes:
We can rearrange this to solve for the integer
after
n loops:
- 4.
-
The Fatal Contradiction: By definition, every number in a Collatz sequence must be an integer. For to be an integer for all values of n, the term must resolve to an integer. This is only possible if the denominator, , perfectly divides the term . Since 3 and 2 are coprime, this simplifies to the condition that must perfectly divide .
For an
infinite traversal to be possible, this condition must hold for all possible values of
. The only integer that is divisible by
for all
n is zero.
The assumption of an infinite traversal leads to the necessary conclusion that the starting integer must be . However, the Collatz conjecture is defined exclusively over the set of positive integers.
Conclusion
No positive integer can satisfy the algebraic condition required for infinite traversal of Segment 9B. The loop is a self-terminating sieve that has no fixed points in the positive integers. The edge The edge case is resolved. □
5.12. Finitude of All Trajectories
Having classified all fundamental segments and proven that the single expansive edge case cannot be traversed indefinitely, we now synthesize these results to prove that all trajectories within the transient stage must be finite.
Lemma 16 (Impossibility of Infinite Trajectories in the Transient Stage)No Collatz trajectory can proceed indefinitely within the transient stage . Every trajectory must be finite.
Proof. The proof is by contradiction, synthesizing the results of Lemma 14 and Lemma 15.
- 1.
The Setup: Assume, for the sake of contradiction, that an infinite trajectory exists within the transient stage . This infinite path must be a concatenation of the fundamental segments defined in Lemma 14.
- 2.
The Role of the Edge Case: By Lemma 14, 11 of the 12 fundamental segments are unconditionally contractile. An infinite trajectory of positive integers composed solely of these segments is impossible, as it would form an infinite, strictly decreasing sequence, which violates the Well-Ordering Principle.
- 3.
Therefore, for an infinite trajectory to exist, it must traverse the single, locally expansive Segment 9B an infinite number of times to counteract the contractile nature of the rest of the system.
- 4.
The Contradiction: Lemma 15 proved by algebraic contradiction that no positive integer can traverse Segment 9B indefinitely. Any trajectory can only traverse this segment a finite number of times before it is structurally forced to exit.
- 5.
This creates a fatal contradiction. An infinite trajectory requires infinite traversals of Segment 9B, but the algebraic structure of Segment 9B forbids infinite traversal by any positive integer.
- 6.
Conclusion: The initial assumption of an infinite trajectory is false. Every trajectory must be finite.
□
6. Proof of the Collatz conjecture: Convergence to the Unique Cycle
In this section, we synthesize all our analysis in the preceding sections to prove our main result: that the Collatz Conjecture is true - every Collatz sequence, no matter the starting number, is ultimately drawn into the cycle .
Theorem 2 (The Collatz Conjecture)Every starting integer eventually reaches the cycle
under repeated application of the Collatz function .
Proof. We prove the conjecture by showing that, within the 17-state FSM framework (comprising stages , , and ), every trajectory starting from any initial state eventually reaches and remains within the cycle stage , which represents the unique Collatz cycle .
The proof proceeds by analyzing the flow through the FSM stages:
- 1.
Initial State Assignment: By Lemma 8, every positive integer x corresponds to exactly one initial state within the 17-state FSM.
- 2.
Transition from Stage : If x starts in Stage (states or ), Lemma 9 establishes that its trajectory transitions into Stage (specifically state ) in a finite number of steps. Lemma 5 ensures the sequence cannot return to Stage .
- 3.
-
Evolution within or into Stage : Any sequence not starting in Stage will thus eventually enter or already be in Stage . Its behaviour within this stage is governed by:
- (a)
All transitions within Stage and from Stage into Stage are deterministic (Lemma 10).
- (b)
is the unique gateway from Stage to Stage (Lemma 12).
- (c)
The unique gateway is reachable from any state in Stage in a finite number of steps (Lemma 13).
- (d)
-
Crucially, infinite survival within Stage is impossible. This is established by a comprehensive three-lemma argument:
Lemma 14 proves that all trajectories are composed of fundamental segments, of which 11 are unconditionally contractile, isolating a single, unique edge case (Segment 9B) with the potential for local expansion.
Lemma 15 proves by infinite descent on the controlling integer parameter k that this unique edge case cannot be exploited for sustained growth.
Lemma 16 synthesizes these results to prove by contradiction that no infinite trajectory can exist within the transient stage.
- 4.
-
Absorption in Stage : By (3a)–(3d), every trajectory in
must exit this stage in finite time. Since the only exit is through the unique gateway state
, the sequence must execute the transition:
Once the trajectory enters (the number 4), it is confined to the terminal cycle stage . By Lemma 6 (Cycle Set Invariance), the sequence remains permanently within the invariant cycle .
Therefore, any trajectory starting from corresponds to a path in the 17-state FSM that inevitably leads, in a finite number of steps, to the absorbing cycle stage . This demonstrates that every positive integer eventually reaches the cycle , completing the proof. □
7. Computational Verification
To empirically test the theoretical claims of our 17-state finite state machine (FSM) - including state assignments, transition rules, and the unique gateway mechanism - we implemented a computational verification over a large numerical range. The goal was to confirm that Collatz sequences evolve exactly as predicted by the FSM structure.
A Python script (verify_collatz_fsm.py) was written using the multiprocessing module (with 8 workers) to test all integers from 1 up to . For each starting value n, the script traced its Collatz sequence and performed the following checks at every step until reaching the cycle set :
Initial state classification: Confirmed that each n is correctly mapped to one of the 17 FSM states via the getState function.
Deterministic transition verification: Ensured that each observed transition conformed exactly to the FSM’s transition rules (Lemma 10).
Gateway consistency: Verified that any transition to 4 (i.e., to ) occurred only from either (in ) or (in ), as required by the FSM structure.
State coverage: Ensured that no number encountered during the sequence evaluation mapped to an undefined or invalid state.
Step count: Recorded the number of steps required for each sequence to reach 1.
A summary of the results is shown in
Table 1. All checks passed without error, and no violations were detected.
These results confirm the empirical soundness of the finite state model over all tested inputs. Every transition was deterministic, every number remained confined within the FSM structure, and the unique gateway mechanism through behaved exactly as predicted. Notably, the number achieved the maximum stopping time within this range, consistent with prior computational records.
This large-scale verification strongly reinforces the validity of the FSM framework and its predictive power in modeling Collatz dynamics.
8. Empirical Evidence from Large-Scale Collatz Computations
Over the decades, extensive computational searches have provided a substantial body of evidence regarding the behavior of Collatz sequences. Numerous studies have explored Collatz sequences for extremely large starting values - with some computations reaching up to
(Oliveira e Silva [
7]) - and ongoing distributed computing projects, such as the BOINC Collatz Conjecture project (BOINC [
1]), continue to expand this empirical base. These large-scale computations have consistently demonstrated that:
Boundedness: No starting number tested has produced a Collatz sequence that grows without bound; all sequences examined remain within finite limits.
Convergence to the 4-2-1 Cycle: Every Collatz sequence observed eventually enters the cycle (or the equivalent permutation ), regardless of the starting value.
No Other Cycles Found: Despite exhaustive searches, no cycles other than the trivial cycle (or its cyclic permutations) have ever been discovered.
This extensive empirical evidence is entirely consistent with and strongly supports the theoretical results established in this paper - specifically, the theorems that prove boundedness, the non-existence of non-trivial cycles, and the eventual convergence to the trivial cycle.
9. Comparison with Previous Approaches
The Collatz Conjecture has been extensively studied using diverse mathematical techniques [
4,
5,
6]. Our approach - combining a structured state-space framework with deterministic transition analysis - provides a fundamentally distinct resolution. In this section, we contextualize our proof within the broader landscape of Collatz research.
9.1. Limitations of Prior Methods
Most previous approaches, while yielding valuable insights, have fundamental limitations that prevented a complete resolution:
Probabilistic and Statistical Models [
4,
6] suggest that, on average, Collatz sequences tend to decrease. However, they cannot establish boundedness for
all initial values, leaving open the possibility of exceptional unbounded orbits.
Computational Verification [
1,
7] confirms the conjecture for extremely large numbers but cannot provide a proof for all integers.
Dynamical Systems and Ergodic Theory [
5,
6] yield statistical insights into typical trajectories but struggle with the discontinuous nature of the Collatz map.
Modulo Arithmetic and Congruence Class Methods demonstrate boundedness within specific residue classes but fail to extend these properties globally.
Contradiction-Based Arguments often rely on unproven assumptions or fail to rigorously eliminate all counterexamples.
Tao’s "Almost All" Result [
8] proves that most orbits are bounded but does not establish boundedness for every number.
9.2. Novelty and Strengths of the Presented Proof
Our proof resolves the Collatz Conjecture through a state-space approach that provides a complete classification of all trajectories, guaranteeing convergence to the unique cycle .
Key innovations include:
Complete State-Space Partition: We classify into five mutually exclusive sets——that exhaust all possible Collatz trajectories, enabling full structural analysis.
Finite State Machine Reformulation: A 17-state finite state machine (FSM) recasts the problem from unbounded numerical iteration into finite deterministic state evolution.
Unified Resolution: The FSM framework simultaneously excludes both non-trivial cycles and divergent trajectories via structural analysis (Lemma 16).
Algebraic Exclusion of Divergence: The only mechanism that could support divergence - an expansive loop - is proven algebraically impossible for all positive integers (Lemma 15).
Deterministic Convergence: All sequences follow finite, structured paths to absorption in , with all indefinite survival mechanisms rigorously excluded.
10. Conclusions
We have presented a complete, structurally grounded proof of the Collatz Conjecture, leveraging a novel framework that interprets Collatz sequences as deterministic trajectories within a structured state space. By partitioning the positive integers into five mutually exclusive sets - namely, the cycle set , ROM3 set , precursor set , immediate successor set , and exclusion set - we have developed a systematic classification that fully captures the behavior of Collatz iterations.
Our proof follows a two-stage approach:
- 1.
We establish that the only possible cycle is , by showing that any cycle must contain the number 1 and is therefore confined to the trivial cycle. The non-existence of other cycles is confirmed by established results in the literature.
- 2.
We prove that every Collatz sequence must reach in finite time, using a deterministic finite state machine (FSM) analysis. The FSM framework guarantees that all sequences undergo a systematic, finite progression. Critically, we prove that the only mechanism for divergence within the FSM - an expansive loop - is algebraically impossible for any positive integer.
With these results, we conclude that every positive integer is eventually drawn into the cycle, thereby resolving the Collatz Conjecture.
Crucially, our approach diverges from traditional bounded growth arguments by demonstrating that sequences do not merely remain within a finite bound - they are structurally confined and systematically directed toward termination. The deterministic nature of our finite state machine analysis ensures that all trajectories are forced into a terminal condition, rather than merely avoiding unbounded divergence. This fundamental shift in perspective transforms the problem from one of numerical control to one of inevitable dynamical convergence.
Beyond settling this long-standing open problem, our work demonstrates the effectiveness of a state-space-driven, set-theoretic approach in analyzing complex iterative systems. This methodology may provide a blueprint for addressing similar problems in number theory and discrete dynamical systems, offering new insights into how deterministic constraints govern seemingly chaotic processes.
11. Need for Verification and Future Directions
11.1. Need for Rigorous Verification
While the proof presented in this paper offers a distinct and potentially compelling approach to the Collatz Conjecture - particularly through the use of the product equation and prime factorization for cycle analysis - rigorous validation by the broader mathematical community is essential. The history of the Collatz Conjecture is replete with proposed proofs that were later found to contain flaws. Therefore, thorough and independent scrutiny of every step of this proof, especially the derivation and application of the product equation for cycle analysis, the partitioning of the state space, the construction and transition analysis of the 17-state FSM, and the proof of convergence via gateway state reachability, is paramount. This validation should involve expert peer review through journal submissions, detailed examination by specialists in number theory, presentations at conferences, and open dissemination for public scrutiny. Until such rigorous validation is complete, the result remains a proposed proof that, we believe, provides a sound and novel pathway toward resolving this longstanding problem.
11.2. Potential Avenues for Future Research
If validated, the proof presented here would not only resolve the Collatz Conjecture but also open new avenues for research in number theory and related fields. Potential directions for future work include:
Generalization of the Product Equation Technique: Investigate whether the product equation method introduced in this paper can be generalized or adapted to study cycle structures and dynamics in other iterative functions or number-theoretic problems.
Refinement and Simplification of the Proof: Explore alternative formulations of the arguments, particularly prime factorization and finite state analysis, to achieve greater clarity or elegance and potentially shorter proofs.
Alternative FSM Constructions: Explore the construction and analysis of finite state machines for the Collatz dynamics based on different moduli (e.g., modulo 12, modulo 36) or alternative state definition criteria. Compare the resulting state counts, the nature of state transitions (determinism vs. branching), the revealed structural features, and the complexity of proving convergence within these alternative FSM frameworks relative to the modulo 9 FSM presented here.
Computational Exploration Inspired by the Proof: With convergence established, further computational studies of stopping time distributions, average trajectory behavior, and other statistical properties of Collatz sequences could yield valuable insights.
Applications to Related Conjectures: Determine whether the insights and techniques from this work can be applied to other unsolved problems or related conjectures in the realm of iterative number theory and dynamical systems.
FSM Methodology for Other Dynamical Systems: Investigate whether the techniques used to construct and analyze the 17-state FSM (based on set partitioning, residue classes, and transition mapping) can be adapted to model and prove properties of other number-theoretic sequences or discrete dynamical systems.
Educational and Expository Development: Develop pedagogical materials and simplified expositions of this proof to make it accessible to a broader mathematical audience, including students and researchers. Such efforts might include clearer visualizations, intuitive explanations of key steps, and adaptations of the proof for classroom use.
12. Acknowledgments
The author acknowledges his wife, Ajifa Atuluku, for her steadfast encouragement throughout the process of drafting this proof. The author also acknowledges the use of AI-assisted tools (Google Gemini AI and ChatGPT) for formatting assistance and language clarity. All mathematical content and original ideas in this manuscript were developed independently by the author.
Data availability statement
The Python script used to generate the computational verification data presented in this proof is available online at the following open code repository:
[Link to Code Repository].
References
- BOINC, Collatz conjecture project, (archived version, accessed March 8, 2025). Retrieved from https://web.archive.org/web/20090915183543/http://boinc.thesonntags.com/collatz/.
- Collatz, L. , Aufgaben E., Mathematische Semesterberichte 1 (1950), 35.
- Conway, J. H. , Unpredictable iterations, in Proceedings of the 1972 Number Theory Conference (Boulder, CO: University of Colorado, 1972), 49–52.
- Lagarias, J. C. , The 3x+1 problem and its generalizations, American Mathematical Monthly 92 (1985), 3–23.
- Lagarias, J. C. , The 3x+1 problem: Annotated bibliography (1963–1999), in de Gruyter Series in Nonlinear Analysis and Applications 6 (Berlin: Walter de Gruyter, 2004), 189–299.
- Lagarias, J. C. , The Collatz conjecture, Chaos 20(4) (2010), 041102.
- Oliveira e Silva, T. , Empirical verification of the 3x+1 and related conjectures, in The ultimate challenge: The 3x+1 problem, edited by J. C. Lagarias, American Mathematical Society, Providence, Rhode Island, USA, 2010, pp. 189–207.
- Tao, T. , Almost all orbits of the Collatz conjecture are bounded, Journal of the American Mathematical Society 32(1) (2019), 1–89.
- Thwaites, B. , My conjecture, Bulletin of the Institute of Mathematics and Its Applications 15(2) (1979), 41.
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).