A Proof of the Infinitude of Prime Numbers

Md. Shouvik Iqbal

doi:10.20944/preprints202406.1296.v2

Submitted:

21 February 2025

Posted:

25 February 2025

You are already at the latest version

Abstract

This paper presents a new proof of the infinitude of prime numbers, also known commonly as "Euclid's theorem." Grounded in fundamental Set theory, the proof uses the method of contradiction to demonstrate the absurdity of assuming the set of all prime numbers to be finite.

Keywords:

Prime Numbers

;

Infinitude of Primes

;

Euclid's theorem

;

Proof of Euclid's theorem

;

Proof of Infinitude of Primes

;

Set Theory

;

Proof by Contradiction

Subject:

Computer Science and Mathematics - Algebra and Number Theory

1. Introduction

Among the natural numbers, there exists a prime category of numbers that are so prime that they may best be described as “prime numbers.” The defining property of such numbers is that they cannot be broken down into smaller whole parts other than 1 and themselves. It was Euclid1, who appears to have provided the earliest documented definition of prime numbers in his work as a collection of books called the “Elements,” dating back to circa 300 BCE. In Book IX of Elements, Proposition 20, Euclid proved that there are an infinite number of such prime numbers2 [2,3]. It was quite the proof that stands for such a high consequence that it is still valued in modernity [4]. From that point onward, the nonexistence of the largest prime number became evident, as evidenced by Euclid’s proof. Post-Euclid, many mathematicians have supported this proposition through their own respective proofs. Several such proofs can be found in [5,6,7,8]. In this paper, we will also give another proof of this proposition. For that, the motivation is visible when the following is assumed,

P = {the set of all prime numbers}

C = {the set of all composite numbers}

This implies,

N^{*} = P \cup C

, where

N^{*} = {n \in N ∣ n \geq 2}

. However, if

| P |

is assumed finite, then an interesting point may have been ignored easily that

N^{*} = P \cup C \cup S

, where

S

is the set of all natural numbers that are neither prime nor composite, which is contradictorily impossible as every element in that set would be greater than 1. Thus, by contradiction, it is possible to prove that the assumption of a finite set containing all prime numbers is inherently false.

If the core idea of the motivation were to be explicitly stated without any formalism, it would assume that there exists a finite number of prime numbers—say only

2, 3, 5

and 7, then consider the set of all natural numbers greater than or equal to 2 as follows,

2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, \dots

Given the assumption that

2, 3, 5

, and 7 are only prime numbers, they are thereupon removed from the set as follows,

4, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, \dots

Now, given that prime numbers are those numbers greater than 1 that are divisible only by 1 and themselves, it follows that every composite number is a multiple of a prime number. Therefore, every multiple of

2, 3, 5

, and 7 are removed from the set as follows,

11, 13, 17, 19, 23, 29, 31, 37, \dots, \underset{121}{\underset{︸}{11 \cdot 11}}, \dots, \underset{143}{\underset{︸}{11 \cdot 13}}, \dots, \underset{187}{\underset{︸}{11 \cdot 17}}, \dots, \underset{209}{\underset{︸}{11 \cdot 19}}, \dots

Now, this is a set of numbers that are neither prime (as the only prime numbers are assumed to be 2, 3, 5, and 7) nor composite (as they are not multiples of any assumed prime number), and all of them are greater than 1. This is a contradiction!

Having outlined the basic idea, we now give the complete proof of the infinitude of prime numbers as follows.

2. The Theorem and Its Proof

Theorem 1.

There are infinitely many prime numbers.

Proof.

Assume—for the sake of contradiction—that the elements in the set of all prime numbers are finite, and thus define the following sets,

N^{*} = {N \in N ∣ N \geq 2}

P = {p_{1}, p_{2}, p_{3}, \dots, p_{k}}

N^{*} ∖ P = {N \in N^{*} ∣ N \notin P}

C = {N \in (N^{*} ∖ P) ∣ N = n p, where, p \in P, n \in N^{*}}

Here,

N^{*} ∖ P \subset N^{*}

and

C \subseteq N^{*} ∖ P

. To show that

C ⊊ N^{*} ∖ P

, notice that

C

is the set of all natural numbers of the form

n p

, where

n \in N^{*}

and

p \in {p_{1}, p_{2}, p_{3}, \dots, p_{k}}

; whereas

N^{*} ∖ P

includes all natural numbers of the form

n k

, where

n, k \in N^{*}

. It will now be shown in three parts that for any sufficiently large range

ε \in N^{*}

, the total number of natural numbers of the form

n k \leq ε

is greater than the total number of natural numbers of the form

n p \leq ε

. By doing so, it would be concludable that

| N^{*} ∖ P | \neq | C |

, for any sufficiently large range

ε \in N^{*}

and that the set

N^{*} ∖ P

contains more elements than

C

, up to that range

ε \in N^{*}

. Which would then be used to form a contradiction.

Part 1: In this part, we will calculate the total number of natural numbers in

C

up to a given range

ε

. For that, notice that

C

is the set of all natural numbers of the form

n p

, where

n \in N^{*}

and

p \in {p_{1}, p_{2}, p_{3}, \dots, p_{k}}

. Therefore, for the ith prime

p_{i} \in {p_{1}, p_{2}, \dots, p_{i}, \dots, p_{k}}

, the natural numbers of the form

n p_{i}

, where

n \in N^{*}

, (that is, multiples of

p_{i}

) are as follows,

2 p_{i}, 3 p_{i}, 4 p_{i}, 5 p_{i} \dots

Now, the number of natural numbers of the form

n p_{i}

, where

n \in N^{*}

, less than or equal to a finite given range

ε \in N^{*}

yields,

2 p_{i}, 3 p_{i}, 4 p_{i}, \dots, c_{i} p_{i} \leq ε

here,

c_{i}

is the largest integer such that

c_{i} p_{i} \leq ε

. This gives:

c_{i} \leq \frac{ε}{p_{i}}

. Since

n \geq 2

in all natural numbers of the form

n p

, therefore

(c_{i} - 1)

represents a count for the total number of natural numbers of the form

n p_{i}

less than or equal to the given range

ε

. Now, to count the total number of natural numbers of the general form

n p

less than or equal to the given range

ε

, the following is done.

\begin{matrix} for & p_{1} : 2 p_{1}, 3 p_{1}, 4 p_{1}, \dots, c_{1} p_{1} \leq ε \Rightarrow & c_{1} \leq \frac{ε}{p_{1}} ∴ (c_{1} - 1) < c_{1} \leq \frac{ε}{p_{1}} \\ for & p_{2} : 2 p_{2}, 3 p_{2}, 4 p_{2}, \dots, c_{2} p_{2} \leq ε \Rightarrow & c_{2} \leq \frac{ε}{p_{2}} ∴ (c_{2} - 1) < c_{2} \leq \frac{ε}{p_{2}} \\ for & p_{3} : 2 p_{3}, 3 p_{3}, 4 p_{3}, \dots, c_{3} p_{3} \leq ε \Rightarrow & c_{3} \leq \frac{ε}{p_{3}} ∴ (c_{3} - 1) < c_{3} \leq \frac{ε}{p_{3}} \\ ⋮ & ⋮ \\ for & p_{m} : 2 p_{m}, 3 p_{m}, 4 p_{m}, \dots, c_{m} p_{m} \leq ε \Rightarrow & c_{m} \leq \frac{ε}{p_{m}} ∴ (c_{m} - 1) < c_{m} \leq \frac{ε}{p_{m}} \end{matrix}

where,

p_{m} \in {p_{1}, p_{2}, \dots, p_{m}, \dots, p_{k}}

is the mth prime number such that

p_{m} < ε

.

This implies that the total number of natural numbers of the form

n p_{1}

,

n p_{2}

,

n p_{3}

, ⋯,

n p_{m}

less than or equal to the given range

ε

are

(c_{1} - 1), (c_{2} - 1), (c_{3} - 1), \dots, (c_{m} - 1)

, respectively. Summing these quantities together results in the total number of natural numbers of the general form

n p

, less than or equal to

ε

. Let the total number be

α

.

Now, note that the abovementioned largest integer

c_{i}

is always greater than 1, for all

i \in {1, 2, \dots, m, \dots, k}

, due to the fact that each n in the natural number of the form

n p

is an element of

N^{*}

(i.e., greater than 1). As a result,

α = (c_{1} - 1) + (c_{2} - 1) + (c_{3} - 1) + \dots + (c_{m} - 1) > 1

, as shown below.

\begin{matrix} 1 & < (c_{1} - 1) + (c_{2} - 1) + (c_{3} - 1) + \dots + (c_{m} - 1) = α \\ < c_{1} + c_{2} + c_{3} + \dots + c_{m} \\ \leq \frac{ε}{p_{1}} + \frac{ε}{p_{2}} + \frac{ε}{p_{3}} + \dots + \frac{ε}{p_{m}} \\ = ε (\frac{1}{p_{1}} + \frac{1}{p_{2}} + \frac{1}{p_{3}} + \dots + \frac{1}{p_{m}}) \\ ∴ 1 & < α < ε (\frac{1}{p_{1}} + \frac{1}{p_{2}} + \frac{1}{p_{3}} + \dots + \frac{1}{p_{m}}) \end{matrix}

(1)

where,

α

represents the total number of natural numbers of the form

n p

, where

n \in N^{*}

and

p \in P

, less than or equal to

ε \in N^{*}

.

Part 2: Similar to the part above, in this part, we will calculate the total number of natural numbers in

N^{*} ∖ P

up to a given range

ε

. For that, notice that the set

N^{*} ∖ P

contains all natural numbers of the form

n k

, where

n, k \in N^{*}

. Therefore, for the ith natural number

k_{i} \in {k_{1}, k_{2}, \dots, k_{i}, \dots} = N^{*}

, the natural numbers of the form

n k_{i}

, where

n \in N^{*}

, (that is, multiples of

k_{i}

) are as follows,

2 k_{i}, 3 k_{i}, 4 k_{i}, 5 k_{i} \dots

Now, the total number of natural numbers of the form

n k_{i}

less than or equal to a given range

ε \in N^{*}

yields,

2 k_{i}, 3 k_{i}, 4 k_{i}, \dots, d_{i} k_{i} \leq ε

here,

d_{i}

is the largest integer such that

d_{i} k_{i} \leq ε

. This gives:

d_{i} \leq \frac{ε}{k_{i}}

. Since

n \geq 2

in all natural numbers of the form

n k

, therefore

(d_{i} - 1)

represents a count for natural numbers of the form

n k_{i}

less than or equal to the given range

ε

. Now, to count the number of natural numbers of the general form

n k

less than or equal to the given range

ε

, the following is done.

\begin{matrix} for & k_{1} : 2 k_{1}, 3 k_{1}, 4 k_{1}, \dots, d_{1} k_{1} \leq ε \Rightarrow & d_{1} \leq \frac{ε}{k_{1}} ∴ (d_{1} - 1) < d_{1} \leq \frac{ε}{k_{1}} \\ for & k_{2} : 2 k_{2}, 3 k_{2}, 4 k_{2}, \dots, d_{2} k_{2} \leq ε \Rightarrow & d_{2} \leq \frac{ε}{k_{2}} ∴ (d_{2} - 1) < d_{2} \leq \frac{ε}{k_{2}} \\ for & k_{3} : 2 k_{3}, 3 k_{3}, 4 k_{3}, \dots, d_{3} k_{3} \leq ε \Rightarrow & d_{3} \leq \frac{ε}{k_{3}} ∴ (d_{3} - 1) < d_{3} \leq \frac{ε}{k_{3}} \\ ⋮ & ⋮ \\ for & k_{j} : 2 k_{j}, 3 k_{j}, 4 k_{j}, \dots, d_{j} k_{j} \leq ε \Rightarrow & d_{j} \leq \frac{ε}{k_{j}} ∴ (d_{j} - 1) < d_{j} \leq \frac{ε}{k_{j}} \end{matrix}

where,

k_{j} \in {k_{1}, k_{2}, \dots, k_{j}, \dots}

is the jth natural number in the set

N^{*}

such that

k_{j} < ε

.

This implies that the total number of natural numbers of the form

n k_{1}, n k_{2}, n k_{3}, \dots, n k_{j}

less than or equal to the given range

ε

are

(d_{1} - 1), (d_{2} - 1), (d_{3} - 1), \dots, (d_{j} - 1)

, respectively. Summing these quantities together results in the total number of natural numbers of the general form

n k

, less than or equal to

ε

. Let the total number be

β

.

Now, note that the abovementioned largest number

d_{i}

is always greater than 1, for all

i \in {1, 2, 3, \dots, j, \dots}

, due to the fact that each n in natural numbers of the form

n k

is an element of

N^{*}

(i.e., greater than 1). As a result,

β = (d_{1} - 1) + (d_{2} - 1) + (d_{3} - 1) + \dots + (d_{j} - 1) > 1

, as shown below.

\begin{matrix} 1 & < (d_{1} - 1) + (d_{2} - 1) + (d_{3} - 1) + \dots + (d_{j} - 1) = β \\ < d_{1} + d_{2} + d_{3} + \dots + d_{j} \\ \leq \frac{ε}{k_{1}} + \frac{ε}{k_{2}} + \frac{ε}{k_{3}} + \dots + \frac{ε}{k_{j}} \\ = ε (\frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}}) \\ ∴ 1 & < β < ε (\frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}}) \end{matrix}

(2)

where,

β

represents the total number of natural numbers of the form

n k

, where

n, k \in N^{*}

, less than or equal to

ε \in N^{*}

Part 3: In this part, we compare Equation (2) with (1) to show that the total number of natural numbers in the set

N^{*} ∖ P

(that is,

β

) up to the given range

ε

is greater than that of in the set

C

(that is,

α

) up to that same range

ε

, provided that the given range

ε

is sufficiently large. That is,

\begin{matrix} \frac{1}{1} & < \frac{β}{α} < \frac{ε (\frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}})}{ε (\frac{1}{p_{1}} + \frac{1}{p_{2}} + \frac{1}{p_{3}} + \dots + \frac{1}{p_{m}})} \\ 1 & < \frac{β}{α} < \frac{\frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}}}{\frac{1}{p_{1}} + \frac{1}{p_{2}} + \frac{1}{p_{3}} + \dots + \frac{1}{p_{m}}} \end{matrix}

(3)

Now, observe that since the set of all prime numbers

P

is assumed to be finite, therefore, the sum

\frac{1}{p_{1}} + \frac{1}{p_{2}} + \frac{1}{p_{3}} + \dots + \frac{1}{p_{m}}

in the denominator of Equation (3) can never exceed the maximum sum

\frac{1}{p_{1}} + \frac{1}{p_{2}} + \dots + \frac{1}{p_{m}} + \dots + \frac{1}{p_{k}}

to diverge3 to infinity for however large range

ε

is considered, due to the assumption that

p_{k} = max {P}

. On the other hand, since

N^{*}

is considered infinite, therefore, the sum

\frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}}

in the numerator of Equation (3) can diverge4 to infinity if we allow the range

ε

to do so. Therefore, for any sufficiently large finite range

ε^{'} > ε

the following condition is met,

\frac{1}{p_{1}} + \frac{1}{p_{2}} + \dots + \frac{1}{p_{m}} + \dots + \frac{1}{p_{k}} < \frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}}

The condition above holds for any sufficiently large

ε^{'}

due to the fact that the number of terms in the summations in both the numerator and denominator of Equation (3) was derived based on the given range.

This implies, for any sufficiently large finite range

ε^{'}

, that,

\frac{\frac{1}{k_{1}} + \frac{1}{k_{2}} + \frac{1}{k_{3}} + \dots + \frac{1}{k_{j}}}{\frac{1}{p_{1}} + \frac{1}{p_{2}} + \frac{1}{p_{3}} + \dots + \frac{1}{p_{k}}} = L > 1

As a result, from Equation (3), we have the following.

\begin{matrix} 1 & < \frac{β}{α} < L \\ α & < β < α L \\ ∴ α & < β \end{matrix}

Conclusion: The analysis above suggests that for any sufficiently large range

ε^{'} \in N^{*}

, the total number of natural numbers of the form

n p \leq ε^{'}

(i.e.,

α

) is less than that of the form

n k \leq ε^{'}

(i.e.,

β

). In other words, the total number of natural numbers in

N^{*} ∖ P

(i.e.,

β

) is greater than the total number of natural numbers in

C

(i.e.,

α

), up to any sufficiently large range

ε^{'} \in N^{*}

, even though both sets have an infinite number of natural numbers when no range

ε

is considered.

Contradiction: Based on the conclusion above, for any sufficiently large range

ε^{'}

, since

α < β

for

ε^{'}

, therefore

\exists δ \in N^{*} ∖ P

such that

δ \notin C

. Since

δ \notin C

, therefore,

δ \neq n p

,

\forall p \in P

,

\forall n \in N^{*}

, where

p, n < δ

. However, if

δ \neq n p

, then,

\forall p \in P

,

\frac{δ}{p} \neq n

, where

n \in N^{*}

. This implies that

\forall p \in P

,

p ∤ δ

, and therefore—by the definition of a prime number—

δ \in P

. However, this contradicts the premise that

δ \in N^{*} ∖ P

.

Should

δ

be in

C

for even large range

ε_{1}^{'} > ε^{'}

, then, since

α < β

also for

ε_{1}^{'}

(because

α < β

for

ε^{'}

and

ε_{1}^{'} > ε^{'}

), therefore

\exists δ_{1} \in N^{*} ∖ P

such that

δ_{1} \notin C

and the rest follows a similar contradiction.

As a result, the argument can be repeated endlessly for every new chosen sufficiently large range

ε^{'} \in N^{*}

, proving the infinitude of the prime numbers each time by contradiction. This completes the proof. □

References

Narkiewicz, W. The Development of Prime Number Theory: From Euclid to Hardy and Littlewood; Springer: Berlin/Heidelberg, Germany, 2000. [Google Scholar]
Fitzpatrick, R.; Heiberg, J. Euclid’s Elements, 2007.
Euclid, Johan Ludvig Heiberg, S.T.L.H. The Thirteen Books of Euclid’s Elements; The University Press: University of Minnesota, 1908.
G. H. Hardy, C.P.S. In A Mathematician’s Apology; Cambridge University Press: Cambridge, UK, 1967.
Martin Aigner, G.M.Z. Proofs from THE BOOK; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
Dickson, L.E. History of the Theory of Numbers: Divisibility and Primality; Number v. 1; Dover Publications: Washington, DC, USA, 2005. [Google Scholar]
Meštrović, R. Euclid’s theorem on the infinitude of primes: a historical survey of its proofs (300 B.C.–2022) and another new proof. arXiv 2023, arXiv:1202.3670. [Google Scholar]
Ribenboim, P. The Little Book of Bigger Primes, 2013.

1	an ancient Greek mathematician, who lived in Alexandria, Egypt, circa 300 BCE.
2	it was done by showing that prime numbers are more than any assigned multitude of prime numbers. Note that, the proof was specifically for only three prime numbers. Euclid did not consider any arbitrary finite set of prime numbers, as is commonly done nowadays [1].
3	since the sum of the reciprocal of primes diverges
4	since the sum of the reciprocal of natural numbers diverges

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

A Proof of the Infinitude of Prime Numbers

Abstract

Keywords:

Subject:

1. Introduction

2. The Theorem and Its Proof

References

MDPI Initiatives

Important Links

Subscribe