Preprint
Article

A proof of the infinitude of prime numbers

This version is not peer-reviewed.

Submitted:

18 June 2024

Posted:

19 June 2024

Read the latest preprint version here

Abstract
This paper presents a new proof of the infinite nature of prime numbers. Grounded in fundamental Set theory and Analysis, the proof uses the method of contradiction to demonstrate the absurdity of assuming the set of all prime numbers to be finite.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

Consider a scenario with five kittens. Is it possible to group them in a way that each group has the same number of kittens1 other than one group with 5 kittens or 5 groups with one kitten?”—the author.
Among the natural numbers, there exists a prime category of numbers that are so prime that they may best be described as “prime numbers”. The defining property of such numbers is that they cannot be broken down into smaller whole parts other than 1 and themselves (as hinted in the question above). It was Euclid2, who appears to have been provided the earliest documented definition of prime numbers in his work as a collection of books called “Elements”, dating back to circa 300 BCE. In Book IX of Elements, Proposition 20, Euclid proved that there are an infinite number of such prime numbers3 [2, p.210], [3, p.412], [?]. It was quite the proof that stands for such a high consequence that it is still valued in modernity (see [4, p.92]). From that point onward, the nonexistence of the largest prime number became evident, as evidenced by Euclid’s proof. Post-Euclid, many mathematicians have supported this proposition through their own respective proofs. Several such proofs can be found in [5, ch.01], [6, ch.XVIII], [7, v.04], [8, ch.01]. In this paper, we shall also give another proof of this proposition. For that, the motivation is visible when the following is assumed,
P = { the set of all prime numbers }
C = { the set of all composite numbers }
This implies, N * = P C , where N * = { n N n 2 } . However, if | P | is assumed finite, then an interesting point may have been ignored by the masses that N * = P C S , where S is the set of all natural numbers that are neither prime nor composite, which is contradictorily impossible as every element in that set would be greater than 1. Thus, by contradiction, it is possible to prove that the assumption of a finite set containing all prime numbers is inherently false.
If the core idea of the motivation were to be explicitly stated without any formalism, it would assume that there exists a finite number of prime numbers—say only 2 , 3 , 5 and 7, then consider the set of all natural numbers greater than or equal to 2 as follows,
2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 , 27 ,
Given the assumption that 2 , 3 , 5 , and 7 are only prime numbers, they are thereupon removed from the set as follows,
4 , 6 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 ,
Now, given that prime numbers are those numbers greater than 1 that are divisible only by 1 and themselves, it follows that every composite number is a multiple of a prime number. Therefore, every multiple of 2 , 3 , 5 , and 7 are removed from the set as follows,
11 , 13 , 17 , 19 , 23 , 29 , 31 , 37 , 41 , 43 , 47 , 53 , 59 , 61 , 67 , 71 , 73 , 79 , 83 , 89 , 97 ,
Now, this is a set of numbers that are neither prime (as the only prime numbers are assumed to be 2, 3, 5, and 7) nor composite (as they are not multiples of any assumed prime number), and all of them are greater than 1. This is a contradiction!
Having outlined the basic idea in this section, we shall give the complete proof in the next section. Following that, the paper concludes by discussing the formalized proof through a series of remarks in Section 3.

2. The Theorem and Its Proof

 Theorem 1.
There are infinitely many prime numbers.
 Proof. 
Assume—for the sake of contradiction—that the elements in the set of all prime numbers are finite, and thus define the following sets,
N * = { N N N 2 }
P = { p 1 , p 2 , p 3 , , p k }
N * P = { N N * N P }
C = { N ( N * P ) N = n p , where , p P , n N * }
Here, N * P N * and C N * P . To show that N * P C , notice that C is the set of all natural numbers of the form n p , where n N * and p { p 1 , p 2 , p 3 , , p k } ; whereas N * P includes all natural numbers of the form n k , where n , k N * . It will now be shown that for any sufficiently large range ε N * , the total amount of natural numbers of the form n k ε is greater than the total amount of natural numbers of the form n p ε . By doing so, it would be concludable that | N * P | | C | , for any sufficiently large range ε N * and thus the set N * P always contains more elements than C , up to that range ε N * . Which would then be used to form a contradiction.
Now, since C is the set of all natural numbers of the form n p , where n N * and p { p 1 , p 2 , p 3 , , p k } . Therefore, for the ith prime p i { p 1 , p 2 , p 3 , , p k } , the natural numbers of the form n p i , where n N * , are as follows,
2 p i , 3 p i , 4 p i , 5 p i
Now, the number of natural numbers of the form n p i , where n N * , less than or equal to a large range ε N * yields,
2 p i , 3 p i , 4 p i , , c i p i
where, c i p i ε or c i ε p i . Since n 2 in the expression n p i , therefore ( c i 1 ) in the expression c i p i represents a count of natural numbers of the form n p i less than or equal to the given range ε . Now, to count the number of natural numbers of the general form of n p less than or equal to the given range ε , the following is done.
for p 1 : 2 p 1 , 3 p 1 , 4 p 1 , , c 1 p 1 ε c 1 ε p 1 ( c 1 1 ) < c 1 ε p 1 for p 2 : 2 p 2 , 3 p 2 , 4 p 2 , , c 2 p 2 ε c 2 ε p 2 ( c 2 1 ) < c 2 ε p 2 for p 3 : 2 p 3 , 3 p 3 , 4 p 3 , , c 3 p 3 ε c 3 ε p 3 ( c 3 1 ) < c 3 ε p 3 for p n : 2 p n , 3 p n , 4 p n , , c n p n ε c n ε p n ( c n 1 ) < c n ε p n
Here, p n is such a prime number for which n p n ε and n p n + 1 ¬ ε , for all n N * . If ε is large enough, then n p n = n p k ε , where p k = max P . This implies that the amount of natural numbers of the form n p 1 , n p 2 , n p 3 , ⋯, n p n less than or equal to ε are ( c 1 1 ) , ( c 2 1 ) , ( c 3 1 ) , , ( c n 1 ) , respectively. Now, summing these quantities results in the total amount of natural numbers of the general form n p , less than or equal to ε . Let such a total amount be α . Now, note that c i in the expression c i p i for all i { 1 , 2 , 3 , , k } is greater than 1, as each n in the expression n p i is an element of N * (i.e., greater than 1). As a result, α = c 1 1 + c 2 1 + c 3 1 + + c n 1 > 1 , as shown below.
1 < c 1 1 + c 2 1 + c 3 1 + + c n 1 = α < c 1 + c 2 + c 3 + + c n ε p 1 + ε p 2 + ε p 3 + + ε p k = ε 1 p 1 + 1 p 2 + 1 p 3 + + 1 p n 1 < α ε 1 p 1 + 1 p 2 + 1 p 3 + + 1 p n
where, α represents the total amount of natural numbers of the form n p , where n N * and p { p 1 , p 2 , p 3 , , p k } , less than or equal to ε N * .
Now, on the other hand, since the set N * P contains all natural numbers of the form n k , where n , k N * . Therefore, for the ith natural number k i N * , the natural numbers of the form n k i , where n N * , are as follows,
2 k i , 3 k i , 4 k i , 5 k i
Now, the number of natural numbers of the form n k i less than or equal to a large range ε N * yields,
2 k i , 3 k i , 4 k i , , c i k i
where, c i k i ε or c i ε k i . Since n 2 in the expression n k i , therefore ( c i 1 ) in the expression c i k i represents a count of natural numbers of the form n k i less than or equal to the given range ε . Now, to count the amount of natural numbers of the general form n k less than or equal to the given range ε , the following is done.
for k 1 : 2 k 1 , 3 k 1 , 4 k 1 , , c 1 k 1 ε c 1 ε k 1 ( c 1 1 ) < c 1 ε k 1 for k 2 : 2 k 2 , 3 k 2 , 4 k 2 , , c 2 k 2 ε c 2 ε k 2 ( c 2 1 ) < c 2 ε k 2 for k 3 : 2 k 3 , 3 k 3 , 4 k 3 , , c 3 k 3 ε c 3 ε k 3 ( c 3 1 ) < c 3 ε k 3 for k n : 2 k n , 3 k n , 4 k n , , c n k n ε c n ε k n ( c n 1 ) < c n ε k n
Here, k n is such a natural number for which n k n ε and n k n + 1 ¬ ε , for all n N * . This implies that the amount of natural numbers of the form n k 1 , n k 2 , n k 3 , , n k n less than or equal to ε are ( c 1 1 ) , ( c 2 1 ) , ( c 3 1 ) , , ( c n 1 ) , respectively. Now, summing these quantities results in the total amount of natural numbers of the general form n k , less than or equal to ε . Let such a total quantity be β . Now, note that c i in the expression c i k i for all i { 1 , 2 , 3 , , n } is greater than 1, as each n in the expression n k is an element of N * (i.e., greater than 1). As a result, β = c 1 1 + c 2 1 + c 3 1 + + c n 1 > 1 , as shown below.
1 < c 1 1 + c 2 1 + c 3 1 + + c n 1 = β < c 1 + c 2 + c 3 + + c n ε k 1 + ε k 2 + ε k 3 + + ε k n = ε 1 k 1 + 1 k 2 + 1 k 3 + + 1 k n 1 < β ε 1 k 1 + 1 k 2 + 1 k 3 + + 1 k n
where, β represents the total amount of natural numbers of the form n k , where n , k N * , less than or equal to ε N *
Now, comparing Equation (1) and (2), we get,
1 1 < β α ε 1 k 1 + 1 k 2 + 1 k 3 + + 1 k n ε 1 p 1 + 1 p 2 + 1 p 3 + + 1 p n 1 < β α 1 k 1 + 1 k 2 + 1 k 3 + + 1 k n 1 p 1 + 1 p 2 + 1 p 3 + + 1 p n
Since, in the expression n k , k N * , therefore 1 k 1 + 1 k 2 + 1 k 3 + + 1 k n represents 1 2 + 1 3 + 1 4 + + 1 n . Which is the Harmonic number H n minus 1. Therefore, 1 k 1 + 1 k 2 + 1 k 3 + + 1 k n = k = 1 n 1 k 1 . Similarly, 1 p 1 + 1 p 2 + 1 p 3 + + 1 p n represents the sum of the reciprocals of the prime numbers. Therefore, 1 p 1 + 1 p 2 + 1 p 3 + + 1 p n = i = 1 n 1 p i . As a result, we have the following.
1 < β α k = 1 n 1 k 1 i = 1 n 1 p i
Now, since k = 1 n 1 k 1 ln n and i = 1 n 1 p i ln ln n [9] for sufficiently large values of the terms n obtained by choosing a sufficiently large range ε , and ln n > ln ln n ; therefore, k = 1 n 1 k 1 > i = 1 n 1 p i for sufficiently large values of n. As a result, k = 1 n 1 k 1 i = 1 n 1 p i = L > 1 , for sufficiently large n. This implies that
1 < β α L α < β α L α < β
This implies that for any sufficiently large range ε N * , the total amount of natural numbers of the form n p ε is always less than that of the form n k ε . In other words, the number of elements in N * P is always greater than the number of elements in C , up to any sufficiently large range ε N * . As a result, for any sufficiently large range ε , since α < β for ε , therefore δ = n k N * P such that δ C . Since δ C , therefore, δ n p , p P , n N * , where p , n < δ . However, if δ n p , then, p P , δ p n , where n N * . This implies that p P , p δ , therefore, by the definition of a prime number, δ P . However, this contradicts the premise that δ N * P .
Should δ be in C for even large range ε 1 > ε , then, since α < β also for ε 1 , therefore δ 1 = n k N * P such that δ 1 C and the rest follows a similar contradiction! Thus, the argument can be repeated endlessly with every new chosen large range ε N * , proving the infinitude of the prime numbers each time by contradiction. This completes the proof. □

3. Discussion and Remarks

This section is dedicated to gaining a comprehensive understanding of the proof of Theorem 2.1.
The proof of Theorem 2.1, makes use of `proof by contradiction’ method. Rooted in the principles of reductio ad absurdum, this method of mathematical proof involves assuming the negation of the proposition and then deriving a logical contradiction from that assumption. By showing that the assumption of the negation leads to a contradiction, one concludes that the original proposition must be true and cannot be denied otherwise (see [10, ch.06]). In this case,
Proposition:  There exists infinitely many prime numbers.
A few examples of the proof using the method of contradiction are provided below as remarks, where,
N * = { N N N 2 }
P = { the set of all prime numbers }
N * P = { N N * N P }
C = { N ( N * P ) N = n p , where , p P , n N * }
are considered.
Remark 01. (Numerical example of the proof): Assume, to the contrary, that the proposition is false and the only prime numbers are 2, 3, and 5. Then,
N * = { 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , }
P = { 2 , 3 , 5 }
N * P = { 4 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , }
C = { 4 , 6 , 8 , 9 , 10 , 12 , 14 , 15 , 16 , 18 , 20 , 21 , }
Now, since C N * P , therefore, there exists an element 7 N * P such that 7 C . Since 7 C , therefore, 7 n p , p { 2 , 3 , 5 } , n N * , where n , p < 7 . This implies that p { 2 , 3 , 5 } , p 7 , and thus, by definition, 7 P . However, this contradicts the premise that 7 N * P , and the assumption that the only prime numbers are 2, 3, and 5.
Remark 02. (Randomized example of the proof): Assume, to the contrary, that the proposition is false and the only prime numbers are 11, 13, 17, 5623, 7901, and 53982894593057. Then,
N * = { 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , } P = { 11 , 13 , 17 , 5623 , 7901 , 53982894593057 } N * P = { 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 12 , 14 , 15 , 16 , 18 , 19 , 20 , 21 , } C = { 22 , 26 , 33 , 34 , 39 , 44 , 51 , 52 , 55 , 65 , 66 , 68 , 77 , 78 , 85 , }
Now, since C N * P , therefore, there exists an element 4 N * P such that 4 C . Since 4 C , therefore, 4 n p , p P , n N * , where n , p < 4 . This implies that p P , p 4 , and thus, by definition, 4 P . However, this contradicts the premise that 4 N * P , and the assumption that the only prime numbers are 11, 13, 17, 5623, 7901, and 53982894593057.
Remark 03. (Generalized example of the proof): Assume, to the contrary, that the proposition is false and the only prime numbers are 2 , 3 , 5 , 7 , , k ; where k N * . Then,
N * = { 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , } P = { 2 , 3 , 5 , 7 , 11 , 13 , 17 , 19 , 23 , 27 , , k } N * P = { 4 , 6 , 8 , 9 , 10 , 12 , 14 , 15 , 16 , 18 , , k + 1 , k + 2 , k + 3 , } C = { N ( N * P ) N = n p , where , p P , n N * }
Now, since C N * P , therefore, there exists an element k + ε N * P such that k + ε C , where k , ε N * . Since k + ε C , therefore, k + ε n p , n N * , p P , n , p < k + ε , where k , ε N * . This implies that p P , p k + ε , and thus, by definition, k + ε P , where k , ε N * . However, this contradicts the premise that k + ε N * P , where k , ε N * , and the assumption that the only prime numbers are 2 , 3 , 5 , 7 , , k , where k N .

References

  1. Narkiewicz, W. The Development of Prime Number Theory: From Euclid to Hardy and Littlewood; Springer Monographs in Mathematics; Springer: Berlin Heidelberg, 2000. [Google Scholar]
  2. Fitzpatrick, R.; Heiberg, J. Euclid’s Elements. Richard Fitzpatrick, 2007. [Google Scholar]
  3. Heath, T.; Heiberg, J. The Thirteen Books of Euclid’s Elements; Number v. 2 in The Thirteen Books of Euclid’s Elements; The University Press: location, 1908. [Google Scholar]
  4. Hardy, G.; Snow, C. A Mathematician’s Apology; Canto Classics; Cambridge University Press: location, 2012. [Google Scholar]
  5. Aigner, M.; Hofmann, K.; Ziegler, G. Proofs from THE BOOK; Springer: Berlin Heidelberg, 2018. [Google Scholar]
  6. Dickson, L. History of the Theory of Numbers: Divisibility and Primality; Number v. 1; Dover Publications: location, 2012. [Google Scholar]
  7. Meštrović, R. Euclid’s theorem on the infinitude of primes: a historical survey of its proofs (300 B.C.–2022) and another new proof 2023. arXiv:math.HO/1202.3670].
  8. Ribenboim, P. The Little Book of Bigger Primes; Springer-Verlag: location, 1991. [Google Scholar]
  9. Gasarch, W.; Washington, L. ∑p≤n1/p=ln(lnn)+O(1): An Exposition. 2015; arXiv:math.HO/1511.01823]. [Google Scholar]
  10. Hammack, R. Book of Proof; Richard Hammack: location, 2019. [Google Scholar]
1
without sacrificing any of them to the real number system
2
an ancient Greek mathematician, who lived in Alexandria, Egypt, circa 300 BCE.
3
it was done by showing that prime numbers are more than any assigned multitude of prime numbers. Note that, the proof was specifically for only three prime numbers. Euclid did not consider any arbitrary finite set of prime numbers, as is commonly done nowadays [1, p.01].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Downloads

157

Views

184

Comments

0

Subscription

Notify me about updates to this article or when a peer-reviewed version is published.

Email

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated