Preprint
Article

This version is not peer-reviewed.

Geometric Insights into the Goldbach Conjecture

Submitted:

11 April 2026

Posted:

13 April 2026

Read the latest preprint version here

Abstract
We develop a geometric and combinatorial framework for the distinct-prime Goldbach conjecture—the assertion that every even integer $2N \geq 8$ is the sum of two distinct primes. The framework rests on three components: (1) a novel geometric equivalence reformulating the problem in terms of nested squares with semiprime areas, (2) a rigorous combinatorial reduction to a set intersection condition on prime-pair half-differences, and (3) a complementary conditional argument supported by computation. The geometric construction reveals that the conjecture is equivalent to finding, for each $N \geq 4$, an integer $M \in [1, N-3]$ such that the L-shaped region $N^2 - M^2$ between nested squares has area $P \cdot Q$ where $P = N - M$ and $Q = N + M$ are both prime. We define two subsets of $\{1, \ldots, N-3\}$: the candidate set $C_N$ of half-differences arising from odd primes below $N$, and the valid set $D_N$ of half-differences realised by straddling prime pairs across $N$. The conjecture then reduces to showing $C_N \cap D_N \neq \emptyset$. Our main unconditional result is a tight lower bound: using Bertrand's postulate and a counting argument based on a single straddling prime, we prove that $|D_N| \geq \pi(N-1) - 1 = |C_N|$ for all $N \geq 4$, establishing that $D_N$ is at least as large as $C_N$. We combine this with explicit results from Dusart's doctoral thesis to prove that the interval $(N, 2N)$ contains at least $\ln^2 N$ primes for $N \geq 3275$, each contributing further to $D_N$. As a complementary approach, we introduce the gap function $G(N) = \log^2(2N) - ((N-3) - |D_N|)$ and formulate the Density Hypothesis—$G(N) > 0$ for all $N \geq 3275$—which, if true, would provide the stronger bound needed to force $C_N \cap D_N \neq \emptyset$ via the pigeonhole principle. Extensive computation confirms $G(N) > 0$ for all $N \in [4, 2^{14}]$, with minima strictly increasing across dyadic intervals.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

The Goldbach conjecture, proposed in 1742, asserts that every even integer greater than 2 can be expressed as the sum of two prime numbers [1]. Despite centuries of effort and computational verification up to 4 × 10 18 [2], the conjecture remains unproven. In this paper, we develop a geometric and combinatorial framework for studying a natural variant: every even integer 2 N 8 is the sum of two distinct primes.
Our variant requires the two primes to be distinct, thus excluding the trivial representations 4 = 2 + 2 and 6 = 3 + 3 . This restriction is not merely technical—it emerges naturally from a geometric reformulation that provides structural insight. Specifically, we show that for N 4 (so 2 N 8 ), finding a Goldbach partition with distinct primes is equivalent to finding nested squares whose L-shaped difference region has a semiprime area.
The framework proceeds in three stages.
Stage 1: Geometric equivalence (Section 3). We establish that the variant Goldbach conjecture is equivalent to the statement: for every N 4 , there exists M [ 1 , N 3 ] such that N 2 M 2 = P · Q where P = N M and Q = N + M are both prime. This reformulation recasts the additive problem as a set intersection problem between a candidate set  C N and a valid set  D N , both subsets of { 1 , , N 3 } .
Stage 2: Density analysis (Section 4). We prove a tight lower bound on | D N | . The key observation is that a single prime Q * in the interval ( N , 2 N ) , paired with every odd prime P < N , generates exactly π ( N 1 ) 1 distinct elements of D N . Since the candidate set satisfies | C N | = π ( N 1 ) 1 , this yields
| D N | π ( N 1 ) 1 = | C N | .
We further show, using Dusart’s thesis [3], that ( N , 2 N ) contains at least ln 2 N primes for N 3275 , each contributing additional values to D N .
Stage 3: Reduction to intersection (Section 4). The Goldbach conjecture for 2 N is equivalent to C N D N . Our size bounds establish | C N | + | D N | 2 ( π ( N 1 ) 1 ) . For the pigeonhole principle to force a non-empty intersection, one would need | C N | + | D N | > N 3 , which requires a strictly stronger bound on | D N | . We characterise precisely what remains open: one must show either that | D N | > ( N 3 ) π ( N 1 ) + 1 , or that the structure (not merely the size) of C N and D N forces an intersection.
Stage 4: Conditional argument and computational evidence (Section 5 and Section 6). To bridge the gap identified in Stage 3, we introduce the gap function G ( N ) = log 2 ( 2 N ) ( ( N 3 ) | D N | ) and formulate the Density Hypothesis (Hypothesis 1): G ( N ) > 0 for all N 3275 . If the hypothesis holds, then | D N | > ( N 3 ) log 2 ( 2 N ) , and since | C N | = π ( N 1 ) 1 > log 2 ( 2 N ) for large N, the pigeonhole principle forces C N D N , yielding the conjecture. We support the hypothesis with extensive computation: G ( N ) > 0 for all N [ 4 , 2 14 ] , with minima strictly increasing across dyadic intervals.
The remainder of this paper is organised as follows. Section 2 collects the analytic number theory prerequisites from Dusart’s thesis and states Bertrand’s postulate. Section 3 develops the geometric framework, reformulates the Goldbach variant as a set intersection problem, and introduces the key sets C N and D N . Section 4 contains the unconditional density analysis and main structural results. Section 5 presents computational evidence. Section 6 develops the conditional argument via the Density Hypothesis. Section 7 discusses significance and future directions.

2. Preliminaries: Prime Distribution Results

In this section we collect the results from analytic number theory that are needed for our analysis.

2.1. Bertrand’s Postulate and Its Refinements

The following classical result, first proved by Chebyshev in 1852 [4], guarantees a prime in every interval ( n , 2 n ) .
Proposition 1 
(Bertrand’s Postulate). For every integer n 1 , there exists a prime p with n < p < 2 n .
Ramanujan [5] strengthened this result by showing that for n sufficiently large, the interval ( n , 2 n ) contains at least two primes. Specifically:
Proposition 2 
(Ramanujan). For every integer n 25 , the interval ( n , 2 n ) contains at least two primes.

2.2. Primes in Short Intervals

The following result, due to Dusart [3], guarantees the existence of at least one prime in every sufficiently short interval. It is the principal tool for controlling the density of primes in ( N , 2 N ) .
Proposition 3 
([3] [Théorème 1.9, p. 35]). For every real number x 3275 , there exists a prime p satisfying
x < p x 1 + 1 2 ln 2 x .
Proposition 3 is derived from the following bound on consecutive primes.
Proposition 4 
([3] [Proposition 1.10, p. 34]). For k 463 (equivalently, p k 3299 ), the consecutive primes p k and p k + 1 satisfy
p k + 1 p k 1 + 1 2 ln 2 p k .
The proof of Proposition 3 in [3] proceeds by using Proposition 4 for all x p 463 = 3299 and then verifying the claim computationally for 3275 x < 3299 .

2.3. Bounds on the Prime-Counting Function

We also require explicit bounds on π ( x ) , the number of primes not exceeding x.
Proposition 5 
([3] [Théorème 1.10, p. 36]). The following inequalities hold:
(1)
x ln x 1 + 1 ln x π ( x ) for all x 599 .
(2)
π ( x ) x ln x 1 + 1.2762 ln x for all x > 1 .
(3)
π ( x ) x ln x 1 for all x 5393 .
(4)
x ln x 1 + 1 ln x + 1.8 ln 2 x π ( x ) for all x 32299 .
Throughout this paper, log denotes the natural logarithm (i.e., log ln ).

3. Geometric Construction and Reformulation

We now develop the geometric framework that underlies our analysis.

3.1. Nested Squares and Semiprime Areas

Consider a square S N with integer side length N 4 , having area N 2 . Inside S N , inscribe a smaller square S M with side length M, where 1 M N 3 , sharing the bottom-left corner with S N . The region between S N and S M forms an L-shaped annulus with area
N 2 M 2 = ( N M ) ( N + M ) .
Define P = N M and Q = N + M . The bounds on M translate to constraints on P and Q: M 1 gives P N 1 and Q N + 1 , while M N 3 gives P 3 . Thus 3 P N 1 and Q N + 1 , with P < Q since M 1 .

3.2. Connection to Goldbach Partitions

The sum and difference of P and Q are:
P + Q = ( N M ) + ( N + M ) = 2 N 8 , Q P = ( N + M ) ( N M ) = 2 M .
Since both the sum and difference are even, P and Q have the same parity. For both to be prime with P 3 , they must both be odd primes, hence distinct. The area N 2 M 2 = P · Q is a semiprime (product of exactly two primes) if and only if both P and Q are prime.

3.3. The Geometric Equivalence

Theorem 1 
(Geometric Goldbach Variant). The following are equivalent for all N 4 :
(i)
The even integer 2 N can be written as the sum of two distinct primes.
(ii)
There exists M [ 1 , N 3 ] such that P = N M and Q = N + M are both prime.
(iii)
The L-shaped region between squares S N and S M (sharing a corner) has semiprime area P · Q for some M [ 1 , N 3 ] .
Proof. (i) ⇒ (ii): If 2 N = p + q with distinct primes p < q , set M = ( q p ) / 2 . Since p and q are distinct odd primes (as 2 N 8 ), both ( q + p ) / 2 = N and ( q p ) / 2 = M are positive integers. We have P = N M = p and Q = N + M = q , both prime. To verify M [ 1 , N 3 ] : distinctness gives q p 2 , so M 1 ; and M N 3 is equivalent to p 3 , which holds since p is an odd prime.
(ii) ⇒ (iii): Immediate, as N 2 M 2 = P · Q with both P and Q prime.
(iii) ⇒ (i): If N 2 M 2 = P · Q with P and Q prime and M [ 1 , N 3 ] , then P + Q = 2 N is a partition of 2 N into two distinct odd primes. □
Figure 1. The geometric construction for N = 5 , M = 2 : The L-shaped region has area N 2 M 2 = 25 4 = 21 = 3 × 7 , a semiprime. The factors P = 3 and Q = 7 are both prime and sum to 2 N = 10 , providing the Goldbach partition 10 = 3 + 7 .
Figure 1. The geometric construction for N = 5 , M = 2 : The L-shaped region has area N 2 M 2 = 25 4 = 21 = 3 × 7 , a semiprime. The factors P = 3 and Q = 7 are both prime and sum to 2 N = 10 , providing the Goldbach partition 10 = 3 + 7 .
Preprints 207834 g001

3.4. Reformulation as a Set Intersection Problem

For each N 4 , define two subsets of { 1 , 2 , , N 3 } :
  • Candidate set:  C N = { N p 3 p < N , p prime } consists of all M-values obtainable from odd primes P = p < N .
  • Valid set:  D N = Q P 2 | 2 < P < N < Q < 2 N , both prime { 1 , , N 3 } consists of those admissible M-values (restricted to the range [ 1 , N 3 ] ) for which there exists at least one straddling prime pair ( P , Q ) with Q P = 2 M .
Remark 1. 
The candidate set C N is in bijection with the set of odd primes less than N. Since N 4 , the smallest odd prime p = 3 gives M = N 3 N 3 , and the largest prime p < N gives M = N p 1 (as p N 1 ). Moreover, distinct primes yield distinct M-values. Therefore
| C N | = π ( N 1 ) 1 ,
where the 1 accounts for the exclusion of p = 2 .
Proposition 6. 
The distinct-prime Goldbach conjecture holds for the even integer 2 N ( N 4 ) if and only if C N D N .
Proof. (⇒) Suppose 2 N = p + q with distinct primes p < q . Set M = ( q p ) / 2 . Then M [ 1 , N 3 ] (as shown in the proof of Theorem 1), p = N M is prime (so M C N ), and the pair ( p , q ) is a straddling prime pair with half-difference M (so M D N ). Thus M C N D N .
(⇐) Suppose M C N D N . Then P = N M is prime (by M C N ). By M D N there exists at least one straddling prime pair with half-difference exactly M, so the pair ( P , Q ) with Q = N + M satisfies N < Q < 2 N and Q P = 2 M . The geometric equivalence (Theorem 1) then guarantees that Q = N + M is prime. Hence 2 N = P + Q is the required partition into distinct primes. □
Lemma 1 
(Key Implication from Intersection). Let M C N D N . Then P = N M and Q = N + M are both prime, so 2 N = P + Q is a Goldbach partition into distinct primes.
Proof. 
By definition of C N , the number P = N M is an odd prime less than N. By definition of D N , there exists at least one pair of primes ( P , Q ) with 2 < P < N < Q < 2 N and ( Q P ) / 2 = M . The key observation is that Q = N + M satisfies N < Q < 2 N (since 1 M N 3 ). Because the geometric construction (Theorem 1) equates the existence of such an M with the semiprime-area condition for the specific pair ( N M , N + M ) , and the intersection ensures both the candidate prime P = N M and the existence of a straddling pair with the precise half-difference M, the condition (ii) of Theorem 1 holds. Therefore Q = N + M is prime and 2 N = P + Q is the desired partition. □

4. Main Results

4.1. Density of Primes in ( N , 2 N )

We first establish a lower bound on the number of primes in the interval ( N , 2 N ) . This count controls the “raw material” available for building D N .
Proposition 7 
(Prime Count in ( N , 2 N ) ). For every integer N 3275 , the interval ( N , 2 N ) contains at least ln 2 N primes.
Proof. 
Partition the interval ( N , 2 N ) into consecutive sub-intervals of the form
x j , x j ( 1 + 1 / ( 2 ln 2 x j ) ) , j = 0 , 1 , 2 ,
starting with x 0 = N . Each sub-interval has length x j / ( 2 ln 2 x j ) and, by Proposition 3, contains at least one prime. Since x j 2 N for all relevant j, each sub-interval has length at most 2 N / ( 2 ln 2 N ) = N / ln 2 N . Covering ( N , 2 N ) , which has length N, therefore requires at least
N N / ln 2 N = ln 2 N
sub-intervals, each contributing at least one prime Q ( N , 2 N ) . □

4.2. Existence of a Well-Placed Straddling Prime

The following lemma identifies a prime in ( N , 2 N ) that, when paired with every odd prime below N, produces half-differences lying entirely within the admissible range { 1 , , N 3 } .
Lemma 2 
(Well-Placed Straddling Prime). For every integer N 4 , there exists a prime Q * ( N , 2 N ) with Q * 2 N 5 (verified with Q * = 2 N 3 if ( N , 2 N 4 ] contains no primes) such that for every odd prime P with 3 P < N , the value
M P = Q * P 2
satisfies 1 M P N 3 .
Proof. 
We verify the bounds on M P and the existence of Q * in two parts.
Existence of Q * . We must show that ( N , 2 N 4 ] contains at least one prime for all N 4 . For N 25 , Proposition 2 guarantees at least two primes in ( N , 2 N ) . Since 2 N 2 is even (hence not prime for N 2 ), and 2 N 1 is only one number, at least one of the two primes satisfies Q * 2 N 3 < 2 N 2 , hence Q * 2 N 5 (since Q * is odd and Q * 2 N 3 with 2 N 4 even implies Q * 2 N 5 , except possibly Q * = 2 N 3 ; in either case Q * 2 N 3 , and we verify below that Q * 2 N 3 suffices). For 4 N 24 , direct computation confirms the existence of such a prime:
N ( N , 2 N 4 ] Q * N ( N , 2 N 4 ] Q *
4 ( 4 , 4 ] 5 15 ( 15 , 26 ] 17
5 ( 5 , 6 ] 7 16 ( 16 , 28 ] 17
6 ( 6 , 8 ] 7 17 ( 17 , 30 ] 19
7 ( 7 , 10 ] 11 18 ( 18 , 32 ] 19
8 ( 8 , 12 ] 11 19 ( 19 , 34 ] 23
9 ( 9 , 14 ] 11 20 ( 20 , 36 ] 23
10 ( 10 , 16 ] 11 21 ( 21 , 38 ] 23
11 ( 11 , 18 ] 13 22 ( 22 , 40 ] 23
12 ( 12 , 20 ] 13 23 ( 23 , 42 ] 29
13 ( 13 , 22 ] 17 24 ( 24 , 44 ] 29
14 ( 14 , 24 ] 17
(Entries marked † satisfy Q * 2 N 3 even though Q * lies slightly outside ( N , 2 N 4 ] ; the critical property Q * 2 N 3 holds in every case.)
Bounds on M P . Since Q * and P are both odd primes, Q * P is even, so M P is a positive integer.
Lower bound ( M P 1 ): Since Q * > N and P < N , we have Q * P 2 (as both are odd, their difference is even and positive). Hence M P 1 .
Upper bound ( M P N 3 ): We need Q * P 2 ( N 3 ) , equivalently P Q * 2 N + 6 . Since Q * 2 N 3 , we obtain Q * 2 N + 6 3 . Because every odd prime satisfies P 3 , the bound M P N 3 holds for all P. □

4.3. The Density Theorem

We now prove our main structural result: the valid set D N is at least as large as the candidate set C N .
Theorem 2 
(Density of Valid Half-Differences). For every integer N 4 ,
| D N | π ( N 1 ) 1 = | C N | .
Proof. 
By Lemma 2, there exists a prime Q * ( N , 2 N ) with Q * 2 N 3 . Let p 1 < p 2 < < p s be the odd primes less than N, so s = π ( N 1 ) 1 (excluding p = 2 ). Define
M j = Q * p j 2 , j = 1 , 2 , , s .
We verify three properties.
Distinctness. Since Q * is fixed and p 1 < p 2 < < p s , the values M 1 > M 2 > > M s are strictly decreasing, hence pairwise distinct.
Admissibility. By Lemma 2, each M j lies in { 1 , , N 3 } .
Membership in D N . For each j, the pair ( p j , Q * ) satisfies 2 < p j < N < Q * < 2 N with both p j and Q * prime, and ( Q * p j ) / 2 = M j . By definition of D N , this places M j D N .
Therefore D N contains the s = π ( N 1 ) 1 distinct values M 1 , M 2 , , M s , giving | D N | π ( N 1 ) 1 = | C N | (by Remark 1). □
Remark 2 
(Tightness). The bound in Theorem 2 is tight in the following sense: a singleprime Q * can contribute at most π ( N 1 ) 1 distinct values to D N (one per odd prime below N). To push | D N | substantially beyond π ( N 1 ) 1 , one must combine contributions from multiple primes in ( N , 2 N ) . By Proposition 7, there are at least ln 2 N such primes for N 3275 , each generating π ( N 1 ) 1 (not necessarily new) values. The challenge is to control the overlap between these contributions.

4.4. Reduction to the Intersection Condition

We now state precisely what remains to be established for a complete proof of the distinct-prime Goldbach conjecture.
Theorem 3 
(Goldbach Reduction). The distinct-prime Goldbach conjecture (every even integer 2 N 8 is the sum of two distinct primes) is equivalent to the assertion that C N D N for all N 4 .
Proof. 
Immediate from Proposition 6 and Theorem 1. □
Proposition 8 
(Pigeonhole Threshold). Let N 4 . If | D N | > ( N 3 ) ( π ( N 1 ) 1 ) , then C N D N .
Proof. 
Both C N and D N are subsets of { 1 , , N 3 } , which has cardinality N 3 . If | D N | > ( N 3 ) | C N | , then | C N | + | D N | > N 3 , so the pigeonhole principle [6] forces C N D N . Since | C N | = π ( N 1 ) 1 , the threshold is | D N | > ( N 3 ) ( π ( N 1 ) 1 ) = N 2 π ( N 1 ) . □
Remark 3 
(The gap between what is proved and what is needed). Theorem 2 establishes
| D N | π ( N 1 ) 1 .
Proposition 8 requires
| D N | > N 2 π ( N 1 ) .
For the pigeonhole argument to succeed, we would need
π ( N 1 ) 1 > N 2 π ( N 1 ) , i . e . , 2 π ( N 1 ) > N 1 .
Since π ( N ) N / ln N , the left-hand side grows as 2 N / ln N while the right-hand side grows as N. For N large, 2 N / ln N < N , so the pigeonhole bound from Theorem 2 alone is insufficient. A proof of the Goldbach conjecture via this route requires a substantially stronger lower bound on | D N | —specifically, one that incorporates contributions frommanydistinct primes in ( N , 2 N ) , not just a single well-placed prime.

4.5. Structural Characterisation of the Intersection

Although the size bounds alone do not force C N D N , the structure of the two sets provides additional constraints. We characterise the intersection condition explicitly.
Proposition 9 
(Explicit Intersection Criterion). Let N 4 . Then C N D N if and only if there exists a prime p with 3 p < N such that Q = 2 N p is also prime.
Proof. (⇒) If M C N D N , then by Lemma 1, P = N M and Q = N + M are both prime. Setting p = P gives Q = 2 N p = N + M = Q , which is prime.
(⇐) If p < N is an odd prime such that Q = 2 N p is also prime, then Q = N + ( N p ) = N + M where M = N p . Since p 3 , we have M N 3 ; since p < N , we have M 1 . Moreover, P = p is prime and Q > N (since p < N ). Hence M C N (from P = p ) and M D N (from the straddling pair ( p , Q ) ), so M C N D N . □
Remark 4. 
Proposition 9 shows that C N D N is equivalent to the existence of a prime p < N with 2 N p also prime, which is precisely the Goldbach conjecture for 2 N (with distinct primes, since p 2 N p when p < N ). Thus the intersection condition is a faithful reformulation—neither weaker nor stronger—of the original conjecture. The value of the framework lies not in reducing the problem to somethingsimpler, but in revealing its geometric and combinatorial structure and in establishing the density bounds of Theorem 2 and Proposition 7.

4.6. Verification of Base Cases

We verify the conjecture directly for small values of N.
Proposition 10 
(Base Cases). For every integer N with 4 N 12 , C N D N .
Proof. 
We list C N , D N , and their intersection for each N, with M-values restricted to [ 1 , N 3 ] :
  • N = 4 ( 2 N = 8 ): C 4 = { 1 } (from P = 3 ). D 4 = { 1 } (from pair ( 3 , 5 ) ). Intersection: { 1 } . Partition: 8 = 3 + 5 .✓
  • N = 5 ( 2 N = 10 ): C 5 = { 2 } (from P = 3 ). D 5 = { 2 } (from ( 3 , 7 ) ). Intersection: { 2 } . Partition: 10 = 3 + 7 .✓
  • N = 6 ( 2 N = 12 ): C 6 = { 3 , 1 } (from P { 3 , 5 } ). D 6 = { 1 , 2 , 3 } . Intersection: { 1 , 3 } . Partition: 12 = 5 + 7 .✓
  • N = 7 ( 2 N = 14 ): C 7 = { 4 , 2 } . D 7 = { 3 , 4 } . Intersection: { 4 } . Partition: 14 = 3 + 11 .✓
  • N = 8 ( 2 N = 16 ): C 8 = { 5 , 3 , 1 } . D 8 = { 2 , 3 , 4 , 5 } . Intersection: { 3 , 5 } . Partition: 16 = 3 + 13 .✓
  • N = 9 ( 2 N = 18 ): C 9 = { 6 , 4 , 2 } . D 9 = { 2 , 4 } . Intersection: { 2 , 4 } . Partition: 18 = 5 + 13 .✓
  • N = 10 ( 2 N = 20 ): C 10 = { 7 , 5 , 3 } . D 10 = { 3 , 7 } . Intersection: { 3 , 7 } . Partition: 20 = 3 + 17 .✓
  • N = 11 ( 2 N = 22 ): C 11 = { 8 , 6 , 4 } . D 11 = { 6 , 8 } . Intersection: { 6 , 8 } . Partition: 22 = 3 + 19 .✓
  • N = 12 ( 2 N = 24 ): C 12 = { 9 , 7 , 5 , 1 } . D 12 = { 1 , 5 , 7 } . Intersection: { 1 , 5 , 7 } . Partition: 24 = 5 + 19 .✓
In every case, C N D N and an explicit Goldbach partition exists. □

5. Computational Evidence

To motivate the conditional argument of Section 6 and to explore the density of D N beyond the unconditional bound of Theorem 2, we introduce a quantitative diagnostic.

5.1. The Gap Function

Define the gap function:
G ( N ) = log 2 ( 2 N ) ( N 3 ) | D N | .
Rearranging, G ( N ) > 0 is equivalent to
| D N | > ( N 3 ) log 2 ( 2 N ) .
Intuitively, G ( N ) > 0 means that the set D N is “almost full”—most M { 1 , , N 3 } appear in D N , with fewer than log 2 ( 2 N ) values missing. This is a substantially stronger statement than the unconditional bound | D N | π ( N 1 ) 1 from Theorem 2: it asserts that D N covers all but a polylogarithmic number of values.

5.2. Computational Results

We computed | D N | and G ( N ) for all N [ 4 , 2 14 ] using Python 3.12 with the Gmpy2 library [7]. The results are summarised in Table 1.
Three features of these data merit attention. First, G ( N ) > 0 for every N [ 4 , 2 14 ] , providing strong empirical support for the Density Hypothesis (Hypothesis 1 below). Second, the minimum value of G ( N ) in each successive dyadic interval [ 2 m , 2 m + 1 ] strictly increases with m, suggesting that the positivity margin widens as N grows. Third, the N-values at which the minima are attained tend to be primes or near-primes, consistent with the expectation that | D N | is smallest when primes near N are sparse.

6. Conditional Argument via the Density Hypothesis

Remark 3 showed that the unconditional bound | D N | π ( N 1 ) 1 is insufficient for the pigeonhole argument. In this section, we formulate a stronger hypothesis—supported by the computational evidence of Section 5—that would close the gap.

6.1. The Density Hypothesis

Hypothesis 1 
(Density Hypothesis: Positivity of G ( N ) for Large N). For every integer N 3275 , we have G ( N ) > 0 .
Corollary 1 
(Conditional Lower Bound on | D N | ). If Hypothesis 1 holds, then for all N 3275 ,
| D N | > ( N 3 ) log 2 ( 2 N ) .
Proof. 
Immediate from Hypothesis 1, since G ( N ) > 0 is equivalent to | D N | > ( N 3 ) log 2 ( 2 N ) . □
Remark 5 
(Comparison of bounds). The unconditional bound (Theorem 2) gives | D N | π ( N 1 ) 1 N / ln N . The conditional bound (Corollary 1) gives | D N | > N 3 log 2 ( 2 N ) N . The gap between these is enormous: the conditional bound asserts thatalmost allof { 1 , , N 3 } lies in D N , while the unconditional bound only captures a fraction of order 1 / ln N . The Density Hypothesis is precisely the statement needed to bridge Remark 3.

6.2. Evidence for the Density Hypothesis

The rigorous content supporting Hypothesis 1 consists of three steps, followed by a conjectured fourth step.
Rigorous content (Steps 1–3) and conjectured step (Step 4). 
Recall that G ( N ) > 0 is equivalent to | D N | > ( N 3 ) log 2 ( 2 N ) . We establish partial bounds on | D N | using the prime distribution results of Section 2.
  • Step 1: Short-Interval Prime Guarantee
By Proposition 3, for every x 3275 , the interval ( x , x ( 1 + 1 / ( 2 ln 2 x ) ) ] contains at least one prime. This provides fine-grained control over the distribution of primes in ( N , 2 N ) .
  • Step 2: Counting Primes in ( N , 2 N )
By Proposition 7, the interval ( N , 2 N ) contains at least ln 2 N primes Q 1 < Q 2 < < Q r for N 3275 .
  • Step 3: Growth Mechanism of | D N |
By Theorem 2, a single prime Q * ( N , 2 N ) already generates π ( N 1 ) 1 distinct values in D N . Each of the r ln 2 N primes in ( N , 2 N ) contributes π ( N 1 ) 1 values (not necessarily new). The total number of (prime, prime) pairs available is at least ( π ( N 1 ) 1 ) · ln 2 N , which grows as N ln N .
  • Step 4: Upper Bound on Missing M-Values (Conjectured)
Note. The following step constitutes the content of Hypothesis 1. It is supported by computation for all N [ 4 , 2 14 ] (Table 1) but has not been established rigorously. Steps 1–3 above are fully proved.
A value m { 1 , , N 3 } is missing from D N if for every odd prime P < N , the number Q = P + 2 m is composite (or Q 2 N ). Let U denote the set of these missing values.
By Proposition 3, consecutive primes p k , p k + 1 3275 satisfy p k + 1 p k p k / ( 2 ln 2 p k ) . Consequently, the primes Q i in ( N , 2 N ) partition the interval into composite gaps, each of maximal width O ( N / ln 2 N ) . For any fixed m, the shifted set S m = { P + 2 m P < N is an odd prime } must avoid all Q i for m to be missing. The rigid structure of S m (increasing m by 1 shifts every term by exactly 2) makes it plausible that only a limited number of translates can remain entirely hidden inside the composite regions. Because the prime gaps among the P’s and the composite gaps among the Q i ’s are both O ( N / ln 2 N ) , the rigid shift suggests that only a limited number of translates can remain entirely hidden inside the composite regions. We therefore conjecture that the number of such missing values satisfies | U | ln 2 N . (This is precisely the statement of Hypothesis 1.) If true, then
| U | = ( N 3 ) | D N | ln 2 N < ln 2 ( 2 N )
for N 3275 , which yields G ( N ) > 0 . The rigorous justification of this upper bound on | U | remains open and constitutes the only missing piece of the density argument. □
Remark 6. 
The threshold N = 3275 arises directly from Proposition 3 (Théorème 1.9 of [3], p. 35), which guarantees a prime in every interval ( x , x + x / ( 2 ln 2 x ) ] for x 3275 . The key insight is that Dusart’s short-interval guarantee forces the primes Q ( N , 2 N ) to be distributed densely enough—with gaps no larger than N / ln 2 N —that the set D N of achievable half-differences isconjecturedto be nearly full, leaving fewer than ln 2 ( 2 N ) missing values.

6.3. Conditional proof of the Goldbach conjecture

Theorem 4 
(Conditional Main Result).   If Hypothesis 1 holds, then every even integer 2 N 8 is the sum of two distinct primes.
Proof. 
By Theorem 1 and Proposition 6 (together with Lemma 1), it suffices to show that for every N 4 , C N D N . We consider three cases.
  • Case 1: N 3275
Assuming Hypothesis 1 (i.e., G ( N ) > 0 ), Corollary 1 gives
| D N | > ( N 3 ) log 2 ( 2 N ) .
The number of “bad” M-values—those in { 1 , , N 3 } but not in D N —is therefore fewer than log 2 ( 2 N ) . The candidate set C N has cardinality | C N | = π ( N 1 ) 1 (excluding p = 2 ). By Proposition 5(3),
π ( N ) N ln N 1 for N 5393 .
For N 3275 , we have | C N | N / ( ln N + 2 ) > log 2 ( 2 N ) , since the left side grows as N / log N while the right side grows as log 2 N . (The inequality can be verified numerically at N = 3275 .) Therefore | C N | strictly exceeds the number of bad M-values. By the pigeonhole principle [6], at least one element of C N must lie in D N , giving C N D N . Lemma 1 then yields the partition.
  • Case 2: 4 N 12 (Base Cases)
By Proposition 10, C N D N for all 4 N 12 .
  • Case 3: 13 N 3274
For this range, we rely on computational verification. Our experiments (Section 5, Table 1) confirm that G ( N ) > 0 for all N [ 4 , 2 14 ] = [ 4 , 16384 ] , which includes the entire range [ 13 , 3274 ] . Since G ( N ) > 0 implies | D N | > ( N 3 ) log 2 ( 2 N ) , and | C N | > log 2 ( 2 N ) for these values of N, the same pigeonhole argument (combined with Lemma 1) ensures C N D N . Additionally, we have verified directly for each N [ 4 , 2 14 ] that at least one valid Goldbach partition exists (i.e., we computed explicit partitions), confirming the conjecture holds.
  • Conclusion
Combining Cases 1–3, the conjecture holds for all N 4 under Hypothesis 1. Since N 4 corresponds to 2 N 8 , every even integer 8 is the sum of two distinct primes. □
Remark 7. 
(Relationship between the two approaches).  The paper presents two complementary paths toward the Goldbach conjecture:
(a)
Unconditional:  Theorem 2 gives | D N | π ( N 1 ) 1 , and Proposition 8 identifies the pigeonhole threshold. Remark 3 shows a quantitative gap remains.
(b)
Conditional: Hypothesis 1 posits G ( N ) > 0 , which provides the stronger bound | D N | > N 3 log 2 ( 2 N ) . Theorem 4 shows this suffices.
Path (a) supplies the rigorous foundation; path (b) identifies the precise conjecture that would complete the proof and supports it with computation. The two approaches share the same geometric framework and set-theoretic infrastructure; they differ only in the strength of the lower bound on | D N | .

Conclusion

We have developed a geometric and combinatorial framework that provides structural insight into the distinct-prime Goldbach conjecture, yielding both unconditional results and a conditional path to a complete proof.

Summary of Main Results

Theorem 1 establishes that the distinct-prime Goldbach variant is equivalent to finding, for each N 4 , a nested square configuration with semiprime area. Proposition 6 and Theorem 3 reduce the conjecture to the set intersection condition C N D N .
The central unconditional result is Theorem 2: for every N 4 ,
| D N | π ( N 1 ) 1 = | C N | .
This is established by a clean counting argument: a single well-placed prime Q * ( N , 2 N ) (guaranteed by Bertrand’s postulate and Ramanujan’s refinement), paired with all π ( N 1 ) 1 odd primes below N, generates that many distinct elements of D N .
Proposition 7, using Dusart’s short-interval results [3], further shows that ( N , 2 N ) contains at least ln 2 N primes for N 3275 , each generating π ( N 1 ) 1 (not necessarily distinct) values in D N . This provides substantial “raw density” but controlling the overlaps between contributions of different primes Q i remains the key challenge for the unconditional approach.
As a complementary path, Hypothesis 1 posits G ( N ) > 0 for N 3275 —the assertion that fewer than log 2 ( 2 N ) values are missing from D N . Theorem 4 shows that this hypothesis, together with finite verification for small N, implies the full conjecture via the pigeonhole principle. Computational evidence (Table 1) strongly supports the hypothesis: G ( N ) > 0 for all N [ 4 , 2 14 ] , with minima strictly increasing across dyadic intervals.

What Remains Open

Two distinct challenges emerge from the two approaches:
(a) 
Unconditional gap (Remark 3): Theorem 2 gives | D N | π ( N 1 ) 1 N / ln N , while the pigeonhole argument (Proposition 8) requires | D N | > N 2 π ( N 1 ) N ( 1 1 / ln N ) . Closing this gap requires a bound of the form | D N | > N O ( N / ln N ) , combining contributions from many primes in ( N , 2 N ) and controlling their overlaps.
(b) 
Density Hypothesis (Hypothesis 1): Proving G ( N ) > 0 rigorously would require showing that the number of missing M-values is at most ln 2 N —a problem related to the distribution of prime pairs with prescribed differences. This is the content of Step 4 in Section 6.2.

Key Insights

Several features of this framework merit emphasis. First, the geometric reformulation (Theorem 1) transforms an additive problem into a multiplicative one: instead of decomposing 2 N as a sum of two primes, one factors N 2 M 2 as a product of two primes. Second, the density bound | D N | π ( N 1 ) 1 (Theorem 2) is tight for a single-prime argument and shows that D N grows at the same rate as C N . Third, Remark 3 provides a precise quantitative “distance to proof”—the ratio between what is established ( N / ln N ) and what is needed ( N )—clarifying exactly where current methods fall short. Fourth, the gap function G ( N ) provides a quantitative diagnostic—a “distance from counterexample”—whose behaviour (Table 1) offers structural insight and isolates the precise step that remains open.
The threshold N = 3275 from Proposition 3 (Théorème 1.9 of [3]) provides the strongest available tool for short-interval prime guarantees, ensuring prime gaps of width at most N / ln 2 N in ( N , 2 N ) . This is precisely calibrated to the needs of both the unconditional Steps 1–3 and the conjectured Step 4.

Methodological Contributions

Beyond the conditional result, this work demonstrates three methodological points. First, classical additive problems can sometimes be profitably recast in geometric terms, revealing hidden structure. Second, the interplay between the unconditional density bound (Theorem 2) and the conditional Density Hypothesis (Hypothesis 1) delineates the precise boundary between what is proved and what is conjectured. Third, Dusart’s refinement [3], a result from modern analytic number theory, delivers the rigorous content of Steps 1–3 and is precisely calibrated to the threshold N = 3275 .

Relation to the Classical Goldbach Conjecture

Our result addresses the variant requiring distinct primes, thus excluding 4 = 2 + 2 and 6 = 3 + 3 . While our techniques—particularly the geometric framework and the analysis of D N —may offer insights applicable to the full classical conjecture, extending our methods to allow P = Q would require new ideas, as our geometric construction inherently demands P Q (i.e., M 1 ).

Open Questions

Several natural questions remain:
1.
Can one prove | D N | > N O ( N / ln N ) unconditionally, by combining contributions from multiple primes in ( N , 2 N ) and controlling overlaps?
2.
Can Step 4 of the density argument (Section 6.2) be established rigorously, thereby proving Hypothesis 1? This would require a tight upper bound on the number of m-values for which no straddling prime pair has half-difference m.
3.
What is the exact asymptotic behaviour of | D N | and G ( N ) ?
4.
Can the geometric framework accommodate P = Q (the case M = 0 ), thereby addressing the full classical Goldbach conjecture?
5.
Can similar geometric reformulations illuminate other additive problems, such as the twin prime conjecture or Waring’s problem?
6.
Can the structural properties of C N and D N —beyond their sizes—be exploited to prove C N D N without the pigeonhole principle?

Acknowledgments

The author is sincerely grateful to Iris, Marilin, Sonia, Yoselin, Arelis, Anissa, Liuva, Yudit, Gretel, Gema, and Blaquier, as well as Israel, Arderi, Juan Carlos, Yamil, Alejandro, Aroldo, Yary, Reinaldo, Alex, Emmanuel, and Michael for their constant support. Whether through encouragement, stimulating conversations, practical assistance, or simply being present during challenging moments, their contributions have played an important role in bringing this work to completion.

References

  1. Goldbach, C. Lettre XLIII. In Correspondance mathématique et physique de quelques célèbres géomètres du XVIIIème siècle; Fuss, P.H., Ed.; Imperial Academy of Sciences: St. Petersburg, 1843; Vol. 1, pp. 125–129. (letter to Leonhard Euler) (in German).
  2. Oliveira e Silva, T.; Herzog, S.; Pardi, S. Empirical verification of the even Goldbach conjecture and computation of prime gaps up to 4·1018. Mathematics of Computation 2014, 83, 2033–2060. [CrossRef]
  3. Dusart, P. Autour de la fonction qui compte le nombre de nombres premiers. https://www.unilim.fr/pages_perso/pierre.dusart/Documents/T1998_01.pdf, 1998. Accessed: 2026-04-11.
  4. Chebyshev, P.L. Mémoire sur les nombres premiers. Journal de mathématiques pures et appliquées 1852, 17, 366–390.
  5. Ramanujan, S. A proof of Bertrand’s postulate. Journal of the Indian Mathematical Society 1919, 11, 181–182.
  6. Rittaud, B.; Heeffer, A. The Pigeonhole Principle, Two Centuries before Dirichlet. The Mathematical Intelligencer 2014, 36, 27–29. [CrossRef]
  7. Vega, F. Experimental Results on Goldbach’s Conjecture. https://github.com/frankvegadelgado/goldbach, 2026. Accessed: 2026-04-11.
Table 1. Minimum G ( N ) values in dyadic intervals [ 2 m , 2 m + 1 ] . Note that G ( N ) > 0 for all tested values, and the minima strictly increase with m.
Table 1. Minimum G ( N ) values in dyadic intervals [ 2 m , 2 m + 1 ] . Note that G ( N ) > 0 for all tested values, and the minima strictly increase with m.
Interval (m) Range [ 2 m , 2 m + 1 ] N achieving min Min G ( N )
2 [ 4 , 8 ] 5 4.301898
3 [ 8 , 16 ] 9 7.354249
4 [ 16 , 32 ] 19 10.232033
5 [ 32 , 64 ] 61 14.078618
6 [ 64 , 128 ] 73 17.836335
7 [ 128 , 256 ] 151 20.608977
8 [ 256 , 512 ] 269 23.537165
9 [ 512 , 1024 ] 541 28.812111
10 [ 1024 , 2048 ] 1327 33.154668
11 [ 2048 , 4096 ] 2161 35.081569
12 [ 4096 , 8192 ] 7069 42.329014
13 [ 8192 , 16384 ] 14138 44.057758
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated