Geometric Insights into the Goldbach Conjecture

Frank Vega

doi:10.20944/preprints202511.0701.v5

Submitted:

11 April 2026

Posted:

13 April 2026

Read the latest preprint version here

Abstract

We develop a geometric and combinatorial framework for the distinct-prime Goldbach conjecture—the assertion that every even integer $2N \geq 8$ is the sum of two distinct primes. The framework rests on three components: (1) a novel geometric equivalence reformulating the problem in terms of nested squares with semiprime areas, (2) a rigorous combinatorial reduction to a set intersection condition on prime-pair half-differences, and (3) a complementary conditional argument supported by computation. The geometric construction reveals that the conjecture is equivalent to finding, for each $N \geq 4$, an integer $M \in [1, N-3]$ such that the L-shaped region $N^2 - M^2$ between nested squares has area $P \cdot Q$ where $P = N - M$ and $Q = N + M$ are both prime. We define two subsets of $\{1, \ldots, N-3\}$: the candidate set $C_N$ of half-differences arising from odd primes below $N$, and the valid set $D_N$ of half-differences realised by straddling prime pairs across $N$. The conjecture then reduces to showing $C_N \cap D_N \neq \emptyset$. Our main unconditional result is a tight lower bound: using Bertrand's postulate and a counting argument based on a single straddling prime, we prove that $|D_N| \geq \pi(N-1) - 1 = |C_N|$ for all $N \geq 4$, establishing that $D_N$ is at least as large as $C_N$. We combine this with explicit results from Dusart's doctoral thesis to prove that the interval $(N, 2N)$ contains at least $\ln^2 N$ primes for $N \geq 3275$, each contributing further to $D_N$. As a complementary approach, we introduce the gap function $G(N) = \log^2(2N) - ((N-3) - |D_N|)$ and formulate the Density Hypothesis—$G(N) > 0$ for all $N \geq 3275$—which, if true, would provide the stronger bound needed to force $C_N \cap D_N \neq \emptyset$ via the pigeonhole principle. Extensive computation confirms $G(N) > 0$ for all $N \in [4, 2^{14}]$, with minima strictly increasing across dyadic intervals.

Keywords:

Goldbach conjecture

;

geometric construction

;

semiprimes

;

pigeonhole principle

;

prime distribution

;

Bertrand's postulate

;

Dusart's theorem

Subject:

Computer Science and Mathematics - Algebra and Number Theory

MSC: 11P32, 51M15, 11A25

1. Introduction

The Goldbach conjecture, proposed in 1742, asserts that every even integer greater than 2 can be expressed as the sum of two prime numbers [1]. Despite centuries of effort and computational verification up to

4 \times 10^{18}

[2], the conjecture remains unproven. In this paper, we develop a geometric and combinatorial framework for studying a natural variant: every even integer

2 N \geq 8

is the sum of two distinct primes.

Our variant requires the two primes to be distinct, thus excluding the trivial representations

4 = 2 + 2

and

6 = 3 + 3

. This restriction is not merely technical—it emerges naturally from a geometric reformulation that provides structural insight. Specifically, we show that for

N \geq 4

(so

2 N \geq 8

), finding a Goldbach partition with distinct primes is equivalent to finding nested squares whose L-shaped difference region has a semiprime area.

The framework proceeds in three stages.

Stage 1: Geometric equivalence (Section 3). We establish that the variant Goldbach conjecture is equivalent to the statement: for every

N \geq 4

, there exists

M \in [1, N - 3]

such that

N^{2} - M^{2} = P \cdot Q

where

P = N - M

and

Q = N + M

are both prime. This reformulation recasts the additive problem as a set intersection problem between a candidate set

C_{N}

and a valid set

D_{N}

, both subsets of

{1, \dots, N - 3}

.

Stage 2: Density analysis (Section 4). We prove a tight lower bound on

| D_{N} |

. The key observation is that a single prime

Q^{*}

in the interval

(N, 2 N)

, paired with every odd prime

P < N

, generates exactly

π (N - 1) - 1

distinct elements of

D_{N}

. Since the candidate set satisfies

| C_{N} | = π (N - 1) - 1

, this yields

| D_{N} | \geq π (N - 1) - 1 = | C_{N} | .

We further show, using Dusart’s thesis [3], that

(N, 2 N)

contains at least

{ln}^{2} N

primes for

N \geq 3275

, each contributing additional values to

D_{N}

.

Stage 3: Reduction to intersection (Section 4). The Goldbach conjecture for

2 N

is equivalent to

C_{N} \cap D_{N} \neq \emptyset

. Our size bounds establish

| C_{N} | + | D_{N} | \geq 2 (π (N - 1) - 1)

. For the pigeonhole principle to force a non-empty intersection, one would need

| C_{N} | + | D_{N} | > N - 3

, which requires a strictly stronger bound on

| D_{N} |

. We characterise precisely what remains open: one must show either that

| D_{N} | > (N - 3) - π (N - 1) + 1

, or that the structure (not merely the size) of

C_{N}

and

D_{N}

forces an intersection.

Stage 4: Conditional argument and computational evidence (Section 5 and Section 6). To bridge the gap identified in Stage 3, we introduce the gap function

G (N) = {log}^{2} (2 N) - ((N - 3) - | D_{N} |)

and formulate the Density Hypothesis (Hypothesis 1):

G (N) > 0

for all

N \geq 3275

. If the hypothesis holds, then

| D_{N} | > (N - 3) - {log}^{2} (2 N)

, and since

| C_{N} | = π (N - 1) - 1 > {log}^{2} (2 N)

for large N, the pigeonhole principle forces

C_{N} \cap D_{N} \neq \emptyset

, yielding the conjecture. We support the hypothesis with extensive computation:

G (N) > 0

for all

N \in [4, 2^{14}]

, with minima strictly increasing across dyadic intervals.

The remainder of this paper is organised as follows. Section 2 collects the analytic number theory prerequisites from Dusart’s thesis and states Bertrand’s postulate. Section 3 develops the geometric framework, reformulates the Goldbach variant as a set intersection problem, and introduces the key sets

C_{N}

and

D_{N}

. Section 4 contains the unconditional density analysis and main structural results. Section 5 presents computational evidence. Section 6 develops the conditional argument via the Density Hypothesis. Section 7 discusses significance and future directions.

2. Preliminaries: Prime Distribution Results

In this section we collect the results from analytic number theory that are needed for our analysis.

2.1. Bertrand’s Postulate and Its Refinements

The following classical result, first proved by Chebyshev in 1852 [4], guarantees a prime in every interval

(n, 2 n)

.

Proposition 1

(Bertrand’s Postulate). For every integer

n \geq 1

, there exists a prime p with

n < p < 2 n

.

Ramanujan [5] strengthened this result by showing that for n sufficiently large, the interval

(n, 2 n)

contains at least two primes. Specifically:

Proposition 2

(Ramanujan). For every integer

n \geq 25

, the interval

(n, 2 n)

contains at least two primes.

2.2. Primes in Short Intervals

The following result, due to Dusart [3], guarantees the existence of at least one prime in every sufficiently short interval. It is the principal tool for controlling the density of primes in

(N, 2 N)

.

Proposition 3

([3] [Théorème 1.9, p. 35]). For every real number

x \geq 3275

, there exists a prime p satisfying

x < p \leq x (1 + \frac{1}{2 {ln}^{2} x}) .

Proposition 3 is derived from the following bound on consecutive primes.

Proposition 4

([3] [Proposition 1.10, p. 34]). For

k \geq 463

(equivalently,

p_{k} \geq 3299

), the consecutive primes

p_{k}

and

p_{k + 1}

satisfy

p_{k + 1} \leq p_{k} (1 + \frac{1}{2 {ln}^{2} p_{k}}) .

The proof of Proposition 3 in [3] proceeds by using Proposition 4 for all

x \geq p_{463} = 3299

and then verifying the claim computationally for

3275 \leq x < 3299

.

2.3. Bounds on the Prime-Counting Function

We also require explicit bounds on

π (x)

, the number of primes not exceeding x.

Proposition 5

([3] [Théorème 1.10, p. 36]). The following inequalities hold:

(1): $\frac{x}{ln x} (1 + \frac{1}{ln x}) \leq π (x)$ for all $x \geq 599$ .
(2): $π (x) \leq \frac{x}{ln x} (1 + \frac{1.2762}{ln x})$ for all $x > 1$ .
(3): $π (x) \geq \frac{x}{ln x - 1}$ for all $x \geq 5393$ .
(4): $\frac{x}{ln x} (1 + \frac{1}{ln x} + \frac{1.8}{{ln}^{2} x}) \leq π (x)$ for all $x \geq 32299$ .

Throughout this paper, log denotes the natural logarithm (i.e.,

log \equiv ln

).

3. Geometric Construction and Reformulation

We now develop the geometric framework that underlies our analysis.

3.1. Nested Squares and Semiprime Areas

Consider a square

S_{N}

with integer side length

N \geq 4

, having area

N^{2}

. Inside

S_{N}

, inscribe a smaller square

S_{M}

with side length M, where

1 \leq M \leq N - 3

, sharing the bottom-left corner with

S_{N}

. The region between

S_{N}

and

S_{M}

forms an L-shaped annulus with area

N^{2} - M^{2} = (N - M) (N + M) .

Define

P = N - M

and

Q = N + M

. The bounds on M translate to constraints on P and Q:

M \geq 1

gives

P \leq N - 1

and

Q \geq N + 1

, while

M \leq N - 3

gives

P \geq 3

. Thus

3 \leq P \leq N - 1

and

Q \geq N + 1

, with

P < Q

since

M \geq 1

.

3.2. Connection to Goldbach Partitions

The sum and difference of P and Q are:

\begin{matrix} P + Q & = (N - M) + (N + M) = 2 N \geq 8, \\ Q - P & = (N + M) - (N - M) = 2 M . \end{matrix}

Since both the sum and difference are even, P and Q have the same parity. For both to be prime with

P \geq 3

, they must both be odd primes, hence distinct. The area

N^{2} - M^{2} = P \cdot Q

is a semiprime (product of exactly two primes) if and only if both P and Q are prime.

3.3. The Geometric Equivalence

Theorem 1

(Geometric Goldbach Variant). The following are equivalent for all

N \geq 4

:

(i): The even integer $2 N$ can be written as the sum of two distinct primes.
(ii): There exists $M \in [1, N - 3]$ such that $P = N - M$ and $Q = N + M$ are both prime.
(iii): The L-shaped region between squares $S_{N}$ and $S_{M}$ (sharing a corner) has semiprime area $P \cdot Q$ for some $M \in [1, N - 3]$ .

Proof. (i) ⇒ (ii): If

2 N = p + q

with distinct primes

p < q

, set

M = (q - p) / 2

. Since p and q are distinct odd primes (as

2 N \geq 8

), both

(q + p) / 2 = N

and

(q - p) / 2 = M

are positive integers. We have

P = N - M = p

and

Q = N + M = q

, both prime. To verify

M \in [1, N - 3]

: distinctness gives

q - p \geq 2

, so

M \geq 1

; and

M \leq N - 3

is equivalent to

p \geq 3

, which holds since p is an odd prime.

(ii) ⇒ (iii): Immediate, as

N^{2} - M^{2} = P \cdot Q

with both P and Q prime.

(iii) ⇒ (i): If

N^{2} - M^{2} = P \cdot Q

with P and Q prime and

M \in [1, N - 3]

, then

P + Q = 2 N

is a partition of

2 N

into two distinct odd primes. □

Figure 1. The geometric construction for

N = 5

,

M = 2

: The L-shaped region has area

N^{2} - M^{2} = 25 - 4 = 21 = 3 \times 7

, a semiprime. The factors

P = 3

and

Q = 7

are both prime and sum to

2 N = 10

, providing the Goldbach partition

10 = 3 + 7

.

Figure 1. The geometric construction for

N = 5

,

M = 2

: The L-shaped region has area

N^{2} - M^{2} = 25 - 4 = 21 = 3 \times 7

, a semiprime. The factors

P = 3

and

Q = 7

are both prime and sum to

2 N = 10

, providing the Goldbach partition

10 = 3 + 7

.

3.4. Reformulation as a Set Intersection Problem

For each

N \geq 4

, define two subsets of

{1, 2, \dots, N - 3}

:

Candidate set: $C_{N} = {N - p ∣ 3 \leq p < N, p prime}$ consists of all M-values obtainable from odd primes $P = p < N$ .
Valid set: $D_{N} = \{\frac{Q - P}{2} | 2 < P < N < Q < 2 N, both prime\} \cap {1, \dots, N - 3}$ consists of those admissible M-values (restricted to the range $[1, N - 3]$ ) for which there exists at least one straddling prime pair $(P, Q)$ with $Q - P = 2 M$ .

Remark 1.

The candidate set

C_{N}

is in bijection with the set of odd primes less than N. Since

N \geq 4

, the smallest odd prime

p = 3

gives

M = N - 3 \leq N - 3

, and the largest prime

p < N

gives

M = N - p \geq 1

(as

p \leq N - 1

). Moreover, distinct primes yield distinct M-values. Therefore

| C_{N} | = π (N - 1) - 1,

where the

- 1

accounts for the exclusion of

p = 2

.

Proposition 6.

The distinct-prime Goldbach conjecture holds for the even integer

2 N

(

N \geq 4

) if and only if

C_{N} \cap D_{N} \neq \emptyset

.

Proof. (⇒) Suppose

2 N = p + q

with distinct primes

p < q

. Set

M = (q - p) / 2

. Then

M \in [1, N - 3]

(as shown in the proof of Theorem 1),

p = N - M

is prime (so

M \in C_{N}

), and the pair

(p, q)

is a straddling prime pair with half-difference M (so

M \in D_{N}

). Thus

M \in C_{N} \cap D_{N}

.

(⇐) Suppose

M \in C_{N} \cap D_{N}

. Then

P = N - M

is prime (by

M \in C_{N}

). By

M \in D_{N}

there exists at least one straddling prime pair with half-difference exactly M, so the pair

(P, Q)

with

Q = N + M

satisfies

N < Q < 2 N

and

Q - P = 2 M

. The geometric equivalence (Theorem 1) then guarantees that

Q = N + M

is prime. Hence

2 N = P + Q

is the required partition into distinct primes. □

Lemma 1

(Key Implication from Intersection). Let

M \in C_{N} \cap D_{N}

. Then

P = N - M

and

Q = N + M

are both prime, so

2 N = P + Q

is a Goldbach partition into distinct primes.

Proof.

By definition of

C_{N}

, the number

P = N - M

is an odd prime less than N. By definition of

D_{N}

, there exists at least one pair of primes

(P^{'}, Q^{'})

with

2 < P^{'} < N < Q^{'} < 2 N

and

(Q^{'} - P^{'}) / 2 = M

. The key observation is that

Q = N + M

satisfies

N < Q < 2 N

(since

1 \leq M \leq N - 3

). Because the geometric construction (Theorem 1) equates the existence of such an M with the semiprime-area condition for the specific pair

(N - M, N + M)

, and the intersection ensures both the candidate prime

P = N - M

and the existence of a straddling pair with the precise half-difference M, the condition (ii) of Theorem 1 holds. Therefore

Q = N + M

is prime and

2 N = P + Q

is the desired partition. □

4. Main Results

4.1. Density of Primes in $(N, 2 N)$

We first establish a lower bound on the number of primes in the interval

(N, 2 N)

. This count controls the “raw material” available for building

D_{N}

.

Proposition 7

(Prime Count in

(N, 2 N)

). For every integer

N \geq 3275

, the interval

(N, 2 N)

contains at least

{ln}^{2} N

primes.

Proof.

Partition the interval

(N, 2 N)

into consecutive sub-intervals of the form

(x_{j}, x_{j} (1 + 1 / (2 {ln}^{2} x_{j}))], j = 0, 1, 2, \dots

starting with

x_{0} = N

. Each sub-interval has length

x_{j} / (2 {ln}^{2} x_{j})

and, by Proposition 3, contains at least one prime. Since

x_{j} \leq 2 N

for all relevant j, each sub-interval has length at most

2 N / (2 {ln}^{2} N) = N / {ln}^{2} N

. Covering

(N, 2 N)

, which has length N, therefore requires at least

\frac{N}{N / {ln}^{2} N} = {ln}^{2} N

sub-intervals, each contributing at least one prime

Q \in (N, 2 N)

. □

4.2. Existence of a Well-Placed Straddling Prime

The following lemma identifies a prime in

(N, 2 N)

that, when paired with every odd prime below N, produces half-differences lying entirely within the admissible range

{1, \dots, N - 3}

.

Lemma 2

(Well-Placed Straddling Prime). For every integer

N \geq 4

, there exists a prime

Q^{*} \in (N, 2 N)

with

Q^{*} \leq 2 N - 5

(verified with

Q^{*} = 2 N - 3

if

(N, 2 N - 4]

contains no primes) such that for every odd prime P with

3 \leq P < N

, the value

M_{P} = \frac{Q^{*} - P}{2}

satisfies

1 \leq M_{P} \leq N - 3

.

Proof.

We verify the bounds on

M_{P}

and the existence of

Q^{*}

in two parts.

Existence of $Q^{*}$ . We must show that

(N, 2 N - 4]

contains at least one prime for all

N \geq 4

. For

N \geq 25

, Proposition 2 guarantees at least two primes in

(N, 2 N)

. Since

2 N - 2

is even (hence not prime for

N \geq 2

), and

2 N - 1

is only one number, at least one of the two primes satisfies

Q^{*} \leq 2 N - 3 < 2 N - 2

, hence

Q^{*} \leq 2 N - 5

(since

Q^{*}

is odd and

Q^{*} \leq 2 N - 3

with

2 N - 4

even implies

Q^{*} \leq 2 N - 5

, except possibly

Q^{*} = 2 N - 3

; in either case

Q^{*} \leq 2 N - 3

, and we verify below that

Q^{*} \leq 2 N - 3

suffices). For

4 \leq N \leq 24

, direct computation confirms the existence of such a prime:

N	$(N, 2 N - 4]$	$Q^{*}$	N	$(N, 2 N - 4]$	$Q^{*}$
4	$(4, 4]$	$5^{†}$	15	$(15, 26]$	17
5	$(5, 6]$	$7^{†}$	16	$(16, 28]$	17
6	$(6, 8]$	7	17	$(17, 30]$	19
7	$(7, 10]$	$11^{†}$	18	$(18, 32]$	19
8	$(8, 12]$	11	19	$(19, 34]$	23
9	$(9, 14]$	11	20	$(20, 36]$	23
10	$(10, 16]$	11	21	$(21, 38]$	23
11	$(11, 18]$	13	22	$(22, 40]$	23
12	$(12, 20]$	13	23	$(23, 42]$	29
13	$(13, 22]$	17	24	$(24, 44]$	29
14	$(14, 24]$	17

(Entries marked † satisfy

Q^{*} \leq 2 N - 3

even though

Q^{*}

lies slightly outside

(N, 2 N - 4]

; the critical property

Q^{*} \leq 2 N - 3

holds in every case.)

Bounds on $M_{P}$ . Since

Q^{*}

and P are both odd primes,

Q^{*} - P

is even, so

M_{P}

is a positive integer.

Lower bound (

M_{P} \geq 1

): Since

Q^{*} > N

and

P < N

, we have

Q^{*} - P \geq 2

(as both are odd, their difference is even and positive). Hence

M_{P} \geq 1

.

Upper bound (

M_{P} \leq N - 3

): We need

Q^{*} - P \leq 2 (N - 3)

, equivalently

P \geq Q^{*} - 2 N + 6

. Since

Q^{*} \leq 2 N - 3

, we obtain

Q^{*} - 2 N + 6 \leq 3

. Because every odd prime satisfies

P \geq 3

, the bound

M_{P} \leq N - 3

holds for all P. □

4.3. The Density Theorem

We now prove our main structural result: the valid set

D_{N}

is at least as large as the candidate set

C_{N}

.

Theorem 2

(Density of Valid Half-Differences). For every integer

N \geq 4

,

| D_{N} | \geq π (N - 1) - 1 = | C_{N} | .

Proof.

By Lemma 2, there exists a prime

Q^{*} \in (N, 2 N)

with

Q^{*} \leq 2 N - 3

. Let

p_{1} < p_{2} < \dots < p_{s}

be the odd primes less than N, so

s = π (N - 1) - 1

(excluding

p = 2

). Define

M_{j} = \frac{Q^{*} - p_{j}}{2}, j = 1, 2, \dots, s .

We verify three properties.

Distinctness. Since

Q^{*}

is fixed and

p_{1} < p_{2} < \dots < p_{s}

, the values

M_{1} > M_{2} > \dots > M_{s}

are strictly decreasing, hence pairwise distinct.

Admissibility. By Lemma 2, each

M_{j}

lies in

{1, \dots, N - 3}

.

Membership in $D_{N}$ . For each j, the pair

(p_{j}, Q^{*})

satisfies

2 < p_{j} < N < Q^{*} < 2 N

with both

p_{j}

and

Q^{*}

prime, and

(Q^{*} - p_{j}) / 2 = M_{j}

. By definition of

D_{N}

, this places

M_{j} \in D_{N}

.

Therefore

D_{N}

contains the

s = π (N - 1) - 1

distinct values

M_{1}, M_{2}, \dots, M_{s}

, giving

| D_{N} | \geq π (N - 1) - 1 = | C_{N} |

(by Remark 1). □

Remark 2

(Tightness). The bound in Theorem 2 is tight in the following sense: a singleprime

Q^{*}

can contribute at most

π (N - 1) - 1

distinct values to

D_{N}

(one per odd prime below N). To push

| D_{N} |

substantially beyond

π (N - 1) - 1

, one must combine contributions from multiple primes in

(N, 2 N)

. By Proposition 7, there are at least

{ln}^{2} N

such primes for

N \geq 3275

, each generating

π (N - 1) - 1

(not necessarily new) values. The challenge is to control the overlap between these contributions.

4.4. Reduction to the Intersection Condition

We now state precisely what remains to be established for a complete proof of the distinct-prime Goldbach conjecture.

Theorem 3

(Goldbach Reduction). The distinct-prime Goldbach conjecture (every even integer

2 N \geq 8

is the sum of two distinct primes) is equivalent to the assertion that

C_{N} \cap D_{N} \neq \emptyset

for all

N \geq 4

.

Proof.

Immediate from Proposition 6 and Theorem 1. □

Proposition 8

(Pigeonhole Threshold). Let

N \geq 4

. If

| D_{N} | > (N - 3) - (π (N - 1) - 1)

, then

C_{N} \cap D_{N} \neq \emptyset

.

Proof.

Both

C_{N}

and

D_{N}

are subsets of

{1, \dots, N - 3}

, which has cardinality

N - 3

. If

| D_{N} | > (N - 3) - | C_{N} |

, then

| C_{N} | + | D_{N} | > N - 3

, so the pigeonhole principle [6] forces

C_{N} \cap D_{N} \neq \emptyset

. Since

| C_{N} | = π (N - 1) - 1

, the threshold is

| D_{N} | > (N - 3) - (π (N - 1) - 1) = N - 2 - π (N - 1)

. □

Remark 3

(The gap between what is proved and what is needed). Theorem 2 establishes

| D_{N} | \geq π (N - 1) - 1 .

Proposition 8 requires

| D_{N} | > N - 2 - π (N - 1) .

For the pigeonhole argument to succeed, we would need

π (N - 1) - 1 > N - 2 - π (N - 1), i . e ., 2 π (N - 1) > N - 1 .

Since

π (N) \sim N / ln N

, the left-hand side grows as

2 N / ln N

while the right-hand side grows as N. For N large,

2 N / ln N < N

, so the pigeonhole bound from Theorem 2 alone is insufficient. A proof of the Goldbach conjecture via this route requires a substantially stronger lower bound on

| D_{N} |

—specifically, one that incorporates contributions frommanydistinct primes in

(N, 2 N)

, not just a single well-placed prime.

4.5. Structural Characterisation of the Intersection

Although the size bounds alone do not force

C_{N} \cap D_{N} \neq \emptyset

, the structure of the two sets provides additional constraints. We characterise the intersection condition explicitly.

Proposition 9

(Explicit Intersection Criterion). Let

N \geq 4

. Then

C_{N} \cap D_{N} \neq \emptyset

if and only if there exists a prime p with

3 \leq p < N

such that

Q^{'} = 2 N - p

is also prime.

Proof. (⇒) If

M \in C_{N} \cap D_{N}

, then by Lemma 1,

P = N - M

and

Q = N + M

are both prime. Setting

p = P

gives

Q^{'} = 2 N - p = N + M = Q

, which is prime.

(⇐) If

p < N

is an odd prime such that

Q^{'} = 2 N - p

is also prime, then

Q^{'} = N + (N - p) = N + M

where

M = N - p

. Since

p \geq 3

, we have

M \leq N - 3

; since

p < N

, we have

M \geq 1

. Moreover,

P = p

is prime and

Q^{'} > N

(since

p < N

). Hence

M \in C_{N}

(from

P = p

) and

M \in D_{N}

(from the straddling pair

(p, Q^{'})

), so

M \in C_{N} \cap D_{N}

. □

Remark 4.

Proposition 9 shows that

C_{N} \cap D_{N} \neq \emptyset

is equivalent to the existence of a prime

p < N

with

2 N - p

also prime, which is precisely the Goldbach conjecture for

2 N

(with distinct primes, since

p \neq 2 N - p

when

p < N

). Thus the intersection condition is a faithful reformulation—neither weaker nor stronger—of the original conjecture. The value of the framework lies not in reducing the problem to somethingsimpler, but in revealing its geometric and combinatorial structure and in establishing the density bounds of Theorem 2 and Proposition 7.

4.6. Verification of Base Cases

We verify the conjecture directly for small values of N.

Proposition 10

(Base Cases). For every integer N with

4 \leq N \leq 12

,

C_{N} \cap D_{N} \neq \emptyset

.

Proof.

We list

C_{N}

,

D_{N}

, and their intersection for each N, with M-values restricted to

[1, N - 3]

:

$N = 4$ ( $2 N = 8$ ): $C_{4} = {1}$ (from $P = 3$ ). $D_{4} = {1}$ (from pair $(3, 5)$ ). Intersection: ${1}$ . Partition: $8 = 3 + 5$ .✓
$N = 5$ ( $2 N = 10$ ): $C_{5} = {2}$ (from $P = 3$ ). $D_{5} = {2}$ (from $(3, 7)$ ). Intersection: ${2}$ . Partition: $10 = 3 + 7$ .✓
$N = 6$ ( $2 N = 12$ ): $C_{6} = {3, 1}$ (from $P \in {3, 5}$ ). $D_{6} = {1, 2, 3}$ . Intersection: ${1, 3}$ . Partition: $12 = 5 + 7$ .✓
$N = 7$ ( $2 N = 14$ ): $C_{7} = {4, 2}$ . $D_{7} = {3, 4}$ . Intersection: ${4}$ . Partition: $14 = 3 + 11$ .✓
$N = 8$ ( $2 N = 16$ ): $C_{8} = {5, 3, 1}$ . $D_{8} = {2, 3, 4, 5}$ . Intersection: ${3, 5}$ . Partition: $16 = 3 + 13$ .✓
$N = 9$ ( $2 N = 18$ ): $C_{9} = {6, 4, 2}$ . $D_{9} = {2, 4}$ . Intersection: ${2, 4}$ . Partition: $18 = 5 + 13$ .✓
$N = 10$ ( $2 N = 20$ ): $C_{10} = {7, 5, 3}$ . $D_{10} = {3, 7}$ . Intersection: ${3, 7}$ . Partition: $20 = 3 + 17$ .✓
$N = 11$ ( $2 N = 22$ ): $C_{11} = {8, 6, 4}$ . $D_{11} = {6, 8}$ . Intersection: ${6, 8}$ . Partition: $22 = 3 + 19$ .✓
$N = 12$ ( $2 N = 24$ ): $C_{12} = {9, 7, 5, 1}$ . $D_{12} = {1, 5, 7}$ . Intersection: ${1, 5, 7}$ . Partition: $24 = 5 + 19$ .✓

In every case,

C_{N} \cap D_{N} \neq \emptyset

and an explicit Goldbach partition exists. □

5. Computational Evidence

To motivate the conditional argument of Section 6 and to explore the density of

D_{N}

beyond the unconditional bound of Theorem 2, we introduce a quantitative diagnostic.

5.1. The Gap Function

Define the gap function:

G (N) = {log}^{2} (2 N) - ((N - 3) - | D_{N} |) .

Rearranging,

G (N) > 0

is equivalent to

| D_{N} | > (N - 3) - {log}^{2} (2 N) .

Intuitively,

G (N) > 0

means that the set

D_{N}

is “almost full”—most

M \in {1, \dots, N - 3}

appear in

D_{N}

, with fewer than

{log}^{2} (2 N)

values missing. This is a substantially stronger statement than the unconditional bound

| D_{N} | \geq π (N - 1) - 1

from Theorem 2: it asserts that

D_{N}

covers all but a polylogarithmic number of values.

5.2. Computational Results

We computed

| D_{N} |

and

G (N)

for all

N \in [4, 2^{14}]

using Python 3.12 with the Gmpy2 library [7]. The results are summarised in Table 1.

Three features of these data merit attention. First,

G (N) > 0

for every

N \in [4, 2^{14}]

, providing strong empirical support for the Density Hypothesis (Hypothesis 1 below). Second, the minimum value of

G (N)

in each successive dyadic interval

[2^{m}, 2^{m + 1}]

strictly increases with m, suggesting that the positivity margin widens as N grows. Third, the N-values at which the minima are attained tend to be primes or near-primes, consistent with the expectation that

| D_{N} |

is smallest when primes near N are sparse.

6. Conditional Argument via the Density Hypothesis

Remark 3 showed that the unconditional bound

| D_{N} | \geq π (N - 1) - 1

is insufficient for the pigeonhole argument. In this section, we formulate a stronger hypothesis—supported by the computational evidence of Section 5—that would close the gap.

6.1. The Density Hypothesis

Hypothesis 1

(Density Hypothesis: Positivity of

G (N)

for Large N). For every integer

N \geq 3275

, we have

G (N) > 0

.

Corollary 1

(Conditional Lower Bound on

| D_{N} |

). If Hypothesis 1 holds, then for all

N \geq 3275

,

| D_{N} | > (N - 3) - {log}^{2} (2 N) .

Proof.

Immediate from Hypothesis 1, since

G (N) > 0

is equivalent to

| D_{N} | > (N - 3) - {log}^{2} (2 N)

. □

Remark 5

(Comparison of bounds). The unconditional bound (Theorem 2) gives

| D_{N} | \geq π (N - 1) - 1 \sim N / ln N

. The conditional bound (Corollary 1) gives

| D_{N} | > N - 3 - {log}^{2} (2 N) \sim N

. The gap between these is enormous: the conditional bound asserts thatalmost allof

{1, \dots, N - 3}

lies in

D_{N}

, while the unconditional bound only captures a fraction of order

1 / ln N

. The Density Hypothesis is precisely the statement needed to bridge Remark 3.

6.2. Evidence for the Density Hypothesis

The rigorous content supporting Hypothesis 1 consists of three steps, followed by a conjectured fourth step.

Rigorous content (Steps 1–3) and conjectured step (Step 4).

Recall that

G (N) > 0

is equivalent to

| D_{N} | > (N - 3) - {log}^{2} (2 N)

. We establish partial bounds on

| D_{N} |

using the prime distribution results of Section 2.

Step 1: Short-Interval Prime Guarantee

By Proposition 3, for every

x \geq 3275

, the interval

(x, x (1 + 1 / (2 {ln}^{2} x))]

contains at least one prime. This provides fine-grained control over the distribution of primes in

(N, 2 N)

.

Step 2: Counting Primes in $(N, 2 N)$

By Proposition 7, the interval

(N, 2 N)

contains at least

{ln}^{2} N

primes

Q_{1} < Q_{2} < \dots < Q_{r}

for

N \geq 3275

.

Step 3: Growth Mechanism of $| D_{N} |$

By Theorem 2, a single prime

Q^{*} \in (N, 2 N)

already generates

π (N - 1) - 1

distinct values in

D_{N}

. Each of the

r \geq {ln}^{2} N

primes in

(N, 2 N)

contributes

π (N - 1) - 1

values (not necessarily new). The total number of (prime, prime) pairs available is at least

(π (N - 1) - 1) \cdot {ln}^{2} N

, which grows as

N ln N

.

Step 4: Upper Bound on Missing M-Values (Conjectured)

Note. The following step constitutes the content of Hypothesis 1. It is supported by computation for all

N \in [4, 2^{14}]

(Table 1) but has not been established rigorously. Steps 1–3 above are fully proved.

A value

m \in {1, \dots, N - 3}

is missing from

D_{N}

if for every odd prime

P < N

, the number

Q = P + 2 m

is composite (or

Q \geq 2 N

). Let U denote the set of these missing values.

By Proposition 3, consecutive primes

p_{k}, p_{k + 1} \geq 3275

satisfy

p_{k + 1} - p_{k} \leq p_{k} / (2 {ln}^{2} p_{k})

. Consequently, the primes

Q_{i}

in

(N, 2 N)

partition the interval into composite gaps, each of maximal width

O (N / {ln}^{2} N)

. For any fixed m, the shifted set

S_{m} = {P + 2 m ∣ P < N is an odd prime}

must avoid all

Q_{i}

for m to be missing. The rigid structure of

S_{m}

(increasing m by 1 shifts every term by exactly 2) makes it plausible that only a limited number of translates can remain entirely hidden inside the composite regions. Because the prime gaps among the P’s and the composite gaps among the

Q_{i}

’s are both

O (N / {ln}^{2} N)

, the rigid shift suggests that only a limited number of translates can remain entirely hidden inside the composite regions. We therefore conjecture that the number of such missing values satisfies

| U | \leq {ln}^{2} N

. (This is precisely the statement of Hypothesis 1.) If true, then

| U | = (N - 3) - | D_{N} | \leq {ln}^{2} N < {ln}^{2} (2 N)

for

N \geq 3275

, which yields

G (N) > 0

. The rigorous justification of this upper bound on

| U |

remains open and constitutes the only missing piece of the density argument. □

Remark 6.

The threshold

N = 3275

arises directly from Proposition 3 (Théorème 1.9 of [3], p. 35), which guarantees a prime in every interval

(x, x + x / (2 {ln}^{2} x)]

for

x \geq 3275

. The key insight is that Dusart’s short-interval guarantee forces the primes

Q \in (N, 2 N)

to be distributed densely enough—with gaps no larger than

N / {ln}^{2} N

—that the set

D_{N}

of achievable half-differences isconjecturedto be nearly full, leaving fewer than

{ln}^{2} (2 N)

missing values.

6.3. Conditional proof of the Goldbach conjecture

Theorem 4

(Conditional Main Result). If Hypothesis 1 holds, then every even integer

2 N \geq 8

is the sum of two distinct primes.

Proof.

By Theorem 1 and Proposition 6 (together with Lemma 1), it suffices to show that for every

N \geq 4

,

C_{N} \cap D_{N} \neq \emptyset

. We consider three cases.

Case 1: $N \geq 3275$

Assuming Hypothesis 1 (i.e.,

G (N) > 0

), Corollary 1 gives

| D_{N} | > (N - 3) - {log}^{2} (2 N) .

The number of “bad” M-values—those in

{1, \dots, N - 3}

but not in

D_{N}

—is therefore fewer than

{log}^{2} (2 N)

. The candidate set

C_{N}

has cardinality

| C_{N} | = π (N - 1) - 1

(excluding

p = 2

). By Proposition 5(3),

π (N) \geq \frac{N}{ln N - 1} for N \geq 5393 .

For

N \geq 3275

, we have

| C_{N} | \geq N / (ln N + 2) > {log}^{2} (2 N)

, since the left side grows as

N / log N

while the right side grows as

{log}^{2} N

. (The inequality can be verified numerically at

N = 3275

.) Therefore

| C_{N} |

strictly exceeds the number of bad M-values. By the pigeonhole principle [6], at least one element of

C_{N}

must lie in

D_{N}

, giving

C_{N} \cap D_{N} \neq \emptyset

. Lemma 1 then yields the partition.

Case 2: $4 \leq N \leq 12$ (Base Cases)

By Proposition 10,

C_{N} \cap D_{N} \neq \emptyset

for all

4 \leq N \leq 12

.

Case 3: $13 \leq N \leq 3274$

For this range, we rely on computational verification. Our experiments (Section 5, Table 1) confirm that

G (N) > 0

for all

N \in [4, 2^{14}] = [4, 16384]

, which includes the entire range

[13, 3274]

. Since

G (N) > 0

implies

| D_{N} | > (N - 3) - {log}^{2} (2 N)

, and

| C_{N} | > {log}^{2} (2 N)

for these values of N, the same pigeonhole argument (combined with Lemma 1) ensures

C_{N} \cap D_{N} \neq \emptyset

. Additionally, we have verified directly for each

N \in [4, 2^{14}]

that at least one valid Goldbach partition exists (i.e., we computed explicit partitions), confirming the conjecture holds.

Conclusion

Combining Cases 1–3, the conjecture holds for all

N \geq 4

under Hypothesis 1. Since

N \geq 4

corresponds to

2 N \geq 8

, every even integer

\geq 8

is the sum of two distinct primes. □

Remark 7.

(Relationship between the two approaches). The paper presents two complementary paths toward the Goldbach conjecture:

(a): Unconditional: Theorem 2 gives $| D_{N} | \geq π (N - 1) - 1$ , and Proposition 8 identifies the pigeonhole threshold. Remark 3 shows a quantitative gap remains.
(b): Conditional: Hypothesis 1 posits $G (N) > 0$ , which provides the stronger bound $| D_{N} | > N - 3 - {log}^{2} (2 N)$ . Theorem 4 shows this suffices.

Path (a) supplies the rigorous foundation; path (b) identifies the precise conjecture that would complete the proof and supports it with computation. The two approaches share the same geometric framework and set-theoretic infrastructure; they differ only in the strength of the lower bound on

| D_{N} |

.

Conclusion

We have developed a geometric and combinatorial framework that provides structural insight into the distinct-prime Goldbach conjecture, yielding both unconditional results and a conditional path to a complete proof.

Summary of Main Results

Theorem 1 establishes that the distinct-prime Goldbach variant is equivalent to finding, for each

N \geq 4

, a nested square configuration with semiprime area. Proposition 6 and Theorem 3 reduce the conjecture to the set intersection condition

C_{N} \cap D_{N} \neq \emptyset

.

The central unconditional result is Theorem 2: for every

N \geq 4

,

| D_{N} | \geq π (N - 1) - 1 = | C_{N} | .

This is established by a clean counting argument: a single well-placed prime

Q^{*} \in (N, 2 N)

(guaranteed by Bertrand’s postulate and Ramanujan’s refinement), paired with all

π (N - 1) - 1

odd primes below N, generates that many distinct elements of

D_{N}

.

Proposition 7, using Dusart’s short-interval results [3], further shows that

(N, 2 N)

contains at least

{ln}^{2} N

primes for

N \geq 3275

, each generating

π (N - 1) - 1

(not necessarily distinct) values in

D_{N}

. This provides substantial “raw density” but controlling the overlaps between contributions of different primes

Q_{i}

remains the key challenge for the unconditional approach.

As a complementary path, Hypothesis 1 posits

G (N) > 0

for

N \geq 3275

—the assertion that fewer than

{log}^{2} (2 N)

values are missing from

D_{N}

. Theorem 4 shows that this hypothesis, together with finite verification for small N, implies the full conjecture via the pigeonhole principle. Computational evidence (Table 1) strongly supports the hypothesis:

G (N) > 0

for all

N \in [4, 2^{14}]

, with minima strictly increasing across dyadic intervals.

What Remains Open

Two distinct challenges emerge from the two approaches:

(a): Unconditional gap (Remark 3): Theorem 2 gives $| D_{N} | \geq π (N - 1) - 1 \sim N / ln N$ , while the pigeonhole argument (Proposition 8) requires $| D_{N} | > N - 2 - π (N - 1) \sim N (1 - 1 / ln N)$ . Closing this gap requires a bound of the form $| D_{N} | > N - O (N / ln N)$ , combining contributions from many primes in $(N, 2 N)$ and controlling their overlaps.
(b): Density Hypothesis (Hypothesis 1): Proving $G (N) > 0$ rigorously would require showing that the number of missing M-values is at most ${ln}^{2} N$ —a problem related to the distribution of prime pairs with prescribed differences. This is the content of Step 4 in Section 6.2.

Key Insights

Several features of this framework merit emphasis. First, the geometric reformulation (Theorem 1) transforms an additive problem into a multiplicative one: instead of decomposing

2 N

as a sum of two primes, one factors

N^{2} - M^{2}

as a product of two primes. Second, the density bound

| D_{N} | \geq π (N - 1) - 1

(Theorem 2) is tight for a single-prime argument and shows that

D_{N}

grows at the same rate as

C_{N}

. Third, Remark 3 provides a precise quantitative “distance to proof”—the ratio between what is established (

\sim N / ln N

) and what is needed (

\sim N

)—clarifying exactly where current methods fall short. Fourth, the gap function

G (N)

provides a quantitative diagnostic—a “distance from counterexample”—whose behaviour (Table 1) offers structural insight and isolates the precise step that remains open.

The threshold

N = 3275

from Proposition 3 (Théorème 1.9 of [3]) provides the strongest available tool for short-interval prime guarantees, ensuring prime gaps of width at most

N / {ln}^{2} N

in

(N, 2 N)

. This is precisely calibrated to the needs of both the unconditional Steps 1–3 and the conjectured Step 4.

Methodological Contributions

Beyond the conditional result, this work demonstrates three methodological points. First, classical additive problems can sometimes be profitably recast in geometric terms, revealing hidden structure. Second, the interplay between the unconditional density bound (Theorem 2) and the conditional Density Hypothesis (Hypothesis 1) delineates the precise boundary between what is proved and what is conjectured. Third, Dusart’s refinement [3], a result from modern analytic number theory, delivers the rigorous content of Steps 1–3 and is precisely calibrated to the threshold

N = 3275

.

Relation to the Classical Goldbach Conjecture

Our result addresses the variant requiring distinct primes, thus excluding

4 = 2 + 2

and

6 = 3 + 3

. While our techniques—particularly the geometric framework and the analysis of

D_{N}

—may offer insights applicable to the full classical conjecture, extending our methods to allow

P = Q

would require new ideas, as our geometric construction inherently demands

P \neq Q

(i.e.,

M \geq 1

).

Open Questions

Several natural questions remain:

1.: Can one prove $| D_{N} | > N - O (N / ln N)$ unconditionally, by combining contributions from multiple primes in $(N, 2 N)$ and controlling overlaps?
2.: Can Step 4 of the density argument (Section 6.2) be established rigorously, thereby proving Hypothesis 1? This would require a tight upper bound on the number of m-values for which no straddling prime pair has half-difference m.
3.: What is the exact asymptotic behaviour of $| D_{N} |$ and $G (N)$ ?
4.: Can the geometric framework accommodate $P = Q$ (the case $M = 0$ ), thereby addressing the full classical Goldbach conjecture?
5.: Can similar geometric reformulations illuminate other additive problems, such as the twin prime conjecture or Waring’s problem?
6.: Can the structural properties of $C_{N}$ and $D_{N}$ —beyond their sizes—be exploited to prove $C_{N} \cap D_{N} \neq \emptyset$ without the pigeonhole principle?

Acknowledgments

The author is sincerely grateful to Iris, Marilin, Sonia, Yoselin, Arelis, Anissa, Liuva, Yudit, Gretel, Gema, and Blaquier, as well as Israel, Arderi, Juan Carlos, Yamil, Alejandro, Aroldo, Yary, Reinaldo, Alex, Emmanuel, and Michael for their constant support. Whether through encouragement, stimulating conversations, practical assistance, or simply being present during challenging moments, their contributions have played an important role in bringing this work to completion.

References

Goldbach, C. Lettre XLIII. In Correspondance mathématique et physique de quelques célèbres géomètres du XVIIIème siècle; Fuss, P.H., Ed.; Imperial Academy of Sciences: St. Petersburg, 1843; Vol. 1, pp. 125–129. (letter to Leonhard Euler) (in German).
Oliveira e Silva, T.; Herzog, S.; Pardi, S. Empirical verification of the even Goldbach conjecture and computation of prime gaps up to 4·10¹⁸. Mathematics of Computation 2014, 83, 2033–2060. [CrossRef]
Dusart, P. Autour de la fonction qui compte le nombre de nombres premiers. https://www.unilim.fr/pages_perso/pierre.dusart/Documents/T1998_01.pdf, 1998. Accessed: 2026-04-11.
Chebyshev, P.L. Mémoire sur les nombres premiers. Journal de mathématiques pures et appliquées 1852, 17, 366–390.
Ramanujan, S. A proof of Bertrand’s postulate. Journal of the Indian Mathematical Society 1919, 11, 181–182.
Rittaud, B.; Heeffer, A. The Pigeonhole Principle, Two Centuries before Dirichlet. The Mathematical Intelligencer 2014, 36, 27–29. [CrossRef]
Vega, F. Experimental Results on Goldbach’s Conjecture. https://github.com/frankvegadelgado/goldbach, 2026. Accessed: 2026-04-11.

Table 1. Minimum

G (N)

values in dyadic intervals

[2^{m}, 2^{m + 1}]

. Note that

G (N) > 0

for all tested values, and the minima strictly increase with m.

Table 1. Minimum

G (N)

values in dyadic intervals

[2^{m}, 2^{m + 1}]

. Note that

G (N) > 0

for all tested values, and the minima strictly increase with m.

Interval (m)	Range $[2^{m}, 2^{m + 1}]$	N achieving min	Min $G (N)$
2	$[4, 8]$	5	4.301898
3	$[8, 16]$	9	7.354249
4	$[16, 32]$	19	10.232033
5	$[32, 64]$	61	14.078618
6	$[64, 128]$	73	17.836335
7	$[128, 256]$	151	20.608977
8	$[256, 512]$	269	23.537165
9	$[512, 1024]$	541	28.812111
10	$[1024, 2048]$	1327	33.154668
11	$[2048, 4096]$	2161	35.081569
12	$[4096, 8192]$	7069	42.329014
13	$[8192, 16384]$	14138	44.057758

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Geometric Insights into the Goldbach Conjecture

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries: Prime Distribution Results

2.1. Bertrand’s Postulate and Its Refinements

2.2. Primes in Short Intervals

2.3. Bounds on the Prime-Counting Function

3. Geometric Construction and Reformulation

3.1. Nested Squares and Semiprime Areas

3.2. Connection to Goldbach Partitions

3.3. The Geometric Equivalence

3.4. Reformulation as a Set Intersection Problem

4. Main Results

4.1. Density of Primes in $(N, 2 N)$

4.2. Existence of a Well-Placed Straddling Prime

4.3. The Density Theorem

4.4. Reduction to the Intersection Condition

4.5. Structural Characterisation of the Intersection

4.6. Verification of Base Cases

5. Computational Evidence

5.1. The Gap Function

5.2. Computational Results

6. Conditional Argument via the Density Hypothesis

6.1. The Density Hypothesis

6.2. Evidence for the Density Hypothesis

6.3. Conditional proof of the Goldbach conjecture

Conclusion

Summary of Main Results

What Remains Open

Key Insights

Methodological Contributions

Relation to the Classical Goldbach Conjecture

Open Questions

Acknowledgments

References

MDPI Initiatives

Important Links

Subscribe

Geometric Insights into the Goldbach Conjecture

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries: Prime Distribution Results

2.1. Bertrand’s Postulate and Its Refinements

2.2. Primes in Short Intervals

2.3. Bounds on the Prime-Counting Function

3. Geometric Construction and Reformulation

3.1. Nested Squares and Semiprime Areas

3.2. Connection to Goldbach Partitions

3.3. The Geometric Equivalence

3.4. Reformulation as a Set Intersection Problem

4. Main Results

4.1. Density of Primes in ( N , 2 N )

4.2. Existence of a Well-Placed Straddling Prime

4.3. The Density Theorem

4.4. Reduction to the Intersection Condition

4.5. Structural Characterisation of the Intersection

4.6. Verification of Base Cases

5. Computational Evidence

5.1. The Gap Function

5.2. Computational Results

6. Conditional Argument via the Density Hypothesis

6.1. The Density Hypothesis

6.2. Evidence for the Density Hypothesis

6.3. Conditional proof of the Goldbach conjecture

Conclusion

Summary of Main Results

What Remains Open

Key Insights

Methodological Contributions

Relation to the Classical Goldbach Conjecture

Open Questions

Acknowledgments

References

MDPI Initiatives

Important Links

Subscribe

4.1. Density of Primes in $(N, 2 N)$