Proof of the Riemann Hypothesis via the Chebyshev Function and the Integral Convergence

Hao-Cong Wu

doi:10.20944/preprints202605.1525.v1

Submitted:

22 May 2026

Posted:

22 May 2026

You are already at the latest version

Abstract

In this article, we offer a complete, self-contained, and entirely elementary proof of the mean-square estimate for the Chebyshev function. From this we deduce the convergence of the integral is valid, thus proving the validity of the Riemann hypothesis. The proof primarily employs elementary estimates of the Chebyshev function, the Cauchy-Schwarz inequality, and a dyadic decomposition (with Abel summation applied in the appendix), in which the argument results of this article are already optimal within the elementary framework and sufficient to derive the convergence of the required integral that it is a suffficient condition for the Riemann hypothesis. In particular, the appendix of this paper provides a theoretical complement linking integral convergence, pointwise bounds and analyticity, and concludes that the well-known $o$-bound is valid, thereby reconfirming the validity of the Riemann hypothesis. In other words, we give a self-contained elementary proof for the mean-square estimate that $\displaystyle\int_2^{X} \bigl(\psi(t)-t\bigr)^{2}\,dt = O(X^{2}\log^{2} X),$ where $\displaystyle \psi(x)$ is the Chebyshev function. From this we deduce that $\displaystyle\int_{1}^{\infty}\frac{|\psi(x)-x|}{x^{\frac{3}{2}+\varepsilon}}\,dx < \infty$ holds for every $\varepsilon>0,$ thus concluding the integral $\displaystyle \int_1^{\infty} \frac{\psi(x)-x}{x^{\frac{3}{2}+\varepsilon}}\,dx$ converges absolutely for every $\varepsilon>0,$ so that the integral $\displaystyle \int_1^{\infty} \frac{\psi(x)-x}{x^{\frac{3}{2}+\varepsilon}}\,dx$ converges conditionally for every $\varepsilon>0,$ whereas the integral converges conditionally $\iff \text{RH},$ so then the Riemann hypothesis is true. In particular, the absolute convergence of the integral is equivalent to the conditional convergence of the integral, either of which is equivalent to the $o$-bound: $|\psi(x)-x| = o(x^{\frac{1}{2}+\varepsilon}),$ and all of them imply the $O$-bound: $\psi(x)-x= O(x^{\frac{1}{2}+\varepsilon})$ is also valid for every $\varepsilon>0,$ thus reconfirming the validity of the Riemann hypothesis.

Keywords:

Riemann hypothesis

;

Chebyshev function

;

asymptotic relation

;

mean-square estimate

;

integral convergence

Subject:

Computer Science and Mathematics - Algebra and Number Theory

MSC: Primary 11M26; Secondary 11N05, 11M06

1. Introduction

The Chebyshev function

ψ (x) = \sum_{n \leq x} Λ (n)

plays a central role in the study of the distribution of prime numbers. The prime number theorem (PNT), proved independently by Hadamard and de la Vallée Poussin in 1896 using complex analysis, states that

ψ (x) \sim x

, refer to Edwards [6].

In fact, the prime number theorem is equivalent to not only the asymptotic behavior

ψ (x) \sim x,

but also to the statement that the error term

ψ (x) - x

satisfies

ψ (x) - x = o (x)

as

x \to \infty .

A major breakthrough in the twentieth century was the discovery of elementary proofs of the prime number theorem by Selberg [3] and Erdos [4] in 1949, avoiding complex analysis, but the error term remained rather weak. In this paper, we will deduce the stronger error term that

ψ (x) - x = o (x^{\frac{1}{2} + ε})

holds for every

ε > 0 .

In this paper, we focus on the mean-square estimate of

ψ (x) - x .

Our goal is to prove, by purely elementary means, that

\int_{2}^{X} {(ψ (t) - t)}^{2} d t = O (X^{2} {log}^{2} X) .

(1)

This estimate is unconditional and, as we shall see, already suffices to prove the convergence of

\int_{1}^{\infty} | ψ (x) - x | x^{- \frac{3}{2} - ε} d x

for every

ε > 0 .

The proof mainly uses elementary estimates of the Chebyshev function, and Cauchy-Schwarz inequality. In particular, we avoid any complex analysis or unproven hypotheses.

The paper is organized as follows.

Section 1 and Section 2 recall necessary notation and preliminary estimates. Section 3 proves an upper bound for

U (N) = \sum_{k = 1}^{N} k A {(N / k)}^{2},

where

A (x) = ψ (x) - ⌊ x ⌋ .

Section 4 expands

U (N) / N .

Section 5 establishes a pointwise lower bound for the weights in

U (N) / N .

Section 6 using the lower bound for the weights and concluding the core estimate

\sum_{m = 1}^{N} \frac{A {(m)}^{2}}{m} = O (N {log}^{2} N) .

Section 7 converts this discrete estimate into the continuous mean-square bound (1). Section 8 gives the proof of the convergence of

\int_{1}^{\infty} | ψ (x) - x | x^{- \frac{3}{2} - ε} d x

for every

ε > 0 .

Section 9 concludes with a summary, where we obtain

\int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x < \infty

for every

ε > 0,

thus the integral

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges absolutely for every

ε > 0,

so that the integral

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges conditionally for every

ε > 0,

whereas the integral converges conditionally

\Leftrightarrow RH,

so then the Riemann hypothesis is true. In particular, an appendix provides a theoretical complement linking integral convergence, pointwise bounds and analyticity, where the absolute convergence of the integral is equivalent to the conditional convergence of the integral, either of which is equivalent to the o-bound:

| ψ (x) - x | = o (x^{\frac{1}{2} + ε}),

and all of them imply the O-bound:

ψ (x) - x = O (x^{\frac{1}{2} + ε})

is also valid for every

ε > 0,

thus reconfirming the validity of the Riemann hypothesis.

2. Preliminary

2.1. Notation Conventions and Logical Symbols

We use the following notation throughout the paper:

N = {1, 2, 3, \dots} .

p always denotes a prime.

log x

is the natural logarithm.

For a real number

x,

⌊ x ⌋

is the greatest integer

\leq x,

the fractional part is

{x} = x - ⌊ x ⌋,

which satisfies

0 \leq {x} < 1,

and

{x} = 0

if and only if x is an integer.

f (x) = O (g (x))

or

f (x) ≪ g (x)

means there exists a positive constant

C > 0

such that

| f (x) | \leq C g (x)

for all sufficiently large

x .

More importantly, if

g (x)

is positive for all x in the domain, then there exists an absolute constant

C_{0} > 0

such that the asymptotic estimate holds for all sufficiently large x and all finite

x .

f (x) = o (g (x))

means

lim_{x \to \infty} |f (x) / g (x)| = 0 .

It is to be observed that the asymptotic relation

f (x) = o (g (x))

implies, and is stronger than, the asymptotic relation

f (x) = O (g (x)) .

f (x) \sim g (x)

means

lim_{x \to \infty} f (x) / g (x) = 1 .

Summations

\sum_{n \leq x}

are over integers n with

1 \leq n \leq x .

A {(N / n)}^{2}

means

{(A (N / n))}^{2} .

For the logical relations in theorems and lemmas we employ the standard symbols:

⇒: implies, if … then

⇐: is implied by

⇔: if and only if (iff)

The following terminology is used:

Sufficient condition:

a \Rightarrow b

means that if a is true, then b is true; a guarantees

b .

Necessary condition:

a \Leftarrow b

means that if b is true, then a must be true; without a we cannot have

b,

i.e., if a is not true, then b must be not true.

Necessary and sufficient condition (equivalence):

a \Leftrightarrow b

means

a \Rightarrow b

and

b \Rightarrow a

; a and b are either both true or both false.

2.2. Basic Definitions

The Chebyshev

ψ

-function is defined by

ψ (x) = \sum_{p^{k} \leq x} log p,

where the sum runs over all prime powers

p^{k}

(p prime, and integer

k \geq 1

).

The Chebyshev

ϑ

-function is defined by

ϑ (x) = \sum_{p \leq x} log p,

where the sum runs over primes

p \leq x .

The von Mangoldt function

Λ (n

) is defined by

Λ (n) = \{\begin{matrix} log p & if n = p^{k} for some prime p and integer k \geq 1, \\ 0 & otherwise . \end{matrix}

A well-known identity connects it with the Chebyshev function

ψ (x) = \sum_{n \leq x} Λ (n) .

Define

a_{n} = Λ (n) - 1 and A (x) = \sum_{n \leq x} a_{n} .

Then

A (x) = \sum_{n \leq x} a_{n} = \sum_{n \leq x} Λ (n) - \sum_{n \leq x} 1 = ψ (x) - ⌊ x ⌋ = ψ (x) - x + {x}

with fractional part

0 \leq {x} < 1 .

Thus,

ψ (x) = A (x) + ⌊ x ⌋ = A (x) + x - {x},

and

ψ (x) - x = A (x) - {x} .

For integer

n,

⌊ n ⌋ = n,

so

A (n) = ψ (n) - n .

2.3. The Prime Number Theorem

Let

π (x)

denote the number of primes not exceeding

x (> 0),

which is the familiar prime-counting function.

The classical prime number theorem asserts that

ψ (x) \sim x

as

x \to \infty,

or equivalently

π (x) \sim \frac{x}{log x} .

While this result is not needed in our derivations (all estimates we employ are elementary or established results analogous to the PNT), it provides the historical context and motivation for studying the error term

ψ (x) - x .

2.4. Non-trivial Zeros of Riemman’s Zeta Function and the Riemann Hypothesis

The Riemman zeta function is defined by

ζ (s) = \sum_{n = 1}^{\infty} \frac{1}{n^{s}},

where

ζ (s)

extends meromorphically to the whole complex plane with a simple pole at

s = 1

with residue

1 .

Non-trivial zeros of

ζ (s)

are the zeros of

ζ (s)

in the critical strip

0 < Re (s) < 1 .

As is well-known, the Riemann hypothesis (RH) states that all non-trivial zeros lie on the line

Re (s) = \frac{1}{2} .

In addition, we have the following facts.

Theorem 1.

(cf.[2], p.17.) If

ℜ e (s) > 1,

then

log ζ (s) = - \sum_{p} log (1 - p^{- s}) = \sum_{p, m} \frac{p^{- m s}}{m},

(2)

where p runs through all primes and m through all positive integers.

Theorem 2.

(cf.[2], pp.17-18.) For

ℜ e (s) > 1,

then

- \frac{ζ^{'} (s)}{ζ (s)} = \sum_{p} \frac{log p}{p^{s} - 1} = \sum_{p, m} (log p) p^{- m s} = \sum_{n = 1}^{\infty} \frac{Λ (n)}{n^{s}} = s \int_{1}^{\infty} \frac{ψ (x)}{x^{s + 1}} d x

(3)

where p runs through all primes and m through all positive integers.

We see that

- \frac{ζ^{'} (s)}{ζ (s)} - \frac{1}{s - 1} = 1 + s \int_{1}^{\infty} \frac{ψ (x) - x}{x^{s + 1}} d x

holds for

ℜ e (s) > 1,

which is a well-known formula and extends meromorphically to the half-plane

ℜ e (s) > 0

with some poles at non-trivial zeros of

ζ (s),

but no other poles on the region

ℜ e (s) > 0 .

2.5. Well-known Equivalent Forms of RH in Terms of the Chebyshev Function

There are well-known equivalent forms of the Riemann hypothesis in terms of the Chebyshev function

ψ (x)

:

(1). Standard arithmetical form:

ψ (x) - x = O (x^{\frac{1}{2}} {log}^{2} x) \Leftrightarrow RH .

(2). Another arithmetical form: For every

ε > 0,

ψ (x) - x = O (x^{\frac{1}{2} + ε}) \Leftrightarrow RH .

(3). Integral form (analyticity):

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{s + 1}} d x is analytic for Re (s) > \frac{1}{2} \Leftrightarrow RH .

(4). Integral form (conditional convergence): For every

ε > 0,

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x converges conditionally \Leftrightarrow RH,

since

f (s) : = \int_{1}^{\infty} \frac{ψ (x) - x}{x^{s + 1}} d x

is analytic for

Re (s) > \frac{1}{2}

⇔

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges conditionally.

Remark 1.

For instance, these celebrated equivalent formulations of the Riemann hypothesis can be found in the paper by von Koch [1] and on pages

83 - 85

of the monograph by Ingham [2], which offer rich details and comprehensive perspectives.

2.6. Basic Analytical Tools

We shall employ the following standard results.

(i) Cauchy-Schwarz inequality.

For any real numbers

u_{1}, \dots, u_{N}

and

v_{1}, \dots, v_{N},

{(\sum_{n = 1}^{N} u_{n} v_{n})}^{2} \leq (\sum_{n = 1}^{N} u_{n}^{2}) (\sum_{n = 1}^{N} v_{n}^{2}) .

We also use its integral form: for functions

f, g

on an interval

I,

{(\int_{I} f (x) g (x) d x)}^{2} \leq (\int_{I} f {(x)}^{2} d x) (\int_{I} g {(x)}^{2} d x) .

For a reference, please see, G. H. Hardy, J. E. Littlewood & G. Pólya [5], Theorems 7 and

181 .

(ii) A lemma on the O-notation.

Lemma 1.

(Bidirectional transfer). Let

f (N), g (N), h (N)

be functions defined on positive integers and suppose

f (N) = g (N) + O (h (N)) .

(4)

Then

g (N) = O (h (N)) if and only if f (N) = O (h (N)) .

Proof.

By (4), there exist constants

C_{0} > 0

and

N_{0}

such that

| f (N) - g (N) | \leq C_{0} | h (N) | for all N \geq N_{0} .

(5)

Sufficiency: Assume g(N) = O(h(N)). Then there exist constants

C_{1} > 0

and

N_{1}

with

| g (N) | \leq C_{1} | h (N) |

for

N \geq N_{1} .

For

N \geq max (N_{0}, N_{1}),

we have

| f (N) | \leq | f (N) - g (N) | + | g (N) | \leq (C_{0} + C_{1}) | h (N) |,

hence

f (N) = O (h (N)) .

Necessity: Assume

f (N) = O (h (N)) .

Then there exist constants

C_{2} > 0

and

N_{2}

with

| f (N) | \leq C_{2} | h (N) |

for

N \geq N_{2} .

For

N \geq max (N_{0}, N_{2}),

we have

| g (N) | \leq | g (N) - f (N) | + | f (N) | \leq (C_{0} + C_{2}) | h (N) |,

hence

g (N) = O (h (N)) .

□

Corollary 1.

(Absorption). If

h (N) = O (k (N)),

then

f (N) = g (N) + O (h (N)) ⟹ f (N) = g (N) + O (k (N)) .

Proof.

By

h (N) = O (k (N)),

there exist constants

C > 0

and

N_{0}

such that

| h (N) | \leq C | k (N) |

for all

N \geq N_{0} .

Hence any function that is

O (h (N))

is also

O (k (N)) .

Therefore the error term

O (h (N))

can be replaced by

O (k (N)) .

□

(iii) Two lemmas on the partition by floor values.

Lemma 2.

(Partition by Floor Values). For a fixed integer

N \geq 1,

define for each integer

m \geq 1

the set

J_{m} = \{k \in {1, \dots, N} : ⌊\frac{N}{k}⌋ = m\} .

Then the collection

{J_{m} : m = 1, 2, \dots, N}

forms a partition of

{1, 2, \dots, N} .

Furthermore, an equivalent description is

J_{m} = \{k \in N : \frac{N}{m + 1} < k \leq \frac{N}{m}\} .

Proof.

The proof of the lemma consists of several steps.

Step 1. The values of m range from 1 to

N .

Since

1 \leq k \leq N,

we have

⌊ N / k ⌋

is an integer between

⌊ N / N ⌋ = 1

and

⌊ N / 1 ⌋ = N .

Hence

m = ⌊ N / k ⌋

takes values in

{1, 2, \dots, N} .

Step 2. Disjointness. If

k \in J_{m}

and also

k \in J_{m^{'}}

with

m \neq m^{'},

then

⌊ N / k ⌋

would equal two different integers, which is impossible. So the sets

J_{m}

are pairwise disjoint.

Step 3. Covering. For any

k \in {1, \dots, N},

let

m = ⌊ N / k ⌋ .

Then

1 \leq m \leq N

and by definition

k \in J_{m} .

Hence every element of

{1, \dots, N}

belongs to some

J_{m} .

Thus

{J_{m}}_{m = 1}^{N}

is a partition of

{1, \dots, N} .

Step 4. Equivalent description. We show

J_{m} = \{k \in N : \frac{N}{m + 1} < k \leq \frac{N}{m}\} .

From

⌊ N / k ⌋ = m

we have

m \leq \frac{N}{k} < m + 1 .

Inverting (all terms positive) gives

\frac{N}{m + 1} < k \leq \frac{N}{m} .

Conversely, if k satisfies the inequality

\frac{N}{m + 1} < k \leq \frac{N}{m},

then

m \leq \frac{N}{k} < m + 1,

so

⌊ N / k ⌋ = m .

The condition

k \in N

together with

k \leq N / m \leq N

automatically implies

1 \leq k \leq N .

Hence the two descriptions coincide. Therefore, the lemma is proved. □

Lemma 3.

For

J_{m} = \{k \in N : \frac{N}{m + 1} < k \leq \frac{N}{m}\},

we have

\sum_{k \in J_{m}} k = O (\frac{N^{2}}{m^{2}}) .

Proof.

The proof of the lemma consists of several steps.

Step 1. Every

k \in J_{m}

satisfies

k \leq N / m .

Hence

\sum_{k \in J_{m}} k \leq \frac{N}{m} \cdot | J_{m} | .

Step 2. The length of the interval is

\frac{N}{m} - \frac{N}{m + 1} = \frac{N}{m (m + 1)} .

Therefore the number of integers in it is at most

| J_{m} | \leq \frac{N}{m (m + 1)} + 1 .

Step 3. Substituting,

\sum_{k \in J_{m}} k \leq \frac{N}{m} (\frac{N}{m (m + 1)} + 1) = \frac{N^{2}}{m^{2} (m + 1)} + \frac{N}{m} .

Step 4. Bound each term:

\frac{N^{2}}{m^{2} (m + 1)} \leq \frac{N^{2}}{m^{3}},

and

\frac{N}{m} = \frac{N^{2}}{m^{2}} \cdot \frac{m}{N} \leq \frac{N^{2}}{m^{2}},

where

m \leq N .

Step 5. Thus

\sum_{k \in J_{m}} k \leq \frac{N^{2}}{m^{3}} + \frac{N^{2}}{m^{2}} \leq \frac{2 N^{2}}{m^{2}},

which proves

\sum_{k \in J_{m}} k = O (N^{2} / m^{2}) .

Therefore, the lemma is proved. □

(iv) Dyadic decomposition. For sums over integers

n \geq 2,

we can decompose the range using powers of two.

Lemma 4.

Let

N \geq 2

be an integer and set

L = ⌊ {log}_{2} N ⌋,

so that

2^{L} \leq N < 2^{L + 1} .

Then the following identity holds:

\sum_{n = 2}^{N} f (n) = \sum_{k = 1}^{⌊ {log}_{2} N ⌋} \sum_{2^{k - 1} < n \leq min (N, 2^{k})} f (n) .

(6)

Proof.

For

k = 1, 2, \dots, L - 1,

we have

2^{k} \leq 2^{L - 1} \cdot 2 = 2^{L} \leq N,

hence

min (N, 2^{k}) = 2^{k} .

The inner sum becomes

\sum_{2^{k - 1} < n \leq 2^{k}} f (n),

which covers the integers

2^{k - 1} + 1, \dots, 2^{k} .

For

k = L,

since

N < 2^{L + 1},

we have

min (N, 2^{L}) = N,

and the inner sum becomes

\sum_{2^{L - 1} < n \leq N} f (n),

covering the remaining integers

2^{L - 1} + 1, \dots, N .

The intervals for

k = 1, \dots, L - 1

together with the last one partition the set

{2, 3, \dots, N}

without overlap. Thus the equality (6) holds. □

Remark 2.

Note that when

N \to \infty,

we have

⌊ {log}_{2} N ⌋ \to \infty,

so the number of dyadic intervals grows without bound, this property is essential for the convergence arguments in Section 8. If the sum includes

n = 1,

we treat it separately:

\sum_{n = 1}^{N} f (n) = f (1) + \sum_{n = 2}^{N} f (n) .

This decomposition is used in Section 8 to handle integrals via dyadic intervals.

(v) Abel summation (summation by parts), which will be applied in the appendix.

Lemma 5.

Let

{(a_{n})}_{n \geq 1}

and

{(b_{n})}_{n \geq 1}

be sequences of real numbers. Define

A_{0} = 0

and

A_{n} = \sum_{k = 1}^{n} a_{k}

for

n \geq 1 .

Then for any

N \geq 1,

we have

\sum_{n = 1}^{N} a_{n} b_{n} = A_{N} b_{N} + \sum_{n = 1}^{N - 1} A_{n} (b_{n} - b_{n + 1}) = A_{N} b_{N} - \sum_{n = 1}^{N - 1} A_{n} (b_{n + 1} - b_{n}) .

(7)

Proof.

Write

a_{n} = A_{n} - A_{n - 1}

(with

A_{0} = 0

). Then

\sum_{n = 1}^{N} a_{n} b_{n} = \sum_{n = 1}^{N} (A_{n} - A_{n - 1}) b_{n} = \sum_{n = 1}^{N} A_{n} b_{n} - \sum_{n = 1}^{N} A_{n - 1} b_{n} .

Shift the index in the second sum:

\sum_{n = 1}^{N} A_{n - 1} b_{n} = \sum_{k = 0}^{N - 1} A_{k} b_{k + 1} = \sum_{k = 1}^{N - 1} A_{k} b_{k + 1}

(since

A_{0} = 0

). Thus

\sum_{n = 1}^{N} a_{n} b_{n} = \sum_{n = 1}^{N} A_{n} b_{n} - \sum_{n = 1}^{N - 1} A_{n} b_{n + 1} = A_{N} b_{N} + \sum_{n = 1}^{N - 1} A_{n} (b_{n} - b_{n + 1}),

then we obtain (7). □

2.7. Elementary Estimates

We shall apply the following well-known elementary facts.

Lemma 6.

(Asymptotic expansion of harmonic numbers). For

n \to \infty,

\sum_{k = 1}^{n} \frac{1}{k} = log n + γ + O (\frac{1}{n}),

where γ is the Euler-Mascheroni constant.

Proof.

See Apostol [7], Chapter 3, Theorems 3.1 and 3.2. □

Lemma 7.

We have the elementary estimates for the bounds:

ϑ (x) = O (x), ψ (x) = O (x), A (x) = O (x) .

(8)

The two Chebyshev functions

ψ (x)

and

ϑ (x)

satisfy

ψ (x) = ϑ (x) + O (\sqrt{x}) .

The details of the proof are presented below, consisting of three steps.

Proof.

Step 1. Proof of

ϑ (x) = O (x)

(Chebyshev upper bound)

Consider the binomial coefficient

(\binom{2 n}{n}) .

Upper bound:

(\binom{2 n}{n}) \leq {(1 + 1)}^{2 n} = 4^{n} .

Lower bound via primes: every prime p with

n < p \leq 2 n

divides

(\binom{2 n}{n}) .

Hence

\prod_{n < p \leq 2 n} p \leq (\binom{2 n}{n}) \leq 4^{n} .

Taking logarithms:

ϑ (2 n) - ϑ (n) \leq n log 4 .

Apply this repeatedly for

n = 2^{k}

and sum:

ϑ (2^{m}) \leq 2^{m} log 4 .

For general

x,

choose m such that

2^{m} \leq x < 2^{m + 1} .

Then

ϑ (x) \leq ϑ (2^{m + 1}) \leq 2^{m + 1} log 4 \leq 4 x log 2 = O (x) .

Thus there exists

C > 0

such that

ϑ (x) \leq C x

for all

x \geq 1 .

Step 2. Proof of

ψ (x) = ϑ (x) + O (\sqrt{x})

Decompose

ψ (x)

by prime power exponents:

ψ (x) = \sum_{k \geq 1} \sum_{p^{k} \leq x} log p = ϑ (x) + ϑ (\sqrt{x}) + ϑ (\sqrt[3]{x}) + \dots,

where only terms with

x^{1 / k} \geq 2

(i.e.,

k \leq {log}_{2} x

) are non-zero.

Using

ϑ (y) \leq C y

from step 1,

ψ (x) - ϑ (x) = \sum_{k = 2}^{⌊ {log}_{2} x ⌋} ϑ (x^{1 / k}) \leq C \sum_{k = 2}^{⌊ {log}_{2} x ⌋} x^{1 / k} .

Split the sum:

For

k = 2

: term

C \sqrt{x} .

For

k \geq 3 : x^{1 / k} \leq x^{1 / 3},

and there are at most

{log}_{2} x

such terms. Hence

\sum_{k = 3}^{⌊ {log}_{2} x ⌋} x^{1 / k} \leq x^{1 / 3} \cdot {log}_{2} x = o (\sqrt{x}) (x \to \infty) .

This part is

O (x^{1 / 3} log x),

which is certainly

O (\sqrt{x}) .

Therefore

ψ (x) - ϑ (x) = O (\sqrt{x}) .

(One often writes

O (\sqrt{x} log x),

but the above shows

O (\sqrt{x})

suffices because

x^{1 / 3} log x = o (\sqrt{x}) .

)

Step 3. From

ϑ (x) = O (x)

and

ψ (x) = ϑ (x) + O (\sqrt{x})

we immediately obtain

ψ (x) = O (x) .

Hence

| A (x) | = | ψ (x) - ⌊ x ⌋ | \leq ψ (x) + ⌊ x ⌋ = O (x),

whereas

⌊ x ⌋ \leq x .

Therefore, the lemma is proved. □

3. Estimating U(N)

Lemma 8.

Let

U (N) = \sum_{k = 1}^{N} k A {(N / k)}^{2} .

Then

U (N) = O (N^{2} log N) .

(9)

Proof.

Chebyshev’s bound gives

| A (x) | = | ψ (x) - ⌊ x ⌋ | \leq ψ (x) + ⌊ x ⌋ = O (x)

for some constant

C > 0

and all

x \geq 1,

where

ψ (x) = O (x)

and

⌊ x ⌋ \leq x .

Then

| U (N) | = |\sum_{k = 1}^{N} k A {(N / k)}^{2}| \leq \sum_{k = 1}^{N} k \cdot C^{2} \frac{N^{2}}{k^{2}} = C^{2} N^{2} \sum_{k = 1}^{N} \frac{1}{k} = O (N^{2} log N) .

Therefore, the lemma is proved. □

Corollary 2.

\frac{U (N)}{N} = O (N log N) .

(10)

Proof.

The proof follows immediately from Lemma 8. □

4. Expanding U(N)/N

For each

k = 1, \dots, N

define

m = ⌊ N / k ⌋ .

According to Lemma 2 (Partition by Floor Values), we know that the sets

J_{m} = \{k \in {1, \dots, N} : ⌊\frac{N}{k}⌋ = m\} = \{k \in N : \frac{N}{m + 1} < k \leq \frac{N}{m}\}

form a partition of

{1, \dots, N} .

Hence

U (N) = \sum_{k = 1}^{N} k A {(N / k)}^{2} = \sum_{m = 1}^{N} \sum_{k \in J_{m}} k A {(N / k)}^{2} .

For

k \in J_{m}

we write

A (N / k) = A (m) + δ_{m, k} .

To bound

δ_{m, k},

note that

δ_{m, k} = A (N / k) - A (m) = (ψ (N / k) - ⌊ N / k ⌋) - (ψ (m) - m) = ψ (N / k) - ψ (m),

because

⌊ N / k ⌋ = m

and

⌊ m ⌋ = m .

Since

m \leq N / k < m + 1,

the interval

(m, N / k]

has length

\frac{N}{k} - m < (m + 1) - m = 1,

hence it contains at most one integer. Consequently,

ψ (N / k) - ψ (m) = \sum_{m < n \leq N / k} Λ (n)

is either 0 or

Λ (n_{0})

for a single integer

n_{0}

(if it exists). In either case,

| ψ (N / k) - ψ (m) | \leq log N,

(11)

because

Λ (n_{0}) \leq log n_{0} \leq log N

when

n_{0}

exists. Therefore

| δ_{m, k} | \leq log N .

Expanding the square, we have

\begin{matrix} U (N) & = \sum_{m = 1}^{N} \sum_{k \in J_{m}} k A {(N / k)}^{2} \\ = \sum_{m = 1}^{N} A {(m)}^{2} \sum_{k \in J_{m}} k + 2 \sum_{m = 1}^{N} A (m) \sum_{k \in J_{m}} k δ_{m, k} + \sum_{m = 1}^{N} \sum_{k \in J_{m}} k δ_{m, k}^{2} . \end{matrix}

For this

U (N),

we can estimate the error terms in the cross-term and the quadratic term contribute at most

O (N^{2} {log}^{2} N) .

(1). Cross-term:

|\sum_{m = 1}^{N} A (m) \sum_{k \in J_{m}} k δ_{m, k}| \leq log N \sum_{m = 1}^{N} | A (m) | \sum_{k \in J_{m}} k .

Since

\sum_{k \in J_{m}} k = O (N^{2} / m^{2})

and

| A (m) | = O (m),

we have

\sum_{m = 1}^{N} | A (m) | \sum_{k \in J_{m}} k ≪ N^{2} \sum_{m = 1}^{N} \frac{1}{m} = O (N^{2} log N) .

Hence the cross-term is

O (N^{2} {log}^{2} N) .

(2). Quadratic term:

|\sum_{m = 1}^{N} \sum_{k \in J_{m}} k δ_{m, k}^{2}| \leq {log}^{2} N \sum_{m = 1}^{N} \sum_{k \in J_{m}} k = {log}^{2} N \sum_{k = 1}^{N} k = O (N^{2} {log}^{2} N) .

Dividing by

N,

we obtain

\frac{U (N)}{N} = \sum_{m = 1}^{N} A {(m)}^{2} \sum_{k \in J_{m}} \frac{k}{N} + O (N {log}^{2} N) .

(12)

5. A Pointwise Lower Bound for the Weights

Lemma 9.

For all

m \geq 1

and

N \geq 2,

we have

\sum_{k \in J_{m}} \frac{k}{N} \geq \frac{1}{2 m} .

Proof.

From the explicit description

J_{m} = {k : N / (m + 1) < k \leq N / m},

let

k_{min}

be the smallest element of

J_{m} .

Then

k_{min} > N / (m + 1) .

Hence

\sum_{k \in J_{m}} \frac{k}{N} \geq \frac{k_{min}}{N} > \frac{1}{m + 1} \geq \frac{1}{2 m},

because

2 m \geq m + 1

for all

m \geq 1 .

□

6. The Core Estimate

Since

A {(m)}^{2} \geq 0,

multiplying the inequality

\sum_{k \in J_{m}} \frac{k}{N} \geq \frac{1}{2 m}

of Lemma 7.1 by

A {(m)}^{2}

and summing over m yields

\sum_{m = 1}^{N} A {(m)}^{2} \sum_{k \in J_{m}} \frac{k}{N} \geq \frac{1}{2} \sum_{m = 1}^{N} \frac{A {(m)}^{2}}{m} .

Insert this into (12):

\frac{U (N)}{N} \geq \frac{1}{2} \sum_{m = 1}^{N} \frac{A {(m)}^{2}}{m} + O (N {log}^{2} N) .

(13)

By (10), we have

\frac{U (N)}{N} = O (N log N),

and therefore

\frac{U (N)}{N} = O (N {log}^{2} N) .

Hence

\frac{1}{2} \sum_{m = 1}^{N} \frac{A {(m)}^{2}}{m} \leq \frac{U (N)}{N} + O (N {log}^{2} N) = O (N {log}^{2} N) .

Thus

\sum_{m = 1}^{N} \frac{A {(m)}^{2}}{m} = O (N {log}^{2} N) .

(14)

This is the central estimate of the paper.

7. From the Core Estimate to an Upper Bound for the Mean-Square Integral

For

t \in [n, n + 1),

we have

ψ (t) = ψ (n),

hence

ψ (t) - t = ψ (n) - t = (ψ (n) - n) + (n - t) = A (n) + (n - t) .

Let

u = t - n \in [0, 1) .

Then

\int_{n}^{n + 1} {(ψ (t) - t)}^{2} d t = \int_{0}^{1} {(A (n) - u)}^{2} d u = A {(n)}^{2} - A (n) + \frac{1}{3} .

Summing from

n = 2

to

⌊ X ⌋

yields

\int_{2}^{X} {(ψ (t) - t)}^{2} d t = \sum_{n \leq X} A {(n)}^{2} - \sum_{n \leq X} A (n) + O (X) .

(15)

By (14), we have

\sum_{n = 1}^{N} \frac{A {(n)}^{2}}{n} = O (N {log}^{2} N),

and using the obvious inequality

1 \leq n \leq N

, we can bound

\sum_{n = 1}^{N} A {(n)}^{2}

:

\sum_{n = 1}^{N} A {(n)}^{2} = \sum_{n = 1}^{N} n \cdot \frac{A {(n)}^{2}}{n} \leq N \sum_{n = 1}^{N} \frac{A {(n)}^{2}}{n} = O (N^{2} {log}^{2} N) .

(16)

So, we have

\sum_{n \leq X} A {(n)}^{2} = O (X^{2} {log}^{2} X) .

By Cauchy-Schwarz, we get

|\sum_{n \leq X} A (n)| \leq {(\sum_{n \leq X} 1)}^{1 / 2} {(\sum_{n \leq X} A {(n)}^{2})}^{1 / 2} = O (X^{1 / 2} \cdot X log X) = O (X^{3 / 2} log X),

which is negligible compared with

O (X^{2} {log}^{2} X) .

Therefore

\int_{2}^{X} {(ψ (t) - t)}^{2} d t = O (X^{2} {log}^{2} X) .

(17)

8. Convergence of the Integral

Theorem 3.

For every

ε > 0,

the integral

\int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x < \infty,

i.e., the integral

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges absolutely.

Proof.

For any

ε > 0,

we have

\int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x \leq \int_{1}^{2} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x + \sum_{n = 1}^{\infty} \int_{2^{n}}^{2^{n + 1}} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x .

Let us examine the integral with weight

x^{- \frac{3}{2} - ε}

on dyadic interval

[2^{n}, 2^{n + 1}],

using (17) we have

\int_{2^{n}}^{2^{n + 1}} {(ψ (x) - x)}^{2} d x = O (2^{2 (n + 1)} {log}^{2} (2^{n + 1})) = O (2^{2 n} {log}^{2} (2^{n + 1})),

and by Cauchy-Schwarz, we get

\begin{matrix} \int_{2^{n}}^{2^{n + 1}} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x & \leq {(2^{n})}^{- \frac{3}{2} - ε} {(\int_{2^{n}}^{2^{n + 1}} 1^{2} d x)}^{1 / 2} {(\int_{2^{n}}^{2^{n + 1}} {(ψ (x) - x)}^{2} d x)}^{1 / 2} \\ ≪ {(2^{n})}^{- \frac{3}{2} - ε} \cdot {(2^{n})}^{1 / 2} \cdot (2^{n} log (2^{n + 1})) = (n + 1) 2^{- n ε} log 2, \end{matrix}

the right-hand side is

≪ (n + 1) 2^{- n ε} .

Hence

\int_{2^{n}}^{2^{n + 1}} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x ≪ (n + 1) 2^{- n ε},

and the series

\sum_{n = 1}^{\infty} (n + 1) 2^{- n ε}

converges. Adding the finite contribution from

[1, 2],

we obtain

\int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x < \infty,

this completes the proof. □

9. Conclusions

We have shown: Elementary estimates of the Chebyshev function lead to

U (N) = O (N^{2} log N),

where

U (N) = \sum_{k = 1}^{N} k A {(N / k)}^{2}

and

A (x) = ψ (x) - ⌊ x ⌋ .

From this we derive the core estimate

\sum_{m = 1}^{N} \frac{A {(m)}^{2}}{m} = O (N {log}^{2} N) .

Using only this core estimate and the Cauchy-Schwarz inequality, we obtain

\int_{2}^{X} {(ψ (t) - t)}^{2} d t = O (X^{2} {log}^{2} X) .

Consequently,

\int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x < \infty

holds, thus the integral

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges absolutely for every

ε > 0 .

All arguments are purely elementary, avoiding complex analysis and unproven hypotheses. The estimate

O (X^{2} {log}^{2} X)

is unconditional and best possible with elementary methods. Since we have proven the integral

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges absolutely for every

ε > 0,

which implies that

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges conditionally for every

ε > 0,

whereas the integral

\int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x

converges conditionally for every

ε > 0

\Leftrightarrow RH,

so then the Riemann hypothesis is true.

Appendix A. A Theorem on Integral Convergence, Pointwise Bound and Analyticity

Theorem A1.

Let

ψ (x) = \sum_{n \leq x} Λ (n)

be the Chebyshev function. Define

a_{n} = Λ (n) - 1, A (x) = \sum_{n \leq x} a_{n} = ψ (x) - ⌊ x ⌋ .

For every

ε > 0

define

I_{abs} (ε) = \int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} d x, I_{cond} (ε) = \int_{1}^{\infty} \frac{ψ (x) - x}{x^{\frac{3}{2} + ε}} d x .

Note that

ψ (x) - x = A (x) + {x}

where

{x} = x - ⌊ x ⌋

is the fractional part, bounded by

1 .

Hence the convergence of

I_{cond} (ε)

is equivalent to the convergence of

\int_{1}^{\infty} A (x) x^{- \frac{3}{2} - ε} d x

(since the integral of

{x} x^{- \frac{3}{2} - ε}

converges absolutely). Therefore we may work with

A (x)

instead of

ψ (x) - x .

Then the following statements are equivalent:

(1). Absolute convergence:

I_{abs} (ε) < \infty

for all

ε > 0 .

(2). Conditional convergence:

I_{cond} (ε)

converges for all

ε > 0 .

(3). Pointwise o-bound:

| ψ (x) - x | = o (x^{1 / 2 + ε})

for all

ε > 0

(hence also

O (x^{1 / 2 + ε})

).

(4). Analyticity: The functions

f (s) = \int_{1}^{\infty} \frac{| ψ (x) - x |}{x^{s + 1}} d x, g (s) = \int_{1}^{\infty} \frac{ψ (x) - x}{x^{s + 1}} d x

are analytic in the half-plane

Re (s) > \frac{1}{2} .

The details of the proof are presented below, consisting of two parts: A.1. Local bounded variation estimate, A.2. Proof of the equivalences.

Appendix A.1. Local Bounded Variation Estimate

Lemma A1.

There exists an absolute constant

C > 0

such that for all

x \geq 2

and all h with

1 \leq h \leq x,

| ψ (x + h) - ψ (x) | \leq C h log x .

(A1)

Proof.

The interval

(x, x + h]

contains at most

⌊ h ⌋ + 1 \leq h + 1

integers. For any integer n in this interval, if

n = p^{k}

is a prime power, then

Λ (n) = log p \leq log n \leq log (x + h);

otherwise

Λ (n) = 0,

where we have

ψ (x + h) - ψ (x) = \sum_{x < n \leq x + h} Λ (n) .

Hence

| ψ (x + h) - ψ (x) | \leq (h + 1) log (x + h) .

Because

h \leq x,

we have

x + h \leq 2 x

and

log (x + h) \leq log (2 x) = log 2 + log x .

For

h \geq 1

we also have

h + 1 \leq 2 h .

Thus

| ψ (x + h) - ψ (x) | \leq 2 h (log 2 + log x) .

Choosing

C = 2 (log 2 + 1)

gives the desired inequality.

Appendix B. Proof of the Equivalences

We prove the chain (1) ⇒ (3) ⇒ (1) and (2) ⇒ (3) ⇒ (2). Furthermore, (1) ⇒ (4), and (4) ⇒ (1). All of these form an equivalence chain.

Proof of (1) ⇒ (3). Assume

I_{abs} (ε) < \infty

for every

ε > 0 .

Fix

ε > 0 .

If the o-bound were false, then there would exist

δ > 0

and a sequence

x_{n} \to \infty

such that

| ψ (x_{n}) - x_{n} | \geq δ x_{n}^{1 / 2 + ε} .

(A2)

Set

h_{n} = x_{n}^{1 / 2} .

For sufficiently large n we have

h_{n} \geq 1

and

h_{n} \leq x_{n};

thus (A1) applies. For any

t \in [x_{n}, x_{n} + h_{n}],

| ψ (t) - ψ (x_{n}) | \leq C h_{n} log x_{n} .

(A3)

Then

| ψ (t) - t | \geq | ψ (x_{n}) - x_{n} | - | ψ (t) - ψ (x_{n}) | - | t - x_{n} | \geq δ x_{n}^{1 / 2 + ε} - C x_{n}^{1 / 2} log x_{n} - x_{n}^{1 / 2} .

Since

x_{n}^{ε} \to \infty,

for large n we have

δ x_{n}^{ε} \geq 2 (C log x_{n} + 1),

whence

| ψ (t) - t | \geq \frac{δ}{2} x_{n}^{1 / 2 + ε} (t \in [x_{n}, x_{n} + h_{n}]) .

(A4)

Now estimate the integral over

[x_{n}, x_{n} + h_{n}] .

Since

t \leq x_{n} + h_{n} \leq 2 x_{n}

(as

h_{n} \leq x_{n}

), and

t^{\frac{3}{2} + ε} \leq {(2 x_{n})}^{\frac{3}{2} + ε} = 2^{3 / 2 + ε} x_{n}^{3 / 2 + ε} .

Using (A4) we obtain

\frac{| ψ (t) - t |}{t^{\frac{3}{2} + ε}} \geq \frac{δ}{2^{\frac{5}{2} + ε}} x_{n}^{- 1} .

Hence

\int_{x_{n}}^{x_{n} + h_{n}} \frac{| ψ (t) - t |}{t^{3 / 2 + ε}} d t \geq \frac{δ}{2^{5 / 2 + ε}} x_{n}^{- 1} \cdot h_{n} = \frac{δ}{2^{5 / 2 + ε}} x_{n}^{- 1 / 2} .

(A5)

Choose a subsequence (still denoted by

x_{n}

) such that the intervals

[x_{n}, x_{n} + h_{n}]

are pairwise disjoint and satisfy

x_{n} \leq n^{2}

(this is always possible by thinning the sequence if necessary). Then

\sum_{n} x_{n}^{- 1 / 2} \geq \sum_{n} 1 / n = \infty .

Because the intervals are disjoint, (A5) gives

I_{abs} (ε) \geq \frac{δ}{2^{5 / 2 + ε}} \sum_{n} x_{n}^{- 1 / 2} = \infty,

contradicting the convergence of

I_{abs} (ε) .

Hence

| ψ (x) - x | = o (x^{\frac{1}{2} + ε}) .

Proof of (3) ⇒ (1). If

| ψ (x) - x | = o (x^{\frac{1}{2} + ε})

for all

ε > 0,

then for any fixed

ε > 0

there exists

X_{0}

such that for all

x \geq X_{0},

we have

| ψ (x) - x | \leq x^{\frac{1}{2} + \frac{ε}{2}} .

Thus

\frac{| ψ (x) - x |}{x^{\frac{3}{2} + ε}} \leq x^{- 1 - \frac{ε}{2}},

which is integrable on

[1, \infty) .

Hence

I_{abs} (ε) < \infty .

Proof of (2) ⇒ (3). Assume that

I_{cond} (ε)

converges conditionally for every

ε > 0 .

Since

ψ (x) - x = A (x) + {x}

and the integral of

{x} x^{- \frac{3}{2} - ε}

converges absolutely, the conditional convergence of

I_{cond} (ε

) implies the convergence of

\int_{1}^{\infty} \frac{A (x)}{x^{\frac{3}{2} + ε}} d x .

Now write

A (x) = \sum_{n \leq x} a_{n} .

For

Re (s) > 1,

integration by parts gives

\int_{1}^{\infty} A (x) x^{- s - 1} d x = \frac{1}{s} \sum_{n = 1}^{\infty} a_{n} n^{- s} .

(A6)

The right-hand side is analytic for

Re (s) > 1,

and by standard results in Dirichlet series, which states:

Lemma A2.

If a Dirichlet series

D (s) = \sum_{n = 1}^{\infty} a_{n} n^{- s}

converges at a point

s = σ_{0}

with

σ_{0} > 0,

then its coefficient partial sums satisfy

A (N) : = \sum_{n \leq N} a_{n} = O (N^{σ_{0}}) .

A short proof of

A (N) = O (N^{σ_{0}})

using Abel summation is as follows. Let the series

H (N) = \sum_{n = 1}^{N} a_{n} n^{- σ_{0}} .

Since the series

H (N)

converges,

H (N)

is bounded, say

| H (N) | \leq M .

Then by partial summation, we have

A (N) = \sum_{n = 1}^{N} a_{n} = \sum_{n = 1}^{N} a_{n} n^{- σ_{0}} \cdot n^{σ_{0}} = H (N) N^{σ_{0}} - \sum_{k = 1}^{N - 1} H (k) ({(k + 1)}^{σ_{0}} - k^{σ_{0}}) .

By the telescoping sum, we have

\sum_{k = 1}^{N - 1} ({(k + 1)}^{σ_{0}} - k^{σ_{0}}) = N^{σ_{0}} - 1 .

Therefore,

|\sum_{k = 1}^{N - 1} H (k) ({(k + 1)}^{σ_{0}} - k^{σ_{0}})| \leq M (N^{σ_{0}} - 1) = O (N^{σ_{0}}) .

Thus,

A (N) = O (N^{σ_{0}}) .

That is to say, if the integral converges at a point

s = σ_{0}

(which is equivalent to the convergence of the Dirichlet series at

s = σ_{0}

), then the partial sums of the coefficients satisfy

\sum_{n \leq N} a_{n} = O (N^{σ_{0}}) .

(A7)

Here the convergence of the integral at

s = \frac{1}{2} + ε

(for every

ε > 0

) implies that the Dirichlet series

\sum_{n = 1}^{\infty} a_{n} n^{- s}

converges at

s = \frac{1}{2} + ε .

Therefore (A7) holds with

σ_{0} = \frac{1}{2} + ε,

giving

ψ (N) - ⌊ N ⌋ = A (N) = O (N^{\frac{1}{2} + ε}) .

Extending the bound to real

x .

Let

x \geq 2

and set

N = ⌊ x ⌋ .

Since

ψ

is constant on

[N, N + 1),

we have

ψ (x) = ψ (N) .

Hence

| ψ (x) - x | = | ψ (N) - x | \leq | ψ (N) - N | + | N - x | \leq C N^{\frac{1}{2} + ε} + 1 .

Because

N \leq x

and

1 = O (x^{\frac{1}{2} + ε}),

we conclude

| ψ (x) - x | = O (x^{\frac{1}{2} + ε}) .

Therefore

| ψ (x) - x | = O (x^{\frac{1}{2} + ε})

for

\forall ε > 0,

i.e., there exists an absolute constant

C > 0

such that for all sufficiently large

x,

we have

| ψ (x) - x | \leq C x^{\frac{1}{2} + δ}

for every

δ > 0 .

This actually implies the stronger o-bound: given any

ε > 0,

choose

δ = ε / 2;

then

| ψ (x) - x | \leq C x^{\frac{1}{2} + δ} = o (x^{\frac{1}{2} + ε}) .

Proof of (3) ⇒ (2). If

| ψ (x) - x | = o (x^{\frac{1}{2} + ε})

for all

ε > 0,

then for any fixed

ε > 0

there exists

X_{0}

such that for

x \geq X_{0},

we have

| ψ (x) - x | \leq x^{\frac{1}{2} + \frac{ε}{2}} .

Hence

|\frac{ψ (x) - x}{x^{\frac{3}{2} + ε}}| \leq x^{- 1 - \frac{ε}{2}},

which is absolutely integrable. Therefore

I_{cond} (ε)

converges absolutely (and hence conditionally).

Thus (1), (2), (3) are all equivalent. The equivalence with analyticity of

f (s)

and

g (s)

follows from standard properties of Dirichlet integrals: absolute convergence in a half-plane implies analyticity, and analyticity implies convergence on the boundary. That is to say, (4) has an analyticity equivalence with (1) and (2).

Proof of (1) ⇒ (4). Assume

I_{abs} (ε) < \infty

for all

ε > 0 .

Let

σ_{0} > 1 / 2

be arbitrary and set

ε = (σ_{0} - \frac{1}{2}) / 2 .

Then there exists a constant

C > 0

such that

| ψ (x) - x | \leq C x^{\frac{1}{2} + ε}

for all large

x .

For

Re (s) \geq σ_{0},

we have

|\frac{| ψ (x) - x |}{x^{s + 1}}| \leq C x^{\frac{1}{2} + ε - σ_{0} - 1} = C x^{- 1 - ε},

and

\int_{1}^{\infty} x^{- 1 - ε} d x < \infty .

Hence the integral defining

f (s)

converges uniformly on any half-plane

Re (s) \geq σ_{0} .

The integrand is analytic in

s,

so by the Weierstrass theorem for parameter-dependent integrals,

f (s)

is analytic in

Re (s) > \frac{1}{2} .

The same estimate applies to

g (s)

because

| g (s) | \leq f (Re (s)) .

Thus (4) holds.

Proof of (4) ⇒ (1). If

f (s)

is analytic for

Re (s) > 1 / 2,

then in particular for any

ε > 0,

the point

s = \frac{1}{2} + ε

lies in the domain of analyticity, so the integral

f (\frac{1}{2} + ε) = \int_{1}^{\infty} | ψ (x) - x | x^{- \frac{3}{2} - ε} d x

converges, i.e.,

I_{abs} (ε) < \infty .

The same conclusion for

g (s)

also gives

I_{cond} (ε)

converges, but (1) follows directly. Hence (1) and (4) are equivalent. By the equivalences of (1), (2), and (3), so (4) is also equivalent to (2), and it is equivalent to (3). Thus (1), (2), (3), and (4) are all equivalent. This completes the proof of Theorem A1. □

Appendix B.1. Remarks

For the Chebyshev function

ψ (x),

the following are equivalent: Absolutely convergent integral

\int_{1}^{\infty} | ψ (x) - x | x^{- \frac{3}{2} - ε} d x

for all

ε > 0;

Conditionally convergent integral

\int_{1}^{\infty} (ψ (x) - x) x^{- \frac{3}{2} - ε} d x

for all

ε > 0;

The pointwise estimate

| ψ (x) - x | = o (x^{\frac{1}{2} + ε})

for all

ε > 0;

The analyticity of the Dirichlet-type integrals

f (s)

and

g (s)

in

Re (s) > 1 / 2 .

These equivalences rely on the local bounded variation of

ψ

and standard Dirichlet series theory. They are included as a theoretical complement to the main paper, where we have directly proved the absolute convergence of the integral

\int_{1}^{\infty} (ψ (x) - x) x^{- \frac{3}{2} - ε} d x

via the mean-square estimate

\int_{2}^{X} {(ψ (t) - t)}^{2} d t = O (X^{2} {log}^{2} X)

and a dyadic decomposition. In a word, the main proof does not rely on this appendix; the appendix is only a theoretical complement that shows the equivalence of the integral convergence, the pointwise o-bound, and the analyticity of

f (s),

as well as the consequence of the analyticity of

g (s) .

It provides additional insight into the relationships between these properties for the Chebyshev function.

References

von Koch, H. Sur la distribution des nombres premiers. Acta Math. 1901, 24, 159–182. [Google Scholar] [CrossRef]
Ingham, A. E. The Distribution of Prime Numbers, Cambridge Tracts in Mathematics and Mathematical Physics; No. 30; Cambridge University Press: Cambridge, 1932; [Reprinted 1990, Cambridge Mathematical Library]. [Google Scholar]
Selberg, A. An elementary proof of the prime-number theorem. Ann. Math. 1949, Vol.50(No.2), 305–313. [Google Scholar] [CrossRef]
Erdos, P. On a new method in elementary number theory which leads to an elementary proof of the prime number theorem. Proc. Natl. Acad. Sci. USA 1949, Vol.35(No.7), 374–384. [Google Scholar] [CrossRef] [PubMed]
Hardy, G. H.; Littlewood, J. E.; Pólya, G. Inequalities, 2nd ed.; Cambridge University Press, 1952. [Google Scholar]
Edwards, Harold M. Riemann’s Zeta Function; Academic Press: New York, 1974. [Google Scholar]
Apostol, Tom M. Introduction to Analytic Number Theory; Springer-Verlag: New York; Heidelberg/Berlin, 1976. [Google Scholar]
Hardy, G. H.; Wright, E. M. An Introduction to the Theory of Numbers, 5th-ed.; Posts & Telecom Press under licence from Oxford University Press: Beijing, 2007. [Google Scholar]
Tenenbaum, G. Introduction to Analytic and Probabilistic Number Theory; American Mathematical Society: Providence, RI, 2015. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Proof of the Riemann Hypothesis via the Chebyshev Function and the Integral Convergence

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminary

2.1. Notation Conventions and Logical Symbols

2.2. Basic Definitions

2.3. The Prime Number Theorem

2.4. Non-trivial Zeros of Riemman’s Zeta Function and the Riemann Hypothesis

2.5. Well-known Equivalent Forms of RH in Terms of the Chebyshev Function

2.6. Basic Analytical Tools

2.7. Elementary Estimates

3. Estimating U(N)

4. Expanding U(N)/N

5. A Pointwise Lower Bound for the Weights

6. The Core Estimate

7. From the Core Estimate to an Upper Bound for the Mean-Square Integral

8. Convergence of the Integral

9. Conclusions

Appendix A. A Theorem on Integral Convergence, Pointwise Bound and Analyticity

Appendix A.1. Local Bounded Variation Estimate

Appendix B. Proof of the Equivalences

Appendix B.1. Remarks

References

MDPI Initiatives

Important Links

Subscribe