Preprint
Article

This version is not peer-reviewed.

Physics as the Solution to an Optimization Problem on Entropy

Submitted:

16 July 2024

Posted:

17 July 2024

Read the latest preprint version here

Abstract
We propose a novel approach to quantum theory construction that involves solving a maximization problem on the Shannon entropy of all possible measurements of a system, relative to its initial preparation. This maximization problem is additionally constrained by a phase condition that vanishes under measurements. Specifically, enforcing a vanishing U(1)-valued phase constraint leads to standard quantum mechanics, while a vanishing Spin^c(3,1)-valued phase constraint extends the theory to relativistic quantum mechanics and to quantum gravity. The latter scenario derives the metric tensor as an operator via a double-copy mechanism applied to the Dirac current. Significantly, this solution is consistent exclusively in 3+1-dimensions as all other dimensional configurations lead to fundamental obstructions. Finally, the solution uniquely incorporates the SU(3)xSU(2)xU(1) symmetries of the Standard Model. This framework seamlessly integrates fundamental concepts from quantum mechanics, relativistic quantum mechanics, quantum gravity, the dimensional specificity of spacetime, and particle physics gauge symmetries as the solution to a simple entropy optimization problem.
Keywords: 

1. Introduction

The canonical formalism of quantum mechanics (QM) is based on five principal axioms[1,2]:
QM Axiom 1 of 5
State Space: Each physical system corresponds to a complex Hilbert space, with the system’s state represented by a ray in this space.
QM Axiom 2 of 5
Observables: Physical observables correspond to Hermitian operators within the Hilbert space.
QM Axiom 3 of 5
Dynamics: The time evolution of a quantum system is dictated by the Schrödinger equation, where the Hamiltonian operator signifies the system’s total energy.
QM Axiom 4 of 5
Measurement: The act of measuring an observable results in the system’s transition to an eigenstate of the associated operator, with the measurement value being one of the eigenvalues.
QM Axiom 5 of 5
Probability Interpretation: The likelihood of a specific measurement outcome is determined by the squared magnitude of the state vector’s projection onto the relevant eigenstate.
Contrastingly, statistical mechanics (SM), the other statistical pillar of physics, derives its probability measure through entropy maximization, constrained by the following expression:
SM Constraint 1 of 1:
Average Energy Constraint: The average of energy measurements of a system at thermodynamic equilibrium converge to a specific value ( E ¯ ):
E ¯ = i ρ i E i
To maximize entropy while satisfying this constraint, the theory uses a Lagrange multiplier approach.
Definition 1 
(Fundamental Lagrange Multiplier Equation of SM).
L = k B i ρ i ln ρ i Boltzmann entropy + λ 1 i ρ i N o r m a l i z a t i o n C o n s t r a i n t + β E ¯ i ρ i E i A v e r a g e E n e r g y C o n s t r a i n t
where λ and β are the Lagrange multipliers.
Theorem 1 
(Gibbs Measure). The solution to the Lagrange multiplier equation of SM is the Gibbs measure.
ρ i = 1 i exp ( β E i ) Microcanonical Ensemble exp ( β E i )
Proof. 
This is an well-known result by E. T. Jaynes [3,4]. As a convenience, we replicate the proof in Appendix A.    □
As evident from E. T. Jaynes’ methodological innovation, SM relies on a single constraint related to the nature of the measurements under consideration, which allows the formulation of an optimization problem sufficient to derive the relevant probability measure. This is an exceptionally parsimonious formulation of a physical theory.
We propose a generalization of E. T. Jaynes’ approach to the realms of Quantum Mechanics (QM), relativistic Quantum Mechanics (RQM) and Quantum Gravity (QG). For each domain, we will introduce a single constraint related to measurements, formulate a corresponding entropy maximization problem, and present a main theorem that encapsulates the theory. This formulation reduces fundamental physics to its most parsimonious expression, deriving the core theories as optimal solutions to a well-defined entropy maximization problem.

1.1. Quantum Mechanics

To reformulate QM as the solution to an entropy maximization problem, we propose the following constraint:
QM Constraint 1 of 1
Vanishing Complex-Phase: Quantum measurements admit a vanishing complex phase. The constraint is:
0 = tr i ρ i 0 E i E i 0
Here, the matrix representation engenders the complex phase, and the trace will cause it to vanish under measurement.
It associates to the follow equation:
Definition 2 
(Fundamental Lagrange Multiplier Equation of QM).
L = i ρ i ln ρ i p i R e l a t i v e S h a n n o n E n t r o p y + λ 1 i ρ i N o r m a l i z a t i o n C o n s t r a i n t + τ tr i ρ i 0 E i E i 0 V a n i s h i n g C o m p l e x - P h a s e
where λ and τ are the Lagrange multipliers.
The relative Shannon entropy[5,6] is utilized because we are solving for the least biased theory that connects an initial preparation p to its final measurement ρ .
Theorem 2. 
The least biased probability measure that connects an initial preparation p to its final measurement ρ, under the constraint of the vanishing complex-phase, is:
ρ i = 1 i p i exp ( i t E i / ) Unitarily Invariant Ensemble exp ( i t E i / ) Born Rule p i Initial Preparation
where we have defined τ = t / (analogous to β = 1 / ( k B T ) in SM).
The proof of this theorem will be presented in the results section. We will show that this solution entails the five axioms of QM, which are now promoted to theorems, yielding a parsimonious formulation of QM.

1.2. Relativistic Quantum Mechanics

Before we can discuss RQM, we first need to introduce a notation. Let u = a + x + f + v + b , where a is a scalar, x is a vector, f is a bivector, v is a pseudo-vector and b is a pseudo-scalar, be a multivector of the geometric algebra GA ( 3 , 1 ) , and let M be its matrix representation. Then, the fundamental constraint is:
QG Constraint 1 of 1
Vanishing Relativistic Phase: Our formulation of RQM is based around a vanishing phase spanning the Spin c ( 3 , 1 ) group. The constraint is:
0 = tr 1 2 i ρ i M i
where M i is the matrix representation of the multivector u i = f i + b i of GA ( 3 , 1 ) . Using the real Majorana representation of the gamma matrices, the representation is as follows:
M = f 02 b f 13 f 01 + f 12 f 03 + f 23 b + f 13 f 02 f 03 + f 23 f 01 f 12 f 01 f 12 f 03 f 23 f 02 b f 13 f 03 f 23 f 01 + f 12 b + f 13 f 02
f 01 γ 0 γ 1 + f 02 γ 0 γ 2 + f 03 γ 0 γ 3 + f 12 γ 1 γ 2 + f 13 γ 1 γ 3 + f 23 γ 2 γ 3 + b γ 0 γ 1 γ 2 γ 3
Similarly to the QM case, here the matrix representation engenders a Spin c ( 3 , 1 ) -phase and the trace will cause it to vanish under measurement.
The Lagrange multiplier equation is as follows:
Definition 3 
(Fundamental Lagrange Multiplier Equation of RQM).
L = i ρ i ln ρ i p i R e l a t i v e S h a n n o n E n t r o p y + λ 1 i ρ i N o r m a l i z a t i o n C o n s t r a i n t + ζ tr 1 2 i ρ i M i V a n i s h i n g S p i n c ( 3 , 1 ) P h a s e
where λ and ζ are the Lagrange multipliers.
Theorem 3. 
The least biased probability measure that connects an initial preparation p i to its final measurement ρ i , under the constraint of the vanishing relativistic phase, is:
ρ ( q ) = 1 i p i det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Invariant Ensemble det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Born Rule p i Initial Preparation
The proof of this theorem is presented in the results section.
In the results section, we demonstrate that the solution to this optimization problem provides a foundation for a quantum mechanical theory in 3+1-dimensional spacetime. The parameter ζ emerges as a the generator of boosts, rotations, and phase transformations. This single parameter accounts for all change-of-basis transformations an observer can perform in spacetime before measurement. Furthermore, we show that the Dirac current resulting from this formulation exhibits invariance under SU ( 3 ) × SU ( 2 ) × U ( 1 ) gauge symmetries, aligning with the fundamental symmetries of the Standard Model.

1.3. Quantum Gravity

Our approach to RQM extends naturally to QG. While RQM utilizes only a portion of the solution space, QG leverages its full generality. The key innovation is the introduction of a double-copy mechanism applied to the Dirac current: the probability measure multiply two Dirac currents to obtain a metric tensor expectation value. This mechanism allows us to construct the metric tensor as an observable from basis vectors, thereby establishing a direct link between the probability measure, metric measurements, and the geometric structure of spacetime.

1.4. Dimensional Obstructions

We conclude the results section with a series of theorems demonstrating that, except for SM (no vanishing phase) and QM (vanishing U ( 1 ) phase), the entropy maximization technique yields a solution only in 3+1-dimensional spacetime (vanishing Spin c ( 3 , 1 ) phase). In all other dimensional configurations, various obstructions are encountered. These findings suggest an intriguing connection between the entropy maximization approach and the specific dimensionality of our universe, the implications of which are discussed.

2. Results

2.1. Quantum Mechanics

In statistical mechanics, the founding observation is that energy measurements of a thermally equilibrated system tend towards an average value. Comparatively, in QM, the founding observation involves the interplay between the systematic elimination of complex phases in measurement outcomes and the presence of interference effects in repeated measurement outcomes. To represent this observation, we introduce the Vanishing U ( 1 ) -Phase Anti-Constraint:
0 = tr i ρ i 0 E i E i 0
where E i are scalar-valued. The usage of the matrix generates a U ( 1 ) phase, and the trace causes it to vanish under measurements.
At first glance, this expression may seem to reduce to a tautology equating zero with zero, suggesting it imposes no restriction on energy measurements. However, this appearance is deceptive. Unlike a conventional constraint that limits the solution space, this expression serves as a formal device to expand it, allowing for the incorporation of complex phases into the probability measure. The expression’s role in broadening, rather than restricting, the solution space leads to its designation as an ”anti-constraint.”
In general, usage of anti-constraints expand classical probability measures into larger domains, such as quantum probabilities.
Its significance will become evident upon the completion of the optimization problem. For the moment, this expression can be conceptualized as an ansatz that, when incorporated as an anti-constraint within an entropy-maximization problem, resolves into the axioms of quantum mechanics.
Our next procedural step involves solving the corresponding Lagrange multiplier equation, mirroring the methodology employed in statistical mechanics by E. T. Jaynes. We utilize the relative Shannon entropy because we wish to solve for the least biased probability measure that connects an initial preparation p i to its final measurement ρ i . For that, we deploy the following Lagrange multiplier equation:
L = i ρ i ln ρ i p i R e l a t i v e S h a n n o n E n t r o p y + λ 1 i ρ i N o r m a l i z a t i o n C o n s t r a i n t + τ tr i ρ i 0 E i E i 0 V a n i s h i n g U ( 1 ) - P h a s e
Where λ and τ are the Lagrange multipliers.
We solve the maximization problem as follows:
0 = L [ ρ 1 , , ρ n ] ρ i
= ln ρ i p i p i λ τ tr 0 E i E i 0
= ln ρ i p i + p i + λ τ tr 0 E i E i 0
ln ρ i p i = p i λ τ tr 0 E i E i 0
ρ i = p i exp ( p i λ ) exp τ tr 0 E i E i 0
= 1 Z ( τ ) p i exp τ tr 0 E i E i 0
The partition function is obtained as follows:
1 = i p i exp ( p i λ ) exp τ tr 0 E i E i 0
exp ( p i λ ) 1 = i p i exp τ tr 0 E i E i 0
Z ( τ ) : = i p i exp τ tr 0 E i E i 0
Finally, the least biased probability measure that connects an initial preparation p i to its final measurement ρ i , under the constraint of the vanishing U(1) phase, is:
ρ i = 1 i p i exp τ tr 0 E i E i 0 exp τ tr 0 E i E i 0 p i
Though initially unfamiliar, this form effectively establishes a comprehensive formulation of quantum mechanics, as we will demonstrate.
Upon examination, we find that phase elimination is manifestly evident in the probability measure: since the trace evaluates to zero, the probability measure simplifies to classical probabilities, aligning precisely with the Born rule’s exclusion of complex phases:
ρ i = p i i p i
However, the significance of this phase elimination extends beyond this mere simplicity. As we will soon see, the partition function Z gains unitary invariance, allowing for the emergence of interference patterns and other quantum characteristics under appropriate basis changes.
We will begin by aligning our results with the conventional quantum mechanical notation. As such, we transform the representation of complex numbers from a b b a to a + i b . For instance, the exponential of a complex matrix is:
exp a b b a = r cos ( b ) sin ( b ) sin ( b ) cos ( b ) , where r = exp a
Then, we associate the exponential trace to the complex norm using exp tr M det exp M :
exp tr a b b a = det exp a b b a = r 2 det cos ( b ) sin ( b ) sin ( b ) cos ( b ) , where r = exp a
= r 2 ( cos 2 ( b ) + sin 2 ( b ) )
= r ( cos ( b ) + i sin ( b ) )
= r exp ( i b )
Finally, substituting τ = t / analogously to β = 1 / ( k B T ) , and applying the complex-norm representation to both the numerator and to the denominator, consolidates the Born rule, normalization, and initial prepration into :
ρ i = 1 i p i exp ( i t E i / ) Unitarily Invariant Partition Function exp ( i t E i / ) Born Rule p i Initial Preparation
We are now in a position to explore the solution space.
The wavefunction is delineated by decomposing the complex norm into a complex number and its conjugate. It is then visualized as a vector within a complex n-dimensional Hilbert space. The partition function acts as the inner product. This relationship is articulated as follows:
i p i exp ( i t E i / ) = Z = ψ | ψ
where
ψ 1 ( t ) ψ n ( t ) = exp ( i t E 1 / ) exp ( i t E n / ) ψ 1 ( 0 ) ψ n ( 0 )
We clarify that p i represents the probability associated with the initial preparation of the wavefunction, where p i = ψ i ( 0 ) | ψ i ( 0 ) .
We also note that Z is invariant under unitary transformations.
Let us now investigate how the axioms of quantum mechanics are recovered from this result:
  • The entropy maximization procedure inherently normalizes the vectors | ψ with 1 / Z = 1 / ψ | ψ . This normalization links | ψ to a unit vector in Hilbert space. Furthermore, as physical states associate to the probability measure, and the probability is defined up to a phase, we conclude that physical states map to Rays within Hilbert space. This demonstrates QM Axiom 1 of 5.
  • In Z, an observable must satisfy:
    O ¯ = i p i O i exp ( i t E i / )
    Since Z = ψ | ψ , then any self-adjoint operator satisfying the condition O ψ | ϕ = ψ | O ϕ will equate the above equation, simply because O = ψ | O | ψ . This demonstrates QM Axiom 2 of 5.
  • Upon transforming Equation 32 out of its eigenbasis through unitary operations, we find that the energy, E i , typically transforms in the manner of a Hamiltonian operator:
    | ψ ( t ) = exp ( i t H / ) | ψ ( 0 )
    The system’s dynamics emerge from differentiating the solution with respect to the Lagrange multiplier. This is manifested as:
    t | ψ ( t ) = t ( exp ( i t H / ) | ψ ( 0 ) )
    = i H / exp ( i t H / ) | ψ ( 0 )
    = i H / | ψ ( t )
    H | ψ ( t ) = i t | ψ ( t )
    which is the Schrödinger equation. This demonstrates QM Axiom 3 of 5.
  • From Equation 32 it follows that the possible microstates E i of the system correspond to specific eigenvalues of H . An observation can thus be conceptualized as sampling from ρ , with the measured state being the occupied microstate i. Consequently, when a measurement occurs, the system invariably emerges in one of these microstates, which directly corresponds to an eigenstate of H . Measured in the eigenbasis, the probability measure is:
    ρ i ( t ) = 1 ψ | ψ ( ψ i ( t ) ) ψ i ( t ) .
    In scenarios where the probability measure ρ i ( τ ) is expressed in a basis other than its eigenbasis, the probability P ( λ i ) of obtaining the eigenvalue λ i is given as a projection on a eigenstate:
    P ( λ i ) = | λ i | ψ | 2
    Here, | λ i | ψ | 2 signifies the squared magnitude of the amplitude of the state | ψ when projected onto the eigenstate | λ i . As this argument hold for any observables, this demonstrates QM Axiom 4 of 5.
  • Finally, since the probability measure (Equation 30) replicates the Born rule, QM Axiom 5 of 5 is also demonstrated.
Revisiting quantum mechanics with this perspective offers a coherent and unified narrative. Specifically, the vanishing U(1) phase constraint (Equation 12) is sufficient to entail the foundations of quantum mechanics (Axiom 1, 2, 3, 4 and 5) through the principle of entropy maximization. Equation 12 becomes the formulation’s new singular foundation, and Axioms 1, 2, 3, 4, and 5 are now promoted to theorems.

2.2. RQM in 2D

In this section, we investigate RQM in 2D. Although all geometric configurations except 3+1D contain obstructions, which will be discussed later in this section, the 2D case provides a valuable starting point before addressing the more complex 3+1D case. In RQM 2D, the fundamental Lagrange Multiplier Equation is:
L = i ρ i ln ρ i p i Relative Shannon Entropy + λ 1 i ρ i Normalization Constraint + θ tr 1 2 i ρ i M i Vanishing Relativistic Phase
where λ and θ are the Lagrange multipliers, and where M i is the matrix representation of a multivector b i of GA ( 2 ) , where b i is a pseudo-scalar. In general a multivector u = a + x + b of GA ( 2 ) , where a is a scalar, x is a vector and b a pseudo-scalar, is represented as follows:
a + x y b y + b a x a + x σ x + y σ y + b σ x σ y
The basis elements are defined as:
σ x = 1 0 0 1 , σ y = 0 1 1 0 , σ x σ y = 0 1 1 0
If we take a 0 , x 0 then M reduces as follows:
u = a + x + b | a 0 , x 0 = b M = 0 b b 0
The Lagrange multiplier equation can be solved as follows:
0 = L [ ρ 1 , , ρ n ] ρ i
= ln ρ i p i p i λ θ tr 1 2 0 b i b i 0
= ln ρ i p i + p i + λ + θ tr 1 2 0 b i b i 0
ln ρ i p i = p i λ θ tr 1 2 0 b i b i 0
ρ i = p i exp ( p i λ ) exp θ tr 1 2 0 b i b i 0
= 1 Z ( θ ) p i exp θ tr 1 2 0 b i b i 0
The partition function Z ( θ ) , serving as a normalization constant, is determined as follows:
1 = i p i exp ( p i λ ) exp θ tr 1 2 0 b i b i 0
exp ( p i λ ) 1 = i p i exp θ tr 1 2 0 b i b i 0
Z ( θ ) : = i p i exp θ tr 1 2 0 b i b i 0
Consequently, the least biased probability measure that connects an initial preparation p i to a final measurement ρ i , under the constraint of the vanishing relativistic phase in 2D is:
ρ i = 1 i p i det exp 1 2 θ 0 b i b i 0 Spin ( 2 ) Invariant Ensemble det exp 1 2 θ 0 b i b i 0 Spin ( 2 ) Born Rule p i Initial Preparation
where det exp M = exp tr M .
In 2D, the Lagrange multiplier θ correspond to an angle of rotation, and in 1+1D it would correspond to the rapidity ζ :
2 D : exp θ 0 1 1 0 = cos θ sin θ sin θ cos θ θ is the angle of rotation
1 + 1 D : exp ζ 0 1 1 0 = cosh ζ sinh ζ sinh ζ cosh ζ ζ is the rapidity
The 2D solution may appear equivalent to the QM case because they are related by an isomorphism Spin ( 2 ) SO ( 2 ) U ( 1 ) and under the replacement θ τ . However, an isomorphism does not mean identical, and in Spin(2) we gain extra structures related to a relativistic description, which are not available in the QM case.
To investigate the solution in more detail, we introduce the multivector conjugate, also known as the Clifford conjugate, which generalizes the concept of complex conjugation to multivectors.
Definition 4 
(Multivector conjugate (a.k.a Clifford conjugate)). Let u = a + x + b be a multi-vector of the geometric algebra over the reals in two dimensions GA ( 2 ) . The multivector conjugate is defined as:
u = a x b
The determinant of the matrix representation of a multivector can be expressed as a self-product:
Theorem 4 
(Determinant as a Multivector Self-Product).
u u = det M
Proof. 
Let u = a + x σ x + y σ y + b σ x σ y , and let M be its matrix representation a + x y b y + b a x . Then:
1 : u u
= ( a + x σ x + y σ y + b σ x σ y ) ( a + x σ x + y σ y + b σ x σ y )
= ( a x σ x y σ y b σ x σ y ) ( a + x σ x + y σ y + b σ x σ y )
= a 2 x 2 y 2 + b 2
2 : det M
= det a + x y b y + b a x
= ( a + x ) ( a x ) ( y b ) ( y + b )
= a 2 x 2 y 2 + b 2
   □
Building upon the concept of the multivector conjugate, we introduce the multivector conjugate transpose, which serves as an extension of the Hermitian conjugate to the domain of multivectors.
Definition 5 
(Multivector Conjugate Transpose). Let V ( GA ( 2 ) ) n :
V = a 1 + x 1 + b 1 a n + x n + b n
The multivector conjugate transpose of V is defined as first taking the transpose and then the element-wise multivector conjugate:
V = a 1 x 1 b 1 a n x n b n
Definition 6 
(Bilinear Form). Let V and W be two vectors valued in GA ( 2 ) . We introduce the following bilinear form:
V | W = ( a 1 x 1 b 1 ) ( a 1 + x 1 + b 1 ) + ( a n x n b n ) ( a n + x n + b n )
Theorem 5 
(Inner Product). Restricted to the even sub-algebra of GA ( 2 ) , the bilinear form is an inner product.
Proof. 
V | W x 0 = ( a 1 b 1 ) ( a 1 + b 1 ) + ( a n b n ) ( a n + b n )
This is isomorphic to the inner product of a complex Hilbert space, with the identification i σ x σ y .    □
Definition 7 
(Spin(2)-valuedWavefunction).
ψ = e 1 2 ( a 1 + b 1 ) e 1 2 ( a n + b n ) = ρ 1 R 1 ρ 2 R 2
where ρ i = e 1 2 a i representing the square root of the probability and R i = e 1 2 b i representing a rotor in 2D (or boost in 1+1D).
The partition function of the probability measure can be expressed using the bilinear form applied to the Spin(2)-valued Wavefunction:
Theorem 6 
(Partition Function). Z = ψ | ψ
Proof. 
ψ | ψ = i ψ i ψ i = i ρ i R i R i = i ρ i = Z
   □
Definition 8 
(Spin(2)-valued Evolution Operator).
T = e 1 2 θ b 1 e 1 2 θ b n
Theorem 7. 
The partition function is invariant with respect to the Spin(2)-valued evolution operator.
Proof. 
We note that:
T v | T v = v | v = v T T v T T = I
then, since e 1 2 θ b 1 e 1 2 θ b n e 1 2 θ b 1 e 1 2 θ b n = I , the relation T T = I is satisfied.    □
We note that the even sub-algebra of GA ( 2 ) , being closed under addition and multiplication and constituting an inner product through its bilinear form, allows for the construction of a Hilbert space. In this context, the Hilbert space is Spin(2)-valued. The primary distinction between a wavefunction in a complex Hilbert space and one in a Spin(2)-valued Hilbert space lies in the subject matter of the theory. Specifically, in the latter, the construction governs the change in orientation experienced by an observer, which in turn dictates the measurement basis used in the experiment, consistently with the rotational symmetry and freedom of the system.
The dynamics of observer orientation transformations are described by the Schrödinger equation, which is derived by taking the derivative of the wavefunction with respect to the Lagrange multiplier, θ :
Definition 9 
(Spin(2)-valued Schrödinger Equation).
d d θ ψ 1 ( θ ) ψ n ( θ ) = 1 2 b 1 1 2 b n ψ 1 ( θ ) ψ n ( θ )
Here, θ represents a global one-parameter evolution parameter akin to time, which is able to transform the wavefunction under the Spin(2), locally across the states of the Hilbert space. This is an extremely general equation that captures all transformations that can be done consistently with the symmetries of the wavefunction.
Definition 10 
(David Hestenes’ Formulation). In 3+1D, the David Hestenes’ formulation [7] of the wavefunction is ψ = ρ R e i b / 2 , where R = e f / 2 is a Lorentz boost or rotation and where e i b / 2 is a phase. In 2D, as the algebra only admits a bivector, his formulation would reduce to ψ = ρ R , which is the form we have recovered.
The definition of the Dirac current applicable to our wavefunction follows the formulation of David Hestenes:
Definition 11 
(Dirac Current). Given the basis σ x and σ y , the Dirac current for the 2D theory is defined as:
J 1 ψ σ x ψ = ρ R σ x R SO ( 2 ) = ρ σ ˜ x
J 2 ψ σ y ψ = ρ R σ y R SO ( 2 ) = ρ σ ˜ y
where σ ˜ x and σ ˜ y are a SO(2) rotated basis vectors.

2.2.1. Obstructions

As stated, all geometric configurations except 3+1D contain obstructions. Specifically, in 1+1D and 2D, we identify two obstructions:
  • In 1+1D: The 1+1D theory results in a split-complex quantum theory due to the bilinear form ( a b e 0 e 1 ) ( a + b e 0 e 1 ) , which yields negative probabilities: a 2 b 2 R for certain wavefunction states, in contrast to the non-negative probabilities a 2 + b 2 R 0 obtained in the Euclidean 2D case. (This is why we had to use 2D instead of 1+1D in this two-dimensional introduction...)
  • In 1+1D and in 2D: The basis vectors ( σ x and σ y in 2D, and e 0 and e 1 in 1+1D) are not self-adjoint. Although useable in the context of defining the Dirac current, their non-self-adjointness prevents the construction of the metric tensor as an observable. The benefits of having the basis vectors self-adjoint will become obvious in the 3+1D case, where we will be able to construct the metric tensor from basis measurements. Specifically, in 2D:
    ( e μ u ) u u e μ u
    because ( e μ u ) u = u e μ u = u ( e μ ) u .
In the following section, we will explore the obstruction-free 3+1D case.

2.3. RQM in 3+1D

In this section, we extend the concepts and techniques developed for multivector amplitudes in 2D to the more physically relevant case of 3+1D dimensions. The Lagrange multiplier equation is as follows:
L = i ρ i ln ρ i p i Relative Shannon Entropy + λ 1 i ρ i Normalization Constraint + ζ tr 1 2 i ρ i M i Vanishing Spin c ( 3 , 1 ) - Phase
The solution (proof in Appendix B) is obtained using the same step-by-step process as the 2D case, and yields:
ρ i = 1 i p i det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Invariant Ensemble det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Born Rule p i Initial Preparation
where ζ is a "twisted-phase" rapidity. (If the invariance group was Spin(3,1) instead of Spinc(3,1), obtainable by posing b 0 , then it would simply be the rapidity).

2.3.1. Preliminaries

Our initial goal will be to express the partition function as a self-product of elements of the vector space. As such, we begin by defining a general multivector in the geometric algebra GA ( 3 , 1 ) .
Definition 12 
(Multivector). Let u be a multivector of GA ( 3 , 1 ) . Its general form is:
u = a
+ t γ 0 + x γ 1 + y γ 2 + z γ 3
+ f 01 γ 0 γ 1 + f 02 γ 0 γ 2 + f 03 γ 0 γ 3 + f 12 γ 1 γ 2 + f 13 γ 1 γ 3 + f 23 γ 2 γ 3
+ p γ 1 γ 2 γ 3 + q γ 0 γ 2 γ 3 + v γ 0 γ 1 γ 3 + w γ 0 γ 1 γ 2
+ b γ 0 γ 1 γ 2 γ 3
where γ 0 , γ 1 , γ 2 , γ 3 are the basis vectors in the real Majorana representation.
A more compact notation for u is
u = a + x + f + v + b
where a is a scalar, x a vector, f a bivector, v is pseudo-vector and b a pseudo-scalar.
This general multivector can be represented by a 4 × 4 real matrix using the real Majorana representation:
Definition 13 
(Matrix Representation of u ).
M = a + f 02 q z b f 13 + w x f 01 + f 12 p + v f 03 + f 23 + t + y b + f 13 + w x a + f 02 + q + z f 03 + f 23 t y f 01 f 12 p + v f 01 f 12 + p + v f 03 f 23 + t y a f 02 + q z b f 13 w x f 03 f 23 t + y f 01 + f 12 + p + v b + f 13 w x a f 02 q + z
To manipulate and analyze multivectors in GA ( 3 , 1 ) , we introduce several important operations, such as the multivector conjugate, the 3,4 blade conjugate, and the multivector self-product.
Definition 14 
(Multivector Conjugate (in 4D)).
u = a x f + v + b
Definition 15 
(3,4 Blade Conjugate). The 3,4 blade conjugate of u is
u 3 , 4 = a + x + f v b
The results of Lundholm[8], demonstrates that the multivector norms in the following definition, are the unique forms which carries the properties of the determinants such as N ( u v ) = N ( u ) N ( v ) to the domain of multivectors:
Definition 16. 
The self-products associated with low-dimensional geometric algebras are:
GA ( 0 , 1 ) : φ φ
GA ( 2 , 0 ) : φ φ
GA ( 3 , 0 ) : φ φ 3 φ φ
GA ( 3 , 1 ) : φ φ 3 , 4 φ φ
GA ( 4 , 1 ) : ( φ φ 3 , 4 φ φ ) ( φ φ 3 , 4 φ φ )
We can now express the determinant of the matrix representation of a multivector via the self-product φ φ 3 , 4 φ φ . Again, this choice is not arbitrary, but the unique choice with allows us to represent the determinant of the matrix representation of a multivector within GA ( 3 , 1 ) :
Theorem 8 
(Determinant as a Multivector Self-Product).
u u 3 , 4 u u = det M
Proof. 
Please find a computer assisted proof of this equality in Appendix C.    □
Definition 17 
( GA ( 3 , 1 ) -valued Vector).
V = u 1 u n = a 1 + x 1 + f 1 + v 1 + b 1 a n + x n + f n + v n + b n
These constructions allow us to express the partition function in terms of the multivector self-product:
Definition 18 
(Double-Copy Product). Instead of an inner product, we obtain what we call a double-copy product:
V | V | V | V = i ψ i ψ i copy 1 3 , 4 ψ i ψ i copy 2
= u 1 u n u 1 0 0 u n copy 1 3 , 4 u 1 0 0 u n u 1 u n copy 2
Theorem 9 
(Partition Function). Z = V | V | V | V
Proof. 
V | V | V | V
= u 1 u n u 1 0 0 u n 3 , 4 u 1 0 0 u n u 1 u n
= u 1 u 1 u n u n 3 , 4 u 1 u 1 u n u n
= u 1 u 1 3 , 4 u 1 u 1 + + u n u n 3 , 4 u n u n
= i = 1 n det M u i
= Z
   □
Desirable properties for the double-copy product are introduced by reducing multivectors to its subgroups. First, non-negativity:
Theorem 10 
(Non-negativity). The double-copy product, applied to the even sub-algebra of GA ( 3 , 1 ) is always non-negative.
Proof. 
Let V = a 1 + f 1 + b 1 a n + f n + b n . Then,
V | V | V | V
= ( a 1 + f 1 + b 1 ) ( a 1 + f 1 + b 1 ) 3 , 4 ( a 1 + f 1 + b 1 ) ( a 1 + f 1 + b 1 )
= ( a 1 f 1 + b 1 ) ( a 1 + f 1 + b 1 ) 3 , 4 ( a 1 f 1 + b 1 ) ( a 1 + f 1 + b 1 )
= a 1 2 + a 1 f 1 + a 1 b 1 f 1 a 1 f 1 2 f 1 b 1 + b 1 a 1 + b 1 f 1 + b 1 2 3 , 4
= a 1 2 f 1 2 + b 1 2 3 , 4
We note 1) b 2 = ( b I ) 2 = b 2 and 2) f 2 = E 1 2 E 2 2 E 3 2 + B 1 2 + B 2 2 + B 3 2 + 4 e 0 e 1 e 2 e 3 ( E 1 B 1 + E 2 B 2 + E 3 B 3 )
= a 1 2 b 1 2 + E 1 2 + E 2 2 + E 3 2 B 1 2 B 2 2 B 3 2 4 e 0 e 1 e 2 e 3 ( E 1 B 1 + E 2 B 2 + E 3 B 3 ) 3 , 4
We note that the terms are now complex numbers, which we rewrite as ( z ) = a 1 2 b 1 2 + E 1 2 + E 2 2 + E 3 2 B 1 2 B 2 2 B 3 2 and ( z ) = 4 ( E 1 B 1 + E 2 B 2 + E 3 B 3 )
1 1 = z 1 z 2 3 , 4 z n z n
= z 1 z 2 z n z n
= z 1 z 1 + + z n z n
which is always non-negative.    □
Then, positive-definiteness of the double-copy product is obtained by creating an equivalence class between the zero vector and any non-zero vector of length zero, and taking the zero vector as the representative of the class. To realize the equivalence class, we define the Spin c ( 3 , 1 ) -valued wavefunction, which is valued in the even sub-algebra of GA ( 3 , 1 ) , as follows:
Definition 19 
( Spin c ( 3 , 1 ) -valued Wavefunction).
ψ = e 1 2 ( a 1 + f 1 + b 1 ) e 1 2 ( a n + f n + b n ) = ρ 1 R 1 B 1 ρ n R n B n
where R i = e 1 2 f i is a rotor, B i = e 1 2 b i is a phase, and where ρ i = e 1 2 a i 0 .
Any even multivectors of GA(3,1) admits a unique exponential representation, except when ρ i = 0 in which it is surjective. Consequently, in this representation the double-copy product yields 0 only for the zero vector, rendering the double-copy product positive-definite.
Now, let us turn our attention to the evolution operator, which leaves the partition function invariant:
Definition 20 
( Spin c ( 3 , 1 ) Evolution Operator).
T = e 1 2 ζ ( f 1 + b 1 ) e 1 2 ζ ( f n + b n )
In turn, this leads to a Schrödinger equation obtained by taking the derivative of the wavefunction with respect to the Lagrange multiplier ζ :
Definition 21 
( Spin c ( 3 , 1 ) -valued Schrödinger equation).
d d ζ ψ 1 ( ζ ) ψ n ( ζ ) = 1 2 ( f 1 + b 1 ) 1 2 ( f n + b n ) ψ 1 ( ζ ) ψ n ( ζ )
In this case ζ represents a one-parameter evolution parameter akin to time, which is able to transform the measurement basis under action of the Spin c ( 3 , 1 ) group. This is an extremely general equation that captures all transformations that can be done consistently with the symmetries of the wavefunction.
Theorem 11 
(Spinc(3,1) invariance). Let e 1 2 f e 1 2 b be a general element of Spinc(3,1). Then, the equality:
ψ ψ 3 , 4 ψ ψ = ( e 1 2 f e 1 2 b ψ ) e 1 2 f e 1 2 b ψ 3 , 4 ( e 1 2 f e 1 2 b ψ ) e 1 2 f e 1 2 b ψ
is always satisfied.
Proof. 
( e 1 2 f e 1 2 b ψ ) e 1 2 f e 1 2 b ψ 3 , 4 ( e 1 2 f e 1 2 b ψ ) e 1 2 f e 1 2 b ψ
= ψ e 1 2 f e 1 2 b e 1 2 f e 1 2 b ψ 3 , 4 ψ e 1 2 f e 1 2 b e 1 2 f e 1 2 b ψ
= ψ e b ψ 3 , 4 ψ e b ψ
= ψ ψ 3 , 4 e b e b ψ ψ
= ψ ψ 3 , 4 ψ ψ
   □

2.3.2. RQM

Definition 22 
(David Hestenes’ Wavefunction). The Spin c ( 3 , 1 ) -valued wavefunction we have recovered is formulated identically to David Hestenes’[7] formulation of the wavefunction within GA(3,1).
ψ = e 1 2 ( a + f + b ) = ρ R e i b / 2
where e 1 2 a = ρ , e 1 2 f = R and e 1 2 b = e i b / 2 .
Before we continue the RQM investigation, let us note that the double-copy product contains two copies of a bilinear form ψ ψ :
ψ ψ copy 1 3 , 4 ψ ψ copy 2
In the present section, we will investigate the properties of each copy individually, leaving the properties specific to the double-copy for the section on quantum gravity.
Taking a single copy, the Dirac current is obtained directly from the gamma matrices, as follows:
Definition 23 
(Dirac Current). The definition of the Dirac current is the same as Hestenes’:
J ψ γ μ ψ = ρ R B γ μ B R = ρ R γ μ B 1 B R = ρ R γ μ R SO ( 3 , 1 ) = ρ γ ˜ μ
where γ ˜ μ is a SO(3,1) rotated basis vector.

2.3.3. Standard Model Gauge Symmetries

We will now demonstrate that the copied bilinear form is automatically invariant with respect to the U ( 1 ) , SU ( 2 ) , and SU ( 3 ) symmetries and the unitary U U = I symmetry which play a fundamental role in the standard model of particle physics. These constitute the set of symmetries that stabilize the Dirac current ( T ψ ) γ 0 T ψ = ψ γ 0 ψ .
Theorem 12 
(U(1) Invariance). Let e 1 2 b be a general element of U(1). Then, the equality
ψ γ 0 ψ 3 , 4 ψ γ 0 ψ = ( e 1 2 b ψ ) γ 0 e 1 2 b ψ copy 1 3 , 4 ( e 1 2 b ψ ) γ 0 e 1 2 b ψ copy 2
is satisfied, yielding a U(1) symmetry for each copied bilinear form.
Proof. 
Equation 126 is invariant if this expression is satisfied:
e 1 2 b γ 0 e 1 2 b = γ 0
This is always satisfied simply because e 1 2 b γ 0 e 1 2 b = γ 0 e 1 2 b e 1 2 b = γ 0    □
Theorem 13 
(SU(2) Invariance). Let e 1 2 f be a general element of Spin(3,1). Then, the equality:
ψ γ 0 ψ 3 , 4 ψ γ 0 ψ = ( e 1 2 f ψ ) γ 0 e 1 2 f ψ copy 1 3 , 4 ( e 1 2 f ψ ) γ 0 e 1 2 f ψ copy 2
is satisfied for if f = θ 1 γ 2 γ 3 + θ 2 γ 1 γ 3 + θ 3 γ 1 γ 2 (which generates SU(2)), yielding a SU(2) symmetry for each copied bilinear form.
Proof. 
Equation 128 is invariant if this expression is satisfied[9]:
e 1 2 f γ 0 e 1 2 f = γ 0
We now note that moving the left-most term to the right of the gamma matrix yields:
e E 1 γ 0 γ 1 E 2 γ 0 γ 2 E 3 γ 0 γ 3 θ 1 γ 2 γ 3 θ 2 γ 1 γ 3 θ 3 γ 1 γ 2 γ 0 e 1 2 f
= γ 0 e E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 θ 1 γ 2 γ 3 θ 2 γ 1 γ 3 θ 3 γ 1 γ 2 e 1 2 f
Therefore, the product e 1 2 f γ 0 e 1 2 f reduces to γ 0 if and only if E 1 = E 2 = E 3 = 0 , leaving f = θ 1 γ 2 γ 3 + θ 2 γ 1 γ 3 + θ 3 γ 1 γ 2 :
Finally, we note that e θ 1 γ 2 γ 3 + θ 2 γ 1 γ 3 + θ 3 γ 1 γ 2 generates SU ( 2 ) .    □
Theorem 14 
(SU(3)). The generators of SU(3) in GA(3,1) are given by Anthony Lesenby in [10] and are as follows:
E ^ i j = e ^ i e ^ j f ^ i f ^ j where i < j
F ^ i j = e ^ i f ^ j + e ^ j f ^ i where i < j
J ^ = e ^ i f ^ i where i = 1 , 2 , 3
where
e ^ i = multiplication on the left by σ i , so that e ^ i ( F ) = σ i F
f ^ i = multiplication on the right by I σ i , so that f ^ i ( F ) = I σ i F
This defines the 9 generators of U(3).
With the additional restriction on J ^
α 1 J ^ 1 + α 2 J ^ 2 + α 3 J ^ 3 , with α 1 + α 2 + α 3 = 0
the number generators is reduced to 8, consistently with SU(3).
We now must show that the following equation is satisfied for all 8 generators:
ψ γ 0 ψ 3 , 4 ψ γ 0 ψ = ( e θ i λ i ψ ) γ 0 e θ i λ i ψ copy 1 3 , 4 ( e θ i λ i ψ ) γ 0 e θ i λ i ψ copy 2
Proof. 
First, we note the following action:
f γ 0 f = γ 0
which we can rewrite as follows:
( E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 + B 1 γ 2 γ 3 + B 2 γ 1 γ 3 + B 3 γ 1 γ 2 ) γ 0 f
The first three terms anticommute with γ 0 , while the last three commute with γ 0 :
= γ 0 ( E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 B 1 γ 2 γ 3 B 2 ( q ) γ 1 γ 3 B 3 ( q ) γ 1 γ 2 ) f ( q )
This can be written as:
γ 0 ( E B ) ( E + B )
= γ 0 ( E 2 + E B B E B 2 )
where E = E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 and B = B 1 γ 2 γ 3 + B 2 γ 1 γ 3 + B 3 γ 1 γ 2 .
Thus, for f γ 0 f = γ 0 , we require: 1) E 2 B 2 = 1 and 2) E B = B E . The first requirement expands as follows:
E 2 B 2 = ( E 1 2 + B 1 2 ) + ( E 2 2 + B 2 2 ) + ( E 3 2 + B 3 2 ) = 1
which is the defining conditions for the SU ( 3 ) symmetry group.
Finally, as the SU(3) norm is a consequence of preserving the Dirac current, it follows that the SU(3) generators provided by Lasenby, acting on f , cannot change the SU(3) norm, hence must also preserve the Dirac current.    □
Theorem 15 
(Unitary invariance). Let U be n × n unitary matrices. Then unitary invariance:
ψ | γ μ ψ | ψ | γ ν ψ = U ψ | γ μ U ψ | U ψ | γ ν U ψ U U = I
is individually satisfied for each copied bilinear form.
Proof. 
Equation 145 is satisfied if U γ μ U = γ μ . Since U is valued in complex numbers, then U = U T , and since γ μ γ 0 γ 1 γ 2 γ 3 = γ 0 γ 1 γ 2 γ 3 γ μ , it follows that:
γ μ U U = γ μ
which is satisfied when U U = I .    □
The invariances SU(3), SU(2) and U(1) discussed above can be promoted to local symmetries using the usual gauge symmetry construction techniques, along with the Dirac equation or field Lagrangian.
In conventional QM, the Born rule naturally leads to a U(1)-valued gauge theory due to the following symmetry:
( e i θ ( x ) ψ ( x ) ) e i θ ( x ) ψ ( x ) = ψ ( x ) ψ ( x )
However, the SU(3) and SU(2) symmetries do not emerge from the probability measure in the same way and must instead be introduced manually, justified by experimental considerations. This raises the question: why these specific symmetries and not others? In contrast, within the double-copy product framework, all three symmetry groups–U(1), SU(2), and SU(3)–as well as the Spin(3,1) and unitary symmetries, follow naturally from the invariance of the probability measure, in the same way that U(1) symmetry follows from the Born rule.

2.3.4. Quantum Gravity

In the previous section, we developed a quantum theory valued in Spinc(3,1), which served as the arena for RQM. We then demonstrated how a single copy of this theory leads to the gauge symmetries of the standard model. The goal of this section is to extend this methodology to arbitrary basis vectors, in which the metric tensor emerges as an observable. To achieve this, we will utilize both copies.
Our formulation is reminiscent of the Bern-Carrasco-Johansson (BCJ) double-copy approach to perturbatively expanded quantum gravity [11]. However, our double-copy is applied directly at the level of the Dirac current, rather than to gauge theory amplitudes.
By applying the double-copy product to the Dirac current, we establish a connection between quantum theory and the geometrical structure of spacetime.
We recall the definition of the metric tensor in terms of basis vectors of geometric algebra, as follows:
g μ ν = 1 2 ( e μ e ν + e ν e μ )
Then, we note that the double-copy product acts on a pair of basis element e μ and e ν , as follows:
1 2 ψ e μ ψ copy 1 3 , 4 ψ e ν ψ copy 2 + ψ e ν φ copy 2 3 , 4 ψ e μ ψ copy 1
= 1 2 R ˜ ρ e i b / 2 e i b / 2 Born rule copy 1 e μ R R ˜ ρ e i b / 2 e i b / 2 Born rule copy 2 e ν R + R ˜ ρ e i b / 2 e i b / 2 Born rule copy 2 e ν R R ˜ ρ e i b / 2 e i b / 2 Born rule copy 1 e μ R
= 1 2 ρ 2 R ˜ e μ R R ˜ e ν R + R ˜ e ν R R ˜ e ν R
= ρ 2 1 2 ( e ˜ μ e ˜ ν + e ˜ ν e ˜ μ ) metric tensor
where e ˜ μ and e ˜ ν are SO(3,1) rotated basis vectors.
As one can swap e μ and e ν and obtain the same metric tensor, the double-copy product guarantees that g μ ν is symmetric.
Furthermore, since e μ = e μ , we get:
( e μ ψ ) ψ 3 , 4 ( e ν ψ ) ψ
= ψ ( 1 ) e μ ψ 3 , 4 ψ ( 1 ) e ν ψ
= ψ e μ ψ 3 , 4 ψ e ν ψ
which allows us to conclude that e μ and e ν are self-adjoint within the double-copy product, entailing the interpretation of g μ ν as an observable.
In the double-copy product, the metric tensor emerges as a double copy of Dirac currents. This formulation suggests that the metric tensor encodes the probabilistic structure of a quantum theory of gravity in the form of a rank-2 tensor, analogous to how the Dirac current encodes the probabilistic structure of a special relativistic quantum theory in the form of a 4-vector.
Let us now investigate the dynamics. We recall that the evolution operator (Definition 20) is:
T = e 1 2 ζ ( f 1 + b 1 ) e 1 2 ζ ( f n + b n )
Acting on the wavefunction, the effect of this operator cascades down to the basis vectors via the double-copy product:
ψ T e μ T ψ copy 1 3 , 4 ψ T e ν T ψ copy 2
which realizes an SO ( 3 , 1 ) transformation of the metric tensor via action of the exponential of a bivector, and a double-copy unitary invariant transformation via action of the exponential of a pseudo-scalar:
ψ e 1 2 ζ f e μ e 1 2 ζ f SO ( 3 , 1 ) evolution e 1 2 ζ b e 1 2 ζ b unitary evolution ψ copy 1 3 , 4 ψ e 1 2 ζ f e μ e 1 2 ζ f SO ( 3 , 1 ) evolution e 1 2 ζ b e 1 2 ζ b unitary evolution ψ copy 2
In summary, this initial investigation has identified a scenario in which the metric tensor is measured using basis vectors. The evolution operator, governed by the Schrödinger equation, dynamically realizes SO(3,1) transformations on the metric tensor. Furthermore, the amplitudes associated with possible metric tensors are derived from a double-copy of unitary quantum theories acting on the basis vectors. This formulation simultaneously preserves the SO(3,1) symmetry, essential for describing spacetime structure, and the unitary symmetry, fundamental to quantum mechanics. It describes all changes of basis transformations that an observer in 3+1D spacetime can perform prior to measuring a quantum system.

2.3.5. Starting Point for a Quantum Theory of Gravity

The symmetries of interest can be approached through two distinct strategies:
  • Particle Physics Approach: We impose the condition e 1 2 ( f + b ) γ 0 e 1 2 ( f + b ) = γ 0 on the double-copy Dirac current (Equation 158). This constraint leads to the symmetries of the standard model of particle physics, as detailed in Section 2.3.3.
  • Gravitational Approach: We allow the double-copy Dirac current (which is equivalent to the metric tensor) to transform freely under SO(3,1). Instead of constraining the current itself, we focus on constructing SO(3,1)-invariant quantities from it. These invariants are typically formed as specific combinations of the double-copy Dirac current and its derivatives. A key example is the Einstein tensor, which remains invariant under SO(3,1) transformations of the metric. The Einstein tensor is particularly significant because it arises from the variation of the Einstein-Hilbert action, which is simplest action leading to such an invariant.
The first strategy provides the gauge symmetries for the standard model of particle physics, while the second offers a path towards a quantum theory of gravity. In the following section, we will explore the gravitational approach in more detail.

2.3.6. Gravitons

Since the double-copy product of the Dirac current holds for any non-degenerate symmetric rank-2 tensor, it should be possible to show that gravitons can be expressed as a special case of this double-copy mechanism. As such, let us now investigate the wave equation in linearized gravity.
It is well known that the Einstein-Hilbert action:
S [ g μ ν ] = c 4 16 π G R | g | d 4 x
under the assumption of a small perturbation g μ ν = η μ ν + h μ ν , and working in de Donder gauge α h α μ 1 2 μ h = 0 , where h = η μ ν h μ ν , can be reduced to its linearized form which is:
S E H ( 1 ) [ h μ ν ] = d 4 x 1 2 μ h ρ σ μ h ρ σ 1 4 μ h μ h
Furthermore, varying this action with respect to h μ ν and applying the transverse-traceless gauge, yields the wave equation as the equation of motion:
δ S δ h μ ν = 0 h μ ν = 0
We now wish to express the wave equation for h μ ν in terms of basis vectors h μ and h ν , such that h μ ν = 1 2 ( h μ h ν + h ν h μ ) . The expression of the wave equation becomes:
1 2 ( h μ h ν + h ν h μ ) = 0
We identify the solution for h μ and h ν by an ansatz:
h μ ( x , t ) = Re d 3 k A + ( k ) e μ + ( k ) + A × ( k ) e μ × ( k ) e 1 2 i ( k · x ω k t )
h ν ( x , t ) = Re d 3 k A + ( k ) e ν + ( k ) + A × ( k ) e ν × ( k ) e 1 2 i ( k · x ω k t )
The × and + symbol designed two polarizations.
Then, we promote h μ ( x , t ) and h ν ( x , t ) to operators:
h ^ μ ( x , t ) = Re d 3 k A ^ + ( k ) e μ + ( k ) + A ^ × ( k ) e μ × ( k ) e 1 2 i ( k · x ω k t )
h ^ ν ( x , t ) = Re d 3 k A ^ + ( k ) e ν + ( k ) + A ^ × ( k ) e ν × ( k ) e i 1 2 ( k · x ω k t )
Finally, the probabilities associated to these operators, corresponding to a metric tensor expectation value, are given using the double-copy product as follows:
h ^ μ ν = 1 2 ( ψ | h ^ μ ψ | ψ | h ^ ν ψ + ψ | h ^ ν ψ | ψ | h ^ μ ψ )
We note that each copy individually applies the Born rule to one of two operators. In contrast, in the conventional perturbative approach to quantum gravity, the metric tensor h ^ μ ν is quantized, and its expectation value is calculated using the Born rule as follows: h ^ μ ν = ψ | h ^ μ ν | ψ . Thus, our probability calculation differs from the conventional approach, as it involves the product of two separate Born rule applications rather than just one.
The double-copy mechanism, identified by Bern, Carrasco, and Johansson (BCJ) [11], has been shown to simplify calculations of scattering amplitudes for gravitons in many cases. However, it remains an open question whether our double-copy mechanism carries similar advantages.
While we have described gravitons in terms of perturbations h μ ν to the metric, it’s important to note that these perturbations transform under SO(3,1) in a way that preserves the SO(3,1) invariance of the Einstein tensor. The Einstein tensor, constructed from h μ ν and its derivatives, remains invariant under these transformations. Thus completing (a special case of) the second strategy.

2.4. Dimensional Obstructions

In this section, we explore the dimensional obstructions that arise when attempting to resolve the entropy maximization problem for other dimensional configurations. We found that all geometric configurations except those we have explored here (e.g. GA ( 0 ) R , GA ( 0 , 1 ) C and GA ( 3 , 1 ) ) are either obstructed or incomplete. By obstructed, we mean that the solution to the entropy maximization problem, ρ , does not satisfy the properties of a probability measure, and by incomplete we refer to the GA ( 2 , 0 ) case where the metric is not an observable.
Dimensions Obstruction
GA ( 0 ) Unobstructed statistical mechanics
GA ( 0 , 1 ) Unobstructed quantum mechanics
GA ( 1 , 0 ) Negative probabilities in the RQM
GA ( 2 , 0 ) Incomplete / No metric measurement
GA ( 1 , 1 ) Negative probabilities in the RQM
GA ( 0 , 2 ) Not isomorphic to a real matrix algebra
GA ( 3 , 0 ) Not isomorphic to a real matrix algebra
GA ( 2 , 1 ) Not isomorphic to a real matrix algebra
GA ( 1 , 2 ) Not isomorphic to a real matrix algebra
GA ( 0 , 3 ) Not isomorphic to a real matrix algebra
GA ( 4 , 0 ) Not isomorphic to a real matrix algebra
GA ( 3 , 1 ) Unobstructed quantum gravity SU ( 3 ) × SU ( 2 ) × U ( 1 )
GA ( 2 , 2 ) Negative probabilities in the RQM
GA ( 1 , 3 ) Not isomorphic to a real matrix algebra
GA ( 0 , 4 ) Not isomorphic to a real matrix algebra
GA ( 5 , 0 ) Not isomorphic to a real matrix algebra
GA ( 6 , 0 ) No probability measure as a self - product
Let us now demonstrate the obstructions mentioned above.
Theorem 16 
(Not isomorphic to a real matrix algebra). The determinant of the matrix representation of the geometric algebras in this category is either complex-valued or quaternion-valued, making them unsuitable as a probability.
Proof. 
These geometric algebras are classified as follows:
GA ( 0 , 2 ) H
GA ( 3 , 0 ) M 2 ( C )
GA ( 2 , 1 ) M 2 2 ( R )
GA ( 1 , 2 ) M 2 ( C )
GA ( 0 , 3 ) H 2
GA ( 4 , 0 ) M 2 ( H )
GA ( 1 , 3 ) M 2 ( H )
GA ( 0 , 4 ) M 2 ( H )
GA ( 5 , 0 ) M 2 2 ( H )
The determinant of these objects is valued in C or in H , where C are the complex numbers, and where H are the quaternions.    □
Theorem 17 
(Negative Probabilities in the RQM). The even sub-algebra, which associates to the RQM part of the theory, of these dimensional configurations allows for negative probabilities, making them unsuitable as a RQM.
Proof. 
This category contains three dimensional configurations:
GA ( 1 , 0 ) :
Let ψ = a + b e 1 , then:
( a + b e 1 ) ( a + b e 1 ) = ( a b e 1 ) ( a + b e 1 ) = a 2 b 2 e 1 e 1 = a 2 b 2
which is valued in R .
GA ( 1 , 1 ) :
Let ψ = a + b e 0 e 1 , then:
( a + b e 0 e 1 ) ( a + b e 0 e 1 ) = ( a b e 0 e 1 ) ( a + b e 0 e 1 ) = a 2 b 2 e 0 e 1 e 0 e 1 = a 2 b 2
which is valued in R .
GA ( 2 , 2 ) :
Let ψ = a + b e 0 e e 1 e 2 , where e 0 2 = 1 , e 2 = 1 , e 1 2 = 1 , e 2 2 = 1 , then:
( a + b ) ( a + b ) 3 , 4 ( a + b ) ( a + b )
= a 2 + 2 a b + b 2 3 , 4 ( a 2 + 2 a b + b 2 )
We note that b 2 = b 2 e 0 e e 1 e 2 e 0 e e 1 e 2 = b 2 , therefore:
1 1 = ( a 2 + b 2 2 a b ) ( a 2 + b 2 + 2 a b )
= ( a 2 + b 2 ) 2 4 a 2 b 2
= ( a 2 + b 2 ) 2 4 a 2 b 2
which is valued in R .
In all of these cases the RQM probability can be negative.    □
We repeat the following self-products[8] (Definition 16), which will help us demonstrate the next theorem:
GA ( 0 , 1 ) : φ φ
GA ( 2 , 0 ) : φ φ
GA ( 3 , 0 ) : φ φ 3 φ φ
GA ( 3 , 1 ) : φ φ 3 , 4 φ φ
GA ( 4 , 1 ) : ( φ φ 3 , 4 φ φ ) ( φ φ 3 , 4 φ φ )
Theorem 18 
(No Metric Measurements). This obstruction applies to GA ( 2 , 0 ) . A probability measure of at least four self-products are required for the theory to be observationally complete with respect to its geometry.
Proof. 
A metric measurement requires a probability measure of 4 self products because the metric tensor is defined using 2 self-products of the gamma matrices:
g μ ν = 1 2 ( e μ e ν + e ν e μ )
Each pair of wavefunction products fixes one basis elements. Thus, two pairs of wavefunction products are required to fix the geometry from the wavefunction. As probability measures of four self-products begin to appear in 3D, then the GA ( 2 , 0 ) cannot produce a metric measurement as a quantum observable, thus its geometry is not observationally complete with respect to its geometry.    □
Theorem 18 
(No Metric Measurements). The multivector representation of the norm in 6D cannot satisfy any observables.
Proof 
(Argument). In six dimensions and above, the self-product patterns found in Definition 16 collapse. The research by Acus et al.[12] in 6D geometric algebra demonstrates that the determinant, so far defined through a self-products of the multivector, fails to extend into 6D. The crux of the difficulty is evident in the reduced case of a 6D multivector containing only scalar and grade-4 elements:
s ( B ) = b 1 B f 5 ( f 4 ( B ) f 3 ( f 2 ( B ) f 1 ( B ) ) ) + b 2 B g 5 ( g 4 ( B ) g 3 ( g 2 ( B ) g 1 ( B ) ) )
This equation is not a multivector self-product but a linear sum of two multivector self-products[12].
The full expression is given in the form of a system of 4 equations, which is too long to list in its entirety. A small characteristic part is shown:
a 0 4 2 a 0 2 a 47 2 + b 2 a 0 2 a 47 2 p 412 p 422 + 72 monomials = 0
b 1 a 0 3 a 52 + 2 b 2 a 0 a 47 2 a 52 p 412 p 422 p 432 p 442 p 452 + 72 monomials = 0
74 monomials = 0
74 monomials = 0
From Equation 208, it is possible to see that no observable O can satisfy this equation because the linear combination does not allow one to factor it out of the equation.
b 1 O B f 5 ( f 4 ( B ) f 3 ( f 2 ( B ) f 1 ( B ) ) ) + b 2 B g 5 ( g 4 ( B ) g 3 ( g 2 ( B ) g 1 ( B ) ) ) = b 1 B f 5 ( f 4 ( B ) f 3 ( f 2 ( B ) f 1 ( B ) ) ) + b 2 O B g 5 ( g 4 ( B ) g 3 ( g 2 ( B ) g 1 ( B ) ) )
Any equality of the above type between b 1 O and b 2 O is frustrated by the factors b 1 and b 2 , forcing O = 1 as the only satisfying observable. Since the obstruction occurs within grade-4, which is part of the even sub-algebra it is questionable that a satisfactory theory (with non-trivial observables) be constructible in 6D, suing this method.    □
This conjecture proposes that the multivector representation of the determinant in 6D does not allow for the construction of non-trivial observables, which is a crucial requirement for a relevant quantum formalism. The linear combination of multivector self-products in the 6D expression prevents the factorization of observables, limiting their role to the identity operator.
Conjecture 2 
(No probability measures as a self-product (above 6D)). The norms beyond 6D are progressively more complex than the 6D case, which is already obstructed.
These theorems and conjectures provide additional insights into the unique role of the unobstructed 3+1D signature in our proposal.
It is also interesting that our proposal is able to rule out GA ( 1 , 3 ) even if in relativity, the signature of the metric ( + , , , ) versus ( , , , + ) does not influence the physics. However, in geometric algebra, GA ( 1 , 3 ) represents 1 space dimension and 3 time dimensions. Therefore, it is not the signature itself that is ruled out but rather the specific arrangement of 3 time and 1 space dimensions, as this configuration yields quaternion-valued "probabilities" (i.e. GA ( 1 , 3 ) M 2 ( H ) and det M 2 ( H ) H ).
Consequently, 3+1D is the only dimensional configuration (other than the "non-geometric" configurations of GA ( 0 ) R and GA ( 0 , 1 ) C ) in which a ’least biased’ solution to the problem of maximizing the Shannon entropy of quantum measurements relative to an initial preparation, exists. This is an extremely constraining result regarding the possible spacetime configurations of the universe, and our ability (or inability) to construct a least biased theory to investigate it.

3. Discussion

The principle of maximum entropy[3] states that the probability measure that best represents the current state of knowledge about a system is the one with the largest entropy, constrained by prior data. In QM, an experiment typically consists of three stages: an initial preparation, followed by some transformations, and concluding with a final measurement of the system, which yields the result of the experiment. Consistent with the maximum entropy principle, our aim is to derive the ’least biased’ theory that connects the initial preparation p to its final measurement ρ . By formulating the theory as a solution to a maximization problem, rather than merely by axiomatic stipulation, we ensure (by mathematical proof) that the resulting framework is as unbiased as possible given the available information.
Using this methodology, fundamental physics can be formulated as the general solution to a maximization problem involving the Shannon entropy of all possible measurements of an arbitrary system relative to its initial preparation, subject to a vanishing phase constraint. The structure of the inferred theory is thus determined by the nature and generality of the employed constraint. In this paper, we have investigated three specific entropy maximization problems, each characterized by a different constraint and corresponding to a distinct level of description in physics:
Constraint Vanishing Phase Inferred Theory Wavefunction E ¯ = i ρ i E i none SM N . A . 0 = tr i ρ i 0 E i E i 0 U ( 1 ) QM C n 0 = 1 2 tr i ρ i M i Spin c ( 3 , 1 ) RQM / QG ( R × Spin c ( 3 , 1 ) ) n
where n represents the size of the ensemble.
Despite the differences in constraints, the three theories here-so formulated share a common logical genesis, adhere to the same principle of maximum entropy, and qualify as the least biased theory for their given constraint.

3.1. Guarantee of Epistemic Soundness

The Born rule is the least biased probability measure operating on a complex Hilbert space, as established in Theorem 2. However, when the framework is extended to include geometric considerations, the Born rule is no longer the least biased measure. Instead, as demonstrated in Theorem 3, the double-copy product emerges as the least biased probability measure in the 3+1D setting. Interestingly, no solutions exist for other geometric configurations, as demonstrated in Section 2.4.

3.2. Guarantee of Ontological Soundness

Our approach inverts the traditional theory construction paradigm: instead of postulating abstract mathematical entities like wavefunctions and Hilbert spaces, we derive them from the measurements—the theory’s only constraint. Remarkably, the logical foundation of the theory is its ontology. This guarantees that the theory’s foundation matches what transpired in the lab and also renders it inherently resistant to falsification by the very measurements used in its construction.

4. Conclusion

This paper presents a novel approach to quantum theory construction by solving a maximization problem on the Shannon entropy of all possible measurements of a system relative to its initial preparation, under the constraint of a vanishing phase. By selecting the appropriate group of the vanishing phase, the solution resolves to quantum mechanics, relativistic quantum mechanics, or a candidate for a theory of quantum gravity. The resulting measure is invariant under a wide range of geometric transformations, including those generated by the gauge groups of the Standard Model, and leads to the metric tensor as an operator involving a double copy of Dirac currents, without additional assumptions. Again without additional assumptions, the theory automatically fails in geometric configurations other than 3+1D. This result aligns with the observed dimensionality and gauge symmetries of the universe, suggesting a possible explanation for its specific structure. This approach may offer a promising avenue for the unification of fundamental physical theories and may provide new insights into the underlying principles governing the structure of our universe.

Statements and Declarations

  • Competing Interests: The author declares that he has no competing financial or non-financial interests that are directly or indirectly related to the work submitted for publication.
  • Data Availability Statement: No datasets were generated or analyzed during the current study.
  • During the preparation of this manuscript, we utilized a Large Language Model (LLM), for assistance with spelling and grammar corrections, as well as for minor improvements to the text to enhance clarity and readability. This AI tool did not contribute to the conceptual development of the work, data analysis, interpretation of results, or the decision-making process in the research. Its use was limited to language editing and minor textual enhancements to ensure the manuscript met the required linguistic standards.

Appendix A. SM

Here, we solve the Lagrange multiplier equation of SM.
L = k B i ρ i ln ρ i Boltzmann   Entropy + λ 1 i ρ i Normalization   Constraint + β E ¯ i ρ i E i Average Energy Constraint
We solve the maximization problem as follows:
0 = L ( ρ i , , ρ n ) ρ i
= ln ρ i 1 λ β E i
= ln ρ i + 1 + λ + β E i
ln ρ i = 1 λ β E i
ρ i = exp ( 1 λ ) exp β E i
= 1 Z ( τ ) exp β E i
The partition function, is obtained as follows:
1 = i exp ( 1 λ ) exp β E i
exp ( 1 λ ) 1 = i exp β E i
Z ( τ ) : = i exp β E i
Finally, the probability measure is:
ρ i = 1 i exp β E i exp β E i

Appendix B. RQM in 3+1D

L = i ρ i ln ρ i p i Relative Shannon Entropy + λ 1 i ρ i Normalization Constraint + ζ tr 1 2 i ρ i M i Vanishing Relativistic - Phase Anti - Constraint
The solution is obtained using the same step-by-step process as the 2D case, and yields:
ρ i = 1 i p i det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Invariant Ensemble det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Born Rule p i Initial Preparation
Proof. 
The Lagrange multiplier equation can be solved as follows:
0 = L ( ρ 1 , , ρ n ) ρ i
= ln ρ i p i p i λ ζ tr 1 2 M i
= ln ρ i p i + p i + λ + ζ tr 1 2 M i
ln ρ i p i = p i λ ζ tr 1 2 M i
ρ i = p i exp ( p i λ ) exp ζ tr 1 2 M i
= 1 Z ( ζ ) p i exp ζ tr 1 2 M i
The partition function Z ( ζ ) , serving as a normalization constant, is determined as follows:
1 = i p i exp ( p i λ ) exp ζ tr 1 2 M i
exp ( p i λ ) 1 = i p i exp ζ tr 1 2 M i
Z ( ζ ) : = i p i exp ζ tr 1 2 M i
   □

Appendix C. SageMath Program Showing ⌊uu⌋3,4uu = det Mu

  • from sage.algebras.clifford_algebra import CliffordAlgebra
  • from sage.quadratic_forms.quadratic_form import QuadraticForm
  • from sage.symbolic.ring import SR
  • from sage.matrix.constructor import Matrix
  •  
  • # Define the quadratic form for GA(3,1) over the Symbolic Ring
  • Q = QuadraticForm(SR, 4, [-1, 0, 0, 0, 1, 0, 0, 1, 0, 1])
  •  
  • # Initialize the GA(3,1) algebra over the Symbolic Ring
  • algebra = CliffordAlgebra(Q)
  •  
  • # Define the basis vectors
  • e0, e1, e2, e3 = algebra.gens()
  •  
  • # Define the scalar variables for each basis element
  • a = var(’a’)
  • t, x, y, z = var(’t x y z’)
  • f01, f02, f03, f12, f23, f13 = var(’f01 f02 f03 f12 f23 f13’)
  • v, w, q, p = var(’v w q p’)
  • b = var(’b’)
  •  
  • # Create a general multivector
  • udegree0=a
  • udegree1=t*e0+x*e1+y*e2+z*e3
  • udegree2=f01*e0*e1+f02*e0*e2+f03*e0*e3+f12*e1*e2+f13*e1*e3+f23*e2*e3
  • udegree3=v*e0*e1*e2+w*e0*e1*e3+q*e0*e2*e3+p*e1*e2*e3
  • udegree4=b*e0*e1*e2*e3
  • u=udegree0+udegree1+udegree2+udegree3+udegree4
  •  
  • u2 = u.clifford_conjugate()*u
  •  
  • u2degree0 = sum(x for x in u2.terms() if x.degree() == 0)
  • u2degree1 = sum(x for x in u2.terms() if x.degree() == 1)
  • u2degree2 = sum(x for x in u2.terms() if x.degree() == 2)
  • u2degree3 = sum(x for x in u2.terms() if x.degree() == 3)
  • u2degree4 = sum(x for x in u2.terms() if x.degree() == 4)
  • u2conj34 = u2degree0+u2degree1+u2degree2-u2degree3-u2degree4
  •  
  •  
  • I = Matrix(SR, [[1, 0, 0, 0],
  •                 [0, 1, 0, 0],
  •                 [0, 0, 1, 0],
  •                 [0, 0, 0, 1]])
  •  
  • #MAJORANA MATRICES
  • y0 = Matrix(SR, [[0, 0, 0, 1],
  •                  [0, 0, -1, 0],
  •                  [0, 1, 0, 0],
  •                  [-1, 0, 0, 0]])
  •  
  • y1 = Matrix(SR, [[0, -1, 0, 0],
  •                  [-1, 0, 0, 0],
  •                  [0, 0, 0, -1],
  •                  [0, 0, -1, 0]])
  •  
  • y2 = Matrix(SR, [[0, 0, 0, 1],
  •                  [0, 0, -1, 0],
  •                  [0, -1, 0, 0],
  •                  [1, 0, 0, 0]])
  •  
  • y3 = Matrix(SR, [[-1, 0, 0, 0],
  •                  [0, 1, 0, 0],
  •                  [0, 0, -1, 0],
  •                  [0, 0, 0, 1]])
  •  
  • mdegree0 = a
  • mdegree1 = t*y0+x*y1+y*y2+z*y3
  • mdegree2 = f01*y0*y1+f02*y0*y2+f03*y0*y3+f12*y1*y2+f13*y1*y3+f23*y2*y3
  • mdegree3 = v*y0*y1*y2+w*y0*y1*y3+q*y0*y2*y3+p*y1*y2*y3
  • mdegree4 = b*y0*y1*y2*y3
  • m=mdegree0+mdegree1+mdegree2+mdegree3+mdegree4
  •  
  • print(u2conj34*u2 == m.det())
The program outputs
  • True
showing, by computer assisted symbolic manipulations, that the determinant of the real Majorana representation of a multivector u is equal to the double-copy form: det M u = u u 3 , 4 u u .

References

  1. Dirac, P.A.M. The principles of quantum mechanics; Number 27, Oxford university press, 1981.
  2. Von Neumann, J. Mathematical foundations of quantum mechanics: New edition; Vol. 53, Princeton university press, 2018.
  3. Jaynes, E.T. Information theory and statistical mechanics. Physical review 1957, 106, 620. [Google Scholar] [CrossRef]
  4. Jaynes, E.T. Information theory and statistical mechanics. II. Physical review 1957, 108, 171. [Google Scholar] [CrossRef]
  5. Kullback, S.; Leibler, R.A. On information and sufficiency. The annals of mathematical statistics 1951, 22, 79–86. [Google Scholar] [CrossRef]
  6. Shannon, C.E. A mathematical theory of communication. Bell system technical journal 1948, 27, 379–423. [Google Scholar] [CrossRef]
  7. Hestenes, D. Spacetime physics with geometric algebra. American Journal of Physics 2003, 71, 691–714. [Google Scholar] [CrossRef]
  8. Lundholm, D. Geometric (Clifford) algebra and its applications. arXiv preprint math/0605280 2006.
  9. Hestenes, D. Space-time structure of weak and electromagnetic interactions. Foundations of Physics 1982, 12, 153–168. [Google Scholar] [CrossRef]
  10. Lasenby, A. Some recent results for SU(3) and Octonions within the Geometric Algebra approach to the fundamental forces of nature. arXiv preprint arXiv:2202.06733 2022.
  11. Bern, Z.; Carrasco, J.J.M.; Johansson, H. Perturbative quantum gravity as a double copy of gauge theory. Physical Review Letters 2010, 105, 061602. [Google Scholar] [CrossRef] [PubMed]
  12. Acus, A.; Dargys, A. Inverse of multivector: Beyond p+ q= 5 threshold. arXiv preprint arXiv:1712.05204 2017.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated