2. Generalized Dirac Type Equation
Let us introduce notations, which will be used
further on. The speed of light and the rationalized Planck’s constant will be
considered as unity.
Matrices constructed from Pauli matrices
A set of arbitrary complex numbers and a vector of
its three components
Let us define a 2×2 matrix of Lorentz
transformations given by the set of real rotation angles
and boosts
and a similar 4×4 transformation matrix
We also define a 4×4 matrix of Lorentz transformations
,
where
μ and
ν take values 0,1,2,3
which can also be written explicitly using the 4×4 matrices of turn generators
and boosts
Let's
define a 4×4 matrix
In
fact, we consider a quaternion with complex coefficients, which we multiply by
its conjugate quaternion (due to the complexity of the coefficients, these are
biquaternions, but we still use quaternionic conjugation, without complex
conjugation).
Let
us subject the set of complex numbers to the Lorentz transformation
Let
us write a relation whose validity for an arbitrary set of complex numbers can
be checked directly
The matrix in the simplest case is diagonal with equal
complex elements on the diagonal equal to the square of the length of the
vector in the metric of Minkowski space, which we denote . Both and do not change under any rotations and boosts, in
physical applications the invariance of is usually used, in particular, for the
four-component momentum vector this quantity is called the square of mass.
Since the matrices anticommutate with each other, for a vector whose components commute with each other, we have
just the simplest case with a diagonal matrix with on the diagonal. But if the components of vector do not commute, the matrix already has a more complex structure and carries
additional physical information compared to . For example, the vector may include the electron momentum vector and the
electromagnetic potential vector. The four-component potential vector is a
function of the four-dimensional coordinates of Minkowski space. The components
of the four-component momentum do not commute with the components of the
coordinate vector, respectively, and the coordinate function does not commute
with the momentum components, and their commutator is expressed through the
partial derivative of this function by the corresponding coordinate. If the components
of the vector do not commute, the matrix will no longer be invariant with respect to
Lorentz transformations.
Suppose that the complex numbers we consider
commute with all matrices, and note that the squares of all matrices are equal
to the unit 4×4 matrix
I
Taking
into account anticommutative properties of matrices and expressions for their
pairwise products we obtain
Consider
the case when
is
the sum of the momentum vector and the electromagnetic potential vector, which
is a function of coordinates
For now, we'll stick with the Heisenberg approach,
that is, we will consider the components of the momentum vector as operators for which there are commutation
relations with coordinates or coordinate functions such as . In this approach, the operators do not have to
act on any wave function.
Taking into account the commutation relations of
the components of the momentum vector and the coordinate vector, the commutator
of the momentum component and the coordinate function is expressed through the
derivative of this function by the corresponding coordinate, e.g.
As
a result, we have the expression
Similarly,
it can be shown that
does
not change under Lorentz transformations involving any rotations and boosts.
Taking
into account the electron charge we have
Let
us summarize our consideration. There is a correlation
Let's
analyze the obtained equality
Note that the quantity
is invariant to the Lorentz transformations
irrespective of whether the momentum and field components commute or not. To
solve this equation, we have to make additional simplifications. For example,
to arrive at an equation similar to the Dirac equation, we must equate
with the matrix
, where
is the square of the mass of a free electron. Then
With this substitution the generalized equation
almost coincides with the equation [6],
formula (43.25), the difference is that there is a plus sign before
, and instead of
there is
, in which the matrices
have the following form
A similar equation is given by Dirac in [7], Para. 76, Equation 24]; he does not use the matrices
, only the matrices
, but the signs of
the contributions of the magnetic and electric fields are the same.
Along
with the original form
it is possible to consider the form with a
different order of the factors. It can be shown that this leads to a change in
the sign of the electric field contribution
Since , unlike , is invariant to Lorentz transformations, it would
be logical to replace it by . At least both these matrices are diagonal, and in
the case of a weak field their diagonal elements are close. Nevertheless, the
approach based on the Dirac equation leads to solutions consistent with
experiment.
The matrix in the general case has complex elements and is
not diagonal, and in the Dirac equations instead of it is substituted the
product of the unit matrix by the square of mass , the physical meaning of such a substitution is
not obvious. Apparently it is implied that it is the square of the mass of a
free electron. But the square of the length of the sum of the lengths of the
electron momentum vectors and the electromagnetic potential vector is not equal
to the sum of the squares of the lengths of these vectors, that is, it is not
equal to the square of the mass of the electron, even if the square of the
length of the potential vector were zero. But, for example, in the case of an
electrostatic central field, even the square of the length of one potential
vector is not equal to zero. Therefore, it is difficult to find a logical
justification for using the mass of a free electron in the Dirac equation in
the presence of an electromagnetic field. Due to the noted differences, the
solutions of the generalized equation can differ from the solutions arising
from the Dirac equation.
In
the case when there is a constant magnetic field directed along the z-axis, we
can write down
Here . Only when the field is directed along the z-axis,
the matrix is diagonal and real because the third Pauli
matrix is diagonal and real. And if the field is weak, can be approximated by the matrix. This is probably why it is customary to
illustrate the interaction of electron spin with the magnetic field by choosing
its direction along the z-axis. In any other direction is not only non-diagonal, but also complex, so
that it is difficult to justify the use of .
When the influence of the electromagnetic field was
taken into account, no specific characteristics of the electron were used. When
deriving a similar result using the Dirac equation, it is assumed that since
the electron equation is used, the result is specific to the electron. In our
case Pauli matrices and commutation relations are used, apparently these two
assumptions or only one of them characterize the properties of the electron,
distinguishing it from other particles with non-zero masses.
The proposed equation echoes the Dirac equation, at
least from it one can obtain the same formulas for the interaction of spin and
electromagnetic field as with the Dirac equation, and in the absence of a field
the proposed equation is invariant to the Lorentz transformations. In contrast,
to prove the invariance of the Dirac equation even in the absence of a field,
the infinitesimal Lorentz transformations are used, but the invariance at
finite angles of rotations and boosts is not demonstrated. The proof of
invariance of the Dirac equation is based on the claim that a combination of
rotations at finite angles can be represented as a combination of infinitesimal
rotations. But this is true only for rotations or boosts around one axis, and
if there are at least two axes, this statement is not true because of
non-commutability of Pauli matrices, which are generators of rotations, so that
the exponent of the sum is not equal to the product of exponents if the sum
includes generators of rotations or boosts around different axes. By a direct
check we can verify that the invariance of the Dirac equation takes place at
any combination of rotations, but only under the condition of zero boosts,
i.e., only in a rest frame of reference, any boost violates the invariance.
A test case for any theory is the model of the
central electrostatic field used in the description of the hydrogen atom, in
which the components of the vector potential are zero
If
again we equate the left part with
,
we obtain
Introducing
the notations (
)
we
obtain
If we substitute operators acting on the wave
function instead of momentum components into the equation, we obtain a
generalized analog of the relativistic Schrödinger equation, in which the wave
function has four components and changes as a spinor under Lorentz
transformations. Using the substitutions
the
equation for the four-component wave function
before
all transformations has the form
and
after transformations
Once
again, note that the matrix
is
not diagonal and real.
All the above deductions are also valid when
replacing 4×4 matrices
by 2×2 matrices
, since their commutative and anticommutative
properties are the same. The corresponding generalized equation is of the form
and
the equation for the now two-component wave function looks like
In deriving his equation, Dirac [7], paragraph 74 noted that as long as we are
dealing with matrices with two rows and columns, we cannot obtain a
representation of more than three anticommuting quantities; to represent four
anticommuting quantities, he turned to matrices with four rows and columns. In
our case, however, three anticommuting matrices are sufficient, so the wave
function can also be two-component. Dirac also explains that the presence of
four components results in twice as many solutions, half of which have negative
energy. In the case of a two-component wave function, however, no negative
energy solutions are obtained. Particles with negative energy in this case also
exist, but they are described by the same equation in which the signs of all
four matrices or are reversed.
One
would seem to expect similar results from other representations of the momentum
operator, e.g.,
[6]
, formula
(24.15)]
under the assumption that this representation can
describe a particle with spin one. But this expectation is not justified, since
the last three matrices do not anticommutate, and therefore the quadratic form
constructed on their basis is not invariant under Lorentz transformations.
Let's see what happens to
when we change the sign of the matrices. When
changing the sign of
we have
swapping the places of the multipliers. The
multipliers do not necessarily commute, so
is not invariant with respect to the change of
sign of
, which can be interpreted as a reflection in time.
The same picture of invariance absence we have at the sign change of matrices
, i.e. at spatial reflection
If
we change the signs of all matrices at once, we have
i.e. invariance. The physical interpretation of
this case can be given by taking into account the change of signs of the
matrices in equation
which
can be rewritten as
it can be interpreted as an equation for a particle
with negative energy and positive charge, i.e. for the positron. Thus, the
generalized equation with matrices describes a particle, and with matrices an antiparticle. However, in the next section of
the paper another approach is described in which we do not have to deal with
negative energy, it is positive for all particles, and the difference between
particles and antiparticles consist in a different sign of their mass. And the
mass in the generally accepted sense is simply equal to its modulus.
If one consistently adheres to the Heisenberg
approach and does not involve the notion of wave function, it is not very clear
how to search for solutions of the presented equations. The Schrödinger
approach with finding the eigenvalues of the
matrix and their corresponding eigenfunctions can
help here.
In the left-hand side are the operators acting on
the wave function, and in the right-hand side is a constant matrix on which the
wave function is simply multiplied. This equality must be satisfied for all
values of the four-dimensional coordinates at once. Then is not fixed but can take a set of possible
values, finding all these values is the goal of solving the equation.
Thus, we have arrived at an equation containing a
matrix which is non-diagonal, complex and in general
depends on the coordinates . After the standard procedure of separating the
time and space variables, we can go to a stationary equation in which there
will be no time dependence, but the dependence the matrix on the coordinates will remain. It is possible to
ignore the dependence of on the coordinates and its non-diagonality and
simply replace this matrix by a unit matrix with a coefficient in the form of
the square of the free electron mass. Then the equation will give solutions
coinciding with those of the Dirac equation. But this solution can be
considered only approximate and the question remains how far we depart from
strict adherence to the principle of invariance with respect to Lorentz
transformations and how far we deviate from the hypothetical true solution,
which is fully consistent with this principle. To find this solution, we need
to approach this equation without simplifying assumptions and look for a set of
solutions, each of which represents an eigenvalue matrix of arbitrary form and its corresponding
four-component eigenfunction.
When
searching for solutions, one can try to use two equations
successively applying the operators with first
order derivatives included in them to the eigenfunctions already found,
similarly as described in Schrödinger's work [8].
3. Equation for the Spinor Coordinate Space
Let
us return to the set of arbitrary complex numbers, for simplicity we will call
it a vector
Let
us consider in connection with it arbitrary four-component complex spinors
Among all possible vectors, let us select a set of
such vectors for which there is a representation of components through
arbitrary complex spinors
and
there is another way to calculate them
Further
we will assume that both spinors are identical, then the vector constructed
from them is
has real components, and we will assume that this
is the electron momentum vector constructed from the complex momentum spinor
Consider
the complex quantity
where we introduce one more complex spinor, which
in the future we will give the meaning of the complex coordinate spinor
Coordinate
vector of the four-dimensional Minkowski space
is
obtained from the coordinate spinor by the same formulas
Thus, the vector in the Minkowski space is not a
set of four arbitrary real numbers, but only such that are the specified
bilinear combinations of components of completely arbitrary complex spinors
Accordingly, the components of the vector in
Minkowski space are interdependent, from this dependence automatically follow
the relations of the special theory of relativity between space and time. For
the same reason, the coordinates of Minkowski space cannot serve as independent
variables in the equations. From the commutative properties of matrices, which are generators of rotations and
boosts with respect to which the length of vectors is invariant, quantum
mechanics automatically follows. Indeed, the commutation relations between the
components of momenta are related to the noncommutativity of rotations in some
way, and from them the commutation relations between the components of
coordinates and momenta are directly deduced. And from these relations the
differential equations are derived.
And since we do not doubt the truth of the theory
of relativity and quantum mechanics, we cannot doubt the reality of spinor
space, which by means of the simplest arithmetic operations generates our space
and time.
The quantity
is invariant under the Lorentz transformation
simultaneously applied to the momentum and coordinate spinor, which
automatically transforms both corresponding vectors as well
This
quantity does not change for any combination of turns and boosts
Accordingly,
the exponent
characterizes the propagation process of a plane
wave in spinor space with phase invariant to Lorentz transformations.
Let
us apply the differential operator to the spinor analog of a plane wave
Applying
this operator at another definition of the phase gives the same eigenvalue
that is, two different eigenfunctions correspond to
this eigenvalue, but in the second case the phase in the exponent is not
invariant with respect to the Lorentz transformation, so we will use the first
definition.
are complex spinors, which, under the transformation
is
affected by the same matrix
,
then the complex quantity
is invariant under the action on the momentum
spinor
of the transformation
.
is an eigenvalue of the differential operator, and
the plane wave is the corresponding
m eigenfunction, which is a solution
of the equation
Here
denotes
the complex function of complex spinor coordinates.
When substantiating the Schrödinger equation for a
plane wave in four-dimensional vector space, an assumption is made (further
confirmed in the experiment) about its applicability to an arbitrary wave
function. Let us make a similar assumption about the applicability of the
reduced spinor equation to an arbitrary function of spinor coordinates, that
is, we will consider this equation as universal and valid for all physical
processes.
Let us clarify that by the derivative on a complex
variable from a complex function we here understand the derivative from an
arbitrary stepped complex function using the formula that is valid at least for
any integer degrees
In
particular, this is true for the exponential function, which is an infinite
power series.
It is not by chance that we denote the eigenvalue
by the symbol
m, because if we form the momentum vector from the
momentum spinor
included in the expression for the plane wave
then
for the square of its length the following equality will be satisfied
That is the square of the modulus
m has the
sense of the square of the mass of a free particle, which is described by a
plane wave in spinor space as well as by a plane wave in vector space. For the
momentum spinor of a fermionic type particle having in the rest frame the
following form
is real and not equal to zero, and for the
bosonic-type momentum spinor having in the rest frame the following form
i.e.,
the boson satisfies the plane wave equation in spinor space with zero
eigenvalue.
For the momentum spinor of a fermion-type particle
we can consider another form in the rest system
then
the mass will be real and negative
This particle with negative mass can be treated as
an antiparticle, and in the rest frame its energy is equal to its mass modulo,
but it is always positive
To describe the behavior of an electron in the
presence of an external electromagnetic field, it is common practice to add the
electromagnetic potential vector to its momentum vector. We use the same
approach at the spinor level and to each component of the momentum spinor of
the electron we add the corresponding component of the electromagnetic
potential spinor. For simplicity, the electron charge is equal to unity.
Further we need an expression for the commutation
relation between the components of the momentum spinor, to which is added the
corresponding component of the electromagnetic potential spinor, which is a
function of the spinor coordinates
Let
us replace the momenta by differential operators
and
find the commutation relation
Let us apply the proposed equation to analyze the
wave function of the electron in a centrally symmetric electric field, this
model is used to describe the hydrogen-like atom. For the components of the
vector potential of a centrally symmetric electric field it is true that
As a result, it is possible to accept
We are looking for a solution of the spinor equation; we do not consider the electron's spin yet
This equation can be interpreted in another way.
Let us take the invariant expression
And let's do the substitution
We will consider this equation as an equation for determining the eigenvalues of
and the corresponding eigenfunctions
Let's introduce the notations
this quantity does not change under rotations and boosts and is some analog of the interval defined for Minkowski space and
this quantity represents time in four-dimensional vector space.
An interesting fact is that time is always a positive quantity. As an assumption it can be noted that since we observe that time value goes forward, i.e. the value of
t grows, and it is possible only due to scaling of all components of spinor space, such scaling leads to increase of distance between any two points of Minkowski space. As a result, with the passage of time the Minkowski space should expand, herewith at first relatively quickly, and then more and more slowly.
As a result, we have an equation for determining the eigenvalues of
m and their corresponding eigenfunctions
Instead of looking for solutions to this equation directly, we can first try substituting already known solutions to the Schrödinger equation for the hydrogen-like atom. If
is one of these solutions, we need to find its derivatives over all spinor components
Let’s pay attention to the shift in priorities. In the Schrödinger equation one looks for energy eigenvalues, while here it is proposed to look for mass eigenvalues, it seem more natural to us. The mass of a free particle is an invariant of the Lorentz transformations, and in the bound state the mass of the particle has a discrete series of allowed values, each of which corresponds to an energy eigenvalue, and the eigenfunction of these eigenvalues is the same. But these energy eigenvalues are not the same as the energy eigenvalues of the Schrödinger equation, because the equations are different. When an electron absorbs a photon, their spinors sum up and the mass of the electron changes. If the new mass coincides with some allowed value, the electron enters a new state. The kay idea here is the assumption that the interaction of spinors occurs simply by summing them.
The advantages of considering physical processes in spinor coordinate space may not be limited to electrodynamics. It may turn out, for example, that the spinor space is not subject to curvature under the influence of matter, as it takes place in the general theory of relativity for the vector coordinate space. On the contrary, it can be assumed that it is when the components of vector coordinate space are computed from the coordinate spinor that the momentum spinor with a multiplier of the order of the gravitational constant is added to this spinor. This results in a warp that affects other massive bodies.
To account for the electron spin, we will further represent the electron wave function as a four-component spinor function of four-component spinor coordinates
where the coefficients
are complex quantities independent of coordinates. In fact, as shown at the end of the paper, the wave function is a linear combination of such right-hand sides with operator coefficients.
We will search for the solution of the wave equation considered in the first part of this paper
Let's express the left part through the components of the momentum spinor
Let's distinguish the direct products of vectors in these matrices
Let's introduce the notations
Let us substitute differential operators instead of spinor components
Then the quantities included in the wave equation
will have the form
Let us consider the case of a free particle and represent the electron field as a four-component spinor function of four-component spinor coordinates
For a free particle, the components of the momentum spinor commute with each other, so all components of the matrix are zero.
Let us use the model of a plane wave in spinor space
Substituting the plane wave solution into the differential equation, we obtain the algebraic equation
Let us take into account the commutativity of the momentum components, besides, let us introduce the notations
for the quantities which are invariant under any rotations and boosts, then we obtain
Additionally, introducing notation for Lorentz invariant quantities
we obtain
We see that in the case of a plane wave in spinor space, the matrix in the left part of the equation is diagonal and remains so at any rotations and boosts, the diagonal element also does not change.
In this case we can consider the matrix
in the right part to be diagonal with the same elements on the diagonal
, then the equation can be rewritten as an equation for the problem of finding eigenvalues and eigenfunctions
Let us compare our equation with the Dirac equation [
6], formula (43.16)]
In the rest frame of reference, the three components of momentum are zero and the equation is simplified
That is, in the rest frame the Dirac equation and the spinor equation analyzed by us look identically and contain a diagonal matrix. The corresponding problem on eigenvalues and eigenvectors of these matrices has degenerate eigenvalues, which correspond to the linear space of eigenfunctions. In this space, one can choose an orthogonal basis of linearly independent functions, and this choice is quite arbitrary. For example, in [
9], formula (2.127)], solutions in the form of plane waves in the vector space have been proposed for the Dirac equation in the rest frame
and the following spinors are chosen as basis vectors
For transformation to a moving coordinate system in [
9], formula (2.133)] the following formula is used
The basis spinors form a complete system, that is, any four-component complex spinor can be represented as their linear combination and this arbitrary spinor will be a solution to the problem on eigenvalues and eigenfunctions in a resting coordinate system. The choice of the given particular basis has disadvantages, because if to find a four-dimensional current vector from any of these basis functions
then this current in the rest frame of reference
has non-zero components, and the square of the length of the current vector is zero. It turns out that a resting electron creates a current, which contradicts physical common sense. Since we have freedom of choice of the basis, it is reasonable to choose the spinor for the wave function as some set of momentum spinor components, for example
The proportionality factor is chosen so that in the rest frame the zero component of the current is equal to the charge of, for example, an electron or a positron. If the momentum spinor in the rest frame has the form
then the momentum vector in this rest frame of reference will be
and the current vector
The same momentum vector in the rest frame of reference can be obtained from different spinors, e.g,
after a 30-degree boost along the z-axis we get
After scaling the spinors by the factor, similar relations are true for the current vector. Thus, electrons can have the same momentum and current vector but different spinors, i.e., they are characterized by different spins. As it is supposed, the electron here has two physical degrees of freedom, since in a rest frame of reference one can choose the components and to be real.
Thus defined spinor wave function for a free particle is invariant to Lorentz transformations, since in this case the mass of electron
, its charge and the phase of the plane spinor wave
do not change at rotations and boosts. The matrix on the left side of the equation does not change either, remaining diagonal with
on the diagonal.
It is logical to use the same considerations when choosing the basis for the wave function of the photon, whose mass, i.e., the eigenvalues of the wave function equation, are also degenerate and thus equal to zero. In this case, the choice of the proportionality factor between the spinor of the wave function and the momentum spinor is not so obvious, one can, for example, consider the option of
For a fermion, which can be an electron or a positron in the rest frame takes place
, so the quantity
which, unlike the mass
M in the Dirac equation, is complex in the general case, is also real for the fermion and can be positive for the electron or negative for the positron. The charge is proportional to the mass with a minus sign, since the electron charge is considered negative and the positron positive. For simplicity it is possible to consider the mass of the electron as negative and that of the positron as positive, then the charge will be proportional to the mass without changing the sign.
For the momentum spinor of a boson, such as a photon, it is true that
, so its mass is zero
The given constructions are not abstract, but describe the physical reality, since the results of the processes occurring in the spinor space are displayed in the Minkowski vector space. In particular, the momentum vector corresponding to the momentum spinor has the following parameters
the square of the length is equal to the square of the mass of the electron or positron
And to the spinor wave function at some point in spinor space corresponds the vector wave function ()
(which for a plane wave coincides with the current vector), taking its value in the corresponding point of physical space with coordinates
The vector wave function can be compared in meaning to the square of the modulus of the conventional scalar wave function, in particular is equal to this square and has the meaning of probability. The conventional scalar wave function itself is closer in meaning to the spinor wave function considered here, they both have complex values, and the four-component wave functions of the electron have in both cases the same meaning.
The arbitrary choice of the basis of the linear space of the eigenvectors of the matrix takes place only for a free particle. In the general case the matrix K is not zero, the wave equation has no solution in the form of plane waves in spinor space and ceases to be invariant with respect to Lorentz transformations, and the eigenvalues become nondegenerate.
We propose to extend the scope of applicability of the presented equation consisting of differential operators in the form of partial derivatives on the components of coordinate spinors with a nonzero matrix
K
not only to the case of a plane wave, but to any situation in general. This transition is analogous to the transition from the application of the Schrödinger equation to a plane wave in vector space to its application in a general situation. The legitimacy of such transitions should be confirmed by the results of experiments.
This equation will be called the equation for the spinor wave function defined on the spinor coordinate space. Here the matrix
is, generally speaking, neither diagonal nor real, but it does not depend on the coordinates and is determined solely by the parameters of the electromagnetic field. Only in the case of a plane wave it is diagonal and has on the diagonal the square of the mass of the free particle. We can try to simplify the problem and require that the matrix
is diagonal with the same elements on the diagonal
, then the equation can be rewritten in the form of the equation for the problem of search of eigenvalues and eigenfunctions for any quantum states
This approach is pleasant in the Dirac equation, where the mass is fixed and equated to the mass of a free particle, and at the same time results giving good agreement with experiment are obtained.
We are of the opinion that the spinor equation is more fundamental than the relativistic Schrödinger and Dirac equations, it is not a generalization of them, it is a refinement of them, because it describes nature at the spinor level, and hence is more precise and detailed than the equations for the wave function defined on the vector space.
Let us consider the proposed equation for the special case when the particle is in an external electromagnetic field, which we will also represent by a four-component spinor function at a point of the spinor coordinate space
We will apply to the wave function of the electron the operators corresponding to the components of the momentum spinor, putting for simplicity the electron charge equal to unity
Note that the electromagnetic potential vector can be calculated from the electromagnetic potential spinor by the standard formula
The advantage of the spinor description over the vector description is that instead of summing up the components of the momentum and electromagnetic potential vectors as is usually done
now we sum the spinor components and then the resulting vector is
in addition to the usual momentum and field vectors, contains an additional term
taking real values and describing the mutual influence of the fields of the electron and photon.
After the addition of the electromagnetic field the components of the momentum spinor do not commute, the corresponding commutators are found above
Let's find commutators for other operators
Further we will use these and analogous relations
Since the second factor
in the left-hand side of the equation has a simpler structure than the first factor, perhaps as a first step we should find the eigenvalues and eigenfunctions of the equation
and use them when solving the equation as a whole.
Let's calculate the expressions included in the equation
Let us consider the situation when the electromagnetic potential can be described by a plane wave in spinor space
When the electromagnetic potential is represented by a plane wave, the field created by a charged particle is not taken into account, so this model adequately describes only the situation when the electromagnetic field is strong enough and the influence of the particle charge can be neglected.
It would be interesting in this context to consider for the presented spinor model the case of a centrally symmetric electric field and to find solutions of the spinor wave equation for the hydrogen-like atom, taking into account the presence of spin at the electron. For such a model we can take
As mentioned above, we can substitute into the equation the already known exact solutions of the Dirac equation for the hydrogen-like atom by expressing the components of the coordinate vector and derivatives on them through the components of the coordinate spinor and derivatives on them. It is likely that the solution of the Dirac equation would not make the spinor equation an identity; it would be evidence that more arbitrary assumptions are made in the Dirac equation than in the spinor equation, and that the latter claims to be a better description of nature.
We can also consider the case of a constant magnetic field directed along the z-axis
We see that the scalar potential
grows with time, but does not depend on spatial coordinates, and the vector potential does not depend on time, so that there is no electric field. In this case
The equation considered up to now is rather cumbersome, therefore we would like to have a simpler and compact relativistic invariant equation for the fermion, taking into account the presence of a half-integer spin. Such equation really exists, its derivation is given in section 4 of the paper. Here we will give its form for the electron in the presence of the electromagnetic field
where
In general case electric and magnetic fields are expressed through partial derivatives of components of the vector potential by components of the space vector. We also can find the expression through these fields for the derivatives of the spinor components of the electromagnetic potential by the components of the coordinate spinor. To do this, we first find all derivatives
then express the components of the vector potential through the components of the spinor potential, substitute the components of the electric and magnetic fields instead of the derivatives of the components of the vector potential by the components of the coordinate vector, and then find the required derivatives from the resulting system of linear equations.
From general considerations taking into account the substitutions
it is possible to write the commutation relations for the components of the momentum spinor and functions from the components of the coordinate spinor
All other combinations commute with each other. The constant coefficients
c and
d possibly include a minus sign, an imaginary unit and some degree of the rationalized Planck’s constant. Their values can be determined using known commutation relations for vector components, e.g.
and using expressions of vector components through spinor components
Let's return to the relations
In this form they are equivalent, but if an external field is added, a difference arises, since in one case the field is added at the vector level and in the other at the spinor level
These relations correspond to differential equations including the relativistic Schrödinger equation
For a free particle the eigenfunctions and eigenvalues solving these equations should coincide, but in the presence of an external field the eigenvalues and the corresponding eigenfunctions will differ because of the above mentioned difference in summation in one case of vector components and in the other case of spinor components.
While the Dirac equation is sometimes referred to as extracting the square root of the Klein-Gordon equation, here we see a different way of doing it.
Let us describe in more detail the square of the length of the momentum vector
To obtain this result, we did not have to make assumptions about commutability of the spinor components among themselves. Accordingly, a similar expression takes place for the phase of a plane wave in vector space
Further we assume that the components of the momentum spinor commute, which takes place for a free particle, then we obtain
On the other hand, we can write
Thus, the results of calculations coincide.
Let us compare the phases of plane waves in vector and spinor spaces. Let us hypothesize that the plane wave in spinor space has a more complicated form than it was supposed earlier in the paper, namely, it contains an additional conjugate multiplier
On the one hand, this assumption does not cancel the reasoning given in the paper concerning the equations and derivatives, since the derivatives on the components of the spatial spinor from the first conjugate factor are equal to zero, and on the other hand, the phase of the wave in this form is closer to the generally accepted phase of a plane wave in vector space.
For the boson the momentum spinor has a special form
Thus phases calculated by two methods do not coincide with each other, though both of them are invariant under Lorentz transformations
However, if to impose restrictions also on the form of the coordinate spinor
then
That is, there is an equality
The form of the coordinate spinor defined in this way leads to the zero length of the coordinate vector, but the boson is characterized by this property. That is, it turns out that the requirement of coincidence of two definitions of the phase allows revealing the form of the coordinate spinor of the particle.
If a fermion is at rest in some coordinate system, then all components of its momentum vector, except the zero component, i.e., energy, are zero. Therefore, the phase of the corresponding plane wave in vector space depends on time but does not depend on spatial coordinates. It turns out that oscillations in time occur synchronously throughout space, and there is no wave propagation in the usual sense. On the contrary, the phase of the wave in spinor space depends on the spatial coordinates in such a situation. In addition, two fermions with different spins correspond to the same momentum vector, so the phases of the corresponding waves do not differ. But the phases of the wave in spinor space for fermions with different spins are different even when they are stationary.
If we accept the proposed hypothesis, then we need to change the equation for which the plane wave is an eigenfunction
where
As a result, we have the equation
Although the complex multiplier in front of the exponent in the right-hand side does not change with rotations and boosts, it now depends on the coordinates.
Let's consider the equation
then the function corresponding to the free particle
is its solution with
This equation can be ascribed a universal character and its solutions can be sought at different possible values of
for real physical conditions, for example, in the presence of an electromagnetic field. Note that the function with imaginary unit under the exponent
tends to zero at removal from the origin, i.e. the wave function of the plane wave is localized in space.
It was an expression for the amplitude of probability; the probability itself has the form
This quantity will not be infinite even at zero coordinates, since the coordinate components enter the denominator only as a product with the momentum components, and such a product cannot be zero, since this is forbidden by commutation relations and the uncertainty relation.
The photon has a mass equal to zero, so the right side of the equation is also zero, and it does not have a multiplier in the denominator in the solution, so the photon is not localized in space.
4. Path Integral and Second Quantization in Spinor Coordinate Space
Based on the above, we can modify the theory of the integral over trajectories. We will consider it in the notations in which it is presented in [
10]. For a free scalar field with sources J(X) the trajectory integral has the form
It includes the action of
and the Lagrangian density for the free field
For convenience and clarity, the following notations are introduced
For the general case the Lagrangian density has the form
where V
-polynomial over the field
.
Substituting the Lagrangian into the Euler equation
the field equation of motion is obtained.
The free field theory is developed for a special kind of polynomial
In summary, Euler's equation defines the equation of motion
The notations used here are
Thus, there is a correspondence of the Lagrangian and the equation of motion for the free field
Our proposal is to replace the Lagrangian in vector coordinate space by the Lagrangian in spinor coordinate space. For this purpose we use the equation of motion in spinor coordinate space and we want to find the Lagrangian for which the Euler equation defines this equation of motion
For the sake of clarity, we use the same notation for the spinor coordinate derivative as for the vector coordinate derivative; the context allows us to distinguish between them
Let us write the Lagrangian plus sources in the form
And let's substitute the Lagrangian into the Euler equation
For the case of a free field the derivative operators commute, so we can write
It is pleasant that the Euler equation in invariant form works also in this situation, so that we obtain the desired form of the equation of motion in the spinor coordinate space. It is important that the proposed Lagrangian has a relativistically invariant form, even in the general case, and not only at commuting derivatives. The polynomial has the form
In the case of a free field we restrict ourselves to the first term of the polynomial
Then the Lagrangian density and the equation of motion for the scalar field in spinor coordinate space have the form
For a free field when the derivative operators commute, we obtain
In the spinor equation of motion there is a plus sign before the mass, although in the rest of the paper there was a minus sign. To return to the minus sign it is enough to put a plus sign in front of the polynomial in the Lagrangian.
Now we have to find the integral over the trajectories, which, along with the Lagrangian, includes the sources
The components of spinors are complex, and we have already noted that the derivatives on complex variables are applied to the degree functions, which, most likely, can describe physical fields, respectively, and at integration the finding of a first-form for the degree function of a complex variable can be treated similarly, i.e. as a first-form from the degree function.
It is possible to recover Planck's constant, which provides a transition to the classical limit
One of the steps in computing the integral over trajectories in [
10] is to find the free propagator from Eq.
the solution of which has the form
herewith
In our case, we want to find
After integration by parts by analogy with [
10], Chapter 1.3] we obtain for the special case of a free field
In the process of calculation, it is necessary to find the solution of the equation
For this purpose, we pass to the momentum space by means of the integral transformation
The assumed propagator has the form
which is verified by substitution into Eq. Here it is assumed that the representation of the delta function
One can see the difference between the propagators, since in one case is real and positive, while in spinor space m is complex in general.
We can use the relation
where
in which it is taken into account that the fermion mass is real. Now the propagator has the form
The derivatives of the scalar field on spinor coordinates can be expressed through the derivatives on vector coordinates
If in the right part to represent the wave function as a plane wave in vector space
then in the left part it should be represented as a plane wave of a special form in spinor space
Only in this case the left and right parts will be dimensionally consistent, e.g.
In any case, a complete coincidence will not be obtained due to the mismatch of dimensionless exponents of the exponents
Since we call the field under consideration a scalar field, we expect its value to be invariant to Lorentz transformations. But how to formalize this statement and to what exactly does this transformation apply? We propose to consider that the value of a scalar field is the scalar product of the representatives of a spinor field, which is the most fundamental field in nature, and vectors, tensors and, among others, scalars are formed from the spinors representing it. The scalar product is defined by means of the metric tensor of the spinor space. From any two spinors we can obtain a scalar, in general the complex case. But if we want to obtain a scalar with real values, we must impose some restrictions on the original spinors. For example, to any spinor u we can correspond a scalar
U taking real values, whose value does not change under the action of the Lorentz transformation on the spinor and the action of the same transformation on the conjugate spinor
When a spinor and its conjugate spinor are simultaneously rotated or boosted by some angle, the scalar undergoes a rotation or boost by zero angle.
We can find the derivatives of the scalar by the components of the coordinate spinor
The components of the coordinate spinor are complex quantities, the derivative on them is taken virtually formally, since physical fields can be represented by power functions of the components of the coordinate spinor and its conjugate.
What advantages does the transition from the integral over trajectories in vector space to the integral over trajectories in spinor space provide? A possible answer is that it may be easier to compute this integral, or the integration may not lead to divergences. If the spinor coordinate space is indeed more fundamental, and the vector coordinate space is an offspring of it, then we may benefit from this transition in any case.
Now let us move from the scalar field to the field of an electron, that is, the field of a particle with half-integer spin. We will use gamma matrices in the Weyl basis
Let us consider the linear combination of these matrices with components of the momentum vector as coefficients, substituting the expressions of the vector components through the components of the momentum spinor
Let us represent the matrix
as a sum of direct products of spinors
For a free field the components of the momentum spinor commute, therefore
Complex mass
does not change at rotations and boosts for an arbitrary complex spinor. Moreover, by a direct check it is possible to check that for an arbitrary spinor
For a free field, when all components of the momentum spinor commute, we can write the relativistic equation of motion of the fermionic field
Where the matrix of derivatives
is obtained from the matrix
by substitutions
However, it is generally accepted to write for this field another equation, the Dirac equation, which does not possess the invariance property anymore
And for the more general case, when the momentum components do not commute, we need to write the equation
Further we will consider the equation of motion for a free field
We again want to find the integral over trajectories
for which we need the Lagrangian, from which the Euler equation is derived equation of motion
It is suggested to use the Lagrangian
Let us substitute the Lagrangian into the Euler equation and obtain the equation of motion
Since the Lagrangian includes, along with the derivatives of
, the derivatives of
, it is logical to use a different definition of Euler's equation
Then for the free field case when the derivative operators commute with each other, we obtain the equation of motion
If the derivative operators do not commute, additional terms will appear in the equation of motion in the form of matrices similar to the
matrix, and these additional terms will not necessarily coincide with
. In this connection it is necessary to consider the Lagrangian as more fundamental notion than the equation of motion and to derive the equation of motion from the Lagrangian, i.e. to take as a basis not the derivation of the equation of motion in momentum space, with what we started, but to take as an axiom the form of the Lagrangian in the form of field derivatives in the relativistically invariant form. Then, if to follow the invariance principle quite strictly, we should start from the product of two matrices, i.e. to use the Lagrangian
Or, not limited to fermions,
Nevertheless, further we will search for the integral over trajectories in the simplest case with the originally proposed Lagrangian and in addition assume commutativity of all derivative operators
After integration by parts, we presumably obtain
Then it is necessary to find the solution of the equation
For this purpose, we pass to the momentum space by means of the integral transformation
We get the equation
with the decision
Here we use the equality, which is valid for an arbitrary complex spinor
It is based on the correlation verified earlier in our work
it is also taken into account that we consider fermions whose mass is real.
As a result, the propagator has the form
here we assume the validity of the relation
In the case of a fermion, the mass in integration is a fixed real quantity, and it can be considered negative for the electron and positive for the positron. Theoretically, the mass can be complex or purely imaginary. If we put mass equal to zero, it may be possible to apply this Lagrangian to describe massless particles. I wonder if there are particles with complex or purely imaginary mass. In the latter case, the square of the mass will still be positive and the particle will satisfy the Klein-Gordon equation. Such particles can interact among themselves, but not with particles whose mass is real.
Let's return to the question about the use of completely relativistically invariant Lagrangian
Let's find the product of matrices
The assumption that the following equalities hold is used
Further we find the product of matrices
Again we use the equality
and consider that the mass of the fermion is real, i.e.
therefore, the relations are valid
But the main advantage of the obtained matrix is the following
This matrix does not change at rotations and boosts, so it can be stated that the equation of motion, e.g., in the form of
where
is truly relativistically invariant, respectively we can use the invariant Lagrangian
to which corresponds the relativistically invariant propagator of the boson having a real mass, which is negative for the electron and positive for the positron
Let us compare the propagator in spinor space with the propagator of the fermion given in [
10], formula II.2.22 and formula II.5.18]
In [
10] this formula is obtained by applying the second quantization procedure or using Grassmann integrals. The results are similar, but the integration here is performed in the vector momentum space. The Dirac equation and the corresponding Lagrangian are not relativistically invariant. Besides, here the mass is considered always real and positive, but then it is not clear how electron and positron differ from the point of view of this formula.
Let us consider in detail the derivation of the expression for the fermion propagator in [
10], Sec. II.2. It is based on the assumption of relativistic invariance of the Dirac equation and therefore the calculations are carried out in the rest frame, and then the result is extended to an arbitrary frame of reference. Thus for the field spinor u the spinor
u_ is defined and it is asserted that the value of
is a Lorentz scalar. But it is not so, since in the spinor space the scalar is formed exclusively by the scalar product of two spinors, where the metric tensor of the spinor space is included
there are no other ways to construct a scalar in the spinor space.
Nevertheless, this fact and the fact of non-invariance of the Dirac equation itself do not cancel the value of the second quantization procedure and the final form of the fermion propagator, which allows to make accurate predictions of the experimental results.
We hope that the proposed Lagrangian for the spinor coordinate space can find application in the calculation of the integral over trajectories, but already in the spinor space. Whether such a calculation in spinor space has an advantage over the calculation of the integral over trajectories in vector space can be shown by their real comparison.
By analogy with the propagator of a photon, more precisely of a massive vector meson, given in [
10], formula I.5.3
we can assume the propagator form in the spinor space without revealing for compactness the expression of the momentum vector components through the momentum spinor components
Among other things, the equation
can be modified to take into account the electromagnetic potential, the electron charge is taken as a unit
and apply, in particular, to analyze the radiation spectrum of a hydrogen-like atom.
Let us formulate again the difference between the equations, the second of which is derived from the Dirac equation with gamma matrices in the Weyl basis
The difference is, the matrix
(p) remains unchanged under any rotations and boosts applied to the spinor
, while the matrix
(p) changes under any rotations and boosts.
Equally radically different are the corresponding Lagrangians and propagators.
By analogy with [
10], Chapter II.2] we will carry out the procedure of second quantization of the fermion field. Let us write the equation
in the momentum space, for which we apply the integral transformation
Let's substitute the wave function into the equation and obtain
Let us define two sets of four reference spinors
where
And let's express the matrix through them
Developing the idea of invariance, we pass to the set of reference spinors with wider filling, but continuing to form matrices possessing the invariance property
Let's express through the reference spinors the matrix
and matrix
here
Let us deviate from the canonical approach to the definition of antisymmetry of the field and consider the following scheme of reasoning. Let us decompose the fermion field into plane waves with operator coefficients
Let's impose the anticommutation conditions on the operator coefficients
We consider the rest anticommutators to be equal to zero. Then we can write the expression for the anticommutator of the field with its transposed version but without its complex conjugation
We will consider this relation as a proof of the anti-symmetry of the fermion wave function under the stipulated anticommutation relations.
Let us calculate the total energy of the fermion field
here
Each summand in brackets represents the operator of the number of particles with a certain reference spinor. The operator's action consists of consecutive application of the annihilation operator and the operator of the birth of a particle. On initial examination, it would appear that the energy associated with zero-point fluctuations in the vacuum has been overlooked. However, an examination of the final expression reveals that the field always possesses a constant energy, regardless of the particles that contribute to it. This constant energy of the field can be interpreted as the energy of zero-point fluctuations of the vacuum.
It is important that all the above deductions are valid in any frame of reference, while the proof of anticommutativity of the fermion field in [
10] is carried out for the rest frame.
The following relations were taken into account in the derivation
Other components of the total field momentum are calculated by the formula
Total momentum
is a vector in Minkowski space. Multiplied by the ratio of the electron or positron charge to their mass, it turns into the vector of the total current of the fermionic field
But the real interest is not the total current of the fermionic field, but the density of this current as a function of coordinates
where
is a four-dimensional probability density current, which is transformed as a four-dimensional vector by Lorentz transformations. Multiplication by
transforms it into a four-dimensional current density. Let us perform a series of transformations analogous to those presented by Dirac in [
11], Lecture 11].
For an arbitrary component of the total momentum we have
Following Dirac's argument in [
11], the value of
can be treated as either a Hamiltonian or a total energy operator, with
representing the birth operator and
representing the annihilation operator.
In [
11] the quantization procedure includes the use of one definite Lorentzian reference frame, i.e. it is not invariant. In our case all deductions are valid in any reference frame in the spinor space, and it means invariance to change of reference frames in the Minkowski space also.
The following relations are used in the transformations
The chain of reasoning can be organized in a slightly different way as well
Here it is taken into account that
Let us draw an analogy between our approach and the relations given in [
12], Volume 1, Chapter 3, Section 3.3.1]. There it is noted that the birth and annihilation operators of the fermionic field must satisfy such commutation relations that the equality expressing translational invariance is satisfied
which in differential form is written as
On the basis of these relations the anticommutation relations between the birth and annihilation operators are derived. The coordinates here are the components of the Minkowski vector space.
We can perform a similar consideration in the spinor coordinate space, describing for it the translational invariance by the relations
Or we can try to approximate the formulation of translational invariance in Minkowski space by means of the formula
Let us calculate the total mass of the fermion field
The ratios used in the derivation are
Let us give an interpretation of the operator coefficients for this approach
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
Note that
and
are translated into each other by a linear transformation, this is also true for other pairs of spinors
It is known [
10], formula II.1.30] that the charge conjugation operation transforms an electron into a positron with a change of the sign of the charge. Let us apply the charge conjugation to the reference spinor
As a result not only transforms to , but also changes a sign of mass due to the imaginary unit in the charge conjugation matrix. This confirms our thesis that the charge conjugation synchronously changes signs of charge and mass.
The properties of all particles and operators are summarized in a table
| creates |
destroys |
particle |
number |
mass |
spin |
wave sign |
|
|
|
|
|
up |
+ |
|
|
|
|
|
up |
- |
|
|
|
|
|
up |
- |
|
|
|
|
|
up |
+ |
|
|
|
|
|
down |
+ |
|
|
|
|
|
down |
- |
|
|
|
|
|
down |
- |
|
|
|
|
|
down |
+ |
|
|
|
|
|
up |
+ |
|
|
|
|
|
up |
- |
|
|
|
|
|
up |
- |
|
|
|
|
|
up |
+ |
|
|
|
|
|
down |
+ |
|
|
|
|
|
down |
- |
|
|
|
|
|
down |
- |
|
|
|
|
|
down |
+ |
By the words destroys the particle it should be understood that this operator transforms this particle into the particle , and the operator performs the reverse transformation of into . Since both of these particles have the same mass, the total mass of the fermionic field does not change from these transformations. The mass m itself can have any sign.
If the operator acts on the particle , it transforms it into the particle , the action on any other particle gives zero.
Let us see what result we get if we apply the canonical definition of anticommutativity of the fermionic field.
where
In deriving this result, the following relations are taken into account
The last operation of taking the value
out from under the sign of the integral seems doubtful because of its dependence on the momentum over which the integration is performed. If one closes one's eyes to this, as is generally accepted in the literature, in particular in [
10], this relation is taken to be interpreted as a proof of the anti-symmetry of the fermion wave function under the stipulated anticommutation relations. The only situation where this is unquestionably true is when considering in a rest system where boosts are excluded, energy is equal to mass, and invariant to rotations.
It is noteworthy that the antisymmetric treatment, whether or not complex conjugation is considered, yields a diagonal matrix that is invariant in one case but not in the other. It is encouraging to observe that the set of reference spinors remain consistent.
It is crucial to note that the proposed invariant approach cannot be realized within the Minkowski vector space. To achieve this, it is necessary to transition to the spinor space. This reiterates the secondary role of the Minkowski space in comparison to the spinor space.
Dirac's equation can be expressed in both spinor and vector spaces, a fact that led Dirac to discover it. In contrast, the invariant equation can be written in spinor space but not in vector space, which explains why it was unknown.
Let us write down the propagator of the fermionic field and the fermionic field invariant equation of motion using the proposed matrices
The equation of motion has the form
where
The equation is relativistically invariant, respectively we can use the invariant Lagrangian
to which corresponds the relativistically invariant fermion propagator
The equation can be modified to take into account the electromagnetic potential, the electron charge is taken as a unit
and apply, in particular, to analyze the emission spectrum of the hydrogen-like atom.
It is our intention to extend the proposed approach to the description of the bosonic field. In order to transition to such a description, it would be prudent to consider the existence of the fermion annihilation process as a potential indicator. It is reasonable to posit a relationship between electrons and photons, given their ability to transform into one another. It is therefore proposed that the photon field be expressed by means of analogous reference spinors, which can be obtained from the fermion spinors but which have zero mass. Let us utilize the table of properties of the reference spinors to identify suitable pairs for describing the annihilation process. The two spinors should exhibit identical properties, with the exception of their respective masses, which should have opposite signs. For example,
Let us represent the photon field as a decomposition
We are interested in the coordinate function
It represents a four-dimensional probability density current, which is transformed by the Lorentz transformations as a four-dimensional vector. It is proposed that this vector, possibly with some dimensional coefficient, be treated as a vector potential of the electromagnetic field
The final step is to verify that, under the specified commutation relations between the operator coefficients, the commutation relations of the following type are satisfied for the vector potential
or
with a certain constant
.