2. Generalized Dirac Type Equation
Let us introduce notations, which will be used further on. The speed of light and the rationalized Planck’s constant will be considered as unity.
Matrices constructed from Pauli matrices
A set of arbitrary complex numbers and a vector of its three components
Let us define a 2×2 matrix of Lorentz transformations given by the set of real rotation angles
and boosts
and a similar 4×4 transformation matrix
We also define a 4×4 matrix of Lorentz transformations
, where
μ and
ν take values 0,1,2,3
which can also be written explicitly using the 4×4 matrices of turn generators
and boosts
Let's define a 4×4 matrix
In fact, we consider a quaternion with complex coefficients, which we multiply by its conjugate quaternion (due to the complexity of the coefficients, these are biquaternions, but we still use quaternionic conjugation, without complex conjugation).
Let us subject the set of complex numbers to the Lorentz transformation
Let us write a relation whose validity for an arbitrary set of complex numbers can be checked directly
The matrix in the simplest case is diagonal with equal complex elements on the diagonal equal to the square of the length of the vector in the metric of Minkowski space, which we denote . Both and do not change under any rotations and boosts, in physical applications the invariance of is usually used, in particular, for the four-component momentum vector this quantity is called the square of mass.
Since the matrices anticommutate with each other, for a vector whose components commute with each other, we have just the simplest case with a diagonal matrix with on the diagonal. But if the components of vector do not commute, the matrix already has a more complex structure and carries additional physical information compared to . For example, the vector may include the electron momentum vector and the electromagnetic potential vector. The four-component potential vector is a function of the four-dimensional coordinates of Minkowski space. The components of the four-component momentum do not commute with the components of the coordinate vector, respectively, and the coordinate function does not commute with the momentum components, and their commutator is expressed through the partial derivative of this function by the corresponding coordinate. If the components of the vector do not commute, the matrix will no longer be invariant with respect to Lorentz transformations.
Suppose that the complex numbers we consider commute with all matrices, and note that the squares of all matrices are equal to the unit 4×4 matrix
I
Taking into account anticommutative properties of matrices and expressions for their pairwise products we obtain
Consider the case when
is the sum of the momentum vector and the electromagnetic potential vector, which is a function of coordinates
For now, we'll stick with the Heisenberg approach, that is, we will consider the components of the momentum vector as operators for which there are commutation relations with coordinates or coordinate functions such as . In this approach, the operators do not have to act on any wave function.
Taking into account the commutation relations of the components of the momentum vector and the coordinate vector, the commutator of the momentum component and the coordinate function is expressed through the derivative of this function by the corresponding coordinate, e.g.
As a result, we obtain
where
As a result, we have the expression
Similarly, it can be shown that
The matrix
does not change under Lorentz transformations involving any rotations and boosts.
Taking into account the electron charge we have
Let us summarize our consideration. There is a correlation
where
Let's analyze the obtained equality
Note that the quantity
is invariant to the Lorentz transformations irrespective of whether the momentum and field components commute or not. To solve this equation, we have to make additional simplifications. For example, to arrive at an equation similar to the Dirac equation, we must equate
with the matrix
, where
is the square of the mass of a free electron. Then
With this substitution the generalized equation almost coincides with the equation [[
6], formula (43.25)], the difference is that there is a plus sign before
, and instead of
there is
, in which the matrices
have the following form
A similar equation is given by Dirac in [[
7], Paragraph 76, Equation 24]; he does not use the matrices
, only the matrices
, but the signs of the contributions of the magnetic and electric fields are the same.
Along with the original form
it is possible to consider the form with a different order of the factors. It can be shown that this leads to a change in the sign of the electric field contribution
Since , unlike , is invariant to Lorentz transformations, it would be logical to replace it by . At least both these matrices are diagonal, and in the case of a weak field their diagonal elements are close. Nevertheless, the approach based on the Dirac equation leads to solutions consistent with experiment.
The matrix in the general case has complex elements and is not diagonal, and in the Dirac equations instead of it is substituted the product of the unit matrix by the square of mass , the physical meaning of such a substitution is not obvious. Apparently it is implied that it is the square of the mass of a free electron. But the square of the length of the sum of the lengths of the electron momentum vectors and the electromagnetic potential vector is not equal to the sum of the squares of the lengths of these vectors, that is, it is not equal to the square of the mass of the electron, even if the square of the length of the potential vector were zero. But, for example, in the case of an electrostatic central field, even the square of the length of one potential vector is not equal to zero. Therefore, it is difficult to find a logical justification for using the mass of a free electron in the Dirac equation in the presence of an electromagnetic field. Due to the noted differences, the solutions of the generalized equation can differ from the solutions arising from the Dirac equation.
In the case when there is a constant magnetic field directed along the z-axis, we can write down
Here . Only when the field is directed along the z-axis, the matrix is diagonal and real because the third Pauli matrix is diagonal and real. And if the field is weak, can be approximated by the matrix. This is probably why it is customary to illustrate the interaction of electron spin with the magnetic field by choosing its direction along the z-axis. In any other direction is not only non-diagonal, but also complex, so that it is difficult to justify the use of .
When the influence of the electromagnetic field was taken into account, no specific characteristics of the electron were used. When deriving a similar result using the Dirac equation, it is assumed that since the electron equation is used, the result is specific to the electron. In our case Pauli matrices and commutation relations are used, apparently these two assumptions or only one of them characterize the properties of the electron, distinguishing it from other particles with non-zero masses.
The proposed equation echoes the Dirac equation, at least from it one can obtain the same formulas for the interaction of spin and electromagnetic field as with the Dirac equation, and in the absence of a field the proposed equation is invariant to the Lorentz transformations. In contrast, to prove the invariance of the Dirac equation even in the absence of a field, the infinitesimal Lorentz transformations are used, but the invariance at finite angles of rotations and boosts is not demonstrated. The proof of invariance of the Dirac equation is based on the claim that a combination of rotations at finite angles can be represented as a combination of infinitesimal rotations. But this is true only for rotations or boosts around one axis, and if there are at least two axes, this statement is not true because of non-commutability of Pauli matrices, which are generators of rotations, so that the exponent of the sum is not equal to the product of exponents if the sum includes generators of rotations or boosts around different axes. By a direct check we can verify that the invariance of the Dirac equation takes place at any combination of rotations, but only under the condition of zero boosts, i.e., only in a rest frame of reference, any boost violates the invariance.
A test case for any theory is the model of the central electrostatic field used in the description of the hydrogen atom, in which the components of the vector potential are zero
If again we equate the left part with
, we obtain
Introducing the notations (
)
we obtain
If we substitute operators acting on the wave function instead of momentum components into the equation, we obtain a generalized analog of the relativistic Schrödinger equation, in which the wave function has four components and changes as a spinor under Lorentz transformations. Using the substitutions
the equation for the four-component wave function
before all transformations has the form
and after transformations
Once again, note that the matrix is not diagonal and real.
All the above deductions are also valid when replacing 4×4 matrices
by 2×2 matrices
, since their commutative and anticommutative properties are the same. The corresponding generalized equation is of the form
where
and the equation for the now two-component wave function looks like
In deriving his equation, Dirac [[
7], Paragraph 74] noted that as long as we are dealing with matrices with two rows and columns, we cannot obtain a representation of more than three anticommuting quantities; to represent four anticommuting quantities, he turned to matrices with four rows and columns. In our case, however, three anticommuting matrices are sufficient, so the wave function can also be two-component. Dirac also explains that the presence of four components results in twice as many solutions, half of which have negative energy. In the case of a two-component wave function, however, no negative energy solutions are obtained. Particles with negative energy in this case also exist, but they are described by the same equation in which the signs of all four matrices
or
are reversed.
One would seem to expect similar results from other representations of the momentum operator, e.g., [6, formula (24.15)]
under the assumption that this representation can describe a particle with spin one. But this expectation is not justified, since the last three matrices do not anticommutate, and therefore the quadratic form constructed on their basis is not invariant under Lorentz transformations.
If one consistently adheres to the Heisenberg approach and does not involve the notion of wave function, it is not very clear how to search for solutions of the presented equations. The Schrödinger approach with finding the eigenvalues of the
matrix and their corresponding eigenfunctions can help here.
In the left-hand side are the operators acting on the wave function, and in the right-hand side is a constant matrix on which the wave function is simply multiplied. This equality must be satisfied for all values of the four-dimensional coordinates at once. Then is not fixed but can take a set of possible values, finding all these values is the goal of solving the equation.
Thus, we have arrived at an equation containing a matrix which is non-diagonal, complex and in general depends on the coordinates . After the standard procedure of separating the time and space variables, we can go to a stationary equation in which there will be no time dependence, but the dependence the matrix on the coordinates will remain. It is possible to ignore the dependence of on the coordinates and its non-diagonality and simply replace this matrix by a unit matrix with a coefficient in the form of the square of the free electron mass. Then the equation will give solutions coinciding with those of the Dirac equation. But this solution can be considered only approximate and the question remains how far we depart from strict adherence to the principle of invariance with respect to Lorentz transformations and how far we deviate from the hypothetical true solution, which is fully consistent with this principle. To find this solution, we need to approach this equation without simplifying assumptions and look for a set of solutions, each of which represents an eigenvalue matrix of arbitrary form and its corresponding four-component eigenfunction.
Let us return to the question of Lorentz invariance of the expression
As it was noted, this expression does not change at rotations and boosts in Minkowski space only if the components of
commute with each other. If they do not commute, the matrix
changes under Lorentz transformations. Two parts can be distinguished in this matrix
The first row represents the unit matrix multiplied by a value that still does not change under Lorentz transformations. All changes occur in the last two rows. In the particular case of electrodynamics, we have
Here the first line is invariant, but the last three are not. The only way to ensure complete invariance of
is to require these three lines to be zero. Let us again consider the commutation relations, but now we will not assume that the momentum components commute with each other, only the potential components still commute with each other. Now we can write the relations of the form
Such values as
always enter
as a sum with the component of the field, in this case the electric one
If we formally define a new value
and suppose that m does not change at rotations and boosts, and also to take into account the presence of charge at the electron, it is possible to require for this and all other similar sums the fulfilment of the condition
The value
can be regarded as a component of velocity, and velocity not in the usual sense, as a derivative of the spatial coordinate by time, but simply as a component of momentum divided by the inertial mass m. Then the above equality can be interpreted in the spirit of Newton's law, namely, that the acceleration multiplied by the mass is equal to the force acting on the side of the electric field. If all such equalities are fulfilled, only the first line will remain in the quantity
, and it will be invariant under Lorentz transformations. It is possible to go further, and to assume equality of the masses appearing here, namely
As a result, we obtain a system of equations
It is the fulfilment of these equations that causes the mass
and
to have the meaning we are accustomed to, that is, not only invariant under Lorentz transformations, but also unchanged by changes in momentum. We can introduce tensor notations
where
The resulting system of equations describes not only uniform but also accelerated motion. The presence of an external field leads to a change in momentum, and vice versa, any change in momentum perturbs the potential and generates an electromagnetic field.
For quantum mechanics we can replace the momentum components in all equations by the derivative operators
This also applies to equations from the second group, where mixed derivatives arise
As a result, we obtain for the wave function a system of equations with second order derivatives, the innovation compared to the commonly used equations is the presence in the equations of mixed derivatives on all components of the coordinate vector.
The equations proposed here initially take into account the non-commutability of momentum components, their derivation relies only on the unconditional fulfilment (even in coupled systems) of the requirement of invariance to Lorentz transformations for the product of conjugate quaternions with arbitrary coefficients
Putting all equations together, we write a truly relativistic system of equations
This system is a generalization of the relativistic Schrödinger equation. The essence of the generalization consists not only in taking into account the spin of the electron, which takes place already in the Dirac equation, but also takes into account the non-commutability of the momentum components. It can be assumed that the solutions of this generalized system will give exact values for stationary electron energy levels in the atom, for which no radiative corrections will be needed.
If not to substitute the coordinate derivative instead of the momentum component and to remain in the framework of classical physics, the system of equations
describes the motion of a macroscopic charged particle in the presence of an electromagnetic or other potential field, for example, gravitational field. Let us note the nontrivial fact that even in classical physics, when an electric field acts on a charge, one should use the combination of derivatives
instead of the simple acceleration
.
By means of the antisymmetric Levy-Civita symbol we transform antisymmetric tensors into dual tensors
and we use Maxwell's equations written in compact form
Let us apply the derivative operator to our proposed equations
then taking into account Maxwell's equations we obtain
These formulas describe the behavior of field and charge that results from their mutual influence.
If in the presence of an arbitrary potential there is no particle in the moving point, then our equations are the homogeneous Maxwell equations for an arbitrarily moving point. In the particular case of uniform motion, they transform into the ordinary Maxwell equations. If a charge is placed in the point, we obtain inhomogeneous Maxwell equations for an arbitrarily moving source.
The equations we propose can even be considered as a derivation of Maxwell's equations. Taking our equations as a basis and equating all derivatives of momentum to zero, we obtain as a residue exactly Maxwell's equations for a stationary or uniformly moving point.
The conditions expressed by the second line of our equations may be too strong, since they require that each pair of brackets with derivatives is zero. But invariance can also be achieved with a weaker requirement that only their sum as a whole is zero. That is, each pair of brackets can deviate from zero; the main thing is that these deviations are compensated in the total sum. This can work both in classical and quantum mechanics. A hint on the validity of this approach is given by Maxwell's equations, in which conditions are imposed not on individual derivatives, but on their sums. In addition, it is intuitively clear that the components having similarity to velocity should be considered in the sum in order not to depend on the rotations of the coordinate system.
3. Equation for the Spinor Coordinates Space
Let us consider the set of arbitrary complex numbers, for simplicity we will call it a vector
and let us consider arbitrary four-component complex spinors
Among all possible vectors, let us select a set of such vectors for which there is a representation of components through arbitrary complex spinors
and there is another way to calculate them
Further we will assume that both spinors are identical, then the vector constructed from them is
has real components, and we will assume that this is the electron momentum vector constructed from the complex momentum spinor
Consider the complex quantity
where we introduce one more complex spinor, which in the future we will give the meaning of the complex coordinate spinor
and
Coordinate vector of the four-dimensional Minkowski space
is obtained from the coordinate spinor by the same formulas
Thus, the vector in the Minkowski space is not a set of four arbitrary real numbers, but only such that are the specified bilinear combinations of components of completely arbitrary complex spinors
Accordingly, the components of the vector in Minkowski space are interdependent, from this dependence automatically follow the relations of the special theory of relativity between space and time. For the same reason, the coordinates of Minkowski space cannot serve as independent variables in the equations. From the commutative properties of matrices, which are generators of rotations and boosts with respect to which the length of vectors is invariant, quantum mechanics automatically follows. Indeed, the commutation relations between the components of momenta are related to the noncommutativity of rotations in some way, and from them the commutation relations between the components of coordinates and momenta are directly deduced. And from these relations the differential equations are derived.
And since we do not doubt the truth of the theory of relativity and quantum mechanics, we cannot doubt the reality of spinor space, which by means of the simplest arithmetic operations generates our space and time.
The quantity
is invariant under the Lorentz transformation simultaneously applied to the momentum and coordinate spinor, which automatically transforms both corresponding vectors as well
This quantity does not change for any combination of turns and boosts
Accordingly, the exponent
characterizes the propagation process of a plane wave in spinor space with phase invariant to Lorentz transformations.
Let us apply the differential operator to the spinor analog of a plane wave
Applying this operator at another definition of the phase gives the same eigenvalue
that is, two different eigenfunctions correspond to this eigenvalue, but in the second case the phase in the exponent is not invariant with respect to the Lorentz transformation, so we will use the first definition.
Since
are complex spinors, which, under the transformation
is affected by the same matrix
, then the complex quantity
is invariant under the action on the momentum spinor
of the transformation
.
is an eigenvalue of the differential operator, and the plane wave is the corresponding
m eigenfunction, which is a solution of the equation
Here denotes the complex function of complex spinor coordinates.
When substantiating the Schrödinger equation for a plane wave in four-dimensional vector space, an assumption is made (further confirmed in the experiment) about its applicability to an arbitrary wave function. Let us make a similar assumption about the applicability of the reduced spinor equation to an arbitrary function of spinor coordinates, that is, we will consider this equation as universal and valid for all physical processes.
Let us clarify that by the derivative on a complex variable from a complex function we here understand the derivative from an arbitrary stepped complex function using the formula that is valid at least for any integer degrees
In particular, this is true for the exponential function, which is an infinite power series.
It is very important to emphasize that we consider the complex variable and the variable conjugate to it to be independent, so when finding the derivative of a complex variable from some function, we treat all the quantities which are conjugate to our variable and which are included in this function, as ordinary constants.
It is not by chance that we denote the eigenvalue by the symbol
m, because if we form the momentum vector from the momentum spinor
included in the expression for the plane wave
then for the square of its length the following equality will be satisfied
That is the square of the modulus
m has the sense of the square of the mass of a free particle, which is described by a plane wave in spinor space as well as by a plane wave in vector space. For the momentum spinor of a fermionic type particle having in the rest frame the following form
quantity
is real and not equal to zero, and for the bosonic-type momentum spinor having in the rest frame the following form
it is zero
i.e., the boson satisfies the plane wave equation in spinor space with zero eigenvalue.
For the momentum spinor of a fermion-type particle we can consider another form in the rest system
then the mass will be real and negative
This particle with negative mass can be treated as an antiparticle, and in the rest frame its energy is equal to its mass modulo, but it is always positive
To describe the behavior of an electron in the presence of an external electromagnetic field, it is common practice to add the electromagnetic potential vector to its momentum vector. We use the same approach at the spinor level and to each component of the momentum spinor of the electron we add the corresponding component of the electromagnetic potential spinor. For simplicity, the electron charge is equal to unity.
Further we need an expression for the commutation relation between the components of the momentum spinor, to which is added the corresponding component of the electromagnetic potential spinor, which is a function of the spinor coordinates
Let us replace the momenta by differential operators
and find the commutation relation
Let us apply the proposed equation to analyze the wave function of the electron in a centrally symmetric electric field, this model is used to describe the hydrogen-like atom. For the components of the vector potential of a centrally symmetric electric field it is true that
As a result, it is possible to accept
We are looking for a solution of the spinor equation; we do not consider the electron's spin yet
This equation can be interpreted in another way. Let us take the invariant expression
And let's do the substitution
We will consider this equation as an equation for determining the eigenvalues of
and the corresponding eigenfunctions
Let's introduce the notations
this quantity does not change under rotations and boosts and is some analog of the interval defined for Minkowski space and
this quantity represents time in four-dimensional vector space.
An interesting fact is that time is always a positive quantity. As an assumption it can be noted that since we observe that time value goes forward, i.e. the value of
t grows, and it is possible only due to scaling of all components of spinor space, such scaling leads to increase of distance between any two points of Minkowski space. As a result, with the passage of time the Minkowski space should expand, herewith at first relatively quickly, and then more and more slowly.
As a result, we have an equation for determining the eigenvalues of
m and their corresponding eigenfunctions
Instead of looking for solutions to this equation directly, we can first try substituting already known solutions to the Schrödinger equation for the hydrogen-like atom. If
is one of these solutions, we need to find its derivatives over all spinor components
Let’s pay attention to the shift in priorities. In the Schrödinger equation one looks for energy eigenvalues, while here it is proposed to look for mass eigenvalues, it seem more natural to us. The mass of a free particle is an invariant of the Lorentz transformations, and in the bound state the mass of the particle has a discrete series of allowed values, each of which corresponds to an energy eigenvalue, and the eigenfunction of these eigenvalues is the same. But these energy eigenvalues are not the same as the energy eigenvalues of the Schrödinger equation, because the equations are different. When an electron absorbs a photon, their spinors sum up and the mass of the electron changes. If the new mass coincides with some allowed value, the electron enters a new state. The kay idea here is the assumption that the interaction of spinors occurs simply by summing them.
The advantages of considering physical processes in spinor coordinate space may not be limited to electrodynamics. It may turn out, for example, that the spinor space is not subject to curvature under the influence of matter, as it takes place in the general theory of relativity for the vector coordinate space. On the contrary, it can be assumed that it is when the components of vector coordinate space are computed from the coordinate spinor that the momentum spinor with a multiplier of the order of the gravitational constant is added to this spinor. This results in a warp that affects other massive bodies.
To account for the electron spin, we will further represent the electron wave function as a four-component spinor function of four-component spinor coordinates
where the coefficients
are complex quantities independent of coordinates. In fact, as shown at the end of the paper, the wave function is a linear combination of such right-hand sides with operator coefficients.
We will search for the solution of the wave equation considered in the first part of this paper
Let's express the left part through the components of the momentum spinor
Let's distinguish the direct products of vectors in these matrices
Let's introduce the notations
Let us substitute differential operators instead of spinor components
Then the quantities included in the wave equation
will have the form
Let us consider the case of a free particle and represent the electron field as a four-component spinor function of four-component spinor coordinates
For a free particle, the components of the momentum spinor commute with each other, so all components of the matrix are zero.
Let us use the model of a plane wave in spinor space
Substituting the plane wave solution into the differential equation, we obtain the algebraic equation
Let us take into account the commutativity of the momentum components, besides, let us introduce the notations
for the quantities which are invariant under any rotations and boosts, then we obtain
Additionally, introducing notation for Lorentz invariant quantities
we obtain
We see that in the case of a plane wave in spinor space, the matrix in the left part of the equation is diagonal and remains so at any rotations and boosts, the diagonal element also does not change.
In this case we can consider the matrix
in the right part to be diagonal with the same elements on the diagonal
, then the equation can be rewritten as an equation for the problem of finding eigenvalues and eigenfunctions
Let us compare our equation with the Dirac equation [6, formula (43.16)]
In the rest frame of reference, the three components of momentum are zero and the equation is simplified
That is, in the rest frame the Dirac equation and the spinor equation analyzed by us look identically and contain a diagonal matrix. The corresponding problem on eigenvalues and eigenvectors of these matrices has degenerate eigenvalues, which correspond to the linear space of eigenfunctions. In this space, one can choose an orthogonal basis of linearly independent functions, and this choice is quite arbitrary. For example, in [[
8], formula (2.127)], solutions in the form of plane waves in the vector space have been proposed for the Dirac equation in the rest frame
and the following spinors are chosen as basis vectors
For transformation to a moving coordinate system in [[
8], formula (2.133)] the following formula is used
where
The basis spinors form a complete system, that is, any four-component complex spinor can be represented as their linear combination and this arbitrary spinor will be a solution to the problem on eigenvalues and eigenfunctions in a resting coordinate system. The choice of the given particular basis has disadvantages, because if to find a four-dimensional current vector from any of these basis functions
then this current in the rest frame of reference
has non-zero components, and the square of the length of the current vector is zero. It turns out that a resting electron creates a current, which contradicts physical common sense.
Since we have freedom of choice of the basis, it is reasonable to choose the spinor for the wave function as some set of momentum spinor components, for example
An exhaustive list of 16 spinors of this kind, each corresponding to some particle of the fermionic field, is given in the last section of the paper. The proportionality factor is chosen so that in the rest frame the zero component of the current is equal to the charge of, for example, an electron or a positron.
The mass of electron
and the phase of the plane spinor wave
do not change at rotations and boosts. The matrix on the left side of the equation does not change either, remaining diagonal with
on the diagonal.
For a fermion, which can be an electron or a positron in the rest frame takes place
, so the quantity
which, unlike the mass
M in the Dirac equation, is complex in the general case, is also real for the fermion and can be positive or negative. For simplicity it is possible to consider the mass of the electron as negative and that of the positron as positive.
For the momentum spinor of a boson, such as a photon, it is true that
, so its mass is zero
The given constructions are not abstract, but describe the physical reality, since the results of the processes occurring in the spinor space are displayed in the Minkowski vector space. In particular, the momentum vector corresponding to the momentum spinor has the following parameters
the square of the length is equal to the square of the mass of the electron or positron
A spinor wave function
at some point in spinor space can be given a probabilistic interpretation by establishing its correspondence with the vector wave function
(
)
taking its values in the corresponding point of physical space with coordinates
We act within the classical concepts of quantum mechanics, simply to describe the state of a physical system we use spinor coordinate and momentum representations along with vector coordinate and momentum representations. Both types of representations equally have the right to be more substantial and in principle there is no need to express the wave function in one representation through the wave function in the other, both wave functions equally describe the same physical state. Moreover, since vector coordinates and momenta are simply expressed through spinor analogues, we would prioritize the spinor representations as the more fundamental ones.
Let us summarize the relations between quantum-mechanical quantities for the spinor space
The complete orthonormalzed system of eigenvectors of the momentum operator
The complete orthonormalzed system of eigenvectors of the coordinate operator
The relation between wave function in momentum and coordinate representations and the relation between eigenvectors of the coordinate operator and the momentum operator
The arbitrary choice of the basis of the linear space of the eigenvectors of the matrix takes place only for a free particle. In the general case the matrix K is not zero, the wave equation has no solution in the form of plane waves in spinor space and ceases to be invariant with respect to Lorentz transformations, and the eigenvalues become nondegenerate.
We propose to extend the scope of applicability of the presented equation consisting of differential operators in the form of partial derivatives on the components of coordinate spinors to case of a nonzero matrix
K
that is not only to the case of a plane wave, but to any situation in general. This transition is analogous to the transition from the application of the Schrödinger equation to a plane wave in vector space to its application in a general situation. The legitimacy of such transitions should be confirmed by the results of experiments.
This equation will be called the equation for the spinor wave function defined on the spinor coordinate space. Here the matrix
is, generally speaking, neither diagonal nor real, but it does not depend on the coordinates and is determined solely by the parameters of the electromagnetic field. Only in the case of a plane wave it is diagonal and has on the diagonal the square of the mass of the free particle. We can try to simplify the problem and require that the matrix
is diagonal with the same elements on the diagonal
, then the equation can be rewritten in the form of the equation for the problem of search of eigenvalues and eigenfunctions for any quantum states
This approach is pleasant in the Dirac equation, where the mass is fixed and equated to the mass of a free particle, and at the same time results giving good agreement with experiment are obtained.
We are of the opinion that the spinor equation is more fundamental than the relativistic Schrödinger and Dirac equations, it is not a generalization of them, it is a refinement of them, because it describes nature at the spinor level, and hence is more precise and detailed than the equations for the wave function defined on the vector space.
Let us consider the proposed equation for the special case when the particle is in an external electromagnetic field, which we will also represent by a four-component spinor function at a point of the spinor coordinate space
We will apply to the wave function of the electron the operators corresponding to the components of the momentum spinor, putting for simplicity the electron charge equal to unity
Note that the electromagnetic potential vector can be calculated from the electromagnetic potential spinor by the standard formula
The advantage of the spinor description over the vector description is that instead of summing up the components of the momentum and electromagnetic potential vectors as is usually done
now we sum the spinor components and then the resulting vector is
in addition to the usual momentum and field vectors, contains an additional term
taking real values and describing the mutual influence of the fields of the electron and photon.
After the addition of the electromagnetic field the components of the momentum spinor do not commute, the corresponding commutators are found above
Let's find commutators for other operators
Further we will use these and analogous relations
Earlier, by giving absoluteness to the requirement of invariance of the mass squared to the Lorentz transformations, we obtained a system of equations for interacting fields in electrodynamics in the case when these fields exist in vector space. But we can apply this approach to interacting fields in spinor space as well. Let us analyze again the formula
If we deal with the field of a free particle, then
But since we want to make the invariance principle absolute, we require commutability also in the presence of the electromagnetic field
This can be achieved if we take into account the dependence of the momentum spinor components on the coordinates and impose the condition
As in the case of vector space, we can treat in the spirit of Newton's law equations of the form
If an external field is applied, the momentum of the electron field changes, if the momentum of the electron changes for some reason, the electromagnetic potential is perturbed and an electromagnetic field is generated.
Earlier we defined a matrix of switches
and noted that for a free particle it is equal to the zero-point matrix. We can require that this matrix is zero also in the presence of an arbitrary field. Absolutization of this requirement gives us an additional set of equations, besides the main one (for example, the Dirac equation), to describe the interaction between the field and the charged particle in the presence of spin. It is guaranteed that the basic equation remains true both for a free particle and for a particle in an external field.
We will not use the given considerations further in the paper, leaving them as an idea requiring a separate consideration.
Let's solve the equation
Since the second factor
in the left-hand side of the equation has a simpler structure than the first factor, perhaps as a first step we should find the eigenvalues and eigenfunctions of the equation
and use the when solving the equation as a whole.
Let's calculate the expressions included in the equation
It would be interesting in this context to consider for the presented spinor model the case of a centrally symmetric electric field and to find solutions of the spinor wave equation for the hydrogen-like atom, taking into account the presence of spin at the electron. For such a model we can take
As mentioned above, we can substitute into the equation the already known exact solutions of the Dirac equation for the hydrogen-like atom by expressing the components of the coordinate vector and derivatives on them through the components of the coordinate spinor and derivatives on them. It is likely that the solution of the Dirac equation would not make the spinor equation an identity; it would be evidence that more arbitrary assumptions are made in the Dirac equation than in the spinor equation, and that the latter claims to be a better description of nature.
We can also consider the case of a constant magnetic field directed along the z-axis
We see that the scalar potential
grows with time, but does not depend on spatial coordinates, and the vector potential does not depend on time, so that there is no electric field. In this case
The equation considered up to now is rather cumbersome, therefore we would like to have a simpler and compact relativistic invariant equation for the fermion, taking into account the presence of a half-integer spin. Such equation really exists; its derivation is given in section 4 of the paper. Here we will give its form for the electron in the presence of the electromagnetic field
where
In general case electric and magnetic fields are expressed through partial derivatives of components of the vector potential by components of the space vector. We also can find the expression through these fields for the derivatives of the spinor components of the electromagnetic potential by the components of the coordinate spinor. To do this, we first find all derivatives
then express the components of the vector potential through the components of the spinor potential, substitute the components of the electric and magnetic fields instead of the derivatives of the components of the vector potential by the components of the coordinate vector, and then find the required derivatives from the resulting system of linear equations.
From general considerations taking into account the substitutions
it is possible to write the commutation relations for the components of the momentum spinor and functions from the components of the coordinate spinor
All other combinations commute with each other. The constant coefficients c and d possibly include a minus sign, an imaginary unit and some degree of the rationalized Planck’s constant.
Let's return to the relations
In this form they are equivalent, but if an external field is added, a difference arises, since in one case the field is added at the vector level and in the other at the spinor level
These relations correspond to differential equations including the relativistic Schrödinger equation
The corresponding inhomogeneous equation is
where the delta function can be represented as
has a solution
For a free particle the eigenfunctions and eigenvalues solving these equations should coincide, but in the presence of an external field the eigenvalues and the corresponding eigenfunctions will differ because of the above mentioned difference in summation in one case of vector components and in the other case of spinor components.
While the Dirac equation is sometimes referred to as extracting the square root of the Klein-Gordon equation, here we see a different way of doing it.
Let us check the truth of the relation
To obtain this result, we did not have to make assumptions about commutability of the spinor components among themselves. Accordingly, a similar expression takes place for the phase of a plane wave in vector space
Further we assume that the components of the momentum spinor commute, which takes place for a free particle, then we obtain
On the other hand, we can write
Thus, the results of calculations coincide.
Let us compare the phases of plane waves in vector and spinor spaces. Let us hypothesize that the plane wave in spinor space has a more complicated form than it was supposed earlier in the paper, namely, it contains an additional conjugate multiplier
The phase of the wave in this form is closer to the generally accepted phase of a plane wave in vector space. But the phases calculated by two methods do not coincide with each other, although both of them are invariant under Lorentz transformations
Let us slightly modify the expression for the phase of the plane wave
where
Let’s change the order of derivatives
and write the difference of the two equations
Add an imaginary unit to the phase
Thus, we obtained a differential equation with an eigenvalue independent of coordinates
to which corresponds the eigenfunction
which is a plane wave with imaginary phase and bounded amplitude.
Now we can define the function
which satisfies to equation
where
thus,
has the properties of the Green's function.
4. Path Integral and Second Quantization in Spinor Coordinate Space
Based on the above, we can modify the theory of the path integral. We will consider it in the notations in which it is presented in [
9]. For a free scalar field with sources J(X) the path integral has the form
It includes the action of
and the Lagrangian density for the free field
For convenience and clarity, the following notations are introduced
For the general case the Lagrangian density has the form
where V
-polynomial over the field
.
Substituting the Lagrangian into the Euler equation
the field equation of motion is obtained.
The free field theory is developed for a special kind of polynomial
In summary, Euler's equation defines the equation of motion
The notations used here are
Thus, there is a correspondence of the Lagrangian and the equation of motion for the free field
Our proposal is to replace the Lagrangian in vector coordinate space by the Lagrangian in spinor coordinate space. For this purpose, we use the equation of motion in spinor coordinate space and we want to find the Lagrangian for which the Euler equation defines this equation of motion
For the sake of clarity, we use the same notation for the spinor coordinate derivative as for the vector coordinate derivative; the context allows us to distinguish between them
et us write the Lagrangian plus sources in the form
And let's substitute the Lagrangian into the Euler equation
For the case of a free field the derivative operators commute, so we can write
It is pleasant that the Euler equation in invariant form works also in this situation, so that we obtain the desired form of the equation of motion in the spinor coordinate space. It is important that the proposed Lagrangian has a relativistically invariant form, even in the general case, and not only at commuting derivatives. The polynomial has the form
In the case of a free field we restrict ourselves to the first term of the polynomial
Then the Lagrangian density and the equation of motion for the scalar field in spinor coordinate space have the form
For a free field when the derivative operators commute, we obtain
In the spinor equation of motion there is a plus sign before the mass, although in the rest of the paper there was a minus sign. To return to the minus sign it is enough to put a plus sign in front of the polynomial in the Lagrangian.
Now we have to find the path integral, which, along with the Lagrangian, includes the sources
The components of spinors are complex, and we have already noted that the derivatives on complex variables are applied to the degree functions, which, most likely, can describe physical fields, respectively, the finding of an indefinite integral for the function of a complex variable can be treated similarly, i.e. as an indefinite integral from the degree function.
It is possible to recover Planck's constant, which provides a transition to the classical limit
One of the steps in computing the path integral in [
9] is to find the free propagator from Eq.
the solution of which has the form
herewith
In our case, we want to find
After integration by parts by analogy with [[
9], Chapter 1.3] we obtain for the special case of a free field
In the process of calculation, it is necessary to find the solution of the equation
For this purpose, we pass to the momentum space by means of the integral transformation
The assumed propagator has the form
which is verified by substitution into Eq. Here it is assumed that the representation of the delta function
We added a conjugate phase to the exponent
which, on the one hand, provides convergence of the integral, and on the other hand, it does not affect the result of differentiation on variables
.
We note at once that there is no simple correspondence between the so defined phase of a plane wave in spinor space and the phase of a plane wave in vector space, e.g.
but both parts of the inequality are invariant under Lorentz transformations.
One can see the difference between the propagators, since in one case
is real and positive, while in spinor space m is complex in general. We can use the relation
where
in which it is taken into account that the fermion mass is real. Now the propagator has the form
The derivatives of the scalar field on spinor coordinates can be expressed through the derivatives on vector coordinates
If in the right part to represent the wave function as a plane wave in vector space
then in the left part it should be represented as a plane wave of a special form in spinor space
Only in this case the left and right parts will be dimensionally consistent, e.g.
In any case, a complete coincidence will not be obtained due to the mismatch of dimensionless exponents of the exponents
Since we call the field under consideration a scalar field, we expect its value to be invariant to Lorentz transformations. But how to formalize this statement and to what exactly does this transformation apply? We propose to consider that the value of a scalar field is the scalar product of the representatives of a spinor field, which is the most fundamental field in nature, and vectors, tensors and, among others, scalars are formed from the spinors representing it. The scalar product is defined by means of the metric tensor of the spinor space. From any two spinors we can obtain a scalar, in general the complex case. But if we want to obtain a scalar with real values, we must impose some restrictions on the original spinors. For example, to any spinor u we can correspond a scalar
U taking real values, whose value does not change under the action of the Lorentz transformation on the spinor and the action of the same transformation on the conjugate spinor
When a spinor and its conjugate spinor are simultaneously rotated or boosted by some angle, the scalar undergoes a rotation or boost by zero angle.
We can find the derivatives of the scalar by the components of the coordinate spinor
The components of the coordinate spinor are complex quantities, the derivative on them is taken formally, since physical fields can be represented by power functions of the components of the coordinate spinor and its conjugate.
What are the advantages of the transition from path integral in vector space to path integral in spinor space? A possible answer is that there are new conditions for working with divergent integrals. Now integration is performed over spinor space, so that in the numerator there is a four-dimensional differential element instead of element in the case of vector space. The spinor element has the order of magnitude instead of for the vector element, whish decreases the order of magnitude of the numerator, while the order of magnitude of the denominator does not change.
If the spinor coordinate space is indeed more fundamental, and the vector coordinate space is an offspring of it, then we ma y benefit from this transition in any case.
Now let us move from the scalar field to the field of an electron, that is, the field of a particle with half-integer spin. We will use gamma matrices in the Weyl basis
Let us consider the linear combination of these matrices with components of the momentum vector as coefficients, substituting the expressions of the vector components through the components of the momentum spinor
Let us represent the matrix
as a sum of direct products of spinors
For a free field the components of the momentum spinor commute, therefore
Complex mass
does not change at rotations and boosts for an arbitrary complex spinor. Moreover, by a direct check it is possible to check that for an arbitrary spinor
For a free field, when all components of the momentum spinor commute, we can write the relativistic equation of motion of the fermionic field
Where the matrix of derivatives
is obtained from the matrix
by substitutions
However, it is generally accepted to write for this field another equation, the Dirac equation, which does not possess the invariance property anymore
And for the more general case, when the momentum components do not commute, we need to write the equation
Further we will consider the equation of motion for a free field
We again want to find the path integral
for which we need the Lagrangian, from which the Euler equation is derived equation of motion
It is suggested to use the Lagrangian
Let us substitute the Lagrangian into the Euler equation and obtain the equation of motion
Since the Lagrangian includes, along with the derivatives of
, the derivatives of
, it is logical to use a different definition of Euler's equation
Then for the free field case when the derivative operators commute with each other, we obtain the equation of motion
If the derivative operators do not commute, additional terms will appear in the equation of motion in the form of matrices similar to the
matrix, and these additional terms will not necessarily coincide with
. In this connection it is necessary to consider the Lagrangian as more fundamental notion than the equation of motion and to derive the equation of motion from the Lagrangian, i.e. to take as a basis not the derivation of the equation of motion in momentum space, with what we started, but to take as an axiom the form of the Lagrangian in the form of field derivatives in the relativistically invariant form. Then, if to follow the invariance principle quite strictly, we should start from the product of two matrices, i.e. to use the Lagrangian
Or, not limited to fermions,
Nevertheless, further we will search for the path integral in the simplest case with the originally proposed Lagrangian and in addition assume commutativity of all derivative operators
After integration by parts, we presumably obtain
Then it is necessary to find the solution of the equation
For this purpose, we pass to the momentum space by means of the integral transformation
We get the equation
with the decision
Here we use the equality, which is valid for an arbitrary complex spinor
It is based on the correlation verified earlier in our work
it is also taken into account that we consider fermions whose mass is real.
As a result, the propagator has the form
here we assume the validity of the relation
In the case of a fermion, the mass in integration is a fixed real quantity, and it can be considered negative for the electron and positive for the positron. Theoretically, the mass can be complex or purely imaginary. If we put mass equal to zero, it may be possible to apply this Lagrangian to describe massless particles. I wonder if there are particles with complex or purely imaginary mass. In the latter case, the square of the mass will still be positive and the particle will satisfy the Klein-Gordon equation. Such particles can interact among themselves, but not with particles whose mass is real.
Let's return to the question about the use of completely relativistically invariant Lagrangian
Let's find the product of matrices
The assumption that the following equalities hold is used
Further we find the product of matrices
Again we use the equality
and consider that the mass of the fermion is real, i.e.
therefore, the relations are valid
But the main advantage of the obtained matrix is the following
This matrix does not change at rotations and boosts, so it can be stated that the equation of motion, e.g., in the form of
where
is truly relativistically invariant, respectively we can use the invariant Lagrangian
to which corresponds the relativistically invariant propagator of the boson having a real mass, which is negative for the electron and positive for the positron
Let us compare the propagator in spinor space with the propagator of the fermion given in [[
9], formula II.2.22 and formula II.5.18]
In [
9] this formula is obtained by applying the second quantization procedure or using Grassmann integrals. The results are similar, but the integration here is performed in the vector momentum space. The Dirac equation and the corresponding Lagrangian are not relativistically invariant. Besides, here the mass is considered always real and positive, but then it is not clear how electron and positron differ from the point of view of this formula.
Let us consider in detail the derivation of the expression for the fermion propagator in [[
9], Sec. II.2]. It is based on the assumption of relativistic invariance of the Dirac equation and therefore the calculations are carried out in the rest frame, and then the result is extended to an arbitrary frame of reference. Thus for the field spinor u the spinor
u_ is defined and it is asserted that the value of
is a Lorentz scalar. But it is not so, since in the spinor space the scalar is formed exclusively by the scalar product of two spinors, where the metric tensor of the spinor space is included
there are no other ways to construct a scalar in the spinor space.
Nevertheless, this fact and the fact of non-invariance of the Dirac equation itself do not cancel the value of the second quantization procedure and the final form of the fermion propagator, which allows to make accurate predictions of the experimental results.
We hope that the proposed Lagrangian for the spinor coordinate space can find application in the calculation of the path integral, but already in the spinor space. Whether such a calculation in spinor space has an advantage over the calculation of the path integral in vector space can be shown by their real comparison.
By analogy with the propagator of a photon, more precisely of a massive vector meson, given in [[
9], formula I.5.3]
we can assume the propagator form in the spinor space without revealing for compactness the expression of the momentum vector components through the momentum spinor components
Among other things, the equation
can be modified to take into account the electromagnetic potential, the electron charge is taken as a unit
and apply, in particular, to analyze the radiation spectrum of a hydrogen-like atom.
Let us formulate again the difference between the equations, the second of which is derived from the Dirac equation with gamma matrices in the Weyl basis
The difference is, the matrix
(p) remains unchanged under any rotations and boosts applied to the spinor
, while the matrix
(p) changes under any rotations and boosts.
Equally radically different are the corresponding Lagrangians and propagators.
By analogy with [9, Chapter II.2] we will carry out the procedure of second quantization of the fermion field. Let us write the equation
in the momentum space, for which we apply the integral transformation
Let's substitute the wave function into the equation and obtain
Let us define two sets of four reference spinors
where
And let's express the matrix through them
Developing the idea of invariance, we pass to the set of reference spinors with wider filling, but continuing to form matrices possessing the invariance property
Let's express through the reference spinors the matrix
and matrix
here
Let us decompose the fermion field into plane waves with operator coefficients
Let's impose the anticommutation conditions on the operator coefficients
We consider the rest anticommutators to be equal to zero. Then we can write the expression for the anticommutator of the field
We will consider this relation as a proof of the anti-symmetry of the fermion wave function under the stipulated anticommutation relations.
It is important that all the above deductions are valid in any frame of reference, while the proof of anticommutativity of the fermion field in [
9] is carried out for the rest frame.
Let us calculate the total energy of the fermion field
here
Each summand in brackets represents the operator of the number of particles with a certain reference spinor. The operator's action consists of consecutive application of the annihilation operator and the operator of the birth of a particle. On initial examination, it would appear that the energy associated with zero-point fluctuations in the vacuum has been overlooked. However, an examination of the final expression reveals that the field always possesses a constant energy, regardless of the particles that contribute to it. This constant energy of the field can be interpreted as the energy of zero-point fluctuations of the vacuum.
The following relations were taken into account in the derivation
Other components of the total field momentum are calculated by the formula
Total momentum
is a vector in Minkowski space. The density of the current as a function of coordinates is
where
is a four-dimensional probability density current, which is transformed as a four-dimensional vector by Lorentz transformations. Multiplication by
transforms it into a four-dimensional current density.
Let us perform a series of transformations analogous to those presented by Dirac in [10, Lecture 11]
For an arbitrary component of the total momentum we have
Following Dirac's argument in [
10], the value of
can be treated as either a Hamiltonian or a total energy operator, with
representing the birth operator and
representing the annihilation operator.
In [
10] the quantization procedure includes the use of one definite Lorentzian reference frame, i.e. it is not invariant. In our case all deductions are valid in any reference frame in the spinor space, and it means invariance to change of reference frames in the Minkowski space also.
The following relations are used in the transformations
The chain of reasoning can be organized in a slightly different way as well
Here it is taken into account that
Let us draw an analogy between our approach and the relations given in [[
11]], Volume 1, Chapter 3, Section 3.3.1]. There it is noted that the birth and annihilation operators of the fermionic field must satisfy such commutation relations that the equality expressing translational invariance is satisfied
which in differential form is written as
The coordinates here are the components of the Minkowski vector space. On the basis of these relations the anticommutation relations between the birth and annihilation operators are derived.
At substantiation of the Schrödinger equation we have to assume that the zero component of momentum, i.e. energy, commutes with the rest of the momentum components, which allows us to represent the exponent of the sum as a product of exponents.
and consider time and energy separately from spatial coordinates and momenta. In this case it is possible to independently perform translation in time and space.
In the spinor coordinate space, we can express the translational invariance of the field operator by the relations
It is interesting to find out in what relation these translational operators are - one operator acts in vector space, the other in spinor space. In both cases the following interpretation can be given. Suppose we know the result of an operator acting on an arbitrary state at a point in space 1, and we want to know the result of its action on a state at point 2. Then we translate the state from point 2 to point 1, act on it by the operator, and transfer the obtained result back to point 2.
Both operators act on the same state, but in one case the state is labeled by spinor coordinates and in the other by vector coordinates. The translation mechanism of the operators is essentially the same, but it is not possible to replace the action of one translation operator by some combination of actions of the other. Because of this, the question arises as to which of these operators better describes nature. Our point of view is that the translation operator in spinor space is primary, and the operator in vector space just successfully copies it, without being exact, but being some approximation. It attracted the attention of physicists first because vector space is more accessible for investigation. When integrating over a four-dimensional vector space in some cases there is a divergence, then use renormalization. When integrating over four-dimensional spinor space, the differential element has two orders of magnitude of the vector momentum component smaller, while the denominator in the integrand remains of the same order as when integrating over vector space. This difference possibly affects the convergence.
Let us calculate the total mass of the fermion field
The ratios used in the derivation are
Let us give an interpretation of the operator coefficients for this approach
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
creates and destroys a particle with mass , spin up and momentum in the interval , is the operator of the number of such particles
Note that
and
are translated into each other by a linear transformation, this is also true for other pairs of spinors
It is known [[
9], formula II.1.30] that the charge conjugation operation transforms an electron into a positron with a change of the sign of the charge. Let us apply the charge conjugation to the reference spinor
As a result not only transforms to , but also changes a sign of mass due to the imaginary unit in the charge conjugation matrix. This confirms our thesis that the charge conjugation synchronously changes signs of charge and mass.
The properties of all particles and operators are summarized in a table
| creates |
destroys |
particle spinor |
vector |
number |
mass |
spin |
wave sign |
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
down |
+ |
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
down |
+ |
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
down |
+ |
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
down |
+ |
Here the column “vector” shows the vector obtained from the corresponding spinor by the formula of the form
and
Although we have used the term vector for quantities like
U1, they are not really vectors in the sense that if a Lorentz transformation is applied to a coordinate spinor and hence a coordinate vector, the true vector must undergo the same transformation. For a momentum vector this is the case, but if the sign of one or more components in the momentum vector is changed, it will no longer be transformed according to this law. For example, charge conjugation changes the signs of some components
so the electron current and the positron current cannot be vectors at the same time, and in fact, as can be seen from the table, neither is a vector.
By the words destroys the particle it should be understood that this operator transforms this particle into the particle, and the operator performs the reverse transformation of into . The action of the operator on any other particle gives zero. Since both these particles have the same mass, the total mass of the fermionic field does not change from these transformations. The mass itself can have any sign or even be complex.
Although we call the spinors presented in the table particles, they actually describe the same particle whose characteristic property is a mass with a certain sign. A particle with mass of opposite sign is described by the other sixteen spinors. Let's compare the momenta of two particles with different mass signs
If we add sixteen spinors of the field of one particle with sixteen corresponding spinors of another particle, it will look the same as if the momenta of the particles were summed directly. It is clear that the momenta themselves cannot be directly summed, but summing the fields does not look impossible and leads to the same result as adding the momenta directly. The result can be represented as a sum of two other momenta, the mass of each of which is zero
Perhaps, such summation is an adequate model for describing the phenomenon of annihilation of particles with different masses. As an illustrative example, consider the case of an electron and a positron at rest
As a result of addition and separation, two photons with zero mass are obtained, having oppositely directed spatial components of the momentum vector, i.e. flying apart. At this interaction the total energy, the total momentum and the total mass are conserved. We can also say that the total charge is conserved, although in our interpretation the charge is not a numerical characteristic that can be calculated, the sign of the charge is determined by the structure of the spinor. In turn, this structure is determined precisely by the sign of the mass. Thus, a change in the sign of the mass leads ultimately to a change in the sign of the charge.
In the table below two last columns with a set of spinors and vectors corresponding to the particle with opposite sign of mass are added. It is supposed that at annihilation the spinors of the particle and antiparticle, which are in the same row of the table, are summed. The set of 16 spinors remains the same, but the order of their arrangement changes when the sign of mass changes.
| creates |
destroys |
particle spinor |
vector |
mass |
spin |
wave sign |
antiparticle spinor |
vector |
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
|
down |
+ |
|
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
|
down |
+ |
|
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
|
up |
- |
|
|
|
|
|
|
|
up |
+ |
|
|
|
|
|
|
|
down |
+ |
|
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
|
down |
- |
|
|
|
|
|
|
|
down |
+ |
|
|
It is possible to assume that the reason and condition of distinction between particles with different charge is the presence of their non-zero mass. If the mass is zero, then in the given table there are no differences between spinors of the particle and antiparticle, i.e. there is no mechanism for formation of the internal degree of freedom, which we treat as charge.
Let us see what result we get if we apply another definition of anticommutativity of the fermionic field.
where
In deriving this result, the following relations are taken into account
The last operation of taking the value
out from under the sign of the integral seems doubtful because of its dependence on the momentum over which the integration is performed. If one closes one's eyes to this, as is generally accepted in the literature, in particular in [
9], this relation is taken to be interpreted as a proof of the anti-symmetry of the fermion wave function under the stipulated anticommutation relations. The only situation where this is unquestionably true is when considering in a rest system where boosts are excluded, energy is equal to mass, and invariant to rotations.
It is noteworthy that the antisymmetric treatment, whether or not complex conjugation is considered, yields a diagonal matrix that is invariant in one case but not in the other. It is encouraging to observe that the set of reference spinors remain consistent.
It is crucial to note that the proposed invariant approach cannot be realized within the Minkowski vector space. To achieve this, it is necessary to transition to the spinor space. This reiterates the secondary role of the Minkowski space in comparison to the spinor space.
Dirac's equation can be expressed in both spinor and vector spaces, a fact that led Dirac to discover it. In contrast, the invariant equation can be written in spinor space but not in vector space, which explains why it was unknown.
Let us write down the propagator of the fermionic field and the fermionic field invariant equation of motion using the proposed matrices
The equation of motion has the form
where
The equation is relativistically invariant, respectively we can use the invariant Lagrangian
to which corresponds the relativistically invariant fermion propagator
The equation can be modified to take into account the electromagnetic potential, the electron charge is taken as a unit
and apply, in particular, to analyze the emission spectrum of the hydrogen-like atom.
Let us look for a representation of the electromagnetic field operator in vector space without first referring to spinor space. Let us define four vectors expressed through the components of the momentum vector
Why we have chosen these 4 vectors out of 8 possible combinations of signs of three spatial components? Because they are represented in the previously given table of variants of spinor particles. For these vectors the following relations are valid
Let us decompose the field into plane waves with operator coefficients and let's find the commutation relations for them. We will use the next notation for the scalar product of vectors
Let us apply the following commutation relations
Here it is taken into account that
We will consider this relation as a proof of the symmetry of the wave function under the stipulated commutation relations.
Let us find the commutation relations for the wave function and its time derivative, which in this case play the role of canonical momentum
where
The commutation relations remain the same
As one would expect, the field has only two degrees of freedom. This relation is valid for any reference frame, but the values of the momentum components in each of them are different.
Let us calculate the square of the field energy
here
If we consider the photon field, the mass is zero, so that only the energy of the field remains in the formula. Each summand in brackets under the integral represents the operator of number of particles with a certain reference vector, its action consists in the consecutive application of the annihilation operator and the particle birth operator. The last summand describes the energy of zero-point fluctuations of vacuum. When there is no particle, we have the equality
In this connection it is logical to use the normalization for the wave operator so that the energy of zero-point fluctuations of vacuum without taking into account the infinite component is unity
If the mass is not zero, then we can relate and to the current of electrons with different spins and, respectively, relate and to the current of positrons with different spins.
As we have seen, neither electron current vectors nor electromagnetic field vectors are true vectors. When transforming the coordinate system, the same transformation acts on the components of the momentum vector, from these transformed components in each frame of reference the pseudovectors of the field are formed. But we know that the interaction between current and electromagnetic field is described by an additional term in the Lagrangian of the electrodynamics theory. This term is the scalar product of the current and the electromagnetic potential and it is necessary for this product to be a scalar. But to form a scalar using a metric tensor, two true vectors are needed, and these are not available. There remains only one way to provide the scalar, it is necessary that signs of components in pseudovectors of current and field coincide, then they will compensate each other, and in fact we will get the scalar product of two vectors, and hence we will get a scalar.
Thus, there is a direct connection between the spinor description of the field and its vector description. 16 pseudospinors pass into 4 pseudovectors, moreover, the modulus of the complex mass in spinor space is equal to the mass in vector space. At all this by the value of the phase of a plane wave in spinor space by any direct way it is not possible to calculate the phase of a plane wave in vector space. Hence the assumption arises that operators in spinor space describe nature exactly, while operators in vector space provide only an approximate description. This may partly explain the problems with divergence when integrating in vector space.
To describe the evolution of the field state, we consider the vacuum averaged expression having the sense of the propagator. Let us explain the meaning of operators included in the field decomposition
For example, is an operator of annihilation of a particle with pseudovector , similarly, other operators without asterisks annihilate particles with pseudovector which stands in expansion with these operators. Accordingly, the operator is the operator of the number of particles with pseudovector .
Let us define a vacuum state of the field with zero filling numbers of particles of each of four varieties
by specifying its properties with respect to the action of annihilation operators
It follows from these relations that
Let us construct the amplitude of the field component, which is born at the point with coordinates
and annihilated at the point with coordinates
For the reasons given above, let us apply the normalization of the field operator
If the mass is zero, this expression is the propagator of the photon.
Note that the matrix entering the propagator has no inverse, so we do not try to find the equation of motion or Lagrangian, they are not necessary in this case, since we have an explicit expression for the field operator. We do not have to worry about following the principles of Lorentzian covariance, gauge invariance, or following ideas of symmetry. Instead, we rely only on the fulfilment of canonical commutation relations for the field operator. The field operator is written identically in any frame of reference, and to pass to another frame it is enough to know how the momentum vector is transformed, which is transformed by exactly the same law as the coordinate vector, which ensures the invariance of the phase of the plane wave. In other words, the field is not a vector but a set of pseudovectors (pseudospinors in spinor space), only momentum and coordinate are vectors (spinor).
We can make our reasoning more intuitively clear if we define the birth and annihilation operators of the field particle
Let us find the commutation relations between the components of these operators
As we see, the commutation relations are satisfied for the birth and annihilation operators.
Let us define the total particle number operator in the form
Let's find the commutator
Let's define the vacuum state using the relations
which implies
Let's act on vacuum by the birth operator and for the obtained state we find eigenvalues of the particle number operator
If we apply normalization
then the eigenvalues will have the form
Note that in the case of the photon field, the matrix, taking into account the normalization, contains elements whose modulus is less than or equal to ½, since at zero mass .
The fact that for the birth and annihilation operators commutation relations are fulfilled, allows to conclude that quanta of the field obey Bose statistics, therefore a single action of the birth operator increases the number of particles in the field by one, and the action of the annihilation operator decreases this number by one. Hence, by means of these operators it is possible to write the propagator not only for the case when the initial and final states are vacuum, but also for the initial state with an arbitrary number of particles
For illustration let us consider a one-particle state
and act on it with the birth operator. Again, let's take into account
The eigenvalue of the particle number operator has increased, instead of a one-particle state we have a two-particle state
Further application of the birth operator increases the number of particles to any value. Now let us find a commutator for the annihilation operator, without taking into account the normalization for the moment
The ratios have been taken into account here
Let's act on the two-particle state by the annihilation operator and for the obtained state we find the eigenvalues of the particle number operator
Here, the fact that without taking into account the rationing
Thus, the annihilation operator reduces the number of particles and puts the field into a single-particle state.
Separate application of the birth and annihilation operators more corresponds to the ideology of second quantization than their use only as a sum, i.e. only as a field operator
In particular, since
then the propagator really acquires the sense of the amplitude of the probability that the particle is born at the origin and annihilated at the point with coordinates
.
Moreover, now the propagator can be not bound to the vacuum state, but can be applied to the field state with arbitrary number of particles n>0. The application of the sum of operators to some state makes sense only in the case when all operators except one give as a result zero. Therefore, at the usual approach we have to work only with the vacuum state so that at calculation of the propagator the annihilation operator gives zero. In our approach this restriction is removed, the operators are not summed, but only multiplied, and they can be applied to a state with any number of particles. For this purpose, let us take into account the following relations
The assumption used here is that the scalar products 〈
are the same for any values of momentum. Taking into account the normalization
At non-zero number of particles we can change the order of operators and first apply the annihilation operator
After normalization we obtain
Let’s return to the previously used definition of the vacuum state by means of relations
which implies
As we have seen, the action of the birth operator transforms the zero-particle state into a one-particle state
At that, none of the operators of the number of particles with a particular value of momentum
has no definite meaning, since the particle is only one. In this connection it makes sense not to define the vacuum in such a detailed way, it is enough to define that the vacuum state is characterized by only one condition
At this approach the field energy is not equal to the sum of energies of partial oscillations, accordingly the question about the energy of zero-point oscillations of each oscillator constituting the field is removed. We get rid of the problem of infinite energy of the sum of zero-point vibrations of an infinite number of oscillators.
We would like the propagator to have properties of the Green's function, i.e., to satisfy the Klein-Gordon equation of motion
The solution of this equation has the form
Therefore, we add the same multiplier to the denominator of the integrand expression
By analogy with the introduced birth and annihilation operators for fields in vector space, let us describe the corresponding operators for fields in spinor space. As an initial one we use the previously described field operator for the fermionic field
Let us define the birth and annihilation operators
Let's find anticommutation relations between components of these operators
Besides these relations, the following `anti-commutation relations take place between the components of the annihilation and birth operators
Let's define operators of the total number of particles in the form
Let's find the commutators
Instead of defining the vacuum state through its properties under the action of annihilation operators
which would entail the ratios
we will not require from operators all these properties, but we will be limited by a weaker and simpler definition of vacuum, namely, absence of particles in vacuum
Let's use the found commutator
On the obtained one-particle state let's act on the obtained one-particle state by the birth operator again
We have obtained a state with two particles and we can thus increase the number of particles to infinity. All particles are identical and indistinguishable from each other, each of them is in all allowed states, of which the free field has infinitely many. Electrons in an atom have fewer allowed states, but still any electron occupies all of them equally with the others. This theory describes both electron and positron, the difference between them being only in the sign of the mass, it being convenient to consider that the electron has a negative mass and the positron a positive one.
Similarly, we use the commutator of the annihilation operator
Thus, the action of the annihilation operator has transformed the two-particle state into a one-particle state. Using the same calculations, we obtain the result of the annihilation operator action on the one-particle state
And in the same way we define the result of its action on the null state
We obtain a state with the number of particles minus one, but we see that in fact it is a state with one particle whose mass is negative. Thus, the positron annihilation operator is also the electron birth operator. It destroys positrons until they run out, after which it begins to give birth to electrons. The birth operator, on the contrary, destroys electrons, and when they run out, begins to give birth to positrons. Thus, since there are many electrons in our universe, this operator cannot give birth to positrons because it cannot destroy all electrons due to their number. Moreover, the operator of annihilation of positrons because of the absence of the latter, only gives birth to more and more electrons.
If the mass is zero, then in any state the number of particles is zero, i.e., for example, the electromagnetic field in spinor space, where it should be fermionic, simply has no particles. The absence of particles does not contradict the presence of the field, which is represented by the same 16 spinors, this field obeys Fermi statistics, and it has no charge and can be treated as a Majorana fermion. This field interacts with electrons in spinor space, and the result of the interaction manifests itself in vector space.
With the help of the birth and annihilation operators we can write the propagator for the situation when the initial and final states are states with arbitrary number of particles
We would like the spinor propagator also to have properties of the Green's function, i.e. to satisfy the equations which for this case are given below and which can be combined into one equation by summation
where the delta function can be represented as
The solution of the combined equation has the form
Therefore, we must add to the denominator of the integrand an appropriate multiplier
The electron and positron have different mass sign, so their propagators will be different.
Instead of the sum of equations we can use their product, then the corresponding inhomogeneous equation
has a solution
Correspondingly, it is necessary to add to the propagator the denominator in any of these forms, since it was shown earlier that for a free field
We can repeat the above calculations, keeping the annihilation operator, but defining the birth operator differently
As a result, we obtain the anticommutator
As before, using the birth and annihilation operators, we construct propagators for a state with an arbitrary number of particles
Now instead of mass the propagator includes energy, therefore such theory is applicable also to the field with zero mass, i.e. it can serve as a model not only for the electron, but also for the electromagnetic field in spinor space. The only problem is that if earlier the action of the annihilation operator on the zero-point state gave a particle with negative mass, now this action gives a particle with negative energy, which makes the interpretation of such theory more difficult.
We can reformulate the above reasoning in a more consistent and logical form. Let us again write down the equations
and put in correspondence with them the birth and annihilation operators, which in this version are conjugate to each other
Then the propagator without additional assumptions will have the form
One can even propose to use plane waves in spinor space immediately together with the denominator in any field operators
The considered free field propagators describe the situation when there is a point source with coordinate
and a point sink with coordinate
. In the general case in the spinor space the distribution of source-stocks
J(x) can be given and the value of
which is used for finding the integral over the trajectories and which can be written using the Fourier transform for the spinor space
In quantum field theory it is customary to calculate a similar quantity
in which the coordinates, momenta and the Fourier transform connecting them belong to the vector space. In our opinion, the transition to spinor space, more fundamental than vector space, which is a superstructure over spinor space, can eliminate divergences in calculating integrals in the framework of the formalism of the integral over trajectories. In momentum space the similarity is even more obvious, the kernels of the integrals are the same, the only difference is in the space where the integration takes place and the way of calculating the Fourier transform - either in vector or in spinor space
The spinor space has the additional advantage that the integrand is factorized
This factorization in momentum space looks like a consequence of a more fundamental property of factorization in coordinate space
We can assume that first it makes sense to perform integration separately on
x and
y, and only then to perform multiplication
Since earlier we have obtained an explicit representation of field operators in both vector and spinor space, we do not need to refer to the equation of motion and the Lagrangian. Proceeding from these representations, we define the birth and annihilation operators, and from them we construct the propagator as a function of relative coordinates. For example, for a spinor space
But we need to take two more steps. The first is to find an equation for which this propagator is an eigenfunction of the relative coordinates. Two equations can be proposed for this role
It is important that in both cases the eigenvalue is a real value independent of the coordinates. The second step is to ensure that the propagator has the properties of the Green's function, i.e., that the right-hand side of the equation has a delta function. This is necessary so that for an arbitrary distribution of sources we can use this propagator to construct a complete picture of the field propagation. To satisfy these equations, we must include in the solution a normalizing multiplier of the appropriate kind for each equation. This multiplier does not depend on the coordinates, but depends on the momentum. As a result, we obtain the normalized propagator in two variants
In our opinion, the second variant is more preferable, since the denominator consists of two conjugate terms, so we can consider them as an integral part of the birth and annihilation operators, which are also conjugate copies of each other
The obtained results allow us to answer the question how the fermion field changes under the action of Lorentz transformations on the coordinates. Exactly, if we move to another frame of reference by rotations and boosts, the coordinate spinor changes. As a consequence, the momentum spinor changes, the components of which are the coefficients of the expansion on the new coordinates, and the momentum spinor undergoes exactly the same transformation as the coordinates, so that the phases of all plane waves in spinor space do not change. The components of the new momentum spinor are substituted into the 16 spinors describing the fermion field. Thus, there is no any uniform law of transformation of a spinor of the fermionic field, each of 16 spinors corresponding to the particles forming it, is transformed in its own way.
However, if, following Heisenberg [[
12], Chapter 3, Paragraph 1], we index the field components differently
Then it can appear that this field on the first index will be transformed by three spatial rotations and three boosts, and on the second index it will be transformed by three rotations in isotopic space. In this case the additional quantum number related to the sign of mass may be an isotopic spin.
Let us suggest that the coordinate and momentum spinor spaces can also be indexed in a similar way
Thus, we are in a space that is subject to three rotations, three boosts, and three isotopic rotations. All of these transformations are equally real, but there is an imbalance due to the luck of isotopic boosts. After all, isotopic rotations, like spatial rotations, are generated by Pauli matrices; these rotations also do not form a group. Therefore, the full isotopic group must also consist of three rotations and three boosts.
Let's rewrite the previously used quantities with new variables
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Summarizing, we can formulate the following theses. The initial coordinate space is described by complex quantities, which can be represented as a square matrix
The field is a superposition of plane waves with complex phase
where the momentum coefficients of the decomposition are represented as
The phase of a plane wave is constructed using two metric tensors of spinor space and therefore does not change if and are affected by the same transformation, which is a combination of three rotations and three boosts with arbitrary angles at the first index and a combination of three rotations and three boosts with arbitrary angles at the second index. Any transformation is given by 12 real values representing the angles of the turns and boosts. When we considered a four-component spinor, we made do with 6 angles, since we took the same rotation and boost angles for both indexes. Note also that only under this condition the mass invariance takes place.
Each plane wave in superposition has a multiplier in the form of a matrix
which may be any matrix of 16 pulse combinations given in the table, e.g.
Each of these matrices can be compared to some elementary particle, and at transformation of coordinate and momentum space it is transformed according to some inherent law. The field operator has the form
In addition, a complex conjugate version of the phase should be added to both exponents, as was done above, then there would be an imaginary value in the exponent.
For the field, we can obtain the equation of motion as an equation in partial derivatives on the complex variables by substituting the derivatives on these variables instead of the derivatives on in the previously discussed equations.
We can also consider the decomposition of the field by the previously considered plane waves of the form
For the simpler case of a scalar field these plane waves correspond to the Green's function
satisfying the equation
Recall that the transition from spinor space to vector space is performed by transformations
Lorentz transformations are given by 2×2 matrices with a set of valid rotation angles and boosts
After acting on both spinors of the Lorentz transformation with 12 arbitrary angles
and corresponding transformations in the vector space
there is still equality of masses
and invariance of the plane wave phase in spinor space
However, at arbitrary 12 angles, the mass is not invariant
and the phase of a plane wave in vector space also changes at Lorentz transformations
And only under the condition of equality of 6 corresponding angles in the transformation matrices, i.e. under equality
both these invariance properties are restored.
Thus, a plane wave with invariant phase in spinor space is a more general concept than a plane wave in vector space, although the concept of invariant mass cannot be introduced for it in the general case.