1. Introduction
The traditional version of the Lorentz transformation is used to relate the coordinates of an event (a point in spacetime) between different inertial coordinate systems in the absence of any gravitational effects. It is limited to those cases in which both reference frames are inertial, so they have a constant velocity relative to each other. This report utilizes a familiar postulate (stated below in (1.1)) together with the traditional version of the Lorentz transformation to derive a generalized version that allows one of the reference frames to be accelerating. This is not a new topic but the presentation and all derivations in this report are the author’s own inventions. After starting with familiar first principles, the report is self-contained in that derivations are provided for all conclusions. This accounts for the scarcity of references. By defining suitable quantities and introducing suitable notation, the generalized version can be written in a way that is almost as simple as the traditional (constant velocity) version when calculating the coordinates of an event in an inertial system when given the coordinates in an accelerating system. Unfortunately, calculations of the inverse transformation, i.e., calculating the coordinates in the accelerating system when given coordinates in an inertial system, are more cumbersome. Worse yet, while a suitably selected history and future ensure the existence of an inverse transformation, there can exist spacetime points for which it is not unique. However, the metric tensor can be derived in the accelerating system for the general case and is included in this report. This is used to calculate time dilations and Doppler effects that are outside the scope of inertial coordinate systems.
This report frequently uses the phrase “as seen by an observer”. In the context of this report, the word “seen” does not refer to a literal visual image. A visual image of an event (an “event” is a point in spacetime) is received by an observer after a time delay, the time required for a light signal sent by the event to reach the observer. In the context of this report, the phrase “as seen by an observer” does not refer to a visual image but, instead, means that the observer assigned spacetime coordinates to the event.
1 For example, the phrase “the observer sees a clock to be running slow” means that if the observer assigns a time coordinate to one tick of that clock, and another time coordinate to the next tick of that clock, the time between consecutive ticks is longer than the time between consecutive ticks of the observer’s own clock.
This report considers two observers. One, called the home observer, uses a clock at rest at the origin of his coordinate system, which is an inertial coordinate system (recognized to be inertial by freely moving particles having constant velocities).
2 The other observer, called the traveler (with travel defined to be relative to home), uses a clock at the origin of his coordinate system, called the traveler’s system. The traveler’s system is not rotating but may have a translational acceleration relative to the home system. A spacetime point denoting the start of the traveler’s journey has an arbitrary initial spatial translation relative to the home system and an arbitrary initial velocity in the home system, but the clocks are synchronized so that the traveler’s time coordinate and home time coordinate of the start of the journey are both zero. The goal is to relate, for an arbitrary event, the traveler’s coordinates of that event to the home coordinates of the same event.
The traveler’s clock traces out a worldline as seen by the home observer. Information that is assumed to be known includes the three-dimensional (spatial) vector function, denoted
X(
t), which is a parameterization of the traveler’s worldline expressing the spatial coordinates
x, in the home system, of a point on the traveler’s worldline, in terms of the coordinate time
t in the home system.
3 From this parametrization, the velocity of the traveler’s clock as seen by the home observer is the function
V(
t) calculated from
V(
t) =
dX(
t)/
dt.
All conclusions in this report are derived from two postulates. One, taken from [
1] with minor paraphrasing, is the statement:
The second postulate that all results in this report are derived from is the Lorentz transformation applicable to the case in which the traveler has a constant velocity
v as seen by the home observer. This is presented in various formats in [
2] through [
8] but the format preferred for this work is taken from [
9] and given as follows. Consider an event with home coordinates denoted
and traveler coordinates denoted
, and a second event with home coordinates denoted
and traveler coordinates denoted
. The standard (constant velocity) Lorentz transformation in [
9] relates the displacement between these events expressed in each of the two systems according to
4
where the parameter
γ and the unit vector
n are defined by
2. Analysis
Let
P denote an arbitrary spacetime point on the traveler’s worldline. The coordinates of
P in the home system are denoted
, and the coordinates in the traveler’s system are denoted
. The traveler’s clock is at the traveler’s origin and the worldline considered here is the worldline of the traveler’s clock so
Also, even if the traveler’s clock is accelerating, the hypothesis (1.1) states that it is still true that increments of proper time for displacements along the worldline of the traveler’s clock equal increments of time measured by the traveler’s clock. With proper time and the traveler’s clock both set to zero when the traveler starts his journey, the proper time along the worldline up to the point
P equals
. The proper time is an invariant that can be calculated in the home system, giving
where
τ is the proper time calculated in the home system according to
with
γ(
t) defined by
Now consider an inertial system that is “local” with the traveler’s system at the arbitrary point
P on the traveler’s worldline. This statement is defined to mean that there exists a time at which the origins of the two systems are at the same location and with each axis of the two systems aligned, the two systems are at rest relative to each other at this time, the spacetime coordinates of the origins at this time are the spacetime coordinates of
P, and the clock in the inertial system is set to
at this time. We will call this new inertial system the
P-system. We use the postulate, stated in (1.1), that the traveler’s clock will momentarily (while at
P) tick at the same rate as the
P-system’s clock, and the traveler’s measuring rods will momentarily (while at
P) measure the same lengths as the
P-system’s measuring rods. To utilize this local inertial system in the analysis we use the following approach. Instead of selecting an arbitrary event and attempting to calculate traveler coordinates of it, we work in the opposite direction by selecting traveler coordinates and then identify what the event is that has those coordinates (this identification can be done by finding the home coordinates). Because the spacetime point
P is arbitrary, it is sufficiently general to confine our attention to those events that the
P-system declares to be simultaneous with the event
P. The event to be constructed will be called
E, with traveler coordinates denoted
and home coordinates denoted
. With the
P-system declaring
E to be simultaneous with
P, we conclude from the hypothesis (1.1) that the traveler also declares
E to be simultaneous with
P,
5 implying that
A constraint that the event
E must satisfy to be treated in this analysis is that there must exist a point
P on the traveler’s worldline such that the inertial system local with the traveler’s system at
P sees
E to be simultaneous with
P. Given that
E is such an “allowed” event, so that
E is simultaneous with
P as seen by the
P-system, the coordinates of
E in the
P-system are the same as the coordinates in the traveler’s system. Therefore, the coordinates in the
P-system are also
. Also, the coordinates of
P in the
P-system are
=
. The
P-system has velocity
V(
tP) as seen in the home system, so the transformation from barred coordinates to unbarred coordinates is obtained from (1.2) by replacing subscripts (with 2 replaced by
E and 1 replaced by
P), replacing
v with
V(
tP), and replacing
γ with
γ(
tP). The results, after simplification by using (2.1) and (2.5) are
3. Summary and Discussion of the Transformation to Home Coordinates
Information assumed to be available includes the functions
X(
t) and
V(
t) explained in
Section 1. From these we construct the functions
γ(
t) and
τ(
t) via (2.4) and (2.3), so all these functions, which refer to the traveler’s worldline as seen by the home observer, are regarded as given. The goal here is to calculate the home coordinates of an arbitrary event
E when the given information, in addition to the above functions, consists of the traveler’s coordinates
. The following steps are used:
The first step calculates
tP. This is done by combining (2.5) with (2.2) so
tP is calculated from the given
via
Note that
γ(
t) is a strictly increasing function so there exists a unique
tP satisfying (3.1) providing only that
is between zero and the largest display of the traveler’s clock (finite if its worldline has a stopping point). With
tP now solved, we calculate
xP via
With this done, all terms appearing in (2.6) have been evaluated except for the terms,
xE and
tE, to be solved. They can now be solved by writing (2.6) as
It is interesting that (3.3a) implies that tE = tP if either V(tP) = 0 or = 0. This could have been anticipated even before deriving (3.3). Recall that the events P and E are always simultaneous as seen by the P-system, because this is a condition that was imposed on E. If V(tP) = 0, the P-system is stationary relative to home, so P and E are also simultaneous as seen by the home system. Therefore, we could have anticipated that tE = tP when V(tP) = 0 even before deriving (3.3). Now suppose = 0. Recall again that the event E is simultaneous with P as seen by the P-system, and therefore also as seen by the traveler, so they have the same time coordinates in the traveler’s system. Also, P is on the traveler’s worldline, so its spatial coordinates are zero in the traveler’s system. Therefore, = 0 if and only if P and E have the same spacetime coordinates in the traveler’s system, in which case they are the same spacetime points and therefore have the same spacetime coordinates in all systems. Therefore, we could have anticipated that tE = tP and xE = xP when = 0 even before deriving (3.3).
A topic that can be discussed here is time dilation. However, there are two versions of time dilation because the traveler and home observer can disagree on simultaneity.
6 Specifically, they can disagree on which readings on their two clocks are simultaneous. One way to clearly state which version of time dilation is being considered starts with the realization that a clock display is not only a time coordinate for some observer, but it can also be regarded as an event (think of a 5:00 o’clock whistle which is clearly an event). The version of time dilation that is being considered is clearly stated when stating which clock is the one whose displays are the events for which coordinates are to be determined. Here we discuss the version of time dilation in which displays on the traveler’s clock are not only traveler time coordinates of the displays but also events. The goal is to calculate home time coordinates of these events, so we will calculate how fast the traveler’s clock is ticking as seen by the home observer. This calculation is very simple. The traveler’s worldline is taken to be the worldline of the traveler’s clock (the clock defines the traveler’s origin) so traveler clock displays are events for which
=
0. From the previous paragraph we conclude that
tE =
tP so (3.1) becomes
which can be inverted via (2.3) and (2.4) to solve for
tE. Also, differentiating (3.4a) while using (2.3) gives
The instantaneous result (3.4b) is the same simple result that is obtained without acceleration. This is a time dilation in that
dtE, which is the time seen by the home observer that is needed to change the display of the traveler’s clock, is greater than
, which is the change of the display of the traveler’s clock and therefore also equal to the time seen by the traveler needed to change the display of the traveler’s clock. A larger time between displays corresponds to a slower clock, so the home observer sees the traveler’s clock to be running slow. Unfortunately, the version of time dilation in which displays on the home clock are events for which traveler time coordinates are to be calculated, i.e., that calculates how fast the home clock is ticking as seen by the traveler, is not so simple but an example is given in
Section 12.
6. The Inverse Transformation
When traveler coordinates of an event are given, the home coordinates are obtained from a simple procedure in
Section 3. We now consider the case in which the home coordinates are given, and the goal is to calculate the traveler’s coordinates. Using the transformation in either direction, one of the unknowns to be solved is
tP. When the traveler coordinates are given,
tP is solved via (3.1). When the home coordinates
xE and
tE are the givens, we need to construct an equation that contains the unknown
tP with all other terms known. This is done by taking the dot product of (3.3b) with
V(
tp) to get
which gives
Substituting this into (3.3a) while using (3.2) gives
With the parameters
tE and
xE given, and the functions
V and
X given, (6.1) is the equation governing
tP. An unfortunate property of (6.1) is that there are example traveler worldlines in which (6.1) does not have a unique solution for
tP for some inputs
tE and
xE. In some cases, no solution exists. For some other cases, solutions exist but are not unique. This is discussed in more detail in
Section 8. For the remainder of this discussion we assume that event
E is one in which (6.1) has a unique solution for
tP and this solution has been found (by some numerical root finding routine if necessary).
Given that
tP satisfying (6.1) can be found and has been found, we calculate
via (3.1). The last step of the inverse transformation solves for
as follows. To shorten the notation, we define, for an arbitrary vector
W, a parallel part
and a perpendicular part
, with respect to the direction of
V(
tP), according to
Using this notation we can write (3.3b) as
The parallel part of the left side equals the parallel part of the far right side, and the perpendicular part of the left side equals the perpendicular part of the far right side so
and
Now express
as the sum of the parallel part plus perpendicular part while using (6.4) to get
Now use the second equation in (6.2) to write this as
Finally, we use the first equation in (6.2) to write this as
Before summarizing the above results, we digress by noting two interesting properties of (6.1). One property is that
tE =
tP if
V(
tP) = 0. This could have been anticipated even before deriving (6.1) as already explained in
Section 3. The second interesting property deserves more discussion. That property is the implication that if
xE =
X(
tP) then
tE =
tP. This could have been anticipated even before deriving (6.1) as follows. Suppose
xE =
X(
tP), i.e.,
xE =
xP. Then the events
E and
P have the same spatial coordinates in the home system. They also have the same time coordinates in the
P-system (by choice of point
P) so it is not surprising that
E and
P are the same spacetime point, implying
tE =
tP. This conclusion is made rigorous by (6.1), showing that it is true that the condition
xE =
X(
tP) implies
tE =
tP. Therefore, the condition
xE =
X(
tP) implies the simultaneous conditions
xE =
xP and
tE =
tP, which implies that
E and
P are the same spacetime points, which implies that they have the same coordinates in all systems. A variety of implications follow by combining this implication with others already stated (e.g., that the event
E is on the traveler’s worldline if and only if
=
0, and the conclusion from
Section 3 that
tE =
tP and
xE =
xP when
=
0). Putting the implications all together, we obtain the following summary of implications:
We now summarize the results derived in this section. It is assumed that we are given
xE and
tE, and the goal is to calculate
and
. The first step calculates
tP from
where it is understood that we are treating the case in which there exists such a
tP (see
Section 8 for more discussion). Having done this, the quantities
X(
tP),
V(
tP),
γ(
tP), and
τ(
tP) all become knowns quantities, via the given worldline of the traveler in the home system together with (2.3) and (2.4), in addition to
xE and
tE being known quantities. The next set of steps calculate
xP,
, and
from
8. Existence and Uniqueness of the Inverse Transformation
Whether or not the inverse transformation in
Section 6 exists and is unique is completely determined by the very first step of the calculation, which is to solve (6.6) for
tP. As pointed out in
Section 2, a constraint that the event
E must satisfy to be treated in this analysis is that there must exist a point
P on the traveler’s worldline such that the inertial system local with the traveler’s system at
P sees
E to be simultaneous with
P. There exists a solution for
tP to (6.6) if and only if
E is such an “allowed” event. However, it is also possible that
E is such an allowed event, so there exists a point
P satisfying the above condition, but such a point
P is not unique. In this case there are multiple solutions for
tP to (6.6). Lack of existence is possible if the traveler’s acceleration continues forever, and an example is given later in
Section 12. However, it is shown in this section that if the acceleration always has a finite magnitude and has a finite time duration, solutions must exist but are not always unique.
The goal of this section is to show that if the acceleration always has a finite magnitude and has a finite time duration, there necessarily exists a solution for
tP to (6.6) (albeit not necessarily unique). The first step towards this goal shortens the notation in (6.6) by defining the function
F(
tP,
xE), a function of
tP and the vector
xE, by
so that (6.6) becomes
The traveler’s journey begins at
t = 0 but nothing was said about his prior history. It is convenient to stipulate that, before starting the journey, the traveler was at rest in the home system, so
V(
tP) =
0 if
tP < 0. Also, the acceleration is taken to have a finite time duration so there exists some cutoff time in the home system, denoted
tC, such that the acceleration is zero when
t >
tC. We therefore have
The cutoff for the velocity implies that
X(
tP) satisfies
Substituting (8.3) into (8.1) and regrouping terms for the case in which
tP >
tC gives
where
γ is defined by (2.4). It is evident from inspection of (8.4) that
Given that the magnitude of acceleration is always finite, so the velocity is a continuous function of time, F(tP,xE) is a continuous function of each argument. Continuity together with the mapping property (8.5) implies that (8.2) has a solution for tP for any value of tE.
Having established that there necessarily exists a solution for
tP to (6.6) if the acceleration always has a finite magnitude and has a finite time duration, the next question asks for the conditions in which the solution is unique under the assumed conditions regarding the acceleration. Note from (8.4) that
F(
tP,
xE) is strictly increasing in
tP when
tP < 0 and when
tP >
tC. If (a big “if”)
F(
tP,
xE) is strictly increasing in
tP when 0 <
tP <
tC, then a solution for
tP to (6.6) is unique. However, if (another big “if”) a solution for
tP to (6.6) is a value of
tP at which
F(
tP,
xE) is strictly decreasing in
tP, then this solution for
tP is not unique. Therefore, the question of uniqueness is answered by considerations of whether
F(
tP,
xE) is increasing or decreasing in
tP. It will be shown below that (given a nonzero acceleration at some point in time) there necessarily exists home coordinates (
tE,
xE) for which a solution for
tP to (6.6) is not unique. A sufficient condition, imposed on
xE, for uniqueness will also be given. The arguments below will refer to the derivative of
F(
tP,
xE) obtained from (8.1) and given by
We will show that there exist home coordinates (
tE,
xE) for which a solution for
tP to (6.6) is not unique by constructing them. Select a
tP for which
A(
tP) ≠ 0 but is otherwise arbitrary. Now select any
xE satisfying
Note that such an xE satisfying (8.7) necessarily exists when A(tP) ≠ 0. Also note, from (8.6), that this choice for xE makes F(tP,xE) strictly decreasing in tP at the selected value of tP. Now use (8.2) (equivalent to (6.6)) to define tE. These steps have constructed a set of home coordinates (tE, xE) such that F(tP,xE) strictly decreasing in tP at the solution tP, implying that the solution is not unique. The conclusion is that, given that there is some point in time at which the acceleration is not zero. There exist home coordinates (tE, xE) for which a solution for tP to (6.6) is not unique.
A fairly simple constraint imposed on the home coordinates (tE, xE) that is a sufficient condition for the solution for tP to (6.6) to be unique is for xE to be selected to make the right side of (8.6) positive for all tP. This constraint makes F(tP,xE) strictly increasing in tP for all tP, ensuring uniqueness.
9. Some Identities for Later Use
The next section calculates the metric tensor in the traveling system. That analysis is less cumbersome if some identities are available, and one goal of this section is to make them available. A few other miscellaneous identities useful in later sections are also derived.
The first goal is to derive expressions for the derivatives of parallel parts and perpendicular parts of arbitrary spatial vectors as they are defined in (6.2). This is accomplished in several steps. The first step calculates the derivative of the magnitude of the velocity vector. The second step uses this result to calculate the derivative of the unit vector in the direction of the velocity vector, and the last step calculates derivative of parallel and perpendicular parts of arbitrary vectors.
Note from (6.2) that parallel and perpendicular parts of an arbitrary spatial vector
W can be written as
where the unit vector
n is defined by
The first step that is used to calculate derivatives of parallel and perpendicular parts of
W notes that
which gives
Substituting (9.3) into the above gives
From the definition (9.1) of parallel and perpendicular parts, we recognize the square bracket in (9.4) as the perpendicular part of the acceleration so
The next step takes the derivative of the parallel part of an arbitrary vector
W using
The first term on the right is seen to be the parallel part of the derivative of
W. Substituting (9.5) into the second and third terms on the right produces
The derivative of the perpendicular part is obtained from
Substituting (9.6) into the above far right term while using
gives
Another identity useful in later sections is an expression for the derivative of
γ(
tP). Using (2.4) gives
which finally gives
Still another identity that will be useful later and is trivial to prove is
Now consider an arbitrary trajectory, or worldline, seen in an arbitrary inertial system. This could be the worldline of the traveler’s clock in the home system but not necessarily. Along this worldline we can define increments of proper time, denoted
dτ, and increments of coordinate time, denoted
dt. From these increments we can define the derivative
dt/
dτ, which is a derivative defined on a curve. Two different expressions for this derivative are derived. Both start with the equation
and then divide by (
dτ)
2 to get
One expression for
dt/
dτ is obtained by rearranging terms in (9.10) to get
A second expression for the same derivative is obtained by using the chain rule to write (9.10) as
and rearranging terms to get
with
γ defined by the second equality in (9.11b), consistent with the definition in earlier sections. Combining (9.11a) with (9.11b) gives
when
dτ is defined by (9.10).
10. The Metric Tensor in the Traveling System
This section calculates the metric tensor in the traveling system. This metric, denoted
, is obtained by starting with the Lorentz metric, denoted
g, applicable to the home system and then transforming it via the transformation of a double covariant tensor. This transformation is defined by
where we use the notation
x0 =
ctE,
x1 =
xE,
x2 =
yE,
x3 =
zE, with corresponding notation for the traveler coordinates. The Lorentz metric is diagonal with
g0,0 = 1 and all other diagonal elements equal to −1, so (10.1) becomes
Recognizing the sum on the right as a three dimensional (spatial) vector dot product gives
It is evident from (3.1) that partial derivatives that treat
as independent variables (meaning that a partial derivative that varies one variable in the list does so with all others in the list held fixed) also treat
as independent variables. Therefore, a derivative with respect
can be expressed in terms of a derivative with respect to
by using
Using (3.1) and (2.3) to evaluate the derivative in the parenthesis on the right allows the above to be written as
Using (10.3) with (10.2), components of the metric tensor can be written as
where we shortened the notation by omitting the subscripts
E to the event coordinates
and
. The remaining off-diagonal elements of the metric tensor are implied by the symmetry condition
=
.
The next step is to calculate the derivatives in (10.4). One equation needed for this is (3.3a), which is written below with the
E subscripts omitted as
The other needed equation is (3.3b). It is convenient to write it in terms of parallel and perpendicular parts. Doing so while omitting the
E subscripts to shorten the notation gives
The first derivative to be calculated is obtained from (10.5), (9.8), and a direct application of the product rule which gives
which can also be written as
Now use
to write the above as
Next, use
to write the above as
Finally, use the identity (9.9) to write the above as
For the next derivative it is convenient to write (10.6) as
Using (9.6) and (9.8) gives
Regrouping terms and using and allows the above to be rewritten as
Once more we use
to write the above as
The next derivatives are simpler to calculate but some notation is needed. Let
e(x),
e(y), and
e(z) be the unit vectors (in the context of three dimensional Euclidean geometry) in the directions of the
x-axis,
y-axis, and
z-axis, respectively, in the traveling system. The subscripts appear in parentheses to emphasize that they are vector names or labels as opposed to vector components. Using this notation we easily obtain from (10.5) that
To calculate the remaining derivatives, we note that partial derivatives that hold
tP fixed, so they hold
n(
tP) fixed, commute with the taking of parallel parts or perpendicular parts. Specifically,
with analogous results for
and
derivatives. Using this fact with (10.6) easily gives
Various products of derivatives are prepared in advance to assist with the evaluation of the metric tensor. We first shorten the notation by defining
T-functions that are characterized by the property of being zero if either
or
A(
tP) are null vectors. They are defined by
where the perpendicular symbol is included as a reminder that the vector
is orthogonal (in the context of three-dimensional Euclidean geometry) to
n(
tP) (recall that
n(
tP) is the unit vector, in the context of three-dimensional Euclidean geometry, in the direction of
V(
tP)). With these definitions we can write (10.7a) and (10.7b) as
We now list several products of derivatives. We temporarily shorten the notation by not displaying the arguments of the various functions. Full notation will be restored in final results derived for the metric tensor. It is evident from (10.9a) that
Also, orthogonality between
and
n(
tP) together with (10.9b) gives
We next use (10.9a) with (10.7c) to get
Next use (10.9b) with (10.7f) while paying attention to which vectors are parallel to each other, which are orthogonal to each other, and use the fact that
=
to get
The next products considered are obtained from (10.7c) and (10.7f) which give
Expanding the product in the second equation gives
Or
so the above equation becomes
We now use the identity (9.9) to write the above as
or
The last pair of products considered are obtained from (10.7c), (10.7d), (10.7f), and (10.7g). These give
Expanding the product in the second equation gives
But we also have
or
so the above equation becomes
We now use the identity (9.9) to write the above as
or
We can now assemble various products of derivatives to produce the metric tensor. The two equations in (10.10) together with (10.4) give
Another component is obtained from the two equations in (10.11) together with (10.4) which give
where we used
=
=
. Expressing the parenthesis in terms of
γ gives.
Similarly
The next diagonal element,
, is obtained by combining the two equations in (10.12) with (10.4). Including analogous results for
and
we obtain
The off diagonal element
is obtained from the two equations in (10.13) together with (10.4). The result is obvious, and including analogous results for other terms gives
All other components of the metric tensor are implied by the symmetric condition = .
The equations in (10.14) give the components of the metric tensor as functions of
tP and
. However,
tP is itself a function of
, obtained by inverting (3.1) which is written below with the subscript
E omitted as
Recognizing this implicit dependence on , (10.14) gives the components of the metric tensor as functions of and .
Recall that the
T-functions are each zero if either
= 0 or
tP is a point at which
A(
tP) = 0. If either of these conditions are satisfied, so all
T-functions are zero, the metric reduces to the Lorentz metric. This is seen by a casual inspection for all elements except
. That
= 1 when all
T-functions are zero is seen by noting that (10.14a) reduces to
when all
T-functions are zero. This gives
= 1 via (9.9), confirming that the metric reduces to the Lorentz metric if either
= 0 or
tP is a point at which
A(
tP) = 0.
11. 4-Vectors
Previous sections explain how various quantities can be calculated when the given information is the trajectory of the traveler’s clock relative to the home system and expressed in terms of the home system coordinates. However, the next section gives an example in which this trajectory is not the given information. Instead, it is necessary to deduce this trajectory from other information that is given. Quantities called 4-vectors provide computational conveniences that make this deduction easier, so a review of 4-vectors is given in this section. This review emphasizes a distinction between vectors and vector components. Also, the notation used here uses bold font for 4-vectors, as was previously done for three-dimensional spatial vectors, but distinguishes between them by using cursive font for 4-vectors and block letters for three-dimensional spatial vectors.
A good illustrative example of a 4-vector is the 4-velocity, denoted
V in the present discussion (
V will be an arbitrary 4-vector in later discussions in this section) of some particle at some point on its worldline. This will be defined after explaining one of the distinctions between it and the three-dimensional velocity
V which is the derivative of spatial coordinates with respect to the time coordinate. The three-dimensional velocity of the same particle is a different vector in different reference frames that move relative to each other. In contrast, the 4-velocity is the same vector in different reference frames, it is only its components with respect to different basis vectors that are different. Therefore the 4-velocity is completely defined when we specify its components in any convenient set of basis vectors. It is convenient to use the basis vectors assigned to the home system to specify
V, but we must first decide on what the home system basis vectors will be. We will use the same basis vectors for the home system that were denoted
e(x),
e(y), and
e(z) that were used in the traveling system,
7 except that we include another vector
e(ct) in the direction of the time axis. We also change notation by using
e(0),
e(1),
e(2), and
e(3) in cursive font and with integers for indices, instead of
e(ct),
e(x),
e(y), and
e(z), to emphasize four dimensions. As discussed above, the 4-velocity is completely determined in all reference frames if we specify its components in the home basis vectors. It is therefore completely defined by the equation
where
dτ is an increment of proper time along the particle’s worldline, and we use the notation
The velocity vector was selected for illustration of a 4-vector but in the remainder of this section,
V is an arbitrary 4-vector. It is uniquely determined (as shown later) in all systems if we specify the numbers
V 0,
V 1,
V 2, and
V 3 satisfying
The coefficients to the basis vectors in such an expansion are called the contravariant components of
V.
8 We use the notation in which
V i denotes the
ith contravariant component in whatever inertial system that was selected to be called the unbarred system and that uses the basis vectors in (11.2). Therefore (11.2) can also be written as
Now consider the contravariant components in another system, called the barred system, that uses basis vectors
. Again, the contravariant components in that system are the coefficients in the expansion in those basis vectors, so if we let
denote the
ith contravariant component in the barred system we have
However, in order for contravariant components in different systems to be related by the transformations traditionally used to define contravariant components, the basis vectors in the barred system are not arbitrary. They are tangent vectors to coordinate curves and are uniquely determined after the coordinates in the barred system have been selected. Denoting these coordinates as
, or more briefly as
, the barred basis vectors, which can be functions of the barred coordinates, are defined by
where
x is defined by
so (11.5a) becomes
with the understanding that a summation that does not explicitly show the range of the index uses the range of 0 to 3 (the Einstein summation convention is not used because there are exceptions to the rule, and it is not difficult to include a summation symbol to avoid any possible confusion).
9
An important and convenient consequence of defining the barred basis vectors by (11.6) is that the transformation between basis vectors can easily be inverted to solve for the unbarred basis vectors in terms of the barred basis vectors. To perform this inversion, note that the Kronecker delta can be expressed as
When the selected spacetime coordinates are in neighborhoods for which there is an invertible transformation between barred and unbarred coordinates, the chain rule applied to the above gives
Now multiply both sides of (11.6) by
/
and sum in
i to get
Next, we use (11.7) to write this as
Changing dummy symbols gives
which is the inverse transform between basis vectors.
We can now confirm that contravariant components satisfy the transformation typically used to define contravariant components. Combine (11.3) with (11.4) to get
Now substitute (11.8) into the right side of (11.9) and interchange dummy symbols to get
The barred basis vectors are linearly independent so the above gives
which is the transformation typically taken to be the definition of contravariant vector components.
Covariant vector components can also be defined but to help with notation we start with a definition of the solid dot product denoted
. This dot product between two arbitrary 4-vectors
V and
U is defined by
where the superscripts on the right denote contravariant components in the unbarred system, and
g is the metric tensor in the unbarred system. The unbarred system is inertial and rectangular, so its metric tensor is the Lorentz metric, which allows us to rewrite (11.11) as
However, the notation in (11.11a) is more convenient than the notation in (11.11b) for deriving an expression for the dot product in the barred system. Recall that the barred metric tensor is defined by the transformation (10.1), and we already established the transformation (11.10) for contravariant components of 4-vectors. We can show via (11.7) that the composite of these transformations applied to the right side of (11.11a) produces
In summary, three expressions derived (so far) for solid dot products are given by the three equations in (11.11).
10
Special interesting cases are dot products between basis vectors. Recall that the contravariant component
is the
jth coefficient in the expansion of
in the unbarred basis vectors. Similarly, the contravariant component
is the
jth coefficient in the expansion of
in the barred basis vectors.
11 Therefore we have
Also, the definition of contravariant components as coefficients in expansions together with (11.6) and (11.8) give
where a coordinate dependence of
is indicated and is due to a coordinate dependence of the basis vectors. Substituting (11.12a) into (11.11a) for the unbarred case, and into (11.11c) for the barred case, gives
Listing the individual products for the unbarred case gives
Note that the middle equation is one of those exceptions to the Einstein summation convention and is an example of the reason for not using that convention.
The covariant components of an arbitrary 4-vector
V in the unbarred and barred systems, with the
ith component denoted
in the unbarred system and denoted
in the barred system, are defined by
The transformation between barred and unbarred covariant components is easily derived by starting with (11.6) to get
and using (11.15) gives
which is the transformation traditionally used to define covariant vector components.
12
Covariant and contravariant components of vectors can be related to each other by using (11.11) to get
Substituting (11.12a) and (11.15) into the above gives
where we used the fact that the metric tensors are symmetric.
Several expressions for dot products were listed in (11.11). Two more expressions are obtained by combining (11.17a) with (11.11a) and combining (11.17c) with (11.11c). The results are
Now consider an arbitrary worldline, the trajectory in spacetime of some moving point, as seen by the inertial unbarred system. Using equations in
Section 2, an increment of proper time along this worldline can be obtained by taking the differential of (2.3) and then using (2.4) to get
which can also be written as
If we now define the 4-vector
and use (11.11b) to express
and compare that expression to the right side of (11.19) we find that (11.19) can be written as
One implication of (11.20) is seen by using (11.11c) to write (11.20) as
If we now take the barred system to be attached to the moving point, the differentials of spatial coordinates are zero so the above reduces to
It was concluded at the end of
Section 11 that if we take the origin (the location of the clock) of the barred system to be attached to the moving point (the spatial coordinates of the point are zero in the barred system) then
= 1 so the above becomes
This is consistent with the conclusion already used in
Section 2, but obtained from the hypothesis (1.1), that the time coordinate of the traveling system at any given point on the worldline of the systems clock is equal to the proper time assigned to that point on the worldline of the systems clock.
A second implication of (11.20) is that
dτ is a scaler invariant. This implies that if
V is an arbitrary 4-vector defined on each point on the given worldline, the derivative
dV /
dτ, which is a derivative on a curve, is another 4-vector. From this fact we can derive the product rule for derivatives of dot products. Using the expression (11.11a) for dot products, together with the ordinary product rule and the fact that the Lorentz metric is constant, we obtain
We next use the fact that
and
are contravariant components of 4-vectors in the unbarred system. This fact together with (11.11a) gives
Combining (11.22) with (11.23) produces the product rule
Note that expressing components in the unbarred system is the simplest way to derive (11.24). If components were expressed in the barred system, there would be two complications. First, the metric tensor in the barred system need not be constant so there would be a third term on the right side of the equation analogous to (11.22). Second, and are not in general contravariant components of 4-vectors in the barred system, so the equation analogous to (11.23) is incorrect. Hence the unbarred system provides the simplest derivation of (11.24).
The previous paragraph stated that
and
are not in general contravariant components of 4-vectors in the barred system. It is interesting to find out what the contravariant components of
dV /
dτ are in the barred system. These components can be derived by using the product rule with (11.4) to get
Note that (11.6) together with the chain rule gives
To express this in terms of the barred basis vectors we substitute (11.8) into the right side of the above to get
To shorten the notation, define the Christoffel symbol by
so (11.26) becomes
From the definition of contravariant components as coefficients in expansions, we conclude from (11.28) that
Also, substituting (11.28) into (11.25) gives
where we define
From the definition of contravariant components as coefficients in expansions, we conclude from (11.30) that
12. An Example: Constant Acceleration Felt by Traveler
The example in this section is one in which the velocity of the traveler relative to the home system and expressed as a function of time in the home system is not the immediately given information. Instead, this must be deducted from other information that will be given. Invariance of solid dot products between 4-vectors facilitates this deduction so the given information will be a statement about the 4-vector version of acceleration. This 4-vector, denoted
A, is defined in general in terms of the home system basis vectors by
The example in this section is the special case in which the acceleration has a constant spatial direction, say in the
x-direction. The remaining given information, which implies (after some deduction to follow) the traveler worldline as expressed in home coordinates, is
where
a is a positive constant. Expressing the dot product in any inertial system produces
where we now write
x instead of
x1. An alternate way to express (12.3) is obtained by first using (9.11a) to derive an expression for
d2t/
dτ2, which is obtained from
and then substitute this into (12.3) and combine some terms to get
From (12.4) we can recognize the physical interpretation of the example considered here. This is seen by letting the acceleration refer to the traveler’s system and letting the inertial system be the
P-system, the inertial system momentarily at rest in the traveler’s system. In the
P-system, and at the time it is at rest in the traveler’s system, the velocity
dx/
dt is zero, and there is no distinction between
τ derivatives and
t derivatives, so (12.4) reduces to
If the traveler is carrying an accelerometer, it will measure an acceleration equal to
a when at rest in the
P-system, but the point
P is an arbitrary point on the traveler’s worldline, so the physical interpretation of this example 4-vector
A is that it produces the case in which the traveler feels (or measures via an accelerometer) a constant acceleration equal to
a [
11].
The next application of (12.4) is to determine the traveler’s worldline as seen in the home system. Taking the sign of
a to be the same as the sign of
d2x/
dτ2, (12.4) gives
Endpoint conditions are not important to the analysis as long as singularities are avoided as
a → 0. A particular choice that avoids singularities takes the endpoint conditions to be
x = 0 and
dx/
dτ = 0 at
τ = 0. The solution to (12.5) subject to these endpoint conditions is easily verified to be
Substituting this into (9.11a) gives
The solution satisfying
t = 0 when
τ = 0 is
Note that if we take the limit as the acceleration a → 0 we find, via L’Hôpital’s rule, that t → τ while x → 0 for all τ, which is the expected result for the selected initial conditions without acceleration.
An identity relating hyperbolic functions allows (12.6a) to be written as
Combining this with (12.6b) gives
A plot of
x versus
t produced by (12.7) produces a hyperbola, so this example is often called hyperbolic motion [
11]. Another important quantity for this example is the coordinate velocity (not a 4-vector) defined to be
dx/
dt, which we find by differentiating (12.7) to be given by
Also, substituting (12.8) into (9.11b), using the second equality in (9.11b) gives
As a reminder that the calculated
x and
dx/
dt refer to the worldline of the traveler’s clock, we go back to the notation in
Section 3 and
Section 6. Instead of (12.7) we write
Instead of (12.8) we write
Instead of (12.9) we write
and instead of (12.6b) we write
When difficulties are encountered regarding the existence of traveler coordinates to be calculated for a given spacetime point, i.e., calculated from a given set of home coordinates, they are encountered in the first step of the calculation which is to solve (6.6) for
tP. The example in this section provides an illustration. Substituting (12.10) into (6.6), the equation to be solved for
tP becomes
First consider the case in which the parenthesis on the right side of (12.11) is positive. It is easy to show that this condition makes the right side of (12.11) a strictly increasing function of
tP, implying that the solution of (12.11) for
tP is unique if it exists. Existence requires that the left side,
tE, be in the range of the function of
tP on the right. The function on the right is zero at
tP = 0, and it is easy to show that the function on the right asymptotically approaches the value
c/
a +
xE/
c as
tP → ∞. We therefore conclude that:
The upper bound for
tE in (12.12) can be explained in terms of time dilation. Imagine a clock stationary at the location
xE in the home system. This clock as seen by the traveler runs slower and slower as the traveler speeds up in the home system, in such a way that the clock display, as seen by the traveler, asymptotically approaches a finite limiting value. This limiting value is the upper bound in (12.12). A more rigorous discussion of this time dilation can be given, for the example worldline considered in this section, as follows. Recall that (3.4) is the version of time dilation in which the home observer is observing the traveler’s clock. For the example in this section, we are now able to calculate the version of time dilation in which the traveler is observing a home clock located at
xE. We start by combining (6.8a) with (12.10d) and invert the equation to get
Substituting (12.13) into (12.11) and using some identities for hyperbolic functions gives
If we now let the event
E be a given display on a home clock located at
xE, then
tE and
xE become the home coordinates of that event. If
tE +
dtE denotes the time coordinate of the next tick (the next event) of the home clock, with the clock still at the spatial coordinate
xE, we relate
dtE to the time increment
d, that the traveler sees to be the time between ticks of the home clock, by differentiating (12.14) while holding
xE fixed (as opposed to holding
fixed as was done when deriving (3.4b)). The result is
Some identities for hyperbolic functions allow (12.14) to be rearranged into
and combining the two above equations gives
This becomes a time dilation when tE is sufficiently close to the upper bound in (12.12) so that the right side of (12.15) is greater than 1, which is seen as follows. Recall that tE is a display on the home clock. When the right side of (12.15) exceeds 1, the time seen by the traveler that is needed to change the display of the home clock is greater than dtE, which is the change of the display of the home clock and therefore also equal to the time seen by the home observer needed to change the display of the home clock. A larger time between displays corresponds to a slower clock, so the traveler sees the home clock to be running slow when the home clock display is sufficiently close to the allowed upper limit.
Note a lack of symmetry between (3.4b) and (12.15) for the example acceleration considered in this section. When making this comparison we must be careful to recognize that tE has a different meaning in the derivation of (3.4) than in the derivation of (12.15) and satisfies different equations for the two cases (e.g., tE = tP for the former case but not the latter) because the event E has a different meaning for the two cases. In the derivation of (3.4), the event E is a given display on the traveler’s clock. In the derivation of (12.15), the event E is a given display on a home clock located at xE. Hence, we must use caution when comparing (3.4b) to (12.15) by recognizing that terms represented by the same symbols do not have the same meanings. However, despite this subtle issue, there is still an obvious distinction between (3.4b) and (12.15). The expression in (3.4b) does not depend on the relative distance between clocks when all cases being compared have the same relative velocity between clocks. In contrast, the expression in (12.15), which is a time dilation pertaining to a home clock located at xE, does depend on the location xE of the home clock.
The upper bound for
tE in (12.12), which limits the spacetime points that can be assigned coordinates in the traveler’s system, can be removed by allowing the acceleration to persist for only a finite time, with the traveler having a constant velocity after that time. This was explained in
Section 8. It was also explained there that there exist home coordinates at which the transformation to traveler’s coordinates is not unique.
13. Metric Tensor for the Constant Acceleration Felt by Traveler
The next section has a need for the zero-zero component of the metric tensor in the traveler’s system and the goal of this section is to calculate the metric tensor for the acceleration example considered. This quantity depends on the space-time point of evaluation and is most conveniently expressed in the traveler’s coordinates of a given point. While
Section 2 through
Section 7 and
Section 12 applied
E subscripts to coordinates to emphasize that they are coordinates of arbitrary events, we shorten the notation here by omitting those subscripts, so it is understood here that
are the traveler’s coordinates of an arbitrary event. When expressed as a function of these coordinates, the
i,
j component of the metric tensor is denoted
.
Section 10 showed explicit dependences that various quantities, used to construct the metric tensor, have on
, but the dependences on
was implicit, through a dependence on
which in turn is a function of
To explicitly show the dependences on
, note that when the travelers time coordinate
is the given quantity, as opposed to home coordinates,
is calculated from (12.13) (with the subscript
E omitted in the notation used here), i.e., calculated from
Quantities previously expressed in terms of can now be expressed explicitly in terms of via the substitution given by (13.1).
Quantities that must be evaluated to calculate the metric tensor include
γ(
tP) and
V(
tP), which, for the acceleration example considered here, were shown in
Section 12 to be given by
Another quantity that must be evaluated is the three-dimensional acceleration, which is the derivative of
V with respect to coordinate time in the home system, and can be calculated from
Using (13.2b) to calculate the derivative in the above gives
We now express the quantities in (13.2) in terms of
by using (13.1) together with some identities for hyperbolic functions to get
For the acceleration example considered here, all motion is along the
x-axis so the
T-functions defined by (10.8) reduce to
together with
and
. Using (13.3) to express
in terms of
gives
We see from inspection of (10.14) that when
and
, each component of the metric tensor in the traveler’s system equals the corresponding component of the Lorentz metric tensor except the zero-zero component, i.e.,
where
is the Lorentz metric. Also, when
and
, the zero-zero component given by (10.14a) reduces to
Substituting (13.3) into the above gives
14. Another Kind of Time Dilation and Doppler Effect
Two kinds of time dilation were previously discussed.
Section 3 discussed the case in which a given display on the traveler’s clock was the event, and the calculated quantity was the home observer’s time coordinate of this event (this is equivalent to saying that the calculated quantity is the display on the home clock that is simultaneous with a given display on the traveler’s clock when the home observer defines simultaneity).
Section 12 discussed the case in which a given display on the home clock was the event, and the calculated quantity was the traveler’s time coordinate of this event (this is equivalent to saying that the calculated quantity is the display on the traveler’s clock that is simultaneous with a given display on the home clock when the traveler defines simultaneity). We now consider a third case in which two clocks are not moving relative to each other, they are both stationary in the traveler’s system, but one clock is spatially displaced relative to the other. One clock, still called the traveler’s clock, is at the traveler’s origin so the traveler’s worldline is the worldline of that clock. The traveler is at the origin of the reference frame and uses this clock to define time coordinates of events. Displays on a second clock, called the displaced clock, are treated as events, and the quantity to be calculated is the traveler’s time coordinate of such an event. This is equivalent to calculating the display on the traveler’s clock that is simultaneous with a given display on the displaced clock when the traveler located at the traveler’s clock defines simultaneity.
A coordinate transformation between the traveler and a displaced traveler has not been derived but the above calculation can be performed by utilizing the metric tensor. Details are as follows. We start with the fact that an increment of time display, denoted
, on the displaced clock is equal to the increment of proper time, denoted
dτ, between nearby points on the worldline of the displaced clock, i.e.,
A second relevant fact is that the increment of proper time is a scaler invariant that can be calculated using the metric tensor of any convenient reference frame. Because the displaced clock is stationary in the traveler’s reference frame, it is convenient to use the traveler’s metric tensor for this calculation. Using the acceleration example of
Section 12 and
Section 13, let
denote the location of the displaced clock, along the
x-axis, in the traveler’s coordinate system. A given point on the worldline of the displaced clock has traveler’s coordinates
and a nearby point on the same worldline has coordinates
. There is no displacement of the spatial coordinate so the relationship between
and coordinate displacements reduces to
Combining this with (14.1) while using (13.4b) gives
If the displaced clock is displaced in the direction of the traveler’s acceleration, so is positive, we have , so the displaced clock appears to the traveler to be running fast. Displaced in the opposite direction, but not far enough to make the parenthesis in (14.3) negative, makes the displaced clock appear to the traveler to be running slow.
Now consider another situation in which the displaced clock is accompanied by a light source that is stationary in the traveler’s system and is at the same location as the displaced clock. There is no motion between the light source and traveler as seen by the accelerating traveler, but the light source and traveler are at different locations.
13 The goal is to derive a Doppler effect as seen by the traveler for this case. This effect is caused by acceleration, which is equivalent to gravity, so we will call this a gravitational Doppler effect. It is interesting to compare this to the effect, which we will call the Lorentz Doppler effect, seen by an inertial system when a light source moves relative to the system. The Lorentz Doppler effect is easily found in the literature, so details need not be included here, and we will merely focus on some points of comparison. The Lorentz effect is a combination of two effects. One is from consecutive wave peaks produced by a moving source being emitted at different locations relative to the observer, which results in different times of travel to the observer. These different travel times contribute to the different arrival times of the wave peaks at the observer location (the other contribution to the different arrival times is the wave period of the source which is the difference in wave peak emission times). This effect, taken by itself, would produce the Doppler effect derived from a nonrelativistic treatment of sound waves in which the observer is at rest in the medium and the light source moves relative to the medium. The second effect that contributes to the Lorentz Doppler effect is a time dilation, which modifies the result derived for sound waves. In contrast, the gravitational Doppler effect is much easier to derive because, as seen below, only time dilation is relevant.
Using the acceleration example of
Section 12 and
Section 13, let
denote the location of the light source, along the
x-axis, in the traveler’s coordinate system. Consider two consecutive wave peaks emitted by the light source. They are both emitted at the same location in the traveler’s system so they both have the same time of travel, as seen by the traveler, between emission and reaching the traveler. Therefore, according to the traveler, the difference in arrival times between the consecutive peaks, which is the wave period denoted
seen by the traveler, is the time difference, seen by the traveler, between emissions of consecutive wave peaks. Recall that
in (14.3) is an increment of time display of the displaced clock, while
is the amount of time seen by the traveler to produce that change in clock display. Let the change in displaced clock display
equal one clock tick, which is also the time between consecutive wave peak emissions produced by the accompanying light source, so
=
T where
T is the wave period in the reference frame of the light source. Because travel time of the light wave is irrelevant, the wave period
seen by the traveler is the difference
in traveler time coordinates between consecutive wave peak emissions, so
=
. Substituting this together with
=
T into (14.3) gives
The frequency
f in the reference frame of the light source and frequency
at the location of the traveler are related by the inverse relation
or