Preprint
Essay

This version is not peer-reviewed.

An Original Presentation of an Old Theory: Lorentz Transformation Generalized to Allow for an Accelerating Coordinate System

Submitted:

10 December 2024

Posted:

11 December 2024

You are already at the latest version

Abstract

The traditional version of the Lorentz transformation is used to relate the coordinates of an event (a point in spacetime) between different inertial coordinate systems in the absence of any gravitational effects. It is limited to those cases in which both reference frames are inertial, so they have a constant velocity relative to each other. This report utilizes a familiar postulate, regarding the equivalence between coordinates measured in an accelerating reference frame and coordinates measured in a reference frame instantaneously at rest with the accelerating frame, together with the traditional version of the Lorentz transformation to derive a generalized version that allows one of the reference frames to be accelerating. This is not a new topic but the presentation and all derivations in this report are the author’s own inventions. By defining suitable quantities and introducing suitable notation, the generalized version can be written in a way that is almost as simple as the traditional (constant velocity) version when calculating the coordinates of an event in an inertial system when given the coordinates in an accelerating system. Unfortunately, calculations of the inverse transformation, i.e., calculating the coordinates in the accelerating system when given coordinates in an inertial system, are more cumbersome. Worse yet, while a suitably selected history and future ensure the existence of an inverse transformation, there can exist spacetime points for which it is not unique. However, the metric tensor can be derived in the accelerating system for the general case and is included in this report. This is used to calculate time dilations and Doppler effects that are outside the scope of inertial coordinate systems.

Keywords: 
;  ;  ;  

1. Introduction

The traditional version of the Lorentz transformation is used to relate the coordinates of an event (a point in spacetime) between different inertial coordinate systems in the absence of any gravitational effects. It is limited to those cases in which both reference frames are inertial, so they have a constant velocity relative to each other. This report utilizes a familiar postulate (stated below in (1.1)) together with the traditional version of the Lorentz transformation to derive a generalized version that allows one of the reference frames to be accelerating. This is not a new topic but the presentation and all derivations in this report are the author’s own inventions. After starting with familiar first principles, the report is self-contained in that derivations are provided for all conclusions. This accounts for the scarcity of references. By defining suitable quantities and introducing suitable notation, the generalized version can be written in a way that is almost as simple as the traditional (constant velocity) version when calculating the coordinates of an event in an inertial system when given the coordinates in an accelerating system. Unfortunately, calculations of the inverse transformation, i.e., calculating the coordinates in the accelerating system when given coordinates in an inertial system, are more cumbersome. Worse yet, while a suitably selected history and future ensure the existence of an inverse transformation, there can exist spacetime points for which it is not unique. However, the metric tensor can be derived in the accelerating system for the general case and is included in this report. This is used to calculate time dilations and Doppler effects that are outside the scope of inertial coordinate systems.
This report frequently uses the phrase “as seen by an observer”. In the context of this report, the word “seen” does not refer to a literal visual image. A visual image of an event (an “event” is a point in spacetime) is received by an observer after a time delay, the time required for a light signal sent by the event to reach the observer. In the context of this report, the phrase “as seen by an observer” does not refer to a visual image but, instead, means that the observer assigned spacetime coordinates to the event.1 For example, the phrase “the observer sees a clock to be running slow” means that if the observer assigns a time coordinate to one tick of that clock, and another time coordinate to the next tick of that clock, the time between consecutive ticks is longer than the time between consecutive ticks of the observer’s own clock.
This report considers two observers. One, called the home observer, uses a clock at rest at the origin of his coordinate system, which is an inertial coordinate system (recognized to be inertial by freely moving particles having constant velocities).2 The other observer, called the traveler (with travel defined to be relative to home), uses a clock at the origin of his coordinate system, called the traveler’s system. The traveler’s system is not rotating but may have a translational acceleration relative to the home system. A spacetime point denoting the start of the traveler’s journey has an arbitrary initial spatial translation relative to the home system and an arbitrary initial velocity in the home system, but the clocks are synchronized so that the traveler’s time coordinate and home time coordinate of the start of the journey are both zero. The goal is to relate, for an arbitrary event, the traveler’s coordinates of that event to the home coordinates of the same event.
The traveler’s clock traces out a worldline as seen by the home observer. Information that is assumed to be known includes the three-dimensional (spatial) vector function, denoted X(t), which is a parameterization of the traveler’s worldline expressing the spatial coordinates x, in the home system, of a point on the traveler’s worldline, in terms of the coordinate time t in the home system.3 From this parametrization, the velocity of the traveler’s clock as seen by the home observer is the function V(t) calculated from V(t) = dX(t)/dt.
All conclusions in this report are derived from two postulates. One, taken from [1] with minor paraphrasing, is the statement:
Carefully   constructed   clocks   ( e . g . ,   atomic   clocks ) ,   when   accelerated ,   will   tick at   the   same   rate   as   unaccelerated   clocks   moving   momentarily   along   with   them , and   sufficiently   rigid   accelerated   measuring   rods   will   measure   the   same   lengths as   unaccelrated   rods   moving   momentarily   along   with   them .  
The second postulate that all results in this report are derived from is the Lorentz transformation applicable to the case in which the traveler has a constant velocity v as seen by the home observer. This is presented in various formats in [2] through [8] but the format preferred for this work is taken from [9] and given as follows. Consider an event with home coordinates denoted ( x 1 , t 1 ) and traveler coordinates denoted ( x ¯ 1 , t ¯ 1 ) , and a second event with home coordinates denoted ( x 2 , t 2 ) and traveler coordinates denoted ( x ¯ 2 , t ¯ 2 ) . The standard (constant velocity) Lorentz transformation in [9] relates the displacement between these events expressed in each of the two systems according to 4
t 2 t 1 = γ t ¯ 2 t ¯ 1 + v x ¯ 2 x ¯ 1 c 2
x 2 x 1 = x ¯ 2 x ¯ 1 + γ 1 x ¯ 2 x ¯ 1 n n + γ v t ¯ 2 t ¯ 1
where the parameter γ and the unit vector n are defined by
γ 1 1 v 2 c 2 , n v v

2. Analysis

Let P denote an arbitrary spacetime point on the traveler’s worldline. The coordinates of P in the home system are denoted ( x P , t P ) , and the coordinates in the traveler’s system are denoted ( x ¯ P , t ¯ P ) . The traveler’s clock is at the traveler’s origin and the worldline considered here is the worldline of the traveler’s clock so
x ¯ P = 0
Also, even if the traveler’s clock is accelerating, the hypothesis (1.1) states that it is still true that increments of proper time for displacements along the worldline of the traveler’s clock equal increments of time measured by the traveler’s clock. With proper time and the traveler’s clock both set to zero when the traveler starts his journey, the proper time along the worldline up to the point P equals t ¯ P . The proper time is an invariant that can be calculated in the home system, giving
t ¯ P = τ t P
where τ is the proper time calculated in the home system according to
τ t = 0 t 1 γ ( ξ ) d ξ
with γ(t) defined by
γ ( t ) 1 1 V 2 ( t ) c 2
Now consider an inertial system that is “local” with the traveler’s system at the arbitrary point P on the traveler’s worldline. This statement is defined to mean that there exists a time at which the origins of the two systems are at the same location and with each axis of the two systems aligned, the two systems are at rest relative to each other at this time, the spacetime coordinates of the origins at this time are the spacetime coordinates of P, and the clock in the inertial system is set to t ¯ P at this time. We will call this new inertial system the P-system. We use the postulate, stated in (1.1), that the traveler’s clock will momentarily (while at P) tick at the same rate as the P-system’s clock, and the traveler’s measuring rods will momentarily (while at P) measure the same lengths as the P-system’s measuring rods. To utilize this local inertial system in the analysis we use the following approach. Instead of selecting an arbitrary event and attempting to calculate traveler coordinates of it, we work in the opposite direction by selecting traveler coordinates and then identify what the event is that has those coordinates (this identification can be done by finding the home coordinates). Because the spacetime point P is arbitrary, it is sufficiently general to confine our attention to those events that the P-system declares to be simultaneous with the event P. The event to be constructed will be called E, with traveler coordinates denoted ( x ¯ E , t ¯ E ) and home coordinates denoted ( x E , t E ) . With the P-system declaring E to be simultaneous with P, we conclude from the hypothesis (1.1) that the traveler also declares E to be simultaneous with P, 5 implying that
t ¯ E = t ¯ P
A constraint that the event E must satisfy to be treated in this analysis is that there must exist a point P on the traveler’s worldline such that the inertial system local with the traveler’s system at P sees E to be simultaneous with P. Given that E is such an “allowed” event, so that E is simultaneous with P as seen by the P-system, the coordinates of E in the P-system are the same as the coordinates in the traveler’s system. Therefore, the coordinates in the P-system are also ( x ¯ E , t ¯ E ) . Also, the coordinates of P in the P-system are ( x ¯ P , t ¯ P ) = ( 0 , t ¯ P ) . The P-system has velocity V(tP) as seen in the home system, so the transformation from barred coordinates to unbarred coordinates is obtained from (1.2) by replacing subscripts (with 2 replaced by E and 1 replaced by P), replacing v with V(tP), and replacing γ with γ(tP). The results, after simplification by using (2.1) and (2.5) are
t E t P = γ t P V t P x ¯ E c 2
x E x P = x ¯ E + γ t P 1 x ¯ E n n

3. Summary and Discussion of the Transformation to Home Coordinates

Information assumed to be available includes the functions X(t) and V(t) explained in Section 1. From these we construct the functions γ(t) and τ(t) via (2.4) and (2.3), so all these functions, which refer to the traveler’s worldline as seen by the home observer, are regarded as given. The goal here is to calculate the home coordinates of an arbitrary event E when the given information, in addition to the above functions, consists of the traveler’s coordinates ( x ¯ E , t ¯ E ) . The following steps are used:
The first step calculates tP. This is done by combining (2.5) with (2.2) so tP is calculated from the given t ¯ E via
t ¯ E = τ t P
Note that γ(t) is a strictly increasing function so there exists a unique tP satisfying (3.1) providing only that t ¯ E is between zero and the largest display of the traveler’s clock (finite if its worldline has a stopping point). With tP now solved, we calculate xP via
x P = X t P
With this done, all terms appearing in (2.6) have been evaluated except for the terms, xE and tE, to be solved. They can now be solved by writing (2.6) as
t E = t P + γ t P V t P x ¯ E c 2
x E = x P + x ¯ E + γ t P 1 x ¯ E V t P V t P V 2 t P
It is interesting that (3.3a) implies that tE = tP if either V(tP) = 0 or x ¯ E = 0. This could have been anticipated even before deriving (3.3). Recall that the events P and E are always simultaneous as seen by the P-system, because this is a condition that was imposed on E. If V(tP) = 0, the P-system is stationary relative to home, so P and E are also simultaneous as seen by the home system. Therefore, we could have anticipated that tE = tP when V(tP) = 0 even before deriving (3.3). Now suppose x ¯ E = 0. Recall again that the event E is simultaneous with P as seen by the P-system, and therefore also as seen by the traveler, so they have the same time coordinates in the traveler’s system. Also, P is on the traveler’s worldline, so its spatial coordinates are zero in the traveler’s system. Therefore, x ¯ E = 0 if and only if P and E have the same spacetime coordinates in the traveler’s system, in which case they are the same spacetime points and therefore have the same spacetime coordinates in all systems. Therefore, we could have anticipated that tE = tP and xE = xP when x ¯ E = 0 even before deriving (3.3).
A topic that can be discussed here is time dilation. However, there are two versions of time dilation because the traveler and home observer can disagree on simultaneity.6 Specifically, they can disagree on which readings on their two clocks are simultaneous. One way to clearly state which version of time dilation is being considered starts with the realization that a clock display is not only a time coordinate for some observer, but it can also be regarded as an event (think of a 5:00 o’clock whistle which is clearly an event). The version of time dilation that is being considered is clearly stated when stating which clock is the one whose displays are the events for which coordinates are to be determined. Here we discuss the version of time dilation in which displays on the traveler’s clock are not only traveler time coordinates of the displays but also events. The goal is to calculate home time coordinates of these events, so we will calculate how fast the traveler’s clock is ticking as seen by the home observer. This calculation is very simple. The traveler’s worldline is taken to be the worldline of the traveler’s clock (the clock defines the traveler’s origin) so traveler clock displays are events for which x ¯ E = 0. From the previous paragraph we conclude that tE = tP so (3.1) becomes
t ¯ E = τ t E ( when   the   events   are   traveling   clock   displays   denoted   t ¯ E )
which can be inverted via (2.3) and (2.4) to solve for tE. Also, differentiating (3.4a) while using (2.3) gives
d t E d t ¯ E = γ t E ( when   the   events   are   traveling   clock   displays   denoted   t ¯ E )
The instantaneous result (3.4b) is the same simple result that is obtained without acceleration. This is a time dilation in that dtE, which is the time seen by the home observer that is needed to change the display of the traveler’s clock, is greater than d t ¯ E , which is the change of the display of the traveler’s clock and therefore also equal to the time seen by the traveler needed to change the display of the traveler’s clock. A larger time between displays corresponds to a slower clock, so the home observer sees the traveler’s clock to be running slow. Unfortunately, the version of time dilation in which displays on the home clock are events for which traveler time coordinates are to be calculated, i.e., that calculates how fast the home clock is ticking as seen by the traveler, is not so simple but an example is given in Section 12.

4. A Check for Consistency

Note that (3.1) through (3.3) are as simple to write as (1.2) while having enough generality to apply to an accelerating traveler. However, (3.1) through (3.3) have a different format than (1.2). As a check for consistency, we show that the two formats are equivalent for the case in which the traveler has a constant velocity so that (1.2) also applies. Specifically, we will show that (3.1) through (3.3) produce (1.2) when the traveler’s velocity is constant.
Given a constant traveler velocity there is no loss of generality by orienting the coordinate axis so that the velocity is in the x-direction with x-component denoted v. In this case, γ is a constant given by (1.3), so the definition (2.3) of τ together with (3.1) gives
t P = γ t ¯ E ( constant   velocity )
Also, the vector equation (3.2) has components
x P = v t P , y P = 0 , z P = 0 ( constant   velocity )
where we treat the case in which the traveler starts the journey from the home origin. The above can be written via (4.1) as
x P = γ v t ¯ E , y P = 0 , z P = 0 ( constant   velocity )
Next, the scaler equation (3.3a) becomes
t E = t P + γ v x ¯ E c 2 ( constant   velocity )
while the vector equation (3.3b) has the components
x E = x P + γ x ¯ E , y E = y ¯ E , z E = z ¯ E , ( constant   velocity )
The last step substitutes (4.1) into (4.3) and uses (4.2) together with (4.4) to get
t E = γ t ¯ E + v x ¯ E c 2 ( constant   velocity )
x E = γ x ¯ E + v t ¯ E , y E = y ¯ E , z E = z ¯ E , ( constant   velocity )
which is the familiar format for expressing the Lorentz transformation applicable to inertial reference frames.

5. A Simple Application (Twin Paradox)

A simple application of the transformation given by (3.1) through (3.3) is the case in which the traveler departs from home with a constant speed v but changes direction as needed to return home. The goal is to compare clock readings (home versus traveler) when the traveler returns home. This comparison was already given by the time-dilation equation (3.4a) but the goal here is to provide another derivation of the same result. We first continue with the derivation that starts with (3.4a). When v is constant this equation reduces to
t E = γ t ¯ E ( constant   v ,   E   is   display   of   t ¯ E   on   traveler s   clock )
In (5.1), the display t ¯ E on the traveler’s clock is the event (also the traveler’s time coordinate of the event) while tE is the home time coordinate of the event.
Next, we use a different derivation of the same result (5.1) just to demonstrate consistency. Instead of the event E consisting of the traveler’s clock displaying the reading t ¯ E , which is the interpretation of E in (3.4) and (5.1), here we take the event E to be the traveler’s return home. In this interpretation, t ¯ E is the traveler’s time coordinate of his return home while tE is the home time coordinate of the traveler’s return home. The first step in the analysis notes that a constant γ in (2.3) together with (3.1) gives
t P = γ t ¯ E ( constant   v )
But the event E is a point on the traveler’s worldline, so x ¯ E = 0 and (3.3a) gives
t E = t P ( E   on   traveler s   world   line )
Combining (5.3) with (5.2) produces agreement with (5.1).

6. The Inverse Transformation

When traveler coordinates of an event are given, the home coordinates are obtained from a simple procedure in Section 3. We now consider the case in which the home coordinates are given, and the goal is to calculate the traveler’s coordinates. Using the transformation in either direction, one of the unknowns to be solved is tP. When the traveler coordinates are given, tP is solved via (3.1). When the home coordinates xE and tE are the givens, we need to construct an equation that contains the unknown tP with all other terms known. This is done by taking the dot product of (3.3b) with V(tp) to get
V t P x E = V t P x P + V t P x ¯ E + γ t P 1 x ¯ E V t P = V t P x P + γ t P x ¯ E V t P
which gives
γ t P V t P x ¯ E = V t P x E x P
Substituting this into (3.3a) while using (3.2) gives
t E = t P + 1 c 2 V t P x E X ( t P )
With the parameters tE and xE given, and the functions V and X given, (6.1) is the equation governing tP. An unfortunate property of (6.1) is that there are example traveler worldlines in which (6.1) does not have a unique solution for tP for some inputs tE and xE. In some cases, no solution exists. For some other cases, solutions exist but are not unique. This is discussed in more detail in Section 8. For the remainder of this discussion we assume that event E is one in which (6.1) has a unique solution for tP and this solution has been found (by some numerical root finding routine if necessary).
Given that tP satisfying (6.1) can be found and has been found, we calculate t ¯ E via (3.1). The last step of the inverse transformation solves for x ¯ E as follows. To shorten the notation, we define, for an arbitrary vector W, a parallel part W and a perpendicular part W , with respect to the direction of V(tP), according to
W W V t P V t P V 2 t P , W W W
Using this notation we can write (3.3b) as
x E x P = x ¯ E + γ t P 1 x ¯ E , = x ¯ E , + γ t P x ¯ E ,
The parallel part of the left side equals the parallel part of the far right side, and the perpendicular part of the left side equals the perpendicular part of the far right side so
x E x P = γ t P x ¯ E , , or x ¯ E , = 1 γ t P x E x P
and
x ¯ E , = x E x P
Now express x ¯ E as the sum of the parallel part plus perpendicular part while using (6.4) to get
x ¯ E = x ¯ E , + x ¯ E , = x E x P + 1 γ t P x E x P
Now use the second equation in (6.2) to write this as
x ¯ E = x E x P x E x P + 1 γ t P x E x P = x E x P + 1 γ t P 1 x E x P
Finally, we use the first equation in (6.2) to write this as
x ¯ E = x E x P + 1 γ t P 1 x E x P V t P V t P V 2 t P
Before summarizing the above results, we digress by noting two interesting properties of (6.1). One property is that tE = tP if V(tP) = 0. This could have been anticipated even before deriving (6.1) as already explained in Section 3. The second interesting property deserves more discussion. That property is the implication that if xE = X(tP) then tE = tP. This could have been anticipated even before deriving (6.1) as follows. Suppose xE = X(tP), i.e., xE = xP. Then the events E and P have the same spatial coordinates in the home system. They also have the same time coordinates in the P-system (by choice of point P) so it is not surprising that E and P are the same spacetime point, implying tE = tP. This conclusion is made rigorous by (6.1), showing that it is true that the condition xE = X(tP) implies tE = tP. Therefore, the condition xE = X(tP) implies the simultaneous conditions xE = xP and tE = tP, which implies that E and P are the same spacetime points, which implies that they have the same coordinates in all systems. A variety of implications follow by combining this implication with others already stated (e.g., that the event E is on the traveler’s worldline if and only if x ¯ E = 0, and the conclusion from Section 3 that tE = tP and xE = xP when x ¯ E = 0). Putting the implications all together, we obtain the following summary of implications:
x E = X t P   if   and   only   if   E   is   the   same   spacetime   point   as   P if   and   only   if   E   is   a   point   on   the   traveler s   worldline   if   and   only   if x ¯ E = 0 . , If   any   of   the   above   are   satisfied   then   t E = t P . If   V t P = 0   then   t E = t P .
We now summarize the results derived in this section. It is assumed that we are given xE and tE, and the goal is to calculate x ¯ E and t ¯ E . The first step calculates tP from
t E = t P + 1 c 2 V t P x E X ( t P )
where it is understood that we are treating the case in which there exists such a tP (see Section 8 for more discussion). Having done this, the quantities X(tP), V(tP), γ(tP), and τ(tP) all become knowns quantities, via the given worldline of the traveler in the home system together with (2.3) and (2.4), in addition to xE and tE being known quantities. The next set of steps calculate xP, t ¯ E , and x ¯ E from
x P = X t P
t ¯ E = τ t P
x ¯ E = x E x P + 1 γ t P 1 x E x P V t P V t P V 2 t P

7. Another Check for Consistency

The goal of this section is to verify, merely as a check for mistakes, that (6.6) through (6.8) reduce to the familiar Lorentz transformation when the traveler has a constant velocity.
Given a constant traveler velocity there is no loss of generality by orienting the coordinate axis so that the velocity is in the x-direction with x-component denoted v. For this case, only x-components of terms in (6.6) have contributions and using X(tP) = vtP with (6.6) gives
t E = t P + 1 c 2 v x E v t P
with solution
t P = 1 1 v 2 c 2 t E v x E c 2
or
t P = γ 2 t E v x E c 2 ( constant   velocity )
Also, the vector equation (6.7) has the components
x P = v t P , y P = 0 , z P = 0 ( constant   velocity )
which can be written via (7.1) as
x P = v γ 2 t E v x E c 2 , y P = 0 , z P = 0 ( constant   velocity )
Next, the scaler equation (6.8a), together with a constant γ in (2.3) gives
t ¯ E = 1 γ t P
and using (7.1) gives
t ¯ E = γ t E v x E c 2 ( constant   velocity )
Finaly, the vector equation (6.8b) has the components
x ¯ E = 1 γ x E x P , y ¯ E = y E y P , z ¯ E = z E x P , ( constant   velocity )
Substituting (7.2) into this and simplifying terms via the identity
1 γ 2 + v 2 c 2 = 1
gives
x ¯ E = γ x E v t E , y ¯ E = y E , z ¯ E = z E , ( constant   velocity )
The transformation given by (7.3) is the familiar form for the case of a constant velocity.

8. Existence and Uniqueness of the Inverse Transformation

Whether or not the inverse transformation in Section 6 exists and is unique is completely determined by the very first step of the calculation, which is to solve (6.6) for tP. As pointed out in Section 2, a constraint that the event E must satisfy to be treated in this analysis is that there must exist a point P on the traveler’s worldline such that the inertial system local with the traveler’s system at P sees E to be simultaneous with P. There exists a solution for tP to (6.6) if and only if E is such an “allowed” event. However, it is also possible that E is such an allowed event, so there exists a point P satisfying the above condition, but such a point P is not unique. In this case there are multiple solutions for tP to (6.6). Lack of existence is possible if the traveler’s acceleration continues forever, and an example is given later in Section 12. However, it is shown in this section that if the acceleration always has a finite magnitude and has a finite time duration, solutions must exist but are not always unique.
The goal of this section is to show that if the acceleration always has a finite magnitude and has a finite time duration, there necessarily exists a solution for tP to (6.6) (albeit not necessarily unique). The first step towards this goal shortens the notation in (6.6) by defining the function F(tP,xE), a function of tP and the vector xE, by
F t P , x E t P + 1 c 2 V t P x E X ( t P )
so that (6.6) becomes
t E = F t P , x E
The traveler’s journey begins at t = 0 but nothing was said about his prior history. It is convenient to stipulate that, before starting the journey, the traveler was at rest in the home system, so V(tP) = 0 if tP < 0. Also, the acceleration is taken to have a finite time duration so there exists some cutoff time in the home system, denoted tC, such that the acceleration is zero when t > tC. We therefore have
V t P = V t C if t P > t C 0 if t P < 0
The cutoff for the velocity implies that X(tP) satisfies
X ( t P ) = X ( t C ) + V t C t P t C if t P > t C
Substituting (8.3) into (8.1) and regrouping terms for the case in which tP > tC gives
F t P , x E = 1 γ 2 t C t P t C + F t C , x E if t P > t C , t P + 1 c 2 V t P x E X ( t P ) if 0 t P t C , t P if t P < 0
where γ is defined by (2.4). It is evident from inspection of (8.4) that
F t P , x E as t P , and F t P , x E + as t P +
Given that the magnitude of acceleration is always finite, so the velocity is a continuous function of time, F(tP,xE) is a continuous function of each argument. Continuity together with the mapping property (8.5) implies that (8.2) has a solution for tP for any value of tE.
Having established that there necessarily exists a solution for tP to (6.6) if the acceleration always has a finite magnitude and has a finite time duration, the next question asks for the conditions in which the solution is unique under the assumed conditions regarding the acceleration. Note from (8.4) that F(tP,xE) is strictly increasing in tP when tP < 0 and when tP > tC. If (a big “if”) F(tP,xE) is strictly increasing in tP when 0 < tP < tC, then a solution for tP to (6.6) is unique. However, if (another big “if”) a solution for tP to (6.6) is a value of tP at which F(tP,xE) is strictly decreasing in tP, then this solution for tP is not unique. Therefore, the question of uniqueness is answered by considerations of whether F(tP,xE) is increasing or decreasing in tP. It will be shown below that (given a nonzero acceleration at some point in time) there necessarily exists home coordinates (tE, xE) for which a solution for tP to (6.6) is not unique. A sufficient condition, imposed on xE, for uniqueness will also be given. The arguments below will refer to the derivative of F(tP,xE) obtained from (8.1) and given by
t P F t P , x E = 1 1 c 2 V 2 t P + 1 c 2 A t P x E X ( t P )
We will show that there exist home coordinates (tE, xE) for which a solution for tP to (6.6) is not unique by constructing them. Select a tP for which A(tP) ≠ 0 but is otherwise arbitrary. Now select any xE satisfying
1 1 c 2 V 2 t P + 1 c 2 A t P x E X ( t P ) < 0 ( for   discussion )
Note that such an xE satisfying (8.7) necessarily exists when A(tP) ≠ 0. Also note, from (8.6), that this choice for xE makes F(tP,xE) strictly decreasing in tP at the selected value of tP. Now use (8.2) (equivalent to (6.6)) to define tE. These steps have constructed a set of home coordinates (tE, xE) such that F(tP,xE) strictly decreasing in tP at the solution tP, implying that the solution is not unique. The conclusion is that, given that there is some point in time at which the acceleration is not zero. There exist home coordinates (tE, xE) for which a solution for tP to (6.6) is not unique.
A fairly simple constraint imposed on the home coordinates (tE, xE) that is a sufficient condition for the solution for tP to (6.6) to be unique is for xE to be selected to make the right side of (8.6) positive for all tP. This constraint makes F(tP,xE) strictly increasing in tP for all tP, ensuring uniqueness.

9. Some Identities for Later Use

The next section calculates the metric tensor in the traveling system. That analysis is less cumbersome if some identities are available, and one goal of this section is to make them available. A few other miscellaneous identities useful in later sections are also derived.
The first goal is to derive expressions for the derivatives of parallel parts and perpendicular parts of arbitrary spatial vectors as they are defined in (6.2). This is accomplished in several steps. The first step calculates the derivative of the magnitude of the velocity vector. The second step uses this result to calculate the derivative of the unit vector in the direction of the velocity vector, and the last step calculates derivative of parallel and perpendicular parts of arbitrary vectors.
Note from (6.2) that parallel and perpendicular parts of an arbitrary spatial vector W can be written as
W W n t P n t P , W W W
where the unit vector n is defined by
n t P V t P V t P
The first step that is used to calculate derivatives of parallel and perpendicular parts of W notes that
2 V t P d V t P d t P = d V 2 t P d t P = d V t P V t P d t P = 2 V t P d V t P d t P = 2 V t P A t P
which gives
d V t P d t P = A t P n t P
Now use
d n t P d t P = d d t P V t P V t P = 1 V t P d V t P d t P V t P V 2 t P d V t P d t P = 1 V t P A t P n t P V t P d V t P d t P .
Substituting (9.3) into the above gives
d n t P d t P = 1 V t P A t P A t P n t P n t P
From the definition (9.1) of parallel and perpendicular parts, we recognize the square bracket in (9.4) as the perpendicular part of the acceleration so
d n t P d t P = 1 V t P A t P
The next step takes the derivative of the parallel part of an arbitrary vector W using
d W t P d t P = d d t P W t P n t P n t P = d W t P d t P n t P n t P + W t P d n t P d t P n t P + W t P n t P d n t P d t P
The first term on the right is seen to be the parallel part of the derivative of W. Substituting (9.5) into the second and third terms on the right produces
d W t P d t P = d W t P d t P + 1 V t P W t P A t P n t P + W t P n t P A t P
The derivative of the perpendicular part is obtained from
d W t P d t P = d W t P W t P d t P = d W t P d t P d W t P d t P
Substituting (9.6) into the above far right term while using
d W t P d t P = d W t P d t P d W t P d t P
gives
d W t P d t P = d W t P d t P 1 V t P W t P A t P n t P + W t P n t P A t P
Another identity useful in later sections is an expression for the derivative of γ(tP). Using (2.4) gives
d γ t P d t P = d d t P 1 V 2 ( t P ) c 2 1 / 2 = 1 2 c 2 1 V 2 ( t P ) c 2 3 / 2 d V 2 ( t P ) d t P = 1 2 c 2 1 V 2 ( t P ) c 2 3 / 2 d V ( t P ) V ( t P ) d t P
which finally gives
d γ t P d t P = 1 c 2 γ 3 t P V ( t P ) A ( t P )
Still another identity that will be useful later and is trivial to prove is
γ 2 t P 1 = 1 c 2 γ 2 t P V 2 t P
Now consider an arbitrary trajectory, or worldline, seen in an arbitrary inertial system. This could be the worldline of the traveler’s clock in the home system but not necessarily. Along this worldline we can define increments of proper time, denoted , and increments of coordinate time, denoted dt. From these increments we can define the derivative dt/, which is a derivative defined on a curve. Two different expressions for this derivative are derived. Both start with the equation
d τ 2 = d t 2 1 c 2 d x d x
and then divide by ()2 to get
1 = d t d τ 2 1 c 2 d x d τ d x d τ
One expression for dt/ is obtained by rearranging terms in (9.10) to get
d t d τ = 1 + 1 c 2 d x d τ d x d τ
A second expression for the same derivative is obtained by using the chain rule to write (9.10) as
1 = d t d τ 2 1 c 2 d t d τ d x d t d t d τ d x d t = 1 1 c 2 d x d t d x d t d t d τ 2
and rearranging terms to get
d t d τ = 1 1 1 c 2 d x d t d x d t = γ
with γ defined by the second equality in (9.11b), consistent with the definition in earlier sections. Combining (9.11a) with (9.11b) gives
1 1 1 c 2 d x d t d x d t = 1 + 1 c 2 d x d τ d x d τ
when is defined by (9.10).

10. The Metric Tensor in the Traveling System

This section calculates the metric tensor in the traveling system. This metric, denoted g ¯ , is obtained by starting with the Lorentz metric, denoted g, applicable to the home system and then transforming it via the transformation of a double covariant tensor. This transformation is defined by
g ¯ i , j = k , l = 0 3 x k x ¯ i x l x ¯ j g k , l for   i , j = 0 , 1 , 2 , 3
where we use the notation x0 = ctE, x1 = xE, x2 = yE, x3 = zE, with corresponding notation for the traveler coordinates. The Lorentz metric is diagonal with g0,0 = 1 and all other diagonal elements equal to −1, so (10.1) becomes
g ¯ i , j = x 0 x ¯ i x 0 x ¯ j k = 1 3 x k x ¯ i x k x ¯ j for   i , j = 0 , 1 , 2 , 3
Recognizing the sum on the right as a three dimensional (spatial) vector dot product gives
g ¯ i , j = c 2 t E x ¯ i t E x ¯ j x E x ¯ i x E x ¯ j for   i , j = 0 , 1 , 2 , 3
It is evident from (3.1) that partial derivatives that treat t ¯ E , x ¯ E , y ¯ E , and   z ¯ E as independent variables (meaning that a partial derivative that varies one variable in the list does so with all others in the list held fixed) also treat t P , x ¯ E , y ¯ E , and   z ¯ E as independent variables. Therefore, a derivative with respect t ¯ E can be expressed in terms of a derivative with respect to t P by using
t ¯ E = d t P d t ¯ E t P = d t ¯ E d t P 1 t P
Using (3.1) and (2.3) to evaluate the derivative in the parenthesis on the right allows the above to be written as
t ¯ E = γ t P t P
Using (10.3) with (10.2), components of the metric tensor can be written as
g ¯ 0 , 0 = γ 2 t P t t P 2 1 c 2 x t P x t P , g ¯ 0 , 1 = γ t P c t t P t x ¯ 1 c x t P x x ¯
g ¯ 0 , 2 = γ t P c t t P t y ¯ 1 c x t P x y ¯ , g ¯ 0 , 3 = γ t P c t t P t z ¯ 1 c x t P x z ¯
g ¯ 1 , 1 = c 2 t x ¯ 2 x x ¯ x x ¯ , g ¯ 1 , 2 = c 2 t x ¯ t y ¯ x x ¯ x y ¯
g ¯ 1 , 3 = c 2 t x ¯ t z ¯ x x ¯ x z ¯ , g ¯ 2 , 2 = c 2 t y ¯ 2 x y ¯ x y ¯
g ¯ 2 , 3 = c 2 t y ¯ t z ¯ x y ¯ x z ¯ , g ¯ 3 , 3 = c 2 t z ¯ 2 x z ¯ x z ¯
where we shortened the notation by omitting the subscripts E to the event coordinates x ¯ E , y ¯ E , z ¯ E and t E , x E , y E , and   z E . The remaining off-diagonal elements of the metric tensor are implied by the symmetry condition g ¯ i , j = g ¯ j , i .
The next step is to calculate the derivatives in (10.4). One equation needed for this is (3.3a), which is written below with the E subscripts omitted as
t = t P + γ t P V t P x ¯ c 2
The other needed equation is (3.3b). It is convenient to write it in terms of parallel and perpendicular parts. Doing so while omitting the E subscripts to shorten the notation gives
x = X t P + x ¯ + γ t P x ¯
The first derivative to be calculated is obtained from (10.5), (9.8), and a direct application of the product rule which gives
t t P = 1 + 1 c 4 γ 3 t P V ( t P ) A ( t P ) V t P x ¯ + 1 c 2 γ t P A t P x ¯
which can also be written as
t t P = 1 + 1 c 4 γ 3 t P V 2 ( t P ) n ( t P ) A ( t P ) n t P x ¯ + 1 c 2 γ t P A t P x ¯
Now use
A ( t P ) x ¯ = A ( t P ) n ( t P ) n ( t P ) x ¯ = n ( t P ) A ( t P ) n t P x ¯
to write the above as
t t P = 1 + 1 c 4 γ 3 t P V 2 ( t P ) A ( t P ) x ¯ + 1 c 2 γ t P A t P x ¯
Next, use
A t P x ¯ = A ( t P ) x ¯ + A ( t P ) x ¯
to write the above as
t t P = 1 + 1 c 2 γ t P γ 2 t P V 2 ( t P ) c 2 + 1 A ( t P ) x ¯ + 1 c 2 γ t P A ( t P ) x ¯
Finally, use the identity (9.9) to write the above as
t t P = 1 + 1 c 2 γ 3 t P A ( t P ) x ¯ + 1 c 2 γ t P A ( t P ) x ¯
For the next derivative it is convenient to write (10.6) as
x = X t P + x ¯ + γ t P 1 x ¯
So
x t P = V t P + γ t P t P x ¯ + γ t P 1 x ¯ t P
Using (9.6) and (9.8) gives
x t P = V t P + 1 c 2 γ 3 t P V ( t P ) A ( t P ) x ¯ + 1 V t P γ t P 1 x ¯ A t P n t P + x ¯ n t P A t P .
Regrouping terms and using x ¯ = x ¯ n ( t P ) n ( t P ) and V ( t P ) = V ( t P ) n ( t P ) allows the above to be rewritten as
Once more we use
x t P = V t P + 1 c 2 γ 3 t P V ( t P ) n ( t P ) A ( t P ) x ¯ n ( t P ) + 1 V t P γ t P 1 x ¯ A t P n t P + 1 V t P γ t P 1 x ¯ n t P A t P
to write the above as
x t P = V t P + 1 c 2 γ 3 t P V ( t P ) A ( t P ) x ¯ + 1 V t P γ t P 1 x ¯ A t P n t P + 1 V t P γ t P 1 x ¯ n t P A t P
The next derivatives are simpler to calculate but some notation is needed. Let e(x), e(y), and e(z) be the unit vectors (in the context of three dimensional Euclidean geometry) in the directions of the x-axis, y-axis, and z-axis, respectively, in the traveling system. The subscripts appear in parentheses to emphasize that they are vector names or labels as opposed to vector components. Using this notation we easily obtain from (10.5) that
t x ¯ = 1 c 2 γ t P e ( x ) V t P
t y ¯ = 1 c 2 γ t P e ( y ) V t P
t z ¯ = 1 c 2 γ t P e ( z ) V t P
To calculate the remaining derivatives, we note that partial derivatives that hold tP fixed, so they hold n(tP) fixed, commute with the taking of parallel parts or perpendicular parts. Specifically,
x ¯ x ¯ = x ¯ x ¯ , = e ( x ) , x ¯ x ¯ = x ¯ x ¯ , = e ( x )
with analogous results for y ¯ and z ¯ derivatives. Using this fact with (10.6) easily gives
x x ¯ = e ( x ) , + γ t P e ( x ) ,
x y ¯ = e ( y ) , + γ t P e ( y ) ,
x z ¯ = e ( z ) , + γ t P e ( z ) ,
Various products of derivatives are prepared in advance to assist with the evaluation of the metric tensor. We first shorten the notation by defining T-functions that are characterized by the property of being zero if either x ¯ or A(tP) are null vectors. They are defined by
T 1 t P , x ¯ 1 c 2 γ 3 t P A ( t P ) x ¯
T 2 t P , x ¯ 1 V 2 t P A ( t P ) x ¯
T t P , x ¯ 1 V t P γ t P 1 x ¯ n t P A t P
where the perpendicular symbol is included as a reminder that the vector T t P , x ¯ is orthogonal (in the context of three-dimensional Euclidean geometry) to n(tP) (recall that n(tP) is the unit vector, in the context of three-dimensional Euclidean geometry, in the direction of V(tP)). With these definitions we can write (10.7a) and (10.7b) as
t t P = 1 + T 1 t P , x ¯ + V 2 t P c 2 γ t P T 2 t P , x ¯
x t P = V t P 1 + T 1 t P , x ¯ + γ t P 1 T 2 t P , x ¯ n t P + T t P , x ¯
We now list several products of derivatives. We temporarily shorten the notation by not displaying the arguments of the various functions. Full notation will be restored in final results derived for the metric tensor. It is evident from (10.9a) that
t t P 2 = 1 + T 1 + γ V 2 c 2 T 2 2
Also, orthogonality between T t P , x ¯ and n(tP) together with (10.9b) gives
x t P x t P = V 2 1 + T 1 + γ 1 T 2 2 + T T
We next use (10.9a) with (10.7c) to get
t t P t x ¯ = 1 c 2 γ V 1 + T 1 + V 2 c 2 γ T 2 e ( x ) n
Next use (10.9b) with (10.7f) while paying attention to which vectors are parallel to each other, which are orthogonal to each other, and use the fact that n t P e ( x ) , = n t P e ( x ) to get
x t P x x ¯ = γ V 1 + T 1 + γ 1 T 2 n e ( x ) + T e ( x ) ,
The next products considered are obtained from (10.7c) and (10.7f) which give
t x ¯ 2 = 1 c 4 γ 2 V 2 e ( x ) n 2
x x ¯ x x ¯ = e ( x ) , + γ e ( x ) , e ( x ) , + γ e ( x ) ,
Expanding the product in the second equation gives
x x ¯ x x ¯ = e ( x ) , e ( x ) , + γ 2 e ( x ) , e ( x ) ,
But we also have
1 = e ( x ) e ( x ) = e ( x ) , + e ( x ) , e ( x ) , + e ( x ) , = e ( x ) , e ( x ) , + e ( x ) , e ( x ) ,
Or
e ( x ) , e ( x ) , = 1 e ( x ) , e ( x ) ,
so the above equation becomes
x x ¯ x x ¯ = 1 + γ 2 1 e ( x ) , e ( x ) ,
We now use the identity (9.9) to write the above as
x x ¯ x x ¯ = 1 + 1 c 2 γ 2 V 2 e ( x ) , e ( x ) , = 1 + 1 c 2 γ 2 V 2 e ( x ) n n e ( x ) n n = 1 + 1 c 2 γ 2 V 2 e ( x ) n 2
or
x x ¯ x x ¯ = 1 + 1 c 2 γ 2 V 2 e ( x ) n 2
The last pair of products considered are obtained from (10.7c), (10.7d), (10.7f), and (10.7g). These give
t x ¯ t y ¯ = 1 c 4 γ 2 V 2 e ( x ) n e ( y ) n
x x ¯ x y ¯ = e ( x ) , + γ e ( x ) , e ( y ) , + γ e ( y ) ,
Expanding the product in the second equation gives
x x ¯ x y ¯ = e ( x ) , e ( y ) , + γ 2 e ( x ) , e ( y ) ,
But we also have
0 = e ( x ) e ( y ) = e ( x ) , + e ( x ) , e ( y ) , + e ( y ) , = e ( x ) , e ( y ) , + e ( x ) , e ( y ) ,
or
e ( x ) , e ( y ) , = e ( x ) , e ( y ) ,
so the above equation becomes
x x ¯ x y ¯ = γ 2 1 e ( x ) , e ( y ) ,
We now use the identity (9.9) to write the above as
x x ¯ x y ¯ = 1 c 2 γ 2 V 2 e ( x ) , e ( y ) , = 1 c 2 γ 2 V 2 e ( x ) n n e ( y ) n n = 1 c 2 γ 2 V 2 e ( x ) n e ( y ) n
or
x x ¯ x y ¯ = 1 c 2 γ 2 V 2 e ( x ) n e ( y ) n
We can now assemble various products of derivatives to produce the metric tensor. The two equations in (10.10) together with (10.4) give
g ¯ 0 , 0 = γ 2 t P 1 + T 1 t P , x ¯ + γ V 2 t P c 2 T 2 t P , x ¯ 2 γ 2 t P V 2 t P c 2 1 + T 1 t P , x ¯ + γ t P 1 T 2 t P , x ¯ 2 γ 2 t P 1 c 2 T t P , x ¯ T t P , x ¯
Another component is obtained from the two equations in (10.11) together with (10.4) which give
g ¯ 0 , 1 = 1 c γ V γ 1 V 2 c 2 γ 2 T 2 n e ( x ) T e ( x )
where we used T e ( x ) = T e ( x ) , + e ( x ) , = T e ( x ) , . Expressing the parenthesis in terms of γ gives.
g ¯ 0 , 1 = 1 c γ t P V t P γ t P 1 T 2 t P , x ¯ n t P e ( x ) T t P , x ¯ e ( x )
g ¯ 0 , 2 = 1 c γ t P V t P γ t P 1 T 2 t P , x ¯ n t P e ( y ) T t P , x ¯ e ( y )
g ¯ 0 , 3 = 1 c γ t P V t P γ t P 1 T 2 t P , x ¯ n t P e ( z ) T t P , x ¯ e ( z )
Similarly
The next diagonal element, g ¯ 1 , 1 , is obtained by combining the two equations in (10.12) with (10.4). Including analogous results for g ¯ 2 , 2 and g ¯ 3 , 3 we obtain
g ¯ 1 , 1 = g ¯ 2 , 2 = g ¯ 3 , 3 = 1
The off diagonal element g ¯ 1 , 2 is obtained from the two equations in (10.13) together with (10.4). The result is obvious, and including analogous results for other terms gives
g ¯ 1 , 2 = g ¯ 1 , 3 = g ¯ 2 , 3 = 0
All other components of the metric tensor are implied by the symmetric condition g ¯ i , j = g ¯ j , i .
The equations in (10.14) give the components of the metric tensor as functions of tP and x ¯ . However, tP is itself a function of t ¯ , obtained by inverting (3.1) which is written below with the subscript E omitted as
t ¯ = τ t P
Recognizing this implicit dependence on t ¯ , (10.14) gives the components of the metric tensor as functions of t ¯ and x ¯ .
Recall that the T-functions are each zero if either x ¯ = 0 or tP is a point at which A(tP) = 0. If either of these conditions are satisfied, so all T-functions are zero, the metric reduces to the Lorentz metric. This is seen by a casual inspection for all elements except g ¯ 0 , 0 . That g ¯ 0 , 0 = 1 when all T-functions are zero is seen by noting that (10.14a) reduces to
g ¯ 0 , 0 = γ 2 t P γ 2 t P V 2 t P c 2
when all T-functions are zero. This gives g ¯ 0 , 0 = 1 via (9.9), confirming that the metric reduces to the Lorentz metric if either x ¯ = 0 or tP is a point at which A(tP) = 0.

11. 4-Vectors

Previous sections explain how various quantities can be calculated when the given information is the trajectory of the traveler’s clock relative to the home system and expressed in terms of the home system coordinates. However, the next section gives an example in which this trajectory is not the given information. Instead, it is necessary to deduce this trajectory from other information that is given. Quantities called 4-vectors provide computational conveniences that make this deduction easier, so a review of 4-vectors is given in this section. This review emphasizes a distinction between vectors and vector components. Also, the notation used here uses bold font for 4-vectors, as was previously done for three-dimensional spatial vectors, but distinguishes between them by using cursive font for 4-vectors and block letters for three-dimensional spatial vectors.
A good illustrative example of a 4-vector is the 4-velocity, denoted V in the present discussion (V will be an arbitrary 4-vector in later discussions in this section) of some particle at some point on its worldline. This will be defined after explaining one of the distinctions between it and the three-dimensional velocity V which is the derivative of spatial coordinates with respect to the time coordinate. The three-dimensional velocity of the same particle is a different vector in different reference frames that move relative to each other. In contrast, the 4-velocity is the same vector in different reference frames, it is only its components with respect to different basis vectors that are different. Therefore the 4-velocity is completely defined when we specify its components in any convenient set of basis vectors. It is convenient to use the basis vectors assigned to the home system to specify V, but we must first decide on what the home system basis vectors will be. We will use the same basis vectors for the home system that were denoted e(x), e(y), and e(z) that were used in the traveling system,7 except that we include another vector e(ct) in the direction of the time axis. We also change notation by using e(0), e(1), e(2), and e(3) in cursive font and with integers for indices, instead of e(ct), e(x), e(y), and e(z), to emphasize four dimensions. As discussed above, the 4-velocity is completely determined in all reference frames if we specify its components in the home basis vectors. It is therefore completely defined by the equation
V = d x 0 d τ e ( 0 ) + d x 1 d τ e ( 1 ) + d x 2 d τ e ( 2 ) + d x 3 d τ e ( 3 )
where is an increment of proper time along the particle’s worldline, and we use the notation
x 0 = c t , x 1 = x , x 2 = y , x 3 = z
The velocity vector was selected for illustration of a 4-vector but in the remainder of this section, V is an arbitrary 4-vector. It is uniquely determined (as shown later) in all systems if we specify the numbers V 0, V 1, V 2, and V 3 satisfying
V = V 0 e ( 0 ) + V 1 e ( 1 ) + V 2 e ( 2 ) + V 3 e ( 3 )
The coefficients to the basis vectors in such an expansion are called the contravariant components of V.8 We use the notation in which V i denotes the ith contravariant component in whatever inertial system that was selected to be called the unbarred system and that uses the basis vectors in (11.2). Therefore (11.2) can also be written as
V = V 0 e ( 0 ) + V 1 e ( 1 ) + V 2 e ( 2 ) + V 3 e ( 3 )
Now consider the contravariant components in another system, called the barred system, that uses basis vectors e ¯ ( 0 ) ,   e ¯ ( 1 ) ,   e ¯ ( 2 ) ,   and   e ¯ ( 3 ) . Again, the contravariant components in that system are the coefficients in the expansion in those basis vectors, so if we let V i ¯ denote the ith contravariant component in the barred system we have
V = V 0 ¯ e ¯ ( 0 ) + V 1 ¯ e ¯ ( 1 ) + V 2 ¯ e ¯ ( 2 ) + V 3 ¯ e ¯ ( 3 )
However, in order for contravariant components in different systems to be related by the transformations traditionally used to define contravariant components, the basis vectors in the barred system are not arbitrary. They are tangent vectors to coordinate curves and are uniquely determined after the coordinates in the barred system have been selected. Denoting these coordinates as ( x ¯ 0 , x ¯ 1 , x ¯ 2 , x ¯ 3 ) , or more briefly as 𝓍 ¯ , the barred basis vectors, which can be functions of the barred coordinates, are defined by
e ¯ ( i ) 𝓍 ¯ x 𝓍 ¯ i for   i = 0 , 1 , 2 , 3
where x is defined by
𝓍 = x 0 e ( 0 ) + x 1 e ( 1 ) + x 2 e ( 2 ) + x 3 e ( 3 )
so (11.5a) becomes
e ¯ ( i ) 𝓍 ¯ x 0 x ¯ i e ( 0 ) + x 1 x ¯ i e ( 1 ) + x 2 x ¯ i e ( 2 ) + x 3 x ¯ i e ( 3 ) = j x j x ¯ i e ( j ) for   i = 0 , 1 , 2 , 3
with the understanding that a summation that does not explicitly show the range of the index uses the range of 0 to 3 (the Einstein summation convention is not used because there are exceptions to the rule, and it is not difficult to include a summation symbol to avoid any possible confusion).9
An important and convenient consequence of defining the barred basis vectors by (11.6) is that the transformation between basis vectors can easily be inverted to solve for the unbarred basis vectors in terms of the barred basis vectors. To perform this inversion, note that the Kronecker delta can be expressed as
δ k , j = x j x k
When the selected spacetime coordinates are in neighborhoods for which there is an invertible transformation between barred and unbarred coordinates, the chain rule applied to the above gives
δ k , j = i x j x ¯ i x ¯ i x k
Now multiply both sides of (11.6) by x ¯ i / x k and sum in i to get
i x ¯ i x k e ¯ ( i ) 𝓍 ¯ = j , i x j x ¯ i x ¯ i x k e ( j )
Next, we use (11.7) to write this as
i x ¯ i x k e ¯ ( i ) 𝓍 ¯ = j δ k , j e ( j ) = e ( k )
Changing dummy symbols gives
e ( i ) = j x ¯ j x i e ¯ ( j ) 𝓍 ¯
which is the inverse transform between basis vectors.
We can now confirm that contravariant components satisfy the transformation typically used to define contravariant components. Combine (11.3) with (11.4) to get
i V i ¯ e ¯ ( i ) ( 𝓍 ¯ ) = i V i e ( i )
Now substitute (11.8) into the right side of (11.9) and interchange dummy symbols to get
i V i ¯ e ¯ ( i ) 𝓍 ¯ = i , j V j x ¯ i x j e ¯ ( i ) 𝓍 ¯
The barred basis vectors are linearly independent so the above gives
V i ¯ = j V j x ¯ i x j
which is the transformation typically taken to be the definition of contravariant vector components.
Covariant vector components can also be defined but to help with notation we start with a definition of the solid dot product denoted . This dot product between two arbitrary 4-vectors V and U is defined by
V U i , j g i , j V i U j
where the superscripts on the right denote contravariant components in the unbarred system, and g is the metric tensor in the unbarred system. The unbarred system is inertial and rectangular, so its metric tensor is the Lorentz metric, which allows us to rewrite (11.11) as
V U = V 0 U 0 i = 1 3 V i U i
However, the notation in (11.11a) is more convenient than the notation in (11.11b) for deriving an expression for the dot product in the barred system. Recall that the barred metric tensor is defined by the transformation (10.1), and we already established the transformation (11.10) for contravariant components of 4-vectors. We can show via (11.7) that the composite of these transformations applied to the right side of (11.11a) produces
V U = i , j g ¯ i , j V i ¯ U j ¯
In summary, three expressions derived (so far) for solid dot products are given by the three equations in (11.11). 10
Special interesting cases are dot products between basis vectors. Recall that the contravariant component e ( i ) , j is the jth coefficient in the expansion of e ( i ) in the unbarred basis vectors. Similarly, the contravariant component e ¯ ( i ) , j ¯ is the jth coefficient in the expansion of e ¯ ( i ) in the barred basis vectors.11 Therefore we have
e ( i ) , j = e ¯ ( i ) , j ¯ = δ i , j
Also, the definition of contravariant components as coefficients in expansions together with (11.6) and (11.8) give
e ¯ ( i ) , j 𝓍 ¯ = x j x ¯ i , e ( i ) , j ¯ 𝓍 = x ¯ j x i
where a coordinate dependence of e ( i ) , j ¯ is indicated and is due to a coordinate dependence of the basis vectors. Substituting (11.12a) into (11.11a) for the unbarred case, and into (11.11c) for the barred case, gives
e ( i ) e ( j ) = g i , j , e ¯ ( i ) e ¯ ( j ) = g ¯ i , j
Listing the individual products for the unbarred case gives
e ( 0 ) e ( 0 ) = 1 , e ( i ) e ( i ) = 1   for   i = 1 , 2 , 3 , e ( i ) e ( j ) = 0   for   i j
Note that the middle equation is one of those exceptions to the Einstein summation convention and is an example of the reason for not using that convention.
The covariant components of an arbitrary 4-vector V in the unbarred and barred systems, with the ith component denoted V i in the unbarred system and denoted V i ¯ in the barred system, are defined by
V i V e ( i ) V i ¯ V e ¯ ( i )
The transformation between barred and unbarred covariant components is easily derived by starting with (11.6) to get
V e ¯ ( i ) = V j x j x ¯ i e ( j ) = j x j x ¯ i V e ( j )
and using (11.15) gives
V i ¯ = j x j x ¯ i V j
which is the transformation traditionally used to define covariant vector components.12
Covariant and contravariant components of vectors can be related to each other by using (11.11) to get
V e ( i ) j , k g j , k V j e ( i ) , k , V e ( i ) = V 0 e ( i ) , 0 j = 1 3 V j e ( i ) , j , V e ¯ ( i ) = j , k g ¯ j , k V j ¯ e ¯ ( i ) , k ¯
Substituting (11.12a) and (11.15) into the above gives
V i = j g j , i V j = j g i , j V j
V 0 = V 0 , V i = V i for   i = 1 , 2 , 3
V i ¯ = j g ¯ j , i V j ¯ = j g ¯ i , j V j ¯
where we used the fact that the metric tensors are symmetric.
Several expressions for dot products were listed in (11.11). Two more expressions are obtained by combining (11.17a) with (11.11a) and combining (11.17c) with (11.11c). The results are
V U = i V i U i
V U = i V i ¯ U i ¯
Now consider an arbitrary worldline, the trajectory in spacetime of some moving point, as seen by the inertial unbarred system. Using equations in Section 2, an increment of proper time along this worldline can be obtained by taking the differential of (2.3) and then using (2.4) to get
d τ = 1 V 2 ( t ) c 2 d t
which can also be written as
d τ 2 = 1 V 2 ( t ) c 2 d t 2 = 1 c 2 d x 0 2 d x d x
If we now define the 4-vector
d 𝓍 = d x 0 e ( 0 ) + d x 1 e ( 1 ) + d x 2 e ( 2 ) + d x 3 e ( 3 )
and use (11.11b) to express d 𝓍 d 𝓍 and compare that expression to the right side of (11.19) we find that (11.19) can be written as
d τ 2 = 1 c 2 d 𝓍 d 𝓍
One implication of (11.20) is seen by using (11.11c) to write (11.20) as
d τ 2 = 1 c 2 i , j g ¯ i , j d 𝓍 i ¯ d 𝓍 j ¯
If we now take the barred system to be attached to the moving point, the differentials of spatial coordinates are zero so the above reduces to
d τ 2 = 1 c 2 g ¯ 0 , 0 d 𝓍 0 ¯ d 𝓍 0 ¯
It was concluded at the end of Section 11 that if we take the origin (the location of the clock) of the barred system to be attached to the moving point (the spatial coordinates of the point are zero in the barred system) then g ¯ 0 , 0 = 1 so the above becomes
d τ = 1 c d 𝓍 0 ¯ (comoving system) .
This is consistent with the conclusion already used in Section 2, but obtained from the hypothesis (1.1), that the time coordinate of the traveling system at any given point on the worldline of the systems clock is equal to the proper time assigned to that point on the worldline of the systems clock.
A second implication of (11.20) is that is a scaler invariant. This implies that if V is an arbitrary 4-vector defined on each point on the given worldline, the derivative dV /, which is a derivative on a curve, is another 4-vector. From this fact we can derive the product rule for derivatives of dot products. Using the expression (11.11a) for dot products, together with the ordinary product rule and the fact that the Lorentz metric is constant, we obtain
d V U d τ = i , j g i , j d V i d τ U j + i , j g i , j V i d U j d τ
We next use the fact that d V i / d τ and d U i / d τ are contravariant components of 4-vectors in the unbarred system. This fact together with (11.11a) gives
d V d τ U = i , j g i , j d V i d τ U j , V d U d τ = i , j g i , j V i d U j d τ
Combining (11.22) with (11.23) produces the product rule
d V U d τ = d V d τ U + V d U d τ
Note that expressing components in the unbarred system is the simplest way to derive (11.24). If components were expressed in the barred system, there would be two complications. First, the metric tensor in the barred system need not be constant so there would be a third term on the right side of the equation analogous to (11.22). Second, d V i ¯ / d τ and d U i ¯ / d τ are not in general contravariant components of 4-vectors in the barred system, so the equation analogous to (11.23) is incorrect. Hence the unbarred system provides the simplest derivation of (11.24).
The previous paragraph stated that d V i ¯ / d τ and d U i ¯ / d τ are not in general contravariant components of 4-vectors in the barred system. It is interesting to find out what the contravariant components of dV / are in the barred system. These components can be derived by using the product rule with (11.4) to get
d V d τ = i d V i ¯ d τ e ¯ ( i ) + i V i ¯ d e ¯ ( i ) d τ
Note that (11.6) together with the chain rule gives
d e ¯ ( i ) 𝓍 ¯ d τ = j d d τ x j x ¯ i e ( j ) = j , k 2 x j x ¯ i x ¯ k d x ¯ k d τ e ( j )
To express this in terms of the barred basis vectors we substitute (11.8) into the right side of the above to get
d e ¯ ( i ) 𝓍 ¯ d τ = j , k , l 2 x l x ¯ i x ¯ j x ¯ k x l d x ¯ j d τ e ¯ ( k ) 𝓍 ¯
To shorten the notation, define the Christoffel symbol by
k i , j l 2 x l x ¯ i x ¯ j x ¯ k x l
so (11.26) becomes
d e ¯ ( i ) 𝓍 ¯ d τ = j , k k i , j d x ¯ j d τ e ¯ ( k ) 𝓍 ¯
From the definition of contravariant components as coefficients in expansions, we conclude from (11.28) that
d e ¯ ( i ) 𝓍 ¯ d τ k ¯ = j k i , j d x ¯ j d τ
Also, substituting (11.28) into (11.25) gives
d V d τ = i δ V i ¯ δ τ e ¯ ( i )
where we define
δ V i ¯ δ τ d V i ¯ d τ + j , k i j , k d x ¯ j d τ V k ¯
From the definition of contravariant components as coefficients in expansions, we conclude from (11.30) that
d V d τ i ¯ = δ V i ¯ δ τ

12. An Example: Constant Acceleration Felt by Traveler

The example in this section is one in which the velocity of the traveler relative to the home system and expressed as a function of time in the home system is not the immediately given information. Instead, this must be deducted from other information that will be given. Invariance of solid dot products between 4-vectors facilitates this deduction so the given information will be a statement about the 4-vector version of acceleration. This 4-vector, denoted A, is defined in general in terms of the home system basis vectors by
A = i d 2 x i d τ 2 e ( i )
The example in this section is the special case in which the acceleration has a constant spatial direction, say in the x-direction. The remaining given information, which implies (after some deduction to follow) the traveler worldline as expressed in home coordinates, is
A A = a 2
where a is a positive constant. Expressing the dot product in any inertial system produces
A A = d 2 x d τ 2 2 c d 2 t d τ 2 2 = a 2
where we now write x instead of x1. An alternate way to express (12.3) is obtained by first using (9.11a) to derive an expression for d2t/2, which is obtained from
d 2 t d τ 2 = d d τ 1 + 1 c 2 d x d τ 2 = 1 c 2 1 1 + 1 c 2 d x d τ 2 d x d τ d 2 x d τ 2
and then substitute this into (12.3) and combine some terms to get
A A = 1 1 + 1 c 2 d x d τ 2 d 2 x d τ 2 2 = a 2
From (12.4) we can recognize the physical interpretation of the example considered here. This is seen by letting the acceleration refer to the traveler’s system and letting the inertial system be the P-system, the inertial system momentarily at rest in the traveler’s system. In the P-system, and at the time it is at rest in the traveler’s system, the velocity dx/dt is zero, and there is no distinction between τ derivatives and t derivatives, so (12.4) reduces to
d 2 x d t 2 = a in   P - system   when   at   rest   in   traveler s   system
If the traveler is carrying an accelerometer, it will measure an acceleration equal to a when at rest in the P-system, but the point P is an arbitrary point on the traveler’s worldline, so the physical interpretation of this example 4-vector A is that it produces the case in which the traveler feels (or measures via an accelerometer) a constant acceleration equal to a [11].
The next application of (12.4) is to determine the traveler’s worldline as seen in the home system. Taking the sign of a to be the same as the sign of d2x/2, (12.4) gives
d 2 x d τ 2 = a 1 + 1 c 2 d x d τ 2
Endpoint conditions are not important to the analysis as long as singularities are avoided as a → 0. A particular choice that avoids singularities takes the endpoint conditions to be x = 0 and dx/ = 0 at τ = 0. The solution to (12.5) subject to these endpoint conditions is easily verified to be
x = c 2 a cosh a c τ 1
Substituting this into (9.11a) gives
d t d τ = cosh a c τ
The solution satisfying t = 0 when τ = 0 is
t = c a sinh a c τ
Note that if we take the limit as the acceleration a → 0 we find, via L’Hôpital’s rule, that tτ while x → 0 for all τ, which is the expected result for the selected initial conditions without acceleration.
An identity relating hyperbolic functions allows (12.6a) to be written as
x = c 2 a 1 + sinh 2 a c τ 1
Combining this with (12.6b) gives
x = c 2 a 1 + a c t 2 1
A plot of x versus t produced by (12.7) produces a hyperbola, so this example is often called hyperbolic motion [11]. Another important quantity for this example is the coordinate velocity (not a 4-vector) defined to be dx/dt, which we find by differentiating (12.7) to be given by
d x d t = a t 1 + a c t 2
Also, substituting (12.8) into (9.11b), using the second equality in (9.11b) gives
γ = 1 + a c t 2
As a reminder that the calculated x and dx/dt refer to the worldline of the traveler’s clock, we go back to the notation in Section 3 and Section 6. Instead of (12.7) we write
X ( t P ) = c 2 a 1 + a c t P 2 1
Instead of (12.8) we write
V ( t P ) = a t P 1 + a c t P 2
Instead of (12.9) we write
γ ( t P ) = 1 + a c t P 2
and instead of (12.6b) we write
τ ( t P ) = c a sinh 1 a c t P
When difficulties are encountered regarding the existence of traveler coordinates to be calculated for a given spacetime point, i.e., calculated from a given set of home coordinates, they are encountered in the first step of the calculation which is to solve (6.6) for tP. The example in this section provides an illustration. Substituting (12.10) into (6.6), the equation to be solved for tP becomes
t E = 1 + a c 2 x E t P 1 + a c t P 2
First consider the case in which the parenthesis on the right side of (12.11) is positive. It is easy to show that this condition makes the right side of (12.11) a strictly increasing function of tP, implying that the solution of (12.11) for tP is unique if it exists. Existence requires that the left side, tE, be in the range of the function of tP on the right. The function on the right is zero at tP = 0, and it is easy to show that the function on the right asymptotically approaches the value c/a + xE/c as tP → ∞. We therefore conclude that:
If 1 + a c 2 x E > 0 then   the   event   E   has   coordinates   in   the   traveler s   system ,   for   the   example in   which   A A = a 2 ,   if   and   only   if     0 t E < c a + x E c .
The upper bound for tE in (12.12) can be explained in terms of time dilation. Imagine a clock stationary at the location xE in the home system. This clock as seen by the traveler runs slower and slower as the traveler speeds up in the home system, in such a way that the clock display, as seen by the traveler, asymptotically approaches a finite limiting value. This limiting value is the upper bound in (12.12). A more rigorous discussion of this time dilation can be given, for the example worldline considered in this section, as follows. Recall that (3.4) is the version of time dilation in which the home observer is observing the traveler’s clock. For the example in this section, we are now able to calculate the version of time dilation in which the traveler is observing a home clock located at xE. We start by combining (6.8a) with (12.10d) and invert the equation to get
t P = c a sinh a c t ¯ E
Substituting (12.13) into (12.11) and using some identities for hyperbolic functions gives
t E = c a + x E c tanh a c t ¯ E
If we now let the event E be a given display on a home clock located at xE, then tE and xE become the home coordinates of that event. If tE + dtE denotes the time coordinate of the next tick (the next event) of the home clock, with the clock still at the spatial coordinate xE, we relate dtE to the time increment d t ¯ E , that the traveler sees to be the time between ticks of the home clock, by differentiating (12.14) while holding xE fixed (as opposed to holding x ¯ E fixed as was done when deriving (3.4b)). The result is
d t E d t ¯ E = 1 + a c 2 x E 1 cosh 2 a c t ¯ E
Some identities for hyperbolic functions allow (12.14) to be rearranged into
1 cosh 2 a c t ¯ E = 1 t E , 2 c a + x E c 2
and combining the two above equations gives
d t ¯ E d t E = c a c a + x E c 1 1 t E , 2 c a + x E c 2 1 ( when   the   events   are   home   clock   displays )
This becomes a time dilation when tE is sufficiently close to the upper bound in (12.12) so that the right side of (12.15) is greater than 1, which is seen as follows. Recall that tE is a display on the home clock. When the right side of (12.15) exceeds 1, the time d t ¯ E seen by the traveler that is needed to change the display of the home clock is greater than dtE, which is the change of the display of the home clock and therefore also equal to the time seen by the home observer needed to change the display of the home clock. A larger time between displays corresponds to a slower clock, so the traveler sees the home clock to be running slow when the home clock display is sufficiently close to the allowed upper limit.
Note a lack of symmetry between (3.4b) and (12.15) for the example acceleration considered in this section. When making this comparison we must be careful to recognize that tE has a different meaning in the derivation of (3.4) than in the derivation of (12.15) and satisfies different equations for the two cases (e.g., tE = tP for the former case but not the latter) because the event E has a different meaning for the two cases. In the derivation of (3.4), the event E is a given display on the traveler’s clock. In the derivation of (12.15), the event E is a given display on a home clock located at xE. Hence, we must use caution when comparing (3.4b) to (12.15) by recognizing that terms represented by the same symbols do not have the same meanings. However, despite this subtle issue, there is still an obvious distinction between (3.4b) and (12.15). The expression in (3.4b) does not depend on the relative distance between clocks when all cases being compared have the same relative velocity between clocks. In contrast, the expression in (12.15), which is a time dilation pertaining to a home clock located at xE, does depend on the location xE of the home clock.
The upper bound for tE in (12.12), which limits the spacetime points that can be assigned coordinates in the traveler’s system, can be removed by allowing the acceleration to persist for only a finite time, with the traveler having a constant velocity after that time. This was explained in Section 8. It was also explained there that there exist home coordinates at which the transformation to traveler’s coordinates is not unique.

13. Metric Tensor for the Constant Acceleration Felt by Traveler

The next section has a need for the zero-zero component of the metric tensor in the traveler’s system and the goal of this section is to calculate the metric tensor for the acceleration example considered. This quantity depends on the space-time point of evaluation and is most conveniently expressed in the traveler’s coordinates of a given point. While Section 2 through Section 7 and Section 12 applied E subscripts to coordinates to emphasize that they are coordinates of arbitrary events, we shorten the notation here by omitting those subscripts, so it is understood here that ( t ¯ , x ¯ ) are the traveler’s coordinates of an arbitrary event. When expressed as a function of these coordinates, the i, j component of the metric tensor is denoted g ¯ i , j ( t ¯ , x ¯ ) . Section 10 showed explicit dependences that various quantities, used to construct the metric tensor, have on x ¯ , but the dependences on t ¯ was implicit, through a dependence on t P which in turn is a function of t ¯ . To explicitly show the dependences on t ¯ , note that when the travelers time coordinate t ¯ is the given quantity, as opposed to home coordinates, t ¯ is calculated from (12.13) (with the subscript E omitted in the notation used here), i.e., calculated from
t P = c a sinh a c t ¯
Quantities previously expressed in terms of t P can now be expressed explicitly in terms of t ¯ via the substitution given by (13.1).
Quantities that must be evaluated to calculate the metric tensor include γ(tP) and V(tP), which, for the acceleration example considered here, were shown in Section 12 to be given by
γ ( t P ) = 1 + a c t P 2
V ( t P ) = a t P 1 + a c t P 2
Another quantity that must be evaluated is the three-dimensional acceleration, which is the derivative of V with respect to coordinate time in the home system, and can be calculated from
A ( t P ) = d V ( t P ) d t P
Using (13.2b) to calculate the derivative in the above gives
A ( t P ) = a 1 + a c t P 2 3
We now express the quantities in (13.2) in terms of t ¯ by using (13.1) together with some identities for hyperbolic functions to get
γ ( t P ) = cosh a c t ¯
V ( t P ) = c tanh a c t ¯
A ( t P ) = a cosh 3 a c t ¯
For the acceleration example considered here, all motion is along the x-axis so the T-functions defined by (10.8) reduce to
T 1 t P , x ¯ = 1 c 2 γ 3 t P A ( t P ) x ¯
together with T 2 t P , x ¯ = 0 and T t P , x ¯ = 0 . Using (13.3) to express T 1 t P , x ¯ in terms of t ¯ gives
T 1 t P , x ¯ = a c 2 x ¯
We see from inspection of (10.14) that when T 2 t P , x ¯ = 0 and T t P , x ¯ = 0 , each component of the metric tensor in the traveler’s system equals the corresponding component of the Lorentz metric tensor except the zero-zero component, i.e.,
g ¯ i , j = g i , j if   ( i , j ) ( 0 , 0 )
where g i , j is the Lorentz metric. Also, when T 2 t P , x ¯ = 0 and T t P , x ¯ = 0 , the zero-zero component given by (10.14a) reduces to
g ¯ 0 , 0 = γ 2 t P 1 V 2 t P c 2 1 + T 1 t P , x ¯ 2 = 1 + T 1 t P , x ¯ 2
Substituting (13.3) into the above gives
g ¯ 0 , 0 = 1 + a c 2 x ¯ 2

14. Another Kind of Time Dilation and Doppler Effect

Two kinds of time dilation were previously discussed. Section 3 discussed the case in which a given display on the traveler’s clock was the event, and the calculated quantity was the home observer’s time coordinate of this event (this is equivalent to saying that the calculated quantity is the display on the home clock that is simultaneous with a given display on the traveler’s clock when the home observer defines simultaneity). Section 12 discussed the case in which a given display on the home clock was the event, and the calculated quantity was the traveler’s time coordinate of this event (this is equivalent to saying that the calculated quantity is the display on the traveler’s clock that is simultaneous with a given display on the home clock when the traveler defines simultaneity). We now consider a third case in which two clocks are not moving relative to each other, they are both stationary in the traveler’s system, but one clock is spatially displaced relative to the other. One clock, still called the traveler’s clock, is at the traveler’s origin so the traveler’s worldline is the worldline of that clock. The traveler is at the origin of the reference frame and uses this clock to define time coordinates of events. Displays on a second clock, called the displaced clock, are treated as events, and the quantity to be calculated is the traveler’s time coordinate of such an event. This is equivalent to calculating the display on the traveler’s clock that is simultaneous with a given display on the displaced clock when the traveler located at the traveler’s clock defines simultaneity.
A coordinate transformation between the traveler and a displaced traveler has not been derived but the above calculation can be performed by utilizing the metric tensor. Details are as follows. We start with the fact that an increment of time display, denoted d t ¯ ¯ , on the displaced clock is equal to the increment of proper time, denoted , between nearby points on the worldline of the displaced clock, i.e.,
d t ¯ ¯ = d τ
A second relevant fact is that the increment of proper time is a scaler invariant that can be calculated using the metric tensor of any convenient reference frame. Because the displaced clock is stationary in the traveler’s reference frame, it is convenient to use the traveler’s metric tensor for this calculation. Using the acceleration example of Section 12 and Section 13, let x ¯ C denote the location of the displaced clock, along the x-axis, in the traveler’s coordinate system. A given point on the worldline of the displaced clock has traveler’s coordinates ( t ¯ , x ¯ C ) and a nearby point on the same worldline has coordinates ( t ¯ + d t ¯ , x ¯ C ) . There is no displacement of the spatial coordinate so the relationship between d τ and coordinate displacements reduces to
d τ 2 = g ¯ 0 , 0 ( t ¯ , x ¯ C ) d t ¯ 2
Combining this with (14.1) while using (13.4b) gives
d t ¯ ¯ = 1 + a c 2 x ¯ C d t ¯
If the displaced clock is displaced in the direction of the traveler’s acceleration, so x ¯ C is positive, we have d t ¯ ¯ > d t ¯ , so the displaced clock appears to the traveler to be running fast. Displaced in the opposite direction, but not far enough to make the parenthesis in (14.3) negative, makes the displaced clock appear to the traveler to be running slow.
Now consider another situation in which the displaced clock is accompanied by a light source that is stationary in the traveler’s system and is at the same location as the displaced clock. There is no motion between the light source and traveler as seen by the accelerating traveler, but the light source and traveler are at different locations.13 The goal is to derive a Doppler effect as seen by the traveler for this case. This effect is caused by acceleration, which is equivalent to gravity, so we will call this a gravitational Doppler effect. It is interesting to compare this to the effect, which we will call the Lorentz Doppler effect, seen by an inertial system when a light source moves relative to the system. The Lorentz Doppler effect is easily found in the literature, so details need not be included here, and we will merely focus on some points of comparison. The Lorentz effect is a combination of two effects. One is from consecutive wave peaks produced by a moving source being emitted at different locations relative to the observer, which results in different times of travel to the observer. These different travel times contribute to the different arrival times of the wave peaks at the observer location (the other contribution to the different arrival times is the wave period of the source which is the difference in wave peak emission times). This effect, taken by itself, would produce the Doppler effect derived from a nonrelativistic treatment of sound waves in which the observer is at rest in the medium and the light source moves relative to the medium. The second effect that contributes to the Lorentz Doppler effect is a time dilation, which modifies the result derived for sound waves. In contrast, the gravitational Doppler effect is much easier to derive because, as seen below, only time dilation is relevant.
Using the acceleration example of Section 12 and Section 13, let x ¯ C denote the location of the light source, along the x-axis, in the traveler’s coordinate system. Consider two consecutive wave peaks emitted by the light source. They are both emitted at the same location in the traveler’s system so they both have the same time of travel, as seen by the traveler, between emission and reaching the traveler. Therefore, according to the traveler, the difference in arrival times between the consecutive peaks, which is the wave period denoted T ¯ seen by the traveler, is the time difference, seen by the traveler, between emissions of consecutive wave peaks. Recall that d t ¯ ¯ in (14.3) is an increment of time display of the displaced clock, while d t ¯ is the amount of time seen by the traveler to produce that change in clock display. Let the change in displaced clock display d t ¯ ¯ equal one clock tick, which is also the time between consecutive wave peak emissions produced by the accompanying light source, so d t ¯ ¯ = T where T is the wave period in the reference frame of the light source. Because travel time of the light wave is irrelevant, the wave period T ¯ seen by the traveler is the difference d t ¯ in traveler time coordinates between consecutive wave peak emissions, so d t ¯ = T ¯ . Substituting this together with d t ¯ ¯ = T into (14.3) gives
T = 1 + a c 2 x ¯ C T ¯
The frequency f in the reference frame of the light source and frequency f ¯ at the location of the traveler are related by the inverse relation
f ¯ = 1 + a c 2 x ¯ C f
or
f ¯ f = 1 + a c 2 x ¯ C

References

  1. C. W. Misner, K. S. Thorne, and J. A. Wheeler, Gravitation, Princeton University Press, pp. 164-165, 2017.
  2. J. D. Jackson, Classical Electrodynamics, Second Edition, John Wiley & Sons, Inc., p. 516, 1975.
  3. H. Goldstein, Classical Mechanics, Addison-Wesley Publishing Company, pp. 187-191, 1965.
  4. G. Arfken, Mathematical Methods for Physicists, Second Edition, Academic Press, p. 151, 1970.
  5. D. McMahon, Quantum Field Theory Demystified, McGraw-Hill, p. 8, 2008.
  6. J. A. Stratton, Electromagnetic Theory, McGraw-Hill, p. 77, 1941.
  7. R. Adler, M. Bazen, and M. Schiffer, Introduction to General Relativity, Second Edition, McGraw-Hill, p. 7, 1975.
  8. R. Penrose, The Emperor’s New Mind, Oxford University Press, p. 200, 1989.
  9. D. F. Lawden, An Introduction to Tensor Calculus and Relativity, Halsted Press, John Wiley & Sons, Inc., p. 22 (problem 10), 1975.
  10. C. W. Misner, K. S. Thorne, and J. A. Wheeler, Gravitation, Princeton University Press, p. 169, 2017.
  11. C. W. Misner, K. S. Thorne, and J. A. Wheeler, Gravitation, Princeton University Press, p. 166, 2017.
1
If the only information available to the observer with which to assign spacetime coordinates to an event is a visual image, it is assumed that the observer accounted for the travel time of the light signal when assigning these coordinates so that the assigned time coordinate does not include this travel time.
2
Gravity is not considered here so we say freely moving instead of free falling.
3
The notation used here uses bold block font for three-dimensional spatial vectors. Bold cursive font will be used later for 4-vectors discussed later.
4
The open symbol denotes the Euclidean dot product between three-dimensional (spatial) vectors. The unit vector n is a unit vector in the context of this dot product. This dot product is distinguished from a four-dimensional dot product, denoted by a solid dot, defined later.
5
While the P-system declares the events E and P to be simultaneous, it is not necessarily true that the home observer declares them to be simultaneous.
6
There is a third version of time dilation in Section 14.
7
It will be seen later that different basis vectors will be needed for the traveling system in order for vector components to obey the transformations of contravariant vector components.
8
In contrast, covariant components are defined in terms of dot products. There is no distinction between contravariant and covariant components when the basis vectors are an orthonormal set, but it will be seen later that the basis vectors used here are not an orthonormal set as defined by a solid dot-product denoted and defined later (the basis vectors are mutually orthogonal but not all of them have unit norm), so there is a distinction between contravariant and covariant components.
9
It might be noted that [10] stated that the derivative / x ¯ 0 is a derivative with respect to proper time, making e ¯ ( 0 ) in (11.6) the 4-velocity. In reality, x ¯ 0 is proper time only on the worldline of the traveler’s origin. Therefore, e ¯ ( 0 ) is the 4-velocity when evaluated at x ¯ E = 0 but is more complicated when evaluated at x ¯ E 0 . The general construction of the barred basis vectors combines (11.6) with (10.7).
10
Symmetry of the transformation (10.1) together with symmetry of the Lorentz metric implies symmetry of all metrics so the order of dot-product multiplication is seen to be reversible in any of the expressions used for it.
11
We sometimes shorten the notation by not displaying the coordinate dependence of the barred basis vectors, but when that is done, it must be remembered that they may be functions of coordinates.
12
Covariant components defined by other dot products (e.g., a Euclidean dot product in which the unbarred metric tensor is the identity matrix, and the barred metric tensor is defined by (10.1)), will also satisfy (11.16) providing that the definition of the dot product is used consistently as needed to satisfy (11.11a) and (11.11c). The case in which the metric in the unbarred system is the Lorentz metric, and produces (11.11b), is assumed throughout this analysis but is not required by (11.16).
13
Although incidental, because the home observer is not relevant in this discussion, it is interesting that, due to length contractions having a velocity dependence, a constant separation between traveler and light source as seen by the traveler (a given condition) does not imply a constant separation as seen by the home observer when velocities change.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated