Академический Документы
Профессиональный Документы
Культура Документы
J A N O S S Y
Theory of
relativity
based on
physical
reality
L. JÁNOSSY
INTRODUCTION 13
£( m )
of a mechanical system, and so on. Among these tensors the tensor g
plays a great role which is hormally supposed to give the "metric of the four-
dimensional space-time continuum". We introduce the tensor as propa-
gation tensor of electromagnetic waves — it appears at the beginning of
our deliberations as a tensor with the help of which the mode of propaga-
tion of light can be described. It turns out that the tensor g appears alsó
* D r . M. von Laue: Die Relativitátstheorie. Band II, p. 25. Verlag Friedr. Vieweg
und Sohn, Braunschweig, 1923.
in the equations of motion of a partiele and alsó in other physical laws.
It seems natural to connect g with the energy-momentum tensor of the
carrier of the various phenomena, i.e. with the energy-momentum tensor
of the ether. Laue in the newer editions of his book speaks about g as
giving a "Führungsfeld", thus he is alsó of the opinion that g deseribes
a kind of field.
10. The tensors % , $ , g,. . . have elements the numerical values
íel) ( m )
J
FIZIKUS
TAN5Z£KcK)PORT r
«* KÖNYVTARA ^ ,
2 Relativity
CHAPTER I
11. In any attempt of dealing with the problems of the theory of relativity
the question of the mode of propagation of light plays an important role.
Maxwell's electromagnetic theory of light gives a concise description of
the phenomena of light. We shall describe somé experiments which support
Maxwell's theory. In particular according to Maxwell's theory the velocity
c of propagation of light can be determined from measurement on the inter-
action of charges and magnets without using observations which are con-
cerned directly with the propagation of light.
Similarly, the force with which a magnetic pole m acts upon another
x
F&> = ^ . (2)
However, as single magnetic poles do not exist in nature, the above equa-
tion can be verified only indirectly, e.g. by using long raagnetized rods,
which behave as if they possessed opposite magnetic charges at their ends.
14. An electric charge does not exert a force upon a magnetic pole if
the charge and the pole are both at rest. However, if, say the pole is at rest
and the charge is moving with a velocity v, we find that the force acting
upon the magnetic pole is given by
^) = _ ^LL,
a ( 3 )
r
where r is the vector pointing from m to e and a is a constant. Comparing
(1), (2) ánd (3) we see that the dimension of a is that of an inverse velocity.
The íatter velocity can be determined numerically by measuring the forces
acting between moving charges and magnetic poles. The velocity
c' = l/a
15. The fact that a moving charge acts upon a magnet was first shown
by Rowland.
Exact experiments for the determination of the critical velocity appearing
in (4) were carried out by Kohlrausch and Weber. In the latter experiment
the magnetic action of the current arising when a condenser is discharged
is compared with the action of the same charge when distributed electro-
statically on the plates of the condenser.
16. From the empirically established fact that c = c' Maxwell con-
cluded that light is an electromagnetic phenomenon. As it will be seen
further below (in chapt. VIII) Maxwell's results apply to two phenomena.
1. A GENERAL REMARK
and P one near A and the other near B have to run synchronously with
B
seen through a telescope situated near B — with the clock P , one has to B
remember that light takes a certain time to travel from A to B and there-
fore the image P' must be expected to be delayed in phase relatíve to P .
A A
This means that for the synchronization of P and P in this way it is already
A B
tion thus described supposes that light is propagated with the same velocity
along AC and BC. This method, although feasible, has not in fact been
used in any real experiment.
2. THE SUGGESTION OF GALILEI
19. Galilei* supposed that light is propagated with a finite velocity and
for its determination he suggested the following experiment. Two covered
lanterns are situated at a certain distance from each other and near each
there stands an observer. The first lantern is suddenly uncovered by the
first observer. The second observer near the other lantern uncovers his own
lantern as soon as he notices the light from the first. Now the first observer
must be expected to see the lighting up of the distant lantern with a delay
equal to the time the light takes to travel to and fro between the two lanterns.
When the experiment was actually carried out no delay could be observed
in this way when the lanterns were placed a few miles apart, and Galilei
came correctly to conclude that the velocity of propagation of light is very
large. He suggested as a possible method for the determination of this
velocity with the help of the observations of the Satellites of Jupiter.
3. ASTRONOMICAL METHODS
a. OBSERVATIONS OF RÖMER
other and are thus synchronized. The synchronized clocks are then trans-
ported to their positions near A and B respectively. If the transport takes
place with sufücient care, we may hope that the clocks remain synchronized
during their travel to their final positions and the experiment described in
17 can now be carried out with these clocks.
In his investigations which led to the first numerical determination of the
velocity of light (in 1676) the astronomer Olaf Römer used the same prin-
ciple. The Satellites of Jupiter circling periodically around Jupiter provide
a clock in space which can be observed from the Earth. While the Earth
moves along its orbit the distance between Jupiter and Earth changes and
the Jupiter-clock appears to be slow when the Earth is moving away from
Jupiter and to be fast when it is approaching Jupiter.
The actual method is as follows: Suppose that one of the Satellites of
Jupiter actually completes its revolutions at times
í =
v t + vT,
0 v = 1, 2 , . . . , k (5)
where Tis the time taken for one revolution and t is the time when the first
0
* Discorsi e demostrazione mat., Elzevir, 1638. p. 43; see alsó R. J. Seeger: Galileo
Galilei, his life and his works. Pergamon Press, Oxford, 1966. pp. 1 8 5 - 1 8 7 .
lutions at times
t' = t + IJc,
v P (6)
where l is the distance Earth-Jupiter at the time t' *
v v
From (5) and (6) we find for the times observed for the first k revolutions
t' -t'
k 0 = kT+ (l - l )/c,
k 0
The difference between the times observed for the first and second sequence
of k revolutions is thus
4 - 2/* + t' = (l 0 2k - 24 + l )/c, 0
thus
_ hk — 2/fc + /p
tzk — 2íjt + t' 0
Römer did not observe complete revolutions of the Satellites, but observed
the times of their eclipses; for the sake of simplicity we have described here
the method somewhat simplified in terms of full revolutions.
Römer observing a few revolutions öf the Satellites found
c = 220 000 km/sec.** (Römer)
The above value is rather inaccurate, nevertheless the fact that the correct
order of magnitude was obtained at all must be regarded as a remarkable
achievement.
The exact instant of an eclipse cannot be determined with an uncertainty
less than somé fraction of one minute. The procedure — even using modern
methods of astronomical observation — can therefore not be improved
sufficiently to give a precise determination of c.
* It may be supposed that the velocity of the Earth relatíve to that of light is so
small that it makes no noticeable difference whether we consider / to be the distance v
Earth-Jupiter at í or at t' .
v v
4. LABORATORY METHODS
its arrival t back in A by one and the same clock P . We may thus write
3 A
21
'3 'l
When using (7) we suppose that t , the time of arrival of the signal in B, is
2
given by
near A we find the phase shift At = 21)'c between the phase of the clock
P and that of its mirror image P' . The velocity of light is thus obtained
A A
c = 21/At.
* H. Fizeau: Compt. Rend. Hebd., 29, 90, 1849; Ann. d. Phys., 79, 167, 1850.
and is passing between the cogs to a mirror placed at a distance /. The beam
refiected from there is projected by means of a suitable optical arrangement
back on the cog-wheel.
The optical arrangement, which is shown in a little more detail in Fig. 1, is
so adjusted that when the cog-wheel is at rest the returning beam falls on
the same part of the rim of the wheel through which it has passed originally
and in this way the light source can be seen through the gap between the
cogs. If, however, the cog-wheel is set in motion and has turnéd by, say,
half a cog during the to and fro passage of the light beam, this can no longer
pass through the gap and the image of the light source is extinguished.
When the wheel moves faster the returning beam may just fali on the gap
following the one through which the originál beam has passed and the
image of the source becomes again visible.
In the experiment the relation between the angular velocity of the cog-
wheel and the re-appearances of the image were registered.
In the actual experiment of Fizeau, the distance was taken to be /=8633 m
and a wheel with n = 720 cogs was chosen. The rate of revolutions at which
the image appeared were therefore the integrál multiples of
c
N = —— = 22.6 revolutions/sec.
2nl
24. In a further experiment Fizeau and Foucault* used an improved
method suggested by Arago.** In the latter arrangement a rotating mirror
is made use of. A beam of light falling on a rotating mirror placed near A
is reflected in such a way that it falls on a stationary mirror placed near B at
a distance / from A (see Fig. 2). The mirror in B reflects the beam back to
the mirror in A but during the time At = 2l/c which the light takes to travel
from A to B and back the mirror turns by an angle ö=a>At where co is
the angular velocity of the mirror. Measuring the angle fi the value of At
can be obtained as the value of co is known and thus c can be determined.
The results of the most accurate determination of c were quoted in 14.
V = cjn, - (8)
* L . Foucault: Compt. Rend. Hebd., 30, 551, 1850; ibid. 55, 501, 1862.
* * F . Arago: Compt. Rend. Hebd., 7, 954, 1838; 30, 489, 1850; 55, 792, 1862.
A more detailed analysis shows that (8) is valid for monochromatic light
only the propagation of planes of constant phase of a monochromatic beam
having velocities
V=cln(y), (9)
where «(v) is the refractive index for light of frequency v.
2) V gives not only the velocity of propagation of phase planes but alsó
that of wave packets, the width of which much exceeds the wave length.
Relation (9) can be checked experimentally by measuring directly the
velocity V of propagation of planes of constant phases of light of frequency
v or by measuring the velocity of wave packets in a transparent médium.
The refractive index n(v) may be determined from the observed angles of
refraction of a beam of light of frequency v. Thus V and n(v) may be deter-
mined by independent experiments and the relation (9) thus checked. The
actual experimentál findings described below support the validity of rela-
tion (9).
time of travel taken for a distance / in air. He found the times to be equal
and thus verified the relation (9).
27. The methods described so far were used to determine the velocity
of signals of light. Indeed, in the experiment of Fizeau the rotating cog-
wheel cuts the incoming beam into packets (which may be regarded as
signals) and it is the velocity of these packets that is determined. Similar-
ly in the experiment of Foucault the rotating mirror produces short flashes
and the speed of propagation of these flashes is measured.
We shall discuss presently interferometric methods of the measurement
of the velocity of light. These methods make use of beams of monochromatic
light and sérve for the observation of the velocities of planes of constant
phase.
1. THE MICHELSON INTERFEROMETER
again and we obtain one component of each returning beam which passes
into the telescope T. In the telescope we obtain a system of interference
fringes with the help of which it is possible to determine the difference of the
lengths of the return paths
2/j = SM -* M - SM
x and 2/ =
2 SM -* M -» SM
2
* A. A. Michelson: Sill. Journ., 15, 394, 1878; 18, 390, 1879; Nature 21, 94 and
120, 1880; Naut. Alm. p. 235, 1885; Astrophys. J., 60, 256, 1924; 65, 1, 1927.
of the components of the incident beam. More precisely the system of fringes
permits to determine the difference AT = T — T of the times of return
2 x
29. For the better understanding of what happens in the Michelson inter-
ferometer it is useful to give a somewhat fuller description. As shown in
Fig. 4, light coming from the source S passes through a lens L and an approxi-
mately parallel beam is obtained; this beam appears to come from a source
5 ' far to the left from S.
s,f s 2
Obstacle
-s' s
SM'
The parallel beam is now split by the semi-transparent mirror, the com-
ponents are reflected back and produce components passing into the tele-
scope. If we put an obstacle between SM and M and thus prevent the latter
2
component to fali into the telescope, then we see in the telescope an image
Sí of 5 which appears to come from a source S situated at somé distance
x
The images Sí and S are coherent and therefore if both paths SM -> M
2 x
and SM -> M are free we see in the telescope the interference pattern which
2
arises from the superposition of the two images — the pattern is such as if
we observed through the telescope the images of two coherent sources S[
and S .2
placed exactly in the pláne into which falls the mirror image of M produced x
by SM, then the virtual sources S and S coincide and no interferences are
± 2
"1
the wavelength X of the light of the source, we can determine from the dis-
tribution of the interference rings the numerical value of Al.
30. Indeed, suppose the distance between the virtual sources Sy and S 2
to be
a = nk, n = n + e, 0
and F being the effective focal length of the telescope. If the & are small k
íi«M
\ n +e 0
*= 0,1,2,...
The concentric rings appearing in the field of view show decreasing spac-
ing when going outwards (see Fig. 6).
From the pattern both n and £ can be determined.
0
parallel to itself until the interference fringes disappear and we obtain the
zero interference pattern, i.e. we see a bright image of the light source in
the telescope. We have then n = 0 and therefore
Ál = 0.
position, then the virtual images Sí and S will bé placed side by side and
2
say, with velocities C] and c , then from the interference pattern we do not
2
The fact that we are not measuring the difference of lengths of light paths
but rather the difference of times of fiight was made use of for to measure
the velocity of propagation of wave fronts in a refracting médium. Inserting
a refracting médium in one of the arms of an interferometer and adjusting
the interferometer so as to give AT = 0, we find from (10)
c
i '• c
2 — h '• h-
Thus, if e.g. Cj = c, c — c/n, we have
2
n = ljl .
2
The above experiment extends the result of that described in 26. Consider-
ing both types of experiments we find that wave fronts like wave packets
both propagate with velocities c/«(v). .•
K . - in- r í l - : ' 1
l »(v ) J'
2
2
where n(vj) and n(v ) are the values of the refractive index for frequencies and
2
v =v(l
2 + |J,
where v is the originál frequency of the source and \\ and v respectively are2
the frequencies shifted by the Doppler effect, i.e. the frequencies with which
the light acts upon the atoms of the moving liquid. We shall come back
to the theoretical interpretation of this effect in chapt IX.
3 Relativity
the fact that the velocity of propagation of light is independent of the state
of motion of the source.
Indeed, the frequency of light we observe is determined by the rhythm in
which consecutive wave fronts reach us. This rhythm may be affected by the
motion of the source but the wave fronts themselves will always proceed
with the velocity c independent of the motion of the source.
34. The emission of waves by a moving atom can be treated with the
help of Maxwell's theory. The exact treatment shows that an atom emits
spherical waves, the surfaces of equal phase expanding isotropically with
a velocity c. That this treatment leads to correct results can be seen most
clearly if we express the field of the emitting atom in terms of retarded
potentials (see chapt. VIII).
Consider thus an atom which is at rest in the point r = 0 and which emits
radiation of constant frequency v. Fronts of constant phase are then emitted
at intervals
T =- , (12)
v
1. MOVING SOURCE
35. If the emitting atom moves with a constant velocity v the position
of the atom at the time t is given by
r(í) = yt
If the fronts are observed from a point A towards which the atom moves
(see Fig. 8), they will appear to be crowded together and to arrive at intervals
shorter than T while in a point B from which the atom moves away the wave
fronts will arrive at intervals longer than T. Of course the velocity of the
individual fronts is equal to c both when passing through A and alsó when
passing through B.
The points of the fc-th front at the time t > t obey the relation
k
(r - yt f
k = c\t - t f
k for t > t.
k
The k-th front arrives at the time t' in the point A with coordinate vector
k
x = R so that
A
(K-yt f =k c\t' -t f, k k
thus
= + | R - yt l/c. k
_ , „r Wtcos^i
|R-tM=JI 1 V " ) '
where cos = Rv/Rv is the cosine of the angle between R and v. Thus we
have in this approximation
t' i-t
k+ k = T' and v' = ^ L , (14)
• —i • (15)
1 cos &
c
We see thus that the observed frequency v' varies with the angle & between
v and the direction of observation.
We note that formula (15) is asymptotically correct for large distances R.
However, in practice we observe the radiation of atoms always at such great
distances that the deviation from expression (15) which arises from having
neglected terms of the order of vt /R can always be neglected. The deviation
k
36. The Doppler effect was first observed by Galitzin and Belepolsky
(1895);* they compared the frequency of the light emitted by a mono-
chromatic light source with the frequency which the mirror image of the
The Doppler effect was observed later by many others. Stark observed
the Doppler effect of the radiation emitted by atoms moving in a beam.
He alsó confirmed relation (15).*
The Doppler effect can now be considered to be a well-established effect
and it may be made use of for the determination of the velocity of moving
sources. The velocity of fast moving atoms can conveniently be determined
by measuring the frequencies of the light emitted by them.
37. In astronomy the velocity of motion of the components of double
stars can be obtained with the help of the Doppler effect. From the observa-
tion of double stars evidence was obtained for the fact that the velocity
of light is independent of the motion of the source. If we were to suppose
that the velocity of light depended on the motion of the source, then we had
to expect that a component of a double star emitted light which is propagated
towards us with a velocity changing with the state of motion of the star.
Because of the large distance the light has to travel we would observe the
motion of the star in a distorted fashion.
By analysing the observational data of the double star /?-Auriga, it was
found that these are incompatible with the assumption that the velocity of
light depends on that of the source.
1. EXPERIMENTÁL OBSERVATIONS
38. More recent experiments by íves and Stillwell** in 1938 and later
by Otting*** in 1939 showed that the frequencies of radiations emitted by
very fast atoms are given by
v-wW -^', 1
(16)
1 cosi?
c
the observed frequencies v" being a little smaller than the frequencies pre-
dicted from simple geometrical considerations and given by (15). It fol-
* J. Stark: Phys. Zs., 6, 893, 1905; Ann. d. Phys., 21, 401, 1906.
** H. E. íves and G. R. Stillwell: J. Opt. S o c , 28, 215, 1938.
*** G. Otting: Phys. Zs„ 40, 681, 1939.
ows in particular from (16) that for •b = nj2 we find
= v / l -v lc* = v*.
N
2
(17)
v* = Vy/l-v lc .
2 2
2. PHYSICAL INTERPRETATION
OF THE PERPENDICULAR DOPPLER EFFECT
1 ——cos#
c
v"(#) (l — - c o s # = independent of d
l c
I (19)
= frequency of moving atom = v(v).
Observing v"(é) the numerical values of the left hand expression of relation
(19) can be determined for various values of é by experiment. If the latter
expression is found to be independent of % indeed, then we have proved
by experiment our theory of the Doppler effect to be correct.
We have to identify the empirically found value v(v) with the frequency
v* emitted by the moving atom. Carrying out the experiment for various
values of v we may establish empirically the relation
/ ^
v(f) = v ( 0 ) / l - - = v*.
x ?
41. That the above interpretation of the Doppler effect is correct indeed
— and that the factor y/l — v /c is not caused by somé "geometrical"
2 2
X
Fig. 10. x — t diagram of comparison of rates of moving clocks
left with a velocity v and the observer to the right with a velocity w (see Fig.
10). The coordinates of the atom and observer at a time t can thus be written
XA( )
1
= a-vt, x (t) = b + wt.
B (20)
t = t + kT,
k Q T = —, k = 0,1,2,... (21)
v
from points with coordinates x (t ); the latter arrive to the observer at times
A k
t' so that
k
v' = ^ v , (23)
C+V
where Y is the frequency with which the phase planes arrive to the observer
in B.
while in the case when the source moves and the observer is at rest, i.e. if
we put v = V, w = 0 we have
"A v 0 V1 - v /c\ 2
v = v
B 0 71 - v?\c\ (24)
where v„ is the common frequency of A and B at rest. The atom A emits
radiation with a frequency v , the phase planes thus emitted reach the
A
moving observer B, they arrive with a frequency v' and according to (23) A
we expect
c—w
v' = . (25)
c+v AA A v ;
e = ^ = í ^ . . ^ ] ' . 1 2
(26)
v [c + v c + w)
B
v of B itself.
B
^ c+v c+w v
'
vw
1
+—
c l
The velocity V is, apart from small terms, equal to the sum of the velocities
v and w. From the point of view of the Doppler effect the latter quantity
characterizes the relatíve motion of observer and source. Provided relations
(24) hold the Doppler effect is characterized by the quantity V which may
be called the "relativistic relatíve velocity". We shall come back to this
question in greater detail.
46. We discuss now two effects which are not connected directly with
the propagation of light, the results of which will be, however, of importance
further below.
47. As has been seen in 38, we are led to conclude that the frequency
of oscillations of an atom is slowed down if the atom is made to move with
a velocity v. A similar effect was observed with unstable elementary particles.
Direct observation of /í-mesons in cosmic radiation showed that the mean
life of a |t-meson when it is brought to rest in an absorber is equal to
* See for details e.g. L. Jánossy: Cosmic Rays. Clarendon Press, Oxford, 1950.
2nd ed.
** From Review of Partiele Properties UCRL-8030.
The cosmic ray /i-mesons are förmed in the higher layers of the atmosphere
by primary cosmic rays. Supposing that the mesons move with velocities
approximately equal to that of light, the mean rangé of them would be of
the order
I « CT « 650 m.
0 0
T = Wl - t> /c
2 2
(30)
Experiments of Rossi* and others have shown this relation to be correct.
It is interesting that the cosmic ray experiments give evidence concerning
(30) for velocities v x c, where the change of the decay time is considerable
and the ratio T/T is of the order 10-30.
0
48. Another effect which will be found relevant to our analysis is the
change of the mass of a partiele with velocity.
From theoretical arguments Ábrahám** came to conclude that the mass
of an electron should increase, when the electron is accelerated; the increase
of mass was derived considering the action of an accelerated charge upon
itself.
The theory of Ábrahám was modified by Lorentz*** who obtained an
expression somewhat different from that of Ábrahám.
The experiments of Kaufmann**** (1901) proved that the mass of the
electrons changes indeed with velocity, his measurements were, however,
* B. Rossi and D. B. Hall: Phys. Rev., 59, 223, 1941; B. Rossi and others: Phys.
Rev., 61, 675, 1942.
** M. Ábrahám: Ann. d. Phys., 10, 105, 1903; see alsó A. Sommerfeld: Atombau
u. Spektrallinien, 1950. p. 313.
*** H. A. Lorentz: The Theory of Electrons. Leipzig, 1909.
**** W. Kaufmann: Ann. d. Phys., 19, 487, 1906.
not sufficiently accurate to decidé between the theoretical formuláé of
Lorentz and those of Ábrahám.
49. From theoretical considerations it was supposed that the formuláé
of Lorentz and not those of Ábrahám are correct and a large number of
experiments were carried out later with the object of trying to proveLorentz's
formuláé to be correct.
The latter experiments claimed to support the Lorentz formuláé, how-
ever, as a detailed analysis showed on the whole they did not have sufficient
accuracy to decidé the question experimentally (see the analysis given by
Faragó and Jánossy*).
The first experiment which proved convincingly that Lorentz's formula
and not that of Ábrahám described the change of mass of the electron
correctly is that carried out by Rogers, Reynolds and Rogers.**
The experiment was later improved by Staub*** who attained still
greater accuracy.
50. Experiments to determine the change of thé mass of protons with
velocity were carried out by Zreliov, Tapkin and Faragó.**** In the latter
investigation the velocity of the protons which was 83 per cent of that of light
was determined directly with the help of Cerenkov radiation. The Lorentz
formula at that velocity was found to be correct inside the margin of experi-
mentál error which was estimated not to exceed 0.1 per cent.
Taken all the evidence together it supports the validity of the Lorentz
formula for electrons and for protons. It seems reasonable to suppose that
the formula has generál validity.
51. The experiments for determining the change of mass with velocity
are based on the observation of the orbits of charged particles in electric
and magnetic fields. The analysis of an orbit may give information about
velocity and acceleration of the partiele — so as to obtain information about
the mass of the partiele dynamical considerations have to be introduced.
For the analysis of the experiments it is supposed that the force F acting
upon a partiele of charge e and velocity v is given by the Lorentz formula.
F = e |E + — (v x B) ,
l c
I
f = F
- < 3 i
>
Supposing that
p = m(v)y,
with
m(v) = ° m
yi-^/c ' 2
F=
0 0
J
dt j l - v \c ) 2 2
v / l - v jc 2 2
(1 - W 2
where
_ fn 0 _ w 0
m
l (l-v2JcY'
~ ~T\ 2
'
..2f.2\3/2 ' m, ' — (1 - «; /c )
2 2 1/2
and
V l = (vv)v/i; , 2
v = v - Vi
2
are the components of the acceleration which are parallel respectively per-
pendicular to the velocity. We see thus that if we define the mass as the
ratio of force and acceleration then the mass thus defined depends on the
angle between velocity and acceleration. The extrémé cases are ni] the mass
which appears when the partiele is accelerated in the direction of velocity
and iw, the mass which appears when the partiele is accelerated in a direc-
tion perpendicular to v.
52. The real experiments can be divided into two types. In the experi-
ment of Rogers and co-workers and alsó of Staub and co-workers particles
are made to move along a circular orbit. So as to maintain a partiele of
velocity c o n a circular orbit with radius R a radial force
F = mv lR 2
54. From Maxwell's theory it follows that light in particular, and all
electromagnetic action in generál, is propagated with a velocity c = c' where
c' is the critical velocity. We shall analyse Maxwell's equations in more
detail in chapL VIII and show there that the above conclusion is indeed an
integrál part of Maxwell's theory.
The question cannot be avoided relatíve to what are electromagnetic
waves propagated with the velocity cl
A simple answer to this question could be obtained claiming that light is
propagated with the velocity c relative to its source. The latter assumption
contradicts, however, the well-estabüshed theory of Maxwell and seems alsó
to be contradicted directly by experiment. The latter assumption, some-
times referred to alsó as the "ballistic theory of light", must therefore be
rejected.
An electromagnetic perturbation once it has left its source is propagated
thus with a velocity c independent of how the perturbation has come about.
The only reasonable interpretation of this is to assume that the perturbation
moves with a velocity c relative to its carrier. The carrier may be denoted,
using Maxwell's terminology, the ether. We shall in accord with the ideas
of Maxwell alsó assume that light is propagated with a velocity c relative
to the ether.
55. So as to avoid misconceptions we wish to emphasize that we regard
the ether merely as the carrier of electromagnetic waves and possibly of the
waves associated with other fields and of elementary particles.
In the last century a number of mechanical models were proposed so as
to explain the properties of the ether by rather artificial mechanisms. Such
models are meaningless as there is no reason why the ether should have
properties like e.g. solids consisting of atoms and molecules. On the con-
trary one might suppose that the properties of atoms, molecules and solids
are ultimately determined by the properties of the ether. We think of the
ether more or less in a manner which was discussed by Einstein in a not
very known article.*
* A. Einstein: Über den Áther. Verh. d. Schweizer. Nat. Ges., 105, Teil II. 8 5 - 9 3 ,
1924.
56. Einstein's polemic against the ether concerned mainly the assump-
tion that the ether is at "absolute rest". Thus Einstein denied the existence
of a system of reference K which is at "absolute rest".
0
to the ether, i.e. at rest relatíve to the carrier of the electromagnetic waves.
As clearly explained above we do not suppose the system K to be that 0
of "absolute rest". Whether or not the ether, i.e. the carrier of electro-
magnetic waves, is at rest or even at "absolute rest" is a question which
does not arise here and certainly has no significance in relation to our
problems.
"Der Áther der allgemeinen Relativitatstheorie unterscheidet sich alsó von dem-
jenigen der klassischen Mechanik bezw. der speziellen Relativitatstheorie dadurch,
dass er nicht 'absolut', sondern in seinen örtlich variablen Eigenschaften durch die
ponderable Materié bestimmt i s t . . . Dass es in der allgemeinen Relativitatstheorie
keine bevorzugten, mit der Metrik eindeutig verknüpften raumzeitlichen Koordina-
ten gibt, ist mehr für die mathematische Form dieser Theorie als für ihren physikali-
schen Gehalt charakteristisch."
"Aber selbst wenn diese Möglichkeiten zu wirklichen Theorien heranreifen, wer-
den wir des Áthers, d. h. des mit physikalischen Eigenschaften ausgestatteten Kon-
tinuums, in der theoretischen Physik nicht entbehren können; denn die allgemeine
Relativitatstheorie, an derén grundsátzlichen Gesichtspunkten die Physiker wohl stets
festhalten werden, schliesst eine unvermittelte Fernwirkung aus: jede Nahewirkungs-
Theorie aber setzt kontinuierliche Felder voraus, alsó auch die Existenz eines 'Áthers'."
The translation of this remarkable statement is as follows:
"The ether of the generál theory of relativity diífers from that of classical mechanics
or from that of the special theory of relativity in so far as it is not 'absolute' but its spatial
distribution is determined by that of matter. The fact that, in the framework of the
generál theory of relativity, there are no distinguished space-time representations
connected in an unambiguous manner with the metric — is rather a characteristic of the
mathematical methods of the theory than a characteristic of its physical contents."
"However, even if these possibilities developed into a real theory we shall not be
able to dispense, in the field of theoretical physics, with the ether, i.e. a médium which
possesses physical properties; indeed, the generál theory of relativity — to the principles
of which physicists will probably always adhere — excludes any direct distant
action. Every theory based on close action supposes the existence of continuous fields
thus they suppose alsó the existence of an 'ether'." (My own translation.)
4 Relativity
Furthermore, for our considerations it is alsó immaterial whether or not
various parts of the ether move relative to each other. It seems quite plausible
that considered on. a cosmic scale distant parts of the ether are streaming
with various velocities and thus the system K , we consider, has only local
0
that in a certain vicinity of the origin of K the ether has negligible velocity
0
relative to K .
0
B. EXPERIMENTÁL INVESTIGATIONS
59. The fact that the Earth is rotating around its axis can be seen from
the apparent motion of the stars in the sky. The rotation can alsó be
observed by mechanical experiments carried out on the surface of the Earth,
i.e. with the help of Foucault's pendulum, or by observing the motion of a
fast rotating gyroscope. It is interesting that the rotation of the Earth can
alsó be observed by optical experiments. The experiment in question is the
extended form of the experiment of Sagnac (1913), i.e. the experiment of
Michelson and Gale (1925). We give a description of the experiments.
We note that if we speak of the rotation of the Earth, we imply thus
rotation relative to the carrier of electromagnetic waves, i.e. rotation relative
to the ether.
2. THE S A G N A C EXPERIMENT*
signal has to catch up the point A which is moving away and it will be
reached by the signal at a later time T so that
+
cT +
= 2nR + RcoT , +
therefore
2%R
T
= =— > - T
c — Rco
T could alsó have been obtained by supposing that the light signal
+
The latter result could be obtained directly if we were to assume that the
signal moves with the velocity
c~ = c + Rco, (2)
relatíve to the edge of the disc. Here again we note that (2) üke (1) is not an
assumption but follows from the consideration of the time of flight of a
signal.
The difference in the times needed to circle around the disc in opposite
directions is thus
meet on SM, and split again. The components of the two returning beams
which are moving back towards the source are led with the help of a second
semitransparent mirror SM' into the telescope T, where an interference
pattern is produced. Just üke in the case of the Michelson interferometer
M 2
(see 31) the interference pattern can be used to determine the difference
of the times of füght of the beams circling round SM, M M , M in opposite
1; 2 3
directions.
In the originál experiment carried out by Sagnac and that repeated later
more carefully by Pogány* (1926) the phase difference between the two
beams was observed when the arrangement was at rest. The arrangement
including light source and telescope was made to rotate with an angular
velocity co and a shift of fringes of magnitude
AX = 4Sco/c
was found. S is the area which the beams are made to circle round. This
shift is exactly what is expected from the calculations.
* B . Pogány: Ann. d. Phys., 80, 217, 1926; 85, 244, 1928; Naturwiss., 15, 177,
1927.
3. THE EXPERIMENT OF MICHELSON A N D GALE*
JA = 10 000 Á
It must be noted that the experiment is carried out on the rotating Earth;
it is impossible to "stop" the Earth while adjusting the arrangement. The
required adjustment is, however, rendered possible by the fact that the
effect of rotation increases with increasing area S.
In particular for a return beam the encircled area is zero and the effect
of rotation is not felt in this case. One might in principle adjust the lengths
of the sides of the square path with the help of return signals and measure
the fringe shift obtained with beams circling round the square path, the sides
of which have thus been adjusted. In the actual experiment Michelson and
Gale compared interferences obtained with beams circling round a smaller
rectangular and a larger quadratic area. From the difference of the observed
fringe shifts they could determine the angular velocity of the system as
a whole.
The Michelson-Gale experiment can be repeated by making use of lasers.
The fringe shift thus obtained can be observed more easily because the line
width of the laser beam is extremely small.
63. Unlike the rotation of the Earth its translational motion relatíve to
the ether cannot be observed by mechanical experiments. The fact that the
translational motion of a system cannot be observed was already recognized
by Galilei.**
We quote an interesting passage:
"For a final indication of the nullity of the experiments brought forth,
this seems to me the place to show you a way to test them all very easily.
Shut yourself up with somé friend in the main eabin below decks on somé
a time
T = l/c.
and B with B , then the time T which is needed for the signal emitted from
0
+
x {t) = ct.
s (4)
When writing down (4) we remember that the propagation of light is inde-
pendent of the motion of the source, therefore (4) gives the motion of the
signal no matter whether it was emitted from A or from A . (We may e.g. 0
*s(T ) = x (T ).
+
B
+
(5)
Thus introducing into (5) the expressions (3) and (4) we find
the points A and B is affected by the motion of these points relatíve to the
ether.
65. Relations (6) and (7) can alsó be interpreted as follows.
Consider a number of points C, D, . . . which all like A and B move with
the same constant velocity v relatíve to K. The points A, B, C, D,.. . define
a system of reference K' relatíve to which all the above points are at rest.
We can define as the velocity of light relatíve to K' in the direction from
A -* B respectively B -» A as
c+ = //r+, c- = //r-. (8)
Thus we find from (6), (7) and (8)
c +
= c— V, C~ = C + V.
66. Formulating the problem a little more generally we may state that
a light flash emitted at a time t = 0 will expand and it will be found at
a time t distributed on a sphere with points
r (0 = ch\
5
2
(9)
Considering a point B moving with a velocity v the coordinate vector
of which is given as
x (i)
B = 1 + vf, (10)
we find that the flash emitted at t = 0 from a point A the coordinate vector
of which at t = 0 is given by
r (0) = 0,
A
/, V c 2 •2 Q
V
sir # + — COS 1
1
c 2
<r c
?
r- = - r - - ? • O)3
T +
= . (14)
y/c 2
— v sin # — r cos #
2 2
2. PROPAGATION OF LIGHT
RELATÍVE TO A MOVING SYSTEM OF REFERENCE
c(#) = Jc - 2
i; sin # - v cos
2 2
(15)
According to this definition light is propagated unisotropically relatíve
to K.
It must be emphasized that (15) is obtained merely as the result of a defini-
tion. The time of travel T from A to B follows, however, from a purely
+
21
T(&) = T +
+ T~ = — • — = . (16)
c 1 v
mation
r(#) = ^ ( l + ^jl-I in^J).
s (17)
From the experiment T(fi) is determined; the velocity of light can be taken
as the solution of (16) into c. Let us have
c = 21IW)
0
the Fizeau experiment without attempting to correct the value for the
motion of the arrangement. We find from (16)
1 v*_ 1 / v V'
2 2
1 + 4-^-cos ^} 2
2 +
4 +
2 V " ^ „ 2
were to take v of the order of 30 km/sec, i.e. supposing that the Sun is about
at rest relative to the ether, then the relative difference between c and c 0
would be of the order of 10~ ; this difference is thus smaller than the experi-
8
mentál error of the most precise measurements. Thus from the practical
point of view the motion of the Earth can be neglected when evaluating
the results of the Fizeau experiment.
70. Although the correction (18) is of no importance for the practical
determination of c it remains an interesting question of principle whether
or not the translational motion of the Earth relative to the ether can be
observed through an experiment. This question was already raised by
Maxwell.* In his paper Maxwell considers a type of Michelson-Morley
experiment but remarks that the expected effect in such an experiment is
3. THE M I C H E L S O N - M O R L E Y EXPERIMENT
71. It follows from (16) or (17) that in an arrangement of the type used
by Fizeau the time T(é) of to and fro journey of the light signal should vary
with the angle •& subtended by the direction of the light path with the velocity
vector v with which the arrangement moves relatíve to the ether. The differ-
ence between c and c is too small to be observable directly, but it can be
0
Á T =
r ^ w - ^ W - ( 1 9 )
find alsó
j r _ % z U ( 1 + í + A . 4 . ( 2 0 )
c [ ej c c l
- r - 1 . 4
C C E
whether or not the arms are of equal lengths unless we make a supposition
of the value of v.
72. Information as to the value of v could be expected to become avail-
able if we turn round the interferometer. Indeed, adjusting the arms until
we find AT = 0 we expect in accord with (19) that
AT(9Q°) = - .
c c
sl
ment thus modiíied gave the same negatíve result as the experiment in the
originál form; no fringe shift was observed while the interferometer was
turnéd round.
74. The fact that turning round the interferometer produces no shift of
fringes could be explained in a rather trivial way.
Let us suppose that at the time of the experiment the Earth happened to be
at rest relatíve to the ether. Thus one might suppose that the solar system
* A. A. Michelson and E. W. Morley: Amer. J. Sci., 31, 377, 1886.
** R. J. Kennedy and E. M. Thorndike: Phys. Rev., 42, 400, 1932.
happened to move relatíve to the ether with a velocity which coincided with
that of the orbital velocity of the Earth — and that by chance the experi-
ment was carried out during a period when the Earth was moving relatíve
to the Sun in the same direction as the ether.
So as to guard against such a trivial interpretation of the negatíve out-
come of the interferometer experiment, the latter was carried out severaí
times in the course of a year. As the direction of the orbital velocity of the
Earth changes in the course of a year the velocity of the Earth relatíve to
the ether — if it happened to be zero at one period — should increase to 2v
after six months where v = 30 km/sec is the orbital velocity. Thus the
Michelson-Morley experiment proved that
and thus we must conclude that the turning round of the interferometer
produces no fringe shift whether or not the Earth moves relatíve to the carrier
of light.
Another supposition which might explain the negatíve outcome of the
Michelson-Morley experiment would be to suppose that the ether is stick-
ing to the Earth so that it carried the surrounding ether with itself. The latter
assumption is disproved to somé extent by the Michelson-Gale experiment
which shows that at any rate the rotation of the Earth is not shared by the
ether. The assumption of the Earth dragging the ether along with itself seems
to be very unlikely and such a hypothesis need not be considered seri-
ously.
75. The only acceptable explanation — in our view — of the negatíve
outcome of the Michelson-Morley experiment is to assume that the inter-
ferometer, however carefully it is turnéd round, nevertheless it deforms;
the deformation being such that it compensates exactly the phase shift
which would appear without the deformation.
76. A hypothesis to this effect was put forward by Lorentz and Fitz-
Gerald. In this hypothesis they suggest that a solid when accelerated so as
to move with a velocity v relatíve to the ether its dimensions parallel to v
contract by a factor *Jl — v /c while the dimensions perpendicular to v
2 2
remain unaffected.
The effect of contraction could be pictured in a rather primitive way sup-
posing that a solid moving in the ether is subject to "pressure" caused by
the ether wind penetrating it and that this pressure causes the contraction.
From the Lorentz-FitzGerald hypothesis it follows that a solid moving
with a velocity v is compressed in a direction parallel to v. If we turn round
the solid by 90° so that the dimensions which were originally parallel to v
are turnéd round to stand perpendicular to v then the dimensions which
were originally compressed are now being released and thus expand by
a factor l/^/l — v /c . Similarly the dimensions which are turnéd into the
2 2
move with a velocity v relative to the ether but it is turnéd into a direction
perpendicular to v then its length remains a . If the rod is now turnéd so as
0
where
«i = Ö(0) sin •&, a = a(é) cos ö,
2 (22)
thus a and a are the projections of the rod perpendicular respectively
x 2
where a' would be the length of the projection if the Lorentz contraction
2
and thus with the help of (22), (23) and (24) we find
cos # 2
a (#)/sin # +
2 2
* H. Dingle: The Special Theory of Relativity. London, Methuen and Co. Ltd.,
New York, J. Wiley and Sons, Inc., 1940.
We see therefore that the time T(ff) of the to and fro flight along a rod
remains constant while the rod is turnéd round if the rod suffers Lorentz
contractions, so that it is adapting itself continuously to its orientation
relatíve to v.
The system of interference fringes observed with the Michelson inter-
ferometer depends on the difference
where é and •& + 90° are the angles subtended between the arms of the
interferometer and v; 7i(#) and r (# + 90°) are thus the times of return
2
flights along the two arms. Since neither T^ö) nor T (fi) vary with
z we
expect the difference alsó to remain constant, i.e.
AT(fi) = independent of fi.
82. Consider two opposite point charges + e and — e; the radius vector
pointing from — e to + e be denoted by r. If the charges are at rest the force
acting upon +e can be written
F = eE =
0 - 7 3 - .
As the force acts in the direction of r the moment of force produced by the
pair of charges vanishes, i.e.
M = r x F = 0,
0 0
B = — (vxr)/r . 3
c
Thus the totál force acting upon e is given by
F(v) = e |E + — (v x B) F
O + - J T V x (v x r)
c
since
v x (v x r) = v(vr) — VT
2
we find that the moment of force produced by the pair of charges is equal to
e 2
c
* Fr. T. Trouton and H. R. Noble: Proc. Roy. S o c , 72, 132, 1903.
Denoting the angle between v and r by we find for the absolute value of
the moment of force
eV
M = sin 20. (26)
dt '
and we expect that the radiation of constant frequency v emitted from A
will act upon B with the same frequency.
If A moves with a constant velocity v and B moves round A as before,
then the coordinate vectors A and B at t can be written
r (t) = yt,
A r (í) = vt + R(í),
B (27)
where R(/) is the vector pointing from A to B at the time t.
A signal starting from A at t will reach the point B at the time f so that
(r (t) -
A r*(0)2 - c (f - tf = 0.
2
(28)
Differentiating (28) into t we find with the help of (27)
«.-o.-w)(.(.-Í)-i«Í)-«v-o(.-Í).
(c* - t, ) - R(/)v -
2
( " "íj
! + vk
C> = °-
We note that Rif)l{t' — f) ~ c, thus neglecting small terms of the order
(D/C) we find from the above relation
2
dt' , vR(/')
— « 1 +—
dt c 2
(29)
. vR(í') V )
c 2
v+w
r v-w \Y
Fig. 13. The orbit of the source used in the experiment of Isaak and co-workers
v*(0 = v ^ 1 - " ^ .
o (31)
then we find inserting (30) into (31) and neglecting small terms
vR(0)
r
v*(í') = v'(0,
.e. the frequency v*(í') of B at any time t' will be equal to the frequency
v'(í') falling upon B.
Summarizing the above considerations we can thus state the following.
If the emitter A moves with a constant velocity v relative to the ether, then
the absorber B circling round A moves along a cycloid path (see Fig. 13).
The phase surfaces emitted by A in equidistant times will reach B in a
sequence corresponding to a periodically varying frequency v'(f). How-
ever, the velocity w(í) = v + R(í) of the absorber along the cycloid alsó
changes periodically and if the inner frequency \*{f) of B adjusts itself at
any instant t' to the velocity of motion of w(í) according to (31), then the
radiation of A falling on B with periodically changing frequency v ' ( 0 will
be exactly in resonance with the changing inner frequency v*(/') of B.
Thus we find just as in the cases of the Michelson-Morley and Trouton-
Noble experiments that alsó in the experiment of Isaak and co-workers
the effects which occurred on purely geometrical grounds are compensated
by other effects and therefore the two types of effects caused by the
translational motion of the system compensate each other and no observable
effect remains.
87. The series of failures to find effects suitable for to determine the
translational velocity v of the Earth relatíve to the ether might appear at
first sight as a series of "accidents".
In this view the Michelson-Morley experiment might have been suitable
for to determine v, but for the "accidental" fact that the arms of the inter-
ferometer deform. Similarly the Trouton-Noble experiment is rendered
inconclusive by the apparently accidental fact that mechanical torques
appear to compensate the electric ones. The fact that the mechanical torque
should change exactly to the same amount as the torque inside the suspended
condenser must be regarded as a rather remarkable fact which, however,
is proved directly by the experiment itself.
In the Isaak experiment we are prevented from determining v by the
apparently accidental fact that the inner frequencies of the absorber vary
with the velocity w(í) of the absorber relatíve to the ether.
Similar apparent accidents prevent the determination of v by a number
of other methods.
In a scientific analysis a series of apparent accidents cannot be accepted
as real accidents.
88. So as to remind of precedents in the history of science, we note that
all the attempts to build perpetuum mobile seemed at that time to be pre-
vented by a series of "accidents". Somehow the proposed mechanisms did
not work because the one or the other disturbing circumstance was under-
estimated.
The fact that this series of accidents were not real accidents but they
reflected somé generál law of nature, was first clearly pronounced when the
French Academy of Sciences declared that it will not be concerned in
future with perpetuum mobiles.
The above declaration is only a negatíve result. It was realized that the
series of failures encountered cannot be accidental but that there must be
a generál law of nature which makes it impossible to construct such a
machine.
The second step was to find and to formulate the law underlying the
negatíve experiences. This law was the law of conservation of energy.
It must be emphasized that the law of conservation of energy is a law
which (as far as we know) is valid for all phenomena. This law gives, how-
ever, only a framework for many particular laws of physics without deter-
mining these laws precisely.
89. It must be noted that should one find somé new phenomena where
the conservation of energy is not satisfied, then this phenomenon would
restrict the validity of the conservation law without making the law itself
invalid. Indeed, the law of conservation of energy has proved its validity
upon a very large field of phenomena and thus it is clear that this principle
does indeed refíect correctly an aspect of laws of nature. Its success would
not be undone even if we were to find somé particular phenomena outside
its validity.
The above remark can be supported if we remember the activities of the
alchemists. Trying unsuccessfully to produce gold, the alchemists found
a large number of laws of chemistry. Eventually it was realized that there
exists a law of nature which had made the efforts of the alchemists futile.
The law itself is that the chemical elements consist of indivisible atoms.
Today we can produce gold from suitable elements by nuclear chemical
methods. Nevertheless the laws discovered by the alchemists remain the
foundation of chemistry. The fact that we know now the limitations of
a law does not affect its importance inside its region of validity.
90. It is often pronounced as a principle of science that a hypothesis can
only be maintained until no single fact is known which contradicts it and
it is claimed as soon as there is one fact which contradicts a given hypo-
thesis that the hypothesis must be dropped.
It is a fortunate thing for the development of science that this principle
is only pronounced in textbooks but is not taken seriously by scientists.
Indeed, if a hypothesis proved its worth by explaining a series of facts then
this is an indication that this hypothesis reflects correctly at least part of
reality and therefore such a hypothesis has a content of reality. If new
phenomena seem to contradict the hypothesis a careful analysis is necessary
to the effect of whether and how the new phenomena affect the validity
of the hypothesis. It is very important for the stability of scientific progress
that hypotheses which have proved their worth are not thrown away when
the first difficulty arises.
1. GENERALIZATION OF NEGATÍVE EXPERIENCES
1. REPRESENTATIONS
e, % G, • • • (i)
for certain real quantities and write
* We use the term "real quantity" because we cannot find a better expression.
We understand as explained in the text the quantity as it exists objectively as distinct
from its measure.
alsó be described in a different representation, say R', and we have thus
R'(&) = E', R'{%) = P\ R'(£i) = Q'...
The E', P', Q are of course functions of the E, P,Q; the latter functions
give the transformation between the representations R and K.
95. In particular if © stands for an event then it can be represented
relatíve to a system of coordinates K by
K{<g) = x
where
x = r, t
and r is the representation of the coordinate vector r of the point in which
the event took place while t is the measure of the time t at which the event
took place.
A series of events (Sj, 6 , . . . (S„ can thus be represented in a system of
2
coordinates as
m) k = xk k= 1, 2 , . .., n (3)
with
and F are the measures of the distance r between the charges respectively
that of the force % acting between them. For the moment we take it for
granted that % and t can be expressed by Fand r in the usual way and we
discuss merely the problem of how to determine the measures of the charges.
Measuring F and r and taking the validity of (6) for granted we obtain
one equation for the two unknown quantities e and e . Considering the x 2
F, = ^ r
k *, 7 = 1 , 2 , 3 (7)
where F is the force acting between e and e, from a distance r when the
kl k
third charge (e.g. e ) is moved away. From (7) we find for the measures
m
e = r /
k
k l km
- , k,l,m = permutation of 1, 2, 3.
The sign of the square root has to be taken properly so that the right hand
side of (7) should be positive in the case of repulsion and negatíve in the
case of attraction.
97. If we have n > 3 charges e e , . . ., e„ then we can determine the
1; 2
measure e of e in more than one manner. We find thus with the help of (7)
k k
' 1
lm ' i l' '
m
where k, l, m and k, /', m' refer to groups of three charges with the help
of which we attempt to determine e . k
We must expect that the various methods for determining e lead to the k
same numerical value. In other words relations (8) give a check of consistency
of the Coulomb formuláé (7). If we carry out measurements with one fixed
distance r only, then the checks support the assumption that the force act-
ing between two charges e and e, is indeed proportional to the measures
k
98. Sometimes it is claimed that relation (7) gives simply the definition
of the measures of charges. This claim cannot, however, be maintained
altogether. — Indeed we have seen that relation (8) provides checks of
relation (7); if e.g. having four charges and we were to find from measure-
ment that
^12^13 , ^12^14
TI '
••23 ^24
then we are led to a result contradicting the law (7) in a particular instance.
Thus (7) is not merely a definition; nevertheless it contains a definition
also.
Indeed, if a set of checks of the form (8) is satisfied, we cannot conclude
that Coulomb's law has necessarily the form (7). If we were to assume
instead of (7) a law of the form
*u = p , (9)
It is seen immediately that if (8) is fulfilled also (10) is fulfilled and vice
versa. Thus introducing a number of different functions q> into (9) we find
consistency for any postulated function <p if we find consistency for one
of them.
99. We see thus that measuring forces between charges z , k = 1, 2,
k
3 , . . . we can establish that the law of force (regarding its dependence upon
the charges) is of the form (9). No information can be obtained, however,
upon the function 95. The choice of the function <p is in fact quite arbitrary
and the law in the form (7) contains a definition in so far as it puts the
function cp(e) = e.
The law (7) might also be written as
F =C ^ ~ ,
kl (11)
where C is a constant. The difference between (7) and (11) is merely that
a different unit is chosen for the charge. We saw, however, that one can
generalize (7) also to a larger extent; replacing (7) by (9) in this way one does
not change merely the unit but one deforms the scale used for the measures
of the charges.
100. Let us denote for clarity the measure of a charge e by e if the
fc k
measure is taken to correspond to the ordinary Coulomb law (7) and let
us denote e' the measure of t we obtain from Coulomb's law in the form
k k
(9) (with a fixed function <f). We find
thus there exists a one to one correspondence between the measures e' and
the measures e of charges e.
We may thus regard the e and the e' just as different representations
k k
e = R{t )
k k e' = R'{t ),
k k k = 1, 2, 3 , . . . , n
where R and R' stand for two possible representations.
3. DISTINGUISHED REPRESENTATIONS
101. The Coulomb law can be expressed taking the measures of charges
in the representation R or in the representation R'. The representation R
is simpler than the representation R' as in terms of the former the Coulomb
law contains simply the product of measures of charges while in the repre-
sentation R' the force between two charges is obtained by a more complicated
combination of the respective measures.
The representation R has therefore a certain advantage over other repre-
sentations. Apart from this convenience there is a further reason for to
prefer the representation R.
Joining two charges t and e together they appear to act like one effective
x 2
ei í e = e ,
2 12 (12)
where we write the sign í for "joining the charges together". In the repre-
sentation R relation (12) can be written
e + e = e
x 2 12 (13)
while in the representation R' we find in place of (13)
<p-\<p{eí) + <p(e )) = e' . 2 12 (14)
4. MEASURES OF LENGTHS
105. When we jóin rods together, we find that (in terms of proper
measures) two rods I and I of length l and / joined together are equal
x 2 x 2
h + h = li2-
The scale in which the measures are additive can be taken as the distin-
guished scale of lengths. If we were to use a distorted scale in terms of the
latter, the length of the rods l and í might appear as
x 2 = <p~\lx) and
l'i = 9> (4) and the length of the rod 1 appeared as
-1
12
rod x „ l > k = 0, 1, 2
k n fits exactly between the points % and ty,. k
We can now ascertain whether or not the rod x when turnéd round still kl
tained by (17) are additive; they give aconsistent system of measures if the
k — 1, 2 , . . . can be chosen in such a manner that the r obtained by
r k í kl
erties of the rods. The everyday experience shows that using solid rods,
additive measures can be obtained indeed inside the margin of error of
measurement.
We may also reverse the argument and state that we define as ideál solid
rods the ones with the help of which an additive scale of lengths can be
obtained.
Constructing a scale of measures with real rods we may find small incon-
sistencies. Such inconsistencies can be made use of to determine deforma-
tions which real rods suffer when moved about and these deformations give
the deviations in behaviours of the real rods from ideál solid rods.
109. Further we note that if we obtain additive measures r , for the k
r'ki = ar , k
a>0.
The r' are the lengths expressed in a new scale. We may also introduce
kl
a new scale using a factor a < 0. Doing so we reverse our convention and
ascribe smaller measures to larger rods. Such a convention is unusual but
nevertheless is internally consistent.
K(*ki) = r„ k,l = 0, 1, 2 , . . . N
of the distances between various pairs ?$ and % of points.
k
We expect the measures thus obtained to obey the relations
ri k +r lm = r k m , (18)
where the equality sign stands if the points ty , ty, and ty lie along a straight k m
line. If the relations (18) are found to hold for the measures of distances
taken for any group of three points then this can be taken as a qualitative
check of consistency of our method. The above result supports in particular
the assumption that our measuring rods behave like solids.
111. For to obtain measures of the coordinates of the points ty we may k
= *«
define (in a representation K) the position of % relative to % . From experi- t k
ence we can take it that the position of a point relative to another is given
by a three component quantity, therefore we suppose
T
kl — kl,l> r
OT/,2' kl,3 r
The relation (19) stands for three relations, i.e. for one relation for each
of the three components of the vectors.
The relations (19) are automatically satisfied if we suppose
r « = i-/ - r fc /, k = 0, 1,. .., n (20)
where
r
/ = r
0/» T
k = T
0k •
with solid rods, therefore we suppose the law of Pythagoras to be valid, thus
r£, = rf, for k, 1 = 0, 1 , . . ., n (21)
where we have written short
3
T
kl =
X kl,i • r
I=L
Relation (21) is valid in an orthogonal system of reference. In terms of
skew coordinates we can suppose in the place of (21)
rf, = T Gr
kl kl (22)
where G is a symmetric and positive definite mátrix. In particular supposing
G = 1, (22) reduces to (21).
The following considerations will be carried through in representations
where we do not specify whether they are orthogonal or skew. In this way
we can considerably simplify the actual calculations and the results can be
applied in orthogonal representations if this is desirable.
For the moment we take thus (22) as a hypothetical connection between
coordinate measures r and measures r of distances. The precise meaning
w kl
with them can be expressed in terms of quadratic expressions of the form (23).
choosing the units along the axis suitably we can take the coordinate vectors
of these points to be given by
r = 0,0,0,
0 r 1 = 1,0,0, r = 0, 1, 0,
2 r = 0, 0, 1.
3 (25)
Writing down (23) for k = 0, / = 1, 2, 3 we find
G„ = rf, / = 1, 2, 3 (26)
G = j{ri
u + r?-rl). (27)
Thus (26) and (27) give the elements of G if the coordinates of the four
points (24) are to be given by (25).
It will be necessary to suppose further below that
det G # 0. (28)
Whether or not the above relation stands, depends on the numerical values
of the distances r in terms of which we express the elements of G. We have
M
to require therefore that the points (24) should have distances in terms of
which (28) is fulfilled. The latter requirement is equivalent to the require-
ment that the points (24) should not lie in one pláne. Thus we have to start
our procedure with four points (24) with mutual distances satisfying (28).
115. Considering five points
where D ( m )
is a vector with components
thus
r m = G -i D ( m ) ( 3 3 )
The relation (33) gives explicitly the coordinate vector r m of a point ty„
in terms of measured distances only.
3. QUESTION OF CONSISTENCY
116. Introducing (33) into (23) we find that the relations (23) are indeed
satisfied for the coordinate vectors of the five points (29) provided (31) is
satisfied. Introducing (33) into (31) we find a relation
D( >G- D > = r .
m 1 (m 2
m (34)
The latter relation gives a quadratic equation for Solving the above
equation, we can express r in terms of the remaining nine distances be-
m
tween the five points (29). [Since (34) is a quadratic equation, we obtain
in generál two solutions for r . ] If the measured value of r coincides with
m m
one of the solutions of (34), then (25) and (33) give a consistent set of
coordinate vectors for the points (29).
Relation (34) gives thus a test of consistency; it shows whether or not
the distances between the five points (29) can be expressed in a form (23).
117. Adding a sixth point to the five points (29) we can determine
the coordinate vectors r„ of *)3„ as
r„ = G-W>
where the elements of D are obtained from an expression of the form (32).
(n)
r Gr„ = r\
n and r„Gr = i - (r* + r%- r * J .
m
conditions.
4. VARIOUS REPRESENTATIONS
of the mátrix G should not vanish. It can be taken that the latter condition is
fulfilled if we choose the four standard points (24) so as not to lie in one pláne.
det G = 0 (35)
to N + 1 points such that these vectors satisfy all the relations (23) then
we can take linear transforms
r' = Sr + s
k k k = 0, 1, 2,... N (36)
Thus we see that provided the coordinate vectors r satisfy the overdeter- k
mined systems (23), then the transformed coordinate vectors r' satisfy the k
relations (38).
120. When constructing coordinate measures for the coordinate vectors
t of the points ty we assumed in 114 (25) particular values for the
k k
were to assume in place of (25) that in somé representations K' the coordi-
nate vectors are given by
rí = r ,
kr2 r' ktS k = 0,1,2,3 (40)
where the r' are chosen arbitrarily, then we can regard the r' as transforms
kJ k
t = Sr + s
k fc k = 1, 2, 3 (41)
with
$ik — k,i ~ 'o,t
r r
k,l= 1,2,3.
— ó.i r
Applying the transformation (41) to any of the coordinate measures of ty k
dct|r^-.ri|5É0
the latter condition is necessary for to make det S # 0 and to make the
transformation (41) a reversible one.
From the above considerations it follows that providéd consistent coor-
dinate vectors r can be introduced, supposing distances to be given by (23)
k
Thus the coordinate vector of the point ty„ in the orthogonal representation
is obtained as
r ' m = G -l/2 (m)_
D (42)
123. The above statements can also be formulated in another way. If the
measured distances r between the points of a set can be expressed by a
u
quadratic form (23), then one might conclude that the space in which the
points are situated is "Euclidean". Or if no consistent coordinate measures
can be obtained one might conclude that the space involved is "non-
Euclidean".
We do not think, however, that such a conclusion has any meaning. The
fact that the overdetermined system (23) possesses solutions r , k = 0, 1,
k
rods used. Roughly speaking one may conclude from the consistency of the
measures that the measuring rods made use of are behaving like rigid
bodies, i.e. if the measuring rods are turnéd or shifted they do not change
their length.
Of course the procedure described provides only necessary conditions
for the rods to behave like rigid rods.
124. A further aspect of the question is as follows. As relation (23) is
a (generalized) form of the law of Pythagoras we come therefore to conclude
that the latter law can be tested experimentally. This statement appears
at first sight paradox as the law of Pythagoras is usually proved with the
help of the axioms of geometry.
In fact no paradox is involved. The axioms of geometry simply reflect
the properties of ideál solids. The experimentál test described above is
a test to the effect that our measuring rods behave like ideál solids.
x
nr> m
= 1. 2 , . .., n. In a particular representation K we have
K(tJ = r .
m
We obtain the coordinate vectors in another representation K' with the
help of a linear transformation; we may write
K\i )
m = t' = Sr + s.
m m (43)
A linear transformation of the coordinate vectors may alsó be inter-
preted in a different manner, by writing in place of (43)
C = Tr m + t (44)
where detT # 0 and t is a constant vector.
5*
r = S - ^ - S - V
m (46)
C = Tr' m + t\ (47)
T' = S T S - 1
(48a)
i.e. it gives the connection between the coordinate vectors of $ m and ^3*
in the representation K'.
127. In a more generál notation we may also write
mj=r , m (49)
where % stands for the deformation which shifts the points S$ into points
m
K(%) = T, t
The relations between the representations T, t and T', t' of £ are given by
(48a, b).
The points ty m = 1, 2 , . . . n can also be taken to be the points con-
m
a. DEFINITIONS
thus
r£ = r Ő G O r .
2
w w (51)
Provided O satisfies the relation
ŐGO = G (52)
we find from (51)
r
r*i = ki k,l=\,2, ...
Thus the deformation mátrix O produces deformations which leave the
measures r unchanged. We denote such matrices orthogonal matrices and
kl
to
ŐO = 1.
129. Equation (52) gives the definition of the representation of an ortho
gonal mátrix in one particular system of reference. We may write
O = X(£>).
From (48a) and (39) we find
1 1 1
O' = S O S , G' = S G S , (53)
where
O' = K'(£>) and G' = K'{®).
Thus introducing (53) into (52) we find
Ő G O ' = G'.
We see therefore that the definition (52) is independent of the system of
reference in which we represent © and £).
130. Relation (52) gives nine equations for the nine elements of O.
However, since G is a symmetric mátrix only six out of the nine equations
are independent and the matrices obeying (52) form a set depending on
three independent parameters. We may denote this by writing O in place p
K(£) ) = O
v p or K(ti) = p.
Thus we write p for the parameters defining a particular orthogonal de
formation mátrix and write p for its representation in a system K.
gonal group.
Indeed, taking the determinant of both sides of (52) we find since
det G > 0
det O = ± 1 . (54)
We see thus that there exist orthogonal matrices with determinant + 1
and others with determinant —1. From (54) it follows that any orthogonal
mátrix O possesses an inverse O p ; thus multiplying (52) from the left
p
l
Őp^GOp = G, 1
ŐpGOp = Ö „ G O , = G (55)
thus the product
O, = O O p q
Ő G O = G.
r r
The unit mátrix obeys also (52) and matrices are always associative, thus
we see that the matrices obeying (52) fulfil the postulates of a group and
thus they form indeed a group.
The matrices with det O = + 1 form a subgroup which subgroup may
be called the proper orthogonal group.
Since the O form a group thus their transforms to another system of
p
O;. = OWOPO'o- . 1
(57)
r;; = 0 « O J R W = r w
133. The question may be raised as to what are the common features
of the various representations O , O -, OJ-, . . . of a deformation mátrix
p p
e'V-'M,
thus they depend on one real paraméter q> only. As can be seen easily an
orthogonal deformation
r* = O r„ + t
p
can always be regarded as a shift and a turning round through an angle <p
where <p can be determined with the help of the eigenvalues of O . In partic
p
ular for any orthogonal mátrix £> a representant K exists, so that
p 0
K (&) = 1
0
\ 0 0 £
where e = + 1 is found for the proper and e = — 1 for the improper
orthogonal matrices.
The measure q> of the angle is independent of the representation, we
have thus
4. RIGID BODIES
1. GENERAL REMARKS
which are distributed about uniformly in somé region 8t; further, clocks
Ei, ©> • • • E« near the points. In a given representation K we can aseribe
2
K(t ) = r„
r v = 1, 2 , . . . , n
to the points and we may adjust the clocks (S„ in a more or less arbitrary
fashion.
We may require in a purely qualitative manner that measures of co-
ordinate vectors of points close to each other should not differ too much
and similarly the readings of close clocks should not deviate appreciably
from each other. Having defined a system of reference with the help of
standard points and standard clocks, we can — interpolating between the
standard coordinates and time measures — determine the four-coordinates
x(p) of the orbit of a partiele crossing the region 31 by expressions of
the form (1).
137. In the previous chapter we have shown that making use of solid
measuring rods we can obtain a distinguished representation of coordinate
vectors. These distinguished coordinates can be taken to reflect particularly
clearly the properties of solids. The question arises whether it is possible
to find a distinguished representation for time measures alsó which reflect
particularly clearly certain physical processes?
138. We shall see that it is possible indeed to find distinguished time
measures starting from actual physical processes. Before describing the
above methods we want to emphasize that time as such has no particular
rhythm and time can adequately be expressed in terms of very different
measures. All that we can say about time is that it flows into one direction
only.
In the literature there exists a considerable confusion because the selec-
tion of a distinguished scale for the measure of time is confused with what
is called a "definition of time".
By analyzing actual phenomena we can compare the rhythms of processes
with each other; we can find e.g. that one process may be accelerated or
slowed down relative to another but our observations are always confined
to physical processes which proceed in time, and the comparison of such
processes.
139. An apparent method for the determination of a distinguished time
scale is provided by Newton's first law. Indeed if Newton's first law is to
be valid then there exist systems of reference K such that in terms of their
measures the orbits of free particles are given by linear expressions, i.e.
r(í) = r + \t.
0 (2)
(3)
where t is the measure of the atomic time and t that of ephemeric time.
A E
Should future observation show that (3) is not satisfied — this would be an
interesting result from the theoretical point of view.
It must be emphasized that if there existed a deviation from (3) one
could not put the question whether the ephemeric time scale or the atomic
time scale gives the true scale of time?
A deviation from relation (3) would simply show that somé of the
quantities we suppose to be constant do in fact vary in time. We remark
that the introduction of sidereal time depended on the assumption that
the Earth rotates uniformly and one of the main conditions for this to
be is the assumed fact that the moment of inertia M of the Earth is E
can check the consistency of the adjustment. Indeed, we expect that the
clocks thus adjusted should be synchronous to each other. Thus viewing
from $ not E but, say, E, / > 0 the image E$ of E/ seen in the telescope
<
k 0
fc)
The latter test when fulfilled supports not only the originál hypothesis
about the mode of propagation of light, but it also supports the assumption
that the rate of the standard clock E was uniform indeed and that the 0
Exchanging light signals between the various pairs of points we can deter-
mine according to (4) the numerical values of the r experimentally. So as kl
( r - r ) = f|,
f c l
a
' M=l,2, ...«. (4a)
As it was pointed out in 115 the above set of relations is mathematically
overdetermined if n > 4. However, in the cases where (4a) admits of solu-
tions, these can be obtained with the help of (33) in 115.
In particular using equ. (42) in 121 we may obtain systems of coordinates
in an orthogonal representation.
Once the r , are determined by return signals and are found to give a
k
Atkl = r ijc.
k
148. Summarizing the procedure we see that using the exchange of light
signals between the points 5Js k = 0, 1, 2 , . . . we can synchronize the
fc
points
The system of coordinate vectors and alsó the modes of synchronization
of clocks obtained by the above procedure can be subjected to a number
of tests of consistency. If the tests lead to positive results, then it can be
taken to support the hypothesis regarding to the mode of propagation of
light relative to the system of reference K in which the points 9p are at fc
rest. However, it is important to point out that the fact that there exist
consistent coordinate measures in terms of which the light appears to prop-
agate isotropically is only a necessary condition which has to be fulfilled
and this condition may be fulfilled even if the mode of propagation of light
is not isotropic; we shall return to this question further below.
Relation (5) can also be written in a different manner. Let us for this
purpose describe events by four-vectors; thus we write
£(<£) = x = Xx, x , x , x 2 3 4
where (£ is the event taking place at a time / in a point 5(5 with the coordinate
vector r and
r = Xi, x , x$ / = x± . *
2
signal from ?$ , and (5/ the arrival of that signal in ^ we may write in the
k h
where
X
kl = / x — x
t
is the vector of the "four-distance" between the events @ and (£/ and
0 0 0 \
fc
(1
0 1 0 0
0 0 1 0
r =I: 0:
\o 0: X 1 • (6a)
The symmetries which can be obtained in various formuláé by the latter definitions
of the fourth coordinates are misleading ones; by putting e.g. x, = ict we obtain
such apparent symmetries between space and time coordinates which do not cor-
respond to anything in reality.
150. If we succeed in constructing consistent coordinate vectors and
succeed in synchronizing clocks in a consistent manner, then we may assume
that light is propagated according to (5) or according to the equivalent
relation (6). However, the tests which prove the consistency of our pro-
cedure give only a necessary condition for light to be propagated iso-
tropically.
Indeed, let us replace the coordinates x of events (5 by
x' = Ax + a, (7)
x« = xj - x*' = A x . w (8)
= 0
thus if
ATA = 0T 0*0 (9)
Multiplying (10a) from the left by F " 1 and from theright by A - 1 we obtain
in place of (10a) a relation which is identical with (10a), namely
A" = 1
r'Xr. (10b)
A íQ) (Q)
rA =r
(12)
where q stands for the six parameters defining one particular solution
of(12).
153. Repeating the considerations of 131 for the orthogonal matrices
O we find that the A form a group, the so-called Lorentz group.
w ( , )
O 0
(a)
0 1
(13)
with
OO = 1 (b)
where O is an orthogonal mátrix of third order. We find that A satis- ( t )
( B 0 0 -vB\
0 1 0 o.
A<°> = (14)
0 0 1 0
0 0 B )
with
-1/2
B= 1-
/ V o V
l+—-(B-l) -vB
\ - yfi/c 2
B J
Inserting (14) or (15) into (12) one verifies that A is a Lorentz mátrix (,)
indeed.
155. Both (13) and (15) represent a three-parameter manyfold. From
the group character of the Lorentz matrices it follows that
A<*>A<'> = A<*-> T
Vov
o + (B-l) -XB
A<*-*> = (16)
vJJ/c 2
B
where
V Ov,
and thus
V.
2
Writing
A »- > = A *
( T 1
q = tp, v
uoU
L= O +
1+ B
1/2
B= = (1 + u O U )1/2
c 2
With the help of (10b) we obtain also an explicit expression for the reciprocal
of a Lorentz mátrix, i.e.
transformation.
With the help of (19) and (20) we can clarify the physical significance
of the parameters q occurring in the Lorentz matrices.
159. Supposing that x(p) and x'(p) represent the orbit of a point A t
relatíve to K respectively K' we can write for the velocity of that point
relative to K and K'
dt r „, di' r'
V
^ - ^„v^T ( A ) V
" - UVTTB (B)
- ( 2 3 )
\ A = - cHJ/B = v. (24a)
From (24a) and (24b) we conclude that the velocity of K' relative to K in
measures of K is equal to v, further the velocity of K relative to K' in measures
of K' is equal to — V.
We see therefore that systems of reference K and K' (connected by a
Lorentz transformation) are in generál in a translational motion relative
to each other; the velocity v of the translation is closely connected with
the k4 and 4k (k = 1, 2, 3) elements of the transformation mátrix.
The mátrix
L = O + terms of the order of v /c 2 2
thus for velocities v <^ c, the mátrix L defines a nearly orthogonal trans-
formation connecting the axis of K and K'. Thus L defines the orientation
of the axis of AT relative to K'. Taking the higher order terms into considera-
tion, we find that the axes of K' are not exactly orthogonal relative to each
other when considered in the measures of K.
160. With the help of Lorentz transformations we can thus construct
a six-parameter manyfold of systems of references. The systems have
translational velocities relative to each other and their coordinate axes
are turnéd round relative to each other. In terms of the coordinate measures
of any of these systems light signals appear to be propagated isotropically
with a constant velocity c. We may denote a system of reference thus obtained
a Lorentz system of reference. A Lorentz system is therefore an inertial system
in which the coordinate measures are chosen in such a way that the prop-
agation of light appears to be isotropical expressed in terms of them.
C. HOMOGENEOUS PROPAGATION OF LIGHT
1. THE CONCEPT
where
x* = r , tk k and x, = r,, t,
are the four-coordinates of the departure of a signal from a point ^ and k
of which was r, at the time of the departure of the signal, which point how-
ever, at the time t, has shifted to a position with radius vector
i> + v(/, - í ).
k
and alsó
det g = - c det G. 2
clocks (S , Ei, ©2> • • • near the points. Using signals of lights we can syn-
0
chronize the rates of the clocks just as described in 145. The fact that this
synchronization can be carried out consistently is a test of the hypothesis
regarding the mode of propagation of light.
Once the clocks © k = 0, 1, 2 , . . . are synchronized we can observe
fc
the time
t = t + t ,
kI
m m
(29)
a signal takes to move from % to % and back. From (27) and (28) it fol*
k
lows that
rGr + 2Vr/< > - C í > = 0
1 2 (1
(30) 2
where r = r, — r and
k is the time the signal takes to move from ty k
_ Vr + Y C ( r G r ) + (Vr)
2 2
C2
The time t^ of the flight back from % to *p is obtained from the above
}
fc
í (i) = V (r -r )/C + y í .
/ f c
2
w
we suppose
r _ _ 2 R
1
u — 0 •
c
Further we take
C 2 = C 2 - Vv.
«-I = Í G _ 1 / 2
-G-^O/C]
(O C0/C
C-k )• (4i
>
Comparing (38) and (40) we find
a ' S g a a ' - g'.
- 1 - 1
Thus putting
S = a'-V (42)
we find
S ^ g S " = g'. 1
given by (42) or (43) lead from a representation K to one K' satisfying (37).
168. We shall always suppose if we speak of a propagation mátrix (or
of a propagation tensor) that it is represented by a symmetric mátrix with
three positive and one negatíve eigenvalue such that g < 0. u
where the r are the distances between points as measured with the help
tk
of rods.
Similarly, using signals of light instead of measuring rods we can deter-
mine measures of distances r in accord with 163 and we obtain coordinate
lk
l VO V \
(r, - r )|G + - ^ - j (r, - r ) = r? .
fc k k (45)
The question arises whether there exist such representations K where the
coordinate measures i satisfying (44) coincide with the measures r satisfy-
k k
ing (45) in this turn. So as to answer this question we have first to investigate
the relation between the measures r and r of the distances between kl kI
points ty and
k
no reason why these measures should or should not be equal. It is, however,
an empirical fact that
r
kfiki — constant,
or choosing suitable units
r i = hi •
k (46)
170. The experiment proving (46) is the Michelson-Morley experiment
and the Kennedy-Thorndike experiment. Indeed, the latter experiments
prove the following. Marking two points A, B on a solid, the time return
of light T = t + t
AB AB between the two points remains constant if we
BA
thus the measures of the distances between the two points do not change
if we shift the solid.
From the interferometer experiments we conclude that
TAB — T *B*
a
and therefore
R
AB — R
A'B*
jf we take the measure determined interferometrically of a distance to be
r = CT. Thus if two distances appear equal, i.e.
r
AB — PQ> r
when compared with rods, then they will also appear to be equal when
measured with signals of light. If we add to this that the distances of points
along straight lines are additive — we see that (46) will be correct for all
pairs of points ty and ^5, if it is correct for one particular pair. Thus if
k
112
172. The requirements (l)-(3) which lead to (49) contain three distinct
physical statements. These statements reflect upon:
(1) the physical properties óf solid rods,
(2) the physical properties of the propagation of light,
(3) the connection between the physical properties of solids and those
of the propagation of light.
We note that the physical properties of solids mentioned under (1) are
by no means trivial ones. These include the Lorentz contractions suffered
by solids. We cannot, however, determine the elements of the propagation
mátrix g itself from the observations of signals of light.
Thus the question whether or not light is propagated isotropically in a
given region has no particular significance — we can only find out whether
or not light is propagated homogeneously in 9í.
CHAPTER V
1. DEFORMATION OPERATORS
x
l , 2s . . . , X
x
n .
of Q may move relatíve to each other. The motion of the points of Q can
be expressed in a paraméter representation, i.e. putting
x* = MP) = kÍP), Kp)
T
* = 1,2,... n
where i(p) > 0.
Thus choosing various values of the paraméter p we obtain the positions
of tyk various times t(p).
a t
thus obtained can be taken to form a physical system C* and thus (1) can
x
Fig. 15. Scheme of the Lorentz deformation of an orbit
represent the orbits of points relative to one and the same system of refer-
ence K (see Fig. 15).
One finds that the representations of Q and C* relative to a system K'
can be written
*k(p) = Sx (/>) + s 1
fc
x* '(p) = Sxt(p)
k + s j W
(3)
a' = (1 - TO s + Sa
We can thus write
K(%) = T, A and K'<%) = T, A'
T, A and T', A' represent the deformation % in terms of K respectively K'.
2. LORENTZ DEFORMATIONS
AQRA„ = T. (6)
M, = SM S~\q
and thus
M,g'M;= ', g
i.e. the point has at a time t* a coordinate vector r*(/*) and t can be
regarded as an independent paraméter with the help of which r*(í*) and t*
can be evaluated.
So as to avoid misconceptions we may alsó write in place of (8) —using
a little different notation:
non-linear functions of the t and the relations (9b) are non-linear equa-
k
r (f) = r = independent of t.
k k (10)
Let us consider the Lorentz deformations which arise if we use as trans-
formation matrices
and
Yoy
l + ( B - l ) — j - By
v
\Byjc2
B
r
ki¥=r k l .
Thus relation (14) is only valid for particular representations of Q and Q*.
180. A transformation with the mátrix (12) gives the following
with
t = Byr [c k
2
+ Bt k + t0 . (15b)
í<*> = kT k = 0, 1, 2 , . . . .
The clock E* gives thus its signals at intervals T* > T and thus its rate
is slower by a factor yjl —v lc? than that of ©. This slowing down of a
2
clock is of the same kind as that we have found when discussing the
perpendicular Doppler effect and alsó the experiment of Isaak et al.
182. A further type of the Lorentz deformation which is important
rather for the interpretation of ideál experiments than of real experiments
is the following.
Consider as the system D a cylinder rotating around an axis with an
angular velocity co. A transformation A where v is parallel to the axis
T
of SQ leads to a deformed state Q>* which is a cylinder moving with a velocity
v in the direction of its axis and rotating with an angular velocity
to* = co J\ - v^/c .
2
to its axis and it appears to be twisted around its axis. The angle of twist
between two sections which are at a distance a from each other is found
to be equal to
a*vto*
where a* = a ^ / l - v /c .
2 2
183. Taking the above results together — the following principle suggests
itself: the laws of nature are such that provided Q, is a real physical system
then the Lorentz deformed systems
ö* = £(G)
are possible systems obeying the same laws as the system £}.
We shall denote the above principle the Lorentz principle, as it reflects
on ideas put forward already by Lorentz.
We show presently that the Lorentz principle in the above form accounts
for the failures of observing the translational motion of physical systems
relatíve to the ether.
184. The Lorentz principle gives the generál law which explains why
it is impossible to determine experimentally the state of translational motion
of physical systems relatíve to the ether.
So as to illustrate by an example the results which can be derived from
the Lorentz principle consider a Michelson interferometer which rotates
slowly around an axis perpendicular to the pláne of its arms.
The observations of Michelson and Morley showed that the interference
pattern of such a rotating interferometer remains stationary. The above
observations may be interpreted by supposing that the interferometer has
no translational motion relatíve to the ether and thus it rotates like a rigid
body. The observations can, however, alsó be interpreted by supposing
that the interferometer is in a state of translational motion relative to the
ether and in the course of the rotation its arms periodically contract and
relax according to their instantaneous direction relative to the direction
of v.
From the Lorentz principle it follows that if there exists a system £}
which is an interferometer having no translational motion relative to the
ether and which rotates like a rigid body around an axis, then there may
exist alsó a system £1* which is a similar interferometer but which has somé
translational motion relative to the ether and the arms of which contract
and relax in a suitable way to keep the interference pattern stationary in
the course of rotation.
The observer moving together with the interferometer has no means to
decidé whether the object of his observations is a system iQ or a system O*
and therefore he has no means to decidé whether or not together with the
interferometer he is moving relative to the ether.
185. So as to elaborate the above example in a more generál way let
us consider somé physical system ö with a representation
= Q,
with _
Ö* = LjQ) q = tf(q).
We may choose K so that the centre of 0, is at rest relatíve to it; in the
latter case we find that the centre of £}* moves with somé velocity v rela
tíve to K. _
We can choose, however, also a representation K so that
k(Q) = e k(o,*) = e*.
thus we have _
£ (Ö) = Ő
(P)
and _
If we choose
187. Observing the measures of systems C we are not only left in the
dark with respect to the translational velocity of G relative to the ether
but a further ambiguity is left.
We may construct a system of reference K in which the propagation of
light can be expressed by a relation
xgx = 0 (18)
c = ( - det g ) 1/2
as can be seen easily. Thus one of the ten components of g can be deter-
mined if we give nine of them arbitrary values.
189. From the above considerations we see that from measurement we
cannot determine the measure of the velocity v with which our system of
reference K moves relative to the ether. We see also that we cannot deter-
mine the parameters defining the mode of propagation of light. Thus we
can just as little claim that an experiment proves light to be propagated
isotropically relative to the ether as we cannot claim that our system of
reference is at rest relative to the ether.
Sometimes it is claimed that "the Michelson-Morley experiment proves
that light is propagated isotropically relative to any system of reference".
The latter claim is quite unfounded. The Michelson-Morley experiment
proves merely that the time of to and fro fiight of a light signal between
two points of a solid remains unchanged, if we turn the solid round without
changing its translational velocity.
190. The state of the ether in a homogeneous region is described by
ten parameters nine of which cannot be determined by the experimentál
investigation of physical objects. These parameters cannot be determined,
because the laws of nature (as far as they are in accord with the Lorentz
principle) show symmetries, which lead to ambiguities in the interpretation
of the measured data. A phenomenon, if it can be interpreted consistently
using one particular choice of the nine parameters, can equally well be
interpreted using another choice for the parameters.
formation.
In a particular representation (19) might be written
e ( o = £ (,)(e)-
9
/(/) = lj\-v{tflc2
o>(í) = (O J\ - v{tf/c . 2
More precisely the phase q>(t) of the clock at the time t will be
t
<p(t) = co J Jl-vit'Ylfdt'. (20)
o
The relation (20) is supported directly by the experiment of Isaak et al.
and also by the experiments on decay times of elementary particles and the
results concerned with the perpendicular Doppler effect.
3) A rod at rest will not change the measures of its length when turnéd
round, i.e. we find for the coordinates of the ends of a rod which is turning
round with constant angular velocity,
r
^(0 = 0> *(0 = I
r
c o s
"""i / sin co í, 0 .
If the rod apait from rotating alsó moves with a constant velocity v then
the length of the rod will change periodically. The latter deformation ac-
counts for the result of the Michelson-Morley experiment.
The adiabatic principle formulated in 192 can be taken to hold only
approximately and it holds the better the smaller the acceleration to which
the system £} is subjected. To see this we have to discuss the mechanism
of the Lorentz deformations in more detail.
a. RELAXATION PROCESSES
195. The question may be raised what are the physical processes which
produce the Lorentz deformation. e.g., what causes a rod to contract when
accelerated ?
To answer this particular question we remark that a solid rod consists
of atoms which are in a state of dynamical equilibrium. Accelerating the
rod by a small amount, we disturb the equilibrium of the atoms. After
the outside forces producing the acceleration cease the (moving) atoms
have to establish again an equilibrium configuration. This process takes
place quite independently of whether there appear relativistic effects or not.
We note that by accelerating a real rod we produce elastic waves inside
it and the state in which the accelerated rod moves as a whole (a con-
figuration with a constant final velocity v) is attained only after the elastic
waves produced by the acceleration have died down and the new state of
equilibrium has been established.
If we take the forces between the atoms to obey the Lorentz principle
then we expect the equilibrium configuration of the moving atoms to be
slightly different from that of the atoms at rest and therefore, taking the
relativistic effects into consideration, we have to expect that, after the
elastic disturbance produced in the rod by the outside forces dies down,
the system settles down into the equiübrium configuration Q* of the mov-
ing system — which configuration differs from Q, the configuration of the
system at rest.
196. If the system is accelerated slowly and step by step, then the system
has time after each consecutive step to settle down into new and new con-
figurations. If on the other hand the system is accelerated in a continuous
manner, then its configuration will lag behind the continuously changing
configuration Q(t) which corresponds to the changing totál velocity of the
system. Only after the acceleration has come to an end will the configura-
tion eventually catch up with the real equilibrium configuration correspond-
ing to the final velocity of the system. We illustrate this process schematically
in Fig. 16.
Q(t)
Q'
Q
t=o Kf Ft,
Fig. 16. Scheme of the dynamics of a Lorentz contraction
co" = co J1 - t> /c
2 2
I
Fig. 17. The Lorentz deformation in connected and non-connected systems
co*vl*
as can be seen easily subjecting the coordinates of the points of the originál
configuration to a transformation A (see alsó 182).
T
and as the angular velocities are all the time equal no phase shift can
develop.
199. The state of affairs is quite different if we suppose the wheels to
be fixed on the common axle. As follows from the adiabatic principle the
rotating axle when accelerated shows a tendency to twist; the axle thus
infiuences the motion of the rotating wheels and makes them to shift by
the amount q>.
To see the process more clearly, we note that the axle has to overcome
the inertia of the rotating wheels and in the case of symmetry it will turn
the one wheel by an angle — <f\2, the other by an angle + cp/2. If the
wheels have large momentum of inertia then considerable forces have
to act between the axle and the wheels to produce the shift.
It may happen that the forces exceed the cohesive forces of the system;
in the latter case the wheels may break off the axle and proceed to rotate
without further changing their phases. Alternatively it may happen that
the axle breaks in a point between the two wheels if the inner forces are
not sufficient to adjust the phases of the rotating wheels. It may also hap-
pen that the axle deforms permanently under the stress; in the latter case
the permanent deformation manifests itself in the fact that the final state
of the system differs from the Lorentz deformed configuration of the initial
state.
In all the three cases discussed above where the phase shift cp does not
establish itself, we can take that the acceleration of the system was not
adiabatic as the acceleration produced changes which were too large for
the inner forces, so as to maintain the equilibrium configurations correspond-
ing to Lorentz deformation.
In particular in the case of the freely rotating wheels the "inner forces",
i.e. the forces keeping the wheels connected to the axle, are of zero
strength — and in the latter case even the slightest acceleration must be
taken to be non-adiabatic.
acting along the positive x-axis, the masses will obtain at the time / veloc-
ities
The system of the two masses thus accelerated will not contract.
If we connect, however, the two masses by an elastic string, then the
string (being a connected system) will try to contract when it is set to move
into the longitudinal direction. Thus the string produces a tendency for
the system to contract. If the string is strong enough and if there is sufficient
time at their disposal, then the masses will eventually be pulled closer to
each other and their distance will be eventually equal to
where v is the fmal velocity of the system. However, just as in the example
with the two wheels, the string — if it is not strong enough — might alsó
break or deform. In the latter cases the Lorentz contraction does not come
about and the acceleration which produced the permanent deformation of
the string must be taken to exceed the measure of the adiabatic acceleration.
It becomes clear from the above consideration that the limit as to what
acceleration acts adiabatically is determined by the physical properties of the
accelerated systems.
201. We have seen that with the help of the Lorentz principle the results
of many experiments can be interpreted.
The question arises whether it follows from the actual experimentál
evidence that the laws of nature are invariant with respect to the whole
of the Lorentz group ?
The question is not trivial and it can be stated straight away that the
invariance of laws of nature can only be ascertained to the proper Lorentz
group, i.e. to transformations with matrices for which
v < c (21a)
B > 0 (21b)
detA = + l
p (21c)
From the last relation it follows automatically
6 = 1 (21d)
(see 156). The transformations with matrices obeying (21) form a sub-
group of the whole of the Lorentz group; (21a) must be supposed so as
to obtain transformations with real elements; (21b) contains a real physical
assertion, i.e. a transformation with B < 0 would define a system C* in
which the inner motions are reversed in time relative to those of Q. There
is evidence to the effect that not all types of physical motions occur also
in the time reversed form — we come back to this question in 263.
A transformation with B > 0 but det A = — 1 would produce a ö *
p
which is a mirror image of £1. There exists also evidence that the mirror
image of a possible physical system is not always a possible system. The
relation (21d) refers to the trivial fact that changing the measures of a
system by a constant factor 9 # 1, in generál we do not get a possible
system.
202. We note that although we have to restrict the operators A, giving
possible Lorentz deformations to the proper group of Lorentz transforma-
tions obeying (21) we may use for coordinate transformations matrices
A from the whole of the Lorentz group. Indeed writing
(q)
Ap^A^ApAO- , 1
1. ADDITION OF VELOCITIES
dt A
' dt
The distance vector pointing from A to B at the time t is given by
and therefore the rate of increase of the distance vector can be written
(,)
with time with the help of a factor; the change of the distance between A
and B can thus be conveniently calculated.
205. Apart from the relation (1) another definition of the relative velocity
can also be introduced as we show presently.
Consider a point P which moves with a velocity
dx P
dt
relative to a system of reference K. The velocity of P can be expressed
relative to a system K' which system moves with a velocity v relative to K.
In the measures of K' we have
d X p
- w'
dt'
where w' expresses the velocity of P relative to K' in measures of K'.
The transformation of coordinates between K and K' can be written
r = Lr' - c Ut', 2
(3)
t = - \ w ' + Bt'
with
V O V
L=l+—=-(*-!),
(4)
u = - \B, U = - \B/c . 2
Lw' - c U 2
-\mt'+B
cr
Inserting (4) into (5) we have further
c i
where
w' = w{ + w , 2 = v(vw')/i> 2
v+w
vw' (7)
1+ —
c 2
dt
w' + v , c2
VW VW
W = 1+—Ö- ; V = W 1+ —2-
<r c
The above formula gives the measures w and w' of the relative velocity
between O' and P in measures of K and of K'.
2. ADDITION FORMULA A N D LORENTZ DEFORMATIONS
207. The addition formula (6) has also another significance. Consider a
system O and its representation Q relative to K. Suppose £} contains a
point 91 which moves relative to K with a velocity
dt = w.
w* - Y + W l + W 2 / g
(9)
c l
to v.
We see that because of the Lorentz deformation the system £} suffers
when accelerated, the deformed front @* appearingin the deformed system;
£i* will move with a velocity w* # v + w.
The velocity of the front <S* relative to C* can be taken as w* — v which
velocity differs from w by terms of the order of v /c . 2 2
If v and w are parallel we obtain instead of (9) the particular relation
v+w
w* = . (10).
VW K
'
c2
take up when it was at rest. If the rod is made to move with a velocity w
relative to the Earth and if w is parallel to v then the velocity of the rod
relative to the ether is thus increased and its length should further diminish.
If on the other hand the rod is accelerated so as to move with a velocity
w relative to the Earth so that w is in the direction opposite to v, then the
velocity of the rod relative to the ether is diminished and consequently
somé of the originál contraction is relaxed and the length of the rod increases.
Thus giving to a rod velocities into various directions and mapping the
changes of lengths it suffers one might suppose that both the absolute
value of v and its direction could be determined.
Similarly making a clock to move into various directions relative to the
Earth one might expect to be able to determine v from the changes the rate
of the clock suffers.
There is no doubt that the above effects exist, i.e. a rod changes indeed
its length and a clock its rate when made to move relative to the Earth.
Nevertheless the real observation of moving rods or of moving clocks do
not permit to determine the velocity v of the Earth as it can be seen from
the analyses we give presently.
210. Let us consider two clocks A, B. Suppose at first the clocks to be
close to each other and also to be at rest relative to each other. In this
preliminary state the clocks can be synchronized and afterwards accelerated
adiabatically so as to move with specified velocities. Let us consider the
motion of the clocks with respect to a system of reference K which is at 0
rest relative to the ether. (We shall see presently that it is unimportant for
our considerations whether or not we can pick out the system K among 0
x (t) = x (t )
k A k + c(t - t ) = a - (c + v) t + ct.
k k (14)
The signals reach B at times t' so that k
x {t'k) =
B x (t ),
k k
a — (c + v)t k + ct' = b
k + wtk
and
b - a + (c + v)t k
tl = ^—— . (15)
We may write
tl = Ú + kT' A (16)
where T' is the interval between subsequent signals arriving from A into B\
A
T-t' t ' - c + v
T
A — 'k+1 k —
l
1 1
A-
C— W
Comparing the rate of the clock B with that of A as seen from B we can
determine the ratio
Q=T ir B A = ^ ^ - (17)
c+v TA
T^T/Jl-Jjc -, 0
Ts-TjJT^w ]?, 2
(18)
where T is the period of either of the clocks if at rest relative to the ether.
Introducing (18) into (17) we find
I c — v c— w ,„
ő= J • (19)
V c+ v c+ w
Making use of relation (8) of 205 we can alsó write
v +w
Ö = J-—77
C+V With V=
+ 2
c
or
2
1 ö12
F
= c
T r f - ( 2 0 )
212. Observing the rate of A from the position of B we can only de-
termine the ratio Q dermed by (17) of the rate of the clock in B and that
of the rate which A appears to have when seen from B. According
to (20) from the observed value of Q we cannot determine the velocities v
and w separately but only their combination V förmed according to Ein-
stein's additional formula. The observed ratio Q is compatible with the
assumptions that A moves with a velocity v', while B moves with a
velocity w' where
(21)
v'w' ~ ° 1 + Q 'z
+
c2
then the latter experimentál result supports the hypothesis (18) as to the
change of rates of clocks; the check also supports the assumption that there
exists a system K relative to which light is propagated isotropically.
0
214. Relation (19) gives one equation for the two unkown quantities v
and w. If one could obtain another mathematically independent relation
between t' and w then one could determine v and w from the system of
equations thus obtained. However, no such equations can be found.
Trying to attempt to obtain a further relation containing v and w one
can determine the velocity of A relative to B by using the radar method.
To determine the relative velocity between A and B the observer near
A may emit light signals at times t = 0 and t = T towards B. Suppose
the signals are reflected from B; they will arrive at times / = t respectively
x
at t = t back to A.
2
From simple geometry (see Fig. 19) we find
Comparing (17) and (23) we see that both measurements—i.e. that when
we try to compare the rate of the clocks A and B, and that when we try
to measure the relative velocity between A and B—lead to the deter-
mination of the same quantity Q which is a function of Einstein's sum
V of the velocities v and w. Thus the two types of measurement described
in 211 respectively 214 give merely a check of consistency of our
assumptions in so far as we find out whether the two methods do indeed
lead to the same numerical value for Q and thus for V. The com-
parison of the results does not help, however, to determine the measures of v
and w separately.
215. Carrying out similar measurements by emitting signals from B
towards A one finds a quantity
v e = ^fi-
2
•
Because v and w appear in a symmetric form in relation (23) one is led to
expect that
t r '
The latter relation provides a further check upon our hypothesis, but
does not provide further information as to the values of v and w. We see
thus that carrying out various observations with moving clocks we obtain
checks of our assumptions. In particular we can check the law of the
relativistic slowing down of clocks. We are, however, unable to determine
from the measured results the velocities of the clocks relative to the ether.
216. A similar consideration can be carried out for how the length of
two moving rods can be compared with the help of light signals sent to
and fro. The result is similar to that found for the comparison of the clocks.
Considering two rods moving with velocities v and — w we can attempt to
measure the length of the one rod with the help of radar signals sent from
an instrument moving with the other rod. Analysing the results of such
measurements we are led to still another determination of V — provided
we suppose that the rods contract when set to move relative to the ether
by factors y/l — v \c and y/l — w /c respectively.
2 2 2 2
The above experiment provides thus still another test of the consistency
of our hypothesis — it does not provide, however, additional information
as to the values of v and H'.
217. The above considerations repeat partly the arguments of 44 in
connection with the Doppler effect. The present considerations go further
than the former in showing that we may add to the observation of Doppler
frequency still other observations and even this further information does
not provide sufficient information for to determine the individual values
of v and w.
-aW = °-
co V,/l - v /c2 2
where v(t') is the velocity of the clock at the time t'. From (24) we see that
T* < T.
If the velocity v(t) is all the time much smairer than c, then we may
develop in powers of v\i)lc and we find from (24) when neglecting terms
2
AT=T-T* =
2c 2
where
v =
2
j^v(t'fdt'
is the average of the square velocity. Thus the rate of loss of the moving
clock is proportional to the average square velocity on its journey. We may
denote AT the loss of phase of the clock.
That a clock suffers indeed a loss of phase when moving with a high
velocity can be seen experimentally when observing the áecay of fast
^-mesons (see in particular 47).
220. Consider two clocks A, B such that A is moving with a con-
stant velocity v while B is moving with a varying velocity
"Át) = v + w(f).
The losses of phase sufíered by A and B, due to their motions, can be
expressed as
cov t
2
J r
* ( 0 =
I?/ ( v + w)2</í==
o
i
co C cot —x
= 47^(0 + - g - T J W í / Í + —w . 2
We may write
i
have
and
{AT -AT ) =^w\
B A l=t (25)
thus
AT > B AT .A
Thus comparing the readings of the clocks A and B at two subsequent meet-
ings, we find that the clock B which moved non-uniformly, suffers greater
loss of phase than the clock A which moves with a constant velocity Y.
222. Considering the two clocks starting at
r^O) = r (0) = 0, fl
we have
A T , - A T A = ™ + % * . (26)
—~W 2
= -zr^ >0 lf U ~* 00.
2c2
2c\
We see thus that, provided the velocity w with which B moves away
from A is sufficiently small, the second term on the right of (26) can be
neglected and we have in a good approximation
turv
AT - AT K —— • (27)
c"
B a
146
226. The question arises whether a humán being or for that matter
a biological process can or cannot be regarded as a clock in the sense used
in the previous paragraphs.
Remembering that all the biological and physiological processes which
take place in a humán being are based on complex interactions between
atoms, it is not absurd to suppose that the resulting processes are Lorentz
invariant like the simple atomic phenomena.
If we accept such a hypothesis then we can suppose that a humán being
placed into the rockét, when being under the influence of outer accelera-
tion, will slow down indeed. His nerves, his muscles, the rate of beat of
his heart may all thus slow down — the rate of oxygen used by the system
can be reduced as the various chemical processes are slowed down. Thus
the organism behaves like under the influence of a drug and slows down
as a whole. In such a "drugged" state it is conceivable that the organism
ages more slowly than it would when it develops its usual activity.
The problem can be compared with that of a certain bacteria, which
if cooled down sufficiently stop almost completely their inner activity and
are revived as soon as their temperature is raised again. These bacteria do
not age while their temperature is low.
Thus the "twin paradox" has to be understood in terms of a slowing
down of the whole of the organism by an outside influence.
It must be emphasized, however, that the assertion that biological pro-
cesses are exactly Lorentz invariant, contains a far-reaching extrapolation.
Whether this extrapolation leads to correct results could only be decided
by future experiments.
CHAPTER VII
RELATIVISTIC MECHANICS
then the orbits of the points ty* of the deformed system are given by
xt(p) = A x (p)
9 k +\ k =1,2,...
where A, is the mátrix of a proper Lorentz transformation.
Describing a physical system not merely in terms of the orbits of its
points, but using quantities like energy, momentum, charge, mass, etc,
we have to extend our formalism so as to be able to give also the trans-
forms of quantities other than four-coordinates.
2. ELASTIC COLLISIONS
p = ww, K = - i mw2
(3)
the suffix / taking up values 1 and 2 refers to the two particles, the upper
index k refers to the states before and after the collision.
In the classical approximation we can suppose that the masses of the
particles do not change in the collisions, thus we may suppose
m<P = mf> = m t i=L,2.
The law of conservation of energy can be written similarly as
Xjo + K£> = Kf> + (6)
with
Kp=^mpw?* i,k=\,2.
231. Applying the Lorentz principle to the collision we expect that the
Lorentz deform of the elastic collision described by (4) and (6) should
deseribe another collision which alsó obeys the laws of conservation of
energy and momentum. We investigate how the expressions (3) have to be
modified so as to make (4) and (6) Lorentz invariant.
Considering a Lorentz deformation with a deformation mátrix A we y
can calculate the velocities the particles take up in the deformed version.
We find applying the results of 207
all parallel we can write in place of (7) using the notation (8)
VWT>
1+ —T-
vw(k)
where
Introducing (9) and (10) into (5) using the notation (8) we find
_ (fc+2) (fc+2) = i
w< k)
+ v
. B
±!_ ( ]
Pi ~i
w
m -
t _ 5 p+2y (fc+2) •
OT UA>
Introducing (12) into (11) and writing = pf^mf^ we find after multi-
plication with y / l ' — tP/c a relation of the following form:
2
(13)
with
W ( 1 4 )
mf^mfif^ ">> r
Í = h l
(16)
íl-%- ^=1,2,3,4
The terms of (13) containing v cancel provided (16) holds and alsó
the relation
mP + mf> = ni? + n%> (17)
is satisfied.
As the result of a short calculation we find that it follows from (4), (16)
and (17) that
mi > + m > = < > + m > .
3
2
3
2
4
P= r. ^ • (20)
234. We see that the law of conservation of momentum (18) can then
and only then be formulated in a Lorentz invariant manner, if we add to it
the relation (19). The relations (18) and (19) together appear Lorentz
invariant if the transformation of mass and momentum is taken to be in
accord with (21).
The classical law of conservation of energy expressed by (6) is not Lorentz
invariant; we see immediately from our formalism that in generál
+ = + (22)
with
E\ =
k)
' = = = mfi + — m-wf? + terms of higher order.
2
y 1 - w^ jc
k) 2 2
E = ^ L = (23)
where m is the rest mass and w the velocity of the partiele — and we find
that the relativistic energy of particles is exactly conserved in elastic col-
lisions.
235. We have obtained relations (20) and (23) as sufficient conditions
for the invariant formulation of the conservation laws. Regarding, how-
ever, the independent parameters involved in the consideration, one can
see easily that for given values m,{0) these conditions are alsó necessary.
Thus supposing that the momentum vector has a direction parallel to w
energy and momentum have necessarily to be assumed to have the depen-
dence upon w as given in (20) and (23). (In the energy expressions there re-
mains of course always the possibility of adding arbitrary constant values
to the expressions £,-.)
3. INELASTIC COLLISIONS
velocities
and wl = - w. 1)
Suppose the particles to collide and stick together after having touched
each other and to come to rest in the end. We suppose thus
wf> = - w « = w = w<> = 0 2
and we have
m vf
0
><1> 2
= p>= 0
2
W 2
and
PÍ + P
1) ( 1)
2 = PÍ + P .
2) ( 2)
2
2w
wf> = wf> = 0; w^ = w = w,
4) 4)
,2 ' 2
the momenta of the transformed system are found to be
„(3) _ _ 2 W W
0 „(3) _ o
2 C 2
/n w
PÍ = P
4)
2
4) 0
thus we find
, / 1 - w /c 2 2
thus for w > 0 the momentum relation in the transformed system is not
fulfilled. The reason for this discrepancy is that the energy relation is not
fulfilled in the originál version of collision, indeed we have
IT?) = Ef = -j^£==, E? = E? = m c . 0
2
X /1-H' /C '
2 2
If the inelastically colliding particles stick together, then their kinetic energy
is transformed into somé other type of energy e.g. the particles may heat
up. We have thus
Ej» + E?> 2m 0
2m =
0 5 = — — ,
C 2
^/l -w /c 2 2
and thus we nave in the place of (24)
P! + P
( 3)
2
3)
= Pl + P ,
4)
2
4)
where
p(4) = p(4) _ m „
and
C. DISTANT COLLISIONS
239. Two particles acting upon each other at a distance influence mutu-
ally their motion. The problem of the conservation laws in this case is
somewhat more complicated.
SchematicalJy a collision at a distance can be pictured by supposing that
the changes of momentum of the colliding particles occur suddenly. We
may thus suppose that (as shown in Fig. 21) the partiele 1 is deflected in
the point A at a. time t and it changes there its momentum from a value
p W t pC2). At the same instant the partiele 2 changes its momentum p
0 2
1 )
PÍ +P 2 =Pf + P ,
1> ( 1) )
2
2)
However, transforming the motion of the particles we find that in the trans-
formed system the changes of momenta do not take place simultaneously.
The momentum of the one partiele changes at a point A* at somé time
1
'*> while the change of momentum of the second partiele takes place
=
1. EXPERIMENTÁL EVIDENCE
n* = Á - n ,
1
(28)
where A is a Lorentz mátrix. (It is immaterial whether we take (27) and
q
II* = my, - wc 2
(30)
with
m=mB = (>
Thus we see that supposing the energy and momentum of a partiele to form
a four-vector, we obtain the same expressions for energy and momentum
which we derived from the Lorentz principle by direct calculation in 232.
242. Supposing energy and momentum to form a four-vector, we auto-
matically ensure that the formalism thus obtained is in accord with the
conservation laws. Indeed, if we consider the elastic collision between two
particles, we may write instead of (18) and (19) 233
with
TT(*) _ n (*) _ pW '=1,2
Relation (31) contains both the laws of conservation of energy and that of
momentum. The relation (31) expresses automatically a Lorentz invariant
law — indeed applying the operator A ~ to both sides of (31) we obtain
where
H (*+2> = njk)* = x^injfc) i,k = \,i.
m\ mc 2
E=
0 0
are the only possible relativistic generalizations of the non-relativistic laws.
The considerations in terms of four-vectors give merely one possible gen-
eralization of the non-relativistic law and it is not obvious from the four-
vector formalism whether or not other generalizations are possible?
This remark is important because it is a fact that the measures of most
physical quantities can be expressed in terms of four-vectors or four-ten-
sors — nevertheless there is no reason why all physical quantities should
necessarily be expressed by vectors and tensors. This is particularly impor-
tant in connection with certain problems of gravitation.
1. NEWTON'S LAWS
F= f (32)
(33)
245. Introducing
or we can write
F = F 1 + F2, (34)
with
F, = m,y x F =2 m,y2
m = 2 2 m = 2 2 1/2 ( 3 ? )
' (1 v*lc ?' ' ' (1 i; /c ) '
We see thus that a mass is accelerated by a given force to a different
extent if the force acts parallel or if it acts perpendicular to the velocity
of motion.
246. Relations which can be expressed in terms of tensors are obtained
if we consider the mechanics of continua; a partiele e.g. can be regarded
as a small cloud of mass.
Supposing thus a mass distribution representing a partiele, we can sup
pose that inside an element of volume ÖV v/e find őp momentum and öE
energy. The force acting upon the matter inside ő V can be written as
SV = főV (38)
I = . — —— • l.T»/
2 2
' Jl-v lc dt
The latter relation can be taken as the space part of a fourvector relation.
Thus the fourforce density
dU.
<f> = B — Í41)
d*
forms a fourvector.
247. The relation (41) gives the density of force which is needed to máin
tain the state of motion of a matériái system with a momentum density
p(x). Ifthe matériái system possesses internál stress, then the expression (40)
must be extended. One finds that the state of matériái moving and alsó
possessing internál stresses can be expressed by an energy momentum
tensor of the form
a -c^x
-q cru)
the above relations can be taken as the continuity relations for the flow of
momentum and of energy and thus a can also be taken as to represent
the density of flow of momentum.
In such cases where T is symmetric such that
2
p = q/c ,
2
the density of flow of energy is (apart from the factor l/c ) numerically
equal to the density of momentum. The latter relation is describing the
inertia of energy.
CHAPTER VIII
A. MAXWELL'S EQUATIONS
rotE = - ± B (a)
c
divB = 0. (d)
The connection between E, D and H, B can be written as
D = E + 4;tP B = H + 4;rM
where P and M are the electric and magnetic polarizations.
The current density i and charge density q are connected by the continuity
equation
div i + — q = 0 (2)
c
and we may add the expression for the force density acting upon a current
of density i accompanied by a charge density q
f = pE + i x B.
1. ANOTHER FORMULATION
rot E = - — B (a)
c
rotB= — É + 4m (b)
(3)
efr
c
divE= 4nQ ef[ (c)
divB= 0 (d)
where we take
i = i + rot M + — P
c
eff
(4)
Geff=e-divP.
It can be seen that (3a-d) together with (4) are mathematically equiv-
alent to (la-d) and it follows also that provided (2) stands, we have also
and charges flowing through matter together with the currents and charges
flowing inside the atoms; f is the force density including the forces acting
eff
i = <rE + - ^
c
where k , x' — xlP d °
a n
characteristics of matter; very often the case
a r e
thus
1
ff = -J-(ExB)
« = J - ( E + B ).
2 2
T = — ( E o E + BoB) + lw (8)*
253. With the help of equations (3a) and (3b) the distributions E(r, t),
B(r, /) can be calculated for any value of / provided the currents and charge
densities i ff(r, t) and Q (r, t) are known and provided we impose an initial
e efí
* The expression (8) giving Maxwell's tensor differs from the förm it is usually
given. We prefer the form (8) for two reasons: 1) The tensor (8) contains only E
and B and is symmetric; in the usual form the tensor contains also D and H and is
unsymmetric. The tensor in the form (8) expresses the purely electromagnetic momen-
tum densities, while if D and H are also made use of, then we obtain a tensor which
contains also part of the elastic energy and momentum which is produced in the
raattér upon which the field acts. 2) The sign of T as given in (8) is the opposite to
the sign usually used. We prefer this choice of the sign because T as defined by (8)
obeys the continuity relation (9), i.e. the components of T can be taken to represent
the flow of momentum density. In regions, where f = 0, (9) expresses the con-
e[I
fact that the electromagnetic momentum increases or decreases but the increase or
decrease is balanced by the change of mechanical momentum; the latter change
appears in the form of a force acting upon matter.
** L. Jánossy: Acta Phys. Hung., 20, 6 7 - 7 9 , 1966.
condition of the form
V <P -
2
\ & = - 4itQ tfí (b) (12)
div A + — $ = 0. (c)
c
#(r, 0) = <i>(r). 0
and therefore (13) must not include an arbitrary initial condition for
The system (12a) and (12b) together with the initial conditions (13), (14)
determine A, í> uniquely for any value of t # 0. Relation (12c) plays the
role of a supplementary condition. It can be seen that the latter condition
will be fulfilled automatically for any value of t if it is fulfilled for one value
of í, and provided the current and charge densities satisfy the continuity
relation (5).
The solution of (12) thus obtained and introduced into (11) provides a set
of field strengths E(r, t), B(r, r) which for themselves satisfy (3a-d). The
latter field strengths satisfy initial conditions of the form (10), namely
c
B(r, 0) = rot A (r) = B (r).
0 0
The initial conditions (15) fulfil automatically (3c) and (3d) for t = 0 as
it must be required for proper initial conditions.
We see thus that solutions of (12a-c) provide with the help of (11) solu-
tions of (3a-d).
1. G A U G E TRANSFORMATION
255. So as to see also the inverse of this statement we note that the
equations (15) can be reversed. Indeed, relations (15) are satisfied if we
put e.g.
4> (r) = 0
0 Á„(r) = - cE (r) 0
1 C rotB (r') 0
(15a)
dx
4TJ |r-r'|
From (15a) it follows also
div A„(r) = 0 and therefore #o( ) r =
0-
Giving therefore the functions E (r) and B (r) [in accord with (3c-d)]
0 0
we can always construct functions A (r), A (r), <P (r) satisfying (15). Thus
0 0 0
V P -2 ,
X
C"
V = 0
for all values of r and t. Indeed replacing in (11) A and tf> by A' and
the values of E and B will not be affected.
The transformation (16) is the so-called gauge transformation — and we
see that the solutions of Maxwell's equations can be expressed by potentials
differing in gauge.
256. Because of the ambiguity of gauge it is often suggested that the
potentials A and d> do not represent real physical quantities but are only
convenient mathematical expressions with the help of which the solutions
of Maxwell's equations can be obtained. We think, however, this not to
be the case. So as to show why we think that A and $ have their good
physical meaning, we investigate in a little more detail the set of wave
equations.
2. RETARDED POTENTIALS
(17)
(b)
The integrals (17) give indeed solutions of (12a-c) as can be shown insert-
ing them into (12a-c). The latter integrals give, however, only particular
solutions of (12), as the solutions (17) fix already A and <P completely and
therefore they leave no room for initial conditions of the form (13). Besides,
(17) give the potentials in a definite gauge.
The generál solutions of (12) can be obtained by adding to (17) suitable
solutions of the homogeneous wave equations, i.e. denoting by A = (0)
V A 2 ( 0 )
- 4 A ' = O,
( 0
1 ..
(18)
y2$(o) $(0) o,
=
divA<°> + - í » = 0.
c
Equations (18) admit of non-trivial solutions.
It can be seen easily that we can always construct solutions of (12) of
the form
A(R, 0 = A « ( R , t) + A<°\r, t) 1
cP(r, t) = &"Kt, f) + <£ (r, 0 (0)
j
so that A(r, /) and <£(r, /) provide the solutions of Maxwell's equations (3)
with an arbitrary initial condition (10). We note that A and <P are unique-
(r) W
ly defined by (17) and the uncertainty of gauge is felt only when determin-
ing A ^ and $ the solutions of the homogeneous equations (18).
(0 (0)
258. The potentials defined by (17a-c) are the so-called retarded poten-
tials. They have a simple physical significance. The values of potentials in
a point r at a time t are determined by the distribution of currents and
charges in the surrounding at times earlier than t. The current and charges
situated at somé time f in a point P' at a distance R = | r — r | from the 7
point P are felt in P at a time í — t' + R/c. Thus the action of the charges
and currents is propagated isotropically with a velocity c •— at least as far
as the retarded potentials are concerned. In our opinion the forms of
(17a-c), the retarded potentials, express the fact that the electromagnetic
action is propagated isotropically with the velocity c.
This is the result which we referred to in chapt. I when we claimed that it
follows from Maxwell's theory that the electromagnetic action is propagated
isotropically with a velocity c.
3. A D V A N C E D POTENTIALS
259. The wave equations (12a-c) admit also solutions of the form
The latter expressions are called advanced potentials. The kinematic inter-
pretation of (19a-c) is that the charges and currents at times í" > t determine
the potentials at the time t in a point P with the coordinate vector R. Such
an "action from the future" has, of course, no physical meaning; somé
authors are inclined to reject the physical significance of both potentials
not only because of the uncertain gauge, but also because of the possibility
of expressing fields in terms of advanced potentials.
260. In our opinion the retarded potentials have a good physical mean-
ing while the advanced potentials have no such meaning*. The advanced
potentials can be eliminated from the theory in a simple fashion. Let us
write A { i } resp. A {i} for retarded and advanced potentials derived from
w (a)
I e f f =I + Divn, (25)
where
I = i, - CQ
D i v ¥ = 0, K }
— Fa = E, i = l, 2, 3. (28a)
c
Div F = 0, (29)
with
~ 1 (*)
F = y((e-F)) (30)
(see Appendix II 468).
267. The energy and momentum can be expressed by a four-tensor, i.e.
(>
2
1 ~ ~
T = - — ( F - F + F-F). (31)
Separating (31) into space and time components one fmds with the help
of (7a, b) and (8)
T
ik = ik>T
Ti4 = r, 4 = - c§h Tu = c-u. (31a)
1. RETARDED FOUR-POTENTIAL
Y(x) = J Ieff(x
^ +X)
t / R
3
(a)
(33)
T= - Rjc (b) I
X = R, T R = | R |. (c)
The expressions (33) contain explicitly the space and time parts of coordi-
nate vectors and therefore it is not obvious that the relations (33) are
invariant.
We show presently how the invariance of (33) can be proved with respect
to reversible linear transformations.
We consider a linear coordinate transformation
x' = Sx + s. (34)
According to Appendix I we have for any covariant vector field
A'(x') = S-^AÍx).
S - I ( x + X) = i; (x' + SX)
1
eff fr
and therefore
Y'(»') = J l
°"( '+x S X )
d * R ( 3 5 )
with
T=-R/c. (36)
We note that the coordinate transformation (34) does not affect the vari-
ables of integration and therefore in the integrál on the right of (35) is to
be integrated just as (33) into the components of R.
We introduce
X' = SX (37)
G = UU-c BOB, 2
(a)
V = AU - c aB, 2
(b)
2 2
C = c a - A, 2 2
(c)
and also
det S = c/c' with c'=V-detg. (d)
270. Separating the space and time components of (37) we may write
with the help of (40)
R = UR' + AT', (a)
(42)
T = — Rjc = B R ' + aT'; (b)
eliminating T from the first relation with the help of the second we find
AR
R + v - ^ ] r .
ac
Differentiating the above relation into R ' and taking the determinant of
the differentiated expressions of both sides we find as the result of a simple
calculation
ARI 1 , l dR
= det S~7a; (43)
R +- 1 —det
ac } R — 1 ŐR'
expressing R in terms of R ' and 7" with the help of (42) and (41) we find
AR 1
R + = — (- C T' + V R ' ) . 2
(44)
ac ac
However it follows from (38)
2
C T' = VR'- C\/R'GR'
with
V O V
(45)
G = G + - - ^ -
Iefffr' + X')
d R'.
3
(48)
R'
We see thus that in the system K' the four-potential can be expressed by
the same expression as in K. The interesting feature of this result is the
manner in which the retardation is introduced in the system K', Consider-
ing the action of sources from a point P' to a point P we note that the
action which arrives in P at a time t' starts at the time + t' where \T'\ is
the time of flight from P' to P; that this is indeed the case follows from
the fact that 7" is obtained as the suitable solution of (38). Furthermore
according to 163 the expression (47) for R' gives just such a value that
2R'\c' = tp . + t'p.
P P
i.e. 2R'/c' gives the time of the return flight of a signal of light between
/' and P'.
Thus the expression (48) has to be taken supposing that the measures
of three dimensional distances R' in K' are given by the times of return flights
of signals covering the distance.
The latter result shows the consistency of our considerations.
around a time t oscillate for a short period. The oscillation can be described
0
by a source density
Ieff( , 0
r
differing from zero if r ~ r , t ~ í .
0 0
r
-r
t ' = t - ~ — -
The integrand on the right-hand side vanishes unless
t' ~ t 0 r ~ r,
1
0
t ~ t. 0
We see thus that the disturbance in x gives rise to a spherical wave expand-
0
D. MAXWELL'S EQUATIONS
AND THE LORENTZ PRINCIPLE
where F may contain the potentials, the field strength and sources of the
field in the representation K.
According to the Lorentz principle we expect that provided % is a field
moving in accord with Maxwell's equations then
£ ®)=r
f l
M„gM = g. q (58)
The transformation (57) has the form of a coordinate transformation:
therefore if F(x) obeys Maxwell's equation in K, then the deformed field
quantities F*{x*) obey Maxwell's equations in a system of reference K'
the measures of which are obtained as
( 5 , ( < , ) _
x' = M " x + vl . with M '= M .
q
E2 = E — E^.
Splitting into components parallel and perpendicular to v we can alsó
write in the place of (61)
E?(x) = E i ® , BJ(x) = B j ®
and
E*(x) = I ? ( E ( X ) -
2 |(vxB (x)) 2
(62)
The inverse of transformation (60) split into space and time components
can be written
x = r, t
AND
r = B(i - ví) + r
t 2
(63)
1 = B(t - vr/c ) 2
to v.
1. T H E FIELD OF A POINT CHARGE
The transformed field is found with the help of (61) and (63)
Be(r - ví)
E*(r, 0=
Be (64)
B*(r,0= — ( v x r )
cs
with
s = B^
2 2
- \t) + r
2 2
and therefore
Be
Et(r,t) = -^-,
i.e. the field strength is increased by a factor B relative to that of the charge
at rest. The longitudinal field with
r = 0
2 and s — Br
gives an electric field
Thus the field strength is reduced by a factor l/B in the longitudinal direction.
2
In the extrémé relativistic case when Bf>\ the field carried by the charge
is concentrated into a small region in the vicinity of the pláne perpendicular
to v and moving with the charge. The field thus moves with a velocity
v ~ c, it is nearly transversal and the electric and magnetic field strengths
are nearly equal.
Thus the field carried by the fast moving charge resembles strongly an
electromagnetic pláne wave accompanying the charge. The latter result
can be compared with experiments; it is verified indirectly by observing colli-
sions between fast moving charged particles.*
280. The measure of charge e* of the moving partiele may be defined
by Gauss theorem, i.e.
e* = -^-J*E*(r,/>ÍS, (65)
RELATIVISTIC EFFECTS
OF THE ELECTROMAGNETIC FIELD
281. Developing into powers of v/c we can classify effects for velocities
v < c as of the first order, of the second order or of higher orders.
The essentially relativistic effects are of the second (or higher) orders.
Somé effects of the first order have, however, played a great role historically
and it was not always fully realized that the latter effects can all be accounted
for without making use of relativistic concepts. Such misunderstandings
appeared in connection with the Doppler effect (not including the per-
pendicular Doppler effect) the effect of aberration of light and in particular
in connection with Fizeau's result giving the velocity of light in a moving
médium.
It was claimed sometimes incorrectly that "Fizeau's experiment gives
an experimentál proof that the relativistic law of addition of velocities is
the correct law, the classical addition law being only approximately cor-
rect". The latter statement was already criticized in chapter VI208. Discuss-
ing the mechanism of first order effects we shall show in particular in this
chapter that the results of Fizeau can be understood in terms of the classical
theory of propagation of light in refracting média and it is not necessary
to make use in this treatment of relativistic concepts.
282. The claim that the first order effect can be explained without using
relativistic concepts has to be made a little more precise. One finds that the
first order effects can be correctly interpreted supposing
(1) that a closed physical system when made to move with a constant
velocity v does not deform noticeably. This assumption implies that for
the purpose of the first order approximation we can neglect the second
order effects like the length contraction and the slowing down of the rates
of clocks. We suppose e.g. when giving the theory of aberration that the
telescope moving together with the Earth behaves like a rigid body and does
not suffer deformations while the orbital velocity of the Earth changes.
(2) We suppose that an electromagnetic field of strengths E and B acts
upon a physical system Q moving with a velocity v as if Q was at rest and
placed into a field with field strength
E = E + -(vxB),
c
eff
B = B - -(vxE).
c
eff
The expressions (1) describe first order effects only; they can be obtained
considering the Biot-Savart law and the law of induction. From the latter
it follows that an electric charge moving with a velocity v possesses a mag-
netic field
B (r) =
e
E (r) = - •m v x r
m
Supposing the principle of action and reaction to hold (at least up to terms
of the order of v/c) from the above two relations (1) can easily be obtained.
The interpretation of first order effects as the aberration and Fizeau's
result can be obtained in a much more elegant way using the Lorentz trans-
formation then with the help of detailed classical considerations. We give
the latter considerations only so as to show that these effects can indeed
be understood without the use of relativistic concepts. The fact that these
effects can also be interpreted making use of Lorentz transformations
proves the consistency of the theory.
where p is the dipole moment. Similarly we find for the field of a magnetic
dipole
Bír) = - g r a d - ^ , E = 0, (2b)
r
where m is the magnetic dipole moment.
The field of an electric or a magnetic dipole moving with a velocity v
can be obtained as the Lorentz deform of the fields (2a) and (2b).
The moving dipoles of both kinds possess both electric and magnetic
fields. The electric field of a moving electric dipole is found to be — apart
from terms of the order of v jc — to be that of the originál dipole moving
2 2
together with the dipole. The magnetic field has a complicated structure.
The main (non-relativistic) features of the magnetic field of a moving electric
dipole can be obtained, if we take in place of a dipole a thin strip, which
is polarized perpendicular to its surface (see Fig. 22) and which is moving
parallel to its surface. We see thus that the electrically polarized strip, when
set to move, shows a magnetic field corresponding to a magnetic polarization
given by (2b).
+ + ++++ +
The strip behaves as if the electric dipoles when set to move would de-
velop magnetic dipole moment
dm = v x í/p/c.
The above result has to be taken with care; the electrically polarized strip
behaves as if it would contain magnetic dipoles of moment
dm = MdS.
However, the magnetic fields of the single electric dipoles when set to move
differ from those of magnetic dipoles of moment dm. The superposition
of the fields of the moving electric dipoles is, however, equal to that of the
superposition of the equivalent magnetic dipoles.
The strip can be taken to consist of electric dipoles of cross-section dS
with an electric dipole moment
dp = TdS.
I* = í, - CQ,
with
v
í = o Q = Bg .
c
0
with
II = í, ~ CQ, I = 0, CQ,
2
thus
1 = 1,0. (3)
The deformed current density corresponding to (3) can be written
I* = i*, (vi)^
with
i* = Bi 1 + i, 2 (4)
ii, i being the components of i parallel respectively perpendicular to v.
2
Q* = -(VÍ)B/C. (5)
W 1 = w —v =1
vw
1+ —
cT 2
and
w|l-
W2 =w - v= -
2
vw
1*
Wy < W2
thus the particles flowing faster in the section C -» D than in the section
A -+ B. Since the particles are thus moving with a variable velocity through
the conductor their density changes accordingly, the actual number of
particles to be found per unit volume being inversely proportional to the
velocity of progress. (This effect can be compared with the density of
traffic along a road. In sections where the traffic is slowed down by somé
obstacle the density of vehicles increases.) The densities of the moving
charges in the sections AB and CD are thus equal to
ei = -^r-e = ^ - ( e + WO
1
c 2
288. The above effects can be interpreted up to first order terms with
the help of relations (1). However, as the higher order effects are alsó of
somé interest we give here the exact calculations which include Lorentz
deformations also. It can be seen easily that the relation obtained con-
cerning the Doppler effect and aberration can be obtained in a first order
ápproximation without making use of the Lorentz transformation but using
only relations (1); this gives an effective field strength acting upon a
moving system O.
Let us consider a source of monochromatic light in a point A at rest
and at a large distance from the origin of the system of coordinates K.
The waves arriving from A produce in the vicinity of the origin oí K a
field which can be described in a good ápproximation as a pláne wave.
The waves satisfy Maxwell's equations, thus they can be expressed in terms
of a four-potential
•F(x) = «P cos 2K
0 (<XX + 4>) (6)
co = ex, (9)
thus k is the unit vector pointing into the direction of propagation of the
waves.
From (6) it follows
F = - 2rox x W .
0 0
289. Taking the Lorentz deformed configurations of the field considered
in the previous paragraph, we obtain the field of a source A* moving with a
velocity v relative to K.
Using the formuláé of 277 we find
F*(x) = F J sin 27c(aA_^ + <t>) (10)
with
F*, = A L . F . A ^ . (11)
We may write
aA _ = A_ a = o* T
(12)
and in place of (10)
F*(x) = FJ sin 2JI ( O * X + <j>).
From (11) and (12) we see that o and F change in the course of deforma-
0
290. Taking the square of both sides of (14a) and remembering that
x*x* = x*v = 0
we find
5 |cos#*- - |
2
+sin #* 2
c
where we have written
x* \/x*v — cos •&*,
thus d* is the angle between x* and v. Remembering the connection between
B and vjc (15) can also be written
x=x*5|l--^cos#*j . (16)
T • (17)
1 COS0*
c
Starting from the inverse expression of (14) we obtain in place of (17)
similarly
co J — . (18)
1 H — cos 0
c
3. EFFECT OF ABERRATION
l/B =
2
|i + -!icos#j j l - - % o s 0 *
and thus
v
cos •& H
cos#* = — • (19)
V
1 -\— cos V
c
The latter expression can also be written
sin
sin &*
1 + — cos#
c
= # _ &* „ 1 i # S n (20)
c
where 0 is the angle of aberration. We come back to the discussion of
this effect in 293. Equation (17) gives the frequency distribution of the Doppler
effect, the latter expression is identical with that obtained from the purely
phenomenological consideration of 38.
u = -L(E 2
+ B )=^.
2
(22)
ö7t An
The energy density of the deformed wave is given by
A* 1
A
5Jl-yCos#*
The relation gives the inténsity of the deformed wave as the function of
direction of incidence.
In particular we find for the forward and backward directions
for é = 0,
c
A*
1
~A~ for = n.
B\l +
~A 1 (24)
for S* = 0
48*
From (24) we see that in the extrémé relativistic case, i.e. for v ~ c, B > 1
the intensity of the deformed radiation will be very small in all directions
except inside a narrow cone the axis of which points into the direction of v.
D í = S (Oo),
n
Öo* = Qo •
Thus we observe F respectively F' with instruments which appear to have
the same configuration; thus from the point of view of the observer it
appears as if not the instrument had changed from Ci -* ö * but as if the
0
3r-£-«,©) = S*.
We may therefore apply the formuláé obtained in 288 — referring to change
of radiation field if the source is made to move — also to the case, where
the sources have remained stationary, but the instrument has been made
to move.
In particular eq. (20) 291 can thus be taken to give the apparent
change of position of a star while the velocity of the Earth changes.
294. It is important to note that there exists a physical difference
between the processes where the source is made to move and that when
the instrument instead is made to move. In the former process there appears
a particular type of radiation emitted while the source is accelerated. The
radiation field F* reaches the observer only after the radiation emitted in
the transient period has passed away. If the instrument Cl is accelerated
instead, then the deformations take place practically instantaneously and
no transient phase is to be expected.
The above considerations can also be formulated as follows. If an instru
ment of observation is placed into the radiation field of, say, a star then it
indicates the direction of the Pointing vector
4n
§ = (E x B ).
c
If the instrument is set to move with a velocity v then it will react upon the
field as if its field strength had changed to values E and B as giveneff eff
471
e?eff = (E ff x B
e eff );
c
the angle between § and § t í { is (neglecting higher order terms) equal to
the angle of aberration.
Let us consider e.g. a narrow metallic tűbe through which we view a star.
The tűbe acts as a wave guide and the star light can pass through it only
ifit is pointing into the direction of §. If the direction of the tűbe is changed
then the heat losses caused by currents — produced by the electric field strength
in the walls of the tűbe — extinguish the beam. If the tűbe is moved with a
velocity v then E instead of E is responsible for the losses and we have
eff
to adjust the tűbe into the direction of § so as to allow the wave to pass.
e { (
Instead of the metál tűbe we can consider any other instrument, e.g., a
telescope or — as was done by B radley — a telescope fiiled with water.
Any such instrument will indicate the direction of § . e{{
S
Fig. 24. Penetration of waves through a moving médium
T(r , 0 = 0
B for /<—
c
where r is the coordinate vector of B and / = | r — r |.
B B A
V A-X'A
2
= - 4m,eff
V 4>-\cp=-4no „
2
(25)
c
e
div A = 0.
+ — S
c
Supposing the S to be an uncharged dielectric we can take
•eff = r o t M + —P (26)
with
V=clJen e = 1 + 4UK = —. T
1— 4nx
We find thus that the planes of constant phase inside 5 are propagated
with a velocity V < c.
It must be emphasized that the radiation in S consists of the super-
imposed effect of the incident pláne wave and the radiation emitted by
the atoms of S. The sources of the latter radiation can be taken as the inner
atomic currents i given explicitly by (26).
efr
The front of the compound wave penetrates with a velocity V < c into S.
There is thus a region into which the primary wave (travelling with the
velocity c) has penetrated — but into which the compound wave (travelling
with the velocity V) has not penetrated. Inside this region the secondary
waves extinguish fully the primary wave as can be seen from solving (29). The
latter equation gives the behaviour of the totál radiation — i.e. that of the
primary wave superimposed with secondary radiation.
Constructing the solution of Maxwell's equation for the region to the
right of S, we find a delayed pláne wave. The front of the latter wave
starts when the compound wave arrives at the right-hand surface of S.
a. DISPERSION
V = c/n(co)
where n(co) is the geometrical refractive index for radiation of frequency co.
The considerations obtained above are only in agreement with experiment
as long as
n(co) ~ Jhe,
P/co 2
0 + P = x E (30)
the electrons.
Relation (30) was first given by Sommerfeld. A similar expression could
be given for the connection of M and B; for the sake of simplicity —
as we want to give here only a qualitative description of the phenomena
— we consider a non-magnetic médium with ti = 1 or x — 0.
300. Using thus the relation (30) in place of (27) we can write (suppos-
ing M = 0)
and we have
V A--^A = - — P .
2
(31)
c c
l
co |
2
co 2
[t CO | 2
47CXÜ) 2
a>l) c 2
alj
co 2
í « 2
1
c2
1 + 4tcx
K =
2 r
co 2
and thus we find for the velocity of propagation of phase planes
the interference does not completely extinguish the field. Thus we expect
a phenomenon which was sometimes called "Vorláufer", i.e. a wave which
appears before the main front arrives. Although the Vorláufer certainly
exists, we have good reasons to believe that the latter phenomenon cannot
be observed experimentally — we cannot go into details here.
(32)
co = co I 1 -
Relation (32) follows from the assumption that both the electromagnetic
fields and their interactions with the atoms of 5 obeyed the Lorentz prin-
ciple. As there exists somé misunderstanding about the significance of (32)
(see chapt. VI 208) we show that (32) can be obtained from Maxwell's
theory also. Furthermore we point out that (32) describes an effect of the
order of v/c and therefore this effect is not an essentially relativistic effect.
302. The atoms of S* are moving with a velocity v, therefore the atoms
behave as if the field acting upon them had components
E = E + - ( v x B)
c
e r f
(33)
B e f f = B - - ( v x E)
c
Taking the constants of polarization x and x' we have thus for the induced
polarization
P = xE
0 M = Z'B . eff 0(34) eff
We denote with the suffixes " 0 " that P and M refer to the polarization
0 0
of the moving atoms. From the result of 283 we find that the atoms
moving with a velocity v and showing polarization according to (34) are
equivalent to matter at rest with polarization
P = P + -(v x M ) 0 0
c (35)
M= M --(vxP ) 0 0
p = xE + X + X
(v x B) - \ v x (v x E)
c c
M = x'B — * +
* (v x E) — ~ v x (v x B).
c
We may express E and B in terms of the vector potential A and thus we
can express
i eff = rotM + —P,
c
also in terms of A and its derivatives. Using this expression the wave
equation
V A - -^Á = -4m
2
e f f (36)
V - 2 \ \ - \ \ v V - V = 0,
2 2
(38)
v = - +
l-l)r
c
n (39)
303. The force with which a moving electric charge acts upon itself can
be worked out with the help of Maxwell's equation. Let us consider a
charge e at rest at t = 0; the charge distribution may be given by g(r)
so that
|g(r) cPt = e and Q(T) = 0 r> a
thus the charge is contained in a sphere with radius a.
The electric field E(r) can be worked out and the force upon itself can
be taken as
F = Je(r)E(r)rf r. 3
(40)
If the charge is at rest the latter vanishes as can be seen from symmetry.
304. We calculate the field of the partiele if it is accelerated so that its
velocity is zero at t = 0. We can thus write for the velocity and displace-
ment of the partiele
1
• 2
\ — vt vr • a =
2
If we take the partiele to move like a rigid body, then the charge and cur-
rent distribution at the time t can be taken as
i(r, 0 = víg |r - y T í s
Q(T,Í) = Q (r-yví
The vector potential can thus be written according to eq. (17) in 257
A(r, 0= -
.•A|,(, + »-L,|,-i)1
c R
R
where we have put t' = t
c
we find
-ÍÁ(r,0)=-^<P (r) 0
R)
similarly we have
l
e | r + R_^ V j R 2 / c 2
4>(r,0) = | ^ J-d V
L 3
E(r) = E ( r ) - - ^ J
0 l - ^ 2
] ^ L d
3
R
2R R
where
E (r) = grad <í>(r)
0 0
with
Uf/(l^f)Üͱ^rtA. (43,
U is a mátrix the components of which have the dimensions of energy.
306. As can be seen easily the orders of magnitudes of the elements of
U are comparable with
wv = F (ou,)
+ (44)
where F (out)
is the outsidé force and F* the self force with which the
J)
charge acts upon itself. Introducing (43) into (44) we find also
Thus we see that the outside force accelerates the charged partiele less
than the corresponding neutral one. The partiele behaves thus as if its
mass had increased as the result of the charge.
Since U is a tensor, the excess mass of the charged partiele appears to
depend on direction. If we take as an order of magnitude relation
U~ 1U 0
mefí x m-\ f- •
c
Writing m e{{ — m + ám, taking a uniformly charged sphere one finds
and thus
- 1 )
so the apparent increase in the mass caused by the electromagnetic reaction
is 2/3 of that corresponding to the relativistic increase in mass expected
from the Lorentz principle.
2. MASS DEFECT
Am = — — 5 - (1 - cos #).
2
Rc*
We see thus that the attraction between the charges causes a decrease in
the apparent mass. Indeed accelerating a pair of particles with opposite
charges, we find that the field of each of the particles acts upon the other
exerting a force in the direction of the outside force. Thus the inner forces
increase the acceleration and the system as a whole can be accelerated
more easily than the system of the unconnected particles. The latter effect
is connected with the mass defect observed in a system of particles bound
together.
308. From the considerations of 237 one is led to expect that a closed
system, e.g. a partiele, has a mass
E
where E is its totál energy. If E is changed in any way the mass is expected
also to change — e.g. if we charge a neutral system and supply an electro-
static energy to the system the mass is expected to change by
Am =
0 . (46)
gives therefore only a necessary condition for the motion of the médium
to which T refers. We have to add to (47) the equations of motion of
(m)
the latter equation describes the circumstance that the density of force
with which the electromagnetic field acts upon the matériái system provides
exactly the force density — F which according to (48) is necessary to balance
the force density + F which is produced by the mechanical stresses inside
the system.
311. The field of a charged condenser can be worked out and in terms
of the field strengths the ponderomotoric force equals
F (el)
= Div T , (el)
the moment of force acting upon the condenser can be expressed with
the help of the density
M (el) = x x F (el).
the totál moment of force can be expressed by the antisymmetric tensor
of third order
m = $M fdx
kl k *, 7 = 1 , 2 , 3 .
Considering a charged condenser at rest, we find
/?#/> = 0 *, 7 = 1 , 2 , 3 .
Transforming the four-tensor M we obtain the tensor M * describing
( e l ) (el)
the state of the deformed system. In generál the space components of the
latter will not vanish, even if those of the former did vanish, i.e. we have
m{f* # 0.
Thus the electromagnetic forces acting between the plates of the con-
denser produce a non-vanishing moment of force if the condenser is in a
state of translational motion.
The system consisting of two charged condenser plates is, however, not
a closed system. To maintain the system, there is need for a mechanical
support keeping the plates apart from each other and internál forces which
counterbalance the Coulomb repulsion of the charge upon the plates. The
totál system including electromagnetic forces and mechanical stresses can
be described by an energy momentum tensor T = T + T — and
(el) ( m )
F. TRANSIENT PHENOMENA
312. We have seen that Maxwell's equations are consistent with the
Lorentz principle in the following sense. If a field % can be taken to result
from a current distribution 3 , then the Lorentz deformed field
can be taken to result from the Lorentz deformed current distribution
3* = m i
The dynamical part of the Lorentz principle asserts that as the result of
an adiabatic interference a physical system Q changes into a deformed
system with configuration
O* = 1(0).
Applying the latter result to an electromagnetic field, one expects adiabatic
deformations of the type
reaches its final velocity v. We may thus suppose that the velocity of the
partiele is given by
, Í0 í<0,
v(í) =
[v t>h.
The coordinate vector of the partiele is thus
i
r(r)= j" v(t')dt' for 0 < t < ^
0
and
r(í) = r + yt
t for t > r x
ri = rOi) - v í . x
Calculating the field of the moving partiele at somé time t > t we find
t
charge is only felt from positions where the partiele moved already with
its final velocity v. The field in I is thus the deformed field as given in
eq. (64) 278.
ct < \ T\
THEORY OF GRAVITATION
315. In this part of the book we shall deal with the problem of the
generál theory of relativity from the points of view we have dealt with
the problems of the special theory of relativity. It seems to us that the
generál theory of relativity can be taken to deal with the effects of gravita-
tion upon physical processes.
Our view that the generál theory of relativity gives in fact a theory of
gravitation resembles to somé extent the views expressed by V. A. Fock
on this subject.*
A. OBSERVATIONAL FACTS
* -^b) = '
2A A 0 (1)
c(r, t) = c = constant,
and relation (1) becomes identical with the wave equation derived from
Maxwell's equations in their originál form.
Similarly, by introducing into the Schrödinger equations the parameters
characterizing the gravitational field one can get the laws giving the effects
of gravitational fields upon atoms. In particular the frequencies of oscilla-
tions of atoms are found to depend on the gravitational potential; therefore
the Schrödinger equation adapted to regions containing gravitational fields,
must lead to frequencies depending on the parameters of the gravitational
field.
318. From a purely mathematical point of view Maxwell's equations,
the Schrödinger equation and other physical laws could be generalized
for regions containing gravitational fields in a large number of ways.
In the following sections we shall give those generalizations which follow
from the generál theory of relativity.
Sometimes suggestions are made to the effect as if the generalizations
of the laws of nature which lead to the forms of the laws in gravitational
fields could be obtained from a priori considerations. According to such
views the laws thus obtained are logically more or less the only possible
ones — and in a paradox formulation one is led to the assumption —
as if there existed no other possibilities, and "nature has to obey the laws
deduced a priori".
Such considerations are at fault; we shall show in the following that
the relativistic laws are based on well-defined physical hypotheses con-
cerning the structure of matter and gravitation. It is a question of facts
as to what extent these hypotheses give a correct description of real nature.
319. In the special theory of relativity only such regions are considered
in which light is propagated homogeneously. The laws governing the
motion of physical systems inside such regions obey symmetries which can
be expressed by the Lorentz principle. In reality light can nowhere be
assumed to be propagated strictly homogeneously, as we have reason to
believe that the propagation of light is aftected by gravitation and regions
entirely free of gravitation do not exist. The Lorentz principle can therefore
be taken to be valid only to such an approximation as gravitational effects
can be neglected. The question arises how the Lorentz principle should be
generalized so as to apply to regions containing not negligible gravitational
fields.
(c) that the points ty are either at rest or have at most translational
k
(x - Xi)g(x - xO = 0
2 2 (3)
where x and x are the four-coordinates of events
x 2 (5 , i.e. of the depar-
2
ture of the signal of somé point at a time t and its arrival in another
x
point % 2 ti h-
a t a me
Supposing that signals are propagated along straight lines we can describe
the orbit of a signal of light in parametric representation by
x(p) = kp + a.
where the vector x has constant components and obeys the relation
xgx = 0
and a is an arbitrary vector with constant components.
322. If the attempt is successful to determine a straight representation
K of the region 9í containing the points ty and clocks E then we may
k fc
signals and interpreting the observational data with the help of relations
of the form (3). However, the coordinate measures r„ and í„ are obtained
as the solutions of a strongly overdetermined system of equations; we can
therefore conclude that the region in which we observe the signals is indeed
homogeneous provided the overdetermined system admits of solutions. We
thus can obtain an internál check of whether the propagation of light in
9t is homogeneous indeed.
However, considering things more strictly — when using the method
described above — we check the fact whether or not all the conditions
(a)-(d) given in 320 are fulfilled. Physically it is mainly of interest whether
or not (a) is fulfilled — thus it is of interest whether or not the propagation
in 9t is homogeneous indeed.
The conditions (b)-(c) reflect only upon the question how the points
$ move and how the clocks © are adjusted. The latter questions are
v v
relative to K are changing in time. Thus taking into account the non-trans-
lational motion of the points ?$ we can introduce a straight representa-
k
but we drop the conditions (b)-(d) thus we suppose that the points ?$ k
may move relative to each other and also relative to the ether in a more
or less arbitrary fashion. The clocks S moving with the points should
fc
where we suppose that the various values of the paraméter p give the time
measures t(p) at which the signal passes points with coordinate vectors
r(p). We may suppose the representation to be such that
t(p) # 0 for any value of p.
xO>)g(x(/>))xO>) = 0. (4)
More precisely we may find that in the vicinity of any given four-points x
light is propagated homogeneously in a first ápproximation. The propaga-
tion tensor g in the representation K may vary with x.
For given g' (8a, b) represents tett differential equations for the four com-
ponents of f(x). This system is thus in generál overdetermined. We may
therefore conclude that, provided the overdetermined system (8) admits of
solutions, then this fact is not an accidental one, but signifies that the region
9í is homogeneous indeed. The representation K' which is obtained from K
by the transformation (7) can be denoted a straight representation.
329. In the following we give the explicit condition which g(x) has to
obey in a homogeneous region and give an expression for the transforma-
tion function f(x) which leads from a curved representation KXo the straight
representation K' of a homogeneous region.
From the relation (8a) it follows that we can put
where a' and a can be determined from the elements of g' and of g(x)
according to Appendix I; 443 eq. (13). A is a Lorentz mátrix; we note that
(p)
P = P(x)
the six parameters of the Lorentz mátrix may vary with x. The paraméter
p(x) have to be chosen so that (9) should satisfy (8b). However, introduc-
ing (9) into (8b) we get a very complicated system of differential equations
for p(x) and therefore the solution (9) is of no practical use.
330. In Appendix II we have given in detail the solutions of (8a, b)
and have also discussed the conditions g(x) have to fulfil for (8a, b) to
admit of solutions. Because of the importance of these considerations, we
give here a short account of the problem the details of which are found in
Appendix II.
Differentiating (8a) into x we find, remembering g' o = 0 (see eq. (29)
in 471),
(3) (3)
( l + c - ) ( g S - ( x ) S ) = g(x).
3
1 1
(10)
Applying the operator n = 1 — c
3 3
1
+ c3
2
to both sides of (10) we find
(3) (3)
S(x) = S(x)-C(x), (11)
with
(3) 1 (3)
C(x) = "2 rcsg(x).
Thus (11) gives a system of linear differential equations for the determina-
tion of S(x). Whether (11) admits of solutions can, however, not be seen
directly.
331. So as to find the conditions for g(x) which have to be satisfied
if (11) is to admit of solutions we differentiate (11) into x and find
(4) (4) Ö) (3)
S(x) = S(x) • (C(x) - (24)C(x) • C(x)). (12)
Relation (12) gives a differential equation for S(x); if the latter admits
of solutions, then such solutions can be obtained e.g. by giving an initial
condition and integrating step by step.
(4)
Since S(x) is to be a third derivative, it must be symmetric in the last
three suffixes. The sufficient and necessary conditions for the solution of
(12) to possess the required symmetries can be written (see for details
477)
(4)
(1 - ( 2 4 ) ) S ( x ) = 0.
(4)
Expressing S(x) by (12) we find
(4)
S(x)-R(x) = 0
where
(4) 1 (4) (3) (3)
R(x) = - ; r ( g ( x ) + C(x)-C(x))
4
(3 + 0 (3)
(3+0
Taking S (x) thus obtained to be the derivatives of f(x) we can develop
f(x) e.g. around x = 0; writing
1 (3)
f(x) = (jl + Sx + - S x + . . . 2
(15)
with
(3) (3)
S = S(0), S = S(0),
xg(x)x = 0
áS(x)=- y (l + (23))(S-R)
The transformation function <f(x)> obtained from <S(x)> does not lead to a
representation K where g is strictly constant, but one obtains an almost
straight representation K' so that
*'(fl) = S'(x') = g' + <5g'(x')
with
1 (4)
«5g'(x') = - ( l + ( 1 2 ) ) R ' x ' +
2
g 1
R
The above consideration does not give strict definition of the almost
straight representation. Indeed, representations K' and K" obeying the
above criteria may show small differences in the elements of somé of the
5. SIMILAR REGIONS
Sí* (see Fig. 27). The propagation tensor in 9t may be given in a particular
0
representation K as
m = go(x).
g(5) = gofx + 5)
g*(5*) = 8o(x* + \*)
Under exceptional circumstances we find that the regions 9í and 9t* appear
as transforms of each other. More precisely there exist cases where a func-
tion X(x) can be found so that
M © g * ( g * ) M ( S ) = g(§)
with (18)
M © = x(g)on
(3) <nn o)
C(?) = (M(5)C*(§*)) , (2)
(19a)
with
(3) 1 (3)
c*«*) = T ***(§*),
(III) (III)
and M(cj) is produced from the elements of M(x) like S(x) of those
of S(x).
Relation (19) admits then and only then of solutions if
(4) (4)
340. Relation (20) gives twenty differential equations for the four com-
ponents of X(Ej). However, M(£) contains six arbitrary parameters, therefore
choosing M(£) suitably we are left^with fourteen conditions between the
g ( 9 and the g*(§*).
Physically we see thus that similar regions are characterized by fourteen
parameters (corresponding to the fourteen conditions imposed on the
propagation tensor). Further, similar regions can be characterized by a
four-dimensional orientation; the relative orientations of the regions 3t
and dt* are expressed by the parameters of Mfé).
C* = S 0Q) q
js a possible system. Furthermore, to the above principle is added a dy-
namical part, i.e. if £} is interfered with adiabatically, as the result of
the interference, it changes into a Lorentz deformed configuration ö*.
The transformations £ can be defined uniquely for homogeneous
q
four-coordinates
Xfc(/>) = táp), tk(j>) ik(P) ^0 '«. = 1, 2,. . ., n.
The Lorentz transformation may be defined by a reversible transformation
x* = X(x) x = 7,-\x*),
344. Equations (22) and (23) cannot be taken to express the Lorentz
transformation in inhomogeneous regions because the system is over-
determined and in inhomogeneous regions the system admits of solutions
only in exceptional cases. Nevertheless it must be assumed that the Lorentz
transformation is somehow connected with the systems (22) and (23).
Indeed consider a signal of light § passing through somé physical sys-
tem O. The orbit of the signal can be expressed by four-coordinates x (p) s
such that
iÁPtó*(.P)y*ÁP) = 0. (24)
Submitting the system Cl to a deformation, we obtain a new system ö *
which is traversed by a signal 3*. The orbit of g* is expected to be given by
x*(/>) = X(x (/>)).
4
If both (24) and (25) are to be valid, then X(x) has to obey the relation (23).
Thus from the hypothesis that in the case of a light signal § passing
through Cl the Lorentz deformation leads to a light signal §* passing
through Cl* we are led to the condition (23) for the transformation
function X(x). Since (23) is overdetermined we see thus that the above
hypothesis cannot be strictly correct.
a. A PHYSICAL EXAMPLE
our particular case to the hypothesis that the interferometer which was
adjusted in one position remains still adjusted if we move it to another
gravitational surrounding.
The above hypothesis is physically unreasonable. Indeed, the gravita-
tional field produces stresses in a mechanical system and the system will
deform so as to compensate these stresses. It is unfounded to suppose
that the deformations which are produced by changing gravitational stresses
upon the mechanical system should give rise to deformations suitable to
compensate exactly the effects of gravitation upon the propagation of light
signals.
So as to see how absurd conclusions we were led to if we were to main-
tain after all such a hypothesis, let us imagine a Michelson interferometer
mounted on a horizontal axis such that the arms move in a verticai pláne.
Turning the interferometer round its axis can be taken to be a Lorentz
deformation. It is obvious that the interferometer will deform under its
own weight when turnéd round. It cannot be expected that the position
of the fringes remains stationary, while the interferometer is turnéd round
its horizontal axis.
In particular we note that the deformations the interferometer suffers
on account of its own weight depend strongly on the properties of the
matériái of the interferometer. Thus two interferometers built of different
materials will deform differently when turnéd round.
347. Taking, however, a small interferometer built of strong matériái
the internál stresses may become negligible and the interferometer will
behave practically independently of the gravitational stresses. Thus the
small and strongly connected interferometer will show no noticeable fringe
shift even if turnéd round a horizontal axis.
From the above qualitative consideration we may come to conclude
that the Lorentz principle in an inhomogeneous region is valid for sufficiently
small and sufficiently strongly connected systems. A mathematical for-
mulation which takes the above restriction into consideration will be
developed presently.
a. FIRST APPROXIMATION
x (p) = x„ +
fc % (p)-
k
The shifted system £i* has its centre round a four-point with a four-
coordinate
o o + {*>
x =x
«2 = *í + 5ÍO0-
The transformation function X(x) cannot be supposed to obey (23) exactly,
we can, however, require that
where A is an orthogonal Lorentz mátrix and a and oc* are given expücitly
q
in Appendix I.
In the above ápproximation we can write
b. A SECOND ÁPPROXIMATION
[M(5)g*(5*)M(5)-g(?)]on = 0 for 5 = 0 .
From the latter relation it follows according to 339 eq. (19)
(3) (3) (3)
M = M • (C - C)
where C is defined by (19a). We can thus write in this approximation
1 (3) (3) _
A(?) * A >(E.) = M !- + - M • (C - C)%\
2
q q q (28)
x' = f(x)
A q 2)' = f A <2) -l
f
and thus A ' contains higher order terms in x' even if A does not con-
(2) (2)
A. GEODETIC ORBITS
1. DEFINITION
The above relation contains more solutions than the relation (1), as the latter relation
implies a linear connection between t and p . Since, however, the restricted relation (1)
gives already all possible orbits of free particles, there is no need to use the more
generál relation (la).
Differentiating (3) once more, remembering (1) we find
(3)
0 = S(p)x\p) + S(p)x(p).
(3)
i(p) + e(p)*\p) =o
where we have written (see 472)
(3) (3) (3)
S" S = g - C = <2.
1 1
358. Generalizing the above result we may suppose that the equation
of motion of a free partiele in an inhomogeneous region can be written
(3)
x + Sx = 0. 2
(4)
The latter relation is Lorentz invariant as we show presently.
Taking a number of particles in the vicinity of a point x their orbits 0
where
(3) (3) (3) A I U ) (3) V3)
359. The equation of motion (4) has still another aspect. It follows
from the dynamical part of the Lorentz principle. that a system Q subject
to an adiabatic interference changes into a Lorentz deformed system ö * .
If there is no interference at all, then a partiele with somé initial velocity Y
will shift during a time t by an amount vr. The latter shift can also be
considered a "spontaneous" Lorentz deformation.
The analogue to this process is the drift of a system in an inhomogeneous
region. The motion in accord with the equation of motion (4) can be taken
as a sequence of infinitesimal spontaneous Lorentz deformations — the
orbit of the partiele in a gravitational field can thus be regarded as a sequence
of such Lorentz deformations.
1. VARIATIONAL PRINCIPLES
S f x(p)g(p)x(p)dp = 0. (9)
The Eulerian equations which can be derived from the above variational
principle can be written
80 d (39) n
with
9(p) = x(p)g(p)x(p). (11)
As the result of a short calculation one finds that (10) and (11) reduce
to relation (4). Thus the equation of motion (4) can also be replaced by
the variational principle (9).
We note that the variational principle (9) defines orbits in an invariant
way, i.e. independent of the representation. Indeed, the transformation
relations for g(x) are so defined that subjecting x and g(x) simultaneously
to a coordinate transformation, the numerical values of 6(p) remain un-
changed. Thus the solutions of (9) and therefore also those of (4) define
definite orbits independent of representation. Solving (4) and (9) in dif-
ferent representations, we obtain the parametric representation of the same
orbits relative to various systems of reference.
In the above sense the relations (4) appear to be exact relations in spite
of the circumstance that in the course of their derivation higher order terms
had been neglected.
362. The fact that higher order terms have been neglected in the equa-
tion of motion (4) leads nevertheless to ambiguities in the following way.
The geodetic orbits which can be determined in an unambiguous manner
must be supposed to be the orbits of sufficiently small and connected
objects.
If we consider e.g. a planet, then we find that the geodetic orbit gives
an excellent first ápproximation of its real orbit. However, taking the prob-
lem more strictly, we have also to consider deformations the body of
the planet suffers, while it is moving from one gravitational surrounding
to another, i.e. we have to consider the tides of the body.
The tides themselves produce perturbations of the orbit and therefore
the real orbit of the planet will deviate to a small extent from the geodetic
orbit. The amount of this deviation is determined by the mechanical prop-
erties of the planetary body, i.e. from the measure of the distortions
which arise while the planetary body continuously adjusts itself to its
gravitational surrounding. Therefore the deviations of the real orbit from
the geodetical orbit depend on the internál mechanical structure of the
planet.
Considering the motion of the planet as a sequence of Lorentz deforma-
tions in the manner explained in 358 we can say that the higher order
terms of the Lorentz transformation express the deformations of the mov-
ing planet. We see therefore that the theoretically expected deviations of
the orbit of a real planet from the geodetical orbit are connected with the
ambiguous higher order terms of the Lorentz transformation.
therefore we find
A
x i = - = (12)
y/c 2
- (w + v)G(w + v)
A
(we note that for G = 1, v = 0 we have x, = .
The value of A depends on the scale we choose for p. Let us write A = m , 0
where we take m to be the rest mass of £t in suitable units and the scale
0
~2
m= / (13c)*
^ 1 - ( v +w)G(v + w)/c 2
Let us take K to represent the kinetic energy, U the potential energy and
m the mass of Q; we can write K— U = L, where L is the Lagrangian of Ci.
In place of (9) we can also write
<5 j W í = 0.
!
(14)
We see thus that (9) corresponds to the Hamilton principle for the motion
for the centre of O.
* We note as a curious feature of equations (13) that although they have been
derived from the generál relativistic equations of motion, they have a remarkable
similarity to the non-relativistic expression. In particular v + w is the velocity of thé
partiele relative to the ether and thus (13b) gives an expression very similar to the
classical expression for kinetic energy.
We note that the inner structure of the system does not appear in the
derivation and therefore we conclude that the orbit of any small closed
system is expected to obey the same law. This result reflects on the prin-
ciple of the equivalence of gravitational and inertial mass.
364. The variational principle can be written also in a different man-
ner. Making use of (13) and remembering dp = díjki we find xgx dp =
= — A x dt and have
2
4
ö . =0.
JJc 2
- (w + v)G(v + w)
Further writing
Jc 2
- (v + w)G(v + w)
we have thus
x,
<5 J dx = 0. (15)
x,
T can be taken to be the propertime of the system and therefore the latter
relation has the form of Fermat's principle in optics. The path followed
by the system in Q is that along which the propertime is stationary.
365. While (9) can be naturally interpreted as a Hamilton, or as Fermat
principle, the usual interpretation given to this relation, i.e. that the system
moves along a "geodetic line" seems to us very artificial. We may of course
"define" the orbits obeying (9) as four-dimensional geodetic lines —
however, it seems to us that it is a play with words if we suppose the
geodetic line to be a "straight line in four dimensions".
The orbits satisfying (9) have specified variational properties and they
can be taken to be a particular type of orbit — however, whatever we
suppose them to be, they are certainly not "straight". As we shall see
later, the solutions of (9) include among others the Kepler ellipses along
which planets move. — If we call those orbits "straight" then we lose com-
pletely the meaning of what is usually called straight.
The satisfactory procedure seems to suppose that the propagation of a
light signal in vacuum is in a good ápproximation straight and an
orbit deviating strongly from that of a light signal is noticeably curved.
With the help of signals of light we can also construct an almost straight
system of reference and orbits in such a system described by
x'(p) = constant
can be regarded as to define almost straight lines. In the presence of a grav-
(4)
itational field when R # 0 the concept of the exactly straight line is
meaningless. However, we are in no need of such a concept, as we can
construct satisfactory systems of reference without having first introduced
the concept of a "straight line" or of a "geodetic line" as this was shown
in detail in 334.
366. Relation (4) can be used to obtain the orbit of a small closed system,
e.g. a planet in a region, where the propagation tensor g(x) is known.
We supposed from the beginning of our arguments that the propagation
of light becomes inhomogeneous in a region, where there is a gravitational
field. Thus relation (4) gives the equation of motion of a mass in a gravita-
tional field, where the field is characterized by the propagation tensor g(x).
Since the motions of the planets are well known the relation (4) can be
used to determine the propagation tensor g(x) in a known gravitational
field. Indeed, in the vicinity of the Sun using a nearly straight representa-
tion we can take the gravitational potential to be given by
where M is the mass of the Sun and G the gravitational constant. The
values of g(x) in the vicinity of the Sun must be such that (4) reduces
— at least in a good approximation — to Newton's laws of motion.
367. So as to carry out the comparison between (4) and the laws of
planetary motion, it is convenient to eliminate the paraméter p from the
system (4) and to obtain a set of equations with t = x f a ) as independent
variable. For this purpose we remember that
dt
4>
~dt
With the help of the above relations we can eliminate p of the equations
of motion, and we get an equation of the form
(2) (3)
v = A + Av + Av , 2
(17)
(2) (3) (3)
where the coefficients A, A, A can be obtained from the elements of (2.
Returning to the problem of determining g(x) we may suppose that
(1 0 0 0
^
0 1 0 0
g(x) = 0 (18)
0 1 0
-c (r)J
V> 0 0 2
a dAc c(ír(r)
r )^ +, v(vgradc
^ _(r))v
1 2
v = - ^g rgrad A—_ —
2
y
2
vvr + ( 2 0 )
c\ ) = C - ^ -
T
2
(22)
^=-73-+-^ ar 2a(vr)v
(23)
where we have written short c in place of c(r). Neglecting terms in v /c 2 2
A • 1 d
2
vr = rr and vv = - —— v .
2 dt
Thus multiplying (23) by v we obtain
(v)
2 dt K }
a.r (24)
1 - 2v,2/„2
jc 2
„2
, = ^-{l-exp{-4^ + ^c }|.
2 2
(25)
relation, i.e.
v = 2
2
j/í + . (26)
Multiplying (23) by r in a vectorial manner we obtain
d , 2<xr(r x v)
—ka—: (>
— (r x v) = ~
x
dt cr 2 2 27
- j r ( r x v
) =
from the above relation we see that the vector
P = rxv (28)
does not change its direction, therefore we can write in place of (27)
P _ 2<xr
y ~ eV
thus
P = aexpi-^1}, (29)
where a is a constant of integration. From (28) and (29) it follows that the
motion takes place in a pláne perpendicular to a. Introducing polar co-
ordinates in that pláne we find
r = f + rV,
2 2
P = r q>,
2
(30)
where <p is the angle of azimuth in the pláne of the motion. Remembering
(3D
we can write making use of (30)
rfo-fjsa*,.
JJtl'-P'lr'
Introducing P from (29) and v from (25) we have if we introduce further
2
«,) — f t
üdS
a. PERIHELION MOTION
2n • . = 2n 1 + •
7c -4a /c 2
"V ' e V 2 2
ca 2 z
v 2
Aq> = 2K • -2 5 .
c
The latter value seems to be in accord with the observed motion of the
planet Mercury. The result obtained from the assumed form (18) and (21)
gives therefore the correct non-relativistic form of the planetary motion
and gives also a relativistic correction which is at least in a qualitative
agreement with the observed deviation from the classical orbit.
The value (33) for the perihelial motion of a planet could be obtained from
our consideration, if we were to add suitable higher order terms to the
expression (22) giving the connection between gravitational potential and
velocity of propagation of light. Such a procedure appears, however,
completely arbitrary andwe conclude that from the theory, in its form given
so far, only the order of magnitude of the perihelial motion of the planets
can be obtained. For to obtain precise results it is necessary to find the
exact relation between propagation tensor g and gravitational fields.
2a r dx 4a
4J = - (37a)
2
cb J (1 + T ) ' 2 3 4
— (30
(37b)
Thus •& is th e angular distance at wh ich a star appears from th e Sun and
AfS th e apparent sh ift it suffers because of th e deflexion.
The value of th e sh ift obtained from th e generál th eory of relativity,
which value seems to agree with th e observed one, is twice th e value th us
obtained:
(37c)
tk = t + x > k
x
k = kT.
%k = Q, 0, 0, kT.
Shilling the atom adiabatically from r -* r* we may carry out the shift
in such a manner that the atom in its final position should be again at
rest, thus we take the transformed coordinates to be given by
§ t = 0, 0, 0, T* .
However, it follows from the explicit expressions of 471 given for the
Christoffel brackets that
(3) (3)
C ( r ) = :C (r*) = 0.
m 444
Since the space components of are zero we find thus in place of (38^
(l 0 0 0\
0 1 0 0
M = (39)
q
0 0 1 0 c(r*)
0 0
Thus taking the fourth component of (38) we find with the help of (39)
T* T
c
(> r
c(r*) *
375. We have seen that the motion of the planets, and the deviation of
light in a gravitational field, are obtained, at least to a first ápproxima-
tion, if we suppose
fi 0 0
^
1
0
'o 1 0 o (40a)
í° 0
0
1
0
o .
-c(rf)
and
C\T) = c (l + 2í>(r)/c )
2 2
(40b)
2 8x k dx k
(42)
£=1,2,3
all the other components are zero.
The non-vanishing components of the Riemann-Christoffel tensor can
thus be written (see 482 eq. (63))
(4) (4) (4) C4)
1 dg 2
1 Ög Ög
(43)
u u u
2 8x dx k t 4g ti dx k dx t
<> 4
d c(r) 2
(2) (4)
The contraction R of R has thus components
(2) 1
RU = c(r)V c(r) = - V g
2
2
u - — (grad g i i ) 2
(45)
2^44
c(r) ox ox,
k
Taking thus only the highest order terms in c into consideration, the
Laplace-Poisson equation (42) can be replaced by
(2) AnG
(47)
(2)
momentum tensor T of matter; thus we might write
(2) (2)
R = - xT
4nG (49)
x =
R - _ g * = -xT-Ag. (51)
- 8 = T 0 (52)
to be the energy momentum tensor of the ether, which tensor signifies the
state of stress of the ether.*
* The above energy momentum tensor is unusual in the following sense. Provided g
has approximately the form (40a), then it resembles the energy momentum tensor
of a gas. The first three diagonal components represent the hydrostatic pressures,
while the 44-component the energy density. In an ordinary gas one expects that the
sum of the first three diagonal elements, i.e. T + T + T of the energy momentum
n 2i 33
5-u + + gs >
3 -gjc*.
Thus the hydrostatic pressure is larger than in an ordinary gas. The latter circum-
stance seems, however, of no importance, as there is no reason to expect the ether
to have the same mechanical properties as matériái média consisting of atoms. We
come back to this question in more detail further below.
The 44-component of (51) in empty regions can be written with the help
of (45)
V c(r) = M r ) .
2
(53)
A solution of the above relation which is regular at infinity can be written
c(r) = c 0 ^ - ^ . (54)
Thus a A-term with A > 0 corresponds to a shielding off effect of the ether.
379. Relation (51) contains the derivatives of g only up to the second
(* + 2)
order. Adding to (51) terms containing the R k = 3, 4 , . . . we can
construct covariant relations containing higher derivatives of g. There is
no valid reason to suppose that the exact relation between g and its sources
should not contain such higher terms. Indeed, under circumstances where
observational data are available the invariants constructed with the help
of derivatives of g higher than the second appear to be too small to give
observable effects. It seems thus very likely that the real connection be-
tween g and its sources differs from (51) which gives merely the relation
obtained if the exact relation is expanded in terms of the derivatives of
g and terms containing third and higher derivatives are neglected.
One might suppose that the effects of higher order terms could be
noticeable, if more information was available about the properties of stellar
objects with extrémé densities.
Supposing, however, that (51) represents only an approximate relation,
it is very dangerous to extrapolate the results obtained from (51) to too
large distances or for too long intervals of time. The danger is of the same
type as if we used the approximate formula e~ ~ 1 — x valid for small
x
values of x and if we tried to apply this formula for large x values. Somé
of the paradoxes which arise when applying Einstein's equations to cosmical
problems might be explained in the above way.
g(x) = g(r) if r P a
where a is a length of the order of the dimension of the central body.
Furthermore supposing the central body to be at rest, we may suppose
9kÁ*) = 0.
The simplest supposition for g(x) would be that it is of the form given
in eq. (40a) 375. However, a g(x) having this simple form cannot satisfy
Einstein's equations. Schwarzschild has shown that Einstein's equations
can be exactly satisfied by a spherical symmetric g(x) which corresponds
to a locally unisotropic mode of propagation of light.
381. So as to obtain Schwarzschild's solution we may suppose that the
velocity of propagation in the radial direction diflers from that in the
tangential direction. Such a mode of propagation can be written down in
polar coordinates r, <j>. We may thus write for the relation giving the
propagation of light in terms of polar coordinates
A 0 0
0 r
2
0 o
0 0 r sin d
2
o
V0 0 0 B
)
(2)
The components of R can be obtained inserting the above expressions
into eq. (12) of 331 and (63)-(67) of 4 8 2 - 8 4 of Appendix II. Wefind
(2) (2)
that among the non-diagonal elements of R only R tíontains non- 12
vanishing terms, however, the latter cancel each other identically. The
(2)
diagonal elements of R are obtained as
>
(2
2 A' B" B'
* n = - - — + — - 2&A ( ' A B
+ ') AB
> (57a)
(2) A > 2
thus we have
A'B + AB' = 0
or (58)
A = y/B,
where y is a constant of integration. Introducing (58) into (57a) and (57d)
we find
ylA = B=y[\-^ + ^kr''
The above expression is found to satisfy identically (57b) and (57a). It repre-
sents the so-called Schwarzschild solution of the gravitational equations.
The constant of integration y is connected with the velocity of light, we
may therefore also write
y=-\lc\ (59)
Thus relation (55) written explicitly has the form
dr2
+ r\d& 2
+ sm -dd<p )
2 2
1 2 0 1 1
1 2
1 kr 1
r 3
c |l-
2
— -~kr \dt
2 2
= Ü. (60)
r 3
382. The Christoffel brackets can be obtained from (56) and the equa-
tion of motion of a planet in the field of a gravitating mass at r = 0 can
be written
OL { CL \ 2CC
Integrating the above relation in a manner analogous to the procedure
given in 369, we obtain eventually for the differential dtp
ds
dtp =
Jc l2a
2 2
exp {4asjc }
2
- exp j - 4/c 2
2c 2
P
c
where v is the average orbital velocity of the planet and Atp is the angular
shift of perihelion in the course of one revolution.
The above result was obtained by Einstein and it is supposed to be in
agreement with the astronomical data obtained for the motion of the
planet Mercury.
b. DEFLECTION OF LIGHT
where b is the distance of the closest approach of the signal. Thus from the
Schwarzschild solution we obtain twice the deflection which we obtained
from the simple theory.
The reason for the factor two thus obtained is that the Schwarzschild
solution of the gravitational field describes a state of the ether where not
only the density varies with r, but also the ether is in a state of radial stress
giving rise to an anisotropy of propagation. The latter anisotropy increases
the deflection of light in the vicinity of a gravitating mass by a factor two.
384. The red shift of spectral lines in the vicinity of a gravitating centre
retains its value whether one uses for g the simple form (18) of 367 or
eq. (56) of 381. Thus the red shift of spectral lines is not affected by
the anisotropic mode of propagation of light in the vicinity of a gravitat-
ing centre.
1. A N INVARIANT FORMULATION
V¥ = — 47cJ eff
Div T = 0
DivJ,EFL 0 (62)
j = J + DÍV n
eff
F = Rot ¥
Alternatively
(63)
Div F = 0
DivJ eff = 0.
The operators Div, Rot, L have to be taken in accord with Appendix II.
Indeed, the set of equations thus obtained contains Maxwell's equations
in homogeneous regions. In inhomogeneous regions using an almost straight
representation the relations (62) or (63) are very similar to the relations
valid in homogeneous regions, the gravitational effect upon the electro-
(4)
magnetic field being of the order of the elements of R.
x(/0g(x(/0)x(p) = 0 ;
(3)
x( ) + e(x(p))i( y
P P =o.
That the orbits of beams of light obey the above relation could be proved
most convincingly, if one were to succeed in giving in inhomogeneous regions
the solutions of Maxwell's equations in terms of retarded potentials in a
manner resembling that given in homogeneous regions explicitly. The
latter procedure meets with difficulties and therefore one may instead use
a method well known in classical optics.
Making the transition from wave optics to geometrical optics, one can
determine with the help of the eiconal the orbits of beams in unisotropic
média. The generalized Maxwell equations have a form analogous to the
wave equations in an inhomogeneous médium. Therefore, considering the
ether as an optically unisotropic médium — the optical properties of which
are determined by the propagation tensor g — we can determine the orbit
of a beam of light with the help of the methods of classical optics.
Such calculations were given by Laue and they prove that in a first ápprox-
imation beams of light which can be derived from the wave equations
follow indeed geodetic zero orbits. From these results we conclude that
Maxwell's equations as formulated in 386 are consistent indeed with the
generalized Lorentz principle.
The question as to the existence or magnitude of the non-linear effects
caused by the polarization of the ether cannot be decided, as at present
there exists no experimentál evidence which gives information concerning
such effects. In particular the question whether or not X = 0 presently
cannot be decided experimentally.
F. ENERGY AND MOMENTUM RELATIONS
OF THE GRAVITATIONAL FIELD
390. Einstein's equations of the gravitational field given in eq. (50) can
also be regarded in a different manner. Supposing
^ g = T<'\ (65)
DivT<*> = J—
- Div gR
2x
and according to Appendix II
Div T^^-l-Grad-R = f >.
(ff
The latter expression can be taken as a force density acting upon matéria!
systems by a gravitational field — or it can be taken as the rate of energy
and momentum transferred from the gravitational field to other fields.
(We can take T also to be the energy momentum tensor of the field of
(m)
R = 2k = constant
i.e. for regions where fields other than the gravitational field do not exist.
392. The force density f is one which is exerted by a gravitational
w
field upon a matériái system. However, the latter force density is not the
gravitational force appearing in Newton's theory, but it is rather an internál
force, which produces in a first ápproximation an internál stress only.
This force is identically zero, if is given by the expression (66) in accord
with Einstein's hypothesis.
The main part of the gravitational action appears in a different form
and quite independent of whether the tensor is given by (66) or by (67).
goto = S(J, 0)
i.e. the distribution of g(x) at t = 0, then the gravitational equations give
a connection between the time derivatives of g(x) at t = 0 and the com-
ponents of T (r) = T(r, 0) of the energy momentum tensor at t = 0. Since
0
derivatives of g and the g . We can thus determine from (65) the values
44 ki
of the second time derivatives of the g (k, l = 1, 2, 3) and the first time
kl
step the g(x) can be determined for times t > 0 from the initial condition
goto.
The procedure given above is not quite unique, as the equations from
which we determine the first time derivatives of g(x) are quadratic equa-
tions. However, choosing once one particular solution, on account of con-
tinuity considerations, no ambiguity remains in the course of the integra-
tion of the equations. Furthermore, taking T (r) arbitrary, it is not certain,
0
whether the system admits of real solutions for the first derivatives.
It seems thus that the gravitational equations (65) give certain restrictions
as to the T(x) for which the equations can be solved into g(x).
The above restrictions may be real restrictions as to the possible states
in which matter appears.* The restrictions may, however, also be inter-
preted not so much to be real restrictions as to the possible mode of
distributions of matter, but it may be supposed that these apparent restric-
tions show the equations (65) to be incomplete. If we were to include in
the latter equations terms containing derivatives of g(x) of higher order
(4)
than the seconds (say terms including Grad R ) , then the relations thus
generalized may become compatible with arbitrary distributions T(x).
395. The position described in the previous section is not unlike the one
we encounter in the case of Maxwell's equations, where — starting from an
initial condition — we can determine the distribution of the electromagnetic
field — provided the motion of the source densities is known.
Just like in the case of Maxwell's equations, also in the case of the
gravitational equations the field g affects the motion of T. Therefore, the
motion of matter can be derived from (65) only, if apart from the initial
condition (66) we give also initial conditions for the distribution of matter,
e.g. in the form
T(r, 0) = T (r)0 (69)
and besides we have to know the equations of motions for T(x), the latter
may contain g(x) explicitly.
396. The gravitational force upon a closed system can be regarded not
so much as an outside force, but rather as an action of the system by itself
caused by internál force. This can be seen from the following consideration.
So as to see the essential features of how the force with which a system
acts upon itself arises, we may consider an example, an electrically
charged partiele placed into a gravitational field. We consider merely the
part of the gravitational force which acts upon the part of the mass cor-
* This restriction may be of the same type as the suggested restriction of Maxwell's
equations according to which only retarded solutions of the wave equations are
realized in nature (see 263). Or it resembles the restriction according to which wave
functions are always antisymmetric. — See also L. Jánossy, Foundations of Phys.
(in the press).
responding to the energy of the electric charge. Thus we consider the
mechanism of the gravitational force upon
Am =~
c 2
where Am is the mass equivalent of the energy E, the static energy of the
electric charge.
Consider the charged partiele to fali freely. In a nearly straight system
of reference we can represent the freely falling partiele as a partiele which
is practically at rest relative to the origin of the system. Calculating the
force with which the electrical charge acts upon itself, we can apply in a
good ápproximation Maxwell's equations valid in a homogeneous region
— as in the nearly straight representation, considering a small partiele,
the terms caused by inhomogeneity can be neglected. We thus find that
the measure of the electric force with which the partiele acts upon itself
is zero. If, however, the partiele is prevented from falling freely, then it
shows acceleration relative to the origin of the nearly straight representa-
tion, and the force with which the accelerated charge acts upon itself can
be calculated in accord with 305 chapter IX.
The force thus obtained tends to reduce the acceleration, i.e. to establish
the state of free fali. This electric force is equal to the gravitational force
which accelerates a free partiele relative to a system of reference where
gravitational forces appear.
397. A little more generál formulation of the problem can be given as
follows. If we have an arbitrary closed physical system, it is kept
together by internál forces which can be supposed to be propagated with
the velocity of light.
In the absence of gravitation the internál forces are propagated homo-
geneously and, as a rule, they produce no resulting force of the system upon
itself. If, however, the system is brought into a gravitational field, then the
mode of propagation of the internál forces is disturbed (just in the way
as the mode of propagation of light). The perturbation of the propagation
of inner forces disturbs the system in such a way that the forces acting
between the elements do not obey any more exactly the principle of action
and reaction and a resulting force arises. This resulting force tends to
accelerate the system just like the Newtonian gravitational force is expected
to do. We may thus suppose that the gravitational force observed phe-
nomenologically is equal to the self force with which a closed system acts upon
itself, if the propagation of the internál forces is made inhomogeneous by the
gravitational field.
We note that in a freely falling particie the propagation of the inner
forces is nearly homogeneous relative to the partiele itself and therefore in
the state of free fali no resultant self force is present.
-ff
CHAPTER XII
COSMOLOGICAL PROBLEMS
* Invariant form means that the relations can be expressed in terms of tensors
and covariant operators.
1. TENSORS A N D DISTINGUISHED MEASURES
xg(x)x = a (2)
where a is a scalar.
Taking a more generál case a physical law describing properties of a
physical system can be described in a form
4 0 5 . The measures of the elements of a tensor (or tensor field) can have
very different values when given in various representations. For this reason
one meets with opinions according to which the one or the other tensor
is "void of physical significance". Putting this question, one must be
careful. The measures of a physical quantity differ from each other when
taken in various representations, nevertheless these measures refíect an
objective quantity.
Let us e.g. consider the propagation tensor g. In a representation K
we have
K(g) = g(x),
g(x) has for any value of x ten independent elements; applying a coordinate
transformation
x' = f(x)
V = 0 Grad C = 0. (7)
We show presently that for any distribution g representations can be found
satisfying (7).
407. Indeed, consider a representation K where g(x) is given by (6) so
that (5) is not necessarily fulfilled. Changing to a new representation K'
by a reversible transformation (4) we may write
S(X): (9)
where
b,= (10a)
dxk 8xk
and
df,<x) (10b)
a, = dt
Inserting (9) into (8) supposing
ÍG' 0
constant (11)
we find
G =PG'P-(bob)c 2
(12a)
C = ac
2 2 2
-aG'a. (12c)
G' = P _ 1
GP~ 1
+ (bP obP )c.
_ 1 _ 1
(13)
a = a(P, b, c, g),
(15)
a = a(P, b, c, g).
then we can determine a and a for t = 0 thus we can determine the time
derivatives of the components of f(x) for t = 0.
Knowing the time derivatives of f(x) for all values of r at t = 0 we can
determine step by step f(x) for any later time t > 0 — or going back-
wards in time we can also determine f(x) for any time t < 0.
408. Thus prescribing an initial condition of the form (16) we can indeed
construct a representation K' so that
^'(9) = g'(x ) /
and V' = 0, C = c.
The system K' can thus be regarded as a representation relative to which
the ether appears to be at rest. The system AT'(g) is, however, not determined
uniquely.
In the homogeneous case we know that in the systems of inertia which
move with translational velocities relative to each other it is possible to
introduce coordinates such that light appears to be propagated isotropi-
cally in terms of those measures. In the inhomogeneous case we find a similar
ambiguity. Indeed, considering two representations K and K' so that
V = V = 0, C = C c,
we find with the help of relations (10) for the elements of a transformation
which connects two normál representations K and K'
G' = P - ( G + b o b c ) P
1 2 1
a = (1 - b(G + (bo b ) ^ ) - ! * ) - / 1 2 1 2
(17)
a = c ab(G + b o b c ) - P .
2 2 1
We can choose P and b for t = 0 arbitrarily and find their values for t # 0
remembering the relations and integrating step by step.
Since — c a gives the velocity of a point r' = constant relative to one
2
1. STATIONARY REPRESENTATIONS
^ = o. os)
^ = 0. (19)
Since
(19) and (20) together impose ten conditions upon the four components
of the function f(x). The latter system is in generál overdetermined and does
not admit of solutions.
In the exceptional cases where g(x) is such that the systems (19) and (20)
can be solved we must suppose that it is not a mere accident that the
mathematically overdetermined system admits of solutions. In the latter
case we have to suppose that the fact that g admits of representation obey-
ing (19) is of physical significance. One may suppose that configurations
which possess representations satisfying (19) may be taken to be stationary
configurations. We may thus suppose that the ether is in a stationary
state if g admits of representations g'(x') where all its elements are inde-
pendent of t'.
Similarly we may consider physically significant other symmetries or
particular properties of which appear in somé representation g(x) of g
provided these symmetries or restrictions cannot be produced by a suitable
choice of representation of an arbitrary g(x).
(2)
More precisely we may suppose that there exist representations of T =
= K(%) such that (23) is satisfied for them.
Although (23) is not of an invariant form it nevertheless describes a
significant feature of the distribution of matter. Indeed, if we start from
(2)
an arbitrary representation T(x) of % then, in generál, it is impossible
(2)
to find a representation T'(x') of % obeying (23). If we nevertheless find
that applying a suitable coordinate transformation to the originál repre-
(2)
sentation, a representation T'(x') of Z can be found obeying (23), then this
proves that the distribution 2 has particular properties. We may suppose
that matter characterized by í£ is free of inner stress. If we find a representa-
tion which apart from (23) satisfies also
g ki = 0 k = 1, 2, 3
then we may conclude that the matter thus represented is free of stress
and flows together with the ether.
C. COSMOLOGICAL PROBLEMS
411. In the present section we give a very brief account of somé cos-
mological aspects of the theory of relativity. Cosmological theories try to
give an account of the observed features of the distribution of matter in
the universe in terms of distributions in accord with Einstein's gravitational
equation or of somé generalization of the latter equation.
Av
— =-kr, (24)
v
where Av is the difference of frequency of a spectral line when observed
in the laboratory, respectively when observed in the spectrum of a galaxy
at a distance from us.
k « 10 _ 2 7
cm _ 1
x 10" parsec
8 _1
is the Hubble constant; its numerical value can be determined only with
great uncertainty, as the measure r of the distances of nebulae can only
be estimated by rather indirect methods.
Supposing that the Hubble effect is caused by Doppler effect, i.e. suppos-
ing that the distant nebulae are receding with great velocities, we can also
write in place of (24)
v = ckr (25)
where v is the velocity of recession of objects at distances r.
There exists also somé astronomical evidence upon electromagnetic
radiation inside the universe; we do not discuss this complex of problems
here.
2. THE SOLUTION OF F R I E D M A N N
g(*)=L
0 -C 2
(27)
It is supposed that
<2' 3 eA
2
From the gravitational equations (26) it follows with the help of (23)
(2)
R ki =0 k= 1,2,3 (31a)
(2) 1
R it = - — xu + k. (31b)
We find thus
ö \nA2
= 0
n
dx dt k
and therefore
A(x) = a(0/J(r).
Without restricting the form of g(x) further we can include the factor
/?(r) into the mátrix B(r) and thus we may suppose
A(x) = ^(í). (32)
So as to determine A(i) explicitly, we note that
(2)
DivT = 0
and therefore according to (23) and eq. (37) 476 of Appendix II we find
.-£-
c
with c - ( - d e , r - " ' 'A ' " ' g
C C B
(33)
and
dt
Inserting (33) into (31b) and remembering that according to (30b) and (32)
(2)
R i4 depends only upon t, we find
3 dA _
2
p
+ X (34)
A dt ~ 2
A 3
where • / ? = 0.
1 dA
Multiplying (32) with — A —- we obtain a totál differential. Integrating
6 dt
we find
IdA'Y -p-KA + XA ^ 3
dt j 3A
where we have written K for the constant of integration.
Inserting (35) we obtain explicit forms of A. In particular putting K = 0
and X = 0 we find
A(t) = A (t+t ) '0 0
2 3
(36)
thus A\t) increases with time.
We see that according to the initial conditions and the values of K and
X we obtain solutions with increasing A(f) or decreasing A(t). Equation
(35) admits of a singular solution
A = A 0 with - / } - KA + Al = 0.
0
417. Firstly we note that, on taking the Friedmann solution for g(x), the
(3)
components of <2, where two or three suffixes are equal to 4 vanish. There
fore we find from the equations of motion that a partiele which is at rest
relative to the system of reference at t = 0 remains at rest; indeed it fol
lows from the equation of motion
x(p) = 0 if r(p) = 0
therefore (37)
r(p) = r = constant
is a solution of the equations of motions.
We conclude therefore that the matter under influence of its own gravi
tational action may float together with the ether.
418. The journey of a light signal set from a point P with fixed coordinate
x
takes a time 5ty. If the signal is refiected back from P to P the time of to
x 2
The points P and P may be regarded as two masses floating freely. Thus
x 2
419. If we suppose for the moment that the physical structure of a solid
is such that the time of fiight of a return signal between two fixed points of
the solid does not change in time then we can interpret the results of the
previous paragraph by supposing that the measure of the distance between
points P and P which float freely increases in time, when the measure ofthe
1 2
F(x) = x + - C x + . . . 2
From (27) and (28) it follows, however, that the non-vanishing components
(3)
of C in a fixed point x = r , t are 0 0 0
(3) (3)
Since C dt = 0 we find
444 0 0
444
dt'
, = 1 + small terms. (40)
dt
Since the rate of a physical clock must be taken to be constant (apart
from small terms) in an almost straight representation, we expect a clock
to have a nearly constant rate in measures of /' and therefore according
to (40) it must be taken to be nearly constant in terms of the originál meas-
ures t also.
Thus measuring the return times of light signals between the points P X
and P we find the point P to recede if we measure the return time with
2 2
17 x
17 x
+
\ W'oWoÜi' - *°)
wé see thus that the measure of the distance between two points which have
fixed coordinates relative to K increases when taken in the measures of K'.
However, since K' is an almost straight system of reference, we expect
the measures of a solid to remain almost constant when taken relative
to K'. We see thus that the distance between two points P and P at restx 2
b. DOPPLE R E FFE CT
where the times are taken in the measures corresponding to the point Pi
with coordinate vector
The signals arrive in the point P with coordinate vector r = T + <5r
2 2 L
in times
t + n/v + A\t„)
0 (SrBSrp'/c
+ terms of higher order
thus we find
2
n+l /* + -(!+ lAXQAiQ^^lc )
c
Av V* — V.
The radiation of the atom situated in i \ arrives in P with a frequency
2
v* < v. The above result describes the red shift of Nebulae. We see thus
that the configuration obtained from Friedmann's solution of the equa-
tions of motion leads quaütatively to the Hubble effect.
422. The argument can be summarized by stating that the Friedmann
solution gives such distribution of g(x) that the coordinates describing
the vicinity of a fixed four-point x give almost straight representations.
0
x = x + §
0
E. MACH'S PRINCIPLE
* H. Thirring: Phys. ZS., 19, 33, 1918. (See also somé minor corrections: Phys.
ZS., 22, 29, 1921.)
** J. Lense and H. Thirring: Phys. ZS., 19, 156, 1918.
If we replace Tki by, say, and suppose that at t = 0
g*(r,0) = g(r)
then we can obtain g*(x) for / > 0 from Einstein's gravitational equation
just as discussed in 393. We note that at * = 0 though g = g* because
of the change of T we expect the second time derivatives &* and also /44
the first time derivatives g£ to differ from zero. Because of the latter
u
(3) (3)
we find that — even at t = 0 — the C* differ from the C and thus the
equations of motion in the starred configurations differ from those in the
originál configurations.
The latter difference is connected with the Thirring effect. The difference
between our remark here and the quoted calculations of Thirring is that
in the latter stationary solutions of the gravitational equations were given,
which can be found in the inside or the vicinity of a rotating sphere. These
solutions were compared with those valid in the vicinity of the sphere free
of rotation.
We consider here — qualitatively — the dynamic of the process; i.e.
we discuss the effect upon the surrounding, if a massive body is being put
into rotation. It would be of interest to investigate whether as the result
of rotation of the massive body the distribution of g(x) changes indeed
into g*(x). As far as we know this question has not been investigated.
APPENDIX I
TENSOR ANALYSIS
IN HOMOGENEOUS REGIONS
427. In this book we have used notations and conventions which deviate
from the usual ones. We did this in order to simplify mathematical expres-
sions but also we tried to find notations and conventions which are partic-
ularly adequate to express the physical concepts upon which we build up
the theory.
A. SYSTEMS OF REFERENCE
3. PROPAGATION TENSOR 0
431. The mátrix g with the help of which we can express orbits of light
signals may be called the propagation mátrix. Having systems of references
K, K',.. . they can be characterized by the matrices g, g',. . . etc. express-
ing the propagation of light in their respective measures. We may write
q for the propagation mátrix and
K(Q) = g, tf'(8) = g',. . . . (5a)
In particular for a Lorentz system of reference K for various straight
0
representations we have
*o(8) = r . (5b)
432. Consider two straight systems of references K and K' with coordinate
measures x and x' connected by the reversible transformation
x' = Sx + s. (6)
and thus if we require that the relations (a) and (b) given in (7) should
follow mutually one from the other then we have to suppose
g = ÖSg'S 0 > 0.
4. LORENTZ TRANSFORMATION
x' = Mx + u,
is a Lorentz transformation provided M obeys
MgM = g. (10)
Matrices obeying (10) will be denoted Lorentz matrices. If K and K' are
both Lorentz systems of reference then we have in place of (10)
ATA = T (11)
2 0 = O*.
x* = Mx* + (A
where the M are Lorentz matrices. We note that both x and x* are measures
taken relative to the same system of reference K. We can also write
tf(S) = M,u.
i.e. the deformation 2 is represented by a Lorentz mátrix M and a constant
four-vector u.
435. For many purposes it is convenient to split space and time com-
ponents of g. We use the notations
G V
8 ~ l<
V7 -c /"2
c = y j - det g = J { C 2
+ VG- V)detG x
where —c\ — T . 44
g - V = Sg^a; (20)
á = g a _ 1
and á' = g ^ a '
1. TWO-DIMENSIONAL TENSORS
it follows that
ag-'b = a Y - V (22)
where a, b and a', b' are the representations of vectors a, b relative to K
and K'.
We shall use the notation
a b = ag- b; 1
(22a)
the latter quantity is in accord with (22) a scalar and it will be denoted
the scalar product of a and b.
442. Similarly we can form the invariant product between a vector and
a tensor. We shall write
a-A = a g A = a
_ 1
(23a)
and also
A-a = A g " a = p .x
(23b)
We see immediately that the quantity a and also p defined by (23a, b) are
vectors.
Finally we denote the invariant product between two tensors
A - B = Ag" B = C. 1
(24)
aob = A.
a x b = » a o b — boa;
d = a/det S. (26)
measure of the velocity of light in the ordinary sense; the relation (27) is
a reasonable definition valid in any straight system of reference.
The sign of c can be fixed assuming that in a system of reference K we
have c > 0 provided the space part of K is a right-handed system and the
time coordinate is chosen so as to make the measure of time to increase
with passing time.
Using the definition (27) c is a pseudo scalar and any pseudo scalar can
be written
a = ac,
where a is a scalar.
C. FIEL DS
a(x) = *(a).
thus a(x) depends on x in a way different from how a'(x') depends on x'.
However, in a fixed fourpoint j all representations of a have the same
numerical value.
447. Similarly a vector or a tensor field can be defined as follows. Let
us write
#(21) = A(x)
X'(9l) = A'(x')
with
A'(x') = S ^ x ) ;
we can also write
K{Z) = T(x)
where T is a twodimensional mátrix. The transformation between represen
tations can be written as
T(x) = ST'(x')S
and also
T(x) = ST(Sx + s)S.
1. THE 31 OPE RATOR
=•=
oxy dx 2 cx 3 öXi
*'(3Í) = • ' = - £ - , - / T , i -
tfJC 2 0*3 ÖX 4
x' = Sx + s
1 x ~ -
•ű(x) = SD'a'(x').
Thus putting
• a ( x ) = Grad a(x)
A. MORE-DIMENSIONAL MEASURES
1. Ar-DIMENSIONAL MEASURES
#(21) = A
(*)
where A has elements
• (*n (*>
' J v , . . . ^ = ^.',.,...,
A
t v v . . .v = 1,2,3,4.
a 2 fc
(*> O
452. We define the direct product of two matrices A and B as
(*) (/) (/c+o
AoB = C
so that
(fc + 0 (fc) (/)
^"l'f-'k
=
AriVt...Vk -0)1,(1,.. .(!(• (1)
Another type of product can be defined so that we carry out summations
of a pair of adjacent suffixes, thus we shall write
cf( + l) Í/ + 1) ik + l)
A B = C
' here
(*+/) 4 0 + 1)
(i,/i ...fi;
t
=
X A ,^ Vi t Vka B afítfíi w . (2)
o- = l
We shall also use summation over more than one pair of suffixes. Thus we
write 11 A Bj]=C
where the double bracket indicates double summation and in terms of ele-
ments we have
<* + /) + 0 + 2)
<T,t
{A B J = C
where the sign (n) over the bracket indicates summation over « pairs of
suffixes. Explicitly we have
^
V *
-
i
'.
(k+l)
tíliL2...M =
'k+n)
X •Á-r r,...Pi ir «,...a
1 c l
0 + ")
H ^a ...a,<'
n ü
f
W
-
i.i
l -
(
O
O
V/
.I-
.
It is also useful to introduce invariant products in the following way
(fc+i) (/+i) (fc+i) (í+i)
A • B = A g" B 1
thus the fat dot • denotes that a factor g has to be taken between the - 1
Similarly, we introduce
Hk + m) </ + m)\(m) (k + l)
| A • B J = C ;
the above expression stands short for
(* + /) (fc + m) (l + m)
^IV,...! 1 * F»,JI,...|I/~ X A
'i-»'k "i..-am8om*m " " '£<VI B
r,...z m HI.-.FI;
a,...om
TI...Tm
11
(í(.k + 2) YV (fc)
R g" ]] = R 1
(3)
or explicitly
(AT) (k+2)
M FACTORS
B. PERMUTATION OPERATORS
thus the operator P produces from A another mátrix B. The above opera-
tion written down explicitly in terms of elements is
( (*)") (*)
and
P(V!V . . . v ) = uyfr. . .p.
2 fc k
by the symbol P.
454. The product of two operators P and Q can be taken as a new opera-
tor, i.e.
PQ = R. (4)
R is defined so that provided
(fc) (*) (*) (*)
QA = B, PB = C, (5)
then by definition of R
(*) (*)
RA = C. (6)
We can also write
(8)
í (*n <*) w
If we choose
P(Hi lh •••Hk) = V! v . . . 2 vk (9)
then we can also write in place of (8)
(*) (*) (*)
thus the associative law cannot be taken to be valid when applying per-
mutation operators to suffixes of more-dimensional quantities, as seen from
the above argument.
1. CYCLIC PERMUTATIONS
step to the right, with the exception of the last suffix, which moves to the
first place, thus
c-*(viv... ) = v 2 Vk k . . . v_ k v
The powers of c are operators c' which produce shifts of / steps to the right.
k k
In particular
4=i.
We can also introduce negatíve powers, i.e. the operator c ' shifts the k
ik + t) (fc + 1) (í+1)
^>»lf-»t f*l(l/.»W =
a
X A, ,. tV ..,...Vk B j......>
( tfl f (t/
where a stands between, say, v and v in the first factor and between u,
s J + 1
similarly
thus we have
(/+1)
• (*I...(t|ff(l/
D
+
k+
+tt í <,k + i)\( y + m
0 if / is even
(3)
Applying e.g. the operator n to a three-dimensional mátrix g we find
3
8 8 8 8
OX-L 8x 2 8x3 8x t
A(X)OQ= A(x)
In the following we shall, for the sake of brevity, omit the variable x.
460. Sometimes we have to apply the 9í operator to a product of matrices.
It is useful to make use of the following formuláé, which can be verified
easily with the help of the definitions given above
(A B ) o• = ( A B ) + c {[c ^k+l+l k m +l A J B ) .
Further, if we apply the 9Í operator to an invariant product of matrices,
then we have
rtk) CM (/c) (/ + 1) ff (ík + l) (*) (3)\1 (0|
[ A - B J c . Q = A - B + c ,_ k+ 1 iLct+i { A - A - g J J - B j (17)
where
(3)
g(x) = g(x) o g.
(«
We shall apply the 9í operator to the quantities R introduced in 452
eq. (3) so we have
((lk + 2) X\ ((( f(fc + 3) (ft + 2) (3)1) ~i\
Mi —fje
where the S are the elements of the mátrix S giving the coordinate trans-
VJ1
formation
x' = Sx + s,
(fc+D (/+»
is not an invariant one; even if A and B represent tensor quantities,
(fc+/)
C defined by (2) is not a tensor. Furthermore one verifies easily that the
invariant product of two tensors is a tensor and the process, as defined
(fc + 2) (fc)
by (3), produces from a tensor R another tensor R.
2. SYMMETRY PROPERTIES
PA = B (20)
if valid in one representation K, then it is also valid in any other representa-
tion K'. We can thus write in place of (20) also
P9t = 33.
If a tensor is such that for a given permutation P we have
i>2I = 2l
then we can take that 21 is symmetric with respect to P. We may also
write
P s l mod 21
(4)
3. THE ANTISYMMETRIC TENSOR e
(4)
467. Consider a four-dimensional mátrix e with elements
0 Bx cE -E 0 Ex -cB
F =
2
F = • (24)
2
B -Bi 0 cE 3 E -Ex
2 0 -cB
V cBx cB
2 3
D. TENSOR FIELDS
(fc)
S(x) = f(x)oQ
(t)
just in the manner as S is built of S in 462 eq. (19).
470. We may also introduce a quantity
(fc) ni) (fc) y<o
A(x) = [S(x)A'(x')j
and we can say that 91 is a tensor field if we find that
tt) (*)
A(x) = A(x).
(*)
We shall apply the 92 operator to the quantities A so we have
c
*4
U A Js-'s].
A - A = l A - A J o n + E^'IUAJS^S}. (27)
(fc) (fc) (fc)
thus we have
(3) (3) í (3>-\
g - g = (I + c - ) l g S - S J . 3
1 1
(29)
Applying the operator 713/2 to both sides of (29) we find with the help
of (15)
(3) (3) (3)
gS- S = C - C
1
(30)
with
(3) 1 (3) (3) 1 0)
C = — n g and C = y
From (31) the reversed relation
7cg
. 3 3 (31)
(3) (3)
g - O + c ^ C (32)
can be obtained.
In place of (30) we can also write
From the definition (31) it follows that the Christoffel symbols are sym-
metric with respect to the second and third suffixes. One finds
(3) (3)
(23)C = C .
The Christoffel symbols represent three-dimensional field quantities — but
these fields are not of tensor character as can be seen.
A - A = I^-'LUAJS^SJ.
(3) • (••••••• (3) (3)
We can express S S in terms of the Christoffel symbols C and C. Writ-
- 1
ing the quantities without bar on the left and with bar on the right, we
obtain thus the following relation
(fc + 1) k [f (k)\ (3)1 (fc + 1) * [Y (fc)\ Öl]
A - Ic -'LUAJ-Cj= A - X^'LUAJ-CJ.
k
(*)
In the case we apply the Grad operator to the quantities R introduced in
4 5 2 eq. (3) we have
476, We can define also the covariant form of the Div operator. We
may write
DivA^ttJGradAJg- )). 1
(37)
with
S(x) = f ( x ) c . g . (44)
(s) (?)
From (41) it follows that C'(x') = 0 and therefore also C(x) = 0, and we
may write using relation (33)
(3) (3)
S(x) = S(x)-C(x). (45)
Relation (45) is a differential equation with the help of which S(x) and
also f(x) can be determined if the region 9í is homogeneous indeed.
Whether or not (45) possesses solutions can be ascertained by differentiat-
ing (45) into x. We write
(3) (4)
S o g = S
where we omit to write down the variable x explicitly. We find with the
help of (17)
(4) (4) r cm W\ (s)\
S = S-C-t-c (c - lS-S-gj-Cj,
4 s
1
(3) (3)
Inserting S from (45) and g from (32) we find
(4)
(4) (4) r / (3)) (3)1
S = S-C-C4 cí l - 3~ CJ-Cj.
L
1 s c 1
(3)
Using the symmetry of C we can simplify the second term and find as the
result of a short calculation
S = S-ÍC-(24)(C-CJJ. (46)
(4)
If S is to be the third derivative of the transformation function f(x), then
it has to be symmetric in the last three suffixes; we expect thus e.g.
(4)
( 1 - ( 2 4 ) ) S = 0. (47)
(4)
Since it follows from (45) that S is automatícally symmetric in (23) it
follows also that provided (47) is fulfilled it is also symmetric in (34) since
we have
(34) = (23X24X23).
(4)
Thus provided (47) is fulfilled, then S is symmetric in the three last
suffixes. Inserting (46) into (47) we find
(W (3) (3))
( 1 - ( 2 4 ) ) | C + C - C | = 0.
We can also write
(4) 1 (4) 1 (*)
(1 - (24)) C = - (1 - (24))** = - 2 * 4 * .
(4) (4)
The latter is symmetric in all the three suffixes. Since < S > is equal to S
(4)
in a homogeneous region, we can take <S> to define an approximate
solution of (43) and (44).
The difference between the two solutions can be written
(4) (4) (4) 1 (4)
ŐS = <S> S = j [(1 - (24)) + (1 (34))] S ,
however
(the integration can be taken on any path from 0 to x); the above trans
formation function satisfies thus approximately (43) and (44) in the vicinity
of x = 0.
480. We may write
W ) = 0. (53)
Differentiating twice with the help of (51) and (53) we find for x = 0
( U)\ (?)
1
( l + (12))lgS ŐSJ = ő g ,
Applying • to the relation (30) one finds with the help of (16) and (54)
With the help of (30) and (32) one can rewrite the second term on the right-
hand side and find
fí(3) (3V| ((3) a>\\
c {---} = c (12)|lc +
1 4 CJ-lc-Cjj.
(3)
As the C are symmetric in (23) the expression in the { } is symmetric in (34)
and one can write for the operator before the bracket
We have thus
2 + (! ~ (24))lC-cJ = y n a + (1 - (24))C-C,
or using the notation (48)
(4) (4)
R=R
(4)
thus in accord with 463 R is a tensor.
(4)
3 . SYMMETRIES OF THE R TENSOR
(T) (3)
mod C-C
and we find in place of (48) also
(4) 1 /tt) (3) (3A
R = - ( 1 - ( 2 4 ) ) ( 1 - ( 1 3 ) ) U + C-Cj (62)
or
(4) ] ((4) (3) (3)\
R = y JTU + C-Cj.
4 (63)
(62) and (63) are, of course, identical with (48). From the above relations the
(4)
symmetries of R can be obtained in a simple manner. From (62) we see
immediately that
(4) (4) (4)
(24)R = (13)R = - R (64a)
so
(4) (4)
(13)(24)R = R. (64b)
Further, remembering that according to the definitions
(4) (4)
Q7t = -7i
4 4 (1234)R = - R . (64c)
Multiplying (63) from the left by (12)(34)(1234) = (13), and remembering
(64a) and (64c) we find
(4) (4)
(12)(34)R = R, (64d)
thus
(4)
v # x, it # L (66)
There exist 21 configurations of suffixes obeying (66) and there exist 21 types
of components, which do not vanish identically. Indeed, there exist six pairs
of suffixes v T 6 K and fifteen quadruplets obeying (66) with v # ii and six
with v = n , where we count those configurations only once which give
the same absolute value for the element R . We can thus arrange the ele-
VílxX
(4)
ments of R into twenty-one groups, each group containing (not identically
vanishing) elements with equal absolute values.
Relation (65) gives a linear connection between three different elements
(4)
of R. Because of (64a) this relation will be fulfilled identically for elements
(4)
of R expecting those with four different suffixes. For the latter elements
(65) gives the relation
(4) (4) (4)
•^1234 + ^3124 + -^2314 =
0>
which is independent of (64a). The elements occurring in this relation con-
nect three out of the twenty-one groups of elements discussed above. We
see thus that out of the 21 groups given above only 20 contain independent
elements. Thus the Riemann-Christoffel tensor possesses altogether 20 non-
trivial independent elements.
(4)
4. THE R E D U C E D FORM OF THE R TENSOR
(4)
484. The Riemann-Christoffel tensor R can be contracted and thus
we obtain the following tensor quantities
R = ((Rg-i)) = ( ( g - S ) ) . (67)
(2)
The contraction leads to the same tensor R whether we multiply from the
left or from the right by g . Further we have
- 1
J ? = ( ( g - R ) ) = (g(- )R)
1 ) 2 (4>
where we write g = g og .
( _ 2 ) - 1 _ 1
We can reduce (69) to a simple form. With the help of a simple considera-
tion one finds that
(4) i (5)
and also
/(2) i
DivlR-yg* =0.
1 B2/40 00001492
Fiz. Kvt.
This analysis shows, among others,
the experimentál significance of
the Michelson-Morley experi-
ment and why it was necessary to
use the well-known experimentál
arrangement in the form it was
actually used.
The author has written many arti-
cles and gave many lectures both
in Hungary and abroad elaborat-
ing his views which are deviating
in certain respects from those
usually put forward.
The concepts and modes of treat-
ment have given rise to discussions
and aroused great interest. With the
new concepts the author's aim is
to induce constructive discussion
but at the same time his book may
sérve as a handbook for physicists,
scientists and philosophers inter-
ested in the problems. It is an
interesting reading and it could
also be used as a university text-
book.
AKADÉMIAI KIADÓ
Publishing House of the
Hungárián Academy of Sciences
Budapest
Distributors: K U L T Ú R A
Budapest 62. P.O.B. 149
L. Jánossy
THEORY OF RELATIVITY
BASED O N PHYSICAL
REALITY