Академический Документы
Профессиональный Документы
Культура Документы
= h (2.1)
(van der Waerden, 1967, p5)
Bohr now remarks that it has been possible to account for the phenomenon of radiation coming
from the atom by classical electrodynamics, but only in a special limiting case of slow vibrations.
This is the old theory. A new theory which considers the transition between two stationary
states as quantum theory tells us, should however account for the radiation without being a
limiting case. Therefore Bohr writes that we may expect that any theory capable of describing
this phenomenon in accordance with observation will form some sort of natural generalisation of
the ordinary theory of radiation (Bohr, 1918 in van der Waerden, p. 6). Bohr comes up with some
kind of correspondence between the ordinary theory of radiation and a new quantum theory
of radiation. The ordinary theory is only capable to describe the radiation of the hydrogen atom
correctly in a limiting case, a new theory which could describe the radiation in the full range of
observation should than be a natural generalisation of the ordinary theory. In his 1918 paper
Bohr species this:
[T]he conditions which will be used to determine the values of the energy in the sta-
tionary states are of such type that the frequencies calculated by [equation [2.1]], in
the limit where the motions in successive stationary states dier very little from each
other, will tend to coincide with the frequencies to be expected in the ordinary theory
of radiation from the motion of the system in the stationary states. (Bohr, 1918 in
van der Waerden, 1967, p.6)
So what does this mean? Bohr states here that the frequency of the radiation from an electron in
orbit calculated with electrodynamics (classically) corresponds with the frequency of the radiation
coming from a transition between stationary states of an atom (quantum-theoretically)! This is
only valid in the limit where the dierence between the classical orbits of successive stationary
states is very small.
Bohr quanties his principle with the use of Fourier series. He uses the series to describe the
classical orbits of the electron. From this and further considerations Bohr gives a condition for
this transition from stationary state n to a neighbouring state n
quantum
(n, n ) =
classic
(n, )
3
(2.2)
On the one side we have the frequency from the radiation which is due to a transition between
stationary states, from n to n , thus depends on both variables. And on the other side we
have the dependence on the th Fourier component of the frequency of the radiation due to an
electron in a orbit.
This is the Principle of Correspondence
4
and it eventually comes to drive most of the research
on quantum theory in the years to follow.
2.1.4 1919-1925
As shown above, the old quantum theory was characterized by rather ad hoc conditions which
should be satised by quantum theory. Planck used an innovative hypothesis, but Bohr also
came up with assumptions that should govern the hydrogen atom, of which his idea of stationary
orbits is the most noticeable. The research following the work of these physicists is characterized
by a same attitude towards this new born quantum theory. With classical mechanics as their
foundation, they were slowly creating the building blocks of a complete new theory. Eventually
all the work and research on quantum theory combined resulted in a large patchwork of ideas,
hypotheses, principles, theorems and computational rules one could use to make the world turn
quantum. In short; Quantum theory anterior to 1925 was a matter of translating. Classical
solutions of quantum systems could be put into this inscrutable machine which implemented the
right conditions to create the good quantum-theoretical solutions. The main tool used for this
translation was Bohrs correspondence principle. One could even say that at that time quantum
theory may be described as systematic guessing, guided by the Principle of Correspondence(van
der Waerden, 1967,p8).
For realising the signicance of this papers subject, it is important to understand the situation
of quantum theory prior to 1925. It allows you to see the steps which are taken to create a fully
edged scientic theory more clearly. The point Heisenberg and Schrodinger nd themselves is
precisely the point a genuine theory is created and scientists try to pin down the meaning of a
new concept.
I would like to end this short overview with last remark which should be made. The history of
the origin of quantum theory, of course, is much more elaborate than pictured above. However, a
full review is far beyond the scope of this article. It is only to indicate the process which involves
the formation of such a theory and a general introduction into the physics of that time. Also it
gives a good indication of the atmosphere there was in the eld of physics which was present in
the formulation of a quantum mechanics.
5
To describe this formulation we rst need to go into one
subject more elaborately for it is used by Heisenberg in the development of his formalism.
2.2 Old Quantum Theory: Dispersion Theory
Now some results of research on dispersion theory will be considered to get familiarized with
Heisenbergs mathematical background, without going deeper into the actual calculations.
We start with looking at the polarization of an atom caused by a vibrating electrical eld.
Classically there is a simple relation between the polarization and the eld:
P =
E (2.3)
Where is empirically well determined. Note that the polarization becomes high when the
frequency of the vibrating eld is near to the absorption frequency
i
of the atom.
The factor can also be derived by classical theory on the assumption that the atom contains
a certain number of oscillators, whose frequencies are equal to the absorption frequencies
i
(van
3
Valid for large n and small .
4
Korrespondenzprinzip. It would however not get this fancy name until Bohrs paper in 1920.
5
For a very elaborate overview of the chain of events leading to the formation of Quantum Mechanics I advise
you to read the nicely constructed The Conceptual Development of Quantum Mechanics(1966) by Max Jammer.
6
2 Background 2.2 Old Quantum Theory: Dispersion Theory
der Waerden, 1967, p.9). This idea was extended to the quantum-theoretical regime by Ladenburg
in 1921:. . . we may say that Ladenburg replaced the atom, as far its interaction with the radia-
tion eld is concerned, by a set of harmonic oscillators with frequencies equal to the absorption
frequencies
i
of the atom (idem, p.11). Oscillators which Bohr later called virtual harmonic
oscillators. Ladenburg used this in his attempt to connect the experimental results on dispersion
with quantum-theoretical considerations on the probability of transitions.
Using Bohrs correspondence principle, it was Hendrik Anthony Kramers who went beyond
Ladenburgs result to nd a quantum-theoretical expression for . In the context of the cor-
respondence principle this meant that in the region of large quantum numbers the interaction
between the atom and the radiation eld tends to coincide with that expected on classical the-
ory (idem, p.14). This consideration led to an expression which will not be discussed here. His
expression does result in a possible formal substitution, equivalent with that of Ladenburg:
The reaction of the atom against the incident radiation can (. . . ) formally be compared
with the action of a set of virtual harmonic oscillators inside the atom, conjugated with
the dierent possible transitions to other stationary states. (Kramers, 1967[1924],
p.179)
Born extrapolates the method used by Kramers to interactions between two mechanical sys-
tems, instead of between a radiation eld and an atom. Born uses the above idea and assumes
again that an atom in a stationary state n may be replaced, as far as the calculation of emission,
absorption and dispersion is concerned, by a set of virtual oscillators of frequencies
(n, n
) =
1
h
[E(n) E(n
)] . (2.4)
(Van der Waerden, 1967, p.15)
And now we have come across a formula which likely heavily inuenced Heisenbergs way of
thinking. This has to do with the fact that in the above set of virtual oscillators every possible
transition between stationary states is present. That is to say; in this set n and n
take every
possible value, so all of the transition frequencies of an atom are present in equation [2.4]. So
one can formally compare the atom with a set of virtual oscillators which take on every possible
transition frequency. Keep this in mind, for it will return in the next section.
Now for a transition between frequencies one can speak of a virtual resonator. Then, according
to the correspondence principle, every virtual resonator must correspond with the element of the
classically computed Fourier series of the motion in state n. If Born then again looks at this
correspondence he remarks that this formally means replacing a dierential with a dierence.
The classical frequency can be expressed as a dierential of the total energy
6
. And thus a new
correspondence rule is born to translate classical formulas into quantum-theoretical analogues:
(n)
n
(n) (n ) (2.5)
Where (n) can be any arbitrary function. This is an important correspondence which is exten-
sively used by Heisenberg.
Two important results thus appeared in this very short review of quantum-theoretical dis-
persion theory. First we saw that when calculating on mechanical processes of the atom, for
calculating purposes one could replace the system by a set of virtual harmonic oscillators. Sec-
ondly, to go from a classical process to a quantum-theoretical one must replace all the dierential
coecients with dierence quotients. Because Heisenberg collaborated with Kramers on a pa-
per regarding these subjects, this is knowledge important for the perpetuation of Heisenbergs
theoretical research.
6
This is a result of earlier work combined with Plancks E = h. See also section 3.2.
7
3 Heisenberg and Matrix Mechanics 3.0 Heisenberg and Matrix Mechanics
3 Heisenberg and Matrix Mechanics
3.1 Background
Werner Heisenberg was the rst to publish a paper in which a new type of mechanics was introduced
to coherently describe quantum phenomena. His paper was received the 29th of July in the German
Zeitschrift fur Physik. Also publishing work with Kramers on his dispersion theory, Heisenberg
places himself in a tradition with the likes of Bohr, Born, Kramers and Slater, whose papers laid
the foundation for Heisenbergs Matrix Mechanics.
Heisenberg started his search for the basis for a theoretical quantum mechanics with a philo-
sophical presupposition which guided his way through the process. The foremost important thing
according to Heisenberg was, to create a theory which is founded only upon observable entities:
. . . it seems more reasonable to try to establish a theoretical quantum mechanics, analogous to
classical mechanics, but in which only relations between observable quantities occur (Heisenberg,
1967[1925], p262). He emphasizes the the failure of the quantum-theoretical rules as a deviation
from classical mechanics, which is the step that lead to his urge to create a new quantum mechan-
ics from scratch, only using quantities which in principle are observable. Heisenberg thus avoids
quantities like the the orbit of an electron, which can not be observed directly. The observables
Heisenberg worked with are the frequency and amplitude of the radiation emitted by an electron
in orbit. This radiation was both described classically and quantum-theoretically, and both de-
scriptions corresponded to each other through Niels Bohrs correspondence principle, a tool we
saw was of inexhaustible heuristic value for the development of quantum mechanics. Using this
correspondence principle and the concept of a Fourier expansion, Heisenberg wants to inquire in
his paper . . . about the form higher order terms would assume in quantum theory(idem). He
refers to the higher order terms of the electromagnetic eld strengths induced by an electron mov-
ing in an orbit. Classically these terms are easy to calculate, but quantum-theoretically there was
at that time no way to compute this.
Without looking at the cause of the motion, Heisenberg thus engages himself in the kinematics
of quantum theory. To simplify his theoretical investigation in this region of kinematics, Heisenberg
poses a new question which covers his previous interest in higher order terms; If instead of a
classical quantity x(t) we have a quantum-theoretical quantity, what quantum-theoretical quantity
will appear in place of x(t)
2
? (idem, p263).
3.2 Calculation
3.2.1 Deduction
To look for a quantum-theoretical x(t)
2
we start with the observable quantum-theoretical quanti-
ties which are known from earlier work on quantum theory. Heisenberg rst looks how these known
quantities act in quantum phenomena. These quantities are the quantum-theoretical frequency
of the radiation emitted by a transition between stationary states of an atom and the classical
frequency of an electron in an orbit, as discussed above. Like Bohr and Born, Heisenberg looks
at both the classically and quantum-theoretically computed frequency of the radiation. How does
this quantity look from these dierent points of view?
First let us see what the frequency looks like an a classical context. In equation [2.2] we saw
the classical frequency has the following form.
classical
= (n, ) (3.1)
This is the th harmonic frequency (or the frequency of the th Fourier-element) of the fun-
damental frequency (n, 0) = (n). Classically it is possible to write this harmonic as a linear
combination of the fundamental frequency.
(n, ) = (n) (3.2)
8
3 Heisenberg and Matrix Mechanics 3.2 Calculation
This is quite clear. However there is another feature of the classical frequency which is not
immediately clear. It looks like this:
(n, ) = (n) =
h
dH
dn
(3.3)
Where H is the Hamiltonian or the total energy. This is a result I will not discuss, but it is
important to notice the dierential in the above equation.
Now take a look at how the frequency is constructed in a quantum context. We have already
seen how Bohr constructed the frequency, and Heisenberg makes great use of this result. In the
terminology used above we get:
(n, n ) =
1
h
[H(n) H(n )] (3.4)
The frequency depends, after the transformation to quantum mechanics, also on (n ). As
shown, this is according to Bohr the frequency of the radiation coming from a transition of an
electron from stationary state n to stationary state n. In retrospect, however, this could also be
the consequence of Borns correspondence rule treated in subsection 2.2, where for any arbitrary
function the dierential in a classical concept makes place for a dierence in quantum theory.
So if we compare quantum theory with classical theory, we are thus dealing with the following
analogue;
(n) (n, n ) (3.5)
To go from a classical quantity to a quantum-theoretically one only replaces a dependence on (n)
with a dependence on (n ). In quantum theory it is no longer possible to write a frequency
as the th overtone times the base frequency. The quantum-theoretical frequency always depends
on two variables which both are equally important. It is from this point on that we can work our
way to how one must manipulate quantum mechanic observables.
The relation we have for the quantum-theoretic frequency in equation [3.4], results in new
combination rules for this quantity. This is due to the terms of H which are able to cancel each
other out when adding the dierent frequencies.
(n, n ) + (n , n ) = (n, n ) (3.6)
These combination rules will appear to be very important in Heisenbergs reasoning towards a
mechanics of quantum theory. Also the amplitude, like the frequency, has a correspondence
between classical and quantum phenomena, and thus quantum-theoretical amplitudes are also
functions of the two values n and n. Now these features of both quantum-theoretical quantities
are dened, they can be applied in other cases. How could the amplitude and frequency help us
to construct a general quantum-theoretical quantity? Heisenberg looked at the Fourier-series as
representations of a quantity, which consist of exactly these two variables.
We introduce a new quantity x(t), which we assume can be written both classically and
quantum-theoretically. This we will do as Heisenberg suggests in his article, using a Fourier
expansion. Classically this will be just a normal Fourier expansion as still used all over modern
physics
7
;
x(n, t) =
=
A
(n)e
i(n)t
(3.7)
Here we have A
(n) and (n) which are, respectively, the amplitude (or Fourier-coecient) and
the angular frequency of the th Fourier component. So the expansion can be divided in into an
amplitude and a phase part. This is a normal quantity as it exists in classical theory.
7
This paper restricts itself in looking at the the Fourier sum, and will not cover the Fourier integral. This is due
to considerations of length. However, the procedure using Fourier integral can be done exactly analogous to the
procedure done in this paper with a Fourier sum. Which one to use obviously depends on whether the quantity
x(t) is periodic or not.
9
3 Heisenberg and Matrix Mechanics 3.2 Calculation
Is there a possible quantum-theoretic analogue? And if so, how is such an expansion con-
structed quantum-theoretically? In answering this question Heisenberg employs the analogue we
explicated in equation [3.5], but now we have to be careful. Replacing (n) with (n, n ), an
expansion as done above becomes nonsensical. Because both the frequency and the amplitude
depend on n and n, an analogous expansion would give us an expression without any physical
meaning. This is because classically you could say the representation for a quantity x(t) is a sum-
mation over all possible harmonics of the base frequency in the phase, with an amplitude which
also depends on this harmonic. With the introduction of (n, n )
8
this interpretation is no
longer possible, it can no longer be said that the phases of all possible frequencies are added up
since both n and n are evenly important in computing the frequency. Hence physically such
an quantum expansion does not mean anything like the classical Fourier-series. Disregarding this,
Heisenberg makes the following hypothesis as a quantum analogue to equation 3.7
9
;
x(n, t) =
=
A(n, n )e
i(n,n)t
(3.8)
It must be noted again that this quantum representation of x(t) is not a meaningful expression since
the two variables are equally weighted. But when computing x(t)
2
, Heisenberg really neglected
this fact, and thought this representation could just do the trick. This is an important conceptual
step.
It is almost cogent that Heisenberg here got inuenced by early dispersion theory. As we saw
in section 2.2 Born came up with a new way to describe the atom, one could formally replace
the atom with a set of virtual oscillators. This set consisted of oscillators that together possessed
all of the possible transition frequencies in an atom. This is exactly what Heisenberg does here!
Equation [3.8] is equivalent to a sum over a set of oscillators with amplitude A and frequency ,
just like what happened in equation [2.4]. Since (n, n ) is a transition frequency, summing
over means all these transitions come to pass. One could say that here instead of summing
over all types of waves with amplitudes and frequencies, we now sum over all possible transitions
with the accompanying frequencies. Heisenberg worked with Kramers on dispersion so it is most
likely that he got his inspiration for his quantum-theoretical x(t) from this work. The result: The
hypothetical expansion of x(t) is one of the founding elements of Heisenbergs Matrix Mechanics.
With this new expansion the tools are available to try to calculate x(t)
2
. Again, rst the
squared value of x(t) is computed classically. And this we will do again with a Fourier-expansion.
The Fourier-series of x(t)
2
gives us:
x(t)
2
=
=
_
=
A
(n)e
i(n)t
=
A
(n)e
i()(n)t
_
(3.9)
This requires some further explanation. The second term and the summation over the new variable
is due to the fact the Fourier transform of a product has a special feature. If a function can
be decomposed as the product of two square integrable functions (here x(t) and x(t)), then the
Fourier transform of the function is given by the convolution product of the respective Fourier
transforms. This is why the is introduced in the second term, with the new variable over
which is summed.
10
We can write the product [3.9] tidier when we bring terms together.
x(t)
2
=
=
A
(n)A
(n)e
i(n)t
e
i()(n)t
(3.10)
This way we notice that the power of e after multiplication drops out , and only depends on
. The phase having only one variable over which is summed gives rise to a beautiful coherent
8
(n, n ) =
1
(n, n )
9
As in 3.5 the amplitude can also be replaced in an analogous way:A(n) A(n, n )
10
F[p(t)q(t)]() = F[p(t)]() F[q(t)]() =
=
F[p(t)]()F[q(t)]( )
10
3 Heisenberg and Matrix Mechanics 3.2 Calculation
representation of x(t)
2
, since only the amplitude changed.
11
We can write this like Heisenberg
did, in the following way:
x(t)
2
=
=
B
(n)e
i(n)t
(3.11a)
with
B
(n)e
i(n)t
=
=
A
(n)A
(n)e
i(+())(n)t
(3.11b)
It is now possible to notice even better that x(t)
2
only diers from its square root in the amplitude
B, the phase again is a sum over all possible frequencies in a power of e. That the classical x(t)
2
Fourier expansion has a phase which eventually only depends on the the variable will turn out
to be an important fact when trying to create a quantum analogue of x(t)
2
.
The past transformation of x(t) was, within the classical frame of reference, a mathematical
manipulation which did not introduce anything new for physicists of that time. Heisenbergs big
step forward was to copy this same manipulation, and instead of the classical Fourier expansion of
x(t) he wanted to do it for his hypothetical quantum equivalent of this expansion, equation [3.8].
For the quantum-theoretical x(t)
2
Heisenberg suggested the following:
x(t)
2
=
=
_
=
A(n, n )e
i(n,n)t
=
A(n , n )e
i(n,n)t
_
=
=
A(n, n )A(n , n )e
i((n,n)+(n,n))t
(3.12)
Now take a good look at equation [3.9] again and notice the insertion of (n, n) for the classical
dependence on (n) and Fourier-element , but also the insertion (n , n ) for the classical
dependence on n and Fourier-element which came from the convolution product. Using the
combination relation of [3.6] we are able to again get a nice expression for x(t)
2
in which the phase
is the same as that of x(t). With the combination relation it is possible to rewrite in equation
[3.12], to get a phase factor which again only depends on .
(n, n ) + (n , n ) = (n, n ) (3.13)
And thus we get
x(t)
2
=
=
A(n, n )A(n , n )e
i(n,n)t
(3.14)
Which is a really nice expression. Its relation with the quantum-theoretical x(t) is strikingly
analogous to how the classical x(t)
2
and x(t) relate to each other.
3.2.2 Construction and Implications
Equation [3.14] is denitely not the ending point, this is only where Heisenbergs sets his story to
begin. But before we look at what follows on his treatment of quantum-theoretical observables the
expression above needs further explanation. Therefore I will now stop to reconstruct Heisenbergs
article and try to nd a justication for the steps Heisenberg took and look at what exactly he
did.
Without further explication on how he came up with exact form of the second sum over
in [3.12], Heisenberg regarded this relation as the new quantum-theoretical x(t)
2
. He only notes
that this equation ...is an almost necessary consequence of the frequency combination rules
(1967[1925], p265). This is rather strange since it is the crucial step that eventually will lead
11
Multiplying quantities with the same phase should result in a new quantity with again the same phase
11
3 Heisenberg and Matrix Mechanics 3.2 Calculation
to Heisenbergs famous multiplication rule. The rst part of equation [3.12] is clear, here he did
exactly the same as when computing the quantum theoretical x(t). However, the construction of
the second part is not directly visible. So then what could be the reason of this exact substitution
Heisenberg performed?
A
(n)e
i()(n)t
?
=A(n , n )e
i(n,n)t
(3.15)
A short explanation is necessary. Looking at the substitution performed in exactly the same
line as Heisenberg determined x(t) quantum-theoretically from the classical representation, using
equation 3.5, we would also try to compute x(t)
2
quantum-theoretically. But instead of only we
now also have the element . We then get the following transition from classical to quantum
theory:
A
(n)e
i()(n)t
=A(n, n ( ))e
i(n,n())t
(3.16)
Applying the correspondence principle we did the substitution ( )(n) (n, n ( )).
But now, unlike in Heisenbergs procedure, we do not get a phase factor which corresponds with
the phase of the quantum-theoretically constructed x(t), which was the case in the classical x(t)
2
.
We could try to use the combination relations to transform (n, n( )) into something more
convenient, but this appears to be futile. From here there is no possible way to end with the
frequency (n , n ) which Heisenberg so mysteriously introduces.
When looking closer it is however possible to get exactly the relation Heisenberg called so
necessary. It is but a small step we have to do, to get an glimpse of Heisenbergs computation
of x(t)
2
. Starting with rewriting the left-hand side of equation [3.16]:
A
(n)e
i()(n)t
= A
(n)e
i((n)(n))t
(3.17)
Now from the correspondence principle [3.5] we can make the transition to quantum theory and
get:
(n) (n, n )
(n) (n, n )
Our quantum theoretical expression for the phase of the latter part of x(t)
2
, the second sum over
in [3.12] hence becomes:
e
i((n,n)(n,n))t
Following directly from the combination rules stated above, or the relation explicitly stated in
equation [3.13], we get this expression:
(n, n ) (n, n ) = (n , n ) (3.18)
For the phase we then get:
e
i()(n)t
e
i((n,n))t
(3.19)
So it appears that there is at least one justication possible for the expression of the frequency
in equation [3.14], which is a result of the multiplication of the two phases in [3.12] of which
the expression of one of the phases has been justied above. This gives an indication of how
Heisenberg could have come up with his quantum-theoretical x(t)
2
. However, the expression for
the amplitude cannot be justied in the same manner.
If we just extrapolate the result to the amplitudes and using this in the classical expression
for x(t)
2
of equation [3.9] we get a familiar expression. The combination relations can be applied
again to compute the quantum-theoretical x(t)
2
and we get:
x(t)
2
=
=
A(n, n )A(n , n )e
i(n,n)t
(3.20)
Which is exactly the expression Heisenberg states in his article, equation [3.14].
12
3 Heisenberg and Matrix Mechanics 3.3 Contextual
We now have a quantum-theoretical x(t)
2
which is not lacking physical meaning; Heisenberg
notes that the phases have just as a physical signicance as their classical analogues(Heisenberg,
1967[1925], p. 265). One can see this even better in the notation of Heisenberg:
x(t)
2
=
=
B(n, n )e
i(n,n)t
(3.21a)
with
B(n, n )e
i(n,n)t
=
=
A(n, n )A(n , n )e
i(n,n)
(3.21b)
So what can be said reecting on the equation we now have? We can conclude Heisenberg con-
structed a new type of higher order term x(t)
2
, which does not behave anything like classical
higher order terms. It is a new expression which is satised by every observable in the quantum
theory, it is a general form. With this expression, in this new theory, Heisenberg totally changed
what an observable represents in physics. It therefore also a beautiful event in the development
of scientic knowledge, where foundations are laid for a true formalism which can be applied at
all times. Quantum theory at this point was accompanied by a genuine new quantum mechanics.
Taking o from this theoretical hypothesis, it is possible to construct a new type of multiplica-
tion rule in which the amplitude is constructed in for quantum-theoretical observables. This can
be done for any x(t)
n
, but also in a more general sense with two dierent quantum-theoretical
quantities, p(t) and q(t). If we characterize p(t) with amplitude P and q(t) with amplitude Q we
can compute the representations of a simple product; p(t)q(t), which we will call R.
Classical:
R
(n) =
=
P
(n)Q
(n) (3.22)
Quantum-Theoretical:
R(n, n ) =
=
P(n, n )Q(n , n ) (3.23)
Where the last relation is Heisenbergs most (in)famous result. From this we can conclude that
quantum-theoretically, observable quantities do not behave as we are classically used of them. The
product between two observable quantities p(t)q(t) now is non-commutative!
p(t)q(t) = q(t)p(t)
or,
=
P(n, n )Q(n , n ) =
=
Q(n, n )P(n , n )
3.3 Contextual
In the theory of quantum mechanics formalized by Heisenberg a giant conceptual step has been
made. After many years passed where physicists constructed new relations, based up on a hy-
pothesis in which the property of continuous natural quantities was replaced by a system of
discrete values, Heisenberg introduced a radical new approach. An approach in which he deed
the classically determined laws of nature and opened an age with a new style of thinking. The
whole conceptual scheme of classical mechanics was thrown aside by his Quantum-Theoretical
Re-Interpretation, and older quantum conceptions were also disposed of. This departure from
old methods, we could say, was due to the specic style of Werner Heisenberg. His institutional
background, the people he worked with, his collaborations on early dispersion theory and even his
13
3 Heisenberg and Matrix Mechanics 3.3 Contextual
philosophical preferences
12
determine his perception of physics and its problems, and thus also
the way one should approach these problems. In section 5 these consideration will be elaborated
some more.
In his article Heisenberg takes his new kinematics of quantum theory and then turns to solv-
ing dynamical problems with his new mechanics. He applies his expression to the equations of
motion which can now be solved quantum-theoretically and nally gives an example regarding
an anharmonic oscillator. This wont be covered here since this only concerns application. After
the deduction of his multiplication rule Heisenberg also made other contributions to quantum
mechanics and worked on with the general rules he found and collaborated with other physicist
to nd solutions to other, more complicated problems. Before this, Max Born and mathematical
physicist Pascual Jordan worked out the ndings of Heisenberg even more elegantly. Most notice-
ably the multiplication rule following from Heisenbergs formalism they recognised as the same
rule on which matrix multiplication takes place. It is exactly what happens when you multiply
rows times columns. The next equation tries to exemplify this feature.
It is possible to dene two matrices to get the following relation:
_
_
P
1,1
P
1,2
. . . P
1,n
P
2,1
P
2,2
. . . P
2,n
.
.
.
.
.
.
.
.
.
.
.
.
P
n,1
P
n,2
. . . P
n,n
_
_
Q
1,1
Q
1,2
. . . Q
1,n
Q
2,1
Q
2,2
. . . Q
2,n
.
.
.
.
.
.
.
.
.
.
.
.
Q
n,1
Q
n,2
. . . Q
n,n
_
_
=
_
n
P
1,n
Q
n,1
n
P
1,n
Q
n,2
. . .
n
P
1,n
Q
n,n
n
P
2,n
Q
n,2
n
P
1,n
Q
n,2
. . .
n
P
2,n
Q
n,n
.
.
.
.
.
.
.
.
.
.
.
.
n
P
n,n
Q
n,1
n
P
n,n
Q
n,2
. . .
n
P
n,n
Q
n,n
_
_
(3.25)
In this equation is constant, and n is a variable which is chosen constant. The term on the nth
row and (n )th column of the nal matrix, we see, is represented by:
n
P
n,n
Q
n,n
=
P
n,n
Q
n,n
(3.26)
Which is a general expression for all of the elements in the matrix, it is the denition of matrix
multiplication! The multiplication of a n by n matrix times a n by n matrix. Notice
that the summation in the earlier expression deduced by Heisenberg sums from to , which
means the observables are represented by an innite dimensional matrix. The matrix which is
showed above then does not correctly represent the quantum observable, but it they gives a good
indication.
The multiplication of matrices is a rule which had already been formulated in 1855 by a the
French Mathematician Arthur Cayley, but never in the history of physics was this mathematical
tool used before. In a paper by Max Born and Pascual Jordan, which was the rst reaction
on the work of Heisenberg, they explicated this fact. Their paper, On Quantum Mechanics
developed Heisenbergs approach into a systematic theory of quantum mechanics with the aid of
the mathematical matrix methods (Born and Jordan, 1967[1925], p 277). It was from here big
steps were made towards a greater understanding of this new quantum mechanics. The further
development of this specic Matrix Mechanics carries with it the names of Paul Dirac and
Wolfgang Pauli.
But as mentioned before, Heisenberg was not the only one who started a project for a new
mechanics of quantum theory.
12
Heisenberg is said to be impressed by Kants Critique but also by Wittgensteins writings (Jammer, 1966, p.
198).
14
4 Schr odinger and Wave Mechanics 4.0 Schrodinger and Wave Mechanics
4 Schr odinger and Wave Mechanics
4.1 Background
Only a short time after the publication of Heisenbergs classical paper, Erwin Schrodinger also
submitted a paper in which he introduced a new way of looking at quantum theory
13
. In the
79th volume of the Annalen der Physik Schrodinger wrote a paper on the deduction of the energy
spectrum of the hydrogen atom, and in this paper Schrodinger used an idea which is totally
unrelated to Heisenbergs conception of quantum mechanics. Unlike Heisenberg, Schrodinger does
not stand in a tradition connected with the old quantum theory. His earlier works were on the
topics of kinetic theory of gases, statistical mechanics, elasticity and was more generally working
on the physics of continuous media (Jammer,1966, p. 255, 256).
Schr odinger was not part of the development of quantum theory as Heisenberg was, but it is
known that Schrodinger was inuenced by two well-known persons in the eld of quantum physics:
Albert Einstein and Louis de Broglie (idem., p. 257). The works of both physicists had great
implications for the constitution of matter and light. Where, as discussed in subsection 2.1.2,
Einstein introduced us with a conception of light which could both we wave-like and particle-like,
de Broglie did the same for the conception of matter. De Broglie suggested that any moving body
may be accompanied by a wave and it is impossible to disjoin motion of body and propagation of
wave.(De Broglie, 1924 in Jammer, 1966, p. 244). These physicists gave Schrodinger important
clues for his approach.
14
In his paper series Quantisation as a Problem of Proper Values Schrodinger introduces a
mechanics for quantum theory based upon these considerations about the wave-like character of
matter and particle-like character of light. He states that he wants to show that the customary
quantum conditions can be replaced by another postulate(Schrodinger, 1926, p. 1). The notion
of quantum numbers should follow from this physical consideration, and Schrodinger writes that
these whole numbers appear like the node numbers of a vibrating string.
15
(idem.) He uses an
equation called the Hamilton-Jacobi equation to construct a variational principle which gives rise
to a dierential equation which can be solved for a new introduced function . This means that
the earlier quantum conditions and hypotheses are replaced by this variation principle.(idem.,
p.2). After he reached the dierential equation Schrodinger tries to deduce the energy spectrum
of the hydrogen atom, this will however not be treated in this paper.
4.2 Calculation
4.2.1 Part I
In his paper Schrodinger almost immediately starts with his main deduction of a new mechanics,
a formalism which could explain the ad hoc conditions of the old quantum theory. He begins with
the Hamilton-Jacobi dierential equation and this needs some explanation.
The Hamilton-Jacobi equation is a is a reformulation of classical mechanics, just like Lagrangian
or Hamiltonian mechanics. It comes forth out of a specic canonical transformation in Hamiltonian
mechanics. A canonical transformation in Hamiltonian mechanics is a transformation from one set
of co-ordinates (q
i
, p
i
, t) to a new set (Q
i
, P
i
, t), under which the form of the Hamilton equations is
preserved without necessarily preserving the Hamiltonian itself (Goldstein, 1950, p.237,238). This
is done with the use of a, so-called, generating function S;
S (q
1
, . . . , q
n
, t) (4.1)
13
Heisenberg submitted his paper on the 29th of July 1925 whereas Schrodingers rst part was received January
the 27th 1926
14
Schrodinger cites Louis de Broglie explicitely as an important inuence (Schrodinger, 1926, p. 9)
15
This is exactly what happens when you solve the famous wave equation, and gives a beautiful picture of the
use of wave-phenomenon in Schrodingers thinking.
15
4 Schr odinger and Wave Mechanics 4.2 Calculation
In this context the solution S is called Hamiltons Principal Function. It is related to the gener-
alised momentum:
p
i
=
S
q
i
(4.2)
(Goldstein, 1950, p.274)
What is important is that when solving the Hamilton-Jacobi equation we are at the same time
obtaining a solution to a mechanical problem. (Goldstein, 1950, p275). This is due to the fact that
the Hamiltons Principal Function is the generator of a transformation to constant co-ordinates
and momenta. That is the reason the Hamilton-Jacobi formalism is used in physics and, in this
context, by Schrodinger
16
.
Now for the equations. We are only concerned with Hamiltonians which do not involve time
explicitly. In this case the Hamilton-Jacobi equation has the following form:
H
_
q
i
,
S
q
i
_
= E (4.3)
Where E is a constant. The equation is just a partial dierential equation for the function S.
This is Schrodingers starting point.
Now Schrodinger wants to nd a solution to this equation such that the solution can be
represented as the sum of functions. He wants to have a function separable into the independent
variables q
i
. Due to this consideration Schrodinger puts for S a new unknown such that it
will appear as a product of related functions of the single co-ordinates. (Schrodinger, 1926, p.1,
emphasis in original).
S = K log (4.4)
Where has no dimension, and Schrodinger introduces K for dimensional purposes, and has those
of action.
What does this form mean? When S needs to be represented as the sum of functions,
Schrodinger chose a new function such that he can achieve this. With the use of a natural
logarithm of a product of functions this is accomplished fairly easy, so that is where the new func-
tion enters. And that is also thuswhy appears as a product of functions. When we insert this
new S into equation [4.3] we get the following:
H =
_
q
i
,
K
q
i
_
(4.5)
Without looking for the real solution of equation [4.5], we can now rewrite this equation in a
quadratic form equated to zero. This informally means no more then to write out the Hamil-
tonian for a system. In his article Schrodinger looks to describe the hydrogen atom. He thus uses
a potential energy of e
2
/r. Using the Hamiltonian of p
2
i
/2m, equation [4.5] can be written in a
three dimensional Cartesian form:
17
1
2m
K
2
2
_
_
x
_
2
+
_
y
_
2
+
_
z
_
2
_
=
_
E +
e
2
r
_
(4.6a)
Or;
_
x
_
2
+
_
y
_
2
+
_
z
_
2
2m
K
2
_
E +
e
2
r
_
2
= 0 (4.6b)
It is easy to recognise this form when one takes in mind the generalised momentum p
i
=
K
q
.
This equation forms Schrodingers foundation, from here he continues with the main point of
his paper. This is his postulate which eventually drives him towards his new mechanics. It looks
as follows: For the quadratic form Schrodinger comes up with a condition which must be satised
to deduce all the old quantum conditions:
16
When interested in the formal aspects of the Hamilton-Jacobi theory I recommend to you the book on advanced
Classical Mechanics by Herbert Goldstein (1950)
17
The Hamiltonian in question Schrodinger calls the Hamilton function for Keplerian motion
16
4 Schr odinger and Wave Mechanics 4.2 Calculation
We now seek a function , such that for any arbitrary variation of it the integral
of the said quadratic form, taken over the whole co-ordinate space, is stationary,
being real, single-valued, nite, and continuously dierentiable up to the second order.
(Schrodinger, 1926, p.2)
As quoted above, Schrodinger thus was looking for [4.6b] to be stationary when integrated over the
whole of co-ordinate space. But for what purpose would one state this? So reecting, he takes the
integral over all space of H E, because this is essentially what is expressed by [4.6b], and states
this integral should be stationary. You could say Schrodinger comes up with a new and stronger
condition for the function , replacing the the quadratic form of his Hamilton-Jacobi equation
[4.6b]. According to Schrodinger another condition needs to be satised to describe phenomena
correctly in the quantum regime. But why it should exactly be this condition stays unclear.
For now we could say this is an incomprehensible step in Schrodingers deduction of a quantum
mechanics, a step for which in his rst article a explicit drive is missing. So in this section I will
just work on with this statement. In the second part Schrodingers interpretation of this step will
be discussed. First we look at the results of these conditions.
Schr odinger is stating that the equation must be stationary. This corresponds with equating
the variation of this equation to zero. From the discussed step thus the following appears:
J =
___
all space
dxdydz
_
_
x
_
2
+
_
y
_
2
+
_
z
_
2
2m
K
2
_
E +
e
2
r
_
2
_
= 0 (4.7)
Now we have an expression that is the mathematical equivalent of the mentioned quote. This
is a variational problem which can be solved fairly easy
18
. We will now briey dive into this
mathematical procedure.
First the variation can be distributed over the terms in the integral:
J =
___
all space
dxdydz
_
_
x
_
2
+
_
y
_
2
+
_
z
_
2
2m
K
2
_
E +
e
2
r
_
2
_
= 0 (4.8a)
=
___
all space
dxdydz
_
x
_
2
+
_
y
_
2
+
_
z
_
2
2m
K
2
_
E +
e
2
r
_
2
_
= 0(4.8b)
To insert the variation into the squared of the partial derivatives, one must proceed in the following
way:
q
i
_
2
= 2
_
q
i
__
q
i
_
= 2
q
i
_
q
i
_
(4.9)
= 2
q
i
_
q
i
_
2
q
2
i
1
2
q
i
_
2
=
q
i
_
q
i
q
2
i
(4.10)
Using this procedure in [4.8b] and computing the variation of
2
gives:
J = 2
___
all space
dxdydz
_
_
x
_
x
_
x
2
_
+
_
y
_
y
_
y
2
_
+
_
z
_
z
_
z
2
_
2m
K
2
_
E +
e
2
r
_
_
= 0
(4.11)
18
In his paper at this point Schrodinger only mentions that one can nd the solution in the usual way, no
derivation is given and the following then also is totally my own construction. This type of calculus was quite
omnipresent at that time.
17
4 Schr odinger and Wave Mechanics 4.2 Calculation
Note the factor of 2 in front of the integral, which besides in equation [4.10] also appeared in the
variation of
2
;
2
= 2. From here on we can collect terms to get a nice expression we can
further manipulate.
J = 2
___
all space
dxdydz
_
x
_
x
_
+
y
_
y
_
+
z
_
z
_
x
2
+
2
y
2
+
2
z
2
_
2m
K
2
_
E +
e
2
r
_
_
= 0
(4.12)
It is now evident that we can rewrite the newly formed terms inside the integral. First notice
the partials of x, y and z which together with the partial derivatives of form a divergence of a
gradient. Secondly notice the formation of a Laplacian operator working on times the variation
of . All together this forms:
J = 2
___
all space
dxdydz
_
2m
K
2
_
E +
e
2
r
_
_
= 0 (4.13a)
or
1
2
J =
___
all space
dxdydz
___
all space
dxdydz
_
2
+
2m
K
2
_
E +
e
2
r
_
_
= 0 (4.13b)
In this form we can easily apply the divergence theorem on the rst integral to get a nal expression
which gives insight in what kind of conditions the variational principle of Schrodinger lead to.
1
2
J =
_
ds
_ _ _
dxdydz
_
2
+
2m
K
2
_
E +
e
2
r
_
2
_
= 0 (4.14)
Where ds is an innitesimal element of the innite closed surface over which the integral is taken.
The dot product of the gradient with the normal vector of the integration surface ds
equates
to a single dierential since of course the dot product of perpendicular vectors results in zero.
1
2
J =
_
ds
n
_ _ _
dxdydz
_
2
+
2m
K
2
_
E +
e
2
r
_
_
= 0 (4.15)
Here ends our mathematical venture, and what a ne result we have. Now a reection is needed.
From this expression Schrodinger famously concludes a condition which at that time drastically
changed the eld of physics; Schrodingers Wave Equation. Since the terms in previous expression
are independent they both must equate to zero, and therefore we can formulate the following
conditions:
2
+
2m
K
2
_
E +
e
2
r
_
= 0 (4.16a)
_
ds
n
= 0 (4.16b)
First note that equation [4.16b] can be ignored since it only says something which is already
known; should become zero at innity (remember the integration is taken over an innite
closed surface). The important result is thus reached with [4.16a]. From this point in his article
Schrodinger deduces the energy-levels of the hydrogen atom by solving this equation, and gets a
result which perfectly matches with Bohrs calculation. Today it is known to every undergraduate
of Physics how to solve this equation, the Schrodinger time-independent wave equation for the
hydrogen atom. I will hence not go into the calculation of the energy relation as Schrodinger does
in his article. It is not this that leads to a new mechanics.
18
4 Schr odinger and Wave Mechanics 4.2 Calculation
More important is to reect upon the way Schrodinger took to get to the point of a governing
wave equation. Starting with the standard Hamilton-Jacobi dierential equation, we now have
a condition for a new function from which the energy-levels of the hydrogen atom can be
perfectly calculated. The essential thing of this procedure, which is a start of a mechanics of
quantum theory, Schrodinger says to be that the postulations of whole numbers no longer
enters into the quantum rules mysteriously, but that we have traced the matter a step further
back, and found the integralness to have its origin in the niteness and single-valuedness of
a certain space function.(idem, p9, quotation marks in original). By this he means that the
quantum-numbers, and thus the discreteness of observables, are no postulates but really follow
from a more fundamental condition. These quantum numbers come into play in solving [4.16a]
and implementing Schrodingers conditions for (e.g., single-valuedness).
But now remember what Schrodinger did at the start of his paper. He made a step in which
he states an incomprehensible condition for a newly introduced function . We could be sceptical
and conclude Schrodinger with this procedure only replaces the old postulates with a new one in
the form of this condition, from which the old one can be deduced on a rather elegant way. He
delays the inevitable.
Such interpretations are of course unnecessary and totally unwanted in a historical reection
like this one. But it is in this sense interesting, and historically relevant, because of the second
paper Schrodinger published on this topic (which ended up in the same volume of the Annalen
der Physik of 1926). In this paper he himself takes on this point of view and recties the steps
taken in the paper we just discussed.
4.2.2 Part II
In Schrodingers second paper, properly named Part II, he proceeds on a manner totally unlike
his rst part. Part I was a very chaotic paper, one in which there was no signicant structural
approach towards the problem he wanted to solve. Conditions are thrown around and many
solutions are almost postulated without any derivation given at all. It gave rise to a feeling of
a sudden idea which he came up with and worked out as fast he could. This is stressed again
by the fact a correction by Schrodinger himself on his rst article was submitted ve days after
he submitted his second part.
19
This chaotic feeling does not at all appear in his second part.
In Part II Schrodinger writes again three paragraphs on an new mechanics for quantum theory,
but now starts with a conceptual idea. He wants to use an old analogy, found by Hamilton, to
conceptually make clear where this quantum mechanics is coming from and how the equations that
must govern this mechanics arise. He starts with a reection on his unintelligble transformation
of equation [4.4] in the rst part and the equally incomprehensible transition from the equating
to zero of a certain expression to the postulation that the space integral of the said expression shall
be stationary (Schrodinger, 1926, p. 13), a daring but almost necessary statement since his rst
part missed an important drive.
In the rst paragraph of part II he explains this old analogy where after he concludes one
could expand this analogy further. This he does in the second paragraph and comes up with a
construction of a new mechanics. In the third paragraph he applies the vocabulary of the analogy
on some examples. We shall not reproduce all of his work in this paper, and focus on the fascinating
analogy Schrodinger used. The derivations will be brief, but it is the motivation which interests
us and which is also the main point of his second communication.
To understand his motivation we rst must distinguish between two forms of optics, undulatory
optics and geometrical optics. Undulatory optics, or wave optics, is the optics generally is worked
with in physics. Also called physical optics, undulatory optics deals with phenomena of dispersion,
diraction, interference and polarization, it is the optics of the propagation of waves and waves
fronts. In geometrical optics wave phenomena are approximated by rays and ray phenomena. One
here speaks of rays which determine the propagation of light, which can quite correctly describe
reection and refraction in optics. Fermats principle is a most famous description in geometrical
19
The rst part was submitted 27 January, the second communication 23 February and his correction on the rst
part the 28th of February.
19
4 Schr odinger and Wave Mechanics 4.2 Calculation
optics. Formally geometrical optics is the short wavelengths limit of undulatory optics, i.e. with
very short wavelengths wave propagation can be approximated by rays. Hamilton found out that
he could describe both geometrical optics and mechanics with the same formalism, the Hamilton-
Jacobi formalism; With the help of his characteristic function he discovered that the actual motion
of a point mass in a eld of force is governed by the same formal law as the propagation of the rays
of light. (Jammer, 1966, p. 236). This analogy is what Schrodinger expounds in paragraph one.
In a somewhat more modern interpretation the analogy, and the rst paragraph of Schrodingers
second part, may to be summarized as follows.
20
We start with Hamiltons variational principle;
_
L dt = 0. From this relation one can derive
Maupertuis principle of least action for systems with a constant energy:
_
2T dt = 0 (4.17a)
Or:
_
_
2m(E V ) ds = 0 (4.17b)
With this we have a formulation of the equations of motion for a physical system, not as dierential
equations but as an integral equation. This we can compare with Fermats idea on rays of light,
the basis of geometrical optics. Fermats principle of least time can formally also be expressed as
a variation principle:
_
n
c
ds = 0 (4.18)
With n the refractive index. It immediately becomes clear that there is a similarity between [4.17b]
and [4.18]. One is a variation for mechanical problems, the other for problems in geometrical optics
and still they have much in common. If we in optics formulate the phase velocity, u = c/n, we
see that the expression C[2m(EV )]
1/2
plays in mechanics the same role as this phase velocity!
This can be represented as follows:
u =
c
n
C[2m(E V )]
1/2
(4.19)
Where C denotes a constant. This is an analogy which was the result of the theoretical research by
Sir William Rowan Hamilton, a result which was published between 1828 and 1837. Hamiltons
optical-mechanical analogy, as we will call it, can be extended further. First, one can say the
action function S denes an action surface, S(q
i
, t)=constant for which we have p = S and
S/t = L pv = E. Second, in the optic region we can express the phase in terms of the
frequency and the wave vector k, (, t) = t + kr. From we can construct the following
relations: k = and /t = . This suces to see a complete formal analogy between the
surface of constant action of a mechanical system and the surface of constant phase in optics. The
wave vector corresponds to the momentum and the frequency to the energy of the particle.
Schr odinger ends the paragraph with a discussion on what to do with this analogy. Maybe one
cannot draw the analogy any further, and should a real parallel between optics and mechanics not
be made. He continues:
But even the rst attempt at the development of the analogy to the wave theory leads
to such striking results, that a quite dierent suspicion arises: we know to-day, in fact,
that our classical mechanics fails for very small dimensions of the path and for very
great curvatures. Perhaps this failure is in strict analogy with the failure of geometrical
optics, i.e. the optics of innitely small wave lengths, that becomes evident as soon
as the obstacles or apertures are no longer great compared with the real, nite, wave
length. Perhaps our classical mechanics is the complete analogy of geometrical optics
and as such is wrong and not in agreement with reality; it fails whenever the radii of
curvature and dimensions of the path are no longer great compared with a certain wave
20
In this summary I will briey follow Max Jammers formulation (Jammer, 1966, pp 236,237).
20
4 Schr odinger and Wave Mechanics 4.3 Contextual
length, to which in, q-space, a real meaning is attached. Then it becomes a question of
searching for an undulatory mechanics, and the most obvious way is the working out
of the Hamiltonian analogy on the lines of undulatory optics.
(Schrodinger, 1926, p. 18. Emphasis in original.)
Further considerations presented in Schrodingers second part will not be discussed. In his
second paragraph Schrodinger does so, but these are compared with the main argument elaborated
above not noteworthy. The most important consideration forms out of this quite naturally: ...we
must treat the matter strictly on the wave theory, i.e. we must proceed from the wave equation
and not from the fundamental equations of mechanics, in order to form a picture of the manifold
of the possible processes. These latter equations are just as useless for the elucidation of the
micro-structure of mechanical processes as geometrical optics is for explaining the phenomena of
diraction. (idem., p. 25). Analogous to undulatory optics there must also exist a undulatory
mechanics of which classical mechanics is a limiting case.
We have thus seen his wave equation appear in part I, and in Part II a most extraordinary
justication of the use of the wave equation came forth. It strikes really deep into a physical
understanding of the world. It is a theory in which dierent phenomena can be explained from
the same principles, a venture which modern string theory wishes to accomplish on a larger scale
even today.
4.3 Contextual
Schrodinger eventual submitted four communications in which his ndings were more and more
elaborated. In the third communication he goes into in perturbation theory, with applications to
the stark eect of the Balmer lines very elaborately. And in the fourth a series of independent
subjects is treated on all of which his new mechanics is applicable. In the end Schrodinger manages
to elaborate an old existing formal system, lost and forgotten, and give in new life.
It seems this day however the work of Schrodinger can, in a tragic way, be compared with
the work of his predecessor, Sir Hamilton. Hamiltons famous mechanics was only elaborated
disjointed from his optics, Hamiltons optics reappeared later in the theory of the eikonal
21
, again
disjointed. His proposition of a synthesis of a formalism for mechanics and geometrical optics was
ignored. When we today look in our highly synthetic and didactic graduate books on Quantum
Mechanics one can nd no trace of Schrodingers initial drive towards a wave equation of micro-
mechanics. If this is a good or a bad thing for physics one must decide for oneself, but I think it
could be said a beautiful work of physical apprehension, useful or not, is to be forgotten.
In his approach of quantum theory Schrodinger did not do away with the old physics. He saw
in the old quantum conceptions no need for a new beginning but the need for a sequel, a sequel of
the old theories of optics and mechanics. No new rmaments needed to be laid down, it was all
part of a bigger whole. One can with this observation notice fundamental dierences in the style
of Heisenberg compared with that of Schrodinger. But it also appeared that both formalism, how
dierent their origins may be, formally have much in common. Consideration of this type will be
discussed in the following section.
21
The eikonal equation is a partial dierential equation useful in problems of wave propagation. It provides a
link between undulatory optics and geometric optics.
21
5 Discussion 5.0 Discussion
5 Discussion
Ever-newer waters ow on those who step into the same rivers.
Heraclitus
In previous sections we have seen the intriguing mathematical and physical steps both Heisenberg
and Schrodinger undertook to create a new formalism for quantum theory. A formalism from
which all old quantum conditions naturally appear. Using concepts which dier radically from
each other, both physicists managed to formulate a theory which eventually could explain the
quantum nature of our experience. Notice it is very extraordinary that these two theories were
formulated parallel to each other, independently. We are speaking of the parallel development of
two totally dierent explanations to describe the very same range of experience.
It is most interesting to reect upon this fact. I will therefore now consider some topics for
which this development I think has, or could have, profound implications.
5.1 Style and Stylistic Dierences
The development in question exemplies a fundamental aspect of scientic endeavour; the concept
of style. I like to say Heisenberg and Schrodinger came up with dierent theories because they both
had a very dierent background in physics, institutions but also in life as a whole. A dierence
which ends up in a dierent style of thinking.
Heisenberg formulated a real new mathematical calculus to cope with the existing problems.
A calculus where special computational rules were involved and quantities turned out to be non-
commutative. Where the latter aspect was something rarely accounted before. The basis for
the theory was Bohrs work on spectral lines, where discreteness was introduced in postulation
and formed the foundation of quantum theory. Therefore this element of discontinuity was also
deeply embedded in the Heisenbergs mechanics. Looking chiey at the mathematical properties
of quantum-theoretical quantities made it also a real algebraic approach. This together made the
theory of Heisenberg conceptually very hard to understand, and resisted any possible graphical
interpretation. Finally we can also state that matrix mechanics is, in the end, based upon the
concept of a particle (or corpuscle), a discrete entity who governs the micro world in his interactions
and possible transitions (Jammer, 1966, p. 271). He had an abstract mathematical style of
approach. Why does the theory of Heisenberg have all these characteristics? Is this a physical
truth, or rather a consequence of other elements?
Totally unlike Heisenberg, Schrodinger worked with a mathematical apparatus which had great
familiarity among physicists; dierential equations. Further characterized by a more analytical
approach with the help of the discussed analogy, Schrodinger manages to get a conceptually very
clear theory. This analogy also stresses his dependence on the element of continuity instead of
discontinuity, in principle working with an extension of the classical laws of motion. And again,
the analogy naturally gives also the concept of wave a prominent place in Schrodingers mechanics,
depending on only the wave equation as both classical mechanics and geometrical optics depend on
the Hamilton-Jacobi equation (Jammer, 1966, pp. 271,272). Schrodinger had a more Anschaulich
physical style. We again see specic elements of this theory which are used by Schrodinger, why
are these and not others incorporated in the theory?
Looking at these consideration one could say that we are dealing with two radically dierent
thought styles
22
. Due to the dierent background, in physics but also in general, of both physicists
they have a dierent directed perception of the world they investigate. This background could be
dened as them being a member of a specic thought-collective, a community of persons mutually
exchanging ideas or maintaining intellectual interaction (Fleck, 1935, p. 39). In such collective
22
A term which is introduced by Ludwik Fleck in his magnum opus Genesis of a Scientic Fact (1935). The
philosophical notes on style in this paper are largely inuenced by his work on the philosophy and sociology of
science.
22
5 Discussion 5.2 Scientic Knowledge
one shares the meaning of dierent concepts, and they are together in control of the meaning
assigned to concepts. Scientists possess a style and are member of one or more collectives
23
, where
of course both style and collective are intimately related to each other. With this conceptual
toolbox we can thus more elegantly describe the above development.
Heisenbergs work on dispersion theory was a key inuence on the development of his thought
style. As an apprentice of Born and having collaborated with Kramers and Bohr (van der Waerden,
1967, p. 17) he became a member of their thought collective, and shared the problems they were
working on and the way they treated those problems. His perception became directed in this
particular way. Schrodingers early work on continuous media and the applications of eigenvalue
problems made that Schrodinger had a special readiness for directed perception, directed towards
a continuous interpretation of quantum theory governed by dierential equations. He could see
the wave mechanics where Heisenberg saw matrix mechanics. They both saw something dierent,
and assigned a dierent meaning to it because of their dierent readiness of perception.
It is thus with studying such a strange development, where two dierent theories were formed
at the same time, to cover the same range of experience, one can clearly see that science is an
enterprise in which dierent styles exist which direct toward a dierent perception of the eld and
it problems. And that real true descriptions of our experience do not come forth, just because
we have a dierent experience coupled to styles and collectives. Even physics is no rigid structure
where one has a well determined and true concept. It is an ever-changing system of interacting
thought-styles and collectives.
Really interesting is the fact that Erwin Schrodinger also wrote a paper on the relation between
his and Heisenbergs mechanics. He concludes in his paper a formal equivalence of both theories.
We will not consider the formal equivalence itself, but it does show in physics unity is needed
in the concepts they use. The thought collective of the eld of physicists must be as smooth as
possible, but uctuations of course always exist; uctuations in the collective but also in the style of
individuals, of which the papers subject is great example. It is here that in quantum mechanics,
after this paper about the formal equivalence, the styles of Heisenberg and Schrodinger have
merged. To clarify: In physics we all use, for example, the same Maxwell-equations. But this does
not mean that this is just a simple fact, these equations were also constructed by physicists which
each a dierent styles of thought, and also has dynamic history. And even now these equations are
determined they are still not rigid and mean dierent things, concerning modern developments,
but also concerning individuals.
5.2 Scientic Knowledge
There are two more point which I would like to make concerning a discussion on what the parallel
development of wave and matrix mechanics can learn us about science. The rst point has to
do with how a theory like quantum mechanics, and then particularly the discussed formalisms, is
formed and discovered. I think namely that it gives a beautiful example of how should see this
concept of discovery. We have seen a very complicated process in which many steps occurred in
nding a formalism for quantum theory. This process was guided by the dierent style of the
physicists. And it is this process I want to emphasize. In the process of working their way towards
a fully edged theory from out their own style, they created knowledge of how one genuinely
could describe this world. Could one thus say it is here were it really is discovered that Quantum
Mechanics exists? I think this is a bad interpretation. A discovery is not a matter of uncovering
reality, uncovering the facts. We have seen people actively participate and create this mechanics,
using their own style to construct it. Facts are not passively discovered, they are actively created.
How they are created depend on the dierent styles and collectives in question.
The second point concerns a reection upon our own place in physics. In todays physi-
cal endeavours and teachings the process of the discussed development is of course no longer
present. The whole process in which quantum theory has developed, from Planck to Heisenberg
23
One is of course a member of a thought collective implicitly, just like most are not aware of their own thought
style.
23
5 Discussion 5.2 Scientic Knowledge
and Schrodinger, will no longer be represented with all details of construction and many steps of
the process will never be looked at again. I think that eventually the actual course of the develop-
ment becomes rationalized and schematized. We no longer see the moment which physicists were
constructing the now determined facts of quantum mechanics. The long path which came before
the construction of the formalism, where there was no dened quantum theory and strange steps
were taken to acquire workable equations, is rationalized and one again begins thinking in the
static accumulation of facts, false and true. Preventing this whiggish type of history of science,
I think reections of such types presented here could act to reect upon the scientic enterprise
in which one places themselves. And I would even say that reections of the kind presented in
this paper are essential for a conceptual understanding of a physical theory, looking back at the
process and trying to understand in a less rationalized way. But even more to learn how science
works. We can learn that we also possess a style of thought and we also are member of thought
collectives which directs our perception of science.
24
6 Conclusion 6.0 Conclusion
6 Conclusion
In this paper the development of a formalism for quantum theory by Erwin Schrodinger and
Werner Heisenberg has been discussed. It became clear that from completely dierent conditions
and tenets, both physicists independently and parallel to each other, created a mechanics which
could describe the exact same range of experience and account for the concept of line spectra.
Heisenbergs work builds upon the old quantum mechanics. He looked what the characteristics
were of quantum-theoretical observables and tried to compute a generalised quantum-theoretical
quantity. For this he used an analogous form of the classic Fourier expansion, which he extended
to the region of quantum theory. With this new type of expansion he tried to compute a quantum-
theoretical x(t)
2
for which he deduced a new type of multiplication rule. This was later recognised
as the multiplication of matrices, and observables in quantum theory turned into innite dimen-
sional matrices. Schrodinger approached quantum theory in a totally dierent fashion. It appeared
in the second part of his paper Schrodinger got inspired by an old analogy worked out by Hamil-
ton. Since geometrical optics was a small wavelength limiting case of wave optics, and geometrical
optics had through the Hamilton-Jacobi equations a formal analogy with classical mechanics, this
new quantum mechanics of which classical mechanics was a limiting case should be governed by
the same formalism as wave optics; the wave equation. But in his rst paper Schrodinger used
an unintelligible variational principle to come up with the same equation, and no analogy was
used to justify his steps.
The dierent formalisations of both physicist I tried elegantly to describe by their dierence
in style. I tried to the determine dierent styles and collectives in which these physicists held
place. Both their style and the collective they communicated with directed their perception in
specic ways, ways in which they could see both a dierent quantum theory. Realising this, such
an historical reection as presented above could become an important tool for reection upon the
scientists own discipline and their place in it. One sees the ux of scientic enterprise where there
are no rigid conceptions available, and not the static discoveries in history which today seem so
natural. It could also be discussed regarding the above historical reection, that such reections
are in some way essential for the conceptual understanding of modern theories
25
7 Acknowledgements 7.0 Acknowledgements
7 Acknowledgements
The author would like to thank Prof. Dr. A.J. Kox for his professional guidance into the world of
the History of Physics. Under his supervision it became clear how important it sometimes is to
reect upon the history of your own discipline, to not only gain consciousness of your position in
the eld but also to learn a lot about your discipline and even science itself.
The author would also like to thank Prof. Dr. E.P. Verlinde for a short informal discussion on
his view of the origin of Heisenbergs matrix mechanics.
26
8 Bibliography 8.0 Bibliography
8 Bibliography
Bohr, N. (1918). On the Quantum Theory of Line-Spectra, Kgl. Danske Vid. Selsk.
Skr.,nat.-math. Afd., 8
Born, M. and Jordan, P. (1925). Zur Quantenmechanik, Z. Phys. 34, p. 879
Fleck, L. (1935). Genesis and Development of a Scientic Fact, University of Chicago Press,
Chicago
Goldstein, H. (1950). Classical Mechanics, Addison-Wesley, Tokyo, Japan
Heilbron , J.L. (1977). Lectures On The History Of Atomic Physics 1900-1922, In: History
of Twentieth Century Physics, Academic Press, pp 40-108
Heisenberg, W. (1925).