On The Theory of Metals - H. Bethe

On the Theory of Metals.
I. Eigenvalues and eigenfunctions of a linear chain of atoms
H. Bethe in Rome
Zeitschrift für Physik 71 (1931) 205–226.
17 June 1931
Original title:
Zur Theorie der Metalle.
I. Eigenwerte und Eigenfunktionen der linearen Atomkette.
(Translation by T. C. Dorlas, DIAS, 9 November 2009.)
Abstract
A method is described for calculating, to zeroth and first order
respectively (in the sense of London and Heitler), the eigenfunctions
and eigenvalues for a “one-dimensional metal” consisting of a linear
chain of very many atoms, each of which has a single s-electron with
spin. Besides Bloch’s “spin waves”, there exist eigenfunctions where
the spins pointing in one direction tend to reside on neighbouring
atoms. These eigenfunctions might be of importance for the theory of
ferromagnetism.
§1. In the theory of metals, one has so far considered only the movement
of single conduction electrons in the field of the metal atoms (Sommerfeld,
Bloch). The interaction between the electrons has been ignored, at least
to the extent that it could not be included in the potential acting on the
electrons. This approach has been very fruitful in treating the problem of
metallic conduction (with the exception of superconduction). However, it
has not allowed a deeper understanding of the problem of ferromagnetism
for example1 , and it has made the calculation of cohesion forces in metals
a virtually hopeless task: The exchange forces between the electrons, which
are relevant for the size of the first-order terms in the perturbation series
1
F. Bloch, Zs. f. Phys. 57, 545 (1929) showed that under certain circumstances, ‘free
electrons’ can also exhibit ferromagnetism.
1
for the energy, are of the same order of magnitude as the zero-point energy
of the electron gas, and one can estimate that the second order is again of
this magnitude, etc. This behaviour makes one somewhat sceptical about
the entire approximation, in which the movement of the individual electrons
(the kinetic zero-point energy) is considered to be of greater importance than
the interaction (or exchange) energy.
Therefore, Slater2 and Bloch3 have recently attempted to approximate the
problem from the opposite side, i.e. assuming that the atoms are fixed and
considering the interaction as a perturbation, just as in the London-Heitler
approximation for molecules. Slater was especially interested in the cohesion
forces in non-ferromagnetic materials, where the London-Heitler exchange
integral J is in general negative4 , and he developed an interesting method
for the approximate calculation of the ground state energy of such metals, for
which the total spin obviously vanishes. Bloch, for the purpose of ferromag-
netism, calculated approximately the lowest terms in a systematic fashion in
the opposite case where J > 0. However, in his method, he obtained too
many eigenvalues. It is the aim of the present work to demonstrate, at first
in the case of a linear chain of atoms, a method for calculating all eigenvalues
of a one-dimensional crystal to arbitrary precision. This arbitrary precision,
of course, has to be understood in the framework of the first approximation
of the London-Heitler method, so that the problem has the same status as
the H2 molecule in the work of London and Heitler. We obtain, in addition
to modified versions of the solutions found by Bloch, a set of solutions of a
different type, in such a way that the total number of eigenvalues is exactly
the correct one.
§2. Our problem is therefore as follows: Given is a linear chain of very
many (N ) identical atoms. Each atom has, apart from closed shells, one con-
duction electron in an s-orbit. We assume that the eigenfunction of the free
atoms is known. What are the eigenfunctions to zeroth, and the eigenval-
ues to first-order approximation of the total system including the interaction
between the atoms?
Disregarding the interaction, each atoms has two states with equal energy:
the spin of the conduction electron can point to the right or to the left. To
zeroth order, the eigenvalue of the chain is therefore 2N -fold degenerate. A
state of the chain can be fixed by indicating for which atoms the spin points to
the right. Assume that this is the case at the atoms with numbers m1 , . . . , mr .
Let the corresponding eigenfunction of the chain be ψ(m1 , . . . , mr ). The
2
J. C. Slater, Phys. Rev. 35, 509 (1930).
3
F. Bloch, Zs. f. Phys. 61, 206 (1930) (in the following referred to as l.c.).
4
W. Heisenberg, Zs. f. Phys. 49, 619 (1928).
2
correct eigenfunction, in zeroth-order approximation, can then be written in
the form X
Ψ= a(m1 , m2 , . . . , mr ) ψ(m1 , . . . , mr ),
m1 ,m2 ,...,mr
where the numbers m1 , . . . , mr run through the values 1 to N , and we will

assume that
m 1 < m 2 < · · · < mr .
If we calculate the matrix elements of the interaction energy w.r.t. the
states defined by the spin distributions m1 , . . . , mr , then we obtain the fol-
lowing5 :
Diagonal elements: If the spin distribution m1 , . . . , mr contains N 0 neigh-
bouring pairs of parallel spins, then
Wm1 ,...mr , m1 ...mr = E0 − N 0 J.
E0 is the interaction due to the charged clouds of the atoms; J is the

London-Heitler exchange integral between neighbouring atoms. For non-
nearest neighbour atoms we neglect the exchange integral because it decreases
exponentially in the distance.
Non-diagonal elements: These occur between two states which can be
obtained from one another by exchanging two nearest-neighbour spins with
opposite spins. For example, in our notation, the states m1 . . . mi . . . mr , and
m1 . . . mi + 1 . . . mr , where it is assumed that mi + 1 is indeed a left-pointing
spin, i.e. mi+1 6= mi + 1. All such off-diagonal elements of the interaction
energy are equal to −J.
Using these matrix elements for the interaction energy, we obtain the fol-
lowing equations between the coefficients a(m1 . . . mr ) of the required eigen-
function Ψ:
X
2²a(m1 . . . mr ) + [a(m01 . . . m0r ) − a(m1 . . . mr )] = 0. (1)
m01 ...m0r
Here
2²J = e − E0 + N J, (2)
e being the total perturbed energy to first approximation. The sum runs over
all distributions m01 . . . m0r that can be obtained from m1 . . . mr by exchanging
neighbouring opposite spins6 .
5
F. Bloch, l.c.
6
The equation (1) is due to Bloch, l.c., more accurately derived [there Eqn. (5)]; the
atoms are numbered with fi there instead of mi , and instead of 2²J, he writes simply ².
3
Apart from the equation (1), the coefficients a should also satisfy the
periodicity condition
a(m1 . . . mi . . . mr ) = a(m1 . . . mi + N . . . mr ). (3)
§3. For r = 1, the solution of (1) reads, as is well-known,
a(m) = eikm ,
² = 1 − cos k,
2π
k = λ, λ = integer.
N
For r = 2 we have to distinguish two cases: Either the two right-spins are
separate from one another, in which case
−2 ² a(m1 m2 ) = a(m1 + 1, m2 ) + a(m1 − 1, m2 ) + a(m1 , m2 + 1)

+ a(m1 , m2 − 1) − 4a(m1 , m2 ) (m2 6= m1 + 1), (4-a)
or they are nearest neighbours:
−2 ² a(m1 , m1 + 1) = a(m1 − 1, m1 + 1) + a(m1 , m1 + 2)

−2a(m1 , m1 + 1). (4-b)
The first group of equations is solved rigorously by setting
a(m1 m2 ) = c1 ei(f1 m1 +f2 m2 ) + c2 ei(f2 m1 +f1 m2 ) ,

(5)
² = 1 − cos f1 + 1 − cos f2 ,
where the constants c1 , c2 , f1 , f2 are for the moment undetermined.
The second group can be satisfied by choosing c1 and c2 such that
0 = a(m1 m2 ) + a(m1 + 1, m1 + 1) − 2a(m1 , m1 + 1). (6)
Here, a(m1 , m1 ) has no physical meaning, but is defined by (5). Clearly, by

adding (6) to (4-b), it becomes of the form (4-a), which has already been
solved. Inserting (5) into (6) gives
ei(f1 +f2 )m1 [c1 (1 + ei(f1 +f2 ) − 2eif2 ) + c2 (1 + ei(f1 +f2 ) − 2eif1 )] = 0, (7)
f −f
f1 −f2
¡ f1 +f2 f1 −f2
¢
c1 cos f1 +f
2
2
− e i 12 2
sin 2
+ i cos 2
− cos 2
=− f −f = f1 −f2
¡ f1 +f2 f1 −f2
¢.
c2 cos f1 +f2
−e−i 1 2 2 sin 2
− i cos 2
− cos 2
2
4
We set c1 = eiϕ/2 and c2 = e−iϕ/2 , so that
ϕ sin f1 −f
2
2
cot = ,
2 cos f1 +f
2
2
− cos f1 −f
2
2
ϕ f1 f2
2 cot = cot − cot . (8)
2 2 2
Correspondingly,
1 1
a(m1 , m2 ) = ei(f1 m1 +f2 m2 + 2 ϕ) + ei(f2 m1 +f1 m2 − 2 ϕ) . (9)
Here it is assumed that m1 and m2 are in the basic part of the chain, i.e.
1 ≤ m1 < m2 ≤ N . The periodicity condition requires:
a(m1 , m2 ) = a(m2 , m1 + N ). (10)
[The notation a(m2 , m1 + N ) is suggested by the fact that we want to order

the mi according to their size; cf. above.] Inserting (9) into (10),
1 1
ei(f1 m1 +f2 m2 + 2 ϕ) + ei(f2 m1 +f1 m2 − 2 ϕ)
1 1
= ei(f1 m2 +f2 (m1 +N )+ 2 ϕ) + ei(f2 m2 +f1 (m1 +N )− 2 ϕ) .
Since this has to hold for all m1 and m2 , the first summand on the left must
equal the second on the right and vice versa, so that
N f1 − ϕ = 2πλ1 ,
N f2 + ϕ = 2πλ2 , (11)
λ1 , λ2 = 0, 1, 2, . . . , N − 1.
2π
The numbers f1 and f2 do not have the usual form N
λ, but their sum does:
2π
k = f1 + f2 = (λ1 + λ2 ). (12)
N
k is a true integration constant of the problem; the coefficient a(m1 , m2 ) is
multiplied by eik when both right-spins are moved one place to the right,
which obviously does not affect the physics of the system.
We presently discuss the behaviour of the phase ϕ as a function of f1 and
f2 , where we take the convention that
−π ≤ ϕ ≤ π. (13)
Interchanging f1 and f2 obviously leads to a change of sign for ϕ, while the

coefficients a remain unchanged according to (9). If we keep f2 fixed and
5
let f1 increase starting at zero, then cot(ϕ/2) decreases from +∞ to smaller
positive values, reaching 0 when f1 = f2 ; ϕ therefore increases from 0 to π.
As f1 becomes slightly bigger than f2 , ϕ jumps from +π to −π, and then
increases steadily back to 0 when f1 increases further to 2π. If f1 = f2 then
either
N f1 1
ϕ = +π, λ1 = λ2 − 1 = − ,
2π 2
or
N f1 1
ϕ = −π, λ1 = λ2 + 1 = + .
2π 2
In both cases, we have, according to (9), for all m1 , m2 ,
¡ π π¢
a(m1 , m2 ) = eif1 (m1 +m2 ) ei 2 + e−i 2 = 0.
Conclusion: f1 = f2 does not lead to a meaningful solution of the problem,

and for a given λ2 , λ1 can only take the values
λ1 = 0, 1, . . . , λ2 − 2, λ2 + 2, . . . , N − 1.
Moreover, since f1 , f2 and f2 , f1 yield the same solution, we may assume

f1 < f2 . Thus, for a given λ2 , there are λ2 − 1 solutions λ1 = 0, 1, . . . , λ2 − 2,
and λ2 runs from 2 to N − 1, so the total number of solutions is
N
X −1 µ ¶
N −1
(λ2 − 1) = .
2
λ2 =2
However, there
¡N ¢ must clearly be as many solutions as there are spin distribu-
tions, i.e. 2 . With our more accurate discussion, we have have therefore
come to the conclusion that the usual spin waves do not yield sufficiently
many solutions, whereas¡Bloch ¢ (l.c.) suggested that the method yielded too
N +1
many solutions, namely 2 .
§4. There must therefore be a further N − 1 solutions. These can be
obtained if one allows the wave numbers f1 and f2 to have conjugate complex
values. Indeed, we will find that, for every arbitrary value of f1 + f2 = k, one
can find exactly one pair of conjugate complex solutions of (8) and (11). Let
¾
f1 = u + i v,
(14)
f2 = u − i v;
then
f1 cos u2 cosh v2 − i sin u2 sinh v2 sin u − i sinh v
cot = u v u v = . (15)
2 sin 2 cosh 2 + i cos 2 sinh 2 cosh u − cos u
6
Now, by (11),
N (f1 − f2 ) = 2N i v = 2π(λ1 − λ2 ) + 2ϕ,


ϕ = ψ + i χ, 
ψ = π(λ2 − λ1 ), (16)

χ = N v.
If v should take a finite value (i.e v > 0), then χ must be very large, and
hence
ϕ sin ψ − 12 ieχ
cot ≈ 1 χ ≈ −i + 2e−χ (sin ψ − i cos ψ),
2 2
e − cos ψ
ϕ
cot ≈ −i(1 + 2e−χ+iψ ). (17)
2
To first approximation, therefore,
ϕ f1 f2
2 cot = cot − cot = −2 i
2 2 2
sin u − i sinh v sin u + i sinh v
= − ,
cosh v − cos u cosh v − cos u
sinh v = cosh v − cos u,
e−v = cos u (18)
and
² = 2 − cos(u + iv) − cos(u − iv)

µ ¶
1
= 2 − 2 cos u cosh v = 2 − cos u cos u + ,
cos u
² = sin2 u = 21 (1 − cos 2u). (19)

Apparently, we must have cos u ≥ 0, i.e. − π2 ≤ u ≤ π
2
. Therefore, if k =
2u + 2nπ (n integer) is given, and
k
between 0 and π, thenu = ,
2
k
between π and 2π, then u = − π.
2
In the second approximation, we set
v = v0 + ², (20)
7
where v0 is the value just obtained to first approximation. Then,
ϕ sinh v
2 cot = −2i − 4ie−χ+iψ = −2i
2 · coshµv − cos u ¶¸
sinh v0 cosh v0 sinh v0
= −2i 1+² −
cosh v0 − cos u sinh v0 cosh v0 − cos u
· µ 2
¶¸
1 + cos u
= −2i 1 + ² −1
1 − cos2 u
= −2i(1 + 2² cot2 u),
² = tan2 u e−χ+iψ .
Since ² is in general very small, we can replace χ by N v0 . ψ is then deter-
mined by the given value of k:
N
If 2π k = λ1 + λ2 = λ is even, and smaller than N2 , then we can put
1
λ1 = λ2 = λ, ψ = 0.
2
N
Similarly, for λ ≥ 2
and N + λ even,
1
λ1 = λ2 = (−N + λ), ψ = 0.
2
If λ resp. N + λ is odd, we must write
λ2 = λ1 + 1, ψ = π.
Correspondingly, we have
² = ± tan2 u e−N v0 . (21)
For even λ (resp. N + λ, positive sign), therefore, v > v0 ; hence, if in the

next approximation v0 is replaced by v then the resulting ² is smaller than
that in the second approximation; the scheme for calculating v therefore
certainly always converges, and even very rapidly. However, if the negative
sign has to be taken (for odd λ or N + λ), then v < v0 and ² increases (in
absolute value) in subsequent higher approximations. This does not of course
make a difference as long as v0 is finite (i.e. v0 > 0 (ed.)), since ² is then only
a tiny correction. But if u becomes small, and hence cos u ≈ 1, then v0 also
becomes small, and in fact we have, to reasonable accuracy,
1
v0 = − log cos u = 1 − cos u = u2 .
2
8
Now, if u is small of order √1 , then N v0 is finite, and
N
² = −u2 e−N v0
is larger in absolute value than v0 when
1.4
N v0 < log 2 ≈ 0.7, u2 < .
N
q
For u < 1.4N
and odd λ, therefore, v1 = v0 + ² becomes negative, and the
scheme diverges. As a result, there is no solution with two conjugate complex
wave numbers7 .
[In its place there appears an additional solution with two real wave num-
bers. Again, we consider k as given, so that f2 = k − f1 . Previously, in the
discussion about real solutions, we tacidly assumed that with increasing f1 ,
F = N f1 − ϕ also increases monotonically. For, this is rather probable since
f1 is multiplied by such a large factor N , and ϕ only by 1. Nevertheless, for
small k, this increase does not happen. Indeed, we have, using (8),
1 1
dF 2 sin2 f1
· 21 + 14 2 1k−f1
sin
=N −2 ¡1
2 2
¢ ,
df1 f 1 k−f1 2
1 + 2 cot 2 − 2 cot 2
1
and if we put f1 = f2 = k2 , then

dF 1
=N− .
df1 sin2 f21
This is obviously only positive as long as
f1 1
sin >√ .
2 N
For k < 4 arcsin √1N ≈ √4N , the rise of F = N f1 − ϕ as a function of f1 is
interrupted by a drop in the neighbourhood of f1 = k2 . If N2πk = λ is odd, then
k λ−1
N − π = 2πλ1 = 2π ,
2 2
where λ1 is an integer, and for λ1 = λ−1 2
, λ2 = λ+1
2
there are two solutions
of the equations (8) and (11): Apart from f1 = f2 , ϕ = π, there is a so-
lution f1 < f2 , ϕ 6= π, for which, in contradistinction from the former, the
coefficients (9) remain finite.
7
If actual fact, this case already occurs for u < √2N : although the second approximation
for v is then still positive, the higher approximations push it into the negative.
9
To actually obtain this solution, we set f1 = f − 2²
N
, and replace sin f = f ,
cos f = 1 and cot f = f1 because of the small size of f . Then we have
ϕ 2 2 8²
2 cot = 2² − 2² = , (8a)
2 f− N
f+ N
Nf2
2ϕ = 2π(λ2 − λ1 ) − N (f2 − f1 ) = 2π − 4², (11a)

ϕ
cot = tan ²,
2
tan ² 4
= . (22)
² Nf2
This determines ², where ² < π2 , and also ϕ > 0 and N f1 > 2πλ1 .]
We have now determined, for each value of λ, an additional solution with
real or complex wave numbers. The highest allowed value for λ is clearly
N − 2 with λ1 = λ2 = N − 1; for λ = N − 1 we would have λ2 = N which is
outside the range of values for λi 8 .
The coefficients a(m1 , m2 ) become, for our complex solutions, according
to (9), (12) and (16a),
³ ´
iu(m1 +m2 ) v(m1 −m2 + 12 N ) v(m2 −m1 − 12 N )
a(m1 , m2 ) = e e ±e ,
cosh £ ¡ 1 ¢¤
a(m1 , m2 ) = eiu(m1 +m2 ) v 2 N − (m2 − m1 ) . (23)
sinh
Here cosh resp. sinh applies depending on whether λ (resp. N + λ for
λ > N/2) is even or odd. It is thus for these solutions most probable that the
two right-spins are close together, as the probability |a(m1 , m2 )|2 decreases
exponentially in the distance m2 − m1 . The most extreme case occurs for the
solution λ = 12 N , u = 21 π, v = ∞: Here, upon appropriate normalisation,
(
0 if m2 6= m1 + 1,
a(m1 , m2 ) = m1
(−1) if m2 = m1 + 1,
so the right-spins are always direct neighbours.

Each eigenvalue ² with two complex conjugate wave numbers is, as we
shall see shortly, smaller than all eigenvalues with the same wave number
k and real wave numbers. By (2), the corresponding energy to first order
lies deeper than all solutions with real wave numbers in case the exchange
8
The solution λ1 = N − 1, λ2 = N has already been counted as λ2 = 0, λ1 = N − 1.
10
integral J is positive (ferromagnetic case), but higher when J is negative
(normal case).
Indeed, for the complex solutions,
²k = sin2 u, (19)
and for real solutions
²k = 1 − cos f1 + 1 − cos(k − f1 ). (5)
(5) is minimal for (

f1 = 12 k, if 0 ≤ k ≤ π,
f1 = 12 k + π if π ≤ k ≤ 2π,
so in general for f1 = f2 = u. The minimum is
²min = 2(1 − cos u),
so that
²k 1
= (1 + cos u) ≤ 1, (24)
²min 2
where the equality sign only applies if u = 09 . But this was exactly our
claim.
§5. We now turn to the general case of r right-oriented spins. The
equations (1) again split into different types:
If among the r spins m1 , . . . , mr there are no neighbours, then
r
X
−2 ² a(m1 , . . . , mr ) = [a(m1 , . . . , mi + 1, . . . , mr )
i=1
+ a(m1 , . . . , mi − 1, . . . , mr ) − 2a(m1 , . . . , mi , . . . , mr )].
(24-a)
If there are two neighbours, say mi+1 = mi + 1, then we have instead,
−2 ² a(m1 , . . . , mi , . . . , mk , mk + 1, . . . , mr ) = a(. . . mk − 1, mk + 1, . . . )
+ a(. . . mk mk + 2 . . . ) − 2a(. . . mk mk + 1 . . . )
X
+ [a(. . . mi + 1 . . . ) + a(. . . mi − 1 . . . ) − 2a(. . . mi . . . )] (24-b)
i6=k,k+1
9
In fact f1 can never be exactly equal to f2 (cf. §3), so in truth the <-sign always
applies.
11
and analogously for the case of more neighbouring spins. We now make the
“Ansatz” (hypothesis)
r!
" r #
X X 1 X
a(m1 . . . mr ) = exp i fP k m k + i ϕP k,P l , (25)
P =1 k=1
2 k<l
r
X
²= [1 − cos fk ]. (26)
k=1
P is some permutation of the numbers 1, 2, . . . , r and P k denotes the number

that this permutation puts in place of k. This hypothesis clearly satisfies the
first set of equations (24-a). The remaining equations will be satisfied by
requiring that
2a(m1 , . . . , mk mk + 1, . . . , mr ) = a(. . . mk mk . . . ) + a(. . . mk + 1 mk + 1 . . . ),

(27)
where the terms on the right-hand side are defined by (25). (27) must hold
for arbitrary multiplets of values m1 , . . . , mr , where arbitrary many mi are
neighbours – provided only that m1 < m2 < · · · < mr . Thus all equations
(24-b) and those in which the distributions m1 , . . . , mr have more than two
neighbouring right-spins are satisfied simultaneously. Indeed, these equations
are reduced (by the substitution (27)) to the type (24-a), which has already
been solved. The equations (27) themselves are satisfied by determining the
phases ϕ from
ϕkl fk fl
2 cot = cot − cot ; −π ≤ ϕkl ≤ +π. (28)
2 2 2
That this is the case can be seen completely analogously to §3.
It remains to consider the periodicity condition
a(m1 , m2 , . . . , mr ) = a(m2 , . . . , mr , m1 + N )
" r
#
X X 1 X
exp i fP k m k + i ϕP k,P l
P k=1
2 k<l
" Ã r !#
X X X
= exp i fP 0 (k−1) mk + fP 0 r (m1 + N ) + ϕP 0 k,P 0 l .
P0 k=2 k<l
This must hold for all m1 , . . . , mr ; therefore each pair of terms on the left and
right, which depend on the mk in the same way, must be equal individually.
12
For example, the term P on the left and the term P 00 on the right, where P 00
is defined by
P 00 (k − 1) = P k (k = 2, . . . , r), P 00 r = P 1.
This yields
1X 1X
N fP 00 r + ϕP 00 k,P 00 l − ϕP k,P l = 2πλ
2 k<l 2 k<l
r−1
1 X 1X
= N fP 1 + ϕP (k+1),P (l+1) + ϕP (k+1),P 1
2 k<l≤r−1 2 k=1
r
1 X 1X
− ϕP k,P l − ϕP 1,P k
2 2≤k<l 2 k=2
r
X
= N fP 1 − ϕP 1,P k ,
k=2
where we used the fact that ϕkl = −ϕlk . Since this holds for all P it follows
that X
N fi = 2πλi + ϕik (29)
k
for all i.
Completely analogously to §3, one can also show that any two fi can
never be equal, as otherwise all coefficients a vanish, and hence that for real
fi two subsequent λi must differ by at least two. The number of solutions
with real fi thus becomes µ ¶
N −r+1
r
¡ ¢
and is therefore much smaller than the required number of solutions Nr .
§6. If fk = uk + i vk is a complex wave number then it follows from
X
N fk = 2πλk + ϕkl
l
that at least one of the ϕkl must have a very large imaginary part of the
order N . This implies in first approximation (§4)
2 cot 12 ϕkl = cot 12 fk − cot 21 fl = −2i.
There must therefore be a fl such that the real part of cot 12 fl is the same as
that of cot 12 fk , while their imaginary parts differ by 2i (up to a quantity of
13
order e−N ). This leads to the following solution, which we shall refer to as a
wave complex:
n wave numbers are determined by the identities
cot 12 fκ = a − i κ; κ = −(n − 1), −(n − 3), . . . , n − 1, (30)
where a is a constant which is the same for each of the n wave numbers. We
then clearly have
ϕκ,κ±2 = ψ ∓ i ∞,
while the other ϕκ,λ have a finite imaginary part. ψ remains undetermined.
Using (15) we obtain
sin uκ
= a,
cosh vκ − cos uκ
sinh vκ
= κ,
cosh vκ − cos uκ
the solutions of which are
(
2a a a
uκ = arctan a2 +κ 2 −1 = arc cot κ+1 − arc cot κ−1 ,
2κ
(31)
arctanh vκ = a2 +κ 2 +1 ,
(κ + 1)2 + a2
e−2vκ = . (32)
(κ − 1)2 + a2
Here sin u always has the same sign as a.
We now claim that a can be expressed in terms of the total wave number
of the wave complex
n−1
X X
k= fκ = uκ (33)
κ=−(n−1) κ
in the simple form

a = n cot 12 k. (34)
For n = 1 this is evident, and for n = 2 it follows by substitution of the
previously obtained solution (18) (cf. §4):
e−v = cos u, u = 12 k resp. 12 k + π,
sin u sin u
a= = 1
¡ 1
¢ = 2 cot u = 2 cot 21 k.
cosh v − cos u 2 cos u
+ cos u − cos u
14
On the other hand, at constant a, the wave numbers uκ for a complex of n
waves are the same as those for n − 2 waves; there are simply two additional
wave numbers un−1 = u−(n−1) , so that
1
k
2 n
= 12 kn−2 + un−1 (33a)
If we now assume that (34) has been proven for n − 2, then we have
1 a a a
k
2 n
= arc cot + arc cot − arc cot
n−2 n n−2
a
= arc cot .
n
We also claim: The eigenvalue corresponding to our wave complex is
1
²n = (1 − cos k). (35)
n
Again, this is obvious for n = 1, while it was proved in (19) for n = 2. In
general, we have
n−1
X
²n = [1 − cos(uκ + i vκ )]
κ=−(n−1)
= ²n−2 + 2 − cos(un−1 + i vn−1 ) − cos(un−1 − i vn−1 )

= ²n−2 + 2 (1 − cos un−1 cosh vn−1 )
µ ¶
[a2 + (n − 1)2 − 1][a2 + (n − 1)2 + 1]
= ²n−2 + 2 1 −
[a2 + (n − 1)2 + 1]2 − 4(n − 1)2
a2 − n(n − 2)
= ²n−2 + 4 2 ,
(a + n2 )(a2 + (n − 2)2 )
where we have used (31) and (32). If we now assume that (35) is valid for
n − 2, then we get, with the help of (34),
1 1 a2 − n(n − 2)
²
2 n
= ³ ´ +2
(n − 2) 1 + a2 (a2 + n2 )(a2 + (n − 2)2 )
(n−2)2
(n − 2)(a2 + n2 ) + 2(a2 − n(n − 2))

=
(a2 + n2 )(a2 + (n − 2)2 )
n 1
= 2 2
= (1 − cos k).
a +n n
Finally, we want to prove, analogously to §4: Given the number of right-
oriented spins r and the total wave number k of all spin waves, the smallest
15
eigenvalue ² is obtained when all r spin waves are combined in a single wave
complex. The eigenvalue is then given by
1
²r = (1 − cos k).
r
If, instead, there are two wave complexes with n and p = r − n wave resp.,
then
1 1
²p+n = (1 − cos k1 ) + (1 − cos(k − k1 )).
n p
The minimum of this expression is attained when
1 1
sin k1 = = sin(k − k1 ),
n p
n sin k
sin k1 = p .
n2 + 2np cos k + p2
It is p
n+P − n2 + p2 + 2np cos k
²min = .
np
Now, p
(n + p) n2 + p2 + 2np cos k < (n + p)2 − np(1 − cos k),
as can be verified easily by squaring. This implies immediately that
²r < ²min . (36)
If the spin waves constitute more than two wave complexes then ² lies even
higher of course. The state of lowest energy in case of r right-spins is there-
fore, when J > 0 (ferromagnetic case): a single wave complex of r spins;
when J < 0 (normal case): r separate waves with real wave numbers. Of
course, in the latter case, this does not fully determine the lowest energy
solution. It is easy to compute the second approximation for the wave num-
bers of a wave complex, where the u and v in the formulas (31) and (32)
are slightly modified to satisfy the true periodicity condition (29). The cal-
culation proceeds in an analogous fashion to §4, and one finds, in general,
that for finite k, there is always a solution in the immediate neighbourhood
of (31) and (32), whereas for small k of the order √1N , the solution changes
its character when N2πk is not divisible by n. Instead of a complex of three
spin waves one then finds, for example, a pair of complex conjugate waves
as described in §4 (with even λ), together with a single wave with a real
wavenumber in a very close neighbourhood. The total number of solutions
is not affected by this transformation in appearance: There is precisely one
16
solution for each λ = 0, 1, 2, . . . , N − n; the latter value corresponding to
λ−(n−1) = λ−(n−3) = · · · = λn−1 = N − 1. For higher values of λ, one or more
λκ are equal to N , which is not allowed.
From now on, however, we want to also exclude λi = 0 in general. First of
all, we gain in symmetry as a result. Moreover, this automatically separates
the solutions for which the total spin equals m = 12 N − r from those in which
only the left-oriented component of the spin has this value, while the total
spin itself has a higher value. The latter states, namely, are given precisely
by those solutions for which one or more among the r wave numbers equals
zero. Thus there remain only N − 2n + 1 solutions with a wave complex of
n spin waves: λ = n, n + 1, . . . , N − n.
§7. We now assume that there are two wave complexes with n and p(> n)
spin waves respectively, and investigate what number of solutions can be
obtained this way. For this, we need to discuss the phases ϕ.
Let the wavenumbers of the first complex be given by
cot 12 fκ = a − i κ, κ = −(n − 1), −(n − 3), . . . , n − 1

X
a = n cot 12 k1 , k1 = fκ , (37-a)
κ
and the second by
cot 12 fµ = b − i µ, µ = −(p − 1), −(p − 3), . . . , p − 1

X
b = p cot 12 k2 , k2 = fµ . (37-b)
µ
Then we have, by (28), (29) and (31),

XX
N k1 = 2πλ1 + ϕκ,µ ,
κ µ
XX
N k2 = 2πλ2 − ϕκ,µ ,
κ µ
(38)
1 1
cot ϕκ,µ = cot(ψκ,µ + i χκ,µ ) = (a − b) − i(κ − µ),
2 2
a−b
tan ψκ,µ = 1 .
4
(a − b) + 41 (κ − µ)2 − 1
2
P P
Sign of ψκ,µ = sign of a−b, κ µ χκ,µ = 0 since the fκ and fµ are arranged in
complex-conjugate pairs. The ψκ,µ are zero when k1 is very small, a large, and
they become positive with increasing k1 as long as a > b. We are particularly
interested in its value when a approaches b and eventually becomes smaller
17
than b, so that we can determine how many values the integers λ1 , λ2 cannot
have. To this end, we keep λ0 = N2πk2 fixed10 and define λ0 by
πλ0 πλ0 π(λ0 + 1)

n cot > p cot > n cot . (39)
N N N
For N k1 = 2πλ0 , a − b is then clearly still positive, and small of order 1/N .
Hence tan ψκ,µ is small and positive when |κ − µ| > 2, small and negative
when |κ − µ| < 2, and very large and positive when |κ − µ| ≈ 2.
The latter follows from the fact that κ and µ only differ from integers by
amount of order e−N ¿ 1/N 2 (cf. §4), so that 14 (κ − µ)2 − 1 ¿ (a − b)2
whenever |κ − µ| is close to 2. We conclude that, up to quantities of order
1/N , 

0 for |κ − µ| > 2,
ψκ,µ = π for |κ − µ| < 2, (40)

1
2
π for |κ − µ| = 2.
If we first assume that p − n is odd, then, for a given κ, there are exactly
two values µ = κ + 1 and µ = κ − 1, for which ψκ,µ does not vanish, but has
the value π. Hence, in this case
XX
ψκ,µ = 2πn. (41)
κ µ
On the other hand, if p − n is even, then, for each κ there exist three µ for
which ψκ,µ 6= 0:
µ = κ, ψκ,µ = π,
1
µ = κ + 2, ψκ,µ = π,
2
1
µ = κ − 2, ψκ,µ = π.
2
In total, we have again XX
ψκ,µ = 2πn.
κ µ
Thus we have
N k1
λ1 = − n = λ0 − n,
2π
λ2 = λ0 + n. (42-a)
10 2π
We shall see in the following that for a ≈ b, k1 and k2 are indeed of the form N times
an integer, so that λ0 = an integer.
18
Similarly, for N k1 = 2π(λ0 + 1),
XX
ψκ,µ = −2πn,
κ µ
(42-b)
λ1 = λ0 + 1 + n,
λ2 = λ0 − n.
The possible values for λ1 are therefore
λ1 = n, n + 1, . . . λ0 − n, λ0 + n + 1, . . . , N − n. (42-c)
The 2n values λ0 − n + 1, . . . , λ0 + n are forbidden due to the presence of the

other spin complex. In the same way one can see that if λ0 is small then,
in general, b > a and hence λ2 = λ0 − n. But λ2 must be at least p (see
the end of the previous section), so λ0 > p + n. Similarly, λ0 ≤ N − p − n,
and it follows that λ0 can take exactly N − 2n − 2p + 1 values. Once again,
this is 2n values fewer than if the other complex were not present. Here it is
important that in both cases it is the number n of waves in the smaller of
the two complexes which appears. The total number of solutions is therefore
(N − 2n − 2p + 1)(N − 4n + 1).
It remains to consider the case n = p. In this case, for κ = n − 1, one

of the partners µ = κ + 2 is absent, which yielded a contribution ψκ,µ = 12 π
previously, and similarly, for κ = −(n − 1), the partner µ = κ − 2 is absent.
In this case, therefore,
XX
ψκ,µ = (2n − 1)π. (43)
κ µ
0
Moreover, there is now just one λ0 such that n cot πλN
= p cot πλ
N
0
, namely
0
λ = λ0 . This does not lead to a solution, however, since fκ = fµ whenever
κ = µ, and we know that the eigenfunction vanishes when two wave numbers
are equal. At most, therefore, N k1 = 2π(λ0 − 1), and that yields
1
λ1 = λ0 − − n,
2
1
λ2 = λ0 − + n.
2
19
Similarly, for N k1 = 2π(λ0 + 1),
XX
ψκ,µ = −(2n − 1)π,
κ µ
1
λ1 = λ0 + + n,
2
1
λ2 = λ0 + − n.
2
λ0 is in this case clearly a half-integer number. In the collection of values
for λ1 , 2n numbers are again missing: λ0 − n + 21 , . . . , λ0 + n − 12 , but in the
collection of possible values for λ0 only 2n − 1: λ0 must be at least 2n − 12
(λ2 = n), and can be at most N − 2n + 21 (λ2 = N − n), i.e. N − 4n + 2 values
instead of the N − 2n + 1 possible values if there were only a single complex
with n waves. Interchanging λ1 and λ2 does not change the solution, so the
total number of solutions is
1
(N − 4n + 2)(N − 4n + 1).
2
[The behaviour becomes perhaps even clearer if, for the moment, we nor-
malise ψ differently: Let ψ 0 be defined in such a way that it agrees with
ψ for large a, but remainsP continuous
P 0 at a = b. Then, in the case of two
complexes with n waves, κ µ ψκ,µ increases from zero to (2n − 1)2π when
k1 increases from 2π
N
n to 2π
N
(N − n) at constant k2 . Now, if
XX
2πλ01 = N k1 − 0
ψκ,µ
κ µ
then λ01 obviously takes all values n to N − 3n + 1, i.e. N − 4n + 2 values,

whatever the value of λ2 . For λ2 , on the contrary, one more value has to be
excluded, namely that which would lead to k1 = k2 .]
We now consider the general case, in which there are qn complexes, each
with n waves, i.e. q1 single waves with real wave numbers, q2 pairs with
conjugate complex wave numbers, etc. The constant λ1 of the first complex
with n waves would have the possible values n, n + 1, . . . , N − n, a total of
N −2n+1 possible values, namely when no other wave complex were present.
But, for every complex with p > n waves, this collection of values is reduced,
as we have seen, by 2n numbers, and for each complex with p > n waves
by 2p, and finally, for the qn − 1 complexes with n waves, by 2n − 1. There
remain
X X
Q0n = N − 2n + 1 − 2 p qp − 2 n qp − (2n − 1)(qn − 1)
p<n p>n
20
possible values for λ1 . The constant λ2 of the second complex of n waves,
must in addition avoid k2 being equal to k1 and hence has one possibility
less. For the last of the complexes of n waves, λqn can take just
Q0n − (qn − 1) = Qn + 1
values, where
X X
Qn (N, q1 , q2 , . . . ) = N − 2 p qp − 2 n qp . (44)
p<n p≥n
We finally have to take into account that interchanging λ’s for different wave
complexes with equal numbers n of waves does not lead to a new solution.
The total number of solutions therefore becomes
Y∞ µ ¶
(Qn + qn ) . . . (Qn + 1) Y Qn + qn
z(N, q1 , q2 , . . . ) = = . (45)
n=1
qn ! n
qn
§8. We shall now prove that we have found the right number of solutions.
It is well-know that the number z(r) of eigenvalues with given total spin
s = 21 N − r is equal to the number of eigenvalues with total spin-component
of left-oriented spins m = s minus those with m = s + 1, i.e.
µ ¶ µ ¶ µ ¶
N N N − 2r + 1 N
z(N, r) = − = . (46)
r r−1 N −r+1 r
We must therefore have

X
z(N, q1 , q2 , . . . ) = z(N, r), (47)
q1 ,q2 ,...
where the sum on the right-hand side runs over all values q1 , q2 , . . . for which
the total number of spin waves equals r, i.e.
X
q1 + 2q2 + 3q3 + · · · = n qn = r.
In other words, we have to sum over all “partitions” of the number r; qn

indicates how often the summand n occurs in the partition.
We introduce the total number of spin complexes:
X
q= qn (48)
n
21
and rewrite (44):
X X
Qn (N, q1 , q2 , . . . ) = N − 2q − 2 (p − 1)qp − 2 (n − 1)qp
p<n p≥n
= Qn−1 (N − 2q, q2 , q3 , . . . ). (49)
In particular,
Q1 (N, q1 , q2 , . . . ) = N − 2q. (49a)
Hence, using (45),
µ ¶
N − 2q + q1
z(N, q1 , q2 , . . . ) = · z(N − 2q, q2 , q3 , . . . ). (50)
q1
In the right-hand side is, apart from the binomial coefficient, the number
of solutions with q2 single-spin waves, and in general qn complexes of n − 1
waves, in a chain of N − 2q atoms. These solutions obviously contain a total
X
r0 = qn (n − 1) = r − q (50-a)
n
of right-spins, arranged in
∞
X
0
q = qn = q − q1 (50-b)
n=2
wave complexes. We now also introduce a notation for the number of solu-
tions in which r right-spins are organised in q wave complexes, irrespective
of how many waves each complex contains:
X
z(N, r, q) = z(N, q1 , q2 , . . . ). (51)
q1 +q2 +q3 +···=q
q1 +2q2 +3q3 +···=r
Then we have, from (50), (50a) and (50b),

q−1 µ ¶
X N − 2q + q1
z(N, r, q) = z(N − 2q, r − q, q − q1 ), (52)
q =0
q1
1
and r
X
z(N, r) = z(N, r, q). (53)
q=1
22
From this point, we treat the problem using mathematical induction. We
make the conjecture
µ ¶µ ¶
N − 2r + 1 N − r + 1 r − 1
z(N, r, q) = . (54)
N −r+1 q q−1
For q = 1 this is certainly correct: We then have a single complex of r waves,
whose wave number can take N − 2r + 1 values. It is also correct for q = r:
then q1 = r and qn = 0 for n > 1, and (54) is the same as (45). We now
assume that (54) has been proven for N −2q, r−q, q−q1 , and have, according
to (52),
q−1 µ ¶µ ¶
X N − 2q + q1 N −r−q+1
z(N, r, q) =
q1 =0
q1 q − q1
µ ¶
r − q − 1 N − 2r + 1
× .
q − q1 − 1 N − r + 1
Now,
µ ¶ Xq1 µ ¶µ ¶
N − 2q + q1 r − 1 N − 2q + q1 + 1 − r
= ,
q1 s=0
s q1 − s
so
q−1 q1
X X (N − r − q + 1)! (Nr − 2q + q1 + 1)!
z(N, r, q) =
q1 =0 s=0
(q − q1 )!(N − r − 2q + q1 + 1)! (q1 − s)!(Nr − 2q + s + 1)!
µ ¶µ ¶
N − 2r + 1 r − 1 r−q−1
×
Nr + 1 s q − q1 − 1
q−1 µ ¶µ ¶
N − 2r + 1 X r − 1 N − r − q + 1
=
N − r − q + 1 s=0 s q−s
q−1 µ ¶µ ¶
X q−s r−q−1
×
q s
q1 − s q − q1 − 1
=
N − 2r + 1 X (r − 1)! (r − s − 1)!
=
N − r − q + 1 s s!(r − s − 1)! (q − s − 1)!(r − q)!
µ ¶
N −r−q+1
×
q−s
µ ¶Xµ ¶µ ¶
N − 2r + 1 r − 1 q−1 N −r−q+1
=
N −r−q+1 q−1 s s q−s
µ ¶µ ¶
N − 2r + 1 r − 1 N − r
= ,
N −r−q+1 q−1 q
23
which is identical to (54). Since (54) holds for q = 1 and for q = 2, r = 2,
and trivially also for q = 2, r = 1, its validity can also be proved for q = 2,
r = 3 or 4, then for q = 2 and larger values of r, finally for q = 3, 4, etc.
Inserting (54) into (53) gives
r µ ¶µ ¶
N − 2r + 1 X N − r + 1 r − 1
z(N, r) =
N − r + 1 q=1 q r−q
µ ¶
N − 2r + 1 N
= ,
N −r+1 r
which corresponds with the required number of solutions (46). Our method
therefore yields all solutions of the problem.
In future work this method will be extended to spatial lattices, and the
physical consequences vis-a-vis cohesion, ferromagnetism and conductivity
will be derived.
Acknowledgement. I thank Prof. Fermi very much for the many
lively discussions. I also thank the Rockefeller Foundation for providing
a stipendium, which enabled my visit to Rome.
Rome, 13 June 1931.
24

On The Theory of Metals - H. Bethe

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

On The Theory of Metals - H. Bethe

Загружено:

Авторское право:

Доступные форматы

On the Theory of Metals.

I. Eigenvalues and eigenfunctions of a linear chain of atoms

where the numbers m1 , . . . , mr run through the values 1 to N , and we will

Wm1 ,...mr , m1 ...mr = E0 − N 0 J.

E0 is the interaction due to the charged clouds of the atoms; J is the

a(m1 . . . mi . . . mr ) = a(m1 . . . mi + N . . . mr ). (3)

§3. For r = 1, the solution of (1) reads, as is well-known,

−2 ² a(m1 m2 ) = a(m1 + 1, m2 ) + a(m1 − 1, m2 ) + a(m1 , m2 + 1)

or they are nearest neighbours:

−2 ² a(m1 , m1 + 1) = a(m1 − 1, m1 + 1) + a(m1 , m1 + 2)

The first group of equations is solved rigorously by setting

a(m1 m2 ) = c1 ei(f1 m1 +f2 m2 ) + c2 ei(f2 m1 +f1 m2 ) ,

0 = a(m1 m2 ) + a(m1 + 1, m1 + 1) − 2a(m1 , m1 + 1). (6)

Here, a(m1 , m1 ) has no physical meaning, but is defined by (5). Clearly, by

a(m1 , m2 ) = a(m2 , m1 + N ). (10)

[The notation a(m2 , m1 + N ) is suggested by the fact that we want to order

Interchanging f1 and f2 obviously leads to a change of sign for ϕ, while the

Conclusion: f1 = f2 does not lead to a meaningful solution of the problem,

Moreover, since f1 , f2 and f2 , f1 yield the same solution, we may assume

N (f1 − f2 ) = 2N i v = 2π(λ1 − λ2 ) + 2ϕ,

² = 2 − cos(u + iv) − cos(u − iv)

² = sin2 u = 21 (1 − cos 2u). (19)

² = ± tan2 u e−N v0 . (21)

For even λ (resp. N + λ, positive sign), therefore, v > v0 ; hence, if in the

and if we put f1 = f2 = k2 , then

2ϕ = 2π(λ2 − λ1 ) − N (f2 − f1 ) = 2π − 4², (11a)

so the right-spins are always direct neighbours.

and for real solutions

²k = 1 − cos f1 + 1 − cos(k − f1 ). (5)

(5) is minimal for (

²min = 2(1 − cos u),

If there are two neighbours, say mi+1 = mi + 1, then we have instead,

P is some permutation of the numbers 1, 2, . . . , r and P k denotes the number

2a(m1 , . . . , mk mk + 1, . . . , mr ) = a(. . . mk mk . . . ) + a(. . . mk + 1 mk + 1 . . . ),

2 cot 12 ϕkl = cot 12 fk − cot 21 fl = −2i.

cot 12 fκ = a − i κ; κ = −(n − 1), −(n − 3), . . . , n − 1, (30)

in the simple form

e−v = cos u, u = 12 k resp. 12 k + π,

= ²n−2 + 2 − cos(un−1 + i vn−1 ) − cos(un−1 − i vn−1 )

(n − 2)(a2 + n2 ) + 2(a2 − n(n − 2))

²r < ²min . (36)

cot 12 fκ = a − i κ, κ = −(n − 1), −(n − 3), . . . , n − 1

and the second by

cot 12 fµ = b − i µ, µ = −(p − 1), −(p − 3), . . . , p − 1

Then we have, by (28), (29) and (31),

πλ0 πλ0 π(λ0 + 1)

The 2n values λ0 − n + 1, . . . , λ0 + n are forbidden due to the presence of the

It remains to consider the case n = p. In this case, for κ = n − 1, one

then λ01 obviously takes all values n to N − 3n + 1, i.e. N − 4n + 2 values,

We must therefore have

In other words, we have to sum over all “partitions” of the number r; qn

Then we have, from (50), (50a) and (50b),

Вам также может понравиться