Академический Документы
Профессиональный Документы
Культура Документы
Volume 108
Editors
J.E. Marsden L. Sirovich
Advisors
s. Antman J.K. Hale P. Holmes
T. Kambe J. Keller K. Kirchgiissner
B.J. Matkowksy C.S. Peskin
With 56 Illustrations
Springer
Eberhard Zeidler
Max-Planck-Institut fur Mathematik
in den Naturwissenschaften
InselstraBe 22-26
D-04103 Leipzig
Germany
Editors
J.E. Marsden L. Sirovich
Control and Division of
Dynamical Systems, 107-81 Applied Mathematics
California Institute of Technology Brown U niversity
Pasadena, CA 91125 Providence, RI 02912
USA USA
I am not able to learn any mathematics unless I can see some problem
I am going to solve with mathematics, and I don't understand how
anyone can teach mathematics without having a battery of problems
that the student is going to be inspired to want to solve and then
see that he or she can use the tools for solving them.
Steven Weinberg
(Winner of the Nobel Prize in physics in 1979)
The more I have learned about physics, the more convinced I am that
physics provides, in a sense, the deepest applications of mathematics.
The mathematical problems that have been solved, or techniques
that have arisen out of physics in the past, have been the lifeblood
of mathematics .... The really deep questions are still in the physical
sciences. For the health of mathematics at its research level, I think
it is very important to maintain that link as much as possible.
Albert Einstein
I think that time is ripe for such an approach. From a general point of view,
functional analysis is based on an assimilation of analysis, geometry, alge-
bra, and topology. The applications to be considered concern the following
topics:
ordinary differential equations (initial-value problems, boundary-eigen-
value problems, and bifurcation);
linear and nonlinear integral equations;
variational problems, partial differential equations, and Sobolev spaces;
optimization (e.g., Cebysev approximation, control of rockets, game the-
ory, and dual problems);
Fourier series and generalized Fourier series;
the Fourier transformation;
generalized functions (distributions) and the role of the Green function;
partial differential equations of mathematical physics (e.g., the Laplace
equation, the heat equation, the wave equation, and the Schrodinger equa-
tion);
time evolution and semigroups;
the N-body problem in celestial mechanics;
capillary surfaces;
minimal surfaces and harmonic maps;
superfluids, superconductors, and phase transition (the Landau-Ginz-
burg model);
viscous fluids (the Navier-Stokes equations);
Preface ix
(i) translating the given concrete problem into the language of functional
analysis;
x Preface
(Au)(x) := u(x) -l b
A(x, y)¢(u(y), y)dy for all x E [a, b]. (Op)
(Au)(x) =1 -l b
A(x, y)¢(l, y)dy for all x E [a, b].
We callilull the length of the vector (function) u. This way, era, b] becomes
a so-called Banach space. In the special case where ¢(u, y) == u, the integral
equation (E int ) is said to be linear. Generally, the importance of nonlinear
problems stems from the fact that they describe processes in nature with
interactions.
In contrast to the integral equation (E int ), the operator equation (E) may
also correspond to the following boundary-value problem for a differential
equation:
u"(x) + c(x)u(x) = f(x), a S x S b,
corresponds to the original operator equation (E), too. Here we define the
operator A through
for all u E X,
lb L(x,u(x),u'(x))dx = min!,
(Mvar)
u(a) = u(b) = 0 (boundary condition),
provided we set
represent basic problems in game theory and duality theory. This will be
shown in Chapter 2 of AMS Vol. 109.
Functional analysis also establishes a calculus for linear operators. For
example, let us consider the abstract differential equation
u(t) = etAuo.
It is the goal of the theory of semigroups to give the formal symbol etA a
rigorous meaning. Equation (D) describes many time-dependent processes
in nature. It turns out that if (D) corresponds to an irreversible process in
nature (e.g., diffusion or heat conduction), then the symbol etA only makes
sense for time t 2: o.
Let us briefly discuss the contents of the present AMS Vol. 108 and of
AMS Vol. 109.
Chapter 1 concerns Banach spaces. For the convenience of the reader,
the most important notions of functional analysis are explained in terms of
the simple space era, b] of continuous functions without using the Lebesgue
integral. This way, the first chapter may serve as a quite elementary intro-
duction to functional analysis. The applications to be studied in Chapter
1 concern existence proofs for ordinary differential equations as well as for
linear and nonlinear integral equations. Here, we will use the two most im-
portant fixed-point theorems due to Banach and Schauder. We also justify
the following fundamental principle in mathematics:
A priori estimates yield existence.
In an abstract functional analytic setting, this principle was established by
Leray and Schauder in 1934.
Riemann's famous Dirichlet principle stands at the beginning of Chapter
2, which is devoted to Hilbert space theory. We give an elegant functional
Preface xiii
and
I: J(x)8(x)dx = J(O) . (mass at 0) = J(O). (III)
I:
Using the substitution x := z - y and g(z) := J(z - y), we also get
I:
Set u(x) := 8(x - y). Applying (IV) to the Fourier transformation
e- iky = 100
e- ikX 8(x - y)dx for all k, y E lR (V)
I:
-00
and
8(x - y) = (27r)-1 eik(x-y)dk for all x, y E R (VI)
ysis and Its Applications by the same author. In particular, Vols. 4 and 5
contain a detailed motivation of the basic equations in classical and modern
mathematical physics along with both abstract existence proofs and inter-
esting applications to concrete problems in physics, chemistry, biology, and
economics.
The representation takes into account that in general no book is read
completely from beginning to end. We hope that even a quick skimming of
the text will suffice to grasp the essential contents. To this end, we recom-
mend reading the introductions to the individual chapters, the definitions,
the "theorems" (without proofs), and the examples (without proofs) as well
as the motivations and comments in the text, which point out the meaning
of the specific results. The proofs are worked out in great detail. Grasping
the individual steps in the proofs as well as their essential ideas is made
easier by the careful organization. It is a truism that only a precise study
of the proofs enables one to penetrate more deeply into a mathematical
theory.
Readers have the following two options:
(i) Those who want to become acquainted as quickly as possible with the
Hilbert space approach to mathematical physics and numerical anal-
ysis can immediately begin with Chapter 2 after glancing at the last
section of Chapter 1, which summarizes important notions concerning
Banach spaces.
Preface vii
Prologue xix
Epilogue 425
Appendix 429
xxvi Contents
References 443
Preface
5 Fredholm Operators
5.1 Duality for Linear Compact Operators
5.2 The Riesz-Schauder Theory On Hilbert Spaces
5.3 Applications to Integral Equations
5.4 Linear Fredholm Operators
5.5 The Riesz-Schauder Theory on Banach Spaces
5.6 Applications to the Spectrum of Linear Compact
Operators
5.7 The Parametrix
5.8 Applications to the Perturbation of Fredholm Operators
5.9 Applications to the Product Index Theorem
5.10 Fredholm Alternatives via Dual Pairs
5.11 Applications to Integral Equations and Boundary-Value
Problems
5.12 Bifurcation Theory
5.13 Applications to Nonlinear Integral Equations
5.14 Applications to Nonlinear Boundary-Value Problems
5.15 Nonlinear Fredholm Operators
5.16 Interpolation Inequalities
5.17 Applications to the Navier-Stokes Equations
References
Subject Index
1
Banach Spaces and Fixed-Point
Theorems
lim
n->oo
Un =U
by means of
lim
n->oo
Ilu n - ull = o.
2 1. Banach Spaces and Fixed-Point Theorems
]RN era, b]
"'/
I Banach space I - Cauchy convergence criterion
normed space
_ convergence and boundedness
(norm iiuii)
~
linear space
- dimension and convexity
(linear combination au + (3v)
FIGURE 1.1.
C[a, b],
which consists of all continuous functions u: [a, b] -+ lR along with the norm
Ilull := max
a::;x::;b
lu(x)l, where - 00 < a < b< 00.
Figure 1.1 shows the relations between Banach spaces and other impor-
tant notions. For example, Figure 1.1 tells us that each Banach space is
also a normed space, etc. In this chapter we will prove the two fundamental
norm
k - contraction A
compact operator A
(11Au - Avll ::; kllu - vii, 0::; k < 1)
~
Banach fixed-point theorem Schauder fixed-point theorem
(Au = u) (Au = u)
FIGURE 1.2.
convex set
/
Brower fixed-point theorem in ]RN
1,"mpoctn~'
+
FIGURE 1.3.
u+v, u,v E X
o:u, 0: E OC, u E X,
are defined elements of X such that, for all u, v, W E X and a, {3 E lK, the
following are true:
u+v =v+u, (u + v) + W = u + (v + w),
(a + (3)u = au + (3u, a (u + v) = au + av,
a({3u) = (a{3)u, au = u if a = 1.
Furthermore, there exists exactly one element () in X such that
(2)
() := 0 and v:= -u
in (1) and (2), respectively. The following proposition shows that this con-
vention makes sense. We also write w - u instead of w + (-u).
au = a( u + ()) = au + a()
and the unique solvability of the equation au = au + v that v = a() = ().
Ad (ii). Since
au = (a + O)u = au + Ou,
we get Ou = ().
Ad (iii). It follows from
() = Ou = (a + (-a))u = au + (-a)u
Example 4. Let X := ]I{N, where N = 1,2, ... ; that is, the set X consists
of all the N-tuples
Define
where -00 < a < b < 00. For u, v E C[a, b] and a E JR., let
u+v and au
(u + v)(x) = u(x) + v(x) and (au)(x) = au(x) for all x E [a, b].
Since the sum and the product of two continuous functions are again con-
tinuous, we get
Definition 6. Let X be a linear space over oc. The elements Ul, ... , UN of
X are called linearly independent iff
dimX=N
6 1. Banach Spaces and Fixed-Point Theorems
au =0 implies a=O.
The following result is well known from the course on linear algebra.
Proof. Set
in Ora, bj
that
for all x E [a, bj.
1.2 Normed Spaces and Convergence 7
Definition 10. If A and B are subsets of a linear space over lK, then we
set
II(u + v) + wll :::; Ilu + vii + Ilwll :::; Ilull + Ilvll + Ilwll·
Analogously, by induction, we get
N N
LUj : :; Lllujll for all Ul, ... , UN EX, N = 1,2, ....
j=l j=l
8 1. Banach Spaces and Fixed-Point Theorems
In these two examples, the triangle inequality (iv) from Definition 1 cor-
responds to the classical triangle inequality for real and complex numbers.
as n ---+ 00.
as n ---+ 00.
Proof. Let Un ----t U as n ----t 00. Hence Ilu n - ull ----t 0 as n ----t 00, i.e., for
each s > 0, there is a number no(s) such that
s
Ilun - ull < "2 for all n ~ no(s).
This implies
Example 3. Let N = 1,2, .... The space X :=]KN is a Banach space over
]Kwith the norm Ilxll := Ixl=, where
lim IXn -
n-->=
xl= = 0 iff lim
n-->=
~kn = ~k for all k = 1, ... , N. (7)
implies
Thus, the sequence (~kn) is also Cauchy. The classical Cauchy convergence
criterion implies the convergence
Ixl := (t~;)! ,
3=1
Moreover,
lim IX n
n-+oo
- xl = 0 iff lim ~kn = ~k
n-+oo
for all k = 1, ... ,N. (8)
Proof. Statement (8) follows from (7) by using the following inequality:
(10)
r(t,qJr
N N
for all a, b E R
. ._ / ('\'N 2)
Choosmg a .- ~j L.j=l ~j ! and b .-
._ 'T/j
/ ('\'N 2) !
L.j=l 'T/j and summing
over j, it follows that
that (xn) is also a Cauchy sequence with respect to the norm I . 100' By
Example 3, we get the convergence
lim ~kn
n---+oo
= ~k for all k,
Standard Example 6. Let -00 < a < b < 00. Then, X := C[a, b] is a
real Banach space with the norm
Ilull := max
a$x$b
lu(x)l·
The convergence Un ----+ U in X as n ----+ 00 means
i.e., the sequence (un) of continuous functions Un: [a, b] ----+ JR converges
uniformly on [a, b] to the continuous function u: [a, b] ----+ R
Moreover, from Iu(x) +v(x)1 ~ lu(x)1 + Iv(x) I we get the triangle inequality
Ilu + vii ~ Ilull + Ilvll·
Finally, we have to show that X = era, b] is a Banach space. Let (un)
be a Cauchy sequence in X, i.e.,
Thus, the convergence in (12) is uniform on the interval [a, b]. By a classical
result, this implies the continuity of the limit function u: [a, b] ---> R Hence
u E X and
Un ---> U in X as n ---> 00. D
in X as n ---> 00.
Since (un') converges to u, there exists some fixed index m such that
Uc(Uo) ~ M
(cf. Figure 1.4). The subset M of X is called closed iff the set X - M is
open.
(i) M is closed.
(ii) It follows from Un E M for all nand
as n ---- 00
that U E M.
Proof. (i) =} (ii). Let Un ---- U as n ---- 00 and Un E M for all n. We have
to show that U E M. If this is not true, then U E X - M. Since the set
X - M is open, there is some c:-neighborhood Uc(u) such that
From Ilun - ull ---- 0 as n ---- 00 we get Ilurn - ull < c: for some index m, and
hence
16 1. Banach Spaces and Fixed-Point Theorems
/,""
....------ ....... .....................
I
I ./
/
...... --.....
~U,,(uo),
"\
\ \ . Uo I I
\
" "./ /
..... _...... ./
-------
..... /
.............. "./
FIGURE 1.4.
uEX-M
Hence
1
Ilun - ull ~ -n -+ 0 as n -+ 00.
Then, B is closed.
The set B is called a closed ball of radius r around the point v.
for all n.
1.5 Operators
Definition 1. Let M and Y be sets. An operator
A:M-+Y
1.5 Operators 17
defined through
iff Au=v.
This definition makes sense, since for each given v E Y, there exists exactly
one u E M such that Au = v. The set
A- 1 (N) := {u E M:Au E N}
is called the preimage of the set N. Operators are also called functions.
Example 3. Let M := [a, b], Y := [c, d], and X := JR., where -00 < a<
b < 00 and -00 < c < d < 00. The operator
A:M ~ X ---+ Y
pictured in Figures 1.5(a), (b), and (c) is injective, surjective, and bijective,
respectively. In addition, A is not injective in Figure 1.5(b).
Example 4. Let -00 < a < b< 00, and let the function
A
d
a b
(a) injective (b) surjective (c) bijective
FIGURE 1.5.
be continuous. We set
(Au)(x) := lx
F(x, y, u(y))dy for all x E [a, b],
lb
and
(Bu)(x) := F(x, y, u(y))dy for all x E [a, b].
u: [a, b] --+ lR
(a) M is a closed nonempty set in the Banach space X over lK, and
(i) Existence and uniqueness. The original equation (13) has exactly one
solution u, i.e., the operator A has exactly one fixed point U on the
set M.
(iii) Error estimates. For all n = 0,1, ... we have the so-called a priori
error estimate
(16)
(17)
Proof. Ad (i), (ii). Step 1: We show first that (Un) is a Cauchy sequence.
Let n = 1,2, .... Using (15) we get
as n ~ 00.
UEM,
and hence Au E M. By (15),
u=Au.
Step 3: We show the uniqueness of the solution U of (13). It follows from
Au = U and Av = v with u, v E M that
Ilu - vii = IIAu - Avll ::; kllu - vii·
1.6. The Banach Fixed-Point Theorem and the Iteration Method 21
°
Since ~ k < 1, this implies Ilu - vii = 0, and hence
Ad (iii). Letting m -> 00 it follows from
U = v.
Example 1. Let -00 < a < b < 00. Suppose that we are given the
differentiable function
A: [a, b] -> [a, b]
such that
a u b
FIGURE 1.6.
(d) Let the real number A be given such that (b - a)IAI.c < l.
(iii) For all n = 0,1,2, ... we get the following error estimates:
u=Au.
Hence
IIAu - Avll ::; kllu - vii for all u, vEX.
Letting M := X, the assertions follow now from the Banach fixed-point
theorem (Theorem l.A in Section l.6). D
£ = max IK(x,y)l.
a<::x,y<::b
Therefore, all the statements of Proposition 1 are true for the integral
equation (19) with (20).
In the special case (20), the original problem (19) is called a linear integral
equation.
24 1. Banach Spaces and Fixed-Point Theorems
u = u(x)
uo \ ~ '-
.",,----_-8
L-----~--*---~----_x
FIGURE 1.7.
Un+l(x) = Uo + r F(y,un(y))dy,
lxo
Xo - h ::; x ::; Xo + h, n = 0, 1, ... ,
(23)
where uo(x) == uo.
is also continuous.
1.8 Applications to Ordinary Differential Equations 25
(b) We set
(Au)(x) := Uo + lxo
x
F(y, u(y))dy for all x E [xo - h,xo + h].
X f---+ F(x,u(x))
is also continuous on the interval [xo - h, Xo + h]. This implies the continuity
of the function
Au: [xo - h, Xo + h] ----t R
This way we get the operator
A:M----tX.
(a) A(M) ~ M.
26 1. Banach Spaces and Fixed-Point Theorems
11Xo
x
F(y, u(Y))dyl :::; Ix - xol max IF(y, u)1
(y,u)ES
:::;hM :::;r for all x E [xo - h,xo + h],
and hence
i.e., Au E M.
Ad (b). By the classical mean value theorem,
The following three sections serve as a preparation for the proof of the
fundamental Schauder fixed-point theorem, which allows a generalization of
the Picard-Lindelof theorem. The basic notions to be considered are:
1.9 Continuity
Definition 1. Let X and Y be normed spaces over K The operator
A:M~X----tY (24)
1.9 Continuity 27
lim
n-.oo
Un = U with U E M implies lim
n-.oo
AU n = Au.
The operator A in (24) is called continuous iff, for each point u E M and
each number 10 > 0, there is a number 15(10, u) > 0 such that
Ilv - ull < b(E,U) and v E M imply IIAv - Aull < E. (25)
Example 2. Let X and Y be normed spaces over lK. The operator A: M <;;;
X ---+ Y is called Lipschitz continuous iff there is a number L > 0 such that
(i) A is continuous.
(ii) A is sequentially continuous.
(ii) =} (i). Suppose that the operator A is not continuous. Then, there
exists a number EO > 0 such that condition (25) is violated for each number
15 > O. In particular, when 15 = ; there is a point Un EM such that
1
Ilun - ull < -n and IIAu n - Aull :::: EO for all n= 1,2, ....
28 1. Banach Spaces and Fixed-Point Theorems
as n ----+ 00.
Proof. Let Un, U E M for all n. Then, Aun , Au E A(M) for all n. It follows
from Un ----+ u as n ----+ 00 that
as n ----+ 00,
along with the inverse operator A- 1(C TJ) = r(a-1~, b- 1TJ) (see Figure 1.8).
TJ
A
=>
D E
FIGURE 1.8.
1.10 Convexity
Definition 1. The set M in a linear space is called convex iff
Intuitively, the convexity of the set M means that if the two points u
and v belong to M, then the segment joining them also belongs to M (see
Figure 1.9(a)).
The convexity of the real function f: [a, b] ---- lR. means that the chords
always lie above the graph of f (see Figure 1.9(b)).
Proof. It follows from Un ---- u as n ---- 00 that Ilunll ---- Iluli. Hence f is
continuous.
30 1. Banach Spaces and Fixed-Point Theorems
a u v b
FIGURE 1.9.
Example 5. Let X := era, b], -00 < a< b < 00, and set
(27)
(28)
where UI, ... ,Un E M and 0 ~ aI, ... ,an ~ 1 with al + ... + an = 1.
Proof. Ad (i). Let L be the set of all the linear combinations of the form
(27). Then
L ~.c.
Thus, L is the smallest linear subspace of X that contains the set M, i.e.,
L = span M.
Ad (ii). Use a similar argument as in the proof of (i). In this connection,
observe that it follows from
We want to show that the set C is closed. To this end, let (v n ) be a sequence
in C such that
Vn -+ V as n -+ 00.
C~C.
Therefore, C is the smallest closed subset that contains the set M, i.e.,
C=M. 0
1.11 Compactness 33
I convergence I
closed set
-- ~
compact set
~
compact operator
bounded set
-- relatively compact set
FIGURE 1.10.
continuous operator
1.11 Compactness
We want to study both compact sets and compact operators. The notion
of a compact set generalizes the classical Bolzano-Weierstrass convergence
theorem.
Compact operators allow us to generalize classical results for opera-
tor equations in finite-dimensional normed spaces to infinite-dimensional
normed spaces via approximation and a limiting process. For example, this
method will be used in the proof of the Schauder fixed-point theorem.
Compactness plays a key role in functional analysis.
Figure 1.10 tells us, for example, that each compact set is closed, and so
on.
Proof. Let the set M be relatively compact and suppose that M is not
bounded. Then, there exists a sequence (un) in M such that
Vn := 6n + i6nf
Since M is bounded,
(i) M is bounded, i.e., Ilull :::; r for all u E M and fixed r 2': o.
(ii) Mis equicontinuous, i.e., by definition, for each c > 0, there is a {j >0
such that
Proof. Suppose that we are given a sequence (un) in M, i.e., the functions
Again by (i), the sequence (u~l) (T2)) is bounded in R Hence there exists a
subsequence (U~2)) of (U~l)) such that
V n .=
.
urn)
n' n = 1,2, ....
Then
for all j = 1, 2, . .. . (32)
Step 2: Cauchy sequence in [a, b]. Let E > 0 be given. We choose the
number 8 > 0 as in (ii). Then, there exists a finite number of points
Xl, ... ,Xs E IQ such that, for each X E [a, b], there is some Xj such that
(33)
By (32), for each j = 1, ... ,8, the sequence (vn(Xj)) is convergent, and
hence it is a Cauchy sequence. Thus, there is a number nO(E) such that
Finally, for each X E [a, b], it follows from assumption (ii) and (33) that
This implies
f:M --dR
Proof. Set
a:= inf f(u).
uEM
M:= {f(u):u E M}
as n ---+ 00.
Since the set M is compact, there exists a convergent subsequence Un' ---+ V
as n' ---+ 00. By the continuity of the function f,
A:M ~ X ---+ Y
Proof. Recall that the uniform continuity of A means that, for each 10 > 0,
there is a number 8 (c) > 0 such that
Ilu - vii < 8(10) and U,V E M imply IIAu - Avil < c. (34)
Suppose that A is not uniformly continuous. Then, there exist a number
co > 0 and two sequences (un) and (v n ) in M such that
1
Ilun - vnll ::; -n and IIAun - Avnll ~ co for all n. (35)
un -+ U as n -+ 00 and u E M.
This implies Ilvn - ull ::; IIVn - unll + Ilun - ull -+ 0 as n -+ 00, and hence
Vn -+ U as n -+ 00.
min
l'5j"5,J
Ilu - Vj II ::; c for all u EM.
Proof. (i) =* (ii). Let M be relatively compact. Suppose that (ii) is not
true. Then, there is a number co > 0 such that M has no finite co-net.
Choose a fixed point UI EM. Then, there exists a point U2 EM such that
and
Consequently, each subsequence of (un) is not Cauchy, i.e., (un) does not
contain any convergent subsequence. This is a contradiction to the relative
compactness of M.
(ii) =? (i). Suppose that condition (ii) is satisfied. Let (un) be a sequence
in M. Fix c = 1. By (ii), there is some Vj EM such that
for all n.
Hence
for all n, m.
with the following properties. For each k = 1,2, ... , (u~k+l)) is a subse-
quence of (u~k)) and
By (36),
for all n, m with m ~ n.
Therefore, (v n ) is a Cauchy subsequence of (un). Since X is a Banach space,
the sequence (v n ) is convergent. This proves the relative compactness of the
set M. D
implies
IIAu - Avll = a~;~b lib [F(x, y, u(y)) - F(x, y, V(Y))]dyl ::; (b - a)c, (38)
A:M~X-Y
be a compact opemtor, where X and Yare Banach spaces over lK, and M
is a bounded nonempty subset of X.
Then, for every n = 1,2, ... there exists a continuous opemtor
such that
1
sup JJAu - AnuJJ :::; - and dim (spanAn(M)) < 00,
uEM n
where
where f3k -I- 0 for some k. Since f30 = 0 implies f31 = ... = f3N = 0, we get
f30 -I- O. Letting aj = - ~~ we obtain (39).
Finally, it follows from (39) and u = ai e1 + ... + a~eN that
(a1 - a~)e1 + ... + (aN - a~)eN = 0,
and hence aj -aj = 0 for all j. This yields the uniqueness of the components
aj. D
Definition 3. The two norms II . II and II . 111 on the normed space X are
1.12. Finite-Dimensional Banach Spaces and Equivalent Norms 43
called equivalent iff there are positive numbers a and (3 such that
One checks easily that I ·1100 is a norm on X. We want to show that there
exist positive numbers a and b such that
N
11(0:1, ... , O:N) - 1(,81, ... ,,8N)1 = L O:jej
j=l
N N
< L(O:j - ,8j)ej ~ 10: - ,8100 L Ilejll for all 0:,,8 E lKN.
j=l j=l
where v := L:f=l,8jej. Note that Ilvll oo = 1 implies ,8j -=I- 0 for some j.
Hence a > O. For given u -=I- 0, set v := Ilull~lu. Hence
and fixed positive numbers a1 and bt. The desired inequality (40) follows
now from (41) and (41*). 0
iff the corresponding components with respect to any fixed basis converge to
each other.
By (41),
allun - ull::; 15cj5cN
max lajn - aj I ::; bllun - ull·
Relation (42) means that Ilun -ull ---t 0 as n ---t 00. In turn, this is equivalent
to
as n ---t 00 for all j. (42*)
o
Corollary 6. Each finite-dimensional normed space is a Banach space.
Proof. Let Un ---t U as n ---t 00 with Un E L for all n. Then, (un) is Cauchy,
and hence u E L, by Corollary 6. 0
Definition 1. Let N = 1,2, .... The points uo, ... , UN in the linear space
X over lK are called to be in general position iff
46 1. Banach Spaces and Fixed-Point Theorems
This definition does not depend on the numbering of the points. For
example, if uo, ... , UN are in general position, then so are U1, UO, U2, ... , UN.
In fact, it follows from
ao(uo - U1) + a2(u2 - U1) + ... + aN(uN - U1) = 0 with aj ElK for all j
that
(ao + a2 + ... + aN)(uO - U1) + a2(u2 - uo) + ... + aN(uN - uo) = 0,
and hence ao +a2 + ... +aN = 0, a2 = ... = aN = O. This implies aj = 0
for all j.
Proposition 2. Let N = 1,2, .... Suppose that the points Uo, ... ,UN are
in general position, and suppose that
U - Uo (j. span{U1 - Uo, ... ,UN - uo}, (43)
then the points Uo, ... ,UN, U are also in general position.
Proof. Let
a1(U1-uO)+" ·+aN(UN-UO)+a(u-uo) = 0 with aj, a E lK for all j.
By (43), d: = O. Hence aj = 0 for all j. o
Definition 3. Let N = 1,2, ... , and let X be a linear space over lK. By an
N-simplex we understand the set
S := co{uo, ... ,UN}, (44)
where the points Uo, ... , UN E X are in general position.
By a O-simplex S, we understand a single point of X, i.e., S = {uo}.
•
Uo
/
So
•
Ul
~ ~
Uo Ul Uo
.b /
Ul
FIGURE 1.11.
The point
1 N
b:= N+1 LUj
j=O
is called the barycenter of S (see Figure 1. 11 (c)).
Let N = 1,2, .... The (N - 1)-simplices
The number
dist(u, M):= inf Ilu - wll
wEM
is called the distance of the point U E X from the set M.
Instead of dist(u, M), we also write distx(u, M).
where
N
U = L,6j{Uj - uo} with ,6j E OC for all j.
j=1
The norm
lIulloo:= max 1,6·1 for all U E Y
l~j~N 3
Let v E Uo +Y and
(3j(v - uo) and (3j(b - uo) of the points v - Uo and b - uo, respectively,
satisfy
j= 1, ... ,N.
for all j.
N
Ilu - vII = ~)Oj(u) - OJ(v))(Uj - uo)
j=l
N
::; max lIuj - uoll
l<j<N
- - j=l
L
IOj(u) - oj(v)1 ::; 2 max lIuj - uoll. D
l<j<N
- -
Uo
• /
(a)
3
•
u\ Uo
•
So
/ • /.b
3\
U\
&
Uo
(b)
U\
FIGURE 1.12.
FIGURE 1.13.
(i) allull :::; p(u) :::; bllull for all u E X and fixed a, b > O.
(ii) p(au) = ap(u) for all a::::: O.
(iii) p(u + v) :::; p(u) + p(v) for all u, v E X (triangle inequality).
(iv) p: X ~ lR. is continuous.
Ilull:::; r implies u E M.
Un - U as n - 00.
MC;;;;L,
SC;;;;M.
A:M -'> M
has at least one fixed point when M is a compact, convex, nonempty set in
a finite-dimensional normed space over lK.
B:K -'> K
has at least one fixed point when K is a subset of a normed space that is
homeomorphic to a set M as considered in Theorem LB.
54 1. Banach Spaces and Fixed-Point Theorems
A
b
a u b
FIGURE 1.14.
B 0 C, i.e.,
C- 1 (B(Cu)) = u, uEM.
Letting v = Cu, this implies Bv = v, v E K, i.e., B has a fixed point. 0
Example 2. Let M = [a, b], where -00 < a< b< 00. Then, each contin-
uous function
A: [a, b] ---+ [a, b]
has a fixed point u (see Figure 1.14).
This is the simplest special case of the Brouwer fixed-point theorem (The-
orem 1.B). Let us give a direct proof. To this end, we set
Since A(a), A(b) E [a, b], we get A(a) ~ a and A(b) ::; b. Hence
FIGURE 1.15.
R:M~8M (47)
as follows. For each point u E M follow the directed line segment from
the point Au through the point u to its intersection with the boundary
8M, and let the intersection point be Ru, as in Figure 1.15. Obviously, the
operator R in (47) is a so-called retraction, that is, R is continuous and
Intuitively, such a retraction does not exist. This is the desired contradic-
tion.
However, a rigorous proof for the nonexistence of a retraction of the form
(4 7) is highly nontrivial. Such a proof can be found in Zeidler (1986), Vol. 1,
p. 51, by means of the mapping degree, which represents an important tool
from topology.
At this place we want to give a completely different proof, one that uses
only elementary facts about simplices. This elegant proof was discovered
by Knaster, Kuratowski, and Mazurkiewicz in 1929.
Remark 3. Intuitively, all the closed sets M pictured in Figure 1.16 are
homeomorphic to a 2-simplex, and hence by Corollary 1 the Brouwer fixed-
point theorem applies to these sets.
Homeomorphic sets
FIGURE 1.16.
'"
"" ""
""
"" ""
"" ""
""",,""""r-
(a) (b)
FIGURE 1.17.
For our proof of the Brouwer fixed-point theorem we need the prepara-
tions that we consider in Sections 1.14.2 and 1.14.3.
FIGURE 1.18.
Lemma 5. Let one of the numbers 0, 1, ... ,N be associated with each vertex
v of the simplices Sj in (48). Suppose that if
Condition (49) means the following. Each vertex Uj of the original sim-
plex S carries the number j. Moreover, let :F be that lowest-dimensional
face of the original simplex S that contains the point v. Then, the number
associated with v is equal to one of the numbers of the vertices of :F (cf.
Figure 1.19).
But since the distinguished O-faces occur twice in the interior and once
on the boundary, the total number of distinguished O-faces is odd. Hence
the number of Sperner simplices is odd.
58 1. Banach Spaces and Fixed-Point Theorems
•
Uo
•
U, •0 •1 •
0
•1
(a)
D
2
Spemer
simplex
Uo U, 0
(b)
FIGURE 1.19.
for all possible systems of indices {io, ... , ik} and all k = 0, ... , N.
Then, there exists a point v in S such that v E C j for all j = 0, ... ,N.
v E Ck ·
We associate the number k with the vertex v. It follows from the Sperner
lemma (Lemma 5) that there is a Sperner simplex Sj whose vertices carry
the numbers 0, ... ,N. Hence the vertices Va, ... ,VN of Sj satisfy the con-
dition
for all k = 0, ... ,N.
Step 2: We now consider a sequence of triangulations of the simplex S
such that the diameters of the simplices of the triangulation go to zero. For
example, one can choose a sequence of barycentric subdivisions of S.
By Step 1, there are points
such that
· d·lam co { Va(n) , ... ,vN
11m
n-oo
(n)} -- ° . (51)
as n ...... 00 and V E S.
By (51),
as n ...... 00 for all k = 0, ... , N.
Since the set C k is closed, this implies
where
and (52)
60 1. Banach Spaces and Fixed-Point Theorems
With
u - Uo = a1(U)(U1 - UO) + a2(U)(U2 - uo)
and ao (u) = 1 - a1 (u) - a2 (u), it follows from the linear independency of
U1 - Uo, U2 - Uo that the barycentric coordinates ao(u), a1(u), and a2(u)
of the points u are uniquely determined by u and depend continuously on
u, by Proposition 5 in Section 1.12.
We set
j = 0,1,2.
Since aj(') and A are continuous on S, the set Cj is closed. Further-
more, the crucial condition (50) of the lemma of Knaster, Kuratowski, and
Mazurkiewicz is satisfied, i.e.,
k
co{ Uio' ... ,Uik} <:;;: U Cim , k = 0,1,2.
m=O
In fact, if this is not true, then there exists a point U E co{ Uio' ... ,Uik}
such that U ~ U:=o Cim , i.e.,
or
A:M-M
has at least one fixed point when M is a bounded, closed, convex, nonempty
subset of a Banach space X over lK.
This theorem was proved by Schauder in 1930. If dim X < 00, then
Theorem 1.C coincides with the Brouwer fixed-point theorem (Theorem
1.B in Section 1.14).
such that
for all u E M. (54)
Define
Mn :=XnnM.
Then, Mn is a bounded, closed, convex subset of Xn with 0 E Mn and
An(M) ~ co A(M) ~ M, since M is convex.
By the Brouwer fixed-point theorem (Theorem 1.B in Section 1.14), the
operator An: Mn - Mn has a fixed point Un, i.e.,
By (54),
for all n = 1,2, .... (56)
62 1. Banach Spaces and Fixed-Point Theorems
Since AUn E M for all n and the set M is closed, we get v E M. Finally,
since the operator A: M --+ M is continuous, it follows that
Av=v, vEM. D
u(x) = Uo + r F(y,u(y))dy,
lxo
Xo - h ::; x ::; Xo + h. (59)
(i) A: M ~ X is continuous.
-i:
1.11. In this connection, observe that
Au=u, UEM,
Observe that we do not assume that equation (61) has a solution. Con-
dition (A) is satisfied trivially if the set A(X) is bounded, i.e., there is a
number r > 0 such that JJAuJJ :::; r for all u E X.
This theorem was proved by Leray and Schauder in 1934. Roughly speak-
ing, Theorem 1.C corresponds to the following important principle in math-
ematics:
A priori estimates yield existence.
Au if JJAuJJ ::; 2r
Bu:= { 2rAu .
IIAul1 If JJAuJJ > 2r.
2rAu
if JJAuJJ = 2r,
Au= JJAuJJ
In case (a), the boundedness of the set M and the compactness of the
operator A imply that there is a subsequence (w n ) of (v n ) such that BW n =
AWn ---+ z as n ---+ 00.
In case (b), one can choose a subsequence (w n ) of (v n ) so that
1
IIAwnl1 ---+ a and AWn ---+ Z as n ---+ 00
for suitable a and z, since the sequence CIA~n II) is bounded and the op-
erator A is compact. Hence
= Bu = tAu .h 2r 1 (62)
u WIt t:= IIAul1 < .
This forces Ilull = Itl·IIAull = 2r. On the other hand, equation (62) implies
Ilull : : ; r, by (A), which is a contradiction. D
u::::; v,
which generalizes the corresponding relation for real numbers.
The order cone X + is called normal iff there is a number c > 0 such that
Example 2. The Banach space X := JR. with the norm Ilull := lui is an
ordered Banach space with the order cone X+ := JR.+ := {u E X: u ~ O}.
Here, the order relation u :::; v in X coincides with the corresponding
classical relation. Since 0 :::; u :::; v implies lui:::; lvi, the order cone X+ is
normal.
Example 3. The Banach space X := JR.N, N = 1,2, ... , with the Euclidean
norm I . I is an ordered Banach space with the order cone
Here,
(6,··· ,eN) :::; ('fJ1> ... ,TIN) iff ej:::; 'fJj for all j. (63)
By (63), 0 :::; x :::; y implies 0 :::; ej :::; 'fJj for all j, and hence Ixl :::; Iyl.
Thus, the order cone X+ is normal. The order cone JR.! in JR.2 is pictured
in Figure 1.20.
Standard Example 4. The Banach space X := C[a, b], -00 < a < b < 00,
with the usual norm Ilull := maxa$x$b lu(x)l, is an ordered Banach space
with the normal order cone
Here,
u:::;vonX iff u(x):::; v(x) on [a,b].
It follows from 0 :::; u :::; v in X that 0 :::; u(x) :::; v(x) on [a,b], and hence
lIull :::; Ilvll· Thus, the order cone X+ is normal.
68 1. Banach Spaces and Fixed-Point Theorems
FIGURE 1.20.
The following proposition shows that the relation u :::; v has the usual
properties.
(iii) u:::; v implies u +W :::; v + wand o:u :::; o:v for all 0: 2': O.
(iv) Un :::; Vn for all n and Un ---+ u and Vn ---+ v as n ---+ 00 imply u :::; v.
(v) If the order cone X+ is normal, then u :::; v :::; w implies Ilv - ull :::;
cllw - ulland Ilw - vii:::; cllw - ull·
In fact, Uo ::; Auo implies Uo ::; UI. Since A is monotone increasing, Uo ::; UI
yields Auo :S AUI, i.e., UI :S U2. Moreover, Uo ::; Vo implies Auo ::; Avo. By
hypothesis, Avo::; Vo. Hence UI ::; VI ::; Vo. Relation (66*) follows now by
induction.
Step 2: Convergence of (un). By Proposition 5(v), it follows from (66*)
that
Ilvo - unll :S cllvo - uoll for all n,
i.e., the sequence (un) is bounded. Since the operator A is compact, there
exists a subsequence (un') such that
Let c > 0 be given. Since un+! = Au n , there is a number no(c) such that
Hence Ilu - unll ::; cllu - uno II < cc for all n ?: no(€), i.e.,
u=Au.
Step 3: Similarly, one proves that Vn -> v as n -> 00 and v = Av. Letting
n -> 00 in (66*), we obtain (66).
An application to integral equations will be considered in Problem 1.1n.
(i) A is continuous.
(ii) There is a number c > 0 such that IIAul1 ::; cllull for all u EX.
since
IIA(u - v)11 ::; cllu - vii < E. o
(69)
where
N
"lm = Lamn~n, m=l, ... ,M. (69*)
n=l
Proof. Suppose that A is linear. Then Aen E Y, and hence there are
numbers amn in ][{ such that
M
Since A is linear,
where
N
max "" lamnl·
Aoo:= l<m<M~
- - n=l
Hence
IIAlloo:= sup IIAulloo ~ Aoo. (70)
II'U II 9
00
Ilull := as;xS;b
max lu(x)l,
where -00 < a < b< 00. Suppose that the function
K: [a, b] x [a, b] -+ lR
Letting n ......., 00, it follows from An(au + (3v) = aAnu + (3Anv that
i.e., A is continuous.
Finally, letting m ......., 00 in (71), we get
Hence
IIAn - All::::: c: for all n 2: no(C:),
i.e., An ......., A in L(X, Y) as n"""" 00.
This proves that each Cauchy sequence in L(X, Y) is convergent, i.e.,
L(X, Y) is a Banach space. 0
f:X......., lK.
1.21 The Dual Space 75
The set of all linear continuous functionals on X is called the dual space
X* of X.
Proposition 2. Let X be a normed space over JK. Then the dual space X*
is a Banach space over JK with respect to the norm Ilfll.
Example 3. Let X := Ora, b], -00 < a< b< 00, and let vEX. Define
A complete description of the dual space Ora, b]* will be given in Section
2.3 of AMS Vol. 109, along with applications to the famous classical moment
problem.
Then, f E X* iff there exist numbers an E JK, n = 1, ... ,N, such that
N
provided this limit exists. This infinite series is called absolutely convergent
iff
00
Proof. Set Sn := 'L.?=o Uj. By (73*), for each € > 0, there is an no(€) such
that
n+k
Ilsn+k - snll:::; L Ilujll < € for all n ~ no(€) and all k = 1,2, ....
j=n+l
Hence the sequence (sn) is Cauchy, i.e., the limit (73) exists. o
Proof. Ad (i). Use IIAm+ll1 = IIAm All ~ IIAmlillAl1 for m = 1,2, ....
Ad (ii). Since the sequences (An) and (Bn) are bounded, we get
where
00
L lajllzl
00
j < 00 for all z E C with Izl < r and some fixed r > O. (75*)
j=O
78 1. Banach Spaces and Fixed-Point Theorems
Recall that lK = lR or lK = C.
IIAII < r,
formula (74) defines an opemtor F(A) E L(X, X).
The following proof shows that this proposition remains valid if we re-
place L(X, X) with a Banach algebra B. Then, F(A) E B.
j=O j=O
Thus, the infinite series 2:;:0 ajAj from (74) is absolutely convergent and
hence convergent in L(X, X). 0
e A := ~Aj
j=O J.
converges absolutely for all A E L(X, X).
(ii) For each A E L(X, X) and all t, s E lK,
etAe sA = e(t+s)A. (76)
Proof. Ad (i). Observe that 2:;:0 I;~I < 00 for all Z E C, by a well-known
property of the classical exponential function
00 1
eZ = """'
L...- -., zj for all z E C.
j=O J.
Ad (ii). As for the classical exponential function, we get
tt ~ (~)
j=O k=O r=O j+k=r r=O j=O
=
r=Oj=O
r. J r .
tjsr-jA r = t
r=O
(t+,s)j Ar.
1.23 Banach Algebras and Operator Functions 79
L zj
00
(1 - z) -1 =
j=O
converges absolutely for all z E C with Izl < 1. By Proposition 4, for each
operator A E L(X, X) with IIAII < 1, the infinite series
00
Proof. Obviously,
Proof. Let A E Linv(X, Y). It follows from AA- l = I that A-I I- 0, and
hence IIA- 1 11 1-0.
If A E Linv(X, Y) and C E Linv(X, X), then AC E Linv(X, Y) and
(AC)-l = C- 1 A-I. (77)
yields
u(to + h) ---+ u(to) as h ---+ O. o
Proposition 3. Let X be a Banach space over IK, and let the operator
A E L(X, X) be given.
Then the initial-value problem (78) has a unique solution given by
for all t E R
j = 1, ... ,N.
1.24 Applications to Linear Differential Equations in Banach Spaces 81
ehA = 1+ hA + h2 A2 + ...
2!
that
00 Ihl j - 1
:::; Ihl ~ -.,-IIAll j :::; constlhl ---> 0 as h ---> O.
j=2 J.
Since
for all t, h E JR,
we get
Ilh-l(U(t + h) - u(t)) - Au(t)11 = II(h-l(e hA - I) - A)etAuoll
:::; ~lletAlilluoll---> 0 as h ---> O.
This implies u'(t) = Au(t) for all t ERIn addition, u(t) = etAuo = Uo for
t = o.
Step 2: For the uniqueness proof, we need the following result, which will
be proved in Section 1.1 of AMS Vol. 109 as an easy consequence of the
Hahn-Banach theorem: For all v EX,
This implies
d
dt (j,w(t)) = (j,Aw(t)) for all t E R (80)
Hence
max Ilw(t)II ::; h11A11 max Ilw(t)ll·
Itl:<S:h Itl9
This yields w(t) = 0 for all t E IR provided A = o.
If A -I- 0, then we choose the number h := 211~1I. Hence
Au = AU, U E X, A E C. (81)
In this case, the operator (>.I - A)-l is called the resolvent of A at >..
The spectrum a(A) of A is defined through a(A) := C - p(A).
Proof. Let>. E p(A) and J.L E C. Hence >.1 - A E Linv(X, X). Since
Recall that a set M is called countable iff there exists a bijective map
A: M --+ N, where N denotes the set of natural numbers n = 1,2 ....
The set M is called at most countable iff it is either finite or countable.
It is well-known that the set Q of rational numbers is countable.
Proposition 2. Let X := era, b], where -00 <a <b< 00. Then the set
of all the polynomials
Proof of Corollary 3. For each real number aj and each e > 0, there is
a rational number r j such that
(82)
Letting
q(x) := ro + rlX + ... + rnx n ,
it follows from (82) that
Thus, the set M of all the polynomials q with rational coefficients is dense
in Ora, b]. But, the set M is countable, since the set of rational numbers is
countable, and the union of a countable number of countable sets is again
countable. 0
a °
Proof of Proposition 2. Let ~ := E~=o' We only consider the case where
= and b = 1. The general case can be reduced to this special case by
using the coordinate transformation x = a + (b - a)y, which transforms
[0,1] into [a, b].
Step 1: Two identities. For bk(X) := (~) xk(1- x)n-k, we have
(83a)
and
By (83a),
10: + i/3 - (r + i8)1 ~ 10: -,I + 1/3 - 81 for all 0:, /3",8 E JR.
1.27. Summary of Important Notions 87
N
U = L~jej, where ~j E lK for all j.
j=l
and
implies a1 = ... = am = 0.
The maximal number of linearly independent points is called the dimension
dim X of X. In particular, we write dim X = 00 iff there is no finite
maximal number of linearly independent points.
Let X, Y be linear spaces over lK. A subset M of X is called a linear
subspace of X iff
An operator
A:M~X -> Y
1.27 Summary of Important Notions 89
A normed space X over lK is called a Banach space over lK iff each Cauchy
sequence in X is convergent.
Let M be a subset of the normed space X. Then, M is called open iff,
for each point u EM, there is a number c > 0 such that the set
The set M is called dense in X iff for each point u E X there exists a
sequence (un) in M such that Un -+ u as n -+ 00.
The set M is called relatively sequentially compact iff each sequence (un)
in M has a convergent subsequence. If, in addition, the limits of these con-
vergent subsequences belong to M, then M is called sequentially compact.
The set M is called compact iff each family of open sets, that covers
M, possesses a finite subfamily that already covers M. The set M is called
relatively compact iff the closure M of M is compact. We have the following
equivalences: 2
°
over K Then, A is called continuous at the point u E M iff, for each 6> 0,
there is a 8(6) > such that
2This will be proved in Problem 1.12 of AMS Vol. 109. In the following
chapters, we only need the concepts of relative sequential compactness and se-
quential compactness. However, to simplify notation, we will use "compact" and
"relatively compact" instead of "sequentially compact" and "relatively sequen-
tially compact." By the equivalences above, this convention cannot cause any
misunderstandings.
Problems 91
The smallest of these numbers d is called the norm IIAII of the operator A,
i.e.,
IIAII:= sup IIAull·
Ilull:5l
Hence, IIAul1 ~ IIAliliuli for all u E X.
The space f: X ~ ][( of all linear continuous functionals on the normed
spaces X over ][( becomes a Banach space X* with respect to the norm
Ilfll, i.e., we define the linear combination of + f3g through
Au = AU, uEX
Problems
1.1. Simple examples. Let X := era, b], where -00 < a< b < 00 and
Ilull := maxa$x$b lu(x)l· Show that
1.1a. {u E X:
is not dense in X.
I:
u(x)dx = O} is a closed linear subspace of X. This set
92 1. Banach Spaces and Fixed-Point Theorems
lb
1.Ii. If we set
IIul11 := lu(x)ldx,
as n -; 00.
Show that (un) is Cauchy with respect to 11·111. Suppose that Ilu n -ulh -; 0
as n -; 00, where u E X. Then,
as n -; 00.
Hence u(x) = w(x) on [a, b], contradicting the continuity of the function u.
1.Ij. The operators A: X -; X and B: X -; X defined through
1.11. Let a E JR with lal(b-a) < 1. For each given Uo E X, the iteration
method
LIm. Let a E JR with lal < 1. For each given Uo E JR, the iteration
method
Un+l = a sin Un + 1, n=O,l, ... ,
converges to the unique solution U E JR of the equation U = a sin U + 1.
for all x, y E [a, b]. Let 2(b - a)d :S 1 along with u(x) == and vo(x) == 2.
Then, the two iteration methods
°°
LIn. Let K(x, y): [a, b] x [a, b] - t JR be continuous with :S K(x, y) :S d
1.3. The spectrum. Let u(A) denote the spectrum of the linear operator
A: X --+ X. Show that
1.3a. u(A) = {2} provided X := e and Au = 2u.
1.3b. If X = eN, N ;::: 1, then the spectrum cr(A) of the matrix operator
A: X --+ X given through
Show that
1.4a. r(A) :::; IIAII, by Proposition 2 in Section 1.25.
l.4c. Volterra integral operator. Let X := Ora, b]c, where -00 < a <
b< 00 (cf. Problem 1.6e). Define the operator A: X --> X through
(Au)(x) := l x
K(x,y)u(y)dy for all x E [a, b],
1.5. The Banach space l~. Let ][(00 denote the space of all sequences
(Un)n>l, where Un E ][( for all n E N. Moreover, let l~ denote the set
of all (un) E ][(00 such that
Define
for all (x, j3 E IT(.
Show that
1.5a. ][(00 is an infinite-dimensional linear space over IT(.
1.6. Classical function spaces on [a, b]. Let -00 < a < b < 00. Show that
the following function spaces are Banach spaces.
Observe that Holder continuous functions play a fundamental role in
the theory of linear and nonlinear elliptic and parabolic partial differential
equations3 as well as in classical potential theory.4
3Cf. Zeidler (1986), Vol. 1, Chapter 6, and Gilbarg and Trudinger (1977).
4Cf. Kellogg, Foundations of Potential Theory, Springer~Verlag, 1929.
96 1. Banach Spaces and Fixed-Point Theorems
1.6a. Let B[a, b] denote the set of all bounded functions u: [a, b] ---+ lR and
set
Ilull:= sup lu(x)l·
a:'Ox:'Ob
1.6b. For 0 < a :s; 1, let cO,a [a, b] denote the set of all the so-called
Holder continuous functions u: [a, b] ---+ lR, i.e., by definition,
lu(x) - u(y)1 :s; constlx - yla for all x, y E [a, b]. (85)
Let
H ( ) ._ lu(x) - u(y)1
a U . - sup Ix_ya I '
where the supremum is taken over all x, y E [a, b] with x =1= y, i.e., the so-
called Holder constant H a (u) of u is the smallest constant such that (85)
holds. In particular,
Set
1.6c. Let Ck[a, b] with k = 1,2, ... denote the set of all continuous
functions u: [a, b] ---+ lR that have continuous derivatives on [a, b] up to order
k. Set
k
Ilull := L a<x<b
max lu(j)(x)l,
j=O - -
where u(j) denotes the jth derivative.
1.6d. For 0 < a :s; 1 and k = 1,2, ... , let Ck,a[a, b] denote the set of all
functions u E Ck[a,b] with u(k) E CO,a[a,b]. Set
1.6e. Let C[a, bk denote the set of all complex continuous functions
u: [a, b] ---+ C. Define
Ilull := max lu(x)l·
a:'Ox:'Ob
Set
Ilull := m~ lu(x)1
xEG
+ Ha(u),
where
lu(x) - u(y)1
Ha(u) := sup
x,YEG, x#y Ix - yla
1.8d. For k = 1,2, ... , let Ck (G) denote the set of all functions u E C( G)
which have continuous partial derivatives on G up to order k. Moreover,
suppose that all these partial derivatives can be extended continuously to
the closure G. Set
Ilull := L m~ 18!1 u(x) I,
1!1I:5k xEG
where we sum over the function and all their partial derivatives up to order
k.
98 1. Banach Spaces and Fixed-Point Theorems
1.8e. For 0 < a S; 1 and k = 1,2, ... , let Ck,Q(G) denote the set of all
functions u E C k (G) such that all the partial derivatives of u of order k
belong to CO,Q(G). We set
Show that
1.9a. Xc forms a complex linear space with respect to the following
operations:
(u,v) + (w,z) = (u+w,v+z), (86)
(a + i(3)(u, v) = (au - f3v, av + f3u), a,f3 E~. (87)
Instead of (u, v) we also write u + iv. Then, (87) corresponds to the
following formal multiplication rule:
implies u* = O.
Problems 99
u*(v) = n-+oo
lim u*(u n ) = 0 for all v E X.
Therefore, u* = O.
and suppose that X is a Banach space with respect to both the norms II . II
and II . IiI· Then, II . II is equivalent to I . IiI on X.
Solution: Let X and Xl denote the linear space X equipped with the
norm II . II and II . Ih, respectively. Define the operator
In a famous paper from 1857, Riemann used the Dirichlet principle for
the foundation of the theory of complex analytic functions. In 1870 Weier-
strass showed that there are variational problems that do not have any
solution. l This way the justification of the Dirichlet principle became an
important open problem, which Hilbert solved in 1900.
In his Paris lecture, Hilbert formulated 23 open problems. In connection
with the twentieth problem, he said
q
L
(resp., u: G --+ with
lu(xWdx < 00.
!
pre-Hilbert space _
!
normed space ~ linear space
(inner product (u I v)) (norm lIull = (u I U)2) (em + ,l3v)
FIGURE 2.1.
2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle 103
partial differential
equations of mathematical
physics (Chapter 5)
Fourier transfonnation
(Section 3.7)
FIGURE 2.2.
(u I v):= l u(x)v(x)dx
on JRN and (CN, where u = (Ul, ... ,UN) and v = (VI> ... ,VN). Observe that
the integral fa ... is to be understood in the sense of Lebesgue.
The theory of Hilbert spaces forces the use of the Lebesgue integral.
The deeper reason for this is the fact that in the case of the classical
Riemann integral the limiting relation
lim
n---HXl
1
a un(x)dx = a u(x)dx 1
2The bar denotes the conjugate complex number. In the case of the real space
L2(G), u(x) is real, and hence
(u I v) = fa u(x)v(x)dx.
104 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
generalized functions
(distributions)
!
variational problems
Sobolev spaces
existence of a perpendicular
I
justification of the
I
Dirichlet principle
I
(finite elements)
t
nonlinear Lipschitz continuous,
strongly monotone operators
Iorthogonality principle I
t
Banach fixed-point theorem
FIGURE 2.3.
2.1 Hilbert Spaces 105
(UIV)E][{
such that the following hold for all u, v, w E X and a, (3 E ][{;
(av + (3w I u) = a(v I u) + .B(w I u) for all u,v,w EX, a,(3 EK (1)
(u I v) = o. (2)
Using the norm 11·11 introduced in Proposition 3 ahead, we can write the
Schwarz inequality in the following form:
with a .- (vlu)
. - (vlv)· o
Proposition 3. Each pre-Hilbert X space over K is also a normed space
over K with respect to the norm
Proof. We have Ilull : : : 0 for all u EX, and Ilull = 0 iff u = O. Furthermore,
Ilaull = (au I au)! = (aa)! (u I u)! = lailiull for all u EX, a E K.
follows from the Schwarz inequality (3*). In fact, for all u,v E X,
Ilu + vll 2 = (u + v I u + v) = (u I u) + (u I v) + (u I v) + (v I v)
= IIul1 2 + 2Re(u I v) + IIvl1 2
::; IIul1 2 + 211ullllvil + IIvl1 2 = (Ilull + Ilvll)2,
where Re z denotes the real part of the complex number z. o
From Proposition 3 we obtain the following:
All the notions and theorems for normed spaces 3 remain valid for pre-
Hilbert spaces with respect to the norm Ilull from (4).
In particular, the convergence
asn-'oo
as n -. 00.
and vn -. v as n -. 00
3The basic notions concerning normed spaces and Banach spaces are summa-
rized in Section 1.27.
2.1 Hilbert Spaces 107
then u = o.
for all n
that (u I u) = o. Hence u = O. D
Proof. Set ¢(t) := (u(t) I v(t)). Letting h -+ 0, the assertion follows from
In other words, a linear space X over ][{ is a Hilbert space iff the following
hold:
(ii) each Cauchy sequence with respect to the norm Ilull from (4) is con-
vergent.
108 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
Proposition 10. Let X be a Hilbert space (resp., Banach space) over lK,
and let L be a linear subspace of X.
Then, the closure L of L is also a Hilbert space (resp., Banach space)
with respect to the restriction of the inner product (resp., norm) on X to
1.
Proof. We first prove that L is a linear space over lK. In fact, let u, vEL
and a, j3 E lK. Then, there are sequences (un) and (v n ) in L such that
and vn -7 V in X as n -7 00.
that au + j3v E L.
Restrict the inner product (resp., norm) on X to the subset L of X.
Then, L is a pre-Hilbert space (resp., normed space).
Finally, let (un) be a Cauchy sequence in 1. Then,
in X as n -7 00.
in L as n -7 00. D
2.2 Standard Examples 109
where x = (6, ... , ~N) and y = ('TI1, ... , 'TIN). The corresponding norm is
given through
(t, l~jI2)
1
Example 2. Let -00 <a < b< 00. For all u, v E C[a, b], we define
(u I v) := lb uvdx. (5)
u:la,b[ -IR
such that
Then
(i) L 2 (a, b) is a real Hilbert space with respect to the following inner
product:
Proof. In the following we will essentially make use of the Fatou lemma
for the Lebesgue integral (cf. the appendix).
J:
Ad (i). Step 1: The classic Schwarz inequality. We want to show that if
u, v E L2 (a, b), then the integral uv dx exists and
(6)
respectively. Hence J:
uv dx = 0, i.e., (6) is true.
Suppose now that Ilull -=1= 0 and Ilvll -=1= O. Replacing u with II~II and v
with II~II' if necessary, we may assume that lIull = 1 and Ilvll = 1. By (6),
and
implies the existence of the integral J: luvldx, and hence the existence of
J: uv dx. Furthermore, it follows from (8) that
lib uVdxl ~ ib luvldx ~ Tl (i b ib lul 2dx + IVI2dX)
= 1 = Ilullllvll.
This is the desired inequality (6).
Step 2: We show that L 2( a, b) is a linear space. In fact, for all a, (3 E IR
and all x E la, b[,
Let u, v E L 2(a, b). Then, the integrals lul 2dx andJ: Ivl 2dx exist. ByJ:
Step 1, this implies the existence of J:
luvldx. Hence the integral
By the identification principle (I) given earlier, we obtain that the function
u = u(x) corresponds to the zero element u = 0 in L 2 (a, b).
Conversely, let u = 0 be the zero element in L 2 (a, b). By (I), this element
corresponds to the class of all the functions u with u(x) = 0 for almost all
x E la,b[. Hence (u I u) = O.
Furthermore, it follows from
m
sm(x) := L IVk+l(X) - vk(x)l·
k=l
Since the sequence (sm(x)) is monotone increasing, the limit
2.2 Standard Examples 113
exists for all x E la,b[, where 0:::; S(x):::; 00. Since L 2(a,b) is a pre-Hilbert
space, the triangle inequality holds. Hence
m
Ilsmll :::; L Ilvk+! - vkll :::; TI + 2- 2 + ... :::; 1 for all m ;::: 1.
k=l
This implies
for all m ;::: 1.
Thus, by the Fatou lemma, the function S is integrable over la, b[ with
In particular, S(x) is finite for almost all x E la, b[. In the remaining points
let us redefine S by setting S(x) := O.
Letting s(x) := S(x)4, we get
s(x) = m->oo
lim sm(x) for almost all x E la, b[, (9)
and J:
s2dx :::; 1, i.e., s E L 2(a, b).
We now use the identity
m-l
L
00
s(x) = IVk+!(X) - vk(x)1 < 00 for almost all x E la, b[. (11)
k=l
Thus, the finite limit
V(X):= lim vm(x)
m->oo
exists for almost all x E la, b[. In the remaining points x of the interval la, b[
we set V(X) := O. As the limit of measurable functions Vm, the function v
is also measurable on the interval la, b[. According to (10) and (11),
Iv(x)1 :::; IVI(X)I + s(x) for almost all x E la, b[. (12)
Since VI E L 2(a, b), we get !vI I, s E L 2(a, b), and hence IVII + s E L 2(a, b),
by Step 2. It follows from (12) that
lar m~CXJ
b
= lim Ivn(x) - v m (x)1 2 dx
Then
(i) L~(G) is a Hilbert space with respect to the following inner product:
un(x).-
._ {~l° if x E C
if]RN - C, (17)
where x = (6, ... '~N)' Then, Un E Llf(G) for all n = 0,1,2, ....
It follows as in the proof of Example 9 in Section 1.1 that the functions
Uo, ... , Un are linearly independent for all n. Hence dim Llf(G) = 00. 0
u: G -+]R
FIGURE 2.4.
(b) Ck(G) is the set of all u E Ck(G) for which all partial derivatives of
order m = 0, ... , k can be extended continuously to the closure G of
G.
(c) lf u E Ck(G) (resp., U E Ck(G)) for all k = 0,1,2, ... , then we write
U E COO(G) (resp., u E COO(G)).
(d) CO'(G) is the set of all functions u E COO(G) that vanish outside
a compact subset C of G that depends on u, i.e., u(x) =
x E G - C (see Figure 2.4).
for all °
Instead of CO(G) (resp., CO(G)) we write briefly C(G) (resp., C(G)).
That is, C( G) consists of all continuous functions u: G --+ IR, and C(G)
consists of all continuous functions u: G --+ IR.
The set CO' (G)c consists of all functions u: G --+ C for which both the
real and the imaginary parts of U belong to CO' (G). Similarly, we define
Ck(G)c, and so on.
lf u E C k (G), then we say "u is C k on G." In the one-dimensional special
case where G = la, b[, we write briefly
and
and so forth.
b
lb u'vdx = - l uv' dx. (21 *)
Proposition 11. Let -00 <a < b< 00. Then, the following are met:
u,v E C1[a,b].
l
b
(U'v+uv')dx= lb(uv)'dx=uvl~.
Ad (ii). Since the function v vanishes in a neighborhood of the two bound-
ary points x = a and x = b, we can choose a subinterval [c, d] of la, b[ such
that v E CO'(c, d). Furthermore, u E C1(a,b) implies u E C1[c,d]. Hence
where x = (6, ... '~N) and 8j u := 8u/8~j. In addition, the outer unit
normal vector to the boundary 8C is denoted by n = (nl' ... ,nN). In the
special two-dimensional case (N = 2), the surface integral fac ... dO is to
be understood in the sense of fac ... ds, where s denotes arclength, and
the boundary curve 8C is oriented in such a way that the set C lies on the
left-hand side of 8C (see Figure 2.5(a)).
2.2 Standard Examples 119
(a) (b)
G
FIGURE 2.5.
In this book, we only need the special case (22*). Let us sketch the proof
of Proposition 12.
Ad (i). The generalization of the fundamental theorem of calculus
{b b
Ja w'dx = wla
Hence
k
r ((8j u)v + u8j v)dx = kr 8j (uv)dx = kG
r uvnjdO.
This is (22).
Ad (ii). Let v E C(f(G). Then, the function v vanishes outside a compact
subset of G, i.e., v vanishes on a "boundary strip" of 8G. Thus, it is possible
to construct a nonempty bounded open subset H of G such that v = 0
outside H and the boundary 8H is sufficiently smooth (see Figure 2.5(b)).
Hence
since v = 0 on 8H.
a: X x X -+ II{
Finally, a(·,·) is called strongly positive iff there is a constant c > 0 such
that
cllul1 2 :::; a(u,u) for all u E X.
2.4 The Main Theorem on Quadratic Variational Problems 121
Un -+ u and Vn -+ v as n -+ 00
Note that a(v,v) 2 cllvl1 2 > 0 for all v E X with v =f. O. Thus, the original
problem (23),
F(u) = min!, UEX,
has a solution u iff the real quadratic function ¢ = ¢(t) has a minimum at
the point t = 0 for each fixed v EX, i.e.,
¢'(O) = O. (25)
This is (23*).
Step 2: Uniqueness. Let u and w be solutions of the original problem
F(u) = min!, u E X. By Step 1,
a(u, v) = b(v)
for all v E X,
a(w,v) = b(v).
Letting v := u - w, we get
Since
F(u) = T1a(u, u) - b(u) 2 T 1 cllul1 2 - Ilbllllull,
we obtain F(u) ~ +00 if Ilull ~ +00. Hence 0 > -00.
By the definition of 0, there is a sequence (un) such that
as n ~ 00.
Hence
as n ---> 00.
This implies
F(u) = a,
i.e., u is a solution of the original problem F(u) = min !, U E X. 0
Proposition 1. Let X be a pre-Hilbert space over IK. Then, for all u, v EX!
we have the so-called parallelogram identity
(27)
Proof. From
(u ± v I u ± v) = (u I u) ± (u I v) ± (v I u) + (v I v)
we get Ilu ± vl1 2 = IIul1 2 ± (u I v) ± (v I u) + Ilv11 2 . o
Remark 2 (The geometrical meaning of the Dirichlet principle). Figure
2.6(a) shows the geometrical meaning of the parallelogram identity (26).
By Figure 2.6(b), it is obvious that (27) generalizes the classic Pythagorean
theorem.
The proof of Theorem 2.A has been based on the identity (26). If we
introduce the energetic inner product
o u u
FIGURE 2.6.
----e<---/l------+----U L
o
FIGURE 2.7.
It seems that this idea has deep roots in our real world, since the Hilbert
space theory is the right mathematical tool for describing quantum physics.
This will be discussed in Section 5.14.
2.5. The Functional Analytic Justification of the Dirichlet Principle 125
F(u) :=
N
r1 12)Oju)2dX
G j =1
-1 G
fudx = min!, u = g on oG. (28)
This problem is also called the Dirichlet problem. Here, we assume that
(H) G is a nonempty bounded open set in IR N , N = 1,2, ....
In addition, we set x = (6, ... , ~N) and OjU := ou/o~j.
The Dirichlet principle says that problem (28) has a solution u. After
the necessary preparations, the final existence theorem will be proved at
the end of this section.
1 LOjuOjvdx =
G j=1
1G
fvdx for all v E Co(G). (28b)
flu:= LoJu.
j=1
w:= u +tv
w =9 on DG and
Explicitly,
Since u is a solution of (28*), the function 1;: JR -> JR has a minimum at the
point t = O. Hence
1;'(0) = O.
Explicitly,
1;'(0) = 1I::
N
Gj=l
DjUDjVdx -1 G
fvdx =0 for all v E cgo(G). (29a)
that is,
-fa (/j,.u + J)v dx = 0 for all v E Cgo(G).
2.5. The Functional Analytic Justification of the Dirichlet Principle 127
6.u + f =0 on G, (29c)
i.e., u is a solution to the boundary-value problem (28a).
Ad (ii). By Step 3, equation (29a) implies (29c). Conversely, integration
by parts tells us that (29c) implies (29a). 0
u E Wi(G)
in the Sobolev space Wi(G), which will be introduced ahead. Such func-
tions only possess generalized derivatives of first order. Summarizing:
The introduction of Sobolev spaces corresponds to the introduction of
real numbers by completion of the set of rational numbers via irrational
numbers.
Our program is now the following:
128 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
(n) One proves the existence of generalized solutions by using the meth-
ods of functional analysis.
((3) One uses sophisticated analytical methods in order to prove that
generalized solutions are also classic solutions, provided the situation
is sufficiently regular (e.g., the boundary oG and the functions f and
g in (28) are sufficiently smooth).
The point is that this formula remains valid for certain nonsmooth func-
tions u and w.
Proof. Suppose that (31*) holds for w,w E L2(G). This yields
] 1 =]0
-1
uv'dx
-1
uv'dx + r uv'dx
io
1
-1 io
=_]0 wvdx- ior1wvdx+u(O)v(O)-u(-I)v(-I)
-1
+ u(1 )v(l) - u(O)v(O)
= _]1 -1
wvdx,
130 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
since v(±l) = O. More precisely, for small c: > 0, note that integration by
l- -1-
parts yields
e
uv'dx =
e
u'v dx + uv [~ ,
-1 -1
1 -1
0 uv'dx = -1 0
-1
u'v dx + UVI~l'
since uv is continuous on [-1, OJ.
Similarly, applying c: ---> +0 to Ie1 ... , we get the corresponding formula
Io
1 ... =.... 0
u E L2(G)
that have generalized derivatives
for all j = 1, ... , N.
(u I Vh,2:= 1 +t
G
(uv
J=l
8 j U8j V) dx,
1
and Ilu111,2 := (u I u)i,2'
Proposition 7. The space Wi(G) together with the inner product (. I
·h,2 becomes a real Hilbert space, provided we identify two functions whose
values differ only on a set of N-dimensional measure zero.
Hence (un) and (8j u n ) are Cauchy sequences in L2(G). Since L2(G) is a
Hilbert space, there are functions u, Wj E L2(G) such that, as n ---> 00,
we obtain
using the continuity of the inner product on the Hilbert space L2(G) (see
Proposition 4 in Section 2.1).
Equation (33) tells us that the function u has the genemlized derivatives
Wj = 8j u on G for all j.
Since 8ju E L2(G) for all j, we get u E WJ(G).
Finally, it follows from (32) that
Ilun - ulll,2 -+ 0 as n -+ 00,
o
2.5.4 The Sobolev Space W~(G)
o
Definition 8. Let W ~(G) denote the closure of the set C(f(G) in the
Hilbert space WJ(G).
We will later discuss that it makes sense to say that all of the functions
o
u EW~(G) satisfy the boundary condition
u=o on8G
in some generalized sense.
o
Proposition 9. The space W~(G) is a real Hilbert space.
Proof. Note that C8"(G) is a linear subspace of the Hilbert space WJ(G).
Now use Proposition 10 from Section 2.1. 0
In the special case where N = 1 and G = la, b[, let us briefly write
wi (a, b) and
o
instead of WJ(G) and W~(G), respectively. The following example shows
o
that the functions u EW~(a, b) possess a simple structure.
o
Example 10. Let -00 < a < b < 00. If u EW~(a, b), then there exists a
unique continuous function v: [a, b] -+ lR such that u(x) = v(x) for almost
132 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
0
1
for all u, v EW2 (a, b).
w(x) = l x
w'dy for all x E [a, b].
that (v n ) is also a Cauchy sequence in the Banach space C[a, b]. Thus, there
is a function v E C[a, b] such that
as n --+ 00.
2.5. The Functional Analytic Justification of the Dirichlet Principle 133
Standard Example 11. Let -00 <a < b< 00, and let the function
u: [a, b] --+ lR
be continuous and piecewise continuously differentiable. Denote by C the
set of points x where the classic derivative exists. Define the real function
. _ {u' (x) if x E C
w(x).- arbitrary otherwise.
(i) The function w is the generalized derivative of u on la, b[, i.e., w = u'
on ]a,b[.
(ii) u E Wi(a, b).
o
(iii) u EW§(a, b) iff u(a) = u(b) = O.
Proof. Ad (i). Divide the interval [a, b] into the subintervals raj, aj+1] and
use integration by parts as in Example 5.
Ad (ii). Since u is continuous and w = u' is piecewise continuous and
bounded, we get
and
134 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
a b
(a) uEWi(a, b)
a b
•
a
~. b
FIGURE 2.8.
it follows from the density proof in Problem 2.13a that Vc; E CO'(a, b), for
sufficiently small c > 0, and
as c -+ +0.
Hence
as c -+ +0.
2.5. The Functional Analytic Justification of the Dirichlet Principle 135
Summarizing, we get
o
where Un E C(f(a, b) for all n. Hence U EW~(a, b). o
(a) Let G ~ ]RN with N = 1 and G = la, b[. Then, Example 10 tells us
that (34) holds true in the "classic sense."
(b) Let G ~ ]RN with N ;::: 2, and suppose that the boundary 8G of the
nonempty bounded open set G is sufficiently regular. Then it can be
proved that
(B)
o
This implies the following: If U EWHG), then there exists a sequence (un)
in C(f(G) such that Un -+ U in wd-(G) as n -+ 00. By (B),
r (u -
JaG u n )2dO ::::; constllu - u n lli,2 -+ 0 as n -+ 00.
and hence
u(x) = 0 for almost all x E 8G, (34*)
in the sense of the surface measure on 8G.
The elementary proof of the boundary inequality (B) can be found in
Zeidler (1986), Vol. 2A, p. 247. This proof is based on the Schwarz inequal-
ity.
Example 15. We consider first the special case where N = 1 and G = la, b[
with -00 < a < b < 00.
Step 1: Let u E CO'(a, b). Then
u(x) =1 x
u'(y)dy for all x E [a, bl.
u(x)2:::; ( 1 b
1. lu'ldy
)2 :::; 1 b
dy 1 b
u,2dy,
and hence
b
1b u 2dx :::; (b - a)21 u,2dx.
Step 2: Let u EW~(a, b). Then, there is a sequence (un) in CO'(a, b) such
that Ilu - un lll,2 -+ 0 as n -+ 00. Hence
By Step I,
'fJ
~----~----------~-----e
a b
FIGURE 2.9.
o
Proof of Proposition 14. Let N = 2. The general case proceeds analo-
gously.
Step 1: Let u E Co(G). As in Figure 2.9 consider a rectangle R :=
[a, b] x [c, d] with G ~ int R. Note that u vanishes outside G. Then
un -? U in L2(G) and
Faust
(36)
u-9EW~(G). (37)
o
As defined in (34), the condition "u - g EW~(G)" corresponds to the
boundary condition
u-g=O on8G,
in the generalized sense. We only consider such boundary functions g on
8G which can be continued to a function g on G such that g E Wi (G).
(ii) This is also the unique solution u E wi(G) of the generalized bound-
ary-value problem (37).
o
Proof. We set X :=W~(G) and
2.5. The Functional Analytic Justification of the Dirichlet Principle 139
where we set
N
~ L 1I0jvl12110j wll2 ~ Nllvlh,2I1wlh.2, (38)
j=l
Hence
for all v E X,
i.e., a(·, .) is strongly positive.
Step 2: The functional b: X --+ R By the Schwarz inequality,
By (38),
for all v E X.
Hence
Ib(v)1 ::; constllvlk2 for all v E X. (39)
Obviously, b(·) is linear. By (39), b: X --+ lR is a linear continuous functional.
Thus, the assumptions of Theorem 2.A are satisfied. Consequently, using
Theorem 2.A, we obtain that problem (36*) has a unique solution wE X,
and w is also the unique solution of (37*).
Finally, set
u = g+w.
Then U E Wi(G), and u is the unique solution of both (36) and (37). 0
(41)
where
Xn is a finite-dimensional subspace of the real Hilbert space X,
2.6. The Convergence of the Ritz Method 141
i.e., 0 < dim Xn < 00. Comparing (40) with (41), it turns out that the
space X of the original problem is replaced with X n . Thus, the Ritz method
is based on a quite natural idea. Let {el n , ... ,eNn} be a basis of X n . Then,
the elements Un and Vn of Xn allow the following simple representation:
N N
Un = LDinein and Vn = L f3in ein.
i=l i=l
lim distx(u,Xn)
n--+oo
=0
for all u E X.
(H2) The bilinear form a: X x X --+ ~ is bounded, symmetric, and strongly
positive, i.e., there exist positive constants c and d such that
la(u,v)1 ::; dilullllvil and cllul1 2 ::; a(u,u) for all u,v E X.
Theorem 2.C (The Ritz method). Assume (HI) through (H3). Then, the
following hold true:
(i) Existence and uniqueness. The original variational problem (40) has
a unique solution u. This is also the unique solution of the variational
equation (40*). We have the a priori estimate
(42)
(ii) The Ritz equation. For all n = 1,2, ... , the Ritz problem (41) has
a unique solution Un. This is also the unique solution of the Ritz
equation (41 *).
(43)
(v) Error estimates. Suppose that we know a lower bound (3 for the min-
imal value of the original problem (40), i.e., F(u) 2:: {3. Then,
Furthermore:
i.e., G(v) is a lower bound and F(w) is an upper bound for the minimal
value of (V).
Such dual problems will be considered in Section 2.12.
(c) The golden rule for the rate of convergence. Using information from
approximation theory, it is possible to obtain estimates for the distance
a(u,u) = b(u).
Hence cllul1 2 ::; a(u,u) = b(u) ::; Ilbllllull. This implies cllull ::; IIbll·
Ad (ii). This follows from Theorem 2.A applied to the Hilbert space X n .
Note that each finite-dimensional linear subspace of a Hilbert space is again
a Hilbert space.
Ad (iii), (iv). The key to the proof is relation (43). Subtracting the Ritz
equation (41*) from the variational equation (40*), we obtain the so-called
orthogonality relation
for all v E X n .
144 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
Hence
Definition 2. We set
and call (. I ·)E the energetic inner product to a(·, .). The energetic norm
is defined through
1
IluliE := (u I u)l: for all u E X.
The linear space X equipped with (. I ·)E is called the energetic space
X E to a(·, .).
Corollary 3. Let X be a real Hilbert space, and assume that the bilinear
form a: X x X -+ lR satisfies assumption (H2) of Theorem 2.C.
Then, the following are met:
2.7. Applications to Boundary-Value Problems 145
(iii) A set is dense in the energetic space X E iff it is dense in the original
Hilbert space X.
Proof. The inequality (45) follows from (H2).
Obviously, (. I ')E is an inner product on X, since a(·,·) is bilinear and
IluliE = 0 implies Ilull = 0, i.e., u = 0, by (45).
Let (un) be a Cauchy sequence in X E . By (45), this is also a Cauchy
sequence in X, and hence it is convergent in X. Furthermore, (45) tells us
that (un) is also convergent in X E . Thus, X E is a Hilbert space.
Assertion (iii) follows immediately from (45). 0
Corollary 4. Assume (HI) through (H3) of Theorem 2.C. Then, we get the
following crucial error relations for the Ritz method:
Proof. Apply Theorem 2.C to the energetic space X E . This situation cor-
responds to c = d = 1. By Theorem 2.C(iv),
f(xo)
u
u = u(x)
Xo (3 x
FIGURE 2.10.
This problem allows the following physical interpretation5 (cf. Figure 2.10):
J:
F (u) = total potential energy of the string;
2- 1 u '2 dx = elastic energy of the string;
5That is, the force (J: f(x)dx) e acts on the part of the string over the
interval b,6].
2.7. Applications to Boundary-Value Problems 147
Furthermore, we set
where (. I ')E and I . liE are called the energetic inner product and the
energetic norm, respectively. Obviously,
dF(u + tv) I
dt
= 1(3 (u'v' -
a
fv)dx = 0 for all v EW~(a,,8). (47**)
t=O
where
«(3-y)(x-a)
ifa~x~y~,8
{
Q(x, y):= «(3-~~~-a)
(3-a ifa~y<x~,8
u: U'dx) ,
We set
and Ilull> .~
6For historical reasons, the principle of virtual power is frequently called the
principle of virtual work.
148 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
Recall that
Proposition 1. Let the force density f E L2 (0:, (3) be given. Then, the
following hold true:
o
(i) The variational problem (47*) has a unique solution u EW~(o:,f3).
This is also the unique solution of the variational equation (47**).
We obtain
(ii) If f E C[o:, f3], then the original boundary-value problem (46) has a
unique solution u E C2 [0:, f3], which is given by the integral formula
(48). This function u is identical to the unique solution of the two
variational problems (47) and (47*).
o
Corollary 2. Set X :=W Ho:,(3). Let X E denote the linear space X
equipped with the energetic inner product (. I .) E. Then, X E is a real Hilbert
space.
Roughly speaking:
Integral operators are frequently inverse operators to differential opera-
tors.
In Section 4.5, we will use the integral formula (48) in order to reduce
boundary-eigenvalue problems to integral equations and to solve them. In
this connection, it is important that the Green function is symmetric, i.e.,
g(x, y) = g(y, x) for all x, y E [0:, f3].
2.7. Applications to Boundary-Value Problems 149
It was one of the main goals of functional analysis in the twentieth cen-
tury to generalize the results above to more general elliptic partial differen-
tial equations.
The Ritz method for solving numerically the original problem (46) will
be studied ahead.
o
Proof of Proposition 1. We set X :=W~(a, {3) along with
w(a) = w({3) = 0,
= ({3 - x) i X
(y - a)f(y)dy + (x - a) 1{3 ({3 - y)f(y)dy,
and hence
({3 - a)u'(x) = - i X
(y - a)f(y)dy + ({3 - x)(x - a)f(x)
which yields
and hence
J: (u 2 + u '2 )dy :::; ((/3 - a)2 + 1) J: u '2 dy,
wEX.
by means of the so-called method of finite elements, let us divide the interval
[a, (3] into n + 1 equal subintervals, i.e.,
ao = a < a1 < ... < an < a n+1 = (3, (54)
i = 1, ... ,n,
is a piecewise linear function with
for all j =I i
(cf. Figure 2.1l(a)). We also set
Then, Un E Xn iff
n
Un = Lainein with a1n, ... ,ann E JR.
i=l
(a) (b)
FIGURE 2.11.
Each basic function ein satisfies the boundary condition ein (a)
ein(3) = 0 of the original problem (53). Hence
F(u) := 1{3 (T u
l '2 - uf)dx = min !, (56)
(57)
j = 1, ... ,no
j = 1, ... ,n,
2.7. Applications to Boundary-Value Problems 153
More precisely, for all n = 1,2, ... , we get the following error estimates:
(1:
1
Lemma 4.
(ii) For all u E C 2 [0:,,8] with u(o:) = u(,8) = 0 and all n = 1,2, ... ,
Then the function Vn is piecewise linear and v( aj) = 0 for all j = 0, ... , n +
1. Consider a fixed subinterval [aj, aj+l], j = 0, ... , n. Since v( aj)
v(aj+l) = 0, it follows from the mean value theorem of calculus that
v'(~) =0
Hence we get the key formula
v'(X) = lX vl/(y)dy
154 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
This implies
Iv'(xW ~ l aH1
dy l aH1
Iv"1 2 dy
l
J J
aj
= hn +
1
Iv"1 2 dy.
J
Hence
J: J:
over j, we get
o
Ad (iii). The set 00'(0:,(3) is dense in X =W§(o:, (3). By Corollary 3 in
Section 2.6,00'(0:, (3) is also dense in X E . Thus, assertion (ii) implies (iii),
since h n ---+ 0 as n ---+ 00. D
b(u) := J: fudx.
Problems (V), (V n ), and (En) correspond to (56), (57), and (58), respec-
tively. By (50),
Ib(u)1 = If: IUdxl ::; 1III1211ul12 ::; ((3 - a)11111211u11E for all uE X E .
n = 1,2, ... ,
by Lemma 4. It follows from Proposition 1 that the solution u of (V) is
also a solution of the original boundary-value problem (53), i.e., -u" = 1
on [a, (3J. Hence
(61)
By Example 10 in Section 2.5,
Hence
Ilu - unll~ = (u - Un IW - Wn)E.
YE ]a,;3[.
That is, let us study the following boundary-value problem:
-u"(x) = fy,E(X), a < x < (3,
(63)
u(a) = u((3) = 0,
where fy,E: [a, (3] -+ lR. is continuous with
2: 0 if x E [y - c, y + c]
fy,E(X)= { 0 ifxrj.[y-c,y+c],
lf3 fy,E(x)dx = 1
(cf. Figure 2.12). By (48), the solution u = Uy,E of (63) is given through
a y-c y y + c (3
FIGURE 2.12.
U1L~-+----t~~(.,y)
I~I'
e
a y (3
(a) (b)
FIGURE 2.13.
(65)
€ ...... +0
FIGURE 2.14.
158 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
Along with (65) we get the following physical interpretation of the Green
function g:
For fixed y E la, ,8[, the function
xl---+g(x,y)
F=e, (67)
+oo if x = y
8y (x):= { 0 (68a)
if x # y,
along with
1 'Y
8
8y (x)dx:=
{I if Y E b, 8]
0 if y ~ b,8], (68b)
lim
e--->+O
1'Y
8
fy,e(x)dx = 188Y(X)dX,
'Y
i.e.,
ol"'lu
o"'u = O~fl ... o~ft .
For a = (0, ... ,0), we set o"'u := u.
Proposition 2 (Integration by parts). For all u, v E CO" (G) and all mul-
tiindices a,
(71)
iff
160 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
(i) there exists a compact subset K of the open set G such that
i.e.,
max lao: ¢n (x) - ao: ¢( x) I ---+ 0 as n ---+ 00 for all multiindices 0:.
xEK
¢n ---+ ¢ in V( G) implies
(73)
i.e., the Hilbert space L2 (G) can be identified with a linear subspace of the
linear space V' (G) .
Then
(i) U is a generalized function, i.e., U E V'(G).
(ii) If u =v in the Hilbert space L 2 (G), then U =V in V'(G).
2.8 Generalized Functions and Linear Functionals 161
¢n -+ ¢ in V(G) as n -+ 00
This implies
Hence V = 8C<u.
Proof. The functiona18C<U: V( G) ---+ JR. is linear. In fact, for all ¢, 'lj; E V( G)
and all a, b E JR.,
in V(G)
Standard Example 9. For fixed y E JR., define the function u: JR. ---+ JR.
through
if x < y
u(x):= if x;::: y. {~
Then
U' = by in V' (JR.),
where U denotes that generalized function that corresponds to u.
I:
Proof. For all ¢ E V(JR.),
U(¢) = u(x)¢(x)dx.
2.8 Generalized Functions and Linear Functionals 163
Hence
U'(¢) = -U(¢') = _ ~oo ¢'(x)dx = ¢(y) = 8y(¢),
as n -+ 00 in V'(G) (77)
Un -+ U
Standard Example 12. Let -00 < 0: < {3 < 00 and y E ]0:, {3[. Suppose
that we are given a family (fy,fJ of continuous functions fy,c: ]0:, {3[-+ lR as
considered in Section 2.8.1. Then,
as c -+ +0 in V'(o:,{3),
where Fy,c denotes the generalized function that corresponds to fy,c. This
is a rigorous formulation of (68c).
Proof. Let ¢ E V(G). By the mean value theorem, there is ayE [y-c, y+c]
such that
This yields
lim Fy,c(¢)
c-->+o
= ¢(y). o
Applications to electrostatics can be found in Problem 2.9.
164 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
By (48),
= 1(1 -
Y
y)x¢>"(x)dx + 11(1 - x)y¢>"(x)dx.
With ¢>(O) = ¢>'(O) = ¢>(1) = ¢>'(1) = 0, integration by parts yields
- (1 - y)y¢>'(y) + 11 y¢>'(x)dx
= -(1- y)¢>(y) - y¢>(y) = -¢>(y) = -8y (¢». D
Theorem 2.D (The perpendicular principle). Assume (H). Then, the min-
imum problem (80) has a unique solution v, and u - v E MJ..
u=V+w,
166 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
FIGURE 2.15.
where
Again by Theorem 2.A, problem (80*) has a unique solution v, and hence
(80) has a unique solution v.
Finally, let X be a Hilbert space over lK. We want to show that u - v E
Ml... Since v is a solution of (80), we get
0= (v - vd + (w - WI),
dim N(f)l. = 1.
2.12 Duality for Quadratic Variational Problems 169
J:X -+ X*
Finally, we set
Theorem 2.F (Duality). The original minimum problem (85) has a unique
solution Ua.
The element Ua is also the unique solution of the dual maximum problem
(85*), and the extremal values of (85) and (85*) are the same, i.e.,
F(ua) = F*(ua).
Moreover, we have
and
cllua - u111::::: a(ua - U,Ua - u) ::::: F(u) - F*(v). (88)
Proof. By Theorem 2.A in Section 2.4, problem (85) has a unique solution
Ua that satisfies the variational equation
By construction of Y,
Hence
This implies
F*(v) :=::: F(ua) for all v E Va + y.
Furthermore, we shall show that Ua E Va +Y and
uaEX, Va-UaEX~.
xJ..
Vo - Uo
"---......----...-----x
Uo
FIGURE 2.16.
It remains to show that (iii) =? (i). To this end, we consider the minimum
problem
T1a(u,u) - b(u) = min!, UEX, (93)
as in Section 2.4. Let us introduce the energetic inner product on X through
for all u E X.
Theorem 2.G. For each given z E X, problem (94) has a unique solution
u.
Proof. We will use the Banach fixed-point theorem. The idea of our proof is
to replace the original equation (94) by the equivalent fixed-point problem
where
m:=1-2tc+t2L2.
By (96), m;:::: 0. If t °
= or t = i~, then m = 1. This implies
In addition, it follows from the Banach fixed-point theorem that, for each
given Uo E X and each fixed t E ]0, i~ the iteration method[,
Un+l = Un - t(Au n - z), n= 0,1, ...
converges to the unique solution u of the original problem (94). Moreover,
we have the error estimates
for n = 1,2, ....
Proof. By (H2) and the Riesz theorem in Section 2.10, for each w E X,
there is an element called Aw such that
Hence
IIAu - Avll = sup I(Au - Av I w)1 ::::: Lllu - vii for all u, vEX.
Ilwll:::;l
It follows now from Theorem 2.G in the preceding section that equation
(98) has a unique solution u. 0
assumptions (H2) and (H3) are satisfied. Then, Theorem 2.H is called the
linear Lax-Milgram theorem. If, in addition, a(·, .) is symmetric, then prob-
lem (97) is identical to the variational equation from Theorem 2.A. By
Section 2.13, it is motivated that
Problems
In Problems 2.9ff the importance of generalized functions for mathematical
physics will be studied on an elementary level. Moreover, in Problems 2.12ff
we will study in detail a smoothing technique that plays a fundamental role
in modern analysis (Friedrichs' mollification).
F(u) := /1
~1
(xu'(x))2dx = min !, u E G 1 [-I, 1], u( -1) = 0, u(l) = 1.
(P)
Use the sequence
._ 1 1 arctan nx
Un ( X ) .- - +- , n = 1,2, ... ,
2 2 arctan n
in order to show that this variational problem has no solution. Recall
that G 1 [-1, 1] denotes the space of continuously differentiable functions
u: [-1,1] -dR.
Solution: Set M := {u E G 1 [-I, 1]:u(-I) = 0 and u(l) = I}. Then,
problem (P) can be written in the following form:
F(u n ) = 1
2n 2 . arctan n
/1 ~1
n2x2
(1 + (nx)2)2
ndx
= 1
2n 2 . arctan n
/n y2
-n (1 + y2)2
dy <
1
- 2n 2 . arctan n
/00
~OO (1
y2
+ y2)2
dy.
Hence F(u n ) ---+ 0 as n ---+ 00. Since F(u) ~ 0 for all u E M, this implies
inf F(u) = 0,
uEM
F(u) = 0, uEM,
and hence
xu'(x) =0 for all x E [-1,1].
This implies u'(x) = 0 on [-1,1]' i.e., u(x) = const. But, this contradicts
the side condition u( -1) = 0 and u( 1) = 1.
Problems 177
We shall discover in Chapter 2 of AMS Vol. 109 that the reason for this
bad structure of (P) is related to the fact that the Banach space C 1 [-1, 1]
is not reflexive.
A detailed historical discussion can be found in Zeidler (1986), Vol. 2A,
Sections 18.7 through 18.9.
2.2. The classical Hilbert space l~. By definition, the space l~ consists of
all the sequences (U n )n>l with Un ElK for all n E fir and
The linear operations are defined as in Problem 1.5 for lK oo • Show that
1~ is an infinite-dimensional Hilbert space over lK equipped with the inner
product
00
(u I v) := L UnVn '
n=l
2.3. Simple identities. Let X be a pre-Hilbert space over lK with the inner
product (. I .). Show that the following hold true:
(ii) If IK = C, then
4(u I v) = Ilu+vI12_llu-vI12_illu+ivI12+illu-ivI12 for all u,v E X.
(99b)
(iii) Appolonius' identity. If IK = JR, C, then
2.4. The Banach space Ora, b]. Let -00 < a < b < 00. Show that the
Banach space C[a, b] equipped with the usual maximum norm
Ilull = maxa$x$b lu(x)1 is not a Hilbert space.
Hint: Prove that the parallelogram identity is violated.
Use the same method in order to show that the Banach space C of
complex numbers equipped with the norm
2.5. * The role of the parallelogram identity. Let X be a normed space over
K Show that X is a pre-Hilbert space iff the parallelogram identity holds,
i.e.,
for all u, vEX.
Hint: Use (99a) and (99b). Cf. Jordan and v. Neumann, On inner prod-
ucts in linear metric spaces, Annals of Math. (1935), 719-723.
Show that
(i) Xc becomes a complex pre-Hilbert space with the inner product
(a) L = Ll..l...
(b) L is closed iff L = Ll..l...
Determine the coefficients Cll ... , C2n. Show that (U2n) converges uniformly
on [0,7f] to the solution u of (V).
Hint: Cf. Zeidler (1986), Vol. 2A, p. 94.
u(x) = {(~~-l~! if x ~ 0,
o if x < O.
Hint: Use a similar argument as in Standard Example 9 from Section
2.8.
180 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
2.9b. The electric field of a charged point. Let us first consider the classic
basic equation of electrostatics in a vacuum:
Here, p denotes the charge density and co denotes the so-called dielectricity
constant in a vacuum. The function u is called the electrostatic potential
caused by the charge density p. If there is a particle P at the point x of
charge Q, then the force K acts on P, where
Here, the vector E(x) is called the electric field at the point x.
Suppose now that p corresponds to a point of charge q at the origin
x = O. Then, formally we get
p(x) = {O ~f x ~ 0
00 If x = 0
and flR 3 p(x)dx = q. Thus, p corresponds to the Dirac distribution q8.
Consequently, let us replace the original equation (100) with the equation
-coLlU = q8 (101)
E(x) _ qx (103)
- 471'colxl 3
This is the classic electric field caused by a charged point at the origin with
charge q. The corresponding force K = QE(x) is the Coulomb force.
Solution: Ad (101). To simplify notation, set q = 471' and co = 1. Let
U(¢) = f ¢(x)dx
JlR 3 Ixl
By (101), we have to show that -(LlU)(¢) = 471'8(¢) for all ¢ E CQ'(JR3 ).
Since Ll = or+ o~ + o§, the definition of derivatives of generalized functions
yields
-U(Ll¢) = 471'8(¢)
Explicitly, this means that
(104)
Problems 181
To prove this, set G := {x E JR3: 0 < r < Ixl < R}. Choose R so large
that ¢(x) = 0 if Ixl ~ ~. Using spherical coordinates, we get dx =
r2 sin iJ dr diJ d¢. Hence
1 IR3
/l¢(x)
- II- dx
x
= 1271" 171"
0 0
lRr=O
/l¢rsiniJdrdiJd¢.
Thus,
r /l¢(x) dx = lim r /l¢(x) dx.
lIR3 Ixl r~O le Ixl
Integration by parts yields
+ 1 (Ixl1
8e
a¢- ¢ -
-- a -
an an
(1))
Ixl
dO
'
where tn denotes the derivative in the direction of the outer unit normal
vector, i.e., tn = - tr for Ixl = r. Since /l (1;1) = 0on Gand ¢ vanishes
in a neighborhood of the set {x: Ixl = R}, we get
J = 1Ixl=r (--1- a¢
r ar
+ ¢-
a
ar
(1))
-
r
r 2 smiJdiJd¢.
•
By the mean value theorem, there are points y and z with Iyl = Izl = r
such that
This is (104).
Ad (103). By the definition of the derivative of generalized functions, we
have to show that
= r grad
lIR3
¢(x) dx
47rcolxl
for all ¢ E CO'(JR3 ),
where E(x) = 471"!oixI 3 ' However, this follows as above by using integration
by parts.
Table 2.1
1
-/:lU = 8 on JR3 u(x) = 47flxl
(potential equation) (potential)
e-lxI2(4t
F(cjJ) = ( f(x)cjJ(x)dx
JJRN
For all generalized functions V E V'(JRN) and all f E L1,o(JR N ), we define
the convolution U * F through
(i) U * FE V'(JRN).
(ii) 8 * F = F.
Convince yourself that this follows simply from the corresponding defi-
nitions.
Problems 183
L:= L aa Da , (105)
lal<;m
LV=F
Solution: By Problem 2.9d,
v(x) = r 47rcop(y)dy
JlR 3 Ix - yl '
(106)
-co~V = p (107)
The function v from (105) is called the classic volume potential. Recall
from classic analysis that v is a classic solution of (107) provided p is
sufficiently smooth (e.g., p E C 1(JR3)).
Solution: Using spherical coordinates, it follows as in Problem 2.9b that
sup
xEG
1
B
I
X
dy
- Y
12 < 00,
11 lR 3
p(Y)dYI2
-I- I ::;
X - Y
1
B
p(y) dy
2 1IB X -
dy
Y
12 < 00,
provided p vanishes outside the ball B. Thus, the function v is well defined
on JR3 and bounded on each ball. Let f:= flR 3 ' For all ¢ E C(f'(JR3), set
By the Tonelli theorem (cf. the appendix) and by the substitution Z := x-y,
we get
Thus, W = v.
2.9g. Generalized plane waves. Let x := (6,6,6) E JR3, t E JR, and
consider the wave operator
for all functions X = x(x, t) with X E CO'(JR N + 1 ). Here, 8 acts on the time
variable t.
To motivate this definition, let us formally consider the product
(u 129 8)(x, t) = u(x)8(t).
If U denotes the generalized function to u, then formally we get
(U 0 V)(X) = U(V(X(x,'))
LU=8
(112)
(i) ¢ E CO'(lR,N).
(ii) ¢ 2: 0 on lR,N.
Hence:
-€ €
FIGURE 2.17.
{ lu(y)ldy = ( 1· lu(y)ldy
lBe lBe
Observe now the identity 1>u = 1>! (1)!u). Thus, by the Schwarz inequality
1
(16), we get
lu g(x)12 :::; 1>(z)lu(x - cz)1 2dz,
and hence
Ug(X) - u(x) = L(u(x - cz) - u(x))1>(z)dz.
fa
that
Iu(x - cz) - u(xWdx < 'f/,
Problems 189
for all z E B and all c: 0 < c ~ co. Thus, it follows from the Fubini-Tonelli
theorem (see the appendix) that
This is b).
( ) .= {u(x)
v x. 0
on C
on G - C.
Then
l G
lu - Vl2dx = 1G-C
lul 2 dx.
By the absolute continuity of the integral (see the appendix), the right-
hand integral is arbitrarily small provided the measure of the set G - C is
sufficiently small. Thus, for each given 'fJ, we can choose the set C in such
a way that
G
'y
/ ' - - ------------
H
-- //
FIGURE 2.18.
la-H
r lul 2 dx < 'Tl 2 ,
BIn fact, for each point x, there exists an open ball B around x such that
B ~ BeG. Since C is compact, a finite set of such balls already covers C. Call
the union of these balls H.
Problems 191
2.14. Show that both COO(G)c and C(G)c are dense in L2(G)c.
Hint: Use the same arguments as above.
Hence Ilv - qll* :::; IIv - pll* + lip - qll* < 2c. This implies
(l
1
Ilv - qll = b
Iv - qI2dX) "2 :::; (b - a)! Ilv - qll* < (b - a)!2c.
That is, the set M is dense in L2 (G). Since the set M is countable, the
space L2 (G) is separable.
2.15b. Let G be an unbounded open interval in JR, e.g., G = R Show
that L2 (G) is separable.
192 2. Hilbert Spaces, Orthogonality, and the Dirichlet Principle
Define
1 if x E G n
Xn(x):= { 0 if]R - G n ,
and
Moo := {Xnq:q E M and n = 1,2, ... }.
1 G-J
luI2dx < E;2,
1 G-G n
lul2dx < E;2.
Hence
Consequently, the countable set Moo is dense in L2(G), i.e., L2(G) is sep-
arable.
2.17. Let G be a nonempty open set in ]RN, N:::::: 1. Show that L2(G) and
L~ (G) are separable.
Hint: Use the same arguments as above. Replace the one-dimensional
Weierstrass approximation theorem with the N-dimensional Weierstrass
approximation theorem (cf. Problem 1.19b in AMS Vol. 109).
= L¢(gj(O))lgj(O)1 = L¢(Xj)I!'(Xj)I- I .
j j
3
Hilbert Spaces and Generalized
Fourier Series
f(x) (1)
i:
k=l
with the so-called Fourier coefficients
i:
ak := 1T- 1 f(x) cos kx dx, k = 0, 1,2, ... ,
From the physical point of view, relation (1) tells us that the 21l"-periodic
"oscillation" f can be represented as a superposition of simple "harmonic
oscillations"
h(x) := coskx, sinkx, (2)
of period ~, where k = 1,2, .... The Fourier coefficients ak and bk cor-
respond to the amplitudes of the harmonic oscillations (2). For example,
if lak I is large for some k, then the harmonic oscillation x ....... cos kx con-
tributes strongly to the oscillation f. Therefore, physicists are interested
in such periods 2;:
for which lak I or Ib k I are large with respect to the other
Fourier coefficients.
For example if f: JR -+ JR has the period 21l" with
then
cos3x cos5x
f( x) =cosx+ - 2- + - - + ... for all x E JR.
3 52
Figure 3.1 represents schematically the corresponding superposition.
In the nineteenth century, many mathematicians studied in detail the
convergence of Fourier series. In 1876, du Bois-Reymond obtained the sur-
prising result that there are continuous functions f for which the Fourier
series f does not converge at each point x E JR. This counterexample shows
that the classical convergence of infinite series is not the right concept for
solving the fundamental convergence problem for Fourier series.
In 1907, Fischer and Riesz proved independently that
A natural answer to the convergence problem for (1) can be given in
terms of the Hilbert space L2 (-1l", 1l").
In this chapter we will show that this is a special case of an abstract
result on complete orthonormal systems in Hilbert spaces. In particular, it
turns out that, for each function
the Fourier series (1) converges in the Hilbert space L 2 ( -1l", 1l"), i.e.,
lim Ilf -
n-+oo
snll = 0,
where
n
sn(x) := 2- 1ao+ 2::::ak coskx + bksinkx,
k=l
(a)
00
JVV~ k 0
(b) (c)
FIGURE 3.1.
We shall also show that this corresponds to the convergence of the Gauss
method of least squares.
In the nineteenth century, mathematicians and physicists also used more
general series expansions than (1) of the following form:
L Ckik(X),
<Xl
where G is a nonempty open set in JRN, and the functions fo, iI, ... satisfy
the so-called orthogonality relation
°
Here, 8km = if k =I- m and 8km = 1 if k = m. Observe that (. I .) represents
the inner product in the Hilbert space L2(G).
Using (4), it is easy to compute the unknown coefficients Ck formally.
Namely, multiplying (3) with fm(x) and integrating over G, it follows for-
mally from (4) that
Our abstract results will show that the infinite series (3) converges in the
Hilbert space L2(G) provided the set of all the real linear combinations
with
-00 < k < 00. (5*)
dfd(x)
x
= (21f)-~ Joo ika(k)eikxdx.
-00
I:
ample, if f(x) == 1, then
But this integral does not exist. In Section 3.7 we shall show that
The extension of the Fourier transformation to generalized functions of
the class 8' (lR N) overcomes the classical difficulties.
In order to describe this formally, choose
f(x) := 8y(x),
Thus, in the formal language of physicists, the "Dirac function" 8y has the
Fourier transform k f-> (21r)-!e- iky . In particular, the "Dirac function" 80
has the constant Fourier transform
(H) Let X be a Hilbert space over II{ = JR, C, and let {UO,Ul, ... } be a
finite or countable orthonormal system in X, i.e., by definition,
~)Un I u)un.
Sm := (9*)
n=O
The numbers (Un I u) are called the Fourier coefficients of u.
Definition 1. Assume (H). The finite orthonormal system {UO, ... , UN} is
called complete in X iff
N
U = I)un I U)Un for all U E X. (10)
n=O
The countable orthonormal system {UO, Ul, ... } is called complete in X
iff the infinite series (9) converges for all U EX, i.e.,
U = m--->oo
lim sm for all U E X.
(Uk I u) = m~oo
lim (Uk I~
~
lim ~ Cn(Uk I un) = Ck·
cnun) = m~oo~ 0
n=O n=O
This motivates the ansatz (9). Let us give a second motivation for (9).
In order to get a good approximation of u by a linear combination COUo +
... + CmU m ' we set
Proof. By (8),
n=O n=O
The smallest value of f is attained for Cn = (un I u), n = 0, ... , m. 0
Ilu - sml1 2 = IIul1 2 - 2: I(u n I u)1 2 for all u E X and all m. (14)
n=O
202 3. Hilbert Spaces and Generalized Fourier Series
cn E OC for all n,
m+k
L Ic n l 2 , (16)
n=m+l
It follows from Proposition 5 and the Bessel inequality (15) that for each
U E X the Fourier series is convergent, i.e., there is some v E X such that
00
v = L(un I u)un .
n=O
However, it is possible that v i= u. But if the orthonormal system {un} is
complete, then v = U for all U EX.
L
00
(ii) For all u E X, the Bessel inequality is replaced with the so-called
special Parseval equation
IIul1 2= L
00
I:
Recall that the inner product in the Hilbert space L 2 ( -7f, 7f) is given
through
(u I v) = u(x)v(x)dx.
204 3. Hilbert Spaces and Generalized Fourier Series
Proposition 1. The set {Uo, Ul, ... } forms a complete orthonormal system
in the Hilbert space L2 ( -7r, 7r).
This proposition tells us that for each U E L2 ( -7r, 7r) the Fourier series
00
I:
where
I:
ak := 7r- 1 u(x) cos kx dx,
Corollary 2. For each U E L 2( -7r, 7r), the classic Fourier series converges
in L 2( -7r, 7r), i.e.,
where m = 0,1, ... , and all the coefficients an, f3n are real numbers. It
follows from the classical addition theorems for sin(·) and cos(·) that
We also set
IIJIIC[a,bj := max IJ(x)l·
a::; x::; b
Lemma 3. For each Junction J E C[-7r,7r] with J( -7r) = J(7r) and each
c > 0, there exists a Junction pET such that
Proof. Step 1: Let J be even, i.e., J( -x) = J(x) for all x E [-7r, 7r]. The
function
ep(x) := cos x
is strictly decreasing on [0, 7r]. Since y ~ J(ep-l(y)) is continuous on [-1,1],
it follows from the Weierstrass approximation theorem (Proposition 2 in
Section 1.26) that there exists a polynomial p(y) = Co + elY + ... + Cnyn
such that
max IJ(ep-l(y)) - p(y)1 < C.
-1::;y::;I
where q(x) := p(cosx), and hence q E T. Since J and q are even, we also
get
Step 2: Let J be odd, i.e., J( -x) = - J(x) for all x E [-7r,7r], and let
J(O) = J(7r) = O. Choose 8> O. Set
Finally, let g(x) := -g( -x) if -7r :::; x:::; O. Since J is uniformly continuous
on [-7r,7r], we get
C
-~~7r IJ(x) - g(x)1 < :2
for sufficiently small 8 > O. Applying Step 1 to the even continuous function
x ~ ;i~~ on [-7r, 7r], it follows that there exists a q E T such that
- -q(x) I <-.
g(x) c
-7r::;X::;7r Ism x
max -.
2
Changing continuously the function f near the point x = 7l', we may assume
that f( -7l') = f(7l'). By Lemma 3, there exists a function q E T such that
Using
e±inx = (cos nx ± i sin nx), n = 1,2, ... ,
relation (20) follows from
einx _ e- inx
cosnx = sin nx = --::-:---
2 2i
and
m=±1,±2,... ,
by a simple computation.
By Corollary 4, the set T = span {uo, Ul, ... } is dense in L2 ( -7l', 7l'). It
follows from Theorem 3.A that the orthonormal system {un} is complete
in L 2 ( -7l', 7l'). D
3.3 The Schmidt Orthogonalization Method 207
Un+1 := (21*)
°
Thus, (U m I un+d = for m = 0, ... , n, and (UnH I un+d = 1.
If WnH = 0, then we use Vn+2, and so forth. This way we obtain an
orthonormal system {u m }.
By induction, it follows that all the Vm are finite linear combinations of
the Un. Hence the linear hull of {Um } is dense in X. If {u m } is countable,
then Theorem 3.A tells us that {u m } is complete.
If {u m } is finite, then span {u m } = X, since each finite-dimensional
linear subspace of a Hilbert space is closed, by Corollary 7 in Section 1.12.
Thus, {un} is again complete. D
Method (21) for constructing the orthonormal system {un} is called the
Schmidt orthogonalization method. The following two results will be used
critically in the next section.
Proof. Ad (a). It follows from (22) that 000 = I vIa II ' and hence
Vo
Uo = Ilvoli'
Let n 2:: 1. By (23),
Hence
n
Un+I = an+IVn+I + 2..= 13m u m,
m=O
where 130, ... ,13n E lK are appropriate coefficients. Using (Uk I un+r) = 0
for k = 0, ... ,n, we get 13k = -an+I(uk I v n+!). By (21),
Corollary 3. Let {VO, VI, ... } be a sequence in the Hilbert space X over lK.
Suppose that
Proof. Let S := span{vo, VI," .}. By (24), (S)-L = {O}. Thus, Corollary 1
from Section 2.9 tells us that X = S. 0
-1 S x S 1.
3.4 Applications to Polynomials 209
()
unx-
_(2n+1)~ dn(2
1
)n
-x-1, n= 0,1, .... (25)
n! 2n+2 dxn
The polynomials
n = 0,1, ... ,
Proof. Step 1: We show that the system {un} defined by (25) represents
an orthonormal system in L2 (-1, 1), i.e.,
n = 0,1, ....
= (2m)! J1
-1
(m,)222m+1
(1 - x2)mdx = -'--::.-'----:--
2m + 1
Step 2: The set span {v n } is dense in L2 ( -1, 1). This follows from the
density of the set C[-l,l] in L2(-1,1) and from the fact that span {v n }
is dense in the Banach space C[-l,l] by the Weierstrass approximation
theorem (cf. the proof of Corollary 4 in Section 3.2).
210 3. Hilbert Spaces and Generalized Fourier Series
n = 0, 1, ... ,
Proof. Step 1: We show that the system {un} defined by (26) forms an
orthonormal system in L~( -00,(0) with IK. = lR or IK. = C, i.e.,
U~(X) + (2n + 1 - X2)u n (x) = 0, -00 <x< 00, n = 0,1, ... , (28)
u~(x) + (2m + 1- X2)U m(x) = 0, -00 <x < 00, m = 0,1, .... (29)
Observe that, for all a > ° and k = 0, 1, ... ,
lim e- ax2 xk = 0. (30)
x-+±oo
N- jN un(x)u:n(x)dx
I:
= lim Un(X)Um(x)I lim
N-+oo -N N-+oo_N
=- un(x)u:n(x)dx, n, m = 0, 1, ....
3.4 Applications to Polynomials 211
Similarly,
g(k):= Joo
-00
2
e-"'2 u(x)e-ikXdx for all k E M.
Formally,
n = 0, 1, .... (33)
Thus, the majorant condition (33) justifies formula (32) (cf. Parameter
Integrals in the appendix). Consequently, the function 9 is analytic on the
strip M. By (31) and (32),
°
Hence g(k) = for all k E M. By Step 2, u(x) = for almost all x E R°
Step 4: The assertion follows now from Proposition 2 in Section 3.3. 0
(a) X and Yare Banach spaces over lK. The linear operator A: D <;;: X -->
Y satisfies
IIAul1 ~ ellull for all u E D, (36)
where e 2': 0 is a constant.
(b) The set D is a linear dense subset of X.
Proof. Ad (i).
Step 1: Existence. Let U E X - D. Since D is dense in X, there exists a
sequence (un) in D such that Un --> U as n --> 00. In particular, (un) is a
Cauchy sequence. By (36),
by means of
S s
Un ---+ U implies AU n ---+ Au.
3.7 Applications to the Fourier Transformation 215
Obviously, u E S implies
Therefore, the functions u from the linear space S are called rapidly de-
creasing at infinity. In particular, if u, v E S, then
i:
N ..... ±= -N-N
= - u(x)v'(x)dx. (42)
Proof. Set
i:
Formally,
!,(k) = 1-<Xl<Xl x2 .
e-"2e- tkX (-ix)dx =
1<Xl de -
.,2
2
_ _ ie-tkxdx
-<Xl ~
.
<Xl 2
=- 1-<Xl e-"2 ke-ikxdx = -kf(k) for all k E R
Hence
k2
f(k) = e-"2 f(O) for all k E R
By (44), f(O) = (21f)-!. o
Let us now study the so-called Fourier transformation
We set Fu:= a.
and
a(ql(k) = (-i)qF[xqu(x)](k), (48b)
where p, q = 0, 1, ....
3.7 Applications to the Fourier Transformation 217
for all k E JR, q = 0,1, .... This can be justified rigorously by using the
majorant condition
for all k E JR
kPa(q)(k) = (21f)-2 1 1 00
u(x)( -ix)q( -i)-P dPe-
ikx
p
dx
1
-00 dx
This implies
Iiallp,q ~ const(p, q) Ilull q+2,p.
Recall that a = Fu. Since F: S ----+ S is linear, we get
S
Un ---+ U implies
i: i:
Step 3: Let us prove the following key formula:
= 1
00
-00
2
Observe that
(51)
Instead of (51), let us write
Ga:=u.
This way we obtain the linear sequentially continuous operator G: S ~ S.
Observe that G is obtained from F by replacing e- ikx with e ikx , and use
the same argument as in Step 2.
Step 5: We want to prove that the linear operator F: S ~ S is bijective.
By (51) with a = Fu, we get
I: I:
Step 6: Let us prove (47). By (49) with y = 0,
Set b = w. Then
for all k E R
I: I:
Hence
This is (47). o
Proposition 5. The Fourier transformation F: S ----+ S can be uniquely
extended to a unitary operator
Since the set CO' (JR k is dense in L~ (JR), so is the set S. By (47),
where (. I .) denotes the inner product in L~(JR). The assertion follows now
from Standard Example 2 in Section 3.6. 0
and, as n - t 00,
S
Un ----+ U implies TUn -t Tu.
The elements T of S' are called tempered generalized functions (or tem-
pered distributions).
Consequently, FT E S'.
Let S E S'. Define
for all u E S.
Then,
(FT)(u) = S(F- 1 Fu) = S(u) for all u E S.
Hence FT = S, i.e., F: S' - t S' is surjective.
Moreover, let FT = O. Then T(Fu) = 0 for all u E S, and hence Tv = 0
for all v E S, i.e., T = O. Thus, F: S' - t S' is bijective.
Let a, (3 E C and T, S E S'. Then
Then, T E S'.
Hence
-00
Iv(x)~ (1
1+x
+ x2)lu n (x) - u(x)ldx
(i) 8y E Sf.
for all u E S.
Hence
8y(u n ) --+ 8y(u) as n --+ 00.
i:
Ad (ii). For all u E S,
Remark 6 (The language of physicists). Instead of (ii) and (iii) from Stan-
dard Example 5, physicists formally write
and
for all x,y E:K (55)
Problems
3.1. Density. Let M be a subset of a Hilbert space X over K Show that
the set span M is dense in X iff
(u I v) = 0 for all v E M implies u = O.
e- ikx U a ,(3(x)dx
k2
where w(k) := e-4;J.
1:
To prove that span F(D) is dense in X, by Problem 3.1, we have to show
that
1:
implies v(k) = 0 for almost all k E JR. In fact, from (57) we get
Since V,W E L~(JR), we get J~oo Iv(k)w(k)ldk < 00. Thus, Problem 3.5
implies that v(k)w(k) = 0 for almost all k E JR, and hence v(k) = 0 for
almost all k E R
3.7. * The fundamental Payley- Wiener theorem. Let us consider the Fourier
transform
224 3. Hilbert Spaces and Generalized Fourier Series
Then, for each fixed R > 0, the following two statements are equivalent:
=
°
(i) The function F: C ---t C is holomorphic and, for each N 1,2, ... ,
there is a constant C N > such that
for all z E C.
(ii) The function f belongs to CO'(lR.)c and vanishes outside the interval
[-R,R].
3.8. The tensor product X I8i Y. Let X and Y be linear spaces over lK, and
let X* denote the space of all linear functionals u*: X ---t lK. Define
(ii) The symbol "®" behaves like a product, i.e., for all u, v EX, w, z E Y,
and a, (3 E lK, we get the following distributive laws:
and
L ajk(Uj ® Vk) = 0
j,k
226 3. Hilbert Spaces and Generalized Fourier Series
(a I a) = "L!3jk!3jk.
By (iii), a = 0 iff !3jk = 0 for all j, k. This yields (62).
"L Uj 0Vk 0w m,
j,k,m
(ii) The symbol "0" behaves like a product. That is, for all u, v E X,
Y E Y, Z E Z, and a, f3 ElK, we get
the seal of his spirit, and which is now sinking below the horizon,
achieved a more perfect balance than has prevailed before or since,
between the mastering of single concrete problems and the formation
of general abstract concepts.
as n ----7 00.
Since X =I=- {O}, this implies A =I=- 0 and hence IIAII =I=- O.
Step 1 is decisive.
Step 1: Variational problem for constructing an eigensolution. We con-
sider the maximum problem
as n ----7 00.
Set an := (Av n I v n ). Since the real sequence (an) is bounded, there exists
a convergent subsequence of (an). Consequently, there exist a subsequence,
again denoted by (v n ), and a real number A1 such that
as n ----7 00.
Therefore,
IA11 = IIAII > O.
This implies IIAvnl1 :::; IIAllllvnl1 = IA11, and hence
as n ---+ 00,
as n ---+ 00.
This implies
AU1 - A1U1 = 0, U1 EX, IIu111 = 1,
i.e., (U1' A1) is an eigensolution of A.
Step 2: Induction. Let
Y := {u E X: (u 1 U1) = O}.
Then, Y is a closed linear subspace of X. The key to our induction argument
is the relation
A(Y) ~ Y, (4)
i.e., the Hilbert space Y is invariant with respect to the operator A. In fact,
let u E Y. Then
By Step 1,
IA11 = IIAII = sup IIAvll,
Ilvll=l,VEX
and hence
IA11?: IA21 > O.
We now set Z := {u E Y: (u 1 U2) = O} and continue this procedure.
This way we obtain a countable system {un, An} of eigensolutions, i.e.,
234 4. Eigenvalue Problems for Linear Compact Symmetric Operators
as n ---* 00.
Otherwise, IAnl ?: const > 0 for all n, and hence the sequence (A~lun) is
bounded. The operator A is compact. Since
n = 1,2, ... ,
Since
m
IIwml1 2= IIuI1 2- L I(u n I u)12 ~ IIuI1 2 for all m,
n=l
we get
as m ---* 00.
Hence
m
v = L(un I u)un .
n=1
Hence
00 00
n=1 n=1
By assumption (A), it follows from A(v - u) = 0 that v - U = o.
Step 6: We show that each eigenvalue A i=- 0 of A has finite multiplicity.
Since {un} forms a complete orthonormal system in X, A is identical to
some Am, by Proposition 2 in Section 4.l.
For simplifying notation, assume first that A = AI. Since An i=- 0 and
An ~ 0 as n ~ 00, there exists a number N such that Al = ... = AN and
Aj i=- Al for all j > N. Let
Au = AU, U i=- O.
By (5),
00 N
n= 1, ... ,M,
where S:= {Ul, ... ,UM} is an orthonormal system. Let M be the largest
possible number. The construction from Step 2 shows that M = N. Hence
the system S with M = N is complete.
236 4. Eigenvalue Problems for Linear Compact Symmetric Operators
This finishes the proof of Theorem 4.A under the additional assumption
(B). D
Proof of Theorem 4.A. Finally, let us consider the general case. We may
assume that >. = 0 is an eigenvalue of A. Otherwise, we meet assumption
(A) or (B). Set
N(A):= {u E X:Au = a}.
Then, N(A) -I {a}. Since the operator A is continuous, N(A) is a closed
linear subspace of X. In fact, if AUn = 0 for all n and Un - t U as n - t 00,
then Au = O.
By Proposition 1 in Section 3.3, there exists a complete orthonormal
system {wd in N(A). Hence
for all k.
Recall that
Consequently, {WI, Ul, W2, U2, ... } represents a complete orthonormal sys-
tem of eigenvectors of A.
The proof of Theorem 4.A is complete. D
AU - Au = b, uEX, (8)
along with the homogeneous problem
AV - Av = 0, vEX. (8*)
This corollary tells us that the following important principle holds true
for the original problem (8):
Uniqueness implies existence.
(i) If dim X < 00, then the spectrum a(A) of the operator A consists
precisely of all the eigenvalues of the operator A.
(ii) If dim X = 00, then the spectrum a(A) consists of all the eigenvalues
of A together with the point A = O.
Proof of Theorem 4.B. Suppose first that the operator A has a countable
system of nonzero eigenvalues.
By (5), there exists an orthonormal system {un} in X such that
L
00
along with AUn = Anun for all n. The system {An} contains all the nonzero
eigenvalues of A, and
Furthermore,
An
where an := A _ An' (14)
along with An --+ 0 as n --+ 00 that Ian I ::::; const for all n, and hence
L
00
lan(un I bW : : ; constllbl1 2 .
n=l
4.3 The Fredholm Alternative 239
AU = b + an(u n I b)un.
n=1
Hence
AU - Au = b,
i.e., u is a solution of (8). Since this solution is unique, the inverse operator
(AI - A)-I: X ---+ X exists. In addition, (14) tells us that
(16)
AU - Au = AV - Av implies u = v.
240 4. Eigenvalue Problems for Linear Compact Symmetric Operators
Then, Au = band Ilull ::; const Ilbll. Thus, the operator A- 1 : X ----+ X is
continuous, i.e., A = 0 belongs to the resolvent set of A.
Ad (ii). Suppose first that the operator A has only a finite number of
nonzero eigenvalues. Since all these eigenvalues have finite multiplicity and
dim X = 00, it follows from Theorem 4.A that there is some v -I- 0 for
which Av = 0, i.e., A = 0 belongs to the spectrum O'(A).
Suppose now that the operator A has a countable set {An} of nonzero
eigenvalues. Then, An ----+ 0 as n ----+ 00. Since the spectrum O'(A) is compact,
the limit point A = 0 belongs to O'(A). 0
Let -00 < a < b < 00. We are looking for eigensolutions A E IR and
u E L2 (a, b) with u -I- O. To this end, we assume the following:
(HI) The function A: [a, b] x [a, b] ----+ IR is continuous.
(H2) The function A is symmetric, i.e.,
A(x,y) = A(y,x) for all X,y E [a,b].
(u I v):= 1 b
u(x)v(x)dx.
Then, the original equation (17) can be written in the following form:
(i) The original integral equation (17) has a countable system of eigen-
functions {Ul' U2, ... }, which forms a complete orthonormal system
in the Hilbert space L2 (a, b).
(v) For each eigenvalue A -I=- 0 of (17), the eigenfunctions u are continuous
on [a,b].
~~oo 1 (m~(Un
b
u(x) - I u)un(x)
)2 dx = o.
Lemma 3. Assume (HI) above and let X := L2 (a, b). Then, the following
hold true:
Proof. Ad (a), (b). Let u E X, and let lIull denote the norm of u in X. By
the Schwarz inequality,
b
11'lu(y)ldY:S
(
1 )! (
b
dy
b
l1U(YWdy
)! = (b - a)! lIull·
Since the set [a, bj x [a, bj is compact, the function A is uniformly continuous
on [a, bj x [a, b]. Thus, for each c > 0, there is a 8 > 0 such that
x, z E [a, b] and Ix - zl < 8 implies a:= max IA(x, y) - A(z, y)1 < c.
w$y::;b
Hence
Iv(x) - v(z)1 :S a lb lu(y)ldy:S c(b - a)! Ilull, (19)
for all x, z E [a, bj with Ix - zl < 8. This proves the continuity of the
function v on [a, b].
Moreover, we also get
This implies
(l IV(XWdX)
1
IIAul1 =
b
"2 :S const Ilull· (21)
4.4 Applications to Integral Equations 243
(l
1
IIVn - vii =
b
(vn(x) _ v(x))2 dX) 2
lObserve that
f
n=k n=k
< (~I(Un Iu)I' (~IAn"n(x) I') 1 for all x E [a, bj. (23)
By the Bessel inequality (15) from Chapter 3, it follows that for all x E [a, bj
and k, m ?:: 1,
k+=
= 2: I(A(x,') 1 unW ~ IIA(x, ')11 2
n=k
= lb
a
IA(x, y)1 2 dy ~ (b _ a) ( sup
a::;x,y::;b
IA(x, Y)I) 2 ~ canst.
Since the Fourier series L:n (un 1 U)u n converges, it follows from Proposition
5 in Section 3.1 that the series
00
is also convergent. Thus, for each c > 0, there exists a number no(c) such
that
k+=
2: I(u n 1U)12 < c for all k ?:: no(c), m?:: 1.
n=k
By (23),
k+=
2:
I(un Au)un(x)1 ~ A . c ~ canst· c,
1
n=k
for all k ?:: no(c), m ?:: 1, and x E [a, bj. This proves the absolute and
uniform convergence of the series L::'=l(Un 1 Au)un(x) on [a,bj. D
(iii) If h E C[a, b], then each solution u of (24) is continuous on [a, b].
This follows from Theorem 4.B along with Lemma 3. Observe that equa-
tion (24) can be written in the following form:
Au - AU = b, u E X, A E lR,
u(x) = fJ LT
' Q(X, y)u(y)dy, (26)
where
(7I"-Y)X
{
if 0 ::; x ::; y ::; 11"
Q(x,y):= (7I"~x)y
if 0 ::; y < x ::; 11".
Observe the following:
The Green function Q is continuous and symmetric on [0,11"] X [0,11"].
We set X := L 2 (0, 11") and
Lemma 1.
r
= Jo u'v'dx = uv'l~ - Jor uv" dx
(27)
Hence fJ > O.
4.5 Applications to Boundary-Eigenvalue Problems 247
Proposition 2.
(i) The original problem (25) has precisely the eigenvalues
n = 1,2, ... ,
which are simple.
(ii) The normalized eigenfunction Un to f-tn with (un I un) = 1 is given by
n = 1,2, ....
Proof. Ad (i), (ii). By Lemma 1, each eigenvalue f-t of (25) is positive. The
general solution of the differential equation -u" = f-tu, f-t > 0, is given
through
u(x) = Csinf-t~x+Dcosf-t~x,
248 4. Eigenvalue Problems for Linear Compact Symmetric Operators
°
°
where C and D are real constants. From u(O) = we get D = 0. Moreover,
the second boundary condition u(7f) = along with U ¢. implies °
1
J-l2 = n, n = 1,2, ....
since !Jx is piecewise continuous and bounded on [0,7f] X [0,7f]. From the
symmetry condition !J(x,y) = !J(y,x) for all x,y E [0,7f], we get
Since u" E D(A), relation (28) remains true if we replace u with u". Using
u~ = -J-lnUn, we get
00 00
n=l n=l
The set CO'(O, 7r) is dense in L 2 (0, 7r). Let v E L 2 (0, 7r), and let c >
given. Then, there is a function u E CO'(O, 7r) such that
°be
1
-u"(x) = J.Lu(x) ,
(29*)
u(O) = u(7r) = 0,
Recall that (29*) has the following eigensolutions:
n = 1,2, ....
(i) If J.L is not an eigenvalue of (29*), then the original equation (29) has
a unique solution u.
(ii) If J.L is an eigenvalue of (29*), i.e., J.L = J.Ln for some n = 1,2, ... ,
then equation (29) has a solution u iff the so-called non-resonance
condition
10" f(x)un(x)dx =0
is satisfied.
By (31),
since g(x,y) = g(y,x) for all X,y E [a,b]. Therefore, (b I un) = 0 iff
(f I Un) = O.
Observe that U E D(A). By (iii), there exists a function w E
span{ Ul, U2, .. .} such that
Hence
Problems
Let -00 < a < b < 00.
4.1. Integral equations with degenerate kernels. Consider the integral equa-
tion
~b }((x,y)u(y)dy = AU(X) for all x E [a, b]. (32)
Suppose that
N
}((x,y):= "Lh(x)gj(y) for all x, y E [a, b],
j=l
where the nonzero functions h,gj: [a, b]-+ lR are continuous for all j.
Compute the eigenvalues and eigenfunctions of (32). Consider first the
special case where N = 1.
(i) Q is the Green function to (33), i.e., if we set v(x):= Q(x,y) for fixed
y E [a, b], then v is a solution of (33) with
4.3. A special problem. Compute the Green function to the following bound-
ary-value problem:
-u" = f on [0,1],
u(O) = u'(I) = O.
5
Self-Adjoint Operators,
the Friedrichs Extension,
and the Partial Differential
Equations of Mathematical
Physics
In the fall of 1926, the young John von Neumann (1903-1957) arrived
at Gottingen to take up his duties as Hilbert's assistant. These were
the hectic years during which quantum mechanics was developing at
breakneck speed, with a new idea popping up every few weeks from
all over the horizon.
Jean Dieudonne
History of Functional Analysis, 1981
quadratic variational
problem (Chapter 2)
t
orthogonal projection
t
IRiesz theorem I
l
duality map
energetic space l
energetic extension
Hilbert-Schmidt theory
for symmetric compact
operators (Chapter 4)
(abstract Sobolev space)
l l
self-adjoint
l
eigenvalue problem
abstract Dirichlet problem _
Friedrichs extension and Fredholm alternative
l
elasticity l l
complete orthonormal
functional calculus for
system of eigenvectors
/ self-adjoi1operators (abstract Fourier series)
l l equation
semigroup one-parameter group
l
l
irreversible process
l
reversible process
one-parameter
unitary group
in nature in nature
l
quantum physics
FIGURE 5.1.
The simplest method for constructing such operator functions is the fol-
lowing. Suppose that the operator A has a
This yields
= ~)Un I Au)un = L
00 00
L F(An){Un I u)un.
00
F(A)u := (7)
n=l
L
00
L
00
=
Au = L An(Un I u)u n for all U E D(A), (10*)
n=l
where
= =
n=l n=l
The preceding considerations motivate the appearance of the Friedrichs
extension in a quite natural way. However, the general theory of the Fried-
richs extension is independent of Fourier series expansions. The basic idea is
the following. We are given a linear, symmetric, strongly monotone operator
B: D(B) ~ X ---7 X on the real Hilbert space X, i.e.,
A: D(A) ~ X ---7 X
iff
Au=Bu for all u E D(B) and D(B) S; D(A).
5.1 Extensions and Embeddings 261
l):, -1
B<:;;A
FIGURE 5.2.
(i) We say that the embedding "X <:;; Y" is continuous iff there exists
an operator
j:X -7 Y (12)
that is linear, continuous, and injective.
(ii) The embedding "X <;;;; Y" is called compact iff the operator j from
(12) is linear, compact, and injective.
Let X be a subset ofY, i.e., X <;;;; Y. Then we set j(u) := u for all u E X.
In terms of sequences, we have the following:
un -7 U in X implies un -7 U in Y.
Proof. Ad (i). We are given u E C(G). Let j(u) denote the element of
L2(G) that corresponds to u. Then
Ilj(u)IIL2(G) = 2 2 ~Ea5Iu(x)1
~ constllullc(G)'
Since u and v are continuous, this implies u(x) = v(x) for all x E G.
Ad (ii). This is the famous Rellich embedding theorem, which will be
proved in Section 5.7. 0
Definition 5. Let X, Y, and Z be linear spaces over lK, and let A: D(A) S;;;
X ----+ Y and B: D(B) S;;; X ----+ Y be linear operators. For each a E lK, we
define the operators
and
(A+ B)u := Au + Bu for all u E D(A) n D(B),
i.e., D(aA) := D(A) and D(A + B) := D(A) n D(B).
Let C: D(C) S;;; Y ----+ Z be a linear operator. We set
The following examples show that, roughly speaking, we have the follow-
ing situation:
v E D(A*)
We have to show that this definition makes sense. In fact, suppose that
relation (14) also holds if we replace w with WI. Then
Proposition 2. Let A: D(A) <;;; X ---+ X and B: D(B) <;;; X ---+ X be linear
operators, where D(A) and D(B) are dense in the Hilbert space X over lK.
Then, the following hold true:
Proof. Ad (i). Let aI, a2 E K If (Au I Vj) = (u I Wj) for all u E D(A),
j= 1,2, then
(Au I alvl + a2v2) = al(Au I vd + a2(Au I V2)
= (u I alWl + a2w2) for all u E D(A).
that (Au I v) = (u I B*v) for all u E D(A), and hence A*v = B*v for all
v E D(B*). This implies B* C;;; A*. 0
(i) A is called symmetric iff A C;;; A*, i.e., (Au I v) = (u I Av) for all
u,v E D(A).
A*:X--7X
5.2 Self-Adjoint Operators 265
A*v = w,
(iii) If A is symmetric, i.e., A(x, y) = A(y, x) for all x, y E [a, b], then the
operator A: X ~ X is sell-adjoint.
266 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
As in the proof of Lemma 3(c) in Section 4.4, it follows from the Tonelli
theorem that
(Au I v) = lb (l b
A(x, Y)U(Y)dY ) v(x)dx
and
A = -A* {:::> aiA = (aiA)*. o
Corollary 7.
A~8
To prove this we will use the same arguments as in the proof of Propo-
sition 7 from Section 2.2. To this end, we set
Vn(X):=
JRr(/J1. (x -
n
y)v(y)dy, n= 1,2, ...
(see (115) in Chapter 2). Then, Vn E CO'(JR)c for all n, and we get Vn ---- v
in L~(JR) as n - t 00.
Since for fixed x E JR, the function y 1-+ (/J1. (x - y) belongs to the space
CO'(JR), it follows from the definition of the generalized derivative that
Hence
(Av I u) = ,-(v I Au) for all u, v E D(A).
Step 3: We prove that the operator A is skew-adjoint. In fact, it follows
from
(Av I u) = -(v I w) for all v E D(A) and fixed u, w E X
that
l (v') udx = - l vwdx for all v E CO' (JR)c.
The following example shows that the notion of the generalized derivative
is quite natural from the operator theory viewpoint.
Au=w.
Hence M is symmetric.
Moreover, it follows from
that
o Pu
FIGURE 5.3.
Pu:=v
(i) The orthogonal projection P: X --+ M from X onto the closed linear
subspace M of X is linear, continuous, and self-adjoint and p 2 = P.
If M =1= {O}, then IIPII = 1.
(ii) Conversely, let P: X --+ X be a linear continuous self-adjoint operator
with p 2 = P. Then, P is the orthogonal projection from X onto the
closed linear subspace P(X).
This implies
U = lim Un = n-+oo
lim PUn = Pu,
n~oo
that u = o.
Consequently, the inverse operators A-I and (A*)-I exist. Since
D(A- I ) = R(A) and R(A) is dense in X, the adjoint operator (A-I)*
exists.
272 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
and
un - u and AUn - v in X
imply u E D(A) and Au = v (cf. Problem 5.3).
Proof. Letting n - 00, it follows from
Proof. (i) ::::} (ii). It follows from (Uv I Uw) = (v I w) for all v, wE X that
(i) {:} (iv). Observe that the inner product can be expressed by norms
(cf.. Problem 2.2). For example, if X is a real Hilbert space, then
for all u,v E X.
Let (. I .) and 11·11 denote the inner product and the norm on X, respec-
tively.
Let us also introduce the eneryetic inner product
where (un) and (v n ) are admissible for u and v, respectively. We shall show
below that this limit exists and is independent of the chosen admissible
sequences.
(i) The energetic space X E becomes a real Hilbert space with respect to
the energetic inner product (. I ·)E. The set D(B) is dense in X E .
for all u E X E .
(iii) There exists a continuous embedding "X <:; ; X'E" given by the operator
j: X --+ X'E, where
where we set
IluIIE = n->oo
lim IlvnllE.
In fact, since the sequence (un -Vn ) is admissible for w = 0, we obtain that
as n ---+ 00,
by Step 1.
Step 3: For each Un, Vn E D(B), we have the identity
lim
n-+oo
Ilu - unllE = O.
In fact, since (Un) is Cauchy with respect to II· liE,
for all n, m :::: no(c:).
For each fixed m, the sequence (un - um) is admissible for U - Um. Letting
n ---t 00, we get
by Step 2.
Step 5: Let (un) be admissible for U E X E . By (28),
for all n.
as n ---t 00.
5.3 The Energetic Space 277
Therefore,
as n -+ 00.
This is (34).
Ad (iii). Let 1 E x. Set
By (33),
Hence j(f) E X'E and Ilj(f)llxi; :::; c-! 11111. Thus, the operator
j:X -+ X'E
Hence B is symmetric.
For all u, v E D(B), integration by parts yields
N
Ilun - umll~ = L IIDjUn - Djuml1 2 ,
j=1
that
for all W E Cff(G).
o
Hence Vj = DjU in the generalized sense. By (36) and (37), U EW§(G).
o
Conversely, let U EW§(G). Then there exists a sequence (un) in D(B)
such that (36) and (37) hold true with Vj := DjU. Thus, (un) is an admissible
sequence for u, and hence U E X E .
Ad (iii). Let u, v E X E , and let (un) and (v n ) be an admissible sequence
for U and v, respectively. Letting n ---+ 00, it follows from
5.4 The Energetic Extension 279
for all n
that
Proof. Let u E D(B) be given. It follows from (29) and (38) that
Hence B EU = Bu. o
280 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
B <;;;A<;;; BE.
(i) The operator A: D(A) <;;; X ---- X is self-adjoint and bijective, and
(iii) If, in addition, the embedding X E <;;; X is compact, then A-I: X ---- X
is compact.
by (33).
The operator Bi/: Xi: -+ X E is linear and continuous. Since the embed-
ding X ~ Xi: is continuous, the restriction
Bi/: X -+ XE
is also continuous. 1 This operator is identical to A-I. Hence the operator
(39)
is linear and continuous. In turn, since the embedding XE ~ X is contin-
uous, the operator
(40)
is linear and continuous. Moreover, if the embedding X E ~ X is compact,
then it follows from (39) that the operator (40) is compact.
Let f,g E X. By (29) and (38),
(A-If I A- 1 g)E = (B E (A- 1 j),A- 1 g)E = (f,A- 1 g)E
= (f I A- 1 g).
Since (A-l f I A-lg)E = (A-l g I A-I j)E, we get
for all f,g E X.
and
Tl(Au I u) - (f I u) = min!, u E D(A). (41*)
Proof. Ad (41). By (29) and (38), it follows that, for all u E XE,
(f I u) = (f,U)E = (AUO,U)E
= (BEUO, u) E = (uo I U)E.
Hence
uEX E ·
Because of (Av I v) 2': cllvl1 2 for all v E D(A), this problem has the unique
solution u = uo. 0
where the operator B satisfies condition (H). Let A be the Friedrichs exten-
sion of B. Note that D(B) ~ D(A) ~ X and Bu = Au for all u E D(B).
Consequently, each solution of (42) is also a solution to the following equa-
tion:
Au=j, u E D(A). (43)
Let us also consider the problem
Proof. By Theorem 5.A and Proposition 2, problems (43) and (43**) have
the unique solution uo.
Equation (43*) is identical to
(Av I u) = (f I v) for fixed u E X E and all v E D(B).
Since (Av I u) = (BEV,U}E = (v I U)E and (v I U)E = (u I V)E for all
U E XE and v E D(B), equation (43*) is equivalent to
(HI) Let X be a real separable Hilbert space with dim X = 00. We are
given the operator B: D(B) ~ X -; X as in (H), Le., B is linear and
symmetric, along with (Bu I u) ~ cllul1 2 for all u E D(B) and fixed
c> O. Let
A:D(A) ~ X -; X
be the Friedrichs extension of B. In addition, we assume that the
embedding XE ~ X is compact.
284 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
(iii) All the eigenvalues JLn have finite multiplicity. Furthermore, we have
o < c S JL1 S JL2 S ... and JLn ----+ +00 as n ----+ 00.
Proof. According to Theorem 5.A, (Au I u) ;::: cllul1 2 for all u E D(A). If
u is a solution of (44*) with f = 0 and u i- 0, then
(flv)=O
for all eigenvectors v of A corresponding to JL.
UEX, (46)
5.6. Applications to Boundary-Eigenvalue Problems 285
where A := f-l- 1 .
Ad (i). This follows from Theorem 4.B in Section 4.3 applied to A -1.
Ad (ii). By Theorem 4.B, equation (46) has a solution u iff
(47)
Let G be a nonempty bounded open set in JRN. For f-l = 0, (48) is called
the Poisson equation.
This means that equation (48) is satisfied in the sense of the theory of
generalized functions. Formally, we obtain (48*) by multiplying (48) with
v and subsequent integration by parts.
(i) The generalized problem (48*) has a countable system {un' f-ln} of
eigensolutions that contain all the possible eigenvalues.
286 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
(u I V)E = 1 N
'LojuOjvdx
G j=1
for all u, v E xE .
Let A be the Friedrichs extension of B. Then, problem (48*) corresponds
to (44**). The assertion follows now from Theorem 5.C. 0
for all u, v E X E .
This means
ifVdX = 0
o
for all eigenfunctions v EW~(G) corresponding to p,.
(49)
Proof. Problems (48*) and (49) correspond to (43*) and (43**), respec-
tively. Thus, the assertion follows immediately from Theorem 5.B. 0
is compact.
This proposition was proved by Rellich in 1930. Our proof will be based
on the following special Poincare inequality:
288 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
u(y) - u(x) = l Y
u'(t)dt for all x, y E C.
u(x) - u(y) = it; Ut;(t, {3)dt + iT/ UT/(e, t)dt for all x, y E C.
Applying the integral fcxc'" dxdy and observing that fcxc dxdy = 1, we
get
This is (51).
Step 3: The proof proceeds analogously for ]R.N with N 2 3. 0
is continuous because
o
Since the set C8"(G) is dense in W~(G), the compactness of j follows from
the compactness of the operator
(52)
by the extension principle from Section 3.6.
Let 8 be an open ball in ]R. N such that G ~ 8. Since each function
u E C8"(G) vanishes on a compact subset of G, we get C8"(G) ~ C8"(8),
and the compactness of the operator
(53)
implies the compactness of j from (52). Consequently, Proposition 1 follows
from Lemma 3, given next. 0
Lemma 3. Let 8 be a closed ball in ]R.N. Set j (u) := u. Then, the operator
j from (53) is compact.
Proof of Lemma 3 for N = 1. Let 8 := Ja, b[, where -00 < a < b< 00.
We are given u E C8"(8). Then,
u(x) = l x
u'(t)dt for all x E [a, bJ.
290 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
where
(l
1
Ilulll,2 := b
(u 2 + U I2 )dX) "2
as n --+ 00.
Define
M:= {u E OO'(B): Ilulll,2 :::; I}.
We want to prove that
Since the operator j is linear, j(aM) = aj(M) for each a > O. Thus,
assertion (A) implies that j sends bounded sets from W~(B) n Oo(B) to
relatively compact sets in L2(B). Consequently, (A) implies the assertion
of Lemma 3.
By Proposition 10 in Section 1.11, the set j(M) is relatively compact iff,
for each c > 0, the set j(M) has a finite c-net. Therefore, it remains to
prove the following:
5.7. The Poincare Inequality 291
(a) (b)
FIGURE 5.4.
(B) For each c > 0, there exist functions Ui, ... , U r EM such that
Step 1: Boundary strip. We first prove the following: For each () > 0,
there exists an open subset 1i of the open ball B such that R <;;; Band
r
Jf3 - H
u 2 dx < () (57)
(cf. Figure 5.4(a)). To this end, we choose a local (/1, ()-coordinate system
as pictured in Figure 5.4(b). More precisely, we assume that the boundary
8B has a local representation of the form
( = g(/1), /1 E J,
{3
is,
r u2dx::::: c is{3r (2{32u2 + 2u2)dx
::::: c . constllulli,2'
There exists a number n independent of c such that n local boundary
strips cover a boundary strip of the ball 8. Thus, choosing the number c
sufficiently small, we obtain (57).
Step 2: The inequality of Poincare on H. We choose closed cubes Cl , ... ,
Cs of edge length 2R that cover the set H such that Cj <:;;; 8 for all j. Let
s
1
We set
where Yj := udx.
Cj
Relation (58) yields
for all j. Thus, the set F(M) IS bounded in ]Rs and is hence relatively
compact. Consequently, for each 'f) > 0, the set F(M) has a finite 'f)-set, i.e.,
there exist functions Ul, ... ,Ur E M such that
min IF(u) - F(Uk)1 < 'f) for all u EM.
l~k~r
From (57) and (59) along with (u - Uk)2 ::::: 2u 2 + 2u~, we get the key
formula
r (u - Uk)2dx is-f{
is
r (u - Uk)2dx + if{r (u - Uk)2dx
=
(i) U E D(A).
(ii) {::> (iii). This equivalence represents the convergence criterion for
Fourier series (Proposition 5 in Section 3.1).
(ii) => (i). We construct the linear operator C: D(C) ~ X ---. X through
Proposition 2. Assume (H). Then, for each real function F: JR --+ JR, the
operator F(A) is self-adjoint.
n n
Hence U E D(G). o
With a view to applications in the next section, we now consider functions
of the form
n
5.8 Functions of Self-Adjoint Operators 295
depending on a real parameter t, which will play the role of time. Set
n
In order to justify this we need the following two majorant conditions:
and
where dn := sup 1Ft (An' t)l. (65)
n tEJ
(i) Assume (64) for u EX. If t f--t F(A, t) is continuous on J for each
A E IR, then t f--t x(t) is continuous on J.
(ii) Assume (64) and (65) for fixed u E X. 1ft f--t F(A, t) is continuously
differentiable on J for each A E IR, then t f--t x(t) is continuously
differentiable on J and relation (63) holds true for all t E J.
Proof. Ad (i). We set Fn(t) := F(An' t) and an := (un I u). Since la+bl 2 ::::;
21al 2 + 21bl 2 for all a, b E lK, we get
~ := IIF(A, t)u - F(A, s)u11 2 = L I(Fn(t) - Fn(s))an I2
n
Let E: > 0 be given. By (64), there is a number N such that 2 2:n>N Ic n a n l 2 <
E:. Since Fn is continuous on J, this implies
~ < E:+E:,
=
Fn(S~ ~n(t) _ F~(t) = F~(t + 1J(s - t)) - F~(t), 0< 1J < 1,
296 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Proof. By Theorems 5.A and 5.C, the operator A satisfies condition (H),
where An ~ C > 0 for all n. Then
Observe that U E D(A) iffLnA;I(un I U)12 < 00. Since 0 < An ~ const·A;
for all n,
D(A) S;;; D(A!).
In addition, note that D(B) S;;; D(A). Furthermore, it follows from (60) and
(68) that
n n
By (69*),
for all U E Y. (70)
We want to show that Y becomes a real Hilbert space equipped with the
inner product (. I .)y.
In fact, if Ilully = 0, then u = 0, by (70). Now let (un) be a Cauchy
sequence in Y. Then, (A~un) is a Cauchy sequence in X. Hence
It follows from (70) that (un) is also a Cauchy sequence in X, and hence
Um := ~)Un I u)un.
n=l
along with
as n ---> 00.
298 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Hence u = w, i.e., u E Y.
Step 4: Let us prove (67), i.e.,
Ifu,v E X E , then there exist admissible sequences (un) and (v n ) for u and
v, respectively. Step 3 yields
Un -+ u and Vn -+ v in Y as n -+ 00.
By (69), (un I Vn)E = (Un I vn)y for all n. Letting n -+ 00, we get (72).
Step 5: Since X E is a Hilbert space, X E is closed with respect to the
norm II ·IIE. By Step 4, IluliE = Ilully for all u E X E . Thus, X E is a closed
linear subspace of Y. Since D(A) ~ X E ~ Y and D(A) is dense in Y, we
get X E = Y. 0
Then
(i) S+ is called strongly continuous iff the function u: [0, oo[ -+ IR is con-
tinuous for each Uo EX.
5.9. Semigroups, One-Parameter Groups, and Their Physical Relevance 299
(iii) S+ is called linear iff 8(t): X ---- X is linear and continuous for all
tER
(iii) S is called uniformly continuous iff S is linear and the function t 1---+
A:D(A) ~ X -+ X
by the extension principle from Section 3.6. Such linear operators are called
unbounded.
This is the group property (77). Statement (iii) follows from Proposition 3
in Section 1.24. In particular, we get u'(O) = Auo for each Uo E X, i.e., the
operator A is the generator of {S(t)}.
Furthermore,
Hence Ile hA - III :::; IlhA11 + Ilh~1I2 + ... :::; ellhAl1 - 1. For each t E JR, this
implies
( L
m
n=O
(iA)n ) (
-,-u I v = u
n.
IL
n=O
m (-iA)n )
n.
,v for all u, vEX.
t = time.
302 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
(C) Strong causality. The state of the system u(to) at a fixed time to
determines uniquely all the states of the system in the future t > to
and in the past t < to.
(H) Homogeneity in time. 1ft f--+ u(t) is a possible process of the physical
system, then the process t f--+ u(t + to) is also possible for each fixed
to E JR.
(R) Reversibility. If t f--+ u(t) is a possible process of the system, then the
reverse process t f--+ u( -t) is also possible.
(a) (b)
FIGURE 5.5.
u(t + to) = S(t)u(t o ) for all t:::=:O and fixed to :::=: O. (85)
(C) Causality. The state of the system at time to :::=: 0 uniquely determines
all the states u( t) of the system in the future t > to.
(H) Homogeneity in time. If t f--+ u(t) is a possible process for all t :::=: 0,
then so is the process t f--+ u( t + to) for all t:::=:O and fixed to :::=: O.
K = mx", (86)
304 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
(a)
o
:lJ
• ••
K -
(b)
e
~K
FIGURE 5.6.
where x = ue and K = Ke. Here, e denotes a unit vector (cf. Figure 5.6).
For small lxi, the Taylor expansion yields
u' =v,
(87*)
v' = -Au, u(O) = uo, v(O) = va.
where we set w := (u, v) and Aw := (v, -Au). The space X := JR.2 with
the inner product
u(t) = + C- 1 (sintC)vo,
(costC)uo
v(t) = -C(sintC)uo + (costC)vo,
5.10 Applications to the Heat Equation 305
E := 2- 1 mv 2 + 2- 1 mAu 2
of the harmonic oscillator. We want to show that
The energy E is constant along each possible motion of the harmonic
oscillator.
In fact, it follows from (87*) that
In Section 5.11 we shall show that the same argument applies to the wave
equation. Then, the original equation (87) represents the wave equation,
where the self-adjoint operator A is the Friedrichs extension of -~.
Theorem 5.E (The abstract heat equation). For each given initial value
Uo E D(A), the original equation (89) has a unique CI-solution u: [0, oo[ -
X. This solution is given by
for all t ~ O. (90)
By Standard Example 4 in Section 5.3, conditions (HI) and (H2) are sat-
isfied with the energetic space
Theorem 5.E tells us that for each given initial temperature Uo E L2(G) the
original classic problem (92) has a uniquely determined mild (generalized)
solution in the sense of (90).
The semigroup property of {e- tA } reflects the fact that
Heat conduction represents an irreversible process in nature.
d
dt (w(t) I w(t)) = 2(w'(t) I w(t)) = -2(Aw(t) I w(t)) :::; 0
for all t E lR+ that (w(t) I w(t)) = 0 for all t E lR+, and hence w(t) = 0 for
all t E lR+.
The semigroup. By Theorem 5.C, the operator A has a complete or-
thonormal system {u }n> 1 of eigenvectors with
for all n = 1, 2, . .. .
n n
Hence
n
= Le-(t+S)An(Un I u)un = e-(t+s)A U.
n
We now set
for all t E lR+.
For all Uo E X and t E lR+, we have the decisive majorant condition
n n
n n
exists for each t E lR+, and the function u' is continuous on lR+. Further-
more,
where W E D(C) iff this limit exists. We want to show that C = -A. In
fact, differentiation of the relation
This is equal to
E(t) = Tl{llu'(t)112 + Ilu(t)II~}. (98)
Theorem 5.F (The abstract wave equation). Assume (HI) and (H2).
Then, for given initial values
the original problem (94) has a unique classic solution. This solution is
given by (95).
The energy E(t) is constant along this solution.
However, we will show that the operator S(t) is still defined for all
3General product spaces will be studied in Section 3.6 of AMS Vol. 109.
5.11 Applications to the Wave Equation 311
The proof will be given later. The original equation (94) describes wave
processes. Corollary 3 reflects the fact that these wave processes are re-
versible.
(Lt,(8
1
Thus, we may apply the results to (101). In particular, for given initial
values
and
we get a uniquely determined mild (generalized) solution for (101). An
application to the vibrating string will be considered in the next section.
According to (98), the energy of the wave process at time t is given
through
Proof of Theorem 5.F. We will use the functional calculus from Section
5.8. In fact, the assertions of Theorem 5.F follow by means of simple formal
312 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
This implies
n n
(105)
n n
and If.Lnl ::; constlf.LnI 2. Thus, using (103), the necessary majorant conditions
from Proposition 3 in Section 5.8 are satisfied. This justifies the preceding
formal differentiation. Hence the function u: IR ---+ X from (104) is C 2 and
u(O) = Ua, u'(O) = Va.
4For brevity of notation, we write cos f.Lnt( Un I uo) instead of (cos J1n t )(Un I uo),
and so on.
314 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
U(t) =L OnUn,
n
where
On := (Un I u(t)) = (Un I uo) COSJ1.nt + (Un I VO)J1.;;:l sinJ1.nt,
by (104). To prove that u(t) E D(A), we need
However, this follows from (103) along with la + W ::; lal 2 + IW for all a, bE C.
5.12. Applications to the Vibrating String 315
U=U(X,t)
~--------------*-------X ~------~~----*-------X
FIGURE 5.7.
The proof ahead shows that, under the assumption (ii), series (109) rep-
resents the mild (generalized) solution of the original problem (108), in the
sense of Theorem 5.F in Section 5.11.
Remark 2 (The classic Fourier method). Let us recall the famous classic
motivation of the ansatz (109). First we are looking for special solutions of
the string equation of the form
u(x, t) = ¢(x)7jJ(t).
If u satisfies the equation Utt - U xx = 0, then
¢(x)7jJ"(t) - ¢"(x)7jJ(t) = O. (110)
Suppose that there exist points Xo and to such that ¢(xo) =I 0 and 7jJ(to) =I
O. Then, the boundary condition "u(O, t) = u(7r, t) = 0 for all times t E ~"
implies ¢(O) = ¢(7r) = O. Letting either t = to or x = Xo in (110), we obtain
the following boundary-eigenvalue problem:
¢"(x) = -A¢(X), o < x < 7r, AE ~,
(111)
¢(O) = ¢(7r) = 0,
along with the differential equation
Thus, multiplying equation (113*) by um(x) and integrating over the in-
terval [0,11"], we get
m=1,2, ....
where
Hence
b~(t) = 17'> Un(X)Ut(X, t)dx.
Let n = 1,2, .... Integration by parts yields
00
00
n= 1,2, ... ,
we obtain the boundary relation u(O, t) = u(n, t) = 0 for all t E JR. Since
n= 1,2, ... ,
Utt - Uxx = 0
That is,
(Un I Bv) = An(Un I v) for all v E D(B).
By (44**), this means that
n = 1,2, ....
By Proposition 2 in Section 4.5, the orthonormal system {un} is complete in
L2(0, 7r), by Section 4.1. Thus, the self-adjoint operator A has no eigenval-
ues different from {An}. According to (104), the mild (generalized) solution
to the original problem (108) is given through
u(t) =
n=l
o
1
where f.Ln := A~ = n. This is precisely the series from (109).
v = Ut(xo, t).
Thus, we obtain
for the kinetic energy of P. Finally, we assume that the potential energy
Epot(P) of P is proportional to the extension of P, i.e.,
£pot(P) = a(.6.s - .6.x), where a = const > O.
5.12. Applications to the Vibrating String 321
Then, the total kinetic energy Ekin and the total potential energy Epot of
the string are given by summing over all small pieces of the string, i.e.,
Observe that
D.s = vII + ux(x, t)2 D.x.
More precisely, the final definition of Ekin and Epot at time t will be
based on the following integrals:
Suppose that the deflection of the string is small, i.e., luxl is small. By
Taylor series expansion,
Hence the total energy E(t) of the string at time t is s, given approximately
through
Here, we vary over those states of the system that are fixed at both the
initial time to and the terminal time tl and those that satisfy the boundary
322 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
L-----------------------------.x
FIGURE 5.8.
conditions. For the string with fixed end points, this means the following
for the action A( u):
A( u) := TI p 1: (1'' (
1
U; - c2 U;)dX) dt = stationary! , (116a)
where
u(O, t) = U(1T, t) =
u(x, to) = fixed
° for all t E [to, tIl
(boundary condition),
for all x E [0,1T]
(initial condition),
u(x, td = fixed for all x E [0,71']
(terminal condition).
(116b)
The meaning of "stationary!" will be explained in (117*). Define
o := la, 71'[ x lto, h [.
Let U = u(x, t) be a sufficiently smooth solution of (116). We set
w(x, t) := u(x, t) + TV(X, t),
where T is a real number. The sufficiently smooth function v is called ad-
missible iff
V=o on8W.
This guarantees that both wand u satisfy the same side conditions (116b).
Set
¢>V(T) := A(u + TV).
By definition, A is stationary at u iff
10 (Utt - c u
2 xx )vdxdt = ° for all v E Co(O).
5.13 Applications to the SchrOdinger Equation 323
u u
---.~ c
--~----~--+---~-------x
FIGURE 5.9.
is a solution of the string equation (117) for all (x, t) E 1R2 . Here, x r-t
a(x - ct) corresponds to a wave that moves with velocity c from left to
right (cf. Figure 5.9), whereas x r-t b(x + ct) moves with velocity c from
right to left. In each textbook on partial differential equations one proves
that (118) represents the most general C 2-so1ution of the string equation
(117). This justifies the designation "one-dimensional wave" equation for
the string equation.
For each Uo E X, the continuous function u: JR. ---> X from (120) is called
a mild solution of (119).
d
-d (S(t)uo) = lim S(t)h-l(S(h) - I)uo = S(t)Cuo,
t h--+O
since {Set)} is strongly continuous. On the other hand, it follows from (121)
that
lim h-1(S(h) - J)S(t)uo = S(t)Cuo.
h--+O
d
ds Set - s)v(s) = -Set - s)Cv(s) + Set - s)v'(s)
(123)
= -Set - s)Cv(s) + Set - s)Cv(s) = 0
is a solution of (122), and hence wet) = u(t) for all t E R This implies
(T(t) - S(t»uo = 0
for all Uo from the dense subset D(A) of the Hilbert space X. By the
extension principle from Section 3.6,
This proves that the operator C cannot generate two different one-para-
meter unitary groups.
Obviously,
Moreover,
~2 -t Set - s)v'(s) as h -t O.
for all n.
326 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
where u E D(e- itA ) iff Ln le-itAn(Un I uW < 00. Since lei'" I = 1 for all
real numbers 0:, we obtain D(e- itA ) = X. Moreover,
n n
(125)
As in the proof of Theorem 5.E, it follows from
that
for all t, s E R
That is, {e- itA } represents a linear one-parameter group. Hence the oper-
ator S(t): X -+ X is bijective, by Section 5.9. Moreover, relation (125) tells
us that S(t) is unitary on X for each t.
Set
for all t E R
First let Uo E X. According to (124), it follows from the majorant criterion
(Proposition 3 in Section 5.8) that u = u(t) is continuous on R Thus,
{e- itA } is strongly continuous. Summarizing, we obtain that {e- itA } is a
one-parameter unitary group.
Now let Uo E D(A). Formal differentiation of (124) with u = Uo yields
00
L le- iAnt .An (un I uo)1 2 ::; L l.An(un I uo)12 < 00, (127)
n=l n=l
formula (126) holds true for all t E lEt, and u' is continuous on R By (124)
with u = Uo and (127), we get u(t) E D(A) for all t E lEt and
00
Hence
u'(t) = -iAu(t) for all t E JEt. (128)
5.14 Applications to Quantum Mechanics 327
in the sense of Remark 4 in the upcoming Section 5.14, and the proof
proceeds analogously to Case 1. The details can be found in Zeidler (1986),
Vol. 2A, p. 186. 0
81t was generally believed until 1952 that there exists a one-to-one correspon-
dence between "states" and "physical states." However, in quantum field theory
328 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
t E JR, (129)
there exist so-called supers election rules, e.g., for charge and baryon number.
These superselection rules say that there exist "states" (resp., "observables")
that do not correspond to "physical states" (resp., "physical quantities"). For
example, suppose that the two states 'lj;l and 'lj;2 correspond to a charged par-
ticle, with the charge el and e2, respectively, where el ::I e2. Then, the state
Cil 'lj;l + Ci2'1j;2 with Cil ::I 0 and Ci2 ::I 0 does not correspond to a "physical state."
9To motivate the definition of ~A, assume that 'Ij; E D(A2), i.e., 'Ij; E D(A)
and A'Ij; E D(A). Then
This is the Schriidinger equation, where Ii := ;". and h denotes the Planck
quantum action. If we measure length, time, and mass in meter, second,
and kilogram, respectively, then
2
h = 6.626 . 10-34 kg ~.
s
Since the operator U(t) := e-¥ is unitary for each t E JR, we get
Then
A = ('IjJ I A'IjJ) = a:
and
LlA = IIA'IjJ - A'ljJ1l = O.
In this case we say that the observable A has the "sharp" value a: in the
eigenstate 'IjJ.
(130)
n
(131)
n
and
n n n
Hence
n
This is (131). Moreover,
If U:X ~ X is a unitary operator, then (U'IjJ I Ucp) = ('IjJ I cp), and hence
Then
LlALlB 2: ICI, (133)
where LlA and LlB correspond to the state 'l/J, and
((BA - AB)'l/J I 'l/J) = ((B - BI)(A - AI)'l/J I 'l/J) - ((A - AI)(B - BI)'l/J I 'l/J)
= ((B - BI)(A - AI)'l/J I 'l/J) - ('l/J I (B - BI)(A - AI)'l/J)
= 2i Im((B - BI)(A - AI)'l/J I'l/J).
With A:= ('l/J I A'l/J) and B =: ('l/J I B'l/J), the Schwarz inequality yields
1:
Explicitly, this means that
and
I:
This way it is possible to define functions of the operator A through
I:
for the given function F: lR. --t C. Explicitly, this means that
where
u E F(A) iff I: 1F(>'Wd(u I E)..u) < 00.
In addition,
I:
Moreover, we get D(F(A)*) = D(F(A)), and
I:
i.e.,
Using probability theory, the corresponding mean value A and the disper-
I:
sion (~A)2 are given through
I: (). -
A = )"'d('I/; I E)..'I/;), 'I/; E D(A),
(i) For each)", E JR, the operator E)..: X -+ X is linear, continuous, and
self-adjoint with E~ = E).., i.e., E).. is an orthogonal projection.
(ii) For each u E X, the function
is nondecreasing on JR.
Af---+(V I EAu)
is of bounded variation. That is, the integrals J ... d(v I EAu) from Remark
4 are to be understood as Stieltjes integrals (see the appendix).
I:
and v EX, we get
I:
Here, U E D(F(A)) iff
Thus, the functional calculus from Section 5.8 represents a special case of
the general functional calculus from Remark 4.
where u E D(A) iff u E X and J~oo Ixu(x)1 2dx < 00. By Example 10 in
Section 5.2, the operator A: D(A) ~ X ~ X is self-adjoint.
if x:::; A
(EW)(x) := { ~(x) if x> A,
for all u E X and each A E R
5.14 Applications to Quantum Mechanics 335
(ii) For all 'ljJ E X and -00 ::; a < b ::; 00,
(136)
In addition,
Thus, the operator E).,: X ----+ X is linear, continuous, and self-adjoint. Ob-
viously, E~ = E)., for all ,X E R
By (137), the function ,X f---+ (u I E).,u) is nondecreasing on R Let -00 <
,X < P, < 00. Then, for each u E X,
and hence
Similarly, we get
Let -00 ::; a < b ::; 00, and let F: JR. ----+ C be a measurable function. It
follows from (137) and from (5) in the appendix that
(138)
i: i:
For all u E D(A) and v E X,
In this connection, observe that u E D(A) iff J~oo ,X2Iu(x)1 2dx < 00. Thus,
the integral J~oo v('x)'xu('x)d'x exists, by the Schwarz inequality.
Ad (ii). Use (138) with F == 1. 0
336 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Here, x denotes the vector of position. Let us suppose that the force field
K = K(x) possesses a potential U, i.e.,
where p(t) := mx'(t) denotes the momentum vector at time t. Recall that
r~ is called the kinetic energy and U is called the potential energy of the
particle.
If x = x(t) is a solution to (139), then
in'I/Jt = --
n2 D..'I/J + U'I/J, (141)
2m
which Schrodinger formulated in 1926. We are looking for solutions 'I/J =
'I/J(x, t) of (141) such that
r I'I/J(x, tWdx =
JJR3
1 for all times t E R (142)
The function 'I/J describes the physical state of the particle. More precisely,
'I/J allows the following interpretation:
(i) Probability. Let G be a nonempty open set in 1R 3 . Then,
i.e., 'l/J(t) is a unit vector in X for each t. The differential operator 1-l:
D(1-l) ~ X ~ X given by D(1-l) := CO'(1R3 ) and
n2
1-lrp := - - D.rp + U(x)rp
2m
J~3
r (b.¢)'IjJ dx = r
J~3
¢(b.'IjJ)dx for all ¢, 'IjJ E D(H),
and hence
(¢ I H'IjJ) = r
J~3
¢H'ljJdx = r
J~3
(H¢)'ljJdx = (H¢ I 'IjJ) for all ¢,'IjJ E D(H),
provided the real function U = U(x) is sufficiently regular. Thus, the formal
Hamiltonian H is a linear symmetric operator on the Hilbert space X.
One of the main tasks of a rigorous mathematical approach to quantum
mechanics consists in extending the formal Hamiltonian H to a self-adjoint
operator H: D(H) <;;; X --+ X, which is called the Hamiltonian of the parti-
cle. Then, the spectrum of H corresponds to the possible energy values of
the particle. An important special case will be studied in the next section.
In Problem 5.10 we will show that for the electron of the hydrogen atom
the Hamiltonian H can be obtained as the Friedrichs extensiorl of H.
fi2 mw 2x 2
ifi'IjJt = - 2m 'ljJxx + --2- 'IjJ. (145)
p =}
. a
-zfi ax and E =}
a
ifi at· (146)
5.14 Applications to Quantum Mechanics 339
i:
We are looking for solutions 't/J = 't/J(x, t) of (145) with
X:= L~(lR)
i:
with the inner product
(¢ I 't/J) = ¢(x)'t/J(x)dx.
lb
a
1¢(x)1 2 dx:= probability of finding the particle
in the interval [a, b].
(148)
where D(1i) := S.
The space S has been introduced in Section 3.7. Recall from Section 3.4
that the Hermitean functions Un are defined through
where
1
an == nIl'
227r 4(n!)2
340 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Proposition 10.
Proof of Proposition 10. Ad (i). For all <p, 'lj; E S, integration by parts
yields
1 -1
i: i:
N
¢"'lj;dx =
00
00 lim ¢'(X)'lj;(X)I ¢''lj;'dx
-00 N ..... +oo -N-oo
=- ¢''lj;'dx = ¢'lj;"dx.
Hence
(<p 11i'lj;) = (1i<p I 'lj;) for all <p, 'lj; E D(1i).
2
Ad (ii). By (149), un(x) = e-"2 x polynomial (x). Hence we obtain
<Pn E S. A fairly simple computation yields (150).
5.14 Applications to Quantum Mechanics 341
Proposition 12.
(i) The Hamiltonian H: D(H) C;;; X -+ X is self-adjoint.
According to Example 1, we say that the particle has the sharp energy En
in the state ¢n'
Suppose that the particle is in the state ¢. By Example 2,
!(¢n ! ¢W = probability of having the sharp energy En·
Explicitly,
. d
(A¢)(x) := -zn dx ¢(x) for all x E lR
with (¢ I ¢) = 1.
(154)
and
5.15 Generalized Eigenfunctions 343
P = (¢ I A¢) and
Obviously,
AB¢ - BA¢ = ili(x¢' - (x¢)') = -ili¢.
Thus, it follows from Proposition 3 that
(156)
(i) If we localize sharply the particle (i.e., tlX is small), then the velocity
of the particle is highly uncertain (i.e., tlp is large).
(ii) Conversely, if we determine sharply the velocity of the particle (i.e.,
tlP is small), then the position of the particle is highly uncertain
(i.e., tlX is large).
lba
d(¢ I E>.¢) = probability of measuring the position x of
the particle in the interval [a, b],
. d ~ ~
-zli - e" = pe " for all x E lR. (157)
dx
344 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
• • • • . .. Y-E Y Y+E
FIGURE 5.10.
and each fixed p E IE.. That is, the function ¢p(x) := e i~X is an eigenfunc-
lx
tion of the differential operator -in that corresponds to the momentum
operator A from Section 5.14.4. The point is that
The function ¢p does not live in the Hilbert space X = L~(IR).
In fact,
i: l¢p(x)1 2 dx = 00.
That is,
implies ¢ == o.
i:
Lemma 2. We set
(ii) The corresponding map 1/; f-+ T is linear and bijective from the space
X onto S'.
for all ¢ E S
Proof. Let A1/; = >"1/; with 1/; i- 0 and>" E R For each ¢ E S, it follows
from the symmetry of A that
ipx
where ¢p(x) := eT". By Standard Example 4 in Section 3.8, Tp E S'.
346 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Tp(A¢) = i:
= pTp(¢).
ili¢~(x) ¢(x)dx = i: p¢p(x) ¢(x)dx
-00
.
'px
e-"¢(x)dx =0 for all p E lR..
Using the Fourier transformation F:S -- S from Section 3.7, this implies
F¢ = 0, and hence ¢ = O. D
as c -- +0. D
5.16 Trace Class Operators 347
(i) The linear operator A: X -+ X is said to be of the trace class iff the
series
n
converges for each complete orthonormal system {v n } of X and the
value of the series is independent of {v n }. The number tr A is called
the trace of A.
(ii) The linear continuous operator A: X -+ X is called a Hilbert-Schmidt
operator iff A * A is of trace class.
for all n.
(i) If An 2: 0 for all nand Ln An < 00, then the operator A is of trace
class and
for all U E X.
n
Let (): > O. Since the sequence (An) is bounded,ll it follows that
for all U E X.
n
n n n m
n m
Since this is a convergent double series with nonnegative terms, the sum-
mation can be interchanged. Hence
n m n m m
Folklore
m
5.17 Applications to Quantum Statistics 349
and
(iii) Entropy. By definition, the entropy S of the statistical state \]I from
(162) is given through
That is, the time evolution of the state '¢mO is identical to the time evolution
of quantum states in Section 5.14.1.
12We assume tacitly that tPm E D(A) for all m. Furthermore, note that (~A)2
is the mean value of (A - AI? provided tPm E D(A2) for all m. In fact,
350 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
for all m.
tr p = 2..::>m = 1.
m
A = tr(pA).
In fact,
m m m
Since 'lj;m(t) = U(t)'lj;mD and hence ('lj;m(t) I 'lj;) = ('lj;mo I U(t)*'lj;), we get
Noting that U(t) is unitary and hence U(t)* = U(t)-l, we obtain (164). 0
(165)
Pm ;= I:~=1 e(/J-N 7n -E7n )/kT·
Here, the positive numbers Em and the nonnegative integers N m are given.
A motivation for the choice of Pm will be given in Remark 8.
By Section 5.17.1,
M
£ = mean value of energy of r = L Pm(J1" T)Em'
m=l
M
N = mean value of the particle number of r = L Pm(J1" T)Nm.
m=l
This relates T and J1, to £ and N. In this connection, note that
M M
£ = tr(pH) = L ('lj;m 1 pH'lj;m) = L PmEm
m=l m=l
and
M
N = tr(pN) = L PmNm·
m=l
Since IIH'lj;m - £'lj;mll = IEm - £1 and IIN'lj;m -N'lj;mll = INm -Nml, we
obtain that
M
(~£)2 = dispersion of the energy of r= LPm(J1"T)IEm _£12;
m=l
M
(M)2 = dispersion of the particle number of r= LPm(J1"T)INm _NI2.
m=l
5.17 Applications to Quantum Statistics 353
Finally,
M
S = entropy of r = -k L Pm(P" T) lnpm(p" T).
m=l
L
M
Z(p" T) := e"N"t<:;.Em
m=l
is called the partition function of r, and
N=_8n
8p,'
m=l
M
=k L Pm In Pm = -So o
m=l
Definition 7. Suppose that for each m the energy Em and the particle
number N m depend on the volume V of the system r. Then, the pressure
P of r is defined through 13
and fixed mean particle number. That is, let us consider the following max-
imum problem:
M
entropy S(p) := -k L Pm lnpm = max!, (166)
m=l
along with the side conditions
M
£(p):= L PmEm = const,
m=l
M
N(p):= L PmNm = const, (167a)
m=l
M
W(p):= LPm = 1,
m=l
and
o :::;Pm:::; 1, m=l, ... ,M. (167b)
We are given the real numbers Em, N m , m = 1, ... , M, such that
E1
EM) =3.
rank ( ~1 .. , NM (168)
... 1
We are also given the mean energy £ and the mean particle number N of
the system r.
Suppose that P = (P1, ... ,PM) is a solution of (166), (167) with 0 < Pm <
1 for all m. By the Lagrange multiplier rule,14 there exist real parameters
a, f3, and 'Y such that
Hence
exEm +(3Nm
Pm = const e k
and p, = -f3T.
(170)
This means that nj particles of r have the energy C j, j = 1, ... , J. For each
such state 'lj;, the particle number N and the energy E are given through
J
and E = Lnjcj.
j=l
Thus, the partition function Z and the statistical potential 0 of r are equal
to
J
Z(p"T) = Lel'~TE = rrLe"ni~;i<i
r j=l nj
and
O(p"T) = -kTlnZ(p"T) = t
j=l
-kTlnL
nj
(e"~;jrj. (171)
Nj =e~. (173*)
(i) (A*)* = A;
Proof. Properties (i)-(iii) follow easily from the definition of the adjoint
operator. Let us prove (iv). Assume that X i- {O}. Since
A state w is called mixed iff there exist two different states WI and W2 of
Qt such that
Let us now define von Neumann algebras based on the algebraic relation
SJ3 = SJ3/1. (174)
Wt := w 0 <Pt
360 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
for all A, B E ~ and for all continuous functions f: JR. ----+ JR. whose Fourier
transform belongs to CD (JR.).
Let
m=l, ... ,M,
then w is a state.
In fact, it follows from
M
w(A) = L Pm('ljJm I A'ljJm)
m=l
that
M
w(A* A) = L Pm('ljJm I A* A'ljJm)
m=l
M
= L Pm (A'ljJm I A'ljJm) ;::: 0 for all A E 21
m=l
and w(I) = 1.
The state w from (176) is mixed provided we have Pj > 0 and Pk > 0 for
two different indices j and k.
(iii) Dynamics. Define
.+. (A) itHA _itH for all A E 21 and all t E R (177)
'l't := e" e"
Proof. For simplifying notation, let us use such units of time that Ii = l.
Observing that tr(C) := L~=l Cl/Jm I C'lj;m) for all C E Qt, we get
tr(CD)=tr(DC) for all C,DEQt (179)
(cf. Problem 5.12). This implies (178). In fact, we have
w(C) = tr(pC) = Z- l tr(e-{3HC),
where Z:= tr e-{3H. By (179),
Variable particle number. Let us now consider the more general case
where J-l =f. O. Here, the state w from (176) corresponds to a physical state
where the particle number N m and the total energy Em are realized with
probability Pm. We expect again that this represents a thermodynamic
equilibrium state. Introducing
'" iHC",)t iHC",)t
cPt (A) := e-"-Ae--"- for all A E Qt,
where H(J-l) := H - J-lN, we obtain the following generalization of Proposi-
tion 9.
Proof. Replace H with H(J-l) and use the same argument as in the proof
of Proposition 9. D
5.19. The Fock Space in Quantum Field Theory 363
(i) bosons (i.e., particles with integer spin like photons) and
where AA means that the two electrons are both in state A, and so on.
In quantum statistics, the number of different states is of fundamental im-
portance. A different counting of states yields completely different physical
results. In fact, the two preceding principles were discovered by physicists
via quantum statistics.
The completely elementary proofs of the following propositions are left
to the reader as exercises.
Example 4. Let 1/Jn E Xn with II1/Jnll = 1 for fixed n = 0,1, .... Set
1/J := (0, ... ,0, 1/Jn, 0, ... ),
where '¢In stands at the nth place. Then, II'¢III = 1 and
N1/J = n1/J.
We say that the state '¢I corresponds to n identical bosons (e.g., n photons).
The state
0:= (1,0,0, ... )
is called the vacuum (or the ground state). Obviously, NO = 0.
and
(183)
(183) that '¢ remains unchanged under a permutation of h, ... , fm. This
reflects the principle of indistinguishability of identical particles. Observe
that
N'¢=m'¢.
We say that the operator b+ (f) creates one particle in the state f from the
vacuum. Furthermore, by Example 8,
Definition 10. The fermionic Pock space Y consists of all the sequences
('¢n)n=O,I, ... such that
00
n=O
Here, we assume that each function '¢n = '¢n(XI, ... ,xn ) is antisymmetric
with respect to all arguments Xl, ... ,Xn E lR.3 .
N('¢n) = (n'¢n)
in the following way. For all Xl, ... ,Xn E lR.3 and all n = 1,2, ... , let
n
(a+(f)'¢)n(XI, ... , xn) := n-~ ~) _1)j-1 f(xj)
j=l
5.19. The Fock Space in Quantum Field Theory 367
Furthermore,
and
(187)
Physical Interpretation 15. Let fI, ... , f m E VI with II fJ III = 1 for all
j. Then, we regard fJ as the state of one particle. Suppose that 'lj; =1= 0,
where
'lj; := ama+(fI)a+(h)··· a+Um)o'.
Choose am E C in such a way that 11'lj;11 = 1. Then, we regard 'lj; as a state
of m identical particles (fermions) in the states fI, ... , f m. It follows as in
(187) that 'lj; passes to 'lj; (resp., -'lj;) if we perform an even (resp., odd)
368 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Remark 16. Physicists use a heuristic machinery (e.g., the path integral)
in order to compute physical effects in particle accelerators with a high
accuracy. Unfortunately, there is no rigorous mathematical justification of
the arguments of physicists. It is one of the most important challenges of
mathematics to construct a rigorous quantum field theory which describes
realistic physical situations. Rigorous application of Fock spaces to quan-
tum statistics can be found in Bratteli and Robinson (1979), Vol. 2. Math-
ematical models of quantum field theory are studied rigorously in Glimm
and Jaffe (1981), and in Grosse (1995).
<
FIGURE 5.11.
lim
t->+oo
11'Ij!(t) - 'lj!o(t) I = 0 and
t-+-oo
lim 11'Ij!(t) - 'lj!o(t) II = O.
Let us also define the wave operators W±: D(W±) C X --+ X in the
following way. We set
(188)
is asymptotically free as t --+ +00 (resp., t --+ -(0) iff 'Ij!(0) E R(W+)
(resp., 'Ij!(0) E R(W_)).
More precisely, if 'Ij!(0) = W±'Ij!±, then
Proof. Since the operator e it"H : X --+ X is unitary for each t E lR,
370 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
o
At the same time, relation (191) tells us the following.
Corollary 3. Let 7jJ(0) EX. Then, the motion (189) is asymptotically free
iff7jJ(O) E R(W+) nR(W_).
d
dt (2- 1 mx'(t? + U(x(t)) = [mxl/(t) - K(x(t))]x'(t) = 0,
and hence
x(t);:::
2E)1 t +
(~ 2 Xo for all t ;::: O.
5.20 A Look at Scattering Theory 371
--------E
• x .x
u u
(a) bound motion (E = energy) (b) asymptotically free motion
FIGURE 5.12.
the momentum operator p'l/J := -in'l/J', and HI'l/J := U'l/J. Let X := L~(lR)
along with D(H) := {'l/J E X: 'l/J','l/J" E X}. Here, the prime denotes the
derivative with respect to x. Assume (A). Then, the following hold true:
(ii) Let E(H) denote the linear hull of the eigenvectors of H. Then
and
but ¢p does not live in the Hilbert space X = L~(~). However, for each
p E~, ¢p is a generalized eigenfunction of Ho in the sense of Section 5.15.
In fact, if we set
for all ¢ E S,
p2
Tp(Ho¢) = 2m Tp(¢) for all ¢ E S and all pER
5.21. The Language of Physicists 373
"I think this is so," says Cicha, "in the fight for new insights, the
breaking brigades are marching in the front row. The vanguard that
does not look to left nor to right, but simply forges ahead-those
are the physicists. And behind them there are following the vari-
ous canteen men, all kinds of stretcher bearers, who clear the dead
bodies away or, simply put, get things in order. Well, those are the
mathematicians. "
and
(orthogonality relation). (195)
Uj = 2::(Vk I Uj)Vk,
k
and from
j j,k
we obtain
(198*)
k,j
noting that (au Iv) = a( U Iv) for all a E OC. This is (198).
5.21. The Language of Physicists 375
Physicists set
for all k E lR
and
L"':= IR.( ... dk.
k
and
and
(v I u) = L(v I k)(k I u) for all u,v E X. (202)
k
This coincides with (202). Thus, the orthogonality relation (200) guarantees
that the computation of (v I u) can be based on the multiplication of (vi
by lu).
for all x E R
(u I v) = l (k I u)(k I v)dk.
can be written as
and
u(x) = (x I u) = L
u(y)6(y - x)dx.
Hence (u I av) = a(u I v). However, the Dirac calculus used in all physics
text books forces the convention
The beauty and elegance of the Dirac calculus will become clear in the
next section. The relation between the general Dirac calculus and rigorous
mathematics (namely, the general spectral theorem due to von Neumann),
is discussed in Zeidler (1986), Vol. 5, Chapter 89.
ld
Hence
u(x, t)dx = mass on the interval [c, d] at time t.
Proposition 1. Let the bounded continuous function Uo: JR. - t JR. be given.
Then, the initial-value problem (209) has a unique classical bounded solu-
tion given by
1JOO
{ (41l"at)-2
(x_y)2
u(x, t):= -00 e-~uo(y)dy if x E JR., t > 0,
uo(x) if x E JR., t = O.
These N particles spread over the real line during the time interval [0, t].
Let M of them be in the small interval [x, x + Ax] at time t. By (P),
This corresponds to the partial mass density ~;: at the point x at time t.
The total mass in the interval [x,x+Ax] at time t is obtained by summing
5.22 The Euclidean Strategy in Quantum Physics 381
over all the contributions coming from all the possible intervals [y, y + Lly].
Letting Llx -+ 0 and Lly -+ 0, we get the formula from Proposition 1 for
the mass density u(x, t) at the point x at time t.
i:
a := 2":n, from Proposition 1 we obtain the following formal solution of
(209*):
(A1/;o) (x, t) := C::nt) ~ e- m(;i-;;~)2 1/;0 (y)dy,
for all x E JR and t > O. Here we choose d := elf. To justify this, let us
write the Schri::idinger equation (209*) in the usual operator form
for all t E R
for all x E JR
and define D:= {u a ,,i3:a: E JR,,8 > O}. It has been proved in Problem 3.6
that span D is dense in X. We shall show ahead that (A1/;o) (x, t) can be
continued to all times t E JR if 1/;0 E span D.
Bya classic solution of the Schri::idinger equation (209*), we understand
a COO-function 1/; = 1/;(x, t) on JR2 that satisfies (209*).
Proposition 3. Let 1/;0 E span D be given. Then, the following hold true:
for all t E R
More precisely, we have -~ < arg(1 + 4if3t) < ~ for all t E JR, and we
choose that branch of the square root where
2if3 . _ 13("_0)2
Bt(X t) = (2f3(x - a)2 - 1 - 4zf3t)e 1+ 4i 13 t •
, (1 + 4if3t)2J1 + 4i!3t
for all x, t E R
i.e., 'IjJ(t) represents the function x ~ B(x, t) on R Let us show the follow-
ing:
(b) The time derivative 'IjJ'(t) exists in the Hilbert space X for all t E R
Explicitly, 'IjJ'(t) = BtL t) for all t E JR.
This follows easily from the explicit expression for B(x, t) by using ma-
jorants for parameter integrals based on
(M)
1 (arB(X,t))2d
00
-00
a xr
x < 00, r = 0,1, ... ,
for all t E JR. Thus, the functions B(·, t), Bx(-, t), and B xx (·, t) belong to
X = L~(JR). Hence 'IjJ E D(Ho).
384 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
with the unitary operator U(t) := e- i ' : . Hence IIU(t)II = 1 for all t E R
Since the set span D is dense in the Hilbert space X, there exists a
sequence (?fon) in span D such that ?fon ----+ ?fo in X as n ----+ 00. This
implies
?f(t) = lim (A?fon)(-, t) in X as n ----+ 00,
n--->oo
for all t ERIn fact,
In this section, we only use purely formal arguments in the spirit of the
two great physicists Dirac and Feynman. We hope that our detailed pre-
386 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
1:
for each initial state u(x, s) = uo(x) at the initial time s, namely,
This will be shown later. Therefore, the Green function plays a fundamental
role in physics. It is decisive that we will compute the Green function
without referring to the results from Section 5.22. To this end, we will use
the Dirac calculus and the Euclidean strategy. Recall from Section 5.21 that
and fixed initial time s. Here, u(x, t) denotes the mass density at the point
x at time t, and a > 0 is the so-called diffusion coefficient. Introducing the
operator
where S(t, s) := e-(t-s)H. Here, u(t) stands for the function x I--> u(x, t)
onR
is called the Green function (or the propagator) of the diffusion equation
(212).
Formal Proposition 2. For given uo, equation (210) yields the solution
u = u(x, t) of the initial-value problem (212) for the genemlized diffusion
equation.
i:
u(x,t)
y
= G(x, t; y, s)uo(y)dy. o
i:
damental relation:
if t ~ T ~ s. (214)
Hence
This is (213). o
The proof shows that (213) can be regarded as a localized version of the
semigroup property (214).
i:
Formal Proposition 4. Let 6.t > 0 be small. Then, for all x, y, t E R,
1:
Thus, up to terms of order (t:.t)2,
This is (215). o
Formal Proposition 5. If we set t:.t := ~+~, Xn+1 := x, Xo := y, and
._
S .- ~ zn+l{ ( ) Pj - aPj - U(Xj-l) }(-Zt:.t) ,
. Xj - Xj-I
t:.t
2 .
(216)
then
Formal Proof. Let tk := kt:.t + s. Then, s := to < tl < ... < tn+l := t.
1:
By (213),
i:
Formal Proof. By Example 2 in Section 3.7, the Fourier transformation
yields the following rigorous formula:
i:
for all a > 0 and f3 E JR. Furthermore, we will use the formal relation 15
By (216),
n n+1
is = iPn+1xn+1 - iPIXO + i LXj(Pj - PHI) - L ap]Llt,
j=1 j=1
where (n + I)Llt = t - s. Hence
= (27r)-n-l j 00
ei(Pn+lX-PlY)
n+1
II e-ap~Atdpj II eiXr(Pr-Pr+ddxr .
n
A = (27r)-1 j 00
ei(Pn+lX-PlY)
n+l
II e-ap~Atdpj II 8(Pr+l - PrJ.
n
i:
Observing that J~oo 8(p-q)f(p)dp = f(q), integration over Pn+b Pn, ... ,P2
yields
A = (27r)-1 eiPl(X-Y)e-ap~(n+1)Atdpl
1 (x(y)2
= (47ra(t - s))-"2 e- 4a .-s).
Definition 7. We are given Llx > 0 and Llp > O. By a discrete path, we
understand a curve
x = xnCr), P = Pn(r), s:::; r:::; t,
where Xn,Pn : [s, tl ---> lR are piecewise linear, continuous functions such
that
Furthermore, we postulate that Xn (.) connects the fixed initial point y with
the fixed end point x, i.e., we set
Let P n denote the set of all these paths. According to (216), we define
S(Xn,Pn) := L
n+1 { Z.
Xn,j(
~t
)
- Xn,j-1 2 } .
Pn,j - aPn,j - U(xn,j-d (-z~t).
J=1
(219)
Here, Xn,j := xn(tj) and Pn,j := Pn(tj).
By an admissible path, we understand a curve
s ~ T ~ t,
which connects the fixed initial point y and the fixed end point x, i.e.,
xes) := y and x(t) := x. Furthermore, let pes) := O. In addition, we demand
that the integral
Here, the function S has to be taken at all the possible node points, i.e.,
where we sum over all discrete paths (xn' Pn). Therefore, from (217) we get
the following fundamental formula:
(221)
where the symbol "lim" stands for the following formal limiting process:
Here, the "integral" has to be taken over all paths (x(-),p(·)) E P and
DxDp denotes a "measure" on the path space P.
for the fixed initial time s. Here, a := ::. According to (141), this equation
describes the motion of a particle of mass m on the real line, under the
action of the force -U'(x) at the point x E JR. in direction of the positive
x-axis. Recall that
provided J~oo lu(x, tWdx = 1. The Hamiltonian (i.e., the energy operator)
is given by
p2
H·--+U
.- 2m '
392 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
First choose such physical units that fi = 1. Then, all the formulas from
Section 5.23.1 can be applied to (222) if we replace the real time t by the
imaginary time it.
In order to get formulas that display the role of Planck's quantum of
*
action fi, forget about the convention fi = 1. Then, the rescaling t ~
and p ~ yields the following basic formulas:
*
(i) The Green function G of the Schrodinger equation (222) is defined
through
i:
equation is given by
(226)
which connect the fixed initial point y and the fixed end point x, i.e.,
x(s) := y and x(t) := x. Furthermore, we postulate that p(s) = 0 and that
the integral
(227)
exists.
(iv) In terms of classical mechanics, the integral (227) represents the
action 16 of the admissible path (x(·),p(·)). Here, x(t) := position at time
p(t)2
1-l(t):= 2m + U(x(t)) = energy at time t.
(v) The Green function G can be expressed by the following Feynman
path integral:
G(x,t;y,s)
r
= }p e
is(xC),p(·))
Ii VxVp, (F)
all the admissible paths x = x(T), p = p(T) in the (x, p) -phase space.
Then, by (F), the Green function is
is(x(·),pC))
the mean value over all the numbers e n ,
where S denotes the classical action along the admissible path x = x( T),
p = p(T) in the (x,p)-phase space. This action is measured in units of Ti,
which corresponds to
27rTii(t-S))-1/2 im(x-y)2
Go(x, t; y, s) =( m e 2n(' 8)
where
S,,(t, s):=
where
H(t) := Ho + U(·, t),
and Ho := - 2'::
d~2. Observe that the potential U = U(x, t) may depend on
time t. Let u = u(t) be the solution of (228). Then, the propagator S(t, s)
is defined through
u(t) = S(t, s)uo. (229)
(230)
if tl ;::: t2
if t2 ;::: h.
More generally,
where tl', ... , tn' denotes a permutation of tl, ... , tn with h, ;::: t2' ;::: ... ;:::
t n ,. Instead of (230), physicists formally write
n= 0,1, ... ,
396 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
we obtain
u(t) = Uo + f(ili)-n
n=l
J H(tl)··· H(tn)uo,
where J := J: dtl J:1 dt2 ... J:n-1 dt n. This yields (230). In fact, for exam-
ple, we get
J:= it it1
dt 1 dt2H(tt}H(t2) = it it H(tl)H(t2)(J(tt - t2)dt1dt2,
~ 1 i(t?lH
S(t, s) = I + ~ (ili)nn! (t - st H n = e- .
Recall that by definition the state ¢ is called a state of sharp energy E iff
H¢ = E¢, i.e.,
iIiSt(O, O)¢ = E¢. (231)
In terms of the Green function G(x, tj y, s), this means that
iii L
y
:t (x I S(t,O) I y}(y I ¢}It=o = E(x I ¢},
5.24. The Importance of the Propagator in Quantum Physics 397
I:
i.e., equation (231) is equivalent to
1:1:
Formal Proposition 4. We have
if s ~ a ~ t.
Set
Suppose that
(<p/3 1 S(t, s) 1 <Pa) = ~::::>j8/3ja.
j
By (232),
W/3 ,a = { 0la "12
J
if {3 = {3J"
if {3 # {3j for all j.
(233)
Then, <Pa is a "generalized state" and physicists assume that (233) remains
valid. This is motivated by the philosophy that 8(a - {3) represents a con-
tinuous version of the Kronecker symbol 8a /3'
This argument will be used later in scattering theory (cf. Remark 10).
•
P
(a)
• ~
scattering process
• •q
•
P
J U(td
•q
s :S t1 :S t
•
P
J U(t1)
P1
J U(t2)
••
q
s :S t1 :S t2 :S t
FIGURE 5.13.
Recall that {¢p} forms a complete orthonormal system in the sense of the
Dirac calculus, i.e.,
J
and
(q I S2(t, s) I p) = -~ T(q I U(td I Pl)(Pl I U(t2) I p),
Then
v'(t) = eitHoiHou(t) + eitHou'(t) = eitHO (-iU)u(t).
Hence
Letting Uo := CisH0</J and observing that v(s) = </J, it follows from the
Formal Theorem 1 that
v(t) = eitHoS(t, s)CisH0</J
(235)
i:
By the Dirac calculus, for example,
Furthermore,
where
402 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
The operator
S¢:= lim eitHOS(t,s)e-isHO¢
s---+-c::x:>,t-++CX)
- P ......
1--......., ~,
P
• c d
• P
FIGURE 5.14.
Pin = 1.
For large -t and -x, this stream is described by the function
These particles either pass the potential barrier U or are reflected (cf.
Figure 5.14). More precisely, we get two different particle streams near
time t = +00, namely:
(i) a particle stream from left to right of velocity v';ut = Vin with the
particle density
(ii) and a reflected particle stream from right to left of velocity V';-ut
-Vin with the particle density
For large t and x, the particle streams in (i) and (ii) are described by the
functions
Observe that
(237)
In fact, since U(x) = 0 for large lxi, the incoming and outgoing particles
have the energy Ein = 2- 1mv?n and Eout = 2-1mv~ut, respectively. It
follows from Ein = Eout that Vout = ±Vin·
Observe that the structure of quantum scattering processes differs com-
pletely from the behavior of classical particles. For example, in classical
mechanics, it follows from (237) and U(x) = 0 for all x ~ [c, d] that a scat-
tered particle P of initial velocity Vin has the energy E = 2- 1mv?n. Thus,
by (237), the particle P cannot pass the potential barrier if U(x) > 0 for
some point x.
i:
where ¢~~t(x) := eipx . This way we get the first approximation
and
i:
In particular, for the Born approximation (239) we get
i:
Recall that Ho = - 2~~. Thus, it follows from (238) that
= -U(x)<Pout(x).
Hence H<pout = E(p)'Ij;out. This implies i'lj;t = H'Ij;. D
Remark 13 (Physical interpretation). The solution 'Ij; from (241) has the
following asymptotic behavior:
where
1/Jin(X,t):= e-itE(p)eiPx, 1/J;ut(X,t):= a±(p)e-itE(p)e±ipx,
for fixed p > O. Here, 1/Jin and 1/J';ut correspond to an incoming and outgoing
particle stream of velocity v := !n from left to right and particle density
Pin = 1 and P';ut = la+ (p) 12, respectively. Furthermore, 1/J;;ut corresponds
to a reflected particle stream from right to left of velocity v = -!n and
particle density P;;ut = la_(p)12 (cf. Figure 5.14).
If we use the Born approximation (240) for a±(p), then this result coin-
cides with Remark 10.
5.25.1 Solitons
The Korteweg-de Vries equation (KdVequation) is given by
Ut + 6uu x + Uxxx = 0, -00 < x,t < 00. (242)
5.25 A Look at Solitons and Inverse Scattering Theory 407
,....,
C2 C2
A .. f\~ .. ..
c] c]
\
,
"
I \
,
I I \
\~
' .. ,'-
I
.- .;
FIGURE 5.15.
An explicit calculation shows that this equation has the following solution: 17
where
along with
Cj = 4k;, j = 1,2,
and A3 := (~~+~~) 2 A I A 2. Here, the numbers kl' k2 > 0 and AI, A2 > 0
are given.
j = 1,2. (245)
17R
eca11 t h at sechx.-
' - cosh
1
x
_
-
2
eX+e-X'
408 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
This shows that the solution (244) behaves like two solitons at time
t =-00 and time t = +00 (cf. Figure 5.15(b)). It is quite remarkable
that the two solitons are stable, i.e., they do not change their shape and
velocity after collision. There appears only a phase shift In summary, xTo.
we observe the following:
Solitons behave like particles under collisions.
This important fact was discovered by Kruskal and Zabusky via computer
experiments in 1963.
Ix - cltl < a
for fixed a > O. Then, as t -+ +00,
t -+ +00.
This yields (245) for j = 1. Now let (x, t) E ]R2 be given such that
Then, as t -+ +00,
Hence
This implies
(j2
u(x, t) = 2 8x 2 lncp(x, t)
_ 2 8 2 I (1 A3 21)2)
- 8x 2 n + Al e ,
Additionally, we obtain
X - t -00. (249)
410 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
Additionally, we have
x --+ +00,
where Cj is real.
The mapping
(250)
with k E IR and j = 1, ... ,N, is called the spectral transform.
In terms of quantum mechanics, (i) and (ii) correspond to scattered par-
ticles and to bound states of particles, respectively.
Let all the scattering data a(k), b(k), qj, and Cj be given. The main
task of inverse scattering theory consists in constructing the corresponding
potential u. To this end, we set
K(x, y) + F(x + y) + 1 00
K(x, z)F(z + y)dz = O. (251)
d
u(x) = -2 dxK(x,x).
for fixed, but otherwise arbitrary, time t. Then, we have the following im-
portant result:
a(k, t) = 0,
(253)
qj = 0,
(a) For given initial values u = uo(x), we compute the spectral transform
a(k, 0), b(k, 0), qj(O), Cj(O).
(c) We obtain the solution of the original problem (252) by using the
inverse spectral transform
d2
L(t)'IjJ := - dx 2 'IjJ(x) + u(x, t)'IjJ(x) ,
A(t)'IjJ
d3
:= 4 dx 3 'IjJ(x)
d a
- 3u(x, t) dx 'IjJ(x) - ax (u(x, t)'IjJ(x)),
where the fixed function u satisfies the KdV equation (252). From (252) we
obtain the key relation
L t = LA - AL. (254)
Hence
Consequently,
Thus, the operator L(t) has the same eigenvalues as the operator L(O), i.e.,
we obtain the crucial relation
for all t,
and hence qj = O. The pair {L, A} with (254) is called a Lax pair. Let
't/J = 't/J(x, t) and let
(255)
for fixed t, where q = qj. By Section 5.25.2, for fixed t, the eigenfunction
't/J can be characterized by the following asymptotic behavior:
x --+ -00. (256a)
In addition,
x --+ -00.
(257)
In this connection, note that u is rapidly vanishing as Ixl --+ 00, i.e., we
may put A = 4-1l:s as Ixl --+ 00. Finally, it follows from (256b) and (257)
that
c(t) = 8q3 c(t).
Step 3: Continuous spectrum. Let k > 0 be fixed, and let 't/J denote the
eigenfunction of the equation
(258)
't/J(x, t) = a(k, t)e- ikx + b(k, t)e ikx + 0(1), x --+ +00. (259b)
Problems 413
Ut + U xxx = o.
I:
U sing the Fourier transformation
we get
This implies
Problems
5.1. A special fundamental solution. Let pER Show that
I:
Solution: Let
v(x) := f oo
-00
ieiplx-yl
2p
f(y)dy.
onR
Hint: Use the decomposition v(x) = Jxoo + ... +J~oo··· and an analogous
argument as in the proof of Proposition 1 in Section 2.7.
5.3. Graph closed operators. Let A: D(A) <:;;; X ---+ X be a linear operator
on the Hilbert space X over lK such that D(A) is dense in X. The set
is called the graph of A. The operator A is called graph closed iff G(A) is
closed in X x X, i.e.,
imply Au = v. The linear operator B: D(B) <:;;; X ---+ X is called the closure
of A iff A <:;;; B and 19
G(A) = G(B).
We write A instead of B. Show the following:
(i) The adjoint operator A* is graph closed.
190bserve that (u,v) E G(A) iff there exists a sequence {(Un,V n )} in G(A)
such that Un -> U and Vn -> v in X as n -> 00.
Problems 415
(ii) The closure A exists iff it follows from Un E D(A) for all n along with
AU n -+ v and Un -+ 0 asn-+oo
that v = o.
(iii) If there exists a linear graph closed operator C: D( C) <;;; X -+ X such
that A <;;; C, then the closure A exists and
A<;;;c.
Hence the closure A is the smallest graph closed extension of A. In
particular, A is uniquely determined by A.
(vii) The operator A is graph closed iff D(A) is a Hilbert space over lK
equipped with the inner product
(u I V)A := (u I v) + (u I Av).
Hence
~~~=~ ~
and hence v E R(A - >.I) . Conversely, if v E R(A - >.I) ,then
as n -+ 00.
In fact, if Av - AV = 0, then
(1m A)llvl1 2 = Im{A(v I v)} = Im(v I AV)
= Im(v I Av) = 0,
since (Av I v) = (v I Av) = (Av I v). Thus, the operator A- AI: D(A) ----+ X
is injective. By Problem 5.4,
and
R(A - AI) = X.
In this connection, observe that A = A*, and hence A is graph closed.
Thus, A E p(A).
Ad (iii). Use (i) and (ii).
Ad (iv). If A is self-adjoint, then A = A *. Hence A is graph closed and
where a and b are fixed real numbers with 0 :::; a < 1 and b 2': o.
Show that A + B is self-adjoint.
Solution: Let a E lR with a # O. Since ia E p(A), the operator (A -
iaI)-I: X ---+ X is linear and continuous. We shall show ahead that
IIB(A - iaI)-III < 1 for all a E lR: lal 2': ao, (262)
for all u E X.
Furthermore,
for all u E X.
H:=Ho+U,
(i) X). is a complex Hilbert space equipped with the inner product
(264)
5.10. The hydrogen atom and the Friedrichs extension. The Schrodinger
equation for the motion of the electron in the hydrogen atom is given
through
in'IjJ = 1i'IjJ,
where 1i := - : : ~ + U with the Coulomb potential for the electron
1 e2
U(x) := --4-
7fco
-I
x
I on R3 - {a}.
Hence
Problems 421
(Hu I u) = 1{1R3
li2 e2 u 2
-(u~ +u~ +u~) - - - -
2m 4m::o r
}
dx.
5.11 **. The spectrum of the hydrogen atom. Show that the spectrum (J(H)
of the Hamiltonian H from Problem 5.10 consists of the eigenvalues
n = 1,2, ... ,
(Jess(H) = [0,00[.
Hint: Study the proof in Triebel (1972). In terms of a classical picture,
the eigenvalues En and E E (Jess(H) correspond to the energy of bounded
orbits (Figure 5.16(a)) and unbounded orbits (Figure 5.16(b)), respectively.
Solution: Ad (i). Let {Uj} and {Vj} be two complete orthonormal sys-
tems in X. Then, by the Dirac calculus from Section 5.21,
r
422 5. Self-Adjoint Operators, the Friedrichs Extension, etc.
electron
FIGURE 5.16.
and hence
m m,k,r
k,r m
where "..l" denotes the orthogonal complement. Show that the following
two statements are equivalent:
Problems 423
CA := (A - if) (A + if)-l
is called the Cayley transform of A. Show that
(vi) If A is graph closed, then D(CA) and R(CA) are closed linear sub-
spaces of X.
Hint: Use Problems 5.3 through 5.5. Cf. Riesz and Nagy (1955), Section
123.
n± := dim(A* ± if)
If one does not sometimes think the illogical, one will never discover
new ideas in science.
Mathematics is an ancient art, and from the outset it has been both
the most highly esoteric and the most intensely practical of human
endeavors. As long ago as 1800 B.C., the Babylonians investigated
the abstract properties of numbers; and in Athenian Greece, geome-
try attained the highest intellectual status. Alongside this theoretical
understanding, mathematics blossomed as a day-to-day tool for sur-
veying lands, for navigation, and for the engineering of public works.
The practical problems and the theoretical pursuits stimulated one
another; it would be impossible to disentangle these two strands.
Much the same is true today. In the twentieth century, mathemat-
ics has burgeoned in scope and in diversity and has been deepened
in its complexity and abstraction. So profound has this explosion
of research been that entire areas of mathematics may seem unin-
telligible to laymen-and frequently to mathematicians working in
other subfields. Despite this trend towards-indeed because of it-
mathematics has become more concrete and vital than ever before.
In the past quarter of a century, mathematics and mathematical
techniques have become an integral, pervasive, and essential compo-
nent of science, technology, and business. In our technically oriented
society, "innumeracy" has replaced illiteracy as our principal edu-
cational gap. One could compare the contributions of mathematics
to our society with the necessity of air and food for life. In fact, we
could say that we live in the age of mathematics-that our culture
has been "mathematized." No reflection of mathematics around us
is more striking than the omnipresent computer ....
There is an exciting development taking place right now, reuni-
fication of mathematics with theoretical physics ....
In the last ten or fifteen years mathematicians and physicists re-
alized that modern geometry is in fact the natural framework for
gauge theory (cf. Sections 2.20ff in AMS Vol. 109). The gauge po-
tential or gauge theory is the connection of mathematics. The gauge
Epilogue 427
Almost all concepts, which relate to the modern measure and inte-
gration theory, go back to the works of Henri Lebesgue (1875-1941).
The introduction of these concepts was the turning point in the tran-
sition from mathematics of the nineteenth century to mathematics
of the twentieth century.
Naum Jakovlevic Vilenkin, 1975
UAn E A nAn
00 00
and E A.
n=l n=l
J1,(0) = vol(O).
(vii) The subset A of]RN has the N-dimensional measure zero, i.e., A E A
and
J1,(A) = 0
iff, for each c > 0, there is a countable number of N-cuboids 01, O2 ,
... such that
00 00
j=l j=l
Appendix 431
(viii) If the set A has the N-dimensional measure zero and B ~ A, then
the set B also has the N-dimensional measure zero.
u(x) = n--+oo
lim un(x) for almost all x E M
iff this limiting relation holds for all x E M - Z, where the set Z has the
N-dimensional measure zero.
where the infimum is taken over all the open subsets G of]RN with M ~ G.
In particular,
meas(]RN) = +00 and meas(0) = O.
432 Appendix
---c
. u
/.
L-~r--------r----_X
a b
FIGURE A.I.
Step Functions
Recall that lK = ]R or lK = C. A function
u: M ~ ]RN -+ lK
Measurable Functions
The function
(ii) There exists a sequence (un) of step functions Un: M -+ lK such that
u(x) = n--+oo
lim un(x) for almost all x E M.
Appendix 433
are measurable for all n and the limit (L) exists for all x EM. Then, the
functions
F, G, H: M ~ JRN -+ II(
are also measurable.
u(x) = n--->oo
lim un(x) for almost all x E M (B)
and
for all n, m 2: no(E). (C)
(i) There is a sequence (un) of step functions Un: M ->JK such that (B)
holds.
(ii) For each E > 0, there is a number nO(E) such that (C) holds.
u: M ->JK
Standard Example 3. Let the function f: M <::; ]RN --> ]R be almost every-
where continuous on the bounded measurable set M (e.g., M is bounded
and open or M is compact). Suppose that there is a point Xo in M such
that
lu(x)l:S; const for all x E M with Xo =I- x (H)
Ix - xol,6
and fixed (3: 0 :s; (3 < N. Then, u is integrable over M.
Condition (H) controls the growth of the function u as x --> Xo.
Measure. Let M be a measurable subset of]RN with meas(M) < 00. Then
1M dx = meas(M),
where we write JM dx instead of JM u dx with u == 1.
Here, det l' (y) denotes the determinant of the first partial derivatives of
the function f at the point y.
r
JKUM
udx= r udx+ JMr udx.
JK
Convergence with respect to domains. Let u: M C ]RN ----+ lK be a
function. Suppose that
CXJ
and M= UMn.
n=l
lim r udx.
r udx = n~ooJMn
1M
Absolute Continuity. Let u: M ~ ]RN ----+ lK be integrable. Then, for each
c > 0, there is a 8 > 0 such that
Ii udxl < c
Appendix 437
I r
JM-H
udxl::; r
JM-H
luldx < c,
for fixed p 2': 1. Set u(x) := 0 outside M. Then, for each c > 0, there is a
8(c) > 0 such that
1M lu(x + h) - u(x)IPdx < c for all h E JRN with Ihl < 8(c).
lim
n~CXJ }
r undx = r lim un(x)dx,
M } M n---+oo
where all the integrals and limits exist, provided the following two conditions
are satisfied:
(i) The functions Un: M ~ JRN ---; OC are measurable for all n and the
limit
lim un(x) exists for almost all x E M.
n ..... oo
and
1M undx::; C for all n and fixed C > o.
and
Then
More precisely,
If we set u(x) := 0 for all the points x of M with limn--> 00 Un (x) = 00, then
the function u: M ----> JR. is integrable and
r u dx::; n-->oo
JM
lim r undx::; C.
JM
Iterated Integration
Our goal is the following fundamental formula:
Special Case. Let M := {(x,y) E ]R.2:a < x < b,c < y < d}, where
-00 :::; a < b :::; 00 and -00 :::; c < d :::; 00.
Then, N = L = 1 and formula (I) reads as follows:
1M u(x,y)dxdy = lb (l dU(X,Y)dY) dx = ld (l b
U(X,Y)dX) dy.
Parameter Integrals
We consider the function
F(p):= 1M f(x,p)dx,
for all parameters pEP. We are- given the function
f:M x P ....... K,
where the infimum is taken over all the possible finite decompositions V of
the interval [a, b], i.e.,
(4)
Appendix 441
1 00
-00
f(x)dg(x):= lim
b-->+oo
a---+-oo
Ib f(x)dg(x),
a
Standard Example. Let -00 ::; a < b::; 00. Then, the formula
(i) u:S;uforalluEC.
Zorn's Lemma. Let C be a nonempty ordered set which has the property
that each totally ordered subset T of C has an upper bound, i. e., there is an
element b of C such that
for all u E T,
where b depends on T.
Then, there exists a maximal element in C.
Example 1. Let S be a set, and let C be the collection of all the subsets
of S. For u, v E C, we write
iff u t:;; v.
Example 2. The set ffi. of real numbers is totally ordered, but ffi. does not
have any maximal element.
Abraham, R., Marsden, J., and Ratiu, T. (1983): Manifolds, Tensor Anal-
ysis, and Applications. Addison-Wesley, Reading, MA.
Albers, D., Alexanderson, G., and Reid, C. (1987): International Mathe-
matical Congresses: An Illustrated History 1893-1986. Springer-Verlag,
New York.
Albeverio, S. and H~egh-Kron, R. (1975): Mathematical Theory of Feyn-
man Path Integrals. Lecture Notes in Mathematics, Vol. 523, Springer-
Verlag, Berlin, Heidelberg.
Albeverio, S. and Brezniak, Z. (1993): Finite-Dimensional Approximation
Approach to Oscillatory Integrals and Stationary Phase in Infinite Di-
mensions. J. Funct. Anal. 113, 177-244.
Allgower, E. and Georg, K. (1990): Numerical Continuation Methods.
Springer-Verlag, New York.
Alt, H. (1992): Lineare Funktionalanalysis: eine anwendungsorientierte
Einfiihrung. 2nd edition. Springer-Verlag, Berlin, Heidelberg.
Amann, H. (1990): Ordinary Differential Equations: An Introduction to
Nonlinear Analysis. De Gruyter, Berlin.
Amann, H. (1995): Linear and Quasilinear Parabolic Problems, Vol. 1.
Birkhauser, Basel.
Ambrosetti, A. (1993): A Primer of Nonlinear Analysis. Cambridge Uni-
versity Press, Cambridge, UK.
Ambrosetti, A. and Coti-Zelati, V. (1993): Periodic Solutions of Singular
Lagrangian Systems. Birkhauser, Basel.
Antman, S. (1995): Nonlinear Elasticity. Springer-Verlag, New York.
Appell, J. and Zabrejko, P. (1990): Nonlinear Superposition Operators.
Cambridge University Press, Cambridge, UK.
444 References
Methods for Science and Technology; Vol. 1: Physical Origins and Clas-
sical Methods; Vol. 2: Functional and Variational Methods; Vol. 3: Spec-
tral Theory and Applications; Vol. 4: Integral Equations and Numerical
Methods; Vol. 5: Evolution Problems I; Vol. 6: Evolution Problems II -
the Navier-Stokes Equations, the Transport Equations, and Numerical
Methods. Springer-Verlag, Berlin, Heidelberg. (Translated from French.)
Davies, P. (ed.) (1989): The New Physics. Cambridge University Press,
Cambridge, UK.
Deimling, K. (1985): Nonlinear Functional Analysis. Springer-Verlag, New
York.
Deimling, K. (1992): Multivalued Differential Equations. De Gruyter,
Berlin.
Deufihard, P. and Hohmann, A. (1993): Numerische Mathematik I. De
Gruyter, Berlin. (English edition: Numerical Analysis: A First Course
in Scientific Computation. De Gruyter, Berlin, 1994.)
Deufihard, P. and Bornemann, F. (1994): Numerische Mathematik II. Inte-
gration gewohnlicher Differentialgleichungen. De Gruyter, Berlin. (En-
glish edition in preparation.)
DeVito, C. (1990): Functional Analysis and Linear Operator Theory. Addi-
son-Wesley, Reading, MA.
Diekman, 0., van Gils, S., Verduyn Lunel, S., and Walther, H.-O.
(1995): Delay Equations: Functional-, Complex-, and Nonlinear Analy-
sis. Springer-Verlag, New York.
Dierkes, U., Hildebrandt, S., Kuster, A., and Wohlrab, O. (1992): Minimal
Surfaces, Vols. 1, 2. Springer-Verlag, Berlin, Heidelberg.
Dieudonne, J. (1969): Foundations of Modern Analysis. Academic Press,
New York.
Dieudonne, J. (1981): History of Functional Analysis. North-Holland, Am-
sterdam.
Dieudonne, J. (1992): Mathematics-the Music of Reason. Springer-Verlag,
Berlin, Heidelberg.
Di Francesco, P., Mathieu, P., and Senechal, D. (1997): Conformal Field
Theory. Springer-Verlag, New York.
Dittrich, W. and Reutter, M. (1994): Classical and Quantum Dynamics
from Classical Paths to Path Integrals. Springer-Verlag, Berlin, Heidel-
berg.
Diu, B., Guthmann, C., Lederer, D., and Roulet, B. (1989): Elements de
Physique Statistique. Hermann, Paris. (German edition: Grundlagen der
Statistischen, Physik, de Gruyter, Berlin, 1369 pages.)
Donoghue, J., Golowich, E., and Holstein, B. (1992): The Dynamics of the
Standard Model. Cambridge University Press, Cambridge, UK.
Dubin, D. (1974). Solvable Models in Algebraic Statistical Mechanics.
Clarendon Press, Oxford.
Dunham, W. (1991): Journey Through Genius: The Great Theorems of
Mathematics. Penguin Books, New York.
References 447
Penrose, R. (1994): Shadows of the Mind: The Search for the Missing Sci-
ence of Conciousness. Oxford University Press, Oxford.
Petrina, D. (1995): Mathematical Foundations of Quantum Statistical Me-
chanics. Kluwer, Dordrecht.
Polak, E. (1997): Optimization. Springer-Verlag, New York.
Polchinski, J. (1998): String Theory, Vols. 1,2. Cambridge University Press,
Cambridge, UK.
Polianin, A. and Manzhirov, A. (1998): Handbook of Integral Equations.
CRC Press, Boca Raton, FLA.
Polianin, A. and Zaitsev, V. (1995): Handbook of Exact Solutions for Or-
dinary Differential Equations. CRC Press, Boca Raton, FLA.
Polyakov, A. (1987): Gauge Fields and Strings. Academic Publishers, Har-
wood, NJ.
Prugovecki, E. (1981): Quantum Mechanics in Hilbert Space. Academic
Press, New York.
Quarteroni, A. and Valli, A. (1994): Numerical Approximation of Partial
Differential Equations. Springer-Verlag, Berlin, Heidelberg.
Rabinowitz, P. (1986): Methods in Critical Point Theory with Applications.
Amer. Math. Soc., Providence, RI.
Racke, R. (1992): Lectures on Evolution Equations. Vieweg, Braunschweig.
Rauch, J. (1991). Partial Differential Equations. Springer-Verlag, New
York.
Reed, M. and Simon, B. (1972): Methods of Modern Mathematical Physics.
Vol. 1: FUnctional Analysis; Vol. 2: Fourier Analysis, Self-Adjointness;
Vol. 3: Scattering Theory; Vol. 4: Analysis of Operators. Academic
Press, New York.
Reid, C. (1970): Hilbert. Springer-Verlag, New York.
Reid, C. (1976): Courant in Gottingen and New York. Springer-Verlag, New
York.
Renardy, M. and Rogers, R. (1993): Introduction to Partial Differential
Equations. Springer-Verlag, New York.
Riesz, F. and Nagy, B. (1955): Ler;ons d'analyse fonctionelle. (English edi-
tion: FUnctional Analysis, Frederick Ungar, New York 1978.)
Rivers, R. (1990): path Integral Methods in Quantum Field Theory. Cam-
bridge University Press, Cambridge, UK.
Rolnick, W. (1994): FUndamental Particles and Their Interactions. Addi-
son-Wesley, Reading, MA.
Roubicek, T. (1997): Relaxation in Optimization Theory. De Gruyter,
Berlin, New York.
Royden, H. (1988): Real Analysis. Macmillan, New York.
Rudin, W. (1966): Real and Complex Analysis. McGraw-Hill, New York.
Rudin, W. (1973): FUnctional Analysis. McGraw-Hill, New York.
Ruelle, D. (1993): Chance and Chaos. Princeton University Press, Prince-
ton, NJ.
Sakai, A. (1991): Operator Algebras. Cambridge University Press, Cam-
bridge, UK.
References 455
Sattinger, D. and Weaver, O. (1993): Lie Groups, Lie Algebras, and Their
Representations. Springer-Verlag, New York.
Scharf, G. (1995): Finite Quantum Electrodynamics. Springer-Verlag,
Berlin, Heidelberg.
Schechter, M. (1971): Principles of Functional Analysis. Wiley, New York.
Schechter, M. (1982): Operator Methods in Quantum Mechanics. North-
Holland, Amsterdam.
Schechter, M. (1986): Spectra of Partial Differential Operators. North-
Holland, Amsterdam.
Schmutzer, E. (1989): Grundlagen der theoretischen Physik, Vols. 1, 2.
Deutscher Verlag der Wissenschaften, Berlin.
Schwabl, F. (1995): Quantum Mechanics. 2nd edition. Springer-Verlag,
Berlin, Heidelberg.
Schwabl, F. (1997): Quantenmechanik fur Fortgeschrittene. Springer-
Verlag, Berlin, Heidelberg.
Schwarz, A. (1993): Quantum Field Theory and Topology. Springer-Verlag,
Berlin, Heidelberg.
Schwarz, A. (1994): Topology for Physicists. Springer-Verlag, Berlin, Hei-
delberg.
Schweber, S. (1994): QED (Quantum Electrodynamics) and the Men Who
Made It: Dyson, Feynman, Schwinger, and Tomonaga. Princeton Uni-
versity Press, Princeton, NJ.
Scott, G. and Davidson, K. (1994): Wrinkles in Time. Morrow, New York.
Simon, B. (1993): The Statistical Mechanics of Lattice Gases. Princeton
University Press, Princeton, NJ.
Smoller, J. (1994): Shock Waves and Reaction-Diffusion Equations. 2nd
enlarged edition. Springer-Verlag, New York.
Spohn, H. (1991): Large Scale Dynamics of Interacting Particles. Springer-
Verlag, Berlin, Heidelberg.
Sterman, G. (1993): An Introduction to Quantum Field Theory. Cambridge
University Press, Cambridge, UK.
Stoer, J. and Bulirsch, R. (1993): Introduction to Numerical Analysis.
Springer-Verlag, New York. (Translated from German.)
Strang, G. and Fix. G. (1973): An Analysis of the Finite Element Method.
Prentice-Hall, Englewood Cliffs, NJ.
Stroke, H. (ed.) (1995): The Physical Review: The First Hundred Years-A
Selection of Seminal Papers and Commentaries. American Institute of
Physics, New York.
Struwe, M. (1988): Plateau's Problem and the Calculus of Variations.
Princeton University Press, Princeton, NJ.
Struwe, M. (1996): Variational Methods, 2nd edition. Springer-Verlag, New
York.
Sunder, V. (1987): An Invitation to von Neumann Algebras. Springer-
Verlag, New York.
Szabo, I. (1987): Geschichte der mechanischen Prinzipien und ihrer wich-
tigsten Anwendungen. Birkhauser, Basel.
456 References
Quantum field theory: Glimm and Jaffe (1981), Reed and Simon
(1972), Vol. 2 (the Garding-Wightman axioms), Bogoljubov and Shirkov
(1983), Mandl and Shaw (1989), Bogoljubov, et al. (1990), Chang (1990),
Haag (1993), Kaku (1993), Weinberg (1995/96), Vols. 1,2, Scharf (1995),
Zinn-Justin (1996), Greiner and Reinhardt (1996).
Scattering theory: Reed and Simon (1972), Vol. 3, Newton (1988),
Colton and Kress (1992), Iagolnitzer (1993).
Elementary particles: Rolnick (1994).
Standard model of elementary particles: Nachtmann (1990),
Donoghue, Golowich, and Holstein (1992), Kaku (1993), Weinberg (1996),
Vol. 2.
Noncommutative geometry and the standard model of elemen-
tary particles: Connes (1994).
The Feynman path integral: Albeverio (1975) as well as Albeverio
and Brezniak (1993) (rigorous theory), Das (1993), Dittrich and Reutter
(1994), Weinberg (1995/96), Vols. 1,2, Greiner and Reinhardt (1996), Zinn-
Justin (1996), Grosche and Steiner (1998).
Cosmology: Zeidler (1986), Vol. 4, Peebles (1993).
Quantum cosmology: Esposito (1993).
Supersymmetry: Berezin (1987), Wess and Bagger (1991).
Superstring theory: Green, Schwarz, and Witten (1987), Kaku (1987),
Lust and Theissen (1989), Hatfield (1992).
Quantum groups: Lusztig (1993), Kassel (1995).
Conformal field theory: Kaku (1991), Di Francesco et al. (1997).
Topology and physics: Nakahara (1990), Marathe and Martucci
(1992), Monastirsky (1993), Schwarz (1994).
Topology, partial differential equations, pseudo differential op-
erators, and the Atiyah-Singer index theorem: Gilkey (1984).
Textbooks in physics: Feynman, Leighton, and Sands (1963), Schmut-
zer (1989), Vols. 1, 2, Greiner (1993), Vols. 1-7, Honerkamp and Romer
(1993).
A survey on modern physics: Davies (1989).
Seminal papers in physics: Stroke (1995).
Essays on modern physics: Kaku and Trainer (1987), Heisenberg
(1989), Weinberg (1992), Gell-Mann (1994), Schweber (1994), Scott and
Davidson (1994), Thorne (1994).
Essays on modern mathematics: Cascuberta and Castellet (1992)
(viewpoints of seven Fields medalists), Penrose (1992), (1994), Kac, Rota,
and Schwartz (1994).
List of Symbols
General Notation
A=}B A implies B
iff if and only if
A{:}B A iff B (i.e., A =} Band B =} A)
f(x) := 2x f(x) = 2x by definition
xES x is an element of the set S
x¢S x is not an element of the set S
{x: ... } set of all elements x with the property ...
S~T the set S is contained in the set T
SeT S ~ T and S =I- T (the set S is properly contained in T)
SUT the union of the sets Sand T (the set of all
elements that live in S or T)
SnT the intersection of the sets Sand T (the set of
all elements that live in Sand T)
S-T the difference set (the set of all elements that
live in S and not in T)
empty set
set of all subsets of S (the power set of S)
462 List of Symbols
(t, Ixnl2)
1
(fa lu(xWdX)
1
(u I Vh,2 := 1G
(uv + t8jU8jV)
3=1
dx
Operators
A:S ~ X ---+ Y operator from the set S into 17
the set Y, where S ~ Y
D(A) (or dom A) domain of definition of the operator A 17
R(A) (or im A) range (or image) of the operator A 17
N(A) (or ker A) null space (or kernel) of the operator A, 70
N(A) := {x: Ax = O}
I (or id) identical operator, Ix := x for all x 76
A(S) image of the set S, A(S) := {Ax: XES}
464 List of Symbols
Special Sets
S closure of the set S 31
int S interior of the set S 31
ext S exterior of the set S 31
as boundary of the set S 31
Ue(p) E-neighborhood of the point p in a 15
normed space, Ue(p) := {x E X: Ilx - pil < d
U(p) neighborhood of the point P 15
dim X dimension of the linear space X 6
Xc complexification of the linear space X 98
X/L factor space (see Section 3.9 of AMS Vol. 109)
codim L co dimension of the linear subspace L,
codim L:= dim(X/L) (see Section 3.9 of
AMS Vol. 109)
List of Symbols 465
Derivatives
u' (t) derivative of an operator function 80
u = u(t) at time t
8j f partial derivative %1J
8C1.f 8r'8~2 ... 8'ft f, where a = (al, ... , aN) 159
(the classical symbols are also used for the
derivatives of generalized functions)
lal the sum al + ... + aN 159
8
derivative in the direction of 181
8n
the exterior normal
N
!::.f Laplacian, !::.f :=L 8} f 125
n=l
8F(x; h) variation of the functional F at the
point x in direction of h (see
Section 2.1 of AMS Vol. 109)
8n F(x; h) nth variation of the functional F
at the point x in the direction of h
(see Section 2.1 of AMS Vol. 109)
A'(x) (or dA(x)) Frechet-derivative of the operator A
at the point x (see Section 4.2 of
AMS Vol. 109)
dn A(x)(h l , ... , h n ) nth Frcchet-differential of the operator A
at the point x in the directions of
hI, ... ,hn (see Section 4.2 of
AMS Vol. 109)
466 List of Symbols
Sobolev Spaces
131, 132
Spaces of Sequences
lK oo , l~, l~ (b := l~ if lK = JR) 95, 177
Spaces of Distributions
V'(G), S' 160, 219
List of Theorems
A good memory does not recall everything, but forgets the unimpor-
tant.
Folklore
Theorem 1.A
(The Banach fixed-point theorem) 19
Theorem loB
(The Brouwer fixed-point theorem) 53
Theorem I.C
(The Schauder fixed-point theorem) 61
Theorem I.D
(The Leray-Schauder principle) 65
Theorem I.E
(The method of sub- and supersolutions) 69
Theorem 2.A
(Main theorem on quadratic minimum problems) 121
Theorem 2.B
(The Dirichlet principle) 138
Theorem 2.C
(The Ritz method) 141
Theorem 2.D
(The perpendicular principle) 165
Theorem 2.E
(The Riesz theorem) 167
Theorem 2.F
(Dual quadratic variational problems) 170
Theorem 2.G
(Nonlinear monotone operators) 173
Theorem 2.H
(The nonlinear Lax-Milgram theorem) 175
Theorem 3.A
(Complete orthonormal systems) 202
Theorem 4.A
(Eigenvalues and eigenvectors of linear,
symmetric, compact operators) 232
Theorem 4.B (The Fredholm alternative for linear, symmetric,
compact operators) 237
Theorem 5.A (The Friedrichs extension of symmetric operators) 280
Theorem 5.B (The abstract Dirichlet problem) 282
468 List of Theorems
Spaces
linear space 7
dimension 7
linear subspace 30
Banach space 10
norm 7
separable 84
reflexive (see Section 2.8 of AMS Vol. 109)
Hilbert space 107
inner product 105
orthogonal elements 105
orthogonal projection 165
complete orthonormal system 200
Fock space (bosons or fermions) 364
Lebesgue space 114
Sobolev space 273
energetic space 273
dual space 74
470 List of the Most Important Definitions
metric space and topological space (see Chapter 1 of AMS Vol. 109)
Convergence
norm convergence 8
Cauchy sequence 10
weak convergence (see Section 2.4 of AMS Vol. 109)
sequentially continuous 27
sequentially compact 33
relatively sequentially compact 33
Operators
domain of definition 17
range and preimage 17
injective 17
surjective 17
bijective 17
inverse operator 17
linear 70
symmetric 264
the Friedrichs extension 280
adjoint 263
dual (cf. Section 3.10 of AMS Vol. 109)
self-adjoint 264
Hamiltonian 328
orthogonal projection operator 270
skew-adjoint 264
unitary 212
Fourier transformation 216
trace class 347
statistical state 348
statistical operator 350
Hilbert-Schmidt operator 347
continuous 26
k-contraction 19
Lipschitz continuous 27
Holder continuous 97
homeomorphism 28
diffeomorphism 436
compact 39
strongly monotone 273
monotone or coercive (see Section 2.18 of AMS Vol. 109)
semigroup 298
Green function (propagator) 386
List of the Most Important Definitions 471
Functional
nonlinear 17
linear 74
COnvex 29
bilinear form 120
bounded 120
symmetric 120
distribution (generalized function) 160
tempered distribution 219
Fourier transformation 220
generalized eigenfunction 344
Dirac delta distribution 161
Green function 160
fundamental solution 183
Palais~Smale condition (see Section 2.16 of AMS Vol. 109)
Embedding
continuous 261
compact 261
Spectrum
Set
open 15
neighborhood 15
interior 30
472 List of the Most Important Definitions
closed 15
closure 30
boundary 31
compact or relatively compact 33
dense 83
convex 29
bounded 33
countable 84
Point
fixed point 18
critical point (see Section 2.1 of AMS Vol. 109)
saddle point (see Section 2.2 of AMS Vol. 109)
bifurcation point (see Section 5.12 of AMS Vol. 109)
Operator Algebras
Banach algebra 76
von Neumann algebra 359
C*-algebra 357
observable 359
state 358
pure 358
mixed 358
KMS-state (thermodynamic equilibrium) 360
*-automorphism 358
dynamics of a quantum system 359
Derivative
time derivative 80
generalized derivative of a function 129
derivative of a distribution 162
nth variation (see Section 2.1 of AMS Vol. 109)
Frechet derivative (see Section 4.2 of AMS Vol. 109)
Integral