Lecture Notes On Numerical Analysis of Partial Differential Equations

Lecture Notes on
Numerical Analysis of
Partial Differential Equations
Adérito Araújo
Coimbra, January 2019
These lecture notes follow to a large extent Endre Sülli”s notes1 , but with
things reordered and often expanded. The point of these notes is just to
serve as an outline of the actual lectures which I will give and should not
be used outside that context.
Cover image: “Elipse Lace”, de Susan McBurney, 2005.

Font: www.bridgesmathart.org/art-exhibits/jmm09/mcburney.html.
1
E. Sülli, Lecture Notes on Finite Element Methods for Partial Differential Equations, Mathematical
Institute. University of Oxford, 2012; E. Sülli, An Introduction to the Numerical Analysis of Partial
Differential Equations, Mathematical Institute. University of Oxford, 2005.
Chapter 1
Basic functional analysis
Numerical solution of PDEs is a rich and active field of modern applied mathematics. The
steady growth of the subject is stimulated by ever-increasing demands from the natural
sciences, engineering and economics to provide accurate and reliable approximations to
mathematical models involving partial differential equations (PDEs) whose exact solutions
are either too complicated to determine in closed form or, in many cases, are not known
to exist.
While the history of numerical solution of ordinary differential equations is firmly rooted
in 18th and 19th century mathematics, the mathematical foundations of the field of numer-
ical solution of PDEs are much more recent: they were first formulated in the landmark
paper Über die partiellen Differenzengleichungen der mathematischen Physik (On the par-
tial difference equations of mathematical physics) by Richard Courant, Karl Friedrichs, and
Hans Lewy, published in 1928. There is a vast array of powerful numerical techniques for
specific PDEs: level set and fast-marching methods for front-tracking and interface prob-
lems; numerical methods for PDEs on, possibly evolving, manifolds; immersed boundary
methods; mesh-free methods; particle methods; vortex methods; various numerical homog-
enization methods and specialized numerical techniques for multiscale problems; wavelet-
based multiresolution methods; sparse finite difference/finite element methods, greedy algo-
rithms and tensorial methods for high-dimensional PDEs; domain-decomposition methods
for geometrically complex problems, and numerical methods for PDEs with stochastic co-
efficients that feature in a number of applications, including uncertainty quantification
problems.
These notes do justice to this huge and rapidly evolving subject. We shall therefore
confine ourselves to the most standard and well-established techniques for the numerical
solution of PDEs: finite difference methods, finite element methods and a small reference
to finite volume methods. Before embarking on our survey, it is appropriate to take a brief
excursion into the theory of PDEs in order to fix the relevant notational conventions and
to describe some typical model problems.
1.1 Mathematical background and notation

Let us introduce some basic notation that well be used throughout this course. We denote
by R the set of real numbers and
Rn = {x = (x1 , ..., xn ) : xi 2 R, i = 1, ..., n}, R+ = {t 2 R : t > 0}.
A subset ⌦ ⇢ Rn is called a domain if it is open and connected. In ⌦ is bounded we usually
denote its boundary by @⌦ (ou ) . We assume that @⌦ is either smooth or a polygon
3
CHAPTER 1. BASIC FUNCTIONAL ANALYSIS 4
(in n = 2) or polyhedron (in n = 3). By ⌦ ¯ we denote the closure of ⌦, i.e., ⌦

¯ = ⌦ \ @⌦.
The (length, area, or) volume of ⌦ is denoted by |⌦|, the volume element in Rn is denoted
by dx = dx1 · · · dxn , and ds denote the arc length (if n = 2) or surface
P area (if n = 3) on
@⌦. For vectors in Rn we use the Euclidean inner product x · y = ni=1 xi yi and the norm
p
|x| = x · x.
Let u, v be scalar functions and w = (w1 , ..., wn ) a vector-valued function of x 2 Rn .
We define the gradient, the divergence, and the Laplace operator (Laplacian), respectively,
by
✓ ◆
@u @u
ru = grad u = , ..., ,
@x1 @xn
X n
@wi
r · w = div u = ,
@xi
i=1
Xn
@2u
u = r · ru = .
i=1
@x2i
The divergence theorem (or Gauss’ theorem) says that the integral of a vector-field di-
vergence over a domain is equal to the integral of the normal component of the field along
the boundaries: Z Z
r · w dx = w · ⌫ ds,
⌦ @⌦
where ⌫ = (⌫1 , ..., ⌫n ) is the outward unit normal of @⌦. This theorem holds for functions
w and boundaries @⌦ that are sufficiently smooth (to be specified in the forthcoming
sections). Applying this to the product wv we obtain the Green’s formula
Z Z Z
w · rv dx = (w · ⌫)v ds (r · w)v dx.
⌦ @⌦ ⌦
When applied with w = ru the formula becomes

Z Z Z
@u
ru · rv dx = v ds ( u)v dx,
⌦ @⌦ @⌫ ⌦
where @u/@⌫ = ⌫ · ru is the exterior normal derivative of u on @⌦. From this, we see that
Green’s formula is nothing else than a generalisation of the integration-by-parts formula to
higher dimensions. If a(·) is a sufficiently smooth real-valued function, we may conclude
from the Green’s formula that
Z Z Z
@u
r · (a(x)ru)v dx = a(x) v ds a(x)ru · rv dx,
⌦ @⌦ @⌫ ⌦
Let N denote the set of non-negative integers. An n-tuple
↵ = (↵1 , ..., ↵n ) 2 Nn
is called a multi-index. The non-negative integer |↵| = ↵1 + · · · + ↵n is referred to as the

length of the multi-index ↵ = (↵1 , ..., ↵n ). We denote (0, ..., 0) by 0; clearly |0| = 0. Given
a function u : R ! R we denote its partial derivative of order |↵| by
@ |↵| u
@↵u = .
@x↵1 1 · · · @x↵nn
For the sake of simplicity we also use subscripts to denote partial derivatives, i.e., @xi =
@/@xi , i = 1, ..., n and @ ↵ u = @x↵11 · · · @x↵nn . For example,
@u @2u @2u
ut = @ t u = , uxx = @x @x u = @x2 u = , 2
uxy = @x @y u = @xy u= .
@t @x2 @x@y
Exemple 1.1. Suppose that n = 3, and ↵ = (↵1 , ↵2 , ↵3 ), ↵j 2 N, j = 1, 2, 3. Then

for u, a function of three variables x1 , x2 , x3 ,
X @3u @3u @3u
@↵u = + +
@x31 @x21 @x2 @x21 @x3
|↵|=3
@3u @3u @3u

+ + +
@x1 @x22 @x1 @x23 @x32
@3u @3u @3u @3u
+ + 2 + 2 + 3.
@x1 @x2 @x3 @x2 @x3 @x2 @x3 @x3
This example highlights the importance of multi-index notation: instead of laboriously

writing out in detail the ten terms on the right-hand side of the last identity, we can
compress the information into a single entity shown on the left.
1.2 Linear partial differential equations

A linear partial differential operator L of order k with real-valued coefficients a↵ = a↵ (x),
|↵|  k, on a domain ⌦ ⇢ Rn , defined by
X
L= a↵ (x)@ ↵ , x 2 ⌦
|↵|k
and a linear partial differential equation (PDE) of order k is defined by
L(u) = f (x).
The linear operator L is called elliptic if, for every x = (x1 , ..., xn ) 2 ⌦ and every nonzero
⇠ = (⇠1 , ..., ⇠n ) 2 Rn , X
Qk (x, ⇠) = a↵ (x)⇠ ↵ 6= 0.
|↵|=k
The archetypal linear second-order uniformly elliptic PDE is u + c(x)u = f (x),

x 2 ⌦, where c and f are real-valued functions defined on ⌦. When c < 0 the equation is
called the Helmholtz equation. In the special case when c(x) = 0 the equation is referred
to as Poisson’s equation, and when c(x) = 0 and f (x) = 0 as Laplace’s equation. Elliptic
PDEs arise in a range of mathematical models in continuum mechanics, physics, chemistry,
biology, economics and finance. For example, in a two-dimensional flow of an incompressible
fluid with flow-velocity u = (u1 , u2 , 0) the stream-function , related to u by u = r ⇥
(0, 0, ), satisfies Laplace’s equation. The potential of a gravitational field, due to an
attracting massive object of density ⇢, satisfies Poisson’s equation = 4⇡G⇢, where G
is the universal gravitational constant.
Parabolic and hyperbolic PDEs typically arise in mathematical models where one of the
independent physical variables is time, t. For example,
@t u + Lu = f and @t2 u + Lu = f,
where L is a uniformly elliptic partial differential operator of order 2m and u and f are
functions of (t, x1 , ..., xn ), are uniformly parabolic and uniformly hyperbolic PDEs, respec-
tively. The simplest examples are the (uniformly parabolic) unsteady heat equation and
the (uniformly hyperbolic) second-order wave equation, where
n
X
Lu = @xj (aij (t, x)@xi u),
i,j=1
and aij (t, x) = aij (t, x1 , ..., xn ), i, j = 1, ..., n, are the entries of a n ⇥ n matrix, which is
positive definite, uniformly with respect to (t, x1 , ..., xn ).
Not all PDEs are of a certain fixed type. For example, the following PDEs are mixed
elliptic-hyperbolic; they are elliptic for x > 0 and hyperbolic for x < 0:
@x2 u + sgn(x)@y2 u = 0 (Lavrentiev equation),

@x2 u + x@y2 u = 0 (Tricomi equation),
x@x2 u + @y2 u = 0 (Kel’dish equation),
PDEs are rarely considered in isolation: additional information is typically supplied in

the form of boundary conditions, imposed on the boundary @⌦ of the domain ⌦ ⇢ Rn in
which the PDE is studied, or, in the case of parabolic and hyperbolic equations, also as
initial conditions at t = 0. The PDE in tandem with the boundary/initial conditions is
referred to as a boundary-value problem/initial-value problem, or when both boundary and
initial data are supplied, as an initial-boundary-value problem.
1.3 Course overview

In this course we will study linear second-order PDEs. To start with an example, let us
consider the mathematical model which describes the vibration of a drum. Let ⌦ ⇢ R2 be
a thin membrane fixed to the brim of a hallow wooden structure. Given an external force
f we are interested to find the displacement u at any point (x, y) in the domain ⌦ . The
above model gives rise to the following problem: find u(x, y) satisfying
u(x, y) + u(x, y) = f (x, y), (x, y) 2 ⌦, (1.1)

u(x, y) = 0, (x, y) 2 @⌦, (1.2)
where @⌦ is the boundary of ⌦. Since the boundary @⌦ is fixed, there is no displacement

and hence, u(x, y) = 0 on the boundary @⌦.
By a (classical) solution u of (1.1)–(1.2), we mean a twice continuously differentiable
function u which satisfies the partial differential equation (1.1) at each point (x, y) 2 ⌦
and also the boundary condition (1.2). Note that f has to be continuous, if we are looking
for the classical solutions of (1.1)–(1.2). For the present problem, the external force f may
not be imparted continuously. In fact f can be a point force, i.e., a force applied at a point
in the domain. Thus a physically more relevant problem is to allow more general f , that
is, f may have some discontinuities. Therefore, there is a need to generalize the concept
of solutions.
In early 1930, S.L. Sobolev came across with a similar situation while he was dealing
with the following first order hyperbolic equation
@t u + a@x u = 0, t > 0, 1 < x < 1, (1.3)

u(x, 0) = u0 (x), 1 < x < 1, , (1.4)
Here, a is a real, positive constant and u0 is the initial profile. It is well know the exact
solution u(x, t) = u0 (x at) (this equation is called the first order advection equation).
In this case, the solution preserves the shape of the initial profile. However, if u0 is not
differentiable (say it has a kink) the solution u is still meaningful physically, but not in
the sense of classical solution. These observations were instrumental for the development
of the modern partial differential equations.
Again coming back to our vibration of drum problem (1.1)–(1.2), we note that in
mechanics or in physics, the same model is described through a minimisation of total
energy (Dirichlet principle), say J(v), i.e., minimisation of J(v) subjects to the set V of all
possible admissible displacements, where
Z Z
1
J(v) = |rv|2 dx dy f v dx dx (1.5)
2 ⌦ ⌦
| {z } | {z }
Kinetic Energy (unit mass) Potential Energy
and V is a set of all possible displacements v such that the above integrals are meaningful
and v = 0 on @⌦. More precisely, we cast the above problem as: find u 2 V such that
u = 0 on @⌦ and u minimize J, that is,
J(u)  J(v), 8v 2 V. (1.6)
The advantage of the second formulation is that the displacement u may be once con-
tinuously differentiable and the external force f may be of general form, i.e., f may be
square integrable and may allow discontinuities. Further, it is observed that every solution
u of (1.1)–(1.2) satisfies (1.6). However, the converse need not be true, since in (1.6) the
solution u is only once continuously differentiable. We shall see subsequently that the
variational formulation (1.6) is equivalent to a weak formulation of (1.1)–(1.2), and it allows
physically more relevant external forces, say even point force.
The main concern of next sections will be the choice of the admissible space V . At this
stage, it is worth to analyse the space V , which will motivate the introduction of Sobolev
spaces in Section 1.5.3. With f 2 L2 (⌦) (the space of all square integrable functions), V
may be considered as:
Z Z
V = {v 2 C 1 (⌦) \ C(⌦)}¯ : |v|2 dx dy < 1, |rv|2 dx dy < 1 and v = 0 on @⌦},
⌦ ⌦
where C 1 (⌦), the space of one time continuously differentiable functions in ⌦ is such that
¯ is the space of continuous functions defined
C 1 (⌦) = {v : v, @x v, @y v 2 C(⌦)} and C(⌦)
on ⌦ ¯ with ⌦
¯ = ⌦ [ @⌦. Hence, u|@⌦ is properly defined for u 2 C(⌦).
¯ Unfortunately, this
V is not complete with measurement (norm) given by
✓Z ◆1/2
kuk1 = |u|2 + |ru|2 .
⌦
Roughly speaking, completeness means that all possible Cauchy sequences should find
their limits inside that space (we will specify these terms in the next section). In fact,
if V is not complete, we add the limits to make it complete. One is curious to know
“why do we require completeness?” In practice, we need to solve the above problem by
using some approximation schemes, i.e., we should like to approximate u by a sequence of
approximate solutions {uh }. Many times {uh } forms a Cauchy sequence (that is a part
of convergence analysis). Unless the space is complete, the limit may not be inside that
space. Therefore, a more desirable space is the completion of V . Subsequently, we shall
see that the completion of V is H01 (⌦). This is a Hilbert Sobolev space and is stated as:
H01 (⌦) = {v 2 L2 (⌦) : @x v, @y v 2 L2 (⌦) and v = 0 on @⌦}.
Obviously, the square integrable function may not have partial derivatives in the usual
sense. In order to attach a meaning, we shall generalize the concept of differentiation and
that we shall discuss prior to the introduction of Sobolev spaces. One should note that the
meaning of the H 1 -function v on ⌦ which satisfies v = 0 has to be understood in a general
sense.
If we accept (1.6) as a more general formulation, and the equation (1.1)–(1.2) is its
Euler form, then it is natural to ask: “Does every PDE have a variational form which is of
the form (1.6)?” The answer is simply in negative. Say, for a flow problem with a transport
or convective term:
u + b · ru = f,
it does not have an energy formulation like (1.6). So, next question would be: “Under what
conditions on PDE, such a minimisation form exists?” More over, “Is it possible to have a
more general weak formulation which in a particular situation coincides with minimisation
of the energy?”. This is what we shall explore in the course of these lectures.
If formally we multiply (1.1) by v 2 V (space of admissible displacements) and apply
Gauss divergence theorem, the contribution due to boundary terms becomes zero as v = 0
on @⌦. Then we obtain the weak formulation of (1.1)–(1.2): find u 2 V such that
Z Z
ru · rv dx dy = f v dx dy, 8v 2 V. (1.7)
⌦
R
For flow problem with a transport term b · ru we have an extra term ⌦ b · ruv dx dy
added to the left hand side of (1.7). This is a more general weak formulation and we shall
also examine the relation between (1.6) and (1.7). Given such a weak formulation, “Is it
possible to establish its well-posedness1 ?” Subsequently in the next sections, we shall settle
this issue by using Lax-Milgram theorem.
Very often problem like (1.1)–(1.2) doesn’t admit exact or analytic solutions. For the
problem (1.1)–(1.2), if the boundary is irregular, i.e., ⌦ need not be a square or a circle,
it is difficult to obtain an analytic solution. Even when analytic solution is known, it may
contain complicated terms or may be an infinite series. In both the cases, one resorts to
numerical approximations. One of the objectives of the numerical procedures for solving
differential equations is to cut down the degrees of freedom (the solutions lie in some
infinite dimensional spaces like the Hilbert Sobolev spaces) to a finite one so that the
discrete problem can be solved by using computers.
In these notes we will just consider deterministic linear PDEs for the real case. The
background material from linear functional analysis and the theory of function spaces
discussed herein is intentionally sketchy in order to enable understanding of some of the key
concepts, such as stability and convergence of finite difference and finite element methods,
with the bare minimum of analytical prerequisites.
1
The problem is said to be well-posed (in the sense of Hadamard) if it has a solution, the solution is
unique and it depends continuously on the data.
1.4 Abstract linear spaces

In this section we give a short survey of results, essentially without proofs, from mathemat-
ical, particular functional, analysis which are needed in our treatment of partial differential
equations. We follow to a large extent the book of Stig Larson and Vidar Thomée, Partial
Differential Equations with Numerical Methods, TAM 45, Sringer, 2009.
1.4.1 Linear and bilinear forms

Let V be a linear space (or vector space) with real scalars, i.e., a set such that if u, v 2 V ,
and ↵, 2 R, then ↵u + v 2 V . A linear functional (or linear form) l on V is a function
l : V ! R such that
l(↵u + v) = ↵l(u) + l(v), 8u, v, 2 V, ↵, 2 R.
A bilinear form a(·, ·) on V is a function a : V ⇥ V ! R, which is linear in each argument
separately, i.e., such that, for all u, v, w 2 V and ↵, 2 R,
a(↵u + v, w) = ↵a(u, w) + a(v, w),
a(w, ↵u + v, w) = ↵a(w, u) + a(w, v).
The binear form a(·, ·) is said to be symmetric if
a(v, w) = a(w, v) 8v, w 2 V
and positive definite if
a(v, v) > 0 8v 2 V, v 6= 0.
A positive definite, symmetric, bilinear form on V is also called an inner product (ou scalar
product) on V . A linear space V with an inner product is called an inner product space.
If V is an inner product space and (·, ·) is an inner product on V , then we define the
corresponding norm by
kvk = (v, v)1/2 , v 2 V. (1.8)
When we want to emphasise that an inner product or a norm is associated to a specific
space V , we write (·, ·)V and k · kV .
Lema 1.2 (The Cauchy-Schwarz inequality). Let u, v 2 V ; then
|(w, v)|  kwkkvk
with the equality if and only if w = ↵v or v = ↵w for some ↵ 2 R.
Proof: Let 2 R; then

0  kw + vk2 = (w + v, w + v)
= (w, w) + (w, v) + ( v, w) + ( v, v)
= kwk2 + 2 (w, v) + 2
kvk2 , 2 R.
The right-hand side is a quadratic polynomial in with real coefficients, and it is
non- negative for all 2 R; therefore its discriminant is non-positive, i.e.
|2(w, v)|2 4kwk2 kvk2  0,
and hence the desired inequality.
Corollary 1.3 (The triangle inequality). Let w, v 2 V ; then
kw + vk  kwk + kvk.
Proof: This is a straightforward consequence of the Cauchy-Schwarz inequality:

kw + vk2 = (w + v, w + v) = kwk2 + 2 (w, v) + 2
kvk2 ,
 (kwk + kvk)2 .
Upon taking the square root of both sides we complete the proof.
Two elements w, v 2 V for which (w, v) = 0 are said to be orthogonal. In that case, we
have the equality kw + vk2 = kwk2 + kvk2 , also called the Pythagorean theorem.
A sequence {vi }1
i=1 in V is said to converge to v 2 V , also written vi ! v as i ! 1 or
v = limi!1 vi , if
kv vi kV ! 0, as i ! 1.
The sequence {vi }1
i=1 is called a Cauchy sequence in V if
kvi vj kV ! 0, as i, j ! 1.
The inner product space V is said to be complete if every Cauchy sequence in V is con-
vergent, i.e., if every Cauchy sequence {vi }1
i=1 has a limit v = limi!1 vi 2 V . A complete
inner product space is called a Hilbert space.
More generally, a norm in a linear space V is a function k · k : V ! R+ such that (we
just consider the real case)
kvk 0, 8v 2 V, (positivity)
kvk = 0, if and only if v = 0, (definiteness)
k↵vk = |↵|kvk, 8↵ 2 R, v 2 V, (homogeneity)
kv + wk  kvk + kwk, 8v, w 2 V. (triangle inequality)
A function | · | is called a seminorm in V if these conditions hold with exception that the
second one, i.e., if it is only positive semidefinite, and thus can vanish for some v 6= 0. A
linear space with a norm is called a normed linear space. As we have seen, an inner product
space is a normed space, but not all the normed linear spaces are inner product spaces. A
complete normed space is called a Banach space.
Remark 1.4. Note that, from the triangle inequality we have

kvk = kv w + wk  kv wk + kwk,
and so kvk kwk  kv wk. Analogously, by interchanging the roles of v and w we have
kwk kvk  kw vk. Then we may conclude that, for each norm, the second triangle
inequality
|kvk kwk|  kv wk, 8v, w 2 V.
Remark 1.5. For two elements v, w in a normed space V , the norm kv wk is called the
distance between v and w.
Two norms on a linear space are called equivalent if they have the same convergent
sequences.
Exercise 1.6. Prove that two norms k · k and |||·||| on a linear space V are equivalent
if and only if it there exist positive constants c and C such that
ckvk  |||v|||  Ckvk, 8v 2 V.
The limits with respect to the two norms coincide.
Exercise 1.7. Prove that on a finite-dimensional linear space all norms are equiv-
alent.
1.4.2 Bounded operators

Let V and W be two normed spaces. An operator A : U ⇢ V ! W is said to be continuous
at u 2 U if for every sequence {ui }1
i=1 ⇢ U with limi!1 ui = u we have limi!1 Aui = Au.
The operator is called continuous if it is continuous for all u 2 U .
An equivalent definition is the following: An operator A : U ⇢ V ! W is continuous
at u 2 U if for every ✏ > 0 there exists > 0 such that kAu AvkW  ✏ for all v 2 U
with ku vkV < . Note that by the second triangle inequality of Remark 1.4 the norm is
a continuous function/operator.
Exercise 1.8. Prove that a linear operator is continuous if it is continuous at one

element.
Let V and W be two normed spaces. A linear operator A : U ⇢ V ! W is said to be

bounded, if there is a positive constant C such that
kAvkW  CkvkV , 8v 2 V. (1.9)
The norm of a bounded operator A is
kBvkW
kAk = sup . (1.10)
v2V \{0} kvkV
Thus
kAvkW  kAkkvkV , 8v 2 V,
and, by definition, kAk is the smallest constant C such that (1.9) holds.
Exercise 1.9. Prove that a linear operator is continuous if and only if it is bounded.
In the special case of W = R the definition of a linear operator reduced that of a linear
functional. The set of all bounded linear functionals on V is called the dual space of V ,
denoted by V ⇤ . By (1.10) the norm of a linear functional l 2 V ⇤ is
|l(v)|
klkV ⇤ = sup . (1.11)
v2V \{0} kvkV
Note that V ⇤ is itself a linear space and, with the norm defined by (1.11), V ⇤ is a normed
linear space. It can be proved that V ⇤ with the norm defined by (1.11) is complete, i.e., is
a Banach space.
1.4.3 Best approximation

Let V0 ⇢ V be a subset of a normed space V and let v 2 V . An element v0 2 V0 is called
a best approximation of v with respect to V0 if
kv v0 k = inf kv uk,
u2V0
i.e., if v0 2 V0 has the smallest distance from v.

It can be proved that if V0 is a finite-dimensional subspace of a normed space V then,
for every element in V there exists a best approximation with respect to V0 . The prob-
lem is that the proof is not constructive and there is no algorithm to compute the best
approximation in this case. The scenario is completely different in the context of Hilbert
spaces.
Let V be a Hilbert space and let V0 ⇢ V be a linear subspace. Such a subspace is said
to be a closed subspace if it contains all limits of sequences in V0 , i.e., if {vi }1
i=1 ⇢ V0 and
vi ! v as i ! 1 implies v 2 V0 . In that case, V0 is itself a Hilbert space, with the same
inner product as V .
Let V0 be a closed subspace of V . Then any v 2 V may be written uniquely as
v = v0 + w, where v0 2 V and w is orthogonal all the elements of V0 , i.e., w = v v0 ? V0 .
The element v0 may be characterised as the unique element in V0 which is closest to v, i.e.,
kv v0 k = min kv uk.
u2V0
In fact, it may be proved the following theorem.
Theorem 1.10 (Projection theorem). For any closed subspace V0 of a Hilbert space
V , a necessary and sufficient condition for v0 2 V to be the best approximation of
v 2 V with respect to V0 is that
(v v0 , u) = 0, 8u 2 V0 , (1.12)
i.e., if and only if v v 0 ? V0 .
The projection theorem is a basic result in Hilbert space theory. One useful consequence
of the projection theorem is that if the closed linear subspace V0 is not equal to the whole
V , then it has a normal vector, i.e., there exists a nonzero vector w 2 V which is orthogonal
to V0 .
Exercise 1.11. Prove that the operator PV0 : V ! V0 mapping v 2 V onto its
best approximation, i.e., such that PV0 v = v0 , is a bounded linear operator with the
properties
PV20 = PV0 , and kPV0 k = 1.
It is called the orthogonal projection from V onto V0 and the best approximation v0
is called the orthogonal projection of v onto V0 .
It follows immediately form the projection theorem the following result.

Corollary 1.12. Let V0 be a finite-dimensional linear subspace of the Hilbert space

V with basis { 1 , ..., N }. The linear combination
N
X
v0 = ↵i i
i=1
is the best approximation to v 2 V with respect to V0 if and only if the coefficients

↵1 , ..., ↵n satisfy the normal equations
N
X
↵i ( i , j) = (v, j ), j = 1, ..., N, (1.13)
i=1
The normal equations for the best approximation in Hilbert spaces provide an example
of a system of linear equations. The solution becomes trivial if the basis { 1 , ..., n }
is orthonormal. In fact, from the previous corollary, if { 1 , ..., N } is orthonormal, the
orthogonal projection is given by
N
X
PV 0 v = (v, i) i, v 2 V,
i=1
i.e., the coordinates of the orthogonal projection in the orthonormal basis { 1 , ..., N} are
given by ((v, 1 ), ..., (v, N )).
1.5 Elements of function spaces

As will become apparent in subsequent chapters, the accuracy of finite element approx-
imations to partial differential equations very much depends on the smoothness of the
analytical solution to the equation under consideration, and this in turn hinges on the
smoothness of the data. Precise assumptions about the regularity of the solution and the
data can be conveniently formulated by considering classes of functions with specific differ-
entiability and integrability properties, called function spaces. In this section we present
a brief overview of basic definitions and simple results form the theory of function spaces.
For future reference, we remark here that all functions that appear in these notes will be
assumed to be real-valued.
1.5.1 Spaces of continuous functions

In this section, we describe some simple function spaces which consist of continuously
differentiable functions.
Let ⌦ be an open set in Rn and let k 2 N. We denote by C k (⌦) the set of all continuous
real-valued functions defined on ⌦ such that @ ↵ u is continuous on ⌦ for all ↵ = (↵1 , ..., ↵n )
with |↵|  k. Assuming that ⌦ is a bounded open set, C k (⌦) ¯ will denote the set of all u in
C (⌦) such that @ u can be extended from ⌦ to a continuous function on ⌦,
k ↵ ¯ the closure
¯
of the set ⌦, for all ↵ = (↵1 , ..., ↵n ), |↵|  k. C (⌦) can be equipped with the norm
k
X
kukC k (⌦)¯ = sup |@ ↵ u(x)|.
x2⌦
|↵|k
In particular when k = 0 we shall write C(⌦) ¯ instead of C 0 (⌦)

¯ to denote the set of all
¯
continuous functions defined on ⌦; in this case,
kukC(⌦)
¯ = sup |u(x)| = max |u(x)|.
x2⌦ ¯
x2⌦
Similarly, if k = 1,
X
kukC 1 (⌦)
¯ = sup |@ ↵ u(x)|
x2⌦
|↵|1
n
X
= sup |u(x)| + sup |@xj u(x)|.
x2⌦ j=1 x2⌦
Exemple 1.13. Consider the open interval ⌦ = (0, 1) ⇢ R. The function u(x) =
1/x belongs to C k (⌦) for each k 0. As ⌦¯ = [0, 1] and limx!0 u(x) = 1, it is
¯
clear that u is not continuous on ⌦; the same is true of its derivatives. Therefore
¯ for any k 0.
u 62 C k (⌦)
The support of a continuous function u defined on an open set ⌦ ⇢ Rn is defined as the

closure in ⌦ of the set {x 2 ⌦ : u(x) 6= 0}. We shall write supp u for the support of u.
Thus, supp u is the smallest closed subset of ⌦ such that u = 0 in ⌦ \ supp u.
Exemple 1.14. Let w be the function defined on Rn by

( 1
w(x) = e 1 |x|2 , |x| < 1,

0, otherwise;
where |x| = (x21 + · · · + x2n )1/2 . Clearly, the support of w is the closed unit ball
{x 2 Rn : |x|  1}.
We denote by C0k (⌦) the set of all u contained in C k (⌦) whose support is a bounded
subset of ⌦. Let \
C01 (⌦) = C0k (⌦).
k 0
Exemple 1.15. The function w defined in the previous example belongs to the space
C01 (Rn ). In fact, it is enough to check the continuity and differentiability properties
only at the points x = ±1. For n =, apply L’Hopsital rule to conclude the result; for
n > 1, since the function is radial, we can prove the result in a similarly.
1.5.2 Spaces of integrable functions

Next we consider a class of spaces that consist of (Lebesgue-) integrable functions. Let p
be a real number, p 1; we denote by Lp (⌦) the set of all real-valued functions defined
on an open subset ⌦ of Rn such that
Z
|u(x)|p dx < 1.
⌦
Any two functions which are equal almost everywhere (i.e. equal, except on a set of
measure zero) on ⌦ are identified with each other. Thus, strictly speaking, Lp (⌦) consists
of equivalence classes of functions; still, we shall not insist on this technicality. Lp (⌦) is
equipped with the norm
✓Z ◆1/p
p
kukLp (⌦) = |u(x)| dx .
⌦
We shall also consider the space L1 (⌦) consisting of functions u defined on ⌦ such that |u|
has finite essential supremum on ⌦ (namely, there exists a positive constant M such that
|u(x)|  M for almost every2 x in ⌦; the smallest such number M is called the essential
supremum of |u|, and we write M = ess.sup x2⌦ |u(x)|). L1 (⌦) is equipped with the norm
kukL1 (⌦) = ess.sup x2⌦ |u(x)|.
A particularly important case corresponds to taking p = 2; then

✓Z ◆1/2
kukL2 (⌦) = |u(x)|2 dx .
⌦
The space L2 (⌦) can be equipped with the inner product

Z
(u, v) = u(x)v(x) dx.
⌦
Clearly ku(x)kL2 (⌦) = (u, u)1/2 .
Remark 1.16. The space Lp (⌦) with p 2 [1, 1] is a Banach space. In particular, L2 (⌦) is
a Hilbert space: it has an inner product (·, ·) and, when equipped with the associated norm
k · kL2 (⌦) , defined by kukL2 (⌦) = (u, u)1/2 , it is a Banach space.
To conclude this section, we note the the following generalisation of the Cauchy-Schwarz
inequality, known as Hölder’s inequality, that is valid for any two functions u 2 Lp (⌦) and
v 2 Lq (⌦) with 1/p + 1/q = 1:
Z
u(x)v(x) dx  kukLp (⌦) kvkLq (⌦) .
⌦
1.5.3 Sobolev spaces

In this section we introduce a class of spaces, called Sobolev spaces (after the Russian
mathematician S.L. Sobolev), which play an important role in modern differential equation
theory. Before we give the precise definition of a Sobolev space, we introduce the concept
of weak derivative.
Suppose that u is a smooth function, say u 2 C k (⌦), with ⌦ an open subset of Rn , and
let v 2 C01 (⌦); then the following integration-by-parts formula holds:
Z Z
@ ↵ u(x)v(x) dx = ( 1)|↵| u(x)@ ↵ v(x) dx, |↵|  k, 8v 2 C01 (⌦).
⌦ ⌦
Note that all terms involving integrals over the boundary of ⌦, which arise in the course of
integrating by parts, have disappeared because v and all of its derivatives are identically
2
We shall say that a property P (x) is true for almost every x in ⌦, if P (x) is true for all x 2 ⌦ \ where
is a subset of ⌦ with zero Lebesgue measure.
zero on the boundary of ⌦. This identity represents the starting point for defining the
concept of weak derivative.
Now suppose that u is a locally integrable function defined on ⌦ (i.e. u 2 L1 (!) for
each bounded open set !, with ! ¯ ⇢ ⌦). Suppose also that there exists a function w↵ ,
locally integrable on ⌦ and such that
Z Z
|↵|
w↵ (x)v(x) dx = ( 1) u(x)@ ↵ v(x) dx, 8v 2 C01 (⌦);
⌦ ⌦
then we say that w↵ is a weak derivative of the function u of order |↵| = ↵1 + · · · + ↵n , and
we write w↵ = @ ↵ u. In order to see that this definition is correct it has to be shown that
if a locally integrable function has a weak derivative then this must be unique; we remark
that this is a straightforward consequence of DuBois Reymond’s lemma3 . Clearly, if u is a
sufficiently smooth function, say u 2 C k (⌦), then its weak derivative @ ↵ u of order |↵|  k
coincides with the corresponding partial derivative in the classical pointwise sense.
Exemple 1.17. Let ⌦ = R, and suppose that we wish to determine the weak first
derivative of the function u(x) = (1 |x|)+ defined on ⌦. Clearly u is not differen-
tiable at the points 0 and ±1. However, because u is locally integrable on ⌦, it may,
nevertheless, have a weak derivative. Indeed, for any v 2 C01 (⌦),
Z +1 Z +1 Z 1
u(x)v 0 (x) dx = (1 |x|)+ v 0 (x) dx = (1 |x|)v 0 (x) dx
1 1 1
Z 0 Z 1
= (1 + x)v 0 (x) dx + (1 x)v 0 (x) dx
1 0
Z 0 Z 1
0
= v(x) dx + [(1 + x)v(x)] 1 + v(x) dx + [(1 x)v(x)]10
1 0
Z +1
= w(x)v(x) dx
1
where 8
>
> 0, x < 1,
<
1, x 2 ( 1, 0),
w(x) =
>
> 1 x 2 (0, 1),
:
0, x > 1.
Thus, the piecewise constant function w is the first (weak) derivative of the contin-
uous piecewise linear function u, i.e. w = u0 .
Now we are ready to give a precise definition of a Sobolev space. Let k be a non-negative
integer and suppose that p 2 [1, 1]. We define (with @ ↵ denoting a weak derivative of
order |↵|)
Wpk (⌦) = {u 2 Lp (⌦) : @ ↵ u 2 Lp (⌦), |↵|  k}.
3
DuBois Reymond’s lemma: Suppose that w is a locally integrable function defined on an open set
⌦ ⇢ Rn . If Z
w(x)v(x) dx = 0, 8v 2 C01 (⌦)
⌦
then w(x) = 0 for almost every x 2 ⌦.
The space Wpk (⌦) is called a Sobolev space of order k; it is equipped with the (Sobolev)
norm 0 11/p
X
kukWpk (⌦) = @ k@ ↵ u(x)kpLp (⌦) A , when 1  p < 1
|↵|k
and, X
kukW1
k (⌦) = k@ ↵ u(x)kL1 (⌦) , when p = 1.
|↵|k
Letting,
0 11/p
X
|u|Wpk (⌦) = @ k@ ↵ u(x)kpLp (⌦) A , when 1  p < 1,
|↵|=k
we can write 0 11/p

k
X
kukWpk (⌦) = @ |u|p A , when 1  p < 1.
Wpj (⌦)
j=0
Similarly, letting X
|u|W1
k (⌦) = k@ ↵ u(x)kL1 (⌦) ,
|↵|=k
we have that
k
X
kukW1
k (⌦) = |u|W1
j
(⌦)
.
j=0
When k 1, | · |Wpk (⌦) is called the Sobolev semi-norm4 on Wpk (⌦).

An important special case corresponds to taking p = 2; the space W2k (⌦) is then a
Hilbert space with the inner product
X
(u, v)W k (⌦) = (@ ↵ u, @ ↵ v).
2
|↵|k
For this reason, we shall usually write H k (⌦) instead of W2k (⌦).
Throughout these notes we shall frequently refer to the Hilbert Sobolev spaces H 1 (⌦)
and H 2 (⌦). Our definitions of Wpk (⌦) and its norm and semi-norm, for p = 2, k = 1, give:
H 1 (⌦) = u 2 L2 (⌦) : @xj u 2 L2 (⌦), j = 1, ..., n ,

0 11/2
n
X
kukH 1 (⌦) = @kukL2 (⌦) + k@xj ukL2 (⌦) A ,
j=1
0 11/2
Xn
|u|H 1 (⌦) =@ k@xj ukL 2 (⌦)
A .
j=1
4
When k 1, | · |Wpk (⌦) is only a semi-norm rather than a norm because if |u|Wpk (⌦) = 0 for u 2 Wpk (⌦)
it does not necessarily follow that u(x) = 0 for almost every x 2 ⌦ (all that is known is that D↵ u(x) = 0
for almost every x 2 ⌦, |↵| = k), so | · |Wpk (⌦) does not satisfy the first axiom of norm.
Similarly, for p = 2 and k = 3,

n o
H 2 (⌦) = u 2 L2 (⌦) : @xj u 2 L2 (⌦), j = 1, ..., n, @x2i xj u 2 L2 (⌦), i, j = 1, ..., n ,
0 11/2
Xn Xn
kukH 2 (⌦) = @kukL2 (⌦) + k@xj ukL2 (⌦) + k@x2i xj ukL2 (⌦) A ,
j=1 i,j=1
0 11/2
n
X
|u|H 2 (⌦) = @ k@x2i xj ukL2 (⌦) A .
i,j=1
Exercise 1.18. Given that L2 (⌦) is complete, prove that H 1 (⌦) is complete.
Hint: Assume that kvj vkH 1 (⌦) ! 0 as i, j ! 0. Show that there are v, wk such
that kvj vkL2 (⌦) ! 0, k@xk vj wk kL2 (⌦) ! 0, and that wk = @xk v in the sense of
weak derivative.
It may be shown that C m (⌦)¯ is dense in H k (⌦) for any m k, if @⌦ is sufficiently

smooth, say, ⌦ is a polygonal domain in R2 or a polyhedron in R3 . This is useful because
it allows us to obtain certain results for H k (⌦) by carrying out the proof for functions in
¯ which may be technically easier, and then extend the result to all v 2 H k (⌦) by
C k (⌦),
using the density.
Exemple 1.19. Contrary to what happens in dimension 1, for dimension 2 or higher

there are functions in H 1 (⌦) that are not continuous. For example, the function
u(x) = | ln |x| |↵ , where |x| = (x21 + x22 )1/2 , belongs to L2 (B) and, if ↵ < 12 , belongs
to H 1 (B), with B = {x 2 R2 : |x|  e 1 }. Yet it is quite clear that u is not
continuous at the point x = 0. Since the function tends to infinity at this point,
there is no continuous function that can be a representative of its class.
Finally, we define the special Sobolev space H01 (⌦) as the closure of C01 (⌦) in the norm
of k · kH 1 (⌦) ; in other words, H01 (⌦) is the set of all u 2 H 1 (⌦) such that u is the limit in
H 1 (⌦) of a sequence {um }1 m=1 with um 2 C0 (⌦). It can be shown (assuming that @⌦ is
1
sufficiently smooth) that

H01 (⌦) = u 2 H 1 (⌦) : u = 0 on @⌦ ;
i.e. H01 (⌦) is, in fact, the set of all functions u in H 1 (⌦) such that u = 0 almost everywhere
on @⌦, the boundary of the set ⌦. We shall use this space when considering a partial
differential equation that is coupled with a homogeneous (Dirichlet) boundary condition:
u = 0 on @⌦. We note here that H01 (⌦) is also a Hilbert space, with the same norm and
inner product as H 1 (⌦).
Remark 1.20 (The mathematics behind). An even half-trained mathematician should be

wondering what do we mean by the partial derivatives in the definition of H 1 (⌦), since
one cannot think of taking the gradient of an arbitrary function of L2 (⌦), or at least to
taking the gradient and finding something reasonable. What we mean by restriction to the
boundary in the definition of H01 (⌦) is not clear either, since elements on L2 (⌦) are not
really functions, but classes of functions, where values of the function on particular points
or even on lines are not relevant. To make this completely precise there are several ways:
• Define a weak derivative for elements of L2 (⌦) and what we understand by saying
that that derivative is again in L2 (⌦). Then you move to give a meaning to that
restriction of a function in H 1 (⌦) to one part of its boundary.
• Go deeper and take time to browse a book on distribution theory and Sobolev spaces.
It takes a while but you end up with a pretty good intuition of what this all is about.
• Take a shortcut. You first consider the space of functions C 1 (⌦)¯ and then you close
it with the norm k · kH 1 (⌦) . To do that you have to know what closing or completing a
space is. Then you have to prove that restricting to boundary still makes sense after
this completion procedure.
My recommendation at this point is to simply go on. You can take later on some time with a
good simple book on elliptic PDEs and will see that it is not that complicated. Nevertheless,
if you keep on doing research related to finite elements, you should really know something
more about this. In due time you will have to find any of the dozens of books on PDE books,
and read the details. But this is only an opinion.
We conclude the section with the following useful result.
Lema 1.21 (Poincaré-Friedrichs inequality). Suppose that ⌦ is a bounded open set

in Rn (with a sufficiently smooth boundary @⌦) and let u 2 H01 (⌦); then there exists
a constant c⇤ (⌦), independent of u, such that
Z n Z
X
|u(x)|2 dx  c⇤ |@x1 u(x)|2 dx. (1.14)
⌦ i=1 ⌦
Proof: As any function u 2 H01 (⌦) is the limit in H 1 (⌦) of a sequence {um }1
m=1 ⇢
C01 (⌦), it is sufficient to prove this inequality for u 2 C01 (⌦).
In fact, to simplify matters, we shall restrict ourselves to considering the special case
of a rectangular domain ⌦ = (a, b)⇥(c, d) in R2 . The proof for general ⌦ is analogous.
Evidently
Z x Z x
u(x, y) = u(a, y) + @x u(⇠, y) d⇠ = @x u(⇠, y) d⇠, c < y < d.
a a
Thence, by the Cauchy-Schwarz inequality,

Z Z bZ d Z x 2
|u(x, y)|2 dx dy = @x u(⇠, y) d⇠ dy dx
⌦ a c a
Z bZ d ✓Z x ◆
2
 (x a) |@x u(⇠, y)| d⇠ dy dx
a c a
Z b ✓Z d Z b ◆
 (x 1) dx |@x u(⇠, y)|2 d⇠ dy
a c a
Z
1 2
= (b a) |@x u(x, y)|2 dx dy.
2 ⌦
Analogously,
Z Z
1
|u(x, y)|2 dx dy  (d c)2 |@y u(x, y)|2 dx dy.
⌦ 2 ⌦
By adding the two inequalities, we obtain

Z Z
2
|u(x, y)| dx dy  c⇤ |@x u(x, y)|2 + |@y u(x, y)|2 dx dy,
⌦ ⌦
⇣ ⌘ 1
where c⇤ = 2
(b a)2
+ 2
(c d)2
.
The Poincaré-Friedrichs inequality may be written in the form
krvkL2 (⌦)  c⇤ krvkL2 (⌦) , 8v 2 H01 (⌦), (1.15)
and, according to this result, the equivalence of the norms | · |H1 (⌦) and k · kH1 (⌦) on H01 (⌦)
follows from (|v|H1 (⌦) = krvkL2 (⌦) )
kvk2L2 (⌦)  kvk2H 1 (⌦) = kvk2L2 (⌦) + krvk2L2 (⌦)  (1 + c⇤ )krvkL2 (⌦) .
Remark 1.22. Note that the extension of the proof of the Poincaré-Friedrichs inequality
for v 2 H01 (⌦) may be done using the density. In fact, by assumption there exists a sequence
of functions vm 2 C01 (⌦) such that kvm vkH1 (⌦) ! 0. Then, once (1.14) holds for all
vm 2 C01 (⌦), this implies that for v 2 H01 (⌦)
kvkL2 (⌦) = lim kvm kL2 (⌦)  lim c⇤ krvn kL2 (⌦) = lim c⇤ krvn kL2 (⌦) .
m!1 m!1 m!1

Lecture Notes On Numerical Analysis of Partial Differential Equations

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Lecture Notes On Numerical Analysis of Partial Differential Equations

Загружено:

Авторское право:

Доступные форматы

Lecture Notes on

Cover image: “Elipse Lace”, de Susan McBurney, 2005.

Basic functional analysis

1.1 Mathematical background and notation

(in n = 2) or polyhedron (in n = 3). By ⌦ ¯ we denote the closure of ⌦, i.e., ⌦

When applied with w = ru the formula becomes

Let N denote the set of non-negative integers. An n-tuple

is called a multi-index. The non-negative integer |↵| = ↵1 + · · · + ↵n is referred to as the

Exemple 1.1. Suppose that n = 3, and ↵ = (↵1 , ↵2 , ↵3 ), ↵j 2 N, j = 1, 2, 3. Then

@3u @3u @3u

This example highlights the importance of multi-index notation: instead of laboriously

1.2 Linear partial diﬀerential equations

and a linear partial diﬀerential equation (PDE) of order k is defined by

The archetypal linear second-order uniformly elliptic PDE is u + c(x)u = f (x),

@x2 u + sgn(x)@y2 u = 0 (Lavrentiev equation),

PDEs are rarely considered in isolation: additional information is typically supplied in

1.3 Course overview

u(x, y) + u(x, y) = f (x, y), (x, y) 2 ⌦, (1.1)

where @⌦ is the boundary of ⌦. Since the boundary @⌦ is fixed, there is no displacement

@t u + a@x u = 0, t > 0, 1 < x < 1, (1.3)

J(u)  J(v), 8v 2 V. (1.6)

1.4 Abstract linear spaces

1.4.1 Linear and bilinear forms

Lema 1.2 (The Cauchy-Schwarz inequality). Let u, v 2 V ; then

|(w, v)|  kwkkvk

with the equality if and only if w = ↵v or v = ↵w for some ↵ 2 R.

Proof: Let 2 R; then

Corollary 1.3 (The triangle inequality). Let w, v 2 V ; then

Proof: This is a straightforward consequence of the Cauchy-Schwarz inequality:

Remark 1.4. Note that, from the triangle inequality we have

ckvk  |||v|||  Ckvk, 8v 2 V.

The limits with respect to the two norms coincide.

1.4.2 Bounded operators

Exercise 1.8. Prove that a linear operator is continuous if it is continuous at one

Let V and W be two normed spaces. A linear operator A : U ⇢ V ! W is said to be

1.4.3 Best approximation

i.e., if v0 2 V0 has the smallest distance from v.

In fact, it may be proved the following theorem.

i.e., if and only if v v 0 ? V0 .

It follows immediately form the projection theorem the following result.

Corollary 1.12. Let V0 be a finite-dimensional linear subspace of the Hilbert space

is the best approximation to v 2 V with respect to V0 if and only if the coeﬃcients

1.5 Elements of function spaces

1.5.1 Spaces of continuous functions

In particular when k = 0 we shall write C(⌦) ¯ instead of C 0 (⌦)

The support of a continuous function u defined on an open set ⌦ ⇢ Rn is defined as the

Exemple 1.14. Let w be the function defined on Rn by

w(x) = e 1 |x|2 , |x| < 1,

1.5.2 Spaces of integrable functions

kukL1 (⌦) = ess.sup x2⌦ |u(x)|.

A particularly important case corresponds to taking p = 2; then

The space L2 (⌦) can be equipped with the inner product

Clearly ku(x)kL2 (⌦) = (u, u)1/2 .

1.5.3 Sobolev spaces

we can write 0 11/p

When k 1, | · |Wpk (⌦) is called the Sobolev semi-norm4 on Wpk (⌦).

H 1 (⌦) = u 2 L2 (⌦) : @xj u 2 L2 (⌦), j = 1, ..., n ,

Similarly, for p = 2 and k = 3,

It may be shown that C m (⌦)¯ is dense in H k (⌦) for any m k, if @⌦ is suﬃciently

Exemple 1.19. Contrary to what happens in dimension 1, for dimension 2 or higher

suﬃciently smooth) that

Remark 1.20 (The mathematics behind). An even half-trained mathematician should be

We conclude the section with the following useful result.