Академический Документы
Профессиональный Документы
Культура Документы
of Real Analysis
Robert G. Bartle
Department of Mothematics, University of Illinois
vii
vm PREFACE
ROBERT G. BARTLE
Urbana, Illinois
May 5, 1964
-----------------------------------------------------
Chapter Summaries
xi
xu CHAPTER SUMMARIES
III. Convergence 98
V. Differentiation 206
19. The Derivative in R, 206
Definition, Interior Maximum Theorem, Rolle's Theorem,
Mean Value Theorem, Taylor's Theorem, applications, inter-
change of limit and derivative
20. The Derivative in Rp, 224
Directional derivative, partial derivatives, the derivative, the
Chain Rule, the Mean Value Theorem, interchange of the order
of differentiation, Taylor's Theorem
21. Mapping Theorems and Extremum Problems, 249
Class G', Approximation Lemma, Locally One-one Mapping
Theorem, Weak Inversion Theorem, Local Solvability Theorem,
Open Mapping Theorem, Inversion Theorem, Implicit Function
Theorem, extremum problems, location of extrema, second
derivative test, extremum problems with constraints, Lagrange's
Method
.'
The Elements of Real Analysis
Introduction: A Glimpse at Set Theory
1
INTRODUCTION: A GLIMPSE AT SET THEORY
(4)
Figure 1.1
SEC. 1 THE ALGEBRA OF SETS
Set Operations
A UB IIIIIIlIIlll
We have tacitly assumed that the intersection and the union of two
sets is again a set. Among other things this requires that there must
exist a set which has no elements at all (for if A and B have no common
elements, their intersection has no elements).
1.4 DEFINITION. The set which has no elements is called the empty
or the void set and will be denoted by the symbol 0. If A and B are sets
with no common elements (that is, if A (\, B = 0), then we say that A
and B are disjoint or that they are non-intersecting.
The next result gives some of the algebraic properties of the operations
on sets that we have just defined. Since the proofs of these assertions
are routine, we shall leave most of them to the reader as exercises.
6 INTRODUCTION: A GLIMPSE AT SET THEORY
Figure 1.3
Cartesian Product
We now define the Cartesiant product of two sets.
1.9 DEFINITION. If A and B are two non-void sets, then the Car-
tesian product A X B of A and B is the set of all ordered pairs (0" b)
with a E A and b E B. (See Figure 1.5.)
(The definition just given is somewhat informal as we have not defined
what is meant by an "ordered pair." We shall not examine the matter
except to mention that the ordered pair (a, b) could be defined to be
AxB
B
b - - - -1 (a, b)
I
I
I
:
a
A
Figure 1.5
the set whose sole elements are {a l, {a, b}. It can then be shown that
the ordered pairs (a, b) and (a', b') are equal if and only if a = a' and
b = b'. This is the fundamental property of ordered pairs.)
Thus if A = {I, 2, 31 and B = {4, 5}, then the set A X B is the set
whose elements are the ordered pairs
(1,4), (1,5), (2,4), (2,5), (3,4), (3, 5).
We shall often visualize the set A X B as the set of six points in the
plane with the coordinates which we have just listed.
We shall often draw a diagram> such as Figure 1.5, to indicate the
Cartesian product of two sets A, B. However, it should be realized that
this diagram may be somewhat of a simplification. For example, if
A = {x E R: 1 < x < 2} and B = Ix E R: 0 < x < 1 or 2 < x < 3},
then instead of a rectangle, we should have a drawing like Figure 1.6.
2
AxB
1 2
Exercises
1.A. Establish statement (d) of Theorem 1.5.
l.B. Prove that A C B if and only if A (\ B = A.
l.e. Show that the set D of elements which belong either to A or B but not
to both is given by
D = (A\B) V (B\A).
This set D is often called the symmetric difference of A and B.
l.D. Show that the symmetric difference D of A and B is also given by
D = (A V B)\(A (\ B).
l.E. If B ; A, show that B = A\(A\B).
SEC. 2 FUNCTIONS II
Note that if E\A; is denoted by e(A i ), these relations take the form
e (n i=1
Ai) = Ue(A i ),
j=1
e (U
... 1
Ai) = rl e(A;).
i=1
l.J. Let J be any set and, for each j E J, let A i be contained in X. Show that
Section 2 Functions
do not give rise to real numbers for all real values of x was, of course,
well-known but was not regarded as sufficient grounds to require an
extension of the notion of function. Probably one could arouse contro-
versy among those mathematicians as to whether the absolute value
h(x) = Ixl
of a real number is an honest "function" or not. For, after all, the defi-
nition of Ixl is given "in pieces" by
Ixl = { x, ~ x > 0,
-x, If x < O.
As mathematics developed, it became increasingly clear that the
requirement that a function be a formula was unduly restrictive and
that a more general definition would be useful. It also became evident
that it is important to make a clear distinction between the function
itself and the values of the function. The reader probably finds himself
~n the position of the mathematician of a century ago in these two re-
spects and due to no fault of his own. We propose to bring him up to
date with the current usage, but we shall do so in two steps. Our first
revised definition of a function would be:
A function f from a set A to a set B is a rule of correspondence that
assigns to each x in a certain subset 5) of A, a uniquely determined
element f(x) of B.
Certainly, the explicit formulas of the type mentioned above are
included in this tentative definition. The proposed definition allows
the possibility that the function might not be defined for certain elements
of A and also allows the consideration of functions for which the sets A
and B are not necessarily real numbers (but might even be desks and
chairs-or blondes and brunettes).
However suggestive the proposed definition may be, it has a signifi-
cant and fatal defect: it is not clear. It merely shifts the difficulty to
that of interpreting the phrase" rule of correspondence." Doubtless the
reader can think of phrases that will satisfy him better than the above
one, but it is not likely that he can dispel the fog entirely. The most
satisfactory solution seems to be to define "function" entirely in terms
of sets and the notions introduced in the preceding section. This has the
disadvantage of being more artificial and loses some of the intuitive
content of the earlier description, but the gain in clarity outweighs
these disadvantages.
The key idea is to think of the graph of the function: that is, a collec-
tion of ordered pairs. We notice that an arbitrary collection of ordered
SEC. 2 FUNCTIONS 13
B
r----------I
m(f) f (a, b)~
~ =---=1-=---==-=--- j :
I ! I
I ~ I
l I. : :(f) J J
Figure 2.1. A function as a graph.
pairs cannot be the graph of a function, for, once the first member of
the ordered pair is named, the second is uniquely determined.
After these preliminaries, we take the plunge and make our formal
definition.
2.1 DEFINITION. Let A and B be two sets which are not necessarily
distinct. A function from A to B is a set j of ordered pairs in A )( B
with the property that if Ca, b) and Ca, b' ) are elements of j, then b = b' .
The set of all elements of A that can occur as first members of elements
in j is called the domain of j and will be denoted by ~(f). (See Figure
2.1.) Similarly, the set of all elements of B that can occur as second
members of elements in j is called the range of f (or the set of values of
j) and will be denoted by CR (f). In case ~ (f) = A, we often say tha.t f
is a mapping of A into B and write f : A --t B. If, in addition, the range
CR(j) coincides with all of B, we sometimes say that f maps A onto B.
b = I(a)
Tabular Representation
TABLE 2.1
Transformations
There is another way of visualizing a function: as a transformation of
part of the set A into part of B. In this phraseology, when (a, b) E f: we
think of 1 as taking the element a from the subset D(j) of A and "trans-
forming" or "mapping" it into an element b = f(a) in the subset (Jl(j)
of B. We often draw a diagram such as Figure 2.2. We frequently use
this geometrical representation of a function even when the sets A and
B are not subsets of the plane.
There is another way of visualiz- x
ing a function: namely, as a machine
which will accept elements of ~(f)
t
as inputs and yield corresponding
elements of CR(f) as outputs. (See
Figure 2.3.) If we take an element x
from ~(f) and drop it into I, then f
out comes the corresponding value
f(x). If we drop a different element
y of 5) (f) into f, we get f(y) (which
mayor may not differ from f(x).
If we try to insert something which
f
f(x)
does not belong to 5) (f) into j, we
find that it is not accepted, for f can Figure 2.3. A function as a ma-
operate only on elements belonging chine.
to D(j).
This last visualization makes clear the distinction between .f and
f(x): the first is the machine, the second is the output of the machine
when we put x into it. Certainly it is useful to distinguish between a
machine and its outputs. Only a fool would confuse a sausage-grinder
16 INTRODUCTION: A GLIMPSE AT SET THEORY
Composition of Functions
We now want to "compose" two functions by first applyingf to each
x in 'n(f) and then applying g to f(x) whenever possible (that is, when
f(x) belongs to :D(g. In doing so, some care needs to be exercised
concerning the domain of the resulting function. For example, if f is
defined on R by f(x) = x 3 and if g is defined for x > 0 by g(x) = Vi,
then the composition go f can be defined only for x > 0, and for these
real numbers it is to have the value ~.
2.2 DEFINITION. Let f be a function with domain :D(j) in A and
range <R (j) in B and let g be a function with domain :D (g) in Band
range <R(g) in C. (See Figure 2.4.) The composition go f (note the
order I) is the function from A to C given by
g 0 f = {(a, c) E A X C: there exists an element b E B such that
(a, b) E f and (b, c) E g}.
The fact that go f is actually a function (and not an arbitrary subset
of AX C) is a consequence of the fact that f and g are functions (see
Exercise 2.A).
2.3 THEOREM. If f and g are fundions, the composition g 0 f is a
function with
'D(g 0 f) = {x E 'D(j) : f(x) E 'n(g)},
m(g 0 f) = {g(j(x : x E 'n(g 0 f)}.
<R(f)n ~(g)
~ (F 0 G){x E c.D (G) :G (x) E ~ (F) }. This last set is void as G (x) < 0
=
for all x E ~(G). Hence the function FoG is not defined at any point,
so FoG is the" void function."
f -1
of ffi (j) is the image under f of a unique element a in 'JJ (1). The inverse
function f- 1 maps the element b into this unique element a.
For the sake of brevity, we sometimes denote the direct image of a set
E under f by the notation f(E). (See Figure 2.6 on the next page.)
It will be observed that if E n 'J) (f) = 0, then f(E) = 0. If E consists
of the single point p in 'JJ (j), then the set fCE) consists of the single
point j(p). Certain properties of sets are preserved under the direct
image, as we now show.
20 INTRODUCTION: A GLIMPSE AT SET THEORY
J-l(G\H) Cf-I(G)\f-I(H).
Exercises
2.A. Prove that Definition 2.2 actually yields a function and not just a subset.
2.B. Let A = B = R and consider the subset C = {(x, y) : x 2 + y2 = I} of
A X B. Is this set a function with domain in R and range in R?
2.C. Consider the subset of R X R defined by D = {(x, y) : Ixl + Iyl = I}.
Describe this set in words. Is it a function?
2.D. Give an example of two functions I, g on R to R such that I g, but
such that 1 0 g = gO I.
2.E. Prove that if I is a one-one function from A to B, then 1-1 =
{(b,a): (a,b) Efl is a function.
2.F. Suppose I is a one-one function. Show that /-1 0 J(x) = x for all x in
~(f) and f 0 f-l(y) = y for all y in m(f).
2.0. Letf and g be functions and suppose that g 0 f(x) = x for all x in f)(f).
Show that f is one-one and that en (f) C f) (g).
2.H. Let f, g be functions such that
t See the books of Halmos and Hamilton-Landin which are cited in the References.
SEC. 3 FINITE AND INFINITE SETS 25
An = {~ , - ~, ~ , - ~,~, - ~, ... } ,
n n n n n n
Note that each of the sets An is countable and that their union is all of
Q. Hence Theorem 3.2 asserts that Q is countable. In fact, we can
enumerate Q by the diagonal procedure:
1 112 11
0,T, - T, 2, T, - 2, 31 .
t GEORG CANTOR (1845-1918) was born in St. Petersburg, studied in Berlin with
Weierstrass, and taught at Halle. He is best known for his work on set theory, which
he developed during the years 1874-1895.
26 INTRODUCTION: A GLIMPSE AT SET THEORY
Xl = 0.ala2aa . ..
X2 = 0.b 1b2ba .
Xa = O.CIC2Ca
Now let Yl be a digit different from 0) al, and 9; let Y2 be a digit different
from 0, b2, and 9; let Y3 be a digit different from 0, Ca, and 9, etc. Consider
the number Y with decimal representation
Y = 0YIY2Ya . .. ;
clearly Y satisfies 0 < Y < 1. The number Y is not one of the numbers
with two decimal representations, since Yn 0, 9. At the same time
y ~ X n for any n since the nth digit in the decimal representations for
y and X n are different. Therefore, any denumerable collection of real
numbers in this interval will omit at least one real number belonging
to this interval., Therefore, this interval is not a countable set.
We have seen that any set that can be put into one-one correspondence
with an initial segment of N is called a finite set and all other sets are
said to be infinite. Suppose that a set A is infinite; we suppose (rather
than prove) that there is a one-one correspondence with a subset of A
and all of N. In other words, we assume that every infinite set contains a
denumerable subset. The proof of this assertion is based on the so-called
Ii Axiom of Choice," which is one of the axioms of set theory. After the
reader has digested the contents of this book, he may turn to an axio-
matic treatment of the foundations which we have been discussing in a
somewhat informal fashion. However, for the moment he would do well
to take the above statement as a temporary axiom. It can be replaced
later by a more far-reaching axiom of set theory.
Exercises
3.A. Exhibit a one-one correspondence between the set E of even natural
numbers and all of N. Exhibit a one-one correspondence between the set 0 of
odd natural numbers and all of N.
3.B. Exhibit a one-one correspondence between all of N and a proper subset
ofN.
3.C. Show that every infinite set can be put into one-one correspondence with
a proper subset of itself. (Hint: every infinite set has a denumerable subset.)
3.D. Show that a finite set does not have any infinite subset.
3.E. Give an example of a denumerable collection of finite sets whose union
is not finite.
3.F. Show that if A can be put into one-one correspondence with Band B
with C, then A can be put into one-one correspondence with C.
I
27
----------------------
28 eH. I THE REAL NUMBERS
Section A Fields
As we have mentioned, in this section we shall examine the II algebraic"
structure of the real number system. Briefly expressed, the real numbers
form a field" in the sense of abstract algebra. In this section we shall
/I
introduce the notion of a field and examine those properties that will
be of particular importance for later study.
In formulating the next definition, we shall follow a convention that
is familiar to the reader from elementary courses and which is also used
in modern algebra. By a binary operation in a set F we mean a function
B with domain F X F and range in F. Instead of using the notation
B (a, b) to denote the value of the binary operation B at the point (a, b)
in F X F, we shall employ symbols such as a + b or a' b. Although this
notation is at variance with the general notation used for functions, it
is much more suggestive and is almost universally employed in such
a situation.
4.1 DEFINITION. A set F is called a field if there are two binary
operations (denoted by + and and called addition and multiplication,
respectively) satisfying the properties
to check directly that they satisfy all the stated properties. Hence they
show that systems with the required properties do exist. The final
example is familiar in quality, but will be seen to be substantially differ-
ent in character from the real and rational fields.
(d) Let F 2 consist of two distinct elements fJ, e and define addition and
multiplication as in Tables 4.1 and 4.2.
TABLE 4.1 TABLE 4.2
+ () e () e
() () e o () ()
e e 0 e () e
For example, the first column after the vertical bar in Table 4.1 indicates
that 8 + 0 = 8, and e + 8 = e. We leave it to the reader to check that
the properties required in Definition 4.1 are satisfied and that 8 and e
have the properties required. In particular, '8 = 0, e = e, e' = e. (What
about O'?)
(e) Let Fa consist of three distinct elements 0, e, t where we define
addition and multiplication as in Tables 4.3 and 4.4. Check to see that
TABLE 4.3 TABLE 4.4
-+() - 8 e t e t
()
(J e t () () () ()
e e t (J e () e t
t t () e t 0 t e
so CH. I THE REAL NUMBERS
(f) Let Q(t) denote the system of all rational functions with rational
numbers as coefficients. Hence an element of Q(t) is a function f of
the form
f(t) = pet) ,
q(t)
where p and q are polynomials in t with rational coefficients and q is not
the zero polynomial. The operations of addition and multi{:!ication are
the usual ones employed when dealing with rational functions.
Properties of Fields
Theorem 4.3 shows that the hypothesis that 0 and e are unique, which
was made in (A3) and (M3), was not essential and can be proved from
the remaining assumptions. We now prove that the elements a and a'
(when a ~ 0) are unique.
4.4 THEOREM. If a and b are elements of F and a + b = 0, then
b = a. Similarly, if a ~ 8 and b are elements of F and a b = e, then b = a'.
PROOF. If a + b = 0, add a to both sides to obtain a + (a + b) =
a + O. Now use (A2) on the left and (A3) on the right to obtain (a + a)
+ b = a. By using (A4) and (A3) on the left side, we obtain b = a.
The second assertion is proved similarly.
Q.E.D.
for x, and Theorem 4.4 yields the uniqueness of the solutions. We now
show that the right-hand sides of these equations can be arbitrary ele-
ments of F and are not required to be 8, e, respectively.
4.5 THEOREM. (a) Let a, b be arbitrary elements of F. Then the
equat~'on a + x = b has the unique solution x = a + b.
(b) Let a ~ 0 and b be arbitrary elements of F. Then the equation
a'x = b has the unique solution x = a'b.
PROOF. Observe that a + (a + b) = (a + a) + b = 0 + b = b so
that x = a + b is a solution of the equation a + x = b. To show the
uniqueness, let Xl be any solution of this equation and add a to both
sides of a + Xl = b to obtain
a + (a + Xl) = a + b.
Employing (A3) , (A4) , (A2) and this relation, we get
Xl = 0 + Xl = (li + a) + Xl = a + (a + Xl) = a + b.
The proof of part (b) of the theorem is similar.
Q.E.D.
Exercises
4.A. Why must a field contain at least two elements?
4.B. Verify that the system C of complex numbers, as defined in Example
4.2(c), forms a field.
4.C. Does the collection of polynomials with rational coefficients form a field?
4.D. Restate Theorem 4.6 employing the usual notation; that is, using 0, 1,
-a, a-I instead of 8, e, 71, a'.
4.E. Restate Theorem 4.7 employing the usual notation.
4.F. If F4 = {O, 1, a, b} consists of four distinct elements, show that F4 forms
a field with the operations given by Tables 4.5 and 4.6.
TABLE 4.5 TABLE 4.6
+ 0 1 a b 0 1 a b
0 0 1 a b 0 0 0 0 0
1 1 0 b a 1 0 1 a b
a a b 1 a 0 a b 1
b b a
1 0 b 0 b 1 a
Show that, with these operations, if x "e 0, then x3 = 1 and if y is any element,
then y4 = Y and 2y = O.
------------------------------------
CR. I THE REAL NUMBERS
+-
-
0 1 b a 0 1 a b
0 0 1 a b 0 0 0 0 0
1 1 a b 0 1 0 1 a b
a a b 0 1 a 0 a 0 a
b b 0 1 a b 0 b a 1
(b) Let R be the field of real numbers (which has not been formally
defined, but may be regarded as familiar). Let P be the subset in R
consisting of all elements x in R for which x > 0 (or, in geometrical
terms, those x which lie to the right of the origin). This subset P forms
a positive class in R.
(c) Let Q(t) be the field of rational functions with rational numbers
as coefficients. Hence an element of Q(t) is a quotient p/q, where p and q
are polynomials with rational coefficients and not all of the coefficients
of q are zero. Let P be the subset of Q(t) consisting of all quotients p/q
such that the coefficient of the highest power of t in the product p(t)q(t)
is a positive rational number (in the sense of Example (a)). This set P
forms a positive class in Q(t), as may be demonstrated.
(d) Let F be the field consisting of the two elements 0, 1. If Pi = {O},
then the subset Pi satisfies properties (i), (ii) of Definition 5.1 but not
property (iii). Further, the subset P 2 = {1} satisfies (ii) but not (i) or
(iii). Hence neither Pl nor P 2 forms a positive class for this field. (It will
be seen from Theorem 5.5, that there is no positive class for this field.)
5.3 DEFINITION. If P is a positive class of elements in a field F, we
say that F is ordered by P and that F is an ordered field. If a belongs to
P, we say that a is a positive element of F and write a > O. If a is either
in P or is 0, we say that a is non-negative and write a > O. If the differ-
ence a - b belongs to P, we write a > b and if a - b either belongs to
P or equals 0, we write a > b.
As usual, it is often convenient to turn the signs around and write
o < a, 0 < a, b < a, and b < a, respectively. In addition, if both a < b
and b < c, then we write a < b < c or c > b > a; if a < band b < c,
then we write a < b < cor c > b > a.
If a-I < 0, then property (c') with c = a-I implies that aa- l < from
which it follows that 1 < 0, contradicting Theorem 5.5 (D). Invoking
5.1 (iii) we conclude that a-I> 0, since the other two possibilities have
been excluded.
(d') This part can be proved by an argument analogous to that used
in (d). Alternatively, we can observe that (-a)-1 = -a- 1 and use
(d) directly.
Q.E.D.
The corollary just proved with b = implies that given any positive
number a, there is a smaller positive number, namely a/2. Expressed
differently, in an ordered field there is no smallest positive number.
It follows from Theorem 5.6(c) with b = 0 that if a > () and c > 0,
then ae > 0. Similarly, from 5.6(c') with a = it follQl,vS that if b <
and c < 0, then be > 0. We now establish the converse statement.
5.8 THEOREM. If ab > 0, then we either have a
we have a < 0 and b < 0.
> and b > 0 or
Absolute Value
The trichotomy property 5.1 (iii) assures that if a 0, then either a
or -a is a positive element. The absolute value of an element a is defined
to be the positive one of the pair {a, - a} ; for completeness, the absolute
value of 0 is defined to be O.
5.9 If F is a field with positive class P, we define the
DEFINITION.
absolute value function by
lal = { a,
if a >0,
-a, if a < 0.
Thus the domain of the absolute value function is all of F, its range
is P V to}, and it maps the elements a, -a into the same element. We
now obtain the basic properties of the absolute value function.
5.10 THEOREM. (a) lal = 0 if and only if a = O.
(b) I-al = lal for all a in F.
(c) jabl = lallbl for all a, b in F.
(d) If c > 0, then lal < c if and only if -c < a < c.
(e) -lal < a < lal for all a in F.
PROOF. (a) By definition, 101 = 0. If a ~ 0, then -a ~ 0 so that
lal ~ 0.
(b) If a > 0, then lal = a = I-al; if a < 0, then lal = -a = I-ali
and if a = 0, then \01 = = 1-01.
(c) If a > 0 and b > 0, then ab > 0 sO that lab\ = ab = lallbl. If
a < and b > 0, then ab < 0 so that jabl = - (ab) = (-a)b = lallbl.
The other cases are handled similarly.
(d) If laj < c, then both a < c and -a < c. From the latter and
Theorem 5.6(c') we have -c < a so that -c < a < c. Conversely, if
this latter relation holds, then we have both a < c and -a < c, whence
lal < c.
(e) Since lal > 0, this part follows from (d).
Q.E.D.
The next result is commonly called the Triangle Inequality and will
be used frequently in the sequel.
5.11 THEOREM. Let a, b be any elements of an ordered field F, then
Iial - Ibl! < la + bl < lal + Ihl
PROOF.According to Theorem 5.10(e), we obtain -Ial < a < lal
and since Ibl = I-bl, we also have -Ibl < + b < Ibl. Employing 5.6(b)
we infer that
- Clal + jb\) = -lal - Ibl < a + b < lal + Ibl
SEC. 5 ORDERED FIELDS 39
Intervals
t This term is named for ARCHIMEDES (287-212 B.C.), who has been called lithe
greatest intellect of antiquity," and was one of the founders of the scientific method.
SEC. 5 ORDERED FIELDS 41
then there are arbitrarily small irrational elements. \Ve first note that
if ~ is an irrational element of F, then either ~ or - ~ is a positive irrational
element of F.
5.15 THEOREM. Let F be an Archimedean field containing a positive
irrational element ~. If z is a positive element of F, then there is a natural
number m such that the positive irrational element Um satisfies 0 < ~/m < z.
PROOF. Since ~ > 0, Z > 0 it follows from Theorem 5.6(d) and
5.6(c) that ~/z > O. Since F is AI' chimedean, there exists a natural
number m such that 0 < Uz < m. By using Theorem 5.6 again, we
obtain the conclusion.
Q.E.D.
We now show that in any Archimedeun field F the mtional elements
are" dense" in the sense that between any two elements of F there is a
rational element of F. OncE: again, we shall use the Well-ordering
Property of N.
5.16 THEOREM. If y, z a're elements of an Archimedean field F and
if y < z, then there is a rational element T of F such that y < T < Z.
PROOF. It is no loss of generality to assume that 0 < Y < z. (Why?)
Since y > 0 and z - y > 0, it follows from Theorem 5.14(b) that there
is a natural number m such that 0 < l/m < y and 0 < lim. < Z - y.
From Theorem 5.14(a) there is a natural number k such that kim =
k(l/m) > y and we let n be the smallest such natural number. There-
fore, (n - l)jm < y < njm, and we shaH now show that n/m < z.
If this latter relation does not hold, then z < n/m and we have
n-l n
--<y<z<_
m m
It follows from this (as in Exercise 5.D) that z - y < 11m, contradicting
the fact that 11m < z - y.
Q.E.D.
If F is an Archimedean field with at least one irrational element ~,
then the irrational elements of F are also dense in the sense that between
any two elements of F there is an irrational element of F.
5.17 THEOREM. If the A rchimedean field F contains an irrational
element ~ and if y < z, then there is a rational number T such that the
irrational element r~ satisfies y < r~ < z.
The proof of the result is very close to that of Theorem 5.16 except
that it is based on Theorem 5.15 rather than 5.14(b). We leave it as an
exercise for the reader.
42 CH. I THE REAL NUMBERS
Nested Intervals
The next result provides a theoretical basis for the binary (= base 2)
expansion of the fractional part of an element in an Archimedean field.
A similar result can be obtained for any base.
5.18 THEOREM. Let x be an element of an Archimedean field F. For
each integer n = 0, 1,2, ... , there is a closed interval
If the point x belongs to the first of these two intervals, we put a2 = al;
otherwise, we put
1, = [a" ad ;1
By continuing in this manner, we obtain intervals In for n = 0, 1, 2, ...
each containing x. (See Figure .5.].) lVloreover J the end points of these
intervals are rational elements of F and 1 n+l C In for each n.
Q.E.D.
SEC. 5 ORDERED FIELDS 43
I~ II ~
IE 12 )1
IE 13 :-I
~14~
15
~
16
I-l
Ixl I
The reader should convince himself that the nested sequence (K,,) of
non-empty closed intervals in Q does not have any common point,
since ~ is not an element of Q. Thus not every nested sequence of closed
intervals in Q has a common point in Q, although the corresponding
sequence will have a common point in a larger Archimedean field. The
essential distinction between the real number system R and any other
Archimedean field F is that every nested sequence of closed intervals in
R has a common point. It is this property that assures that there are
no "gaps" in the real number system.
Exercises
5.A. No ordererl field contains only a finite number of elements.
5.B. In an ordered field, if a? + b'l = 0, then a = b = O.
5.C. Show that it is not possible to make the complex numbers into an ordered
field.
5.D. If 0::; x ~ band 0 ~ y ::; b, then [x - yl ~ b. More generally, if
a ::; x ~ b and a ~ y :s; b, then Ix - yl < b - a.
5.E. If 1 + a > 0 and n E N, then (1 + a)n > 1 + na. (Hint: use mathe-
matical induction.) This inequality is sometimes called Bernoulli's Inequality.t
5.F. Suppose e > 1. If n E N, then en > c. More generally, if m, n E Nand
m> n, then em> e". (Hint: e = 1 + a with a > 0.)
5.G. Suppose 0 < c < 1. If n E N then 0 < en < c. More generally, if
m, n E Nand m> n, then em::; e".
5.H. If n E N, then n < 2".
5.L If a, b are positive real numbers and n E N, then an < b71 if and only if
a < b.
5.J. Show that the rational numbers form an Archimedean field with the
order given in Example 5.2(a).
5.K. Show that the ordered field Q(t) is not Archimedean with the order
given in Example 5.2(c).
5.L. Show that an ordered field is Archimedean if and only if for each element
z > 0 there is a natural number n such that
1
0<-n < z.
2
t JACOB BERNOULLI (1654-1705) was a member of a Swiss family that produced
severalrnathematicians who played an important role in the development of calculus.
SEc.6 THE REAL NUMBER SYSTEM 45
5.M. Show that statements (i) and (ii) after Theorem 5.14 do not hold in an
Archimedean field.
5.N. Give the details of the proof of Theorem 5.17.
5.0. Explain how Theorem 5.18 provides a basis for the binary expansion of
the fractional part of an element in an Archimedean field.
S.P. Modify Theorem 5.18 to provide a basis for the decimal expansion of a
fraction.
5.Q. Prove that the intervals in Theorem 5.18 have x as the only common
point.
infS\ / 7 S ~ {SUPS
sup S, inf S.
whereas the other does not. Thus when we say that a set has a supremum
we are making no statement as to whether the set contains the supremum as
an element or not.
Since the supremum of a set S is a special upper bound, it is plain that
only sets which are bounded above can have a supremum. The empty
set is bounded above by any real number; hence it does not have a
supremum. However, it is a deep and fundamental property of the real
number system that every non-empty subset of R which is bounded
above does have a supremum. We now establish this result.
~ SUPREMUM PRINCIPLE. Every non-empty subset of real numbers
~ has an upper bound also has a supremum.
PROOF. Let a be some real number which is not an upper bound of a
non-empty set S and let b be an upper bound of S. Then a < h, and we
let II be the closed interval [a, b]. If the point (a + b)/2 of II is an
upper bound of S, we let 12 = [a, (a + b)/2]; otherwise, we let 12 =
[(a + b) /2, b]. In either case, we relabel the left and right end point of
12 to be a2 and b2, respectively. If the midpoint (a2 + b2)/2 of 12 is an
upper bound of S, we let 1 3 = [a2, (a2 + b2)/2]; otherwise, we let 13 =
[(a2 + b2)/2, b2]. We then relabel the end points, bisect the interval, and
so on. In this way we obtain a nested sequence (In) of non-empty closed
intervals such that the length of In is (b - a) /2n~l, the left end point
an of In is not an upper bound of S, but the right end point bn of In is an
upper bound of the set S. According to the completeness (cL Definition
6.1) of the real numbers, there is a real number x which belongs to all
of the intervals In. We shall now show, using Lemma 6.5, that x is the
supremum of S.
Suppose there exists an element s in S such that x < s. Then s - x > 0
and there exists a natural number n such that
b- a
length (In) = bn - an =
2n - 1
< s - x.
Since x belongs to In, we have an < x < bn < 8, which contradicts the
fact that bn is an upper bound of S. Hence x is an upper bound of S.
Now suppose that v < x; since x - v > 0, there exists a natural
number m such that
b- a
length (1 m) = bm - am =
2 m- 1
< x-v.
Since x Elm, then v < am < x <b m By construction, am is not an
upper bound of S, so there exists an element s' in S such that v < am < s'.
According to Lemma 6.5, the point x is the supremum of S.
Q.E.D.
------------------------------------
The reader should note where the completeness of the real number
system was used in the proof of the Supremum Principle. It is a fact of
some interest and importance that if F is an ordered field in which every
non-empty set which has an upper bound also has a supremum, then the
ordering is necessarily Archimedcan and the completeness property
stated in Definition 6.1 also holds (see Exercises 6.J, 6.K). Hence we
could characterize the real number system as an ordered field in which
the Supremum Principle holds, and this means of introducing the real
number system is often used. We chose the approach used here because
it seems more intuitive to us and brings out all the needed properties
in a reasonably natural way.
Dedekind Cuts
----~1'-_------..J,,..-------lJ.~------
F2 ------------~------------
Fa - - - - - - -
F,i
Figure 6.3. The Cantor set.
of elements of F and that the points of F can be put into one-one corre-
spondence with the points of I. Hence the set F contains a large number
of elements.
We now give two senses in which F is "thin." First we observe that
F does not contain any non-void interval. For if x belongs to F and
(a, b) is an open interval containing x, then (a, b) contains some middle
thirds that were removed to obtain F. (Why?) Hence (a, b) is not a
subset of the Cantor set, but contains infinitely many points in its
complement e (F).
A second sense in which F is thin refers to "length." While it is not
possible to define length for arbitrary subsets of R, it is easy to convince
oneself that F cannot have positive length. For, the length of F I is i,
that of F 2 is -l, and, in general, the length of F n is (i) n. Since F is a sub-
set of Fn, it cannot have length exceeding that of Fn' Since this must
be true for each n in N, we conclude that F, although uncountable,
cannot have positive length.
As strange as the Cantor set may seem, it is relatively well behaved
in many respects. It provides us with a bit of insight into how compli-
cated subsets of R can be and how little our intuition guides us. It also
serves as a test for the concepts that we will introduce in later sections
and whose import are not fully grasped in terms of intervals and other
very elementary subsets.
Exercises
6.A. Show that the open intervals I n = (0, lin), n E N, do not have a
common point.
6.B. Show that the unbounded sets
6.J. Prove that if F is an ordered field in which every non-empty set which
has an upper bound also has a supremum, then F is an Archimedean field.
6.K. If (In) is a nested sequence of closed intervals, In = [an, bn], n E N,
show that the numbers
a = sup {an \, b = inf {b,,},
belong to all of the In. In fact, show that the intersection of the In, n E N, con-
sists of the interval [a, b]. Conclude, therefore, that in an ordered field in which
every non-empty set which has an upper bound also has a supremum, the
Completeness Property of Definition 6.1 also holds.
6.L. Let A = {x E Q: x :s; 0 or x'l. < 2} and B = {x E Q: x > 0 and x'l. > 21.
Prove that the pair A, B forms a cut in Q. Show that there does not exist -a
rational number c which is both an upper bound for A and a lower bound for B.
Hence there is no rational number determining this cut.
6.M. Show that every element in the Cantor set F has a ternary (= base 3)
expansion using the digits 0, 2.
6.N. Show that the Cantor set F is a non-denumerable subset of I. [Hint: if
the denumberable collection of "right-hand" end points of F is deleted, then
what remains can be put into one-one correspondence with all of the non-
denumberable subset [0, 1) of R.]
6.0. Show that every open interval (a, b) containing a point of F also contains
an entire "middle third" set, which belongs to e(F). Hence the Cantor set F
does not contain any non-void open interval.
6.P. By removing sets with ever decreasing length, show that we can construct
a "Cantor-like" set which has positive length. How large can we make the length?
6.Q. Show that F is not the union of a countable collection of closed intervals.
6.R. If S is a bounded set of real numbers and if So is a non-empty subset of S,
then
inf S ::;; inf So < sup So ::; sup S.
(Sometimes it is more convenient to express this relation in another form. Let f
be defined on a non-empty set D and have bounded range in R. If Do is a non-
empty subset of D, then
inf If(x) : x E Dl < inf If(x) : x E Do}
~ sup U(x) : x E Do} ~ sup If(x) : XED}.)
6.S. Let X and Y be non-empty sets and let f be defined on X X Y to a
bounded subset of R. Let
ft(x) = sup {I(x, y) : y E Y},
fJ.(Y) = sup {I(x, y) : x E Xl.
Establish the Principle of Iterated Suprema:
sup {f(x, y) : x E X, Y E Y} = sup {fleX) : x E Xl
= sup {f2(Y) : y E Y}.
SEC. 6 THE REAL XUMBER SYSTEM 55
(We sometimes express this in symbols by
supf(x, y) = sup supf(x, y) = sup supf(x, y).)
x, y x y y x
Projects
6.a. If a and b are positive real numbers and if n E N, we have defined an and
bn It follows by mathematical induction that if m, n E N, then
(c) Show that the value of aT given in part (a) does not depend on the repre-
sentation of r in the form min. Also show that if r is an integer, then the new
definition of aT gives the same value as the old one.
(d) Show that if r, 8 E Q, then aTa' = ar+, and (aT)' = an.
(e) Show that aTbT = (ab)T.
(f) If r E Q, then a < b if and only if aT < bT.
(g) If r, 8 E Q, then r < 8 if and only if aT < a'l.
(h) Ii c is a real number satisfying 0 < c < 1, we define cT = (l/c)-r. Show
that parts (d) and (e) hold and that a result similar to (g), but with the in-
equality reversed, holds.
6.{3. Now that a z has been defined for rational numbers x, we wish to define
it for real x. In doing so, make free use of the results of the preceding project. As
before, let a and b be a real numbers exceeding 1. If u E R, let
aU = sup Tu(a).
Prove that this definition yields the same result as the previous one when u is
rationaL Establish the properties that correspond to the statements given in
parts (d)-(g) of the preceding project. The very important function which has
been defined on R in this project is called the exponential function (to the
base a). Some alternative definitions will be given in later sections. Sometimes
it is convenient to denote this function by the symbol
II
58
SEC. 7 ('ARTESIAN SPACES 59
by the single letter x and use s lmilar notations for other p-tuples. The
real numbers El' E2, ..., Ep will te called the first, second, .. 0' pth coor-
dinates (or components) respec';ively, of Xo Sometimes we refer to x as
a vector or sometimes merely as a point or element of Rp. Particular
mention should be made of the zero vector or origin which is the element
e of RP, all of whose coordinatns are the real number zero.
The Algebra of Vectors
The reader will note that the product defined by equation (7.1) is
a function with domain R X Rp and range Rp. We shall now define
a function with domain R P X R P and range R that will be useful.
SEC. 7 JARTESIAN SPACES 61
7.4 DEFINITION. If X and l' are elements of Rp, we define the inner
product, sometimes called the d(~t or scalar product, of x = (~l, ~2, .. " ~p)
and y = (7]1, 7]2, , 7]p) to be tl.e real number
x y = ~17]l + ~27]2 + ... + ~p7]p.
The norm (or the length) of x is defined to be the real number
Ix! = vx:x = (~12 + ... + ~p2)1/2.
7.5 INNER PRODUCT PROPEHTIES. If x, y, z belong to Rp and c 1,8
a real number, then
(i) xx > 0;
(ii) xx = 0 if and only if x = 0;
(iii) x y = y. x ;
(iv) x (y + z) = xy + xz and (x + y)z = xz + yz;
(v) (ex)'Y = c(xy) = x (C1').
PARTIAL PROOF. For example, the first equality in (iv) states that
x (y + z) = h(7]l + tl) + ~2(7]2 + t2) + ... + ~p(7]p + t p)
Icxl 2= i=l
L lc~il2 = L
Icl i=l
2
l~il2 = Icl 2Ix1 2
To prove (iv) , we first observe that
!X+YI2= (x+y)(x+y) =xx+2xy+yy.
According to Corollary 7.7, IX'YI < IxllYI, so that
Ix + y]2 < Ixl 2 + 21xllYI + 1yl2 = (Ixl + lyi)2,
which yields the second part of (iv). In addition, we have the
inequalities
Ixl < \x + Y - yl < Ix + yl + Iy]' Iyl < Ix + yl + Ixl
Therefore it is seen that both Ixl - Iyl and Iy] - Ixl, and hence
Ilxl - Iyll, are at most equal to Ix + yl. Consequently, we have
Ilxl - Iyll < Ix + yl < Ixl + Iyl
Replace y by -y and use the fact that Iyl = l-yl to obtain (iv).
Q.E.D.
The real number Ixl can be thought of as being either the length of x
or as the distance from x to e. More generally, we often interpret the
real number Ix - yl as the distance from x to y. With this interpretation,
property 7.8 (i) implies that the distance from x to y is a non-negative
real number. Property 7.8(ii) asserts that the distance from x to y is
zero if and only if x = y. Property 7.8(iii), with c = -1, implies that
Ix - yl = Iy - xl which means that the distance frorll x to y is equal
y
(I
Figure 7.1
CR. II THE TOPOLOGY OF CARTESIAN SPACES
Ix - yl < Ix - zl + Iz - yl,
which means that the distance from x to y is no greater than the sum of
the distance from x to z and the distance from z to y.
7.9 DEFINITION. Let x E Rp and let r > O. Then the set {y E Rp:
Ix - yl < r l is called the open ball with center x and radius r and the
set {y E Rp: Ix - yl < r} is called the closed ball with center x and
radius r. The set {y E Rp:lx - yl = r} is called the sphere in Rp with
center x and radius r. (See Figure 7.2.)
Note that the open ball with center x and radius r consists of all
points in Rp whose distance from x is less than r.
7.10 PARALLELOGRAM IDENTITY. If x and yare any two vectors in
Rp, then
(7.6)
Q.E.D.
Exercises
holds if and only if xy = O. In this case, one says that x and yare orthogonal
or perpendicular.
7.K. A subset K of Rp is said to be convex if, whenever x, y belong to K and
t is a real number such that 0 < t < 1, then the point
tx + (1 - t)y
also belongs to K. Interpret this condition geometrically and show that the
subsets
K 1 = {xE R2: Ixl < 11,
K2 = I (E, 1)) E R2 : 0 < ~ < 1) J,
Ka = {(~, 1)) E R2 : 0 < 1) < ~ < 1 J,
K4 = Ix E R2 : Ixl = 11
is not convex.
7.1. The intersection of any collection of convex subsets of Rp is convex. The
union of two convex subsets of Rp may not be convex.
7.M. If K is a subset of Rp, a point z in K is said to be an extreme point of
K if there do not exist points x, yin K with x ~ z and y ~ z and a real number t
with 0 < t < 1 such that z = Lx + (1 - t)y. Find the extreme points of the
sets K t , K 2, K a in Exercise 7.K.
SEC. 7 CARTESIAN SPACES 67
7.N. If M is a set, then a real-valued function d on M X M is called a metric
on M if it satisfies:
(i) d(x, y) > 0 for all x, y in M;
(ii) d(x, y) = 0 if and only if x = y;
(iii) d(x, y) = dey, x) for all X, yin M;
(iv) d(x, y) < d(x, z) + d(z, y) for all x, y, z in M.
It has been observed that the Norm Properties 7.8 imply that if the function d2
is defined by d2 (x l y) = Ix - YI, then d2 is a metric on Rp.
Use Exercise 7.B and show that if d1is defined by d1(x, y) = Ix - yh for x, yin
Rp, then d1 is a metric on Rp. Similarly, if doo is defined by doo (x, y) = Ix - Yloo,
then den is a metric on Rp. (Therefore, the same set can have more than one
metric.)
7.0. Suppose that d is a metric on a set M. By employing Definition 7.9 as a
model, use the metric to define an open ball with center x and radius r. Interpret
the sets 8 1, 8 2, and 8m in Exercise 7.D as open balls with center 8 in RZ relative
to three different metrics. Interpret Exercise 7.E as saying that a ball with center
0, relative to the metric dz, contains and is contained in balls with center 0,
relative to the metric dl Make similar interpretations of Exercise 7.F and
Theorem 7.11.
7.P. Let M be any set and let d be defined on M X M by the requirement that
O, if x = y,
{
d(x, y) = 1, if x rf: y.
Show that d gives a metric on M in the sense defined in Exercise 7.N. If x is any
point in M, then the open ball with center x and radius 1 (relative to the metric
d) consists of precisely one point. However, the open ball with center x and
radius 2 (relative to d) consists of all of M. This metric d, is sometimes called
the discrete metric on the set M.
Projects
ab < (a 2 + b2 )/2,
and that the equality holds if and only if a = b. (Hint: consider (a - b)2.)
(b) Let al and a2 be positive real numbers. Show that
~ < (al + G2)/2
and that the equality holds if and only if al = a2.
(c) Let at, a2, ..., am be m = 2n positive real numbers. Show that
(ll<) (ala2 . . a.,,)1/m < (al + az + + am)/m
and that the equality holds if and only if al = = am.
68 CH. II THE TOPOLOGY OF CARTESIAN SPACES
(d) Show that the inequality (*) between the geometric mean and the arith~
metic mean holds even when m is not a power of 2. (Hint: if 2n - 1 < m < 2n , let
bi = ai for j = 1, ..., m and let
Show that the equality holds if and only if the ordered sets {a1, a2, , ~} and
{bi , b2, , bn J are proportional.
(g) Use part (f) and establish the Triangle Inequality
{i
1=1
(aj + bi)2l / < J
12
i
L=1
arll12 + {i
;=1
b;11/2.
7.{3. In this project, let {aI, a2, .., an}, and so forth be sets of n non~negative
real numbers and let r > 1.
(a) It can be proved (for example, by using the Mean Value Theorem) that
if a and b are non-negative and 0 < a < 1, then
aab i - a < aa + (1 - a)b
and that the equality holds if and only if a = b. Assuming this, let r > 1 and
let 8 satisfy
1
-r + 81- = 1,
(so that 8> 1 and r +8 = r8). Show that if A and B are non-negative, then
Ar B.
AB<-+-,
r 8
(Hint: Let A = {L a/p/l' and B = I L b/pls and apply part (a) to ail A and
btlB.)
(c) Using Holder's Inequality, establish the Minkowski Inequalityt
)1/1' {n n b( lIlT ,
lIlT + { ~
(ai + biY
11
L <.~ a/
1J=l J-1 ;-1
(Hint: (a + b)r =
(a + b)(a + b)r/a = a(a + b)r/a + b(a + bYI. )
(d) Using Holder's Inequality, prove that
Show that if al < a2 < ... < an and bl < b2 ~ . < bn , then
(f) Suppose that 0 < at < a2 ~ . ~ an and 0 < bl < b2 < ... <bn and
r > 1. Establish the Chebyshev Inequalityt
Show that this inequality must be reversed if I ail is increasing and I bi} IS
decreasing.
Closed Sets
8.6 CLOSED SET PROPERTIES. (a) The empty set 0 and the entire
space R P are closed in R p.
(b) The union of any two closed sets is closed in Rp.
(c) The intersection of any collection of closed sets is closed in Rp.
SEC. 8 ELEMENTARY TOPOLOGICAL CONCEPTS 73
Neighborhoods
We now introduce some additional topological notions that will be
useful later and which will permit us to characterize open and closed
sets in other terms.
8.7 DEFINITION. If X is a point in Rl', then any set which contains
an open set containing x is called a neighborhood of x in Rv. A point x
is said to be an interior point of a set A in case A is a neighborhood of
the point x. A point x is said to be a cluster point of a set A (or a point
of accumulation of A) in case every neighborhood of x contains at least
one point of A distinct from x.
Before we proceed any further, it will be useful to consider some
reformulations and examples of these new concepts.
8.8 EXAMPLES. (a) A set N is a neighborhood of a point x if and
only if there exists an open ball with center x contained in N.
(b) A point x is an interior point of a set A if and only if there exists
an open ball with center x contained in A.
(c) A point x is a cluster point of a set A if and only if for every
natural number n there exists an element l'n belonging to A such that
o < Ix - xnl < lin.
(d) Every point of the unit interval I of R is a cluster point of I.
Every point in the open interval (0, 1) is an interior point of I in R,
but 0 and 1 are not interior points of I.
(e) Let A be the open interval (0, 1) in R. Then every point of A
is both a cluster and an interior point of A. However, the points x = 0
and x = 1 are also cluster points of A. (Hence, a cluster point of a set
does not need to belong to the se.1.-)--
(f) Let B = I n Q be the set of all rational points in the unit interval.
Every point of I is a cluster point of B in R, but there are no interior
points of B.
(g) A finite subset of Rp has no cluster points. (Why?) A finite sub-
set of Rv has no interior points. (Why?)
We now characterize open sets in terms of neighborhoods and interior
points.
8.9 THEOREM. Let B be a subset of Rp, then the following statements
are equivalent:
(a) B is open.
(b) Every point of B is an interior point of B.
(c) B is a neighborhood of each of its points.
PROOF. If (a) holds and x E B then B, which is open, is a neighbor
hood of x and x is an interior point of B.
CH. II THE TOPOLOGY OF CARTESIAN SPACES
Intervals
The reader will recall from Section 6 that the crucial completeness
property of the real number system hinged on the fact that every nested
sequence of non-empty closed intervals in R has a common point. We
shall now prove that this property carries over to the space Rp.
8.12 NESTED INTERVALS THEOREM. Let (h) be a sequence of non-
empty closed intervals in Rp which is nested in the sense that 1 1 ;;;2
12 ~ ~ I k ~ '. Then there exists a point in Rp which belongs to all
of the intervals.
PROOF. Suppose that I k is the interval
I le = {(~l, ..., ~p) = alel < tl < ble1 , , akp < tp < blep }.
It is easy to see that the intervals {[akl, bk1 ] : kEN} form a nested
sequence of non-empty closed intervals of real numbers and hence by the
completeness of the real number system H, there is a real number 111
which belongs to all of these intervals. Applying this argument to each
coordinate, we obtain a point y = (111, ., 11p) of Rp such that if }
satisfies} = 1, 2, ..., p, then 11j belongs to all the intervals {[akj, bkj ]:
kEN}. Hence the point y belongs to all of the intervals (I le).
Q.E.D.
~---
I
I
t
\
\
" ...... ......... ~
--
.........~2:l:lZ?:~
---_.--
Figure 8.4
Al = {t E R: + t(Zk -
Zk-l Zk-l) E A nG},
BI = {t E R : Zk-l + t(Zk - Zk-l) EB n G},
then it is easily seen that Al and B 1 are disjoint non-empty open subsets
of R. Hence the pair AI, B I form a disconnection for the unit interval
I, contradicting Theorem 8.16. Therefore, if G is not connected, there
exist two points in G which cannot be joined by a polygonal curve in G.
SEC. 8 ELEMENTARY TOPOLOGICAL CONCEPTS 81
/'
,.,---............. '-../
,.------- ............
! A \ "-
I \ B \\
\ \
). ) I
I
(
%1;</
, ~ y
I
I
\ / ----'\ \
\ / \ /
'--_/ \
',-_/
/
Figure 8.6
Exercises
8.A. Justify the assertion about the sets G, F made in Example 8.2(b).
8.B. Justify the assertions made in Example 8.2(c).
8.C. Prove that the intersection of a finite collection of open sets is open in
Rp. (Hint: use Theorem 8.3(b) and induction.)
8.D. Verify in detail that the sets Gn , defined in equation (8.1), are open and
that their intersection is not open. (Compare with Exercise 8.A.)
82 CH. 11 THE TOPOLOGY OF CARTESIAN SPACES
8.E. Show in detail that the set in equation (8.2) is neither open nor closed
inR.
8.F. Give an example of a subset of R2 which is neither open nor closed.
8.G. Write out the details of the proof of Theorem 8.6.
8.H. The union of a collection of closed subsets of Rp may fail to be closed.
8.1. Every open subset of Rp is the union of a countable collection of sets
which are closed. (Hint: the set of points all of whose coordinates are rational is
countable.)
8.J. Every closed subset in Rp is the intersection of a countable collection of
sets which are open.
8.K. If A is a subset of Rp, let A - denote the intersection of all closed sets
which contain A. The set A-is called the closure of A. Prove that A-is a closed
set and that
A cA-
- , (A-)- = A-,
0- = 0.
Observe that A - is the smallest closed set containing A.
8.1. If A, B are any subsets of Rp, then is
8.M. If A is a subset of Rp, let A 0 denote the union of all open sets which are
contained in A. The set A 0 is called the interior of A. Prove that A 0 is an open
set and that
- ,
A O cA (AOt = AO,
(A (\ Bt = A 0 (\ BO , (Rpt = Rp.
8.0. Show that an open interval in Rp, as in Definition 8.11, is an open set.
Prove that a closed interval in Rp is a closed set.
8.P. An open subset G of R is the union of a countable collection of open
intervals. (Hint: The set of points in G with rational coordinates is countable.)
The same result holds in Rp, p > 1.
8.Q. If A and B are open sets in R, show that their Cartesian product A X B
is open in R2.
8.R. Let A, B be subsets of R. The Cartesian product A X B is closed in R2
if and only if A and B are closed in R.
8.S. If A is any subset of Rp, then there exists a countable subset C of A such
that if x E A and f > 0, then there is an element z in C such that Ix - zl < E.
Hence every element of A is either in C or is a cluster point of C.
8.T. If A is a subset of Rp, then x is a cluster point of A if and only if every
neighborhood of x contains infinitely many points in A.
8.U. A finite subset of Rp has no cluster points. An unbounded subset of RF
may not have any cluster point.
SEC. 8 ELEMENTARY TOPOLOGICAL CONCEPTS 83
is connected in R2, but it is not true that every pair of points in A can be joined
by a polygonal curve lying entirely in A.
8.Z. Show that the set
Projects
8.0'. Let 111 be a set and d be a metric on M as defined in Exercise 7.N. Re-
examine the definitions and theorems of Section 8, in order to determine which
rarry over for sets that have a metric. It will be seen, for example, that the
notions of open, closed, and bounded set carryover. The Bolzano-Weierstrass
fails for suitable M and d, however. Whenever possible, either show that the
theorem extends or give a counterexample to show that it may faiL
8.{3. Let J be a family of subsets of a set X which (i) contains 0 and X, (ii)
contains the intersection of any finite family of sets in J, and (iii) contains the
union of any family of sets in J. We call J a topology for X, and refer to the
sets in 3 as the open sets. Reexamine the definitions and theorems of Section 8,
trying to determine which carryover for sets X which have a topology J.
84 CR. II THE TOPOLOGY OF CARTESIAN SPACES
U Gni
j=l
It is readily seen that each set Gm , mEN, is open in Rp. Also, the union
of all the sets Gm , mEN, consists of all points of Rp except x. Since
x ~ K, each point of K belongs to some set Om. In view of the compact-
t EDUARD HEINE (1821-1881) studied at Berlin under Weierstrass find later taught
at Bonn and Halle. In 1872 he proved that a continuous function on a closed interval
is uniformly continuous.
(F. E. J.) EMILE BOREL (1871-1938), a student of Hermite's, \vas professor at
Paris and one of the most influential mathematicians of his day. He made numerous
and deep contributions to analysis and probability. In 1895 he proved that if a count-
able collection of open intervals cover a closed interval, then they have a finite sub-
covering.
86 CH. II THE TOPOLOGY OF CARTESIAN SPACES
...----------.-----------,- - - - - - - - -
12
~::I~J~II3
16
1....-------'-;----1- - _ _ _ _ _
L- ---J- _
Figure 9.2
88 CH. II THE TOPOLOGY OF CARTESIAN SPACES
bisection of the sides of the interval II = {(~I, ..., ~p): I~jl < r} so the
length of the side of h is r/2k-2. It follows from Theorem 7.11 that if
w Elk, then
ryp
wi -< -2 - .
Iy - k- 1
Q.E.D.
t HENRI LEBESGUE (1875-1g41) is best known for his pioneering work on the modem
theory of the integral which is named for him and which is basic to present-day
analysis.
90 CH. II THE TOPOLOGY OF CARTESIAN SPACES
Figure 9.3
t RENE LOUIS BAIRE (1874-1932) was a professor at Dijon. He worked in set theor~'
and real analysis.
SEc.9 THE THEOREMS OF HEINE-BOREL AND BAIRE 91
/_/G
/ "f.... . . . . _
/. \
/ I
/' '\
I \
I \
,)
l 1
)
I
./
~'\ /
\ ;'
'-""" "'-- .... / - ./
Figure 9.4
Exercises
9.A. Show directly from the definition (Le., without using the Heine-Borel
Theorem) that the open ball given by {(x, y): x2 + y2 < 11 is not compact in R2.
9.B. Show directly that the entire space R2 is not compact.
9.C. Prove directly that if K is compact in Rp and F is a closed subset of K,
then F is compact in Rp.
9.D. Prove that if K is a compact subset of R, then K is compact when re-
garded as a subset of R2.
9.E. By modifying the argument in Example 9.2(d), prove that the interval
J = {(x, y) : 0 < x < 1,0 < y < Ij is compact in R2.
9.F. Locate the places where the hypotheses that the set K is bounded and
that it is closed were used in the proof of the Heine-Borel Theorem.
9.G. Prove the Cantor Intersection Theorem by selecting a point Xn from Fn
and then applying the Bolzano-Weierstrass Theorem 8.13 to the set {x n : n EN}.
9.H. If F is closed in Rp and if
d(x, F) = inf {Ix - zl : Z E FI = 0,
then x belongs to F.
9.!. Does the Nearest Point Theorem in R imply that there is a positive real
number nearest zero?
9.J. If F is a non-empty closed set in Rp and if x ~ F, is there a unique point
of F that is nearest to x?
9.K. If K is a compact subset of Rp and x is a point of Rp, then the set K x =
{x + y: y E K I is also compact. (This set K% is sometimes called the translation
of the set K by x.)
9.L. The intersection of two open sets is compact if and only if it is empty.
Can the intersection of an infinite collection of open sets be a non-empty com-
pact set?
9.M. If F is a compact subset of R2 and G is an open set which contains F,
then there exists a closed polygonal curve C lying entirely in G which surrounds F.
9.N. Prove that a line in R2 is a closed subset of R2 and contains no non-void
open sets.
9.0. If the set A does not contain a non-void open set, can the closure A-
(see Exercise 8.K) contain a non-void open subset of Rp?
9.P. If a set B contains a non-void closed set, must the interior BO (see Exercise
8.M) contain a non-void closed subset of Rp?
9.Q. The set of rational numbers Q is not the intersection of a countable
collection of open sets.
9.R. A closed set F does not contain any non-void open set of Rp, if and only if
every point in Rp is a cluster point of its complement.
9.S. A set D is said to be dense in Rp, if every point in Rp is a cluster point
of D. Prove that D is dense in Rp, if and only if its closure D- (see Exercise 8.K)
coincides with Rp.
9.T. Give an example of an open subset of Rp which is dense in Rp. Can you
give an example of a closed subset of Rp which is dense in Rp?
CH. II THE TOPOLOGY OF CARTESIAN SPACES
9.D. If DI and Dz are open sets which are dense in Rp, then DI U D2 and
D I n Dz are also dense open sets in Rp.
9.V. If D1 and Dz are dense subsets of Rp, are DI V D z and D1 () D2 also dense
subsets of Rp?
9.W. If {D... :n E N} is a countable collection of dense open subsets of Rp,
then their intersection is a dense subset of Rp.
10.2 THEOREM. The complex number system C forms a field with the
operations defined in Definition 10.1.
PARTIAL PROOF. We shall leave most of the details to the reader
and mention only that the zero element of C is the complex number
(0,0) and the identity element is (1,0). Furthermore, if z = (x, y) =;6
(0,0), then the inverse to z is given by
z =
I (x+
x2 y2 ' x2
-y )
+ y2
Q.E.D.
Sometimes it is convenient to adopt part of the notation of Section 7
and write
az = a(x, y) = (ax, ay),
when a is a real number and z = (x, y) is in C. With this notation, it is
clear that each element in C has a unique representation in the form of
a sum of a product of a real number with (1,0) and of the product of a
real number with (0, 1). Thus we can write
x=Rez=--, y=Imz=--
z+z z-z
2 2i
By Definition 10.1, (0, 1) (0, 1) = (-1,0) which can be written as
i = - 1. Thus in C the quadratic equation
2
Z2 +1= 0,
has a solution. The historical reason for the development of the complex
number system was to obtain a system of "numbers" in which every
quadratic equation has a solution. It was realized that not every equa-
tion with real coefficients has a real solution, and so complex numbers
were invented to remedy this defect. It. is a well-known fact that not
only do the complex numbers suffice to produce solutions for every
quadratic equation with real coefficients, but they also suffice to guar-
96 CH. II THE TOPOLOGY OF CAHTESIAN SPACES
t CARL FRIEDRICH GAUSS (1777-1855), the prodigious son of a day laborer, was
one of the greatest of all mathematicians, but is also remembered for his work in
astronomy, physics, and geodesy. He became professor and director of the Observa-
tory at Gottingen.
SEC. 10 THE COMPLEX NUMBER SYSTEM 97
Exercises
Convergence
98
SEC. 11 INTRODUCTION TO SEQUENCES 99
In other words, a sequence assigns to each natural number n = 1, 2,
..., a uniquely determined element of Rp. Traditionally, the element of
Rp which is assigned to a natural number n is denoted by a symbol such
as X n and, although this notation is at variance with that employed for
most functions, we shall adhere to the conventional symbolism. To be
consistent with earlier notation, if X:N -) Rp is a sequence, the value of
X at n E N should be symbolized by X(n), rather than by x n While we
accept the traditional notation, we also wish to distinguish between the
function X and its values X(n) = x n . Hence when the elements of the
sequence (that is, the values of the function) are denoted by X n , we shall
denote the function by the notation X = (x n ) or by X = (x n : n EN).
We use the parentheses to indicate that the ordering in the range of X,
induced by that in N, is a matter of importance. Thus we are distinguish-
ing notationally between the sequence X = (x n : n E N) and the set
{x n : n E N} of values of this sequence.
In defining sequences we often list in order the elements of the se-
quence, stopping when the rule of formation seems evident. Thus we
may write
(2,4, 6,8, ... )
for the sequence of even integers. A more satisfactory method IS to
specify a formula for the general term of the sequence, such as
(2n : n EN).
Xl = 2, Xn+l = Xn + 2, n > 1.
or by the (apparently more complicated) definition
Xl = 2,
n
- 1 , ...) ,
converge to respectively.
f}l, f}2, , f}p
We shall now present some examples. For the sake of simplicity in the
calculations, we consider examples in R.
SEC. 11 INTRODUCTION TO SEQUENCES 103
whence it follows that IX n - 0\ < e for n > K(e). This proves that
lim (lin) = o.
1
o < 1 + an < e,
showing that lim (1/(1 + an = O.
(c) Let b be a real number satisfying 0 < b < 1 and consider the
sequence (b n ). We shall show that lim (b n ) = O. To do this, we note that
we can write b in the form
1
b = ,
l+a
where a is a positive real number. Furthermore, using the Binomial
Theorem, we have Bernoulli's Inequality (1 + a)n > 1 + na for n > 1.
Hence
1 1
o < bn =
(1 + a)n <
1 + na for n > 1.
Since the term on the right side can be dominated by e for n > K(e),
then so can bn From this we infer that bn -~ 0 whenever 0 < b < 1.
(d) Let c be a positive real number and eonsider the sequence (cl/ n ).
It will be seen that lim (cl/ n ) = 1. We shall carry out the details only
for the case 0 < c < 1, leaving the similar but slightly easier case where
1 < c as an exercise for the reader. We note that if e > 0, then since
o < c, there exists a natural number K(e) such that if n > K(e) then
104 CH. III CONVERGENC
so that
E
lelfn - 11 < 1 + < f E '
whenever n > K(f). This establishes the fact that el/ n ~ 1, when
O<e<1.
11.9 DEFINITION. If X (X'TJ is a sequence in Rp and if rl < T2 <
=
... < r n < ... is a strictly increasing sequence of natural numbers,
then the sequence X' in Rp given by
is called a subsequence of X.
It may be helpful to connect the notion of a subsequence with that of
the compof:ition of two functions. Let g be a function with domain N
and range in N and let g be strictly increasing in the sense that if n < m,
then g(n) < gem). Then g defines a subsequence of X = (x..) by the
formula
X0g= (Xg(n) : n E N).
Since r" > n, then r" > K v and hence X rn belongs to V. This proves
that X' also converges to x.
Q.E.D.
The preceding results have been mostly directed towards proving that
a sequence converges to a given point. It is also important to knmv pre-
cisely what it means to say that a sequence X does not converge to x.
The next result is elementary but not trivial and its verification is an
important part of everyone's education. Therefore, we leave its detailed
proof to the reader.
11.12 THEOREM. If X = (xrJ is a sequence in Rp, then the following
statements are equivalent:
(a) X does not converge to x.
(b) There exists a neighborhood V of x such that if n is any natural
number, then there is a natural number m = 1n(n) > n such that X m does
not belong to V.
(c) There exists a neighborhood V of x and a subsequence X' of X sueh
that none of the elements of X' belong to V.
11.13 EXAMPLES. (a) Let X be the sequence in R consisting of the
natural numbers
x= (1,2, ... , n, ...).
Let x be any real number and consider the neighborhood 11 of x consist-
ing of the open interval (x - 1, x + 1). According to Theorem 5.14
there exists a natural number k o such that x + 1 < k o; hence, if n > k o,
it follows that X n = n does not belong to Y. Therefore the subsequence
X' = (k o, k o + 1, ... )
of X has no points in V, showing that X does not converge to .t.
(b) Let Y = (Yn) be the sequence in R consisting of Y = ( - 1, 1, ... ,
(_l)n, ... ). We leave it to the reader to show that no point y, exc.ept
possibly y = 1, can be a limit of Y. \Ve shall show that the point
?J = - 1 is not a limit of Y; the consideration for y = + 1 is entirely
similar. Let V be the neighborhood of y = - 1 eonsisting of the open
interval (-2, 0). Then, if n is even, the element Yn = (-1)11. = +1
108 CH. III CONVERGENCE
Since this can be done for arbitrary > 0, we infer that X + Y con-
verges to x + y. Precisely the same argument can be used to show that
X - Y converges to x - y.
To prove that X Y converges to X'y, we make the estimate
IXn'Yn - ,rYI = !(Xn'Yn - xn,y) + (xn,y - x'Y)1
< IXn'(Yn - y)1 + I(x n - x)YI.
Using the C.-B.-S. Inequality, we obtain
(11.3) \Xn'Yn - x'yl < IxnllYn - yl + \x n - xllyl
SEC. 11 INTRODUCTION 'fO SEQUENCES 107
Exercises
11.A. If (x n) and (Yn) are convergent sequences of real numbers and if X n .::; Yn
for all n E N, then lim(x n) .:::; lim(Yn).
'lI 11.:8. If X = (x n) and Y = (Yn) are sequences of real numbers which both
converge to c and if Z = (zn) is a sequence such that XI' < Zn < Yn for n E N,
then .~ also converges to c.
~ 11.C. For X n given by the following formulas, either establish the convergence
or th~ divergence of the sequence X = (x n ):
( _l)nn
(a) X =--,
n
(b) Xn = ,
n n + 1 n+I
(c) Xn =
2n
, (d) Xn =
2n 2 +3,
2
3n + I 3n 2
+I
(e) Xn = n2 - n, (f) Xn = sin(n).
with < r < 1 and some A > 0, then < X n < Arn for large n.)
I1.R. If, in Exercise I1.Q, lim (x n l / n ) > 1, then the sequence X is divergent.
11.S. Give an example of a sequence X = (x n ) of positive numbers with
lim (x n l / n ) = 1 and such that lim X = O. Also, give an example of a divergent
sequence X such that lim (x n l / n ) = 1.
n.T. Re-examine the convergence of Exercises I1.M, N, 0 in the light of
Exercises 11.P, Q, R.
l1.U. If < b < a and if X n = (an + bn)l/n, then lim (x n) = a.
H.V. If x = lim (x n ) and if IX n - cl < E for all n E N, then is it true that
Ix - ci < e?
Projects
n.a. Let d be a metric on a set M in the sense of Exercise 7.N. If X = (x n )
is a sequence in M, then an element x in M is said to be a limit of X if, for each
positive number E there exists a natural number K(e) such that if n > K(E),
then d(xn , x) < E. Use this definition and show that Theorems 11.5, 11.6, 11.10,
11.11, 11.12 can be extended to metric spaces. Show that the metrics dl , d2, do:)
in Rp give rise to the same convergent sequences. Show that if d is the discrete
metric on a set, then the only sequences which converge relative to d are those
sequences which are constant after some point.
11.{:l. Let 8 denote the collection of all sequences in R, let m denote the collec-
tion of all bounded sequences in R, let c denote the collection of all convergent
sequences in R, and let Co denote the collection of all sequences in R which
converge to zero.
(a) With the definition of sum given in Definition 11.2 and the definition of
product of a sequence and real number given by a(Xn) = (ax,.,), show that each
of these collections has the properties of Theorem 7.3. In each case the zero
element is the sequence 8 == (0,0, ...,0, ...). (We sometimes say that these
collections are linear spaces or vector spaces.)
(b) If X = (x n ) belongs to one of the collections m, c, Co, define the norm of
X by IXI = supllxnl:n E N}. Show that this norm function has the properties
of Theorem 7.8. (For this reason, we sometimes say that these collections are
normed linear spaces.)
(c) If X and Y belong to one of these three collections, then the product XY
also belongs to it and IXYI < IXII YI. Give an example to show that equality
may hold and one to show that it may fail.
(d) Suppose that X = (x n) and Y = (Yn) belong to m, C, or Co, and de-
fine d(X, Y) = sup {Ixn - Ynl: n E N I. Show that d yields a metric on each
of these collections.
SEC. 12 CRITERIA FOR THE CONVERGENCE OF SEQUENCES 111
Until now the main method available for showing that a sequence is
convergent is to identify it as a subsequence or an algebraic combination
of convergent sequences. When this can be done, \ve are able to calculate
the limit using the results of the preceding section. However, when this
cannot be done, we have to fall back on Definition 11.3 or Theorem 11.4
in order to establish the existence of the limit. The use of these latter
tools has the noteworthy disadvantage that we must already know (or
at least suspect) the correct value of the limit and we merely verify that
our suspicion is correct.
There are many cases, however, where there is no obvious candidate
for the limit of a given sequence, even though a preliminary analysis has
led to the belief that convergence does take place. In this section we
give some results which are deeper than those in the preceding section
and which can be used to establish the convergence of a sequence when
no particular element presents itself as a candidate for the limit. In fact
we do not even determine the exact value of the limit of the sequence.
The first result in this direction is very important. Although it can be
generalized to Rp, it is convenient to restrict its statement to the case of
sequences in R.
x* - f ~ X* {X * + f
I I I I 1111111 111 1111 I
X2 X3
Figure 12.1
whence it follows that Ix - sup {X n } I < t. Since this holds for all > 0,
we infer that lim (x n ) = x = sup {x n }.
Conversely, suppose that X = (x n ) is a bounded monotone increasing
sequence of real numbers. According to the Supremum Principle 6.6, the
supremum x* = sup {x n } exists; we shall show that it is the limit of X.
Since x* is an upper bound of the elements in X, then X n < x* for n E N.
Since x* is the supremum of X, if > 0 the number x* - E is not an
upper bound of X and exists a natural number K such that
x* - E < XK.
Y = (2y + 3)/4.
Theref Dre, we conclude that Y = t-
(c) Let Z = (zn) be the sequence in R defined by
Zl = 1, Zn+1 = ~ for n E N.
It is clear that ZI < Z2 < 2. If Zn < Zn+l < 2,
then 2z n < 2z n+1 < 4 and
Zn+l = V2Z: < Zn+2 = V 2Z n+1 < 2 = 0. (Why?) This shows that
Z is a monotone increasing sequence which is bounded above by 2; hence
Z converges to a number z. It may be shown directly that 2 = sup {.Zn}
so tha'~ the limit z = 2. Alternatively, we can use the method of the
precedlng example. Knowing that the sequence has a limit z, we con-
clude from the relation Zn+l = V2zn that z must satisfy z = y'2Z. To
find the roots of this last equation, we square to obtain Z2 = 2z, which
has roots 0, 2. Evidently 0 cannot be the limit; hence this limit must
equal 2.
(d) Let U = (Un) be the sequence of real numbers defined by Un =
(1 + l/n)n for n E N. Applying the Binomial Theorem, we can write
n 1 n(n - 1) 1 n(n ~ l)(n - 2) 1
Un = 1 + -1 -n + 2'. --;
n + 3'. n +
"3
+ n(n - 1) .. , 21 ~.
n! nn
111,. CH. III CONVERGENCE
Un = 1+ 1+~
2!
(1 - !)n + ~ (1 - !)n (1 - n~)
31
Note that the expression for Un contains n + 1 terms and that for Un+!
contains n + 2 terms. An elementary examination shows that each term
in Un is no greater than the corresponding term in Un+l and the latter has
one more positive term. Therefore, we have
Ul < U2 < ... < Un < Un+l < ....
To show that the sequence is bounded, we observe that if p = 1, 2, ..., n,
then (1 - pin) < 1. Moreover, 2P-l < p! (why?) so that lip! < 1/2p-l.
From the above expression for Un, these estimates yield
1 1 1
2 < U
n
< 1 + 1 + -2 + -22 + ... + -2n-1'
<3 n > 2.
In fact, this is what the second part of the proof of 12.4 established.
'Ye now introduce the important notion of a Cauchy sequence in Rp.
It will turn out later that a sequence in Rp is convergent if and only if
it is a Cauchy sequence.
12.6 DEFINITION. A sequence X = (x n ) in Rp is said to be a Cauchy
sequence in case for every positive real number E there is a natural num-
ber M(E) such that if m, n > M(E), then IX m - xnl < E.
116 CR. III CONVERGENCE
Now let n be any natural number such that n > M(f). It follows that
(*) holds for this value of n and for m = K. Thus
=-+
2
1 ...
10-2
1
+-=-12
1( 1
1+-+",+
2 2 m- 2 n- 1
1) <-.
1
m - n- 1 2 10- 2
111
= 1 xs
x1 ,
= 1 + -2 , Xli = 1 + -2 + -23 ,...,
111
X2n+1 = 1 + -2 + 23 + ... + 2--,
21'1-1
...
It follows that
X2n.H = 1 + -2 1(1
+ - + ... + -1)
1
4 4n - 1
1 1 - 1/41'1
=1+-
2 1 - 1/4
=1+-32 ( 1 -41-) n
.
1
Xl = - , X2 = -
I!
1
l!
-
1
- , ..., x
2! n
= -
1
- -
I! 2!
1
+ ... + ( -1)
n!
nH
' ...
Since this sequence is not monotone, a direct application of the Mono-
tone Convergence Theorem is not possible. Observe that if m > n, then
( - 1) nH (- 1) 1'1+3 (- 1) m
X
n
- X
m
= (n +
I)! + (n + 2)! + ... + m! .
1 1 1
Xn = 1" + 2 + ... + ;; fornEN,
Exercises
12.A. Give the details of the proof of Corollary 12.2.
12.B. Let Xl be any real number satisfying Xl > 1 and let X2 = 2 - Ilxl,
, Xn+l = 2 - l/x n , Using induction, show that the sequence X = (x n )
is monot.one and bounded. What is the limit of this sequence?
12.C. Let Xl = 1, X2 = y!2 + Xl , .. , Xn+l = y!2 +
X n , Show that the
sequence (x n ) is monotone increasing and bounded. What is the limit of this
sequence?
12.D. If a satisfies 0 < a < 1, show that the real sequence X = (an) con-
verges. Since Y = (a2n ) is a subsequence of X, it is also convergent and
lim X = lim Y = (lim X)2.
Use this to show that lim X = O.
12.E. Show that a sequence of real numbers has either a monotone increasing
subsequence or a monotone decreasing subsequence. Give an example of a
sequence with both a monotone increasing subsequence and a monotone decreas-
ing subsequence.
12.F. Use Exercise 12.E. to prove the Bolzano-Weierstrass Theorem for
sequences in R.
12.G. Give an example of a sequence in Rp which has no convergent sub-
sequence.
12.H. Consider the convergence of the sequence X = (x n ), where
1
Xn = - - + -1- + ... + -1 for n E N.
n +1 n +2 2n
12.1. Let X = (x n) and Y = (Yn) be sequences in Rp and let Z = (Zn) be the
"shuffled" sequence in Rp defined by
Is it true that Z is convergent if and only if X and Yare convergent and lim X =
lim Y?
12.J. Show directly that the sequences
lim (x;:') ~ L,
and if E > 0, then there exist positive numbers A, B and a number K such that
A(L - E)n < Xn < B(L + E)n for n> K.
Use Example 11.8(d) and prove that lim (x n 1In ) = L.
12.M. Apply Example 12.3(d) and the preceding exercise to the sequence
(nn/n!) to show that
. (
I1m n ) = e.
(n!)l!n
12.N. If X = (x n ) is a sequence in R and x > 0, then is it true that lim X = x
if and only if
lim (xn +- xx)
Xn
= O?
Prove that 0,2 < b2 and, by induction, that an < bn . Show that the sequences
(an) and (b n ) converge to the same limit.
12.R. If X = (x n ) is not a Cauchy sequence in Rp, then does there exist an
unbounded subsequence of X?
12.S. If Xn belongs to a subset A of Rp, and X n -:;z!:. x for all n E N and if x =
lim(x n ), then x is a cluster point of A.
12.T. If x is a cluster point of a subset A of Rp, then does there exist a sequence
(Xn) of elements of A which converges to x?
12.U. Prove the Cantor Intersection Theorem 9.4 by taking a point Xn in F n
and applying the Bolzano-Weierstrass Theorem 12.4.
12.V. Prove the Nearest Point Theorem 9.6 by applying the Bolzano-Weier-
strass Theorem 12.4.
12.W. Prove that if K 1 and K 2 are compact subsets of Rp, then there exist
points Xl in K 1 and X2 in K 2 such that if Zl E K l and Z2 E K 2 , then IZI - z21 >
IXI - x21
12.X. If K I and K2 are compact subsets of Rp, then the set K = Ix + y:
x E K 1, Y E KzI is compact.
SEC. 13 SEQUENCES OF FUNCTIONB 121
Projects
II
/
_
:=-------- '3
-=~~~~~:=======.:..fk~ 1--
Figure 13.1
Figure 13.2
cn. IIt CONVERGENCE
il / / k f
Figure 13.3
Figure 13.4
Figure 13.5
preceding lemma is useful to shnw that Examples 13.2 (a~c) do not con-
verge uniformly on the given sets Do.
13.6 EXAMPLES. (a) We consider Example 13.2 (a). If nk = k and
Xk = k, then fk (Xk) = 1 so that
Ifk(Xk) - f(Xk) I = 11 - 01 = 1.
This shows that the sequence (fn) does not converge uniformly on R
to f.
(b) We consider Example 13.2(b). If nk = k and Xk = (!)1I\ then
Ifk(xk) - f(Xk) I = Ifk(Xk) I = !.
Therefore, we infer that the sequence (In) does not converge uniformly
on {x E R : 0 < x < 1} to f.
(c) We consider Example 13.2(c). If nk = k and Xk = k, then
Ih(xk) - f(Xk) I = k,
showing that (fk) does not converge uniformly on R to f.
(d) We consider Example 13.2(d). Then, since
Ifn(x) - f(x)1 < lin
for all x in R, the sequence (fn) converges uniformly on R to f. However,
if we restrict our attention to D = [0, 1] and shuffle (fn) with (gn),
where Yn(x) = xn, the resulting sequence (h n) converges on D to the
zero function. That the convergence of (h n ) is not uniform can be seen
by looking at the subsequence (gn) = (h 2n ) of (h n).
In order to establish uniform convergence it is often convenient to
make use of the notion of the norm of a function.
13.7 DEFINITION. If f is a bounded function defined on a subset D
of R P and with values in R q, the D-norm of f is the real number given by
(13.5) IlfilD = sup {If(x)1 : xED}.
'Vhen the subset D is understood, we can safely omit the subscript on
the left side of (13.5) and denote the D-norm of f by Ilfll.
13.8 LEMMA. If f and g are bounded functions defined on D c R P
to Rq, then the D-norm satisfies:
(a) Ilfll = 0 if and only if f(x) = () for all XED.
(b)llefll = leillfil for any real number c.
(c)Illfll - Jlglll < Ilf + gil < Ilfll + Ilgll
PROOF. (a) If f(x) = e for all xED, then If(x) I = lei = 0 for all
xED so that Ilfll = sup {If(x)l:x E D} = O. Conversely, if there exists
128 CR. III CONVERGENCE
an element Xo E D with f(xo) =;t. (), then If(xo) I > 0 and hence Ilfll >
If(xo) I > O.
(b) This follows since lef(x) I = Iellf(x)l.
(c) According to the Triangle Inequality 7.8(iv),
If(x) + g(x)] < Ifex) I + Ig(x)l,
and by Definition 13.7 the right-hand side is dominated, for each.x E D,
by I[fll + llgl1. Therefore, this last number is an upper bound for the
set {If(x) + g(x)\ : xED}, so we conclude that
The reader will have noted that the set of bounded functions on D
to R q admits a function which possesses some of the same properties
as the distance function in R q. The fact that the D-norm, as defined in
Definition 13.7, satisfies the Norm Properties 7.8 is sometimes summa-
rized by saying that the set of bounded functions on D c Rp to Hq is a
normed linear space. Although such ideas are of considerable interest
and importance, we shall not pursue this line of thought any further,
but content ourselves with the connection between the D-norm and
uniform convergence on the set D.
13.9 LEMMA. A sequence (jn) of bounded functions on D c Rl' to Rq
converges uniformly on D to a function f if and only if
Ilfn - fll ~ o.
PROOF. If the sequence (fn) converges to f uniformly on D, then for
f > 0 there is a natural number K(f) such that if n > K(f) and.r E D,
then Ifn(x) - f(x)! < f. This implies that if n > K(f), then
Ilfn - fll = sup {Ifn(x) - f(x) I: xED} < E.
Hence Ilfn - fl \ converges to zero.
Conversely, if Ilfn - fll converges to zero, then for e > 0 and xED
we have
Ifn(x) - f(x) I < IIfn - fll < E,
provided that n > K(f). Therefore, if xED and n > K(f), then
Ifn(x) -f(x)1 < E.
This shows that the sequence Un) converges uniformly on D to the
function f.
Q.E.D.
SEC. 13 SEQUENCES OF FUNCTIONS 129
III
1
x\ 0<- x < I}
11fn - = sup {
, x = 1
= 1 for n E N.
Since this Do-norm does not converge to zero, we infer that the sequence
(fn) does not converge uniformly on Do = [0, 1] to f. This bears out our
earlier considerations.
(c) We consider Example 13.2(c). Once again we cannot apply
Lemma 13.9, since the functions are not bounded. Again, we choose a
smaller domain, taking D = [0, a] with a > O. Since
Ifn(x) - I(x) I =
X
Z
+ nx - x
x2
= -,
n n
the D-norm of f n - f is
Ilfn - fll = sup {l/n(x) - fex) I: 0 < x < a} = -.
n
Hence the sequence converges uniformly to f on the interval [0, aJ.
(Why does this not contradict the result obtained in Exercises 13.6(c)?)
130 CR. III CONVERGENCE
Applying Lemma 11.16, it follows that for m > M(E) and XED, then
Ifm(x) - j(x)1 < E.
Therefore, the sequence (jm) converges Wliformly on D to the fWlction f.
Q.E.D.
Exercises
In these exercises you may make use of the elementary properties of the trig-
onometric and exponential functions from earlier courses.
l3.A. For each n E N, let fn be defined for x > 0 by fn(x) = 1/(nx). For
what values of x does lim Un (x)) exist?
13.B. For each n E N, let g.. be defined for x > 0 by the formula
gn(X) = nx, 0 < x < lin,
I
=-,
nx
lin < x,
Show that lim (g.. (x)) = 0 for all x > O.
l3.C. Show that lim cos 71'X)2n) exists for all values of x. What ~s its limit?
I3.D. Show that, if we define fn on R by
I3.H. Show that lim (e-nZ) exists for x 2: O. Also consider the existence of
lim (xe-nx).
13.1. Suppose that (xn ) is a convergent sequence of points which lies, together
with its limit x, in a set D c Rp. Suppose that Un) converges on D to the function
j. Is it true thatf(x) = lim (f,,(Xn)?
13.J. Consider the preceding exercise with the additional hypothesis that the
convergence of the (fn) is uniform on D.
13.K. Prove that the convergence in Exercise 13.A is not uniform on the
entire set of convergence, but that it is uniform for x ~ 1.
I3.L. Show that the convergence in Exercise 13.B is not uniform on the
domain x ~ 0, but that it is uniform on the set x 2: c, where c > O.
13.M. Is the convergence in Exercise 13.D uniform on R?
l3.N. Is the convergence in Exercise 13.E uniform on I?
13.0. Is the convergence in Exercise 13.F uniform on I? Is it uniform on
[e, 1] for c > O?
I3.P. Does the sequence (xe-nz) converge uniformly for x ~ O?
13.Q. Does the sequence (x 2e-nx ) converge uniformly for x ~ O?
13.R. Let (fn) be a sequence of functions which converges on D to a functionf.
If A and B are subsets of D and it is known that the convergence is uniform on
A and also on B, show that the convergence is uniform on A VB.
13.S. Let M be the set of all bounded functions on a subset D of }lp with
values in R9. If I, g belong to M, define d(f, g) = Iii - gil. Show that d is a
metric on M and that convergence relative to d is uniform convergence on D.
Give an example of a sequence of elements of M which is bounded relative to d
but which does not have a subsequence which converges relative to d.
Figure 14.1
number N 2(E) such that if n > N 2(E), then x - E < x n . Let N(e) =
sup {NI(e), N 2(E)}; if n > N(e), then IX n - xl < E, showing that
x = limX.
Q.E.D.
.
III case
1
1m
(kd)
IYnl = 1.
Cesaro Summation
t ERNESTO CESARO (1859-1906) studied in Rome and taught at Naples. He did work
in geometry and algebra as well as analysis.
138 CR. III CONVERGENCE
(14.2) rT n - X =
Xl + X2 + ... + X - n
x
n
1
= - I (Xl - x)
n
+ (Xz - X) + ... + (X n - X)}.
NA
jrT n - xl <-
n
+ n-N
n
for n > N (e).
We shall not pursue the theory of summability any further, but refer
the reader to books on divergent series and summability. For example,
see the books of E. Knopp, G. H. Hardy, and P. Dienes listed in the
References. One of the most interesting and elementary applications of
Cesaro summability is the celebrated theorem of Fejert which asserts
that a continuous function can be recovered from its Fourier series by
(14.3) X=
Xml X m2 X mn
Observe that, in this array, the first index refers to the row in which
the element X mn appears and the second index refers to the column.
14.9 DEFINITION. If X (X mn ) is a double sequence in Rp, then
=
an element x is said to be a limit, or a double limit, of X if for each
positive number e there is a natural number N (e) such that if m, n >
N(e) then
IX mn - xl < E.
In this case we say that the double sequence converges to x and write
x = lim (X mn ), or x = lim X.
mn
We shall not pursue in any more detail that part of the theory of
double sequences which is parallel to the theory of (single) sequences.
Rather, we propose to look briefly at the relation between the limit as
defined in 14.9 and the iterated limits.
To begin with, we note that a double sequence can be regarded, in
at least two ways, as giving a sequence of sequences! On one hand, we
can regard each row in the array given in (14.3) as a sequence in Rp.
Thus the first row in (14.3) yields the sequence Y 1 = (xln:n E N) =
(Xll, Xl2., . , Xln, . ); the second row in (14.3) yields the sequence
Y2. = (x2n:n E N); etc. It makes perfectly good sense to consider the
limits of the row sequences Y 1, Y2, ..., Y 71>, . (when these limits exist).
Supposing that these limits exist and denoting them by Yl, Y2, ..., Ym, ...,
we obtain a sequence of elements in Rp which might well be examined
for convergence. Thus we are considering the existence of y = lim (Ym).
Since the elements Ym are given by Ym = lim Y m where Y m = (X7I>n:
n EN), we are led to denote the limit y = lim (Ym) (when it exists) by
the expression
y = lim lim (x mn ).
m n
exists, and if for each natural number m the limit Ym = lim (X mn ) exists,
then the iterated limit lim lim (X mn ) exists and equals x. n
m n
PROOF. By hypothesis, given E > 0 there is a natural number N(E)
such that if m, n > N(E), then
IX mn - xl < E.
Again by hypothesis, the limits Ym = lim (X mn ) exist, and from the above
n
inequality and Lemma 11.16 it follows that
[Ym - xl < E, m > N(E).
Therefore, we conclude that x = lim (Ym).
Q.E.D.
The preceding result shows that if the double limit exists, then the
only thing that can prevent the iterated limits from existing and being
equal to the double limit is that the" inner" limits may not exist. More
precisely, we have the following result.
14.12 COROLLARY. Suppose the double limit exists and that the limits
Ym = lim (X mn ), Zn = lim (x mn )
n m
X = { I , m n, I
mn
0, m - n.
Here both iterated limits exist and are equal, but the double limit does
not exist. However, under some additional conditions, we can establish
the existence of the double limit from the existence of one of the
iterated limits.
14.13 DEFINITION. For each natural number m, let Y m = (X mn ) be
a sequence in Rp which converges to Ym' We say that the sequences
{Y m : mEN} are uniformly convergent if, for each e > 0 there is a
natural number N(E) such that if n > N(e), then IX mn - Yml < E for
all natural numbers m.
The reader will do well to compare this definition with Definition
13.4 and observe that they are of the same character. Partly in order
to motivate Theorem 14.15 to follow, we show that if each of the
sequences Y m is convergent, then the existence of the double limit
implies that the sequences {Y m : mEN 1 are uniformly convergent.
14.14 LEMMA. If the double limit of the double sequence X = (X mn )
exists and if, for each natural number m, the sequence Y m = (Xmn: n E N)
is convergent, then this collection is uniformly convergent.
PROOF. Since the double limit exists, given e > 0 there is a natural
number N(e) such that if m, n > N(e), then IX mn - xl < e. By hypothe-
sis, the sequence Y m = (X mn : n E N) converges to an element Ym and,
applying Lemma 11.16, we find that if m > N(e), then IYm - xl < e.
Thus if m, n > N(e), we infer that
IXmn - Yml < IXmn - xl + Ix - Yml < 2E.
In addition, for m = 1,2, ..., N(e) - 1 the sequence Y m converges to
Ym; hence there is a natural number K(f) such that if n > K(e), then
IX mn - Yml <
m = 1, 2, ..., N(e) - l.
E,
Letting M(e) = sup {N(e), K(e) L we conclude that if n > M(e), then
for any value of m we have
IXmn - Yml < 2e.
This establishes the uniformity of the convergence of the sequences
{Ym:mEN}.
Q.E.D.
SEC. 14 SOME EXTENSIONS AND APPLICATIONS
The preceding lemma shows that, under the hypothesis that the
sequences Y m converge, then the uniform convergence of this collection
of sequences is a necessary condition for the existence of the double
limit. We now establish a result in the reverse direction.
14.15 ITERATED LIMIT THEOREM. Suppose that the single limits
Ym = lim (X mn ), Zn = lim (X mn ), m, n E N,
n m
exist and that the convergence of one of these collections is uniform. Then
both iterated limits and the double limit exist and all three are equal.
PROOF. Suppose that the convergence of the collection 1Y m : mEN 1
is uniform. Hence given e > 0, there is a natural number N(e) such
that if n > N(e), then
(14.4) IXmn - Ym! < e
for all natural numbers m. To show that lim (Ym) exists, take a fixed
number q > N(e). Since Zq = lim (x rq : r E N) exists, we know that if
r, s > R(e, q), then
We now show that the double limit exists. Since Y = lim (Ym), given
e> 0 there is an M(e) such that if m > M(e), then IYm - yl < E.
I;etting K(e) = sup {N(e), M(e)}, we again use (14.4) to conclude that
if m, n > K(e), then
Exercises
14.A. Find the limit superior and the limit inferior (when they exist) of the
following sequences:
(a) (_1)n), (b) (1 + (-l)n), (c) (-l) nn)
H.B. Show that if lim (x,,) exists, then lim sup (x,,) = lim (x n ).
14.C. Show that if X = (x n ) is a bounded sequence in R, then there exists a
subsequence of X which converges to lim inf X.
14.D. Give the details of the proof of Theorem 14.3.
14.E. Formulate the theorem corresponding to Theorem 14.3 for the limit
inferior.
14.F. Give the direct proof of the part (c) of Theorem 14.4 and a proof using
the other parts of this theorem.
14.G. Prove part Cd) of Theorem 14.4 by using property (b) in Theorem 14.3 as
the definition of the limit superior. Do the same using property (d). Property (e).
14.H. If X is a sequence of positive elements, show that
14.W. Which of the double sequences in Exercise 14.T are such that the
collection { Y m = lim (Xmn):m E N} is uniformly convergent?
14.X. Let X = (Xmn) be a bounded double sequence in R with the property
that for each mEN the sequence
Y m = (xmn:n E N)
Continuous Functions
J46
SEC. 15 LOCAL PR01>ERTIE8 OF CON'l'INUOUS FUNCTIONS 11,.7
Figure 15.1
(Xl") to a, there exists a natural number N (6(E such that if n > N (6(E,
then IX n - al < 6(E). Since each Xn E:O, it follows from (b) that
If(x n ) - f(a) I < E, proving that (c) holds.
Finally, we shall argue indirectly and show that if condition (a) does
not hold, then condition (c) does Dot hold. If (a) fails, then there exists
a neighborhood V o of f(a) such that for any neighborhood U of a, there
is aD element Xu belonging to :0 n U but such that f(xu) does not belong
to Vo. For each natural number n consider the neighborhood Un of a
defined by Un = {x E R p : IX - a I < 1/n} ; from the preceding sentence,
for each n in N there is an element XI" belonging to :D nUn but such that
f(x n ) does not belong to Vo. The sequence (XI") just constructed belongs
to :D and converges to a, yet none of the elements of the sequence (f(x n
belong to the neighborhood Vo of f(a). Hence we have constructed a
sequence for which the condition (c) does not hold. This shows that
part (c) implies (a).
Q.E.D.
Before we push the theory any further, we shfJ,ll pause to give some
examples. For simplicity, most of the examples are for the case where
Rp = Rq = R.
SEC. 15 LOCAL PROPERTIES OF CONTINUOUS FUNCTIONS 149
15.5 EXAMPLES. (a) Let :D = R and let f be the "constant"
function defined to be equal to the real number c for all real numbers x.
Thenfis continuous at every point of R; in fact, we can take the neigh-
borhood U of Definition 15.1 to be equal to R for any point a in :D.
Similarly, the function g defined by
g(x) = 1, 0 < x < 1,
= 2, 2 < x <3,
is continuous at each point in its domain.
(b) Let :D = R and let f be the "identity" function defined by
f(x) = x, x E R. (See Figure 15.2.) If a is a given real number, let
E > 0 and let O(E) = E. Then, if Ix - zl < 6(e), we have If(x) - f(a) I =
\x - a\ < E.
f(a) +E
f(a)
f(a) - E
a-o a a+o
Figure 15.2
(d) We consider the same function as in (c) but use a slightly different
technique. Instead of factoring x 2 - a21 we write it as a polynomial in
x-a. Thus
x2 - a2 = (x 2 - 2ax + a2) + (2ax - 2a2 ) = (x - a)2 + 2a(x - a).
Using the Triangle Inequality, we obtain
If(x) - f(a) I < Ix - al 2 + 2lallx - al
If 5 < 1 and lx - a\ < 5,
then Ix -
a\2 < 52 < 5 and the term on the
right side is dominated by 5 + 2\a\5 = 5(1 + 2Ia\). Hence we are led to
choose
0(') 1
= in! 1, 1 +'zlal}'
(e) Consider ~ = {x E R:x r! O} and letjbe defined by f(x) = l/x,
x E :D. If a E ~1 then
f(x) = 1, if x is rational,
= 0, if x is irrationaL
I I I I
3 1 1 1 1
n 3" "2
Figure 15.3
CombinatiollS of Functions
have domain 5)(h) = Ix E 5)(j) :f(x) E 5) (g) I and for x in 5)(h) we set
hex) = g[f(x)]. Thus h = go f is a function mapping 5)(h), which is a
subset of 5)(f) c Rp, into R~. We now establish the continuity of this
function.
15.8 THEOREM. Iff is continuous at a and 9 is continuous at b = f(a),
then the compositian 9 a f is continuous at a.
PROOF. Let W be a neighborhood of the point c = g(b). Since 9 is
continuous at b, there is a neighborhood V of b such that if y belongs to
V (\ 5)(g), then gCy) E W. Since f is continuous at a, there exists a
neighborhood U of a such that if x belongs to U (\ 5)(j), thenf(x) is in V.
f g
Figure 15.4
linear Functions
The preceding discussion pertained to general functions defined on a
part of Rp into Rq. vVe now mention a simple but extremely important
special kind of function, namely the linear functions. In most applica-
tions, the domain of such functions is all of R P, and so we shall restrict
our attention to this case.
15,9 DEFINITION. A function f with domain Rp and range in Rq is
said to be linear if, for all vectors x, y in R P and real numbers c,
(15.3) f(x + y) = f(x) + fey), f(cx) = cf(x).
Often linear functions are called linear transformations.
It is readily seen that the functions in Examples 15.6(b) and 15.6(i)
are linear functions for the case p = q = 1 and p = q = 2, respectively.
In fact, it is not difficult to characterize the most general linear function
from Rp to Rq.
SEC. 15 LOCAL PROPERTIES OF CONTINUOUS FUNCTIONS 155
(15.4)
. . . . . .. . . . . . .. . , .
l1q = Cql~l + Cq2~2 + + Cq~p. 0 0 0
the images of these vectors under the linear function 1. Suppose that
feel) - (Cn, C21, 0 0 0' Cql),
f(e2) (Cl2, C22, 0 0 ., Cq2),
(15.5)
. . . .. . . . . . . .
f(e p ) = (CIP, C2p, 0 0 ., Cqp)o
Thus the real number Cij is the ith coordinate of the point f(ei)'
An arbitrary element x = (~1, ~2, ~p) of Rp can be expressed simply 0 0 .,
This shows that the coordinates of f(x) are given by the relations
(15.4), as asserted.
Conversely, it is easily verified by direct calculation that if the rela-
tions (15.4) are used to obtain the coordinates rtf of y from the coordinates
h of x, then the resulting function satisfies the relations (15.3) and so is
linear. We shall omit this calculation, since it is straight-forward.
Q.E.D.
(15.6)
Cq l C q2 .,. C qp
Exercises
15.A. Prove that if j is defined for x >
0 by j(x) = yx, then j is continuous
at every point of its domain.
I5.B. Show that a polynomial function j, given by
j(x) = anzn + an_lXn- 1 + ... + a\x + ao, x E R,
is continuous at every point of R.
15.C. Show that a rational function (that is, the quotient of two polynomials)
is continuous at every point where it is defined.
15.D. Using the C.-B.-S. Inequality, show that one can take a(e) = E/-VI5
in Example 15.5(i).
158 CR. IV CONTINUOUS FUNCTIONS
15.S. If f denotes the linear function of Exercise 15.R, show that not every
vector in R3 is the image under f of a vector in R2.
15.T. Let g be any linear function on R2 to Rs, Show that not every element
of R s is the image under g of a vector in R 2,
15.D. Let h be any linear function on R3 to R2. Show that there exist non-
~,ero vectors in R 3 which are mapped into the zero vector of R 2 by h.
15.V. Letf be a linear function on R2 to R2 and let the matrix representation
of! be given by
d/.6 -b/.6]
[ -e/.6 a/.6
15.X. Let g be a linear function from Rp to Rq. Show that g is one-one if and
only if g(x) = e implies that x = 8.
15.Y. If h is a one-one linear function from Rp onto Rp, show that the inverse
h- is a linear function from Rp onto Rp.
1
15.Z. Show that the sum and the composition of two linear functions are
Unear functions. Calculate the corresponding matrices for p = 2, q = 3, r = 2.
160 CR, IV CONTINUOUS FUNCTIONS
Preservation of Connectedness
We recall from Definition 8.14 that a set H in Rv is disconnected if
there exist open sets A, Bin Rv such that A n Hand B n H are disjoint
non-empty sets whose union is H. A set is connected if it is not
disconnected.
16.3 PRESERVATION OF CONNECTEDNESS. If H is connected and f is
continuous on H, then f(H) is connected.
PROOF. Assume that feR) is disconnected in Rq, so that there exist
open sets A, B in Rq such that A nf(H) and B nf(H) are disjoint
non-empty sets whose union isf(H). By the Global Continuity Theorem
16.1, there exist open sets AI, B I in Rv such that
Al n H :::= I-I (A), B I n H = f-I(B).
These intersections are non-empty and their disjointness follows from
the disjointness of the sets A nf(H) and B nf(H). The assumption
that the union of A nf(H) and B nf(H) isf(H) implies that the umon
of Al n Hand B I n H is H. Therefore, the disconnectedness of feR)
implies the disconnectedness of H.
Q.E.D.
Preservation of Compactness
We now demonstrate that the important property of compactness is
preserved under continuous mapping. In the discussion to follow, we
do not assume a close familiarity with Section 9, and we shall offer two
proofs of the main results to be presented here. We recall that it is a
consequence of the important Heine-Borel Theorem 9.3 that a subset
K of RP is compact if and only if it is both closed and bounded in Rp.
Thus the next result could be rephrased by saying that if K is closed
and bounded in Rp and if f is continuous on K and with range in Rq,
then f(K) is closed and bounded in R q.
16.5 PRESERVATION OF COMPACTNESS. If K is compact and f is
clmtinuous on K, then f(K) is compact.
FIRST PROOF. We assume that K is closed and bounded in Rp and
shall show thatf(K) is closed and bounded in Rq. If f(K) is not bounded,
for each n E N there exists a point X n in K with If(x n ) I > n. Since K is
bounded, the sequence X = (x n ) is bounded; hence it follows from the
Bolzano-Weierstrass Theorem 12.4 that there is a subsequence of X
which converges to an element x. Since X n E K for n E N, the point x
belongs to the closed set K. Hence f is continuous at x, so f is bounded
by If(x) I + 1 on a neighborhood of x. Since this contradicts the assump-
ti.on that I/(x n ) I > n, the set f(K) is bounded.
We shall prove that f(K) is closed by showing that any cluster point
y of f(K) must be contained in this set. In fact, if n is a natural number,
there is a point Zn in K such that
If(z.. ) - yl < lin.
By the Bolzano-Weierstrass Theorem 12.4, the sequence Z = (Zn) has
a subsequence Z' = (Z.. (k) which converges to an element z. Since K
iB closed, then Z E K and f is continuous at z. Therefore,
fez) = lim (f(Zn(k)) = y,
k
silnce the norm of the vector x/lxl is 1, it follows that if M and mare
the supremum and the infimum of If(u) I for u in S, then
mlxl < If(x) I < Mlxl,
for all x in Rp. We have already seen in Theorem 15.11 that if f is linear
on Rp to Rq, then there exists a positive constant M such that
If(x) I <Mlx\, x E RP,
and this provides an alternative proof. However, it is not always true
that there is a positive constant m such that
all of Rq). One could modify the argument in Theorem 15.11 to show
that this inverse function is continuous. Such modification is not neces-
sary, however] as the continuity of f- l follows from Corollary 16.8.
For, if YI, ys belong to the domain of f- 1 (= the range of f), then there
exist unique elements Xl, X2 in Rp such that f(XI) = YI and f(X2) = V2;
hence f(XI - X2) = f(Xl) - f(x2) = Yl - Y2. From Corollary 16.8, we
infer that
mixi - X2! < If(xi - X2) 1= IYI - Y21.
Since Xl = f-l(YI) and X2 = f- l (Y2), then
From the Global Continuity Theorem 16.1 (c), we infer that g = i-I
is continuous.
Q.E.D.
Uniform Continuity
inf U2
:u > 0) = 0,
11 + eu
we cannot obtain a positive a(E, u) which is independent of the choice
o~t u for all u > O.
We shall now restrict g to a smaller domain. In fact, let a > 0 and
define hex) = l/x for x > a. Then the analysis just made shows that
we can use the same value of a(, u). However, this time the domain
is smaller and
EU2 ) ta2
>a
inf
11 + tU
:u
-
=
1 + ea > O.
Hence if we define a(e) = ea 2 /(1 + ea), then we can use this number for
all points u > a.
168 CR. IV CONTINUOUS FUNC'l'IONS
In order to help fix these ideas, the reader should look over Examples
15.5 and determine in which examples the 0 was chosen to depend on
the point in question and in which ones it was chosen independently
of the point.
With these preliminaries we now introduce the formal definition.
16.10 DEFINITION. Let f have domain ~ in Rp and range in Rq.
We say that f is uniformly continuous on ~ if for each positive real
number Ethere is a positive number O(E) such that if x and u belong to
~ and Ix - ul < a(E), then If(x) - feu) I < f.
(16.5) n EN.
We shall show that the sequence (x n ) converges to a unique fixed point
u of f and estimate the rapidity of the convergence.
To do this, we observe that
(16.7)
(16.8)
equation (16.5), then X converges to the unique fixed point u of f with the
rapidity estimated by (16.8).
In case the function f is not defined on all of Rp, then somewhat
more care needs to be exercised to assure that the iterative definition
(16.5) of the sequence can be carried out and that the points remain in
the domain of f. Although some other formulations are possible, we
shall content ourselves with the following one.
16.16 THEOREM. Suppose that f is a contraction with constant C
which is defined for:D = {x E Rp = Ixl < B} and that If(O) 1< B(l - C).
Then the sequence
Xl = 0, X2 = f(x), ..., X n+l = f(x n ), .
converges to the unique fixed point of f which lies in the set :D.
PROOF. We shall check only that the sequence (x n ) remains in 5:>.
By hypothesis, IX21 = If(O) I < B (1 - C) < B whence X2 E 5:>. Thus
X3 = f(X2) can be defined and
jxa - X2! = If(x2) - f(xd I < C jX2 - 81 = C IX21
172 CR. IV CONTINUOUS FUNCTIONS
Therefore,
IX31 < IX21 + c lx21 = (1 + C)IX21 < B(l - C2).
This argument can be continued inductively to prove that IXn+11 <
B(l - en). Hence the sequence (x n) lies in the set 1> and the result
follows as before.
Q.E.D.
Exercises
16.A. Interpret the Global Continuity Theorem 16.1 for the real functions
f(x) = x2 and g(x) = l/x, x =;t. O. Take various open and closed sets and con-
sider their inverse images under f and g.
16.B. Let h be defined on R by
Exhibit an open set G such that h-1(G) is not open, and a closed set F such that
h-1(F) is not closed.
16.0. If f is defined and continuous on Rp to R and if f(xo) > 0, then is f
positive on some neighborhood of the point xo? Does the same conclusion follow
if f is merely continuous at the point xo?
16.J. Show that every polynomial of odd degree and real coefficients has a
real root. Show that the polynomial p(x) = x 4 + 7x3 - 9 has at least two
real roots.
I6.K. If c > 0 and n is a natural number, there exists a unique positive num~
ber b such that bn = c.
16.L. Let f be continuous on I to R with f(O) < 0 and f(l) > O. If N =
Ix E I: f(x) < 0), and if c = sup N, show that f(c) = O.
16.M. Let f be a continuous function on R to R which is strictly increasing in
the sense that if x' < x" thenf(x') < f(x"). Prove thatf is one-one and that its
inverse function f- 1 is continuous and strictly increasing.
16.N. Letf be a continuous function on R to R which does not take on any
of its values twice. Is it true that f must either be strictly increasing or strictly
decreasing?
16.0. Let {I be a function on I to R. Prove that if g takes on each of its values
exactly twice, then g cannot be continuous at every point of I.
16.P. Let f be continuous on the interval [0,211"] to R and such that f(O) =
+
f(211"). Prove that there exists a point c in this interval such thatf(c) = fCc 11").
(Hint: consider g(x) = f(x) - f(x + 11").) Conclude that there are, at any time,
antipodal points on the equator of the earth which have the same temperature.
16.Q. Consider each of the functions given in Example 15.5 and either show
that the function is uniformly continuous or that it is not.
16.R. Give a proof of the Uniform Continuity Theorem 16.12 by using the
Lebesgue Covering Theorem 9.5.
174 CR. IV CONTINUOUS FUNCTIONS
16.S. If fis uniformly continuous on a bounded subset B of RJ' and has values
in R11, then must f be bounded on B? .
16.T. A function g on R to Rq is periodic if there exists a positive number p
such that g(x + p) = g(x) for all x in R. Show that a continuous periodic func-
t tion is bounded and uniformly continuous on all of R.
16.U. Suppose that f is uniformly continuous on (0, 1) to R. Can J be defined
at x = 0 and x = 1 in such a way that it becomes continuous on [0, 1]1
16.V. Let 5) = {x E Rp: Ixl < 11. Is it true that a continuous function f on
:D to RIl can be extended to a continuous function on:D1 = {x E RJ': Ixl < 1}
if and only if f is uniformly continuous on~?
Projects
16.a. The purpose of this project is to show that many of the theorems of
this section hold for continuous functions whose domains and ranges are con-
tained in metric spaces. In establishing these results, we must either observe that
earlier definitions apply to metric spaces or can be reformulated to do so.
(a) Show that Theorem 15.2 can be reformulated for a function from one
metric space to another.
(b) Show that the Global Continuity Theorem 16.1 holds without change.
(c) Prove that the Preservation of Connectedness Theorem 16.3 holds.
(d) Show that the Preservation of Compactness Theorem 16.5 holds.
(e) Show that the Uniform Continuity Theorem 16.12 can be reformulated.
l6.~. Let g be a function on R to R which is not identically zero and which
satisfies the functional equation
g(x + y) = g(x)g(y) for x, y E R.
The purpose of this project is to show that g must be an exponential function.
(a) Show that g is continuous at every point of R if and only if it is continuous
at one point of R.
(b) Show that g does not vanish at any point.
(c) Prove that g(O) = 1. If a = g(l), then a > 0 and g(r) = ar for r E Q.
(d) If (l(x) > 1, 0 < x < 0, for some positive 0, then g is positive on all of R.
In this case, g is strictly increasing and continuous on R.
(e) If g is continuous on R, then g is positive on all of R.
(f) Show that there exists at most one continuous function satisfying this
functional equation and such that gel) = a for a > O.
(g) Referring to Project 6.{3, show that there exists a unique continuous func-
tion satisfying this functional equation and ~ch that g(l) = a for a> O.
z
16.-y. Let h be a function on P = (x E R: > 0) to R which is not identically
~ero and which satisfies the functional equation
(x, I(x,)
(a, {(a')
f+f./3 __~
"
f-E/3_~
Figure 17.1
SEC. 17 SEQUENCES OF CONTINUOUS FUNCTIONS 177
which converges to a continuous function I, then it does not follow that
the convergence is uniform (see Exercise 17.A).
Approximation Theorems
Here we have used the 5)-norm which was introduced in Definition 13.7.
We say that the function f can be uniformly approximated on ~ by
functions in a class 9 if, for each positive number E there is a function
gf in 9 such that Iluf - fll~ < E; or, equivalently, if there exists a se-
quence of functions in 9 which converges uniformly on 5) to f.
17.2 DEFINITION. A function U with domain Rp and range in Rq is
called a step function if it assumes only a finite number of distinct
values in Rq, each non-zero value being taken on an interval in Rp.
For example, if p = q = 1, then the function g defined explicitly by
g(x) = 0, x< - 2
= 1, -2 < x < 0,
= 3, o < x < 1,
= -5, 1 < x < 3,
= 0, x>3
is a step function. (See Figure 17.2 on the next page.)
We now show that a continuous function whose domain is a compact
interval can be uniformly approximated by step functions.
178 CR. IV COXTIXUOUS FUNCTIONS
( )
( ]
] (
-2 o 1 3
[ 3
Figure 17.2. A step function.
f+ E
f- E
We shall now prove a deeper, more useful, and more interesting result
concerning the approximation by polynomials. First, we prove the Weier-
strass Approximation Theorem for p = q = 1, by using the polynomials
of S. Bernsteln.t Next, we shall establish the Rv case of M. H. Stone's
generalization of the Weierstrass Theorem. We shall then be able to
obtain easily the general case of polynomial approximation.
(17.2)
(17.5) n -
(k - 2
2) (n - 2) I
= (k - 2) l(n - k)!
k(k - 1)
= n(n - 1)
(n)
k
(17.6) 1= t (n) x
k""O k
k(1 - x)n-k
1 = L:
n-l (
n ~ 1) xi(1 - x)n-l- i .
1""0 J
Multiply this last relation by x and apply the identity (17.4) to obtain
x = ni:
j""O
j + 1 (.
n J
n
+1
) Xi+1(l _ x)n-(f+I>.
(17.7) x =
k=O
Ln -k
n
(n)k x k (1 - x)n-k.
(n2 _ n)x2 = f
k=O
(k2 - k) (n)
k
xk(l - x)n-k.
f(x) = t
k=O
f(x) (n) x k(1 - x)n-k.
k
Therefore, we obtain the relation
and break (17.10) into two sums. The sum taken over those k for which
Ix - kin I < n- 1/ 4 < o(e) yields the estimate
L:Ie e (n)
k
xk(1 - x)n-k <E t (n)
k =1 k
x k(1 - x)n-k = e.
The sum taken over those k for which Ix - kln\ > n- 1/4, that is,
(x - klm)2 > n-1/ 2, can be estimated by using formula (17.9). For this
part of the sum in (17.10) we obtain the upper bound
SEC. 17 SEQUENCES OF CONTINUOUS FUNCTIONS 183
~ 2M (~) x'(l - xl n- .
= 2~f L: (x - kJn)2 (n) xk(l _ x)n-k
k (x - kjn)'l k
< 2M vn tk=l
(x - kjn)2 (n) xk(1 -
If;
x)n-k
(('(x) = d(x, A)
d(x, A) + d(x, B)
17.11 TIETZEt EXTENSION THEOREM. Let f be a bounded continuous
function defined on a closed subset:D of Rp and with values in R. Then there
exists a continuous funetion g on Rp to R such that g(x) = f(x) for x in 1)
and such that sup {Ig(x)j : x E Rp} = sup {I[(x)l : x E :D}.
Having done this, we set fa = f2 - CP2 and note that fa = 1 - CPt - fP2 is
continuous on ~ and that sup {lh(x)l:x E ~} < (j)2M.
By proceeding in this manner, we obtain a sequence (CPn) of functions
defined on Rp to R such that, for each n,
Equicontinuity
l' CESARE ARZELA (1847-1912) was a professor at Bologna. He gave necessary and
wfficient conditions for the limit of a sequence of continuous functions on a closed
interval to be continuous, and he studied related topics.
GIULIO ASCOLI (1843-1896), a professor at Milan, formulated the definition of
equicontinuity in a geometrical setting. He also made contributions to Fourier
~leries.
192 CH. IV CONTINUOUS FUNCTIONS
PROOF. First, we shall show that if (a) fails, then so does (b). If the
family;Y is not bounded, there is a sequence (fn) of functions in ;Y such
that for each natural number n we have Ilfnll > n. In view of Lemma
17.13 no subsequence of (jn) can converge uniformly on K. Also, if the
set ;Y is not equicontinuous, then for some fO > 0 there is a sequence
Un) from ;Y such that o(fO, in) > lin. If this sequence Un) has a uni-
formly convergent subsequence, we obtain a contradiction to Lemma
17.15.
We now show that, if the set ;Y satisfies (a), then given any sequence
(in) in ;Y there is a subsequence which converges uniformly on K. To do
this we notice that it follows from Exercise 8.5 that there exists a count-
able set C in K such that if Y E K and f > 0, then there exists an element
x in C such that Ix - yl < f. If C = {Xl, X2, ... }, then the sequence
(!n(Xl)) is bounded in Rq. It follows from the Bolzano-Weierstrass
Theorem 12.4 that there is a subsequence
(!ll (Xl), 112 (Xl), ... , lin (Xl), ...)
of (!n(XI)) which is convergent. Next we note that the sequence (ilk (X2) :
kEN) is bounded in Rq; hence it has a subsequence
(f21 (X2) , ! 22 (X2), ...,!2n (X2), ...)
which is convergent. Again, the sequence (!2n (Xa) :n E N) is bounded
in Rq, so some subsequence
(fal (Xa) , la2 (Xa), ... , fan (Xa), ...)
is convergent. We proceed in this way and then set On = inn so that Yn is
the nth function in the nth subsequence. It is clear from the construction
that the sequence (Yn) converges at each point of C.
We shall now prove that the sequence (On) converges at each point of
K and that the convergence is uniform. To do this, let e: > 0 and let
o(e:) be as in the definition of equicontinuity. Let C1 = {YI, . . 0' yd be a
finite subset of C such that every point in K is within O(f) of some point
in C1 Since the sequences
converge, there exists a natural number M such that if m, n > ill, then
Igm(Yi) - gn(Yi) I < E for i = 1, 2, ..., k.
Given x E K, there exists a Yi E C1 such that Ix - Yil < O(f). Hence, by
the equicontinuity, we have
SEC. 17 SEQUENCES OF CONTINUOUS FUNCTIONS 193
for all n EN; in particular, this inequality holds for n > 1~1. Therefore.
we have
Ign(X) - gm(x)1 < Ign(x) - !In(YJ)! + !gn(Yi) - !lm(Yi)!
+ Igm(Y1) - gm(x)1 < E + + E E = 3e,
provided m, n > M. This shows that
II!ln - gmllK < 3E for
m, n > M,
so the uniform convergence of the sequence (gn) on K follows from the
Cauchy Criterion for uniform convergence, given in 13.11.
Q.E.D.
Exercises
If (fn(C) converges to 0 and e > 0, then there exists a natural number 1'l1 and
a neighborhood U of C such that if n > .M and x E U (\:0 thenin(x) < e.
17.R. Using the preceding exercise, establish the following result of U. Dini. t
If (fn) is a monotone sequence of real-valued continuous functions which con-
verges at each point of a compact subset K of Rp to a continuous function i,
then the convergence is uniform on K.
17.1. Can Dini's Theorem fail if the hypothesis that K is compact is dropped?
Can it fail if the hypothesis that i is continuous is dropped? Can it fail if the
hypothesis that the sequence is monotone is dropped?
17.J. Prove the following result of G. P6lya.t If for each n E N, in is a mono-
tone increasing function on I to R, if i is continuous on I to R, and if f(x) =
lim (jn(x) for all x E I, then the convergence is uniform on I. Observe that it
need not be assumed that the fn are continuous.
17.K. Let (jn) be a sequence of continuous functions on ~ c Rp to Rq and
let f(x) = lim (fn(X) for x E D. Show that i is continuous at a point c in D if
and only if for each e > there exists a natural number m = m(e) and a neigh-
borhood U = U (E) of C such that if x E :0 (\ U, then
17.L. Consider the weight factors CPk that appear in the nth BernsteIn poly-
nomials. By using elementary calculus or other means, show that CPk takes its
supremum on I at the point kin. Write out explicitly the functions CPk, k =
0, 1, 2, when n = 2 and the functions corresponding to n = 3, and note that
L CPn(X) = 1, for x E I. Draw graphs of some of these functions.
17.M. Carry out the details in the derivation of equation (17.8) and the
equation immediately preceding this equation.
17.N. Differentiate equation (17.3) twice with respect to s and then sub-
stitute s = x, t = 1 - x. From what is obtained, give another derivation of
equations (17.7) and (17.8).
17.0. Let K be the circumference of the unit circle in R2; hence K is the set
{ (x, y): x2 + y2 = I}. Note that K can be parametrized by the angle 9, where
tan 0 = ylx. A trigonometric polynomial is a function p on K to R of the form
statement, it may seem odd to the reader that we have waited this long
before inserting a section dealing with limits. There are several reasons
for this delay, the main one being that elementary analysis deals with
several different types of limit operations. We have already discussed
the convergence of sequences and the limiting implicit in the study of
continuity. In the next chapters, we shall bring in the limiting operations
connected with the derivative and the integral. Although all of these
limit notions are special cases of a more general one, the general notion
is of a rather abstract character. For that reason, we prefer to introduce
and discuss the notions separately, rather than to develop the general
limiting idea first and then specialize. Once the special cases are well
understood it is not difficult to comprehend the abstract notion. For an
excellent exposition of this abstract limit, see the expository article of
E. J. McShane cited in the References.
In this section we shall be concerned with the limit of a function at a
point and some slight extensions of this idea. Often this idea is studied
before continuity; in fact, the very definition of a continuous function is
sometimes expressed in terms of this limit instead of using the definition
we have given in Section 15. One of the reasons why we have chosen to
study continuity separately from the limit is that we shall introduce
two slightly different definitions of the limit of a function at a point.
Since both definitions are widely used, we shall present them both and
attempt to relate them to each other.
Unless there is specific mention to the contrary, we shall let f be a
function with domain ~ contained in Rp and values in R q and we shall
consider the limiting character of f at a cluster point c of ~. Therefore,
every neighborhood of c contains infinitely many points of X>.
18.1 DEFINITION. (i) An element b of Rq is said to be the deleted
limit of fat c if for every neighborhood Vof b there is a neighborhood U
of c such that if x belongs to U n ~ and x ~ c, then fCx) belongs to V.
In this case we write
(18.1) b = limf or b = limf(x).
c x~c
limf, Limf,
c c
limf = Limf.
e c
(c) If c does not belong to the domain :D of j, then the deleted limit exists
if and only if the non-deleted limit exists.
Part (b) of the lemma just stated shows that the notion of the non-
deleted limit is somewhat more restrictive than that of the deleted limit.
Part (c) shows that they can be different only in the case where c belongs
to:D. To give an example where these notions differ, consider the function
f on R to R defined by
(18.3) f(x) = 0, x ~ 0,
= 1, x = o.
If c = 0, then the deleted limit of f at c = 0 exists and equals 0, while
the non-deleted limit does not exist.
We now state some necessary and sufficient conditions for the exist-
ence of the limits, leaving their proof to the reader. It should be realized
that in part (c) of both results the limit refers to the limit of a sequence,
which was discussed in Section 11.
a = lim g 0 f.
c
b = Limf, a = Limg
o b
a = Lim go f.
c
Note that the non-deleted limits do not exist for these functions.
200 CH. IV CONTINUOUS FUNCTIONB
For the remainder of the present section, we shall consider the case
where q = 1. Thus f is a function with domain 5) in Rp and values in R
and the point c in Rp is a cluster point of :D. We shall define the limit
superior or the upper limit of f at c. Again there are two possibilities
depending on whether deleted or non-deleted neighborhoods are con-
sidered, and we shall discuss both possibilities. It is clear that we can
define the limit inferior in a similar fashion. One thing to be noted here
is that, although the existence of the limit (deleted or not) is a relatively
delicate matter, the limits superior to be defined have the virtue that (at
least if f is bounded) their existence is guaranteed.
The ideas in this part are parallel to the notion of the limit superior of
a sequence in Rp which was introduced in Section 14. However, we shall
not assume familiarity with what was done there, except in some of
the exercises.
18.7 DEFINITION. Suppose that f is bounded on a neighborhood of
the point c. If r > 0, define cp(r) and cp(r) by
(18.4a) cp(r) = sup {f(x) : 0 < Ix - cl < r, x E :oJ,
(18.4b) 4>(r) = sup {f(x) : Ix - cl < r, x E :O}.
and set
(18.5a) lim sup f = inf {cp (r) : r > O},
x-+c
These quantities are called the deleted limit superior and the non-
deleted limit superior of fat c, respectively.
Since these quantities are defined as the infima of the image under f of
ever-decreasing neighborhoods of c, it is probably not clear that they
deserve the terms "limit superior." The next lemma indicates a justifica-
tion for the terminology.
18.8 LEMMA. If 'P, cI> are as defined in equations (18.4), then
(18.6a) lim sup f = lim cp(r),
x--->c r--->O
which proves (l8.6a). The proof of (18.6b) is similar and will be omitted.
Q.E.D.
18.9 LEMMA. (a) If M > lim sup j, then there exists a neighborhood
x~c
U of c such that
(b) If M > Lim sup f, then there exists a neighborhood U of c such that
x~c
Hence there exists a real number r1 > 0 such that ep(r1) < M and we can
take U = {x E Rp : Ix - cl < rtl. The proof of (b) is similar.
Q.E.D.
One of the keys to the importance and the utility of upper semi-continu-
ous functions is suggested by the following lemma, which may be com-
pared with the Global Continuity Theorem 16.1.
18.12 LEMMA. Let f be an upper serni-continuous function with
domain ~ in Rp and let k be an arbitrary real nwnber. Then there exists an
open set G and a closed set F such that
It is possible to show, using the lemma just proved, (cL Exercise 18.M)
that if K is a compact subset of Rp and f is upper semi-continuous on K,
then f is bounded above on K and there exists a point in K where f attains
its supremum. Thus upper semi-continuous functions on compact sets
possess some of the properties we have established for continuous func-
tions, even though an upper semi-continuous function can have many
points of discontinuity.
SEC. 18 LIMITS OF FUNCTIONS ~03
Exercises
18.A. Discuss the existence of both the deleted and the non-deleted limits of
the following functions at the point x = O.
(a) f(x) == lxi,
(b) f(x) = 1/x, x 0,
(c) f(x) = x sin (l/x), x ~ 0,
X sin (l/x), x ~ 0,
(d) f(x) = {1 , x = 0,
(e) f(x) = sin (l/x), x ~ 0,
O, x <0,
(f) f(x) = { I,
x> O.
Under the same hypotheses, the inner product fg has a deleted limit at c and
in case for each positive number M there exists a neighborhood U of c such that
if x E U n :0 (f), x ~ c, then j(x) > M. Formulate and establish a result
analogous to Lemma 18.3 for this limit.
18.J. In view of Exercises 18.H and 18.1, give a definition of what is meant
by the expressions
lim f = + co, limf = - c:e.
x-++ oo x-+c
18.K. Establish Lemma 18.8 for the non-deleted limit superior. Give the
proof of Lemma 18.9(b).
18.L. Define what is meant by
lim supf = L, lim inf f = - CXl.
x-++ oo x-++ oo
uniformly for y in a set :3)2. Formulate and prove a result analogous to Theorem
14.15.
18.V. Let f be as in Definition 18.1 and suppose that the deleted limit at c
exists and that for some element A in R q and r > 0 the inequality If(x) - A I < r
holds on some neighborhood of c. Prove that
llimf - AI < r.
x~c
206
SEC. 19 THE DERIVATIVE IN R 207
Q.E.D.
208 CR. V DIFFERENTIATION
can be proved to have this property. We shall not go through the details,
but refer the reader to the books of Titchmarsh and Boas for further
details and references.)
19.3 LEMMA. (a) If f has a derivative at c and f' (c) > 0, there
exists a positive number 0 such that if x E :D and c < x < c + 0, then
f(c) < f(x).
(b) If l' (c) < 0, there exists a positive number 0 such that if x E ~
and e - 0 < x < c, then f(c) < f(x).
PROOF. (a) Let eo be such that 0 < eo < 1'(c) and let 0 = o(eo) cor-
respond to eo as in Definition 19.1. If x E :D and c < x < e + 0, then
we have
-EO < f(x) - f(c) - f'ee).
x-c
Since x - e > 0, this relation implies that
o < [f'(c) - fO](X - c) < f(x) - f(c),
which proves the assertion in (a). The proof of (b) is similar.
Q.E.D.
we shall state this result only for relative maxima, leaving the formu-
lation of the corresponding result for relative minima to the reader.
19.4 INTERIOR MAXIMUM THEOREM. Let c be an interior point of
~ at which f has a relative maximum. If the derivative of fat c exists, then
it must be equal to zero.
PROOF. H1'(c) > 0, then from Lemma 19.3(a) there is a 0> 0 such
that if c < x < c + 0 and x E 1>, then f(c) < f(x). This contradicts
the assumption that f has a relative maximum at c. If f' (c) < 0, we use
Lemma 19.3(b).
Q.E.D.
Figure 19.1
(See Figure 1g.1.) By hypothesis l' (c) exists and, since f has a relative
maximum point at c, the Interior Maximum Theorem implies that
f'(c) =0.
Q.E.D.
x b
[It is easily seen that I(J is the difference of f and the function whose
graph consist of the line segment passing through the points (a, f(a
and (b, f(b; see Figure 19.2.] It follows from the hypotheses that rp
is continuous on J = [a, b] and it is easily checked that rp has a derivative
in (a, b). Furthermore, we have rp(a) = rp(b) = O. Applying Rolle's
Theorem, there exists a point c inside J such that
o = rp'(c) = f'Cc) _ feb) - f(a)
b-a
from which the result follows.
Q.E.D.
19.7 COROLLARY. Iff has a derivative on J = [a, b], then there exists
a point c in (a, b) such that
feb) - f(a) = f'(c)(b - a).
Sometimes it is convenient to have a more general version of the
Mean Value Theorem involving two functions.
19.8 CAUCHY MEAN V AL"VE THEOREM. Let f, g be continuous on
J = [a, b] and have derivatives inside (a, b). Then there exists a point c
in (a, b) such that
f'(c)[g(b) - g(a)] = g'(c)[f(b) - f(a)].
BEC. 19 THE DERIVATIVE IN R 111
p }
- (3 - x)n-l =P - fCn)(x)
({3 - x)n-l.
(n-I)! (n-I)!
Since ep' ('Y) = 0, then P = j<n) (')'), proving the assertion.
Q.E.D.
REMARK. The remainder term
fen) (')')
(19.4) Rn = 1 ((3 - a)n
n.
given above is often called the Lagrange form of the remainder. There
are many other expressions for the remainder, but for the present, we
mention only the Cauchy form which asserts that for some number fJ
with 0 < fJ < 1, then
j (n)l - fJ)a + fJ~)
( 19.5) R = (1 - fJ)n-l IJ (~_ )n
1'1 (n _ I)! tJ a.
This form can be established as above, except that on the left side of
equation (19.3) we put (3 - a)Q/ (n - I)! and we define ep as above
except its last term is ({3 - x)Q/ (n - I)! We leave the details as an
exercise. (In Section 23 we shall obtain another form involving use of
the integral to evaluate the remainder term.)
19.10 CONSEQUENCES. We now mention some elementary conse-
quences of the Mean Value Theorem which are frequently of use. As
before, we assume that f is continuous on J = [a, b] and its derivative
exists in (a, b).
SEC. 19 THE DERIVATIVE IN R 213
Our improved estimate is 10.243 < .y105 < 10.250 and more accurate
estimates can be obtained in this way.
(c) The Mean Value Theorem and its corollaries can be used to
establish inequalities and to extend inequalities that are known for
integral or rational values to real values.
For example, we recall that Bernoulli's Inequality 5.E asserts that
if 1 + x > 0 and n E N, then (1 + x)n > 1 + nx. We shall show that
this inequality holds for any real exponent r > 1. To do so, let
f(x) = (1 + x)r,
so that
f'(x) = r(l + X)r-l.
If -1 < x < 0, then f'ex) < r, while if x > 0, then f'ex) > r. If we
apply the Mean Value Theorem to both of these cases, we obtain the
result
(1 + x)r > 1 + rx,
when 1 + x > 0 and r > 1. Moreover, if r > 1, then the equality occurs
if and only if x = o.
As a similar result, let a be a real number satisfying 0 < a < 1 and let
g(x) = ax - XIX for x > o.
SEC. 19 THE DERIVATIVE IN R 215
Then
g' (x) = a(l - Xa-l),
so that g'(x) < 0 for 0 < x < 1 and g'(x:) > 0 for x > 1. Consequently,
if x > 0, then g(x) > g(l) and g(x) = g(l) if and only if x = 1. There-
fore, if x > and 0 < a < 1, then we have
xa < ax + (1 - a).
exists, then
lim f(x) = lim [(x)
x->a g (x) x-+a g' (x)
in the form f(x)jg(x) where f(x) = log x and g(x) = l/x, x > O. It
is seen that
1
f'
--
(x)
= -
x
= -x~O, as x~o.
g' (x) -1
x2
Let > 0 and choose
E a fixed positive number Xl < 1 such that if
o < x < Xl, then
l' (x)
g' (x) < E.
Applying the Cauchy Mean Value Theorem, we have
f(x) - f(Xl) l' (X2)
= ~- <t:
g (x) - g (Xl) g' (X2) ,
with satisfying 0 < x < X2 < Xl. Since f(x) ~ 0 and g(x) ~ 0 for
X2
o < x < Xl, we can write the quantity appearing on the left side in the
more convenient form
1 f(XI)
f(x) f(x).
J
g (x) [1 _g (Xl)
g(x)
Holding Xl fixed, we let x ~ O. Since the quantity in braces converges
to I, it exceeds! for X sufficiently small. We infer from the above that
f(x)
Ih(x)1 = g(x) < 2t:,
x-C x-c
In virtue of the uniform convergence of the sequence Urn, the right hand
ilide is dominated bye when m, n > M(e). Taking the limit with respect
to m, we infer from Lemma 11.16 that
fex) - f(c)
x-c x-c
218 CR. V DIFFERENTIATION
when n > M (E). Since g(e) :::; lim (j,a' (c), there exists an N (e) such that
if n > N(e), then
Iftl'(c) - g(e)1 < e.
Now let K = sup {M(e), N(e)}. In view of the existence of !K'(C), if
o < Ix - el < oK(e), then
fK(X) - fK(C) _ !K'(C) < e.
x-c
Therefore, it follows that if 0 < Ix - cl < oK(e), then
f(x) - f(c) _ gee) < 3e.
x-c
This shows that l' (c) exists and equals g(c).
Q.E.D.
Exercises
19.A. Using the definition, calculate the derivative (when it exists) of the
functions given by the expressions:
(a) f(x) = x2, (b) g(x) = x",
(c) hex) = VX, x> 0 (d) F(x) = l/x, x oF- 0,
(e) G(x) = lxI, (f) H(x) = 1/x2 , x oF- o.
19.B. If f and g are real-valued functions defined on an interval J, and if they
are differentiable at a point c, show that their product h, defined by hex) =
j(x)g(x), x E J, is differentiable at C and
h'(c) = j'(c)g(c) + f(c)g'(c).
19.C. Show that the function defined for x oF- 0 by
f(x) = sin (l/x)
is differentiable at each non-zero real number. Show that its derivative is not
bounded on a neighborhood of x = O. (You may make use of trigonometric
identities, the continuity of the sine and cosine functions, and the elementary
limiting relation
smu
--~1 as u~O.)
u
19.D. Show that the function defined by
g(x) = x 2 sin (l/x), x. 0,
= 0, x = 0,
is differentiable for all real numbers, but that g' is not continuous at x = O.
SEC. 19 THE DERIVATIVE IN l~ S19
xn
R" = - j<n) (8",x) , 0 < 8" < 1.
n!
Show that if 0 < x < 1, then lim(Rn ) = O.
19.T. In the preceding exercise, use Cauchy's form of the remainder to obtain
&=
m(m - 1) .. (m - n + 1) (1 - 6,,)"-IX"
,
12 ... (n - 1) (1 + 8",x)"-
220 CH. V DIFFERENTIATION
Proiects
19.a. In this project we consider the exponential function from the point of
view of differential calculus.
(a) Suppose that a function E on J = (a, b) to R has a derivative at every
point of J and that E'(x) = E(x) for all x E J. Observe that E has derivatives
of all orders on J and they all equal E.
(b) If E(a) = 0 for some a E J, apply Taylor's Theorem 19.9 and Exercise
11.N to show that E(x) = 0 for all x E J.
(c) Show that there exists at most one function E on R to R which satisfies
E' (x) = E(x) for x E R, E(O) = 1.
(d) Prove that if E satisfies the conditions in part (c), then it also satisfies the
functional equation
E(x+ y) = E(x)E(y) for x, y E R.
(Hint: if f(x) = E(x + y)jE(y), then j'(x)= f(x) and f(O) = 1.)
(e) Let (En) be the sequence of functions defined on R by
El(x) = 1 + x, En(x) = En-l(X) + xn/n!.
SEC. 19 THE DERIVATIVE IN R ~21
Let A be any positive number; if Ixl < A and if m ~ n > 2A, then
IEm(x) - En (x) I < A [A
1 + - + ... + (A)m-nJ
n+l
-
(n + 1)! n n
2An+l
< .
(n + 1)!
Hence the sequence (En) converges uniformly for Ixl < A.
(f) If (En) is the sequence of functions defined in part (e), then
En'(x) = En-l(X), x E R.
Show that the sequence (En) converges on R to a function E with the properties
displayed in part (c). Therefore, E is the unique function with these properties.
(g) Let E be the function with E' = E and E(O) = 1. If we define e to be
the number
e = E(l),
then e lies between 2! and 2{. (Hint: 1 +1+t +t < e < 1+1+t +t +
-h. More precisely, we can show that
2.708 < 2 + H < e < 2 + H- < 2.723.)
19.{j. In this project, you may use the results of the preceding one. Let E
denote the unique function on R such that
(Hint: evaluate L'(l) by using the sequence (1 + lIn) and the continuity
of E.)
222 CR. V DIFFERENTIATION
Show that (8n ) converges uniformly for lxl < A to the unique function S with
the properties in part (b).
(e) Prove that S' = C and C' = -8.
(f) Establish the Pythagorean Identity 8 2 + C2 = 1. (Hint: calculate the
derivative of 8 2 + (J2.)
19.0. This project continues the discussion of the sine and cosine functions.
Free use may be made of the properties established in the preceding project.
(a) Suppose that h is a function on R which satisfies the equation
h" +h = O.
Show that there exist constants a, {3 such that h = aC + {38. (Hint: a = h(O),
(3 = h'(O).)
(b) The function 0 is even and S is odd in the sense that
C( -x) = C(x), S( -x) = -Sex), ior all x in R.
SEC. 19 THE DERIvATIVE IN R
Therefore, the smallest positive root 'Y of C lies between the positive root of
x2 - 2 = 0 and the smallest positive root of x 4 - 12x2 + 24 = O. Using this,
prove that y2 < 'Y < y'3.
(f) We define 11" to be the smallest positive root of S. Prove that 11" = 21' and
hence that 2y2 < 11" < 2 V3.
(g) Prove that both C and S are periodic functions with period 211" in the
sense that C(x + 211") = C(x) and Sex + 211") = Sex) for all x in R. Also show
that
Sex) ~ c (~ - x) ~ -c (x +~) ,
C(x) = S (~ - x) ~ S (x+~) ,
for all x in R.
19.E. Following the model of the preceding two exercises, introduce the hyper-
bolic cosine and sine as functions satisfying
respectively. Establish the existence and the uniqueness of these functions and
show that
C2 _S2 =1.
Prove results similar to (a)-Cd) of Project 19.0 and show that, if the exponential
function is denoted by E, then
'" (
Xl + X2 +n ... + xn) ::;;;1 (q;(XI) + ... + ",(x n ) )
(b) If n <2 m
and if Xl, ..., X n belong to J, let Xi for j = n + 1, .. .,2 11'
be
equal to
_x= (Xl + X2 + .. ,+ xn) .
n
Show that the same inequality holds as in part (a).
(c) Since", is continuous, show that if x, y belong to J and tEl, then
q;(1 - t)x + ty) < (1 - t)q;(x) + tq;(y).
(In geometrical terms: the entire chord lies above or on the curve.)
(d) Suppose that q; has a second derivative on J. Then a necessary and
sufficient condition that q; be convex on J is that if" (x) ~ 0 for X E J. (Hint: to
prove the necessity, use Exercise 19.U. To prove the sufficiency, use Taylor's
Theorem and expand about x = (x + y)/2.)
(e) If '" is a continuous convex function on J and if X < y z belong to J, s
show that
q;(y) - q;(x) < q;(z) - q;(x)
y-x - z-x
Therefore, if w < x < Y < z belong to J, then
q;(x) - ",(w) < q;(z) - ",(y)
x-w - z-y
(f) Prove that a continuous convex function", on J has a left-hand derivative
and a right-hand derivative at every point. Furthermore, the subset where q;'
does not exist is countable.
in the Cartesian space Rq. Of course, in this case L is a vector in Rq. The
only change required for this extension is to replace the absolute value
in equation (19.1) by the norm in the space Rq. Except for this, Defini-
tion 19.1 applies verbatim to this more general situation. That this
situation is worthy of study should be clear when it is realized that a
flmction f on J to R q can be regarded as being a curve in the space R q
and that the derivative (when it exists) of this function at the point
x = c yields a tangent vector to the curve at the point fCc). Alternatively,
if we think of x as denoting time, then the function f is the trajectory of
a point in Rq and the derivative l' (c) denotes the velocity vector of the
point at time x = c.
A fuller investigation of these lines of thought would take us farther
into differential geometry and dynamics than is desirable at present.
Our aims are more modest: we wish to organize the analytical machinery
that would make a satisfactory investigation possible and to remove the
restriction that the domain is in a one-dimensional space and allow the
domain to belong to the Cartesian space Rp. We shall now proceed to
do this.
An analysis of Definition 19.1 shows that the only place where it is
necessary for the domain to consist of a subset of R is in equation (19.1),
where a quotient appears. Since we have no meaning for the quotient of
a vector in Rq by a vector in Rp, we cannot interpret equation (19.1) as
it stands. We are led, therefore, to find reformulations of this equation.
One possibility which is of considerable interest is to take one-dimensional
"slices" passing through the point c in the domain. For simplicity it will
be supposed that c is an interior point of the domain :D of the function;
then for any u in RP, the point c + tu belongs to 1) for sufficiently small
real numbers t.
20.1 DEFINITION. Let f be defined on a subset :D of Rp and have
values in Rq, let c be an interior point of 5), and let u be any point in Rp.
A vector L u in R q is said to be the directional derivative of f at c in the
direction of u if for each positive real number there is a positive number
O(E) such that if 0 < It I < O(E), then
In the same way, taking e2 = (0, 1, ... ,0), ... , ep = (0,0, ... , 1), we
obtain the partial derivatives with respect to b, ..., ~P' denoted by
af af
f~2 = a~2' ... , f~p = a~l'
The Derivative
~3
Figure 20.1
Let a be a non-zero real number with lal < ~(E) and set x = c + aZ.
It follows that
which is a contradiction.
Q.E.D.
has no other significance. When this is done and the Leibnizt notation
for the derivative is used, the formula (20.4) becomes
df
Df(e) (dx) = dx (e) ax.
(b) Let p = 1, q > 1, and let D be a subset of R. A functionf, defined
on D to Rq, can be represented by the "coordinate functions":
(20.5) f(x) = (!I(X),f2(X), ...,fq(x), x E~.
It can be verified that the function f is differentiable at an interior point
e in D if and only if each of the real-valued coordinate functions fl,
12, ..., fq has a derivative at e. In this case, the derivative Df(e) is the
linear function of R into Rq which sends the real number u into the vector
(20.6) (j/ (e)u, f2' (e)u, ..., fq' (e)u)
of Rq. It may be noted that Df(e) sends a real number u into the product
of u and a fixed vector in R q.
(c) Let p > 1, q = 1, and let D be a subset of Rp. Then for x =
(~l, ..., ~p) in D, we often write
(d) Lt us consider the case p > 1, q > 1" but restrict our attention
first to a linear function j on Rp to Rq. Then j(x) - fee) = f(x - c),
and hence
'/(x) - fee) - f(x - e)1 = o.
This shows that when f is linear, then j is differentiable at every point
and Df(e) = f for any point e in Rp.
(e) We now consider the case p > 1, q > 1, and do not restrict the
function j, defined on D in Rp to Rq to be linear. In this case we can
represent y = f(x) by system
are given by
(20.9)
....
afq (c) afq (c)
a~1 a~2
232 eH. V DIFFERENTIATION
The next result shows that if f is differentiable at c, then all the direc-
tional derivatives of fat c exist and can be calculated by a very simple
method.
20.6 THEOREM. Let f be defined on ~ in R1' and have range in R q. If
f is differentiable at the point c in 5) and u is any point in R1', then the direc-
tional derivative of fat c in the direction u exists and equals Df(c) (u).
PROOF. Applying Definition 20.2 with x = e + lu, we have
t CARL (G. J.) JACOBI (1804-1851) was professor at Konigsberg and Berlin. His main
work was concerned with elliptic functions, but he is also known for his work in
determinants.
SEC. 20 THE DERIVATIVE IN RP 233
:~ (0, 0)
iJf
- (0,0),
a~
both exist and equal zero and every directional derivative exists. How-
ever, the function f is not even continuous at (J = (0,0), so that f does
not have a derivative at o.
Although the existence of the partial derivatives is not a sufficient
condition for the existence of the derivative, the continuity of these
partial derivatives is a sufficient condition.
20.7 THEOREM. If the partial derivatives of f exist in a neighborhood
of c and are continuous at c, then f is differentiable at c.
PROOF. We shall treat the case q = 1 in detail. If e > 0, let liCE) > 0
be such that if Iy - cl < O(E) and} = 1, 2, ..., p, then
If x = (~1, b ..., ~p) and c = ("YI, "Y2, .., "Yp), let Xl, X2, ., Xp-l denote
the points
Xl = ("Yll ~2, 0 .., ~p), X2 = ("YI, "Y2, ~3, .. 0' ~p),
o , X p-l = ("YI, "Y2, . 0 'J "Yp-l, ~p)
and let Xo = x and X p = c. If Ix - cl < o(e), then it is easily seen that
IXj - cl < o(e) for j = 0, 1, .. 'J p. We write the difference f(x) - fCc)
in the telescoping sum
p
f(x) - fCc) = L
j=1
{f(Xj-l) - j(Xj)}.
Applying the Mean Value Theorem 19.6 to the J"th term of this sum, we
obtain a point Xj, lying on the line segment joining Xj-l and XiJ such that
f(x) - fCc) - t
J =1
(~J - "Y j) af (c) =
a~j
t
p-1
(~j - "Y j){ af (Xj) - af. (c)
a~j a~)
.
Employing the inequality (20.11), each quantity appearing in braces in
the last formula is dominated bye. Applying the C.-B.-S. Inequality to
this last sum, we obtain the estimate
IfCx) - fCc) - t
j =1
(~j - "Yj) af (c)[
a~j
< Ix - cl(e yp),
whenever Ix - cl < o(e).
CH. v DIFfERENTIATION
The next result asserts that the derivative of the composition of two
f\IDctions is the composition of their derivatives.
20.9 CHAIN RULE. Let f be a function with domain XJ(J) in Rp and
range in Rq and let g have domain XJ(g) in Rq and range in Rr. Suppose
that f is differentiable at c and that g is differentiable at b = f(c). Then the
composition h = go f is differentiable at c and
If Ix - cl < inf {1', (I/K)a(e, g) L then (20.14) implies that If(x) - f(c)\
:::; aCE, g), which means that
hl(e)wl+'" +fEp(C)W p
and so the derivative of gO f at c takes this point of Rp into the real
number
(20.17) g'(bHfEl(e)wl + 0'0 + fEp(e)w p ].
(20.19)
ag dfl + ... + ag dfq
a'1Jl dx ar]Q dx
The derivative Df(e) sends (~, 1') into (w, r) according to the formulas
(20.20)
w = Wx(c)~ + W%/(ch,
t = Zx(c)~ + Z%/(C)l1
Also the derivative Dg(b) sends (w, t) into (p, u, r) according to the
relations
p = Rw(b)w + Rz(b)t,
(20.21) u = Sw(b)w + Sz(b)r,
T = Tw(b)w + Tz(b)r.
dz =
az az
- dx + - dy;
ax ay
similarly, (20.21) becomes
ar ar
dr = -dw
aw
+ -dz,
az
ds = -
as dw + -as dz,
ow az
at
dt = -
aw
dw + -azat dz;
iS8 CH. V DIFFERENTIATION
aw (e) aw (e)
W x(c) WII (c) ] ax ay
(20.23) [
ZxCe) ZII(C) = az (e) az (e)
ax ay
Similarly, (20.21) asserts that the mapping Dg(b) of (w, r) into (p, (1, T)
is given by the 3 X 2 matrix
~ (b) ar (b)
aw az
[RW(bl R'(b l ]
(20.24) Sw(b) S,Cb) = ~ (b) as (b) .
aw iJz
Tw(b) T.(b)
~ (b) at (b)
aw az
Finally, relation (20.22) asserts that the mapping D(g 0 f) (c) of (t, 'YJ)
into (p, (1, T) is given by the 3 X 2 matrix
it is easily seen that the desired linear function L has the matrix repre-
sentation
VV"e remark that the proof yields more information about L than was
announced in the statement. Each of the q rows of the matrix for L is
obtained by evaluating the partial derivatives of fi = f Vi, i = 1,2, ... , q,
at some point Ci lying on the line segment joining a and b. However, as
we have already seen, it is not always possible to use the same point c
for different rows in this matrix.
or
af i = 1,2, ..., p.
a~/
Each of the partial derivatives is a function with domain in Rp and
range in R and so each of these p functions may have p partial deriva-
tives. Following the accepted American notation, we shall refer to the
resulting p2 functions (or to such ones that exist) as the second partial
derivatives of f and we shall denote them by
or
a'lf
-- ~,J = 1, 2, ..., p.
a~ja~i'
It should be observed that the partial derivative intended by either of
the latter symbols is the partial derivative with respect to ~ j of the
partial derivative of f with respect to t. (In other words: first ~i, then
~ j; however, note the difference in the order in the two symbols!)
In like manner, we can inquire into the existence of the third partial
derivatives and those of still higher order. In principle, a function on
Rp to R can have as many as pn nth partial derivatives. However, it is
a considerable convenience that if the resulting derivatives are con-
tinuous, then the order of differentiation is not significant. In addition
to decreasing the number of (potentially distinct) higher partial deriva-
tives, this result largely removes the danger from the rather subtle
notational distinction employed for different orders of differentiation.
It is enough to consider the interchange of order for second derivatives.
By holding all the other coordinates constant, we see that it is no loss of
generality to consider a function on R2 to R. In order to simplify our
notation we let (x, y) denote a point in R2 and we shall show that if
fx, fy, and fxy exist and if fxy is continuous at a point, then the partial
derivativefyx exists at this point and equals/xy. It will be seen in Exercise
20.U that it is possible that bothfxy andfyx exists at a point and yet are
not equal.
CH. V DIFFERENTIATION
The device that will be used in this proof is to show that both of these
mixed partial derivatives at the point (0,0) are the limit of the quotient
f(h, k) - f(h, 0) - f(O, k) + f(O, 0)
hk
as (h, k) approaches (0, 0).
20.14 LEMMA. Suppose that 1 is defined on a neighborhood U of the
origin in R 2 with values in R, that the partial derivatives f x and f xt/ exist in
U, and that fxt/ is continuous at (0,0). If A is the mixed difference
(20.26) A (h, k) = f(h, k) - f(h, 0) - f(O, k) + f(O, 0),
then we have
A (h, k)
fXt/(O, 0) = lim
(h.k)~(O,O) hk
PROOF.Let E > 0 and let ~ > 0 be so small that if Ihl < ~ and Ikl < 0,
then the point (h, k) belongs to U and
(20.27) 11x1/(h, k) - lx1/(O, 0)1 < E.
If Ikl < 0, we define B for Ihl < 0 by
B (h) = f(h, k) - l(h, 0),
from which it follows that A(h, k) = B(h) - B(O). By hypothesis, the
partial derivative Ix exists in U and hence B has a derivative. Applying
the Mean Value Theorem 19.6 to B, there exists a number ho with
o < Ihol < lhl such that
(20.28) A(h, k) = B(h) - B(O) = hB'(ho).
(It is noted that the value of ho depends on the value of k, but this will
not cause any difficulty.) Referring to the definition of B, we have
where 0 < [hoi < Ihl, < [k o! < Ikl. It follows from inequality (20.27)
and the preceding expression
A (h, k)
hk - !xu(O, 0) <e
If > 0, there exists a number a(e) > 0 such that if 0 < Ihl < a(e)
I: and
o < Ikl < 5(e), then
A(h, k)
hk - f:&1/(O, 0) < e.
By taking the limit in this inequality with respect to k and using (20.31),
we obtain
for all h satisfying 0 < lhj < a(e). Therefore, fl/x(O, 0) exists and equals
f XI/(O, 0).
Q.E.D.
eH. V DIFFERENTIATION
Higher Derivatives
In discussing the third derivative, we shall assume that all of the third
partial derivatives of j exist and are continuous in a neighborhood of c.
By now the method of formation of the higher differentials should
be clear. (In view of our preceding remarks concerning the interchange
of order in differentiation, if the resulting mixed partial derivatives are
SEC. 20 THE DERIVATIVE IN R"
Exercises
20.A. If J is defined for a,1'], r) in Ra to R by the formula
J(~, 1'], r) = 2~2 - 1'] + 6~1'] - r3 + 3r,
calculate the directional derivative of f at the origin 8 = (0, 0, 0) in the direction
of the points
x = (1,2,0), y = (2, 1, -3).
2O.B.. Let.! be defined for (~, 11) in R2 to R by
f(~, 1']) = Vl1, 1'] ~ 0,
= 0, 11 = 0,
Show that the partial derivatives fE, f1/ exist for 8 = (0,0) but that if u = (a, (J)
with afJ ~ 0, then the directional derivative of fat 8 in the direction of u does
not exist. Show also that f is not continuous at 8; in fact, f is not even bounded
at 8.
20.0. If J is defined on R2 to R by
I(t, 11) = 0, if ~11 = 0,
= 1, otherwise,
then f has partial derivatives IE, f." at 8 = (0, 0), but I does not have directional
derivatives in the direction u = (a, fJ) if afJ ~ O. The functionfis not continuous
at 8, but it is bounded.
20.D. Letfbe defined on R2 to R by
Thenfis continuous and has partial derivatives at (J = (0,0), butfiB not differ-
entiable at 8.
20.G. Let! be defined on R2 to R by
f(~, 1/) = r + 1/2, both ~, 1/ rational,
= 0, otherwise.
Then! is continuous only at the point 8 = (0,0), but it is differentiable there.
20.H. Let! be defined on R2 to R by
f(~, 1) = (E 2 + 1)2) sin 1/(e2 + 1)2), (~, 1) -:;e (0,0),
= 0, (~, 1/) = (0,0).
Then! is differentiable at 6, but its partial derivatives are not continuous (or
even bounded) on a neighborhood of (J.
20.1. Suppose the real-valued function f has a derivative at a point c in Rp.
Express the directional derivative of f at c in the direction of a unit vector
w = (WI, ., w p ). Using the C.-E.-S. Inequality, show that there is a direction
in which the derivative is maximum and this direction is uniquely determined if
at least one of the partial derivatives is not zero. This direction is called the
gradient direction of fat c. Show that there exists a unique vector vc such that
Df(c)(w) = Vc'W for all unit vectors w. This vector Vc is called the gradient of
fat c and is often denoted by Vei or grad f(c).
20.J. Suppose that f and g are real-valued functions which are differentiable
at a point c in Rp and that ex is a real number. Show that the gradient of fat c
is given by
and that
Vc(af) = a Vei,
VcU + g) = Vei + Vcg,
Vc(fg) = (VcJ)g(c) + fCc) (Vcg).
20.K. If f is differentiable on an open subset j) of Rp and has values in R<l
such that If(x)1 = 1 for x E j), then
f(x) .Df(x) (u) = 0 for x E j), u E Rp.
If p = 1, give a physical interpretation of this equation.
20.L. Suppose that f is defined for x = (6, ~2) in R2 to R by the formula
f(x) "'" f(6, ~2) = Ah 2 + Bhb + C~22.
Calculate Df at the point y = (7)1,7)2). Show that
(i) f(tx) = (2f(x) for t E R, ;E E R2;
(ii) Df(x)(y) = Df(y)(x);
(iii) Df(x) (x) = 2f(x);
(iv) f(x +
y) = f(x) + Df(x) (y) + f(1/).
CR. V DIFFERENTIATION
20.M. Letfbe defined on an open set ~ of Rp into Rq and satisfy the relation
(20.33) for t E R, x E ~.
In this case we say that f is homogeneous of degree k. If this function f is
differentiable at x, show that
(20.34) Df(x) (x) = kf(x).
(Hint: differentiate equation (20.33) with respect to t and set t = 1.) Conclude
that Euler'st Relation (20.34) holds even when j is positively homogeneous
>
in the sense that (20.33) holds only for t O. :f q = 1 and x = (el, ..., tp),
then Euler's Relation becomes
of
(c) F(~, '1) = f(~2 + 7]2), then '1 ~ = eof
07] ;
o2F o2F
Cd) F(~, '1) = jCt + C'1) + jCt - C'1), then c2 -
o~2
= -.
01J'l
20.0. If f is defined on an open subset ~ of R2 to R and if the partial deriva
tives f~, fl) exist on ~, then is it true that j is continuous on :D?
20.P. Letf be defined on a neighborhood of a point c in R2 to R. Suppose that
f~ exists and is continuous on a neighborhood of c and that ft'J exists at c. Then
is f differentiable at c?
20.Q. Letf be defined on a subset:D of Rp with values in Rq and suppose that
f is differentiable at every point of a line segment L joining two points a, b in:D.
If IDfCc) Cu) I < M lui for all u in Rp and for all points c on this line segment L,
then
If(b) - f(a)1 < M Ib - al
(This result can often be used as a replacement for the Mean Value Theorem
when q > 1.)
= 0, (~, 7) = (0,0).
Show that the second partial derivatives hrJ' frJ~ exist at f) = (0,0) but that they
are not equal.
If F is defined on I to R by
F(t) = {I[t(Xl - X2) + X2] - Df(e) (Xl - X2)} 'W,
Local Solvability
The next main result, the Local Solvability Theorem, is a companion
to the Local One-One Mapping Theorem. It says that if f is in Class C'
on a neighborhood of c and if Df(c) maps Rp onto all of Rq, then f maps
a neighborhood of c onto a neighborhood of fCc). Expressed differently,
every point of Rq which is sufficiently close to fCc) is the image under f
of a point close to c. In order to establish this result for the general case
we first establish it for linear functions and then prove that it holds for
functions that can be approximated closely enough by linear functions.
21.6 LEMMA. If L is a linear function of UP onto all of Rq, then there
exists a positive constant m such that every element y in R q is the image
under L of an element x in Rp such that Ixl < rn Iyl.
PROOF. Consider the following vectors in Rq:
(21.5)
q
In view of the linearity of L, the vector x = L 11/Uj is mapped into
j~l
the vector
q
Q.E.D.
(21.6)
for IXil < a. Then any vector y in R q satisfying [yl < 13 = a/2m is the
image under g of an element in 5) (g).
164 OR. V DIFFERENTIATION
(21.7)
Since all the hard work has been done, we can derive the next result
by a translation.
21.8 LOCAL SOLVABILITY THEOREM. Suppose that f is in Class C'
on a neighborhood of c and that the derivative DfCe) maps Rp onto all of
R q. There are positive numbers a, {J sueh that if y E R q and Iy - f (e) I < {J,
then there is an element x in Rp w'z'th Ix - cl < a sueh that f(x) = y.
PROOF. By hypothesis, the linear function L = Df(e) maps onto Rq
and we let m be as in Lemma 21.6. By the Approximation Lemma 21.3
there exists a number a > 0 such that if IXi - el < a, i = 1,2, then
1
(21.10) If(XI) - f(x2) - L(xI - X2) I < 2m IXI - x21
is not zero.
Ig(y,) - g(y) - M.(y, - Y)I < ~ IXI - xllu(x,)1 < {; lu(x,) I} IY' - yl
Therefore, g is differentiable at Y = f(x) and its derivative Dg(y) is the
linear function M x, which is the inverse of Df(x).
It remains to show that g is in Class Of on V. Let Z be any element
of Rp and let x, Xl, Y, Yl be as before; then it is seen directly from the
fact that the linear function Dg is the inverse of the linear function
Df that
Dg(y)(z) - Dg(YIHz) = Dg(y) 0 [Dj(XI) - Dj(x)] 0 Dg(YI)(Z).
258 CR. V DIFFERENTIATION
E
IDg(y) (z) - Dg(YI) (z) I <zlzl for z E Rp,
r
Implicit Functions
F[<p(y), y] = 0,
(/)3(Y) = y, y rational,
a:; -y, y irrational.
BEC. 21 MAPPING THEOREMS AND EXTREMUM PROBLEMS S69
(21.12)
fp (~1, .., ~p, 771, .., 77q) = O.
Here it is Wlderstood that the system of equations is satisfied for
h = 0, ..., 77q = 0, and it is desired to solve for the ~, in terms of the
77 j, at least when the latter are sufficiently small. The hypotheses to be
made amount to assuming that the partial derivatives of the functions
ii, with respect to the p + q arguments, are continuuos near zero, and
that the Jacobian of the f, with respect to the ~i is not zero when ~i = 0,
260 CH. V DIFFERENTIATION
(21.13)
~P = rpp('T'fl' 0 0 ., "fJq),
Extremum Problems
The use of the derivative to determine the relative maximum and
relative minimum points of a function on R to R is well-known to
students of calculus. In the Interior Maximum Theorem 19.4, we have
presented the main tool in the case where the relative extreme is taken
at an interior point. The question as to whether a critical point (that is,
a point at which the derivative vanishes) is actually an extreme point
is not always easily settled, but can often be handled by use of Taylor's
Theorem 19.9. The discussion of extreme points which belong to the
boundary, often yields to application of the Mean Value Theorem 19.6.
In the case of a function with domain in Rp, p > 1, and range in R,
the situation is more complicated and each function needs to be examined
in its own right since there are few general statements that can be made.
However, the next result is a familiar and very useful necessary condition.
21.13 THEOREM. Let f be a function with domain 5) in R p and with
range in R. If c is an interior point of 5) at which f is differentiable and has
a relative extremum, then Df(e) = O.
PROOF. By hypothesis, the restriction of f to any line passing through
e will have an extremum at c. Therefore, by the Interior Maximum
Theorem 19.4, any directional derivative of f must vanish at c. In
particular,
af af
(21.17) - (c) = 0, ..., - (c) = 0,
ah a~p
It is clear that if one of these partial derivatives of fat c is not zero, then
Df(c) maps Rp onto all of R. According to the Local Solvability Theorem
SEC. 21 MAPPING THEOREMS AND EXTREMUM PROBLEMS 263
Figure 21.1
CH. v DIFFERENTIATION
(21.18)
evaluated at W = (Wi" . , W p ).
(a) If D2f(c)(w) 2 > ofor all w ~ ()inRp,thenfhasarelativeminimum
at c.
(b) If D2f(c)(w) 2 < 0 for all w ~ () in Rp, thenf has a relative maximum
at c.
(c) If D2f(c)(W)2 takes on both positive and negative values for w in
R P, then c is a saddle point of f.
PROOF. (a) If D2f(c) (W)2 > 0 for points in the compact set {w E Rp:
Iwl = I}, then there exists a constant m > 0 such that
D2f(c) (W)2 > m for Iwl = 1.
The preceding result indicates that the nature of the critical point c
is determined by the quadratic function given in (21.18). In particular,
it is of importance to know whether this function can take on both
positive and negative values or whether it is always of one sign. An
SEC. 21 MAPPIKG THEOREMS AND EXTREMUM PROBLEMS 265
Hence the sign of Q is the same as the sign of A. On the other hand, if
Ll = AC - B2 < 0, then we shall see that Q has both positive and
negative values. This is obvious if A = C = 0. If A ~ 0, we can com-
plete the square in Q as above and observe that the quadratic function
Q has opposite signs at the two points (~, '17) = (1,0) and (B, -A). If
A = 0 but C ~ 0, a similar argument can be given.
\Ve collect these remarks pertaining to a function on R2 in a formal
statement.
Until now we have been discussing the case where the extrema of the
real-valued function f belong to the interior of its domain ~ in Rp. None
of our remarks apply to the location of the extrema on the boundary.
However, if the function is defined on the boundary of ~ and if this
boundary of X> can be parametrized by a function cp, then the extremum
problem is reduced to an examination of the extrema of the composition
f 0 cpo
There is a related problem which leads to an interesting and elegant
procedure. Suppose that S is a surface contained in the domain :D of
the real-valued function f. It is often desired to find the values of f that
are maximum or minimum among all those attained on S. For example,
if :D = 'Rp and f(x) = lxi, then the problem we have posed is concerned
with finding the points on the surface S which are closest to (or farthest
from) the origin. If the surface S is given parametrically, then we can
treat this problem by considering the composition of f with the para-
metric representation of S. However, it frequently is not convenient to
express S in this fashion and another procedure is often more desirable.
Suppose S can be given as the points x in 5.) satisfying a relation of
the form
g(x) = 0,
for a function g defined on X> to R. We are attempting to find the relative
extreme values of j for those points x in X> satisfying the constraint
(or side condition) g (x) = 0. If we assume that j and g are in Class C'
in a neighborhood of a point c in :D and that Dg (c) ~ 0, then a necessary
condition that c be an extreme point of j relative to points X satisfying
g(x) = 0, is that the derivative Dg(c) is a multiple of Dj(c). In terms of
partial derivatives, this condition is that there exists a real number A
such that
af ag
- (c) = A- (c),
ah a~l
8j (c) = A 8g (c).
a~p a~p
are then solved for the p + 1 unkno"n quantities, of which the co-
ordinates of e are of primary interest.
We shall now establish this result.
21.16 LAGRANGE'S METHOD. Let f and g be in Class C' on a neigh-
borhood of a point e in Rp and with values in R. Suppose that there exists a
neighborhood of e such that fex) > fCc) or fex) < fCc) for all points x in
this neighborhood which also satisfy the constraint g Cx) = O. If Dg (c) : 0,
then there exists a real number). such that
Df(c) = ADaee).
PROOF. Let F be defined on 5) to R2 by
F(x) = (J(x), g(x.
It is readily seen that F is in Class C' on a neighborhood of c and that
DF ex)(w) = (Df(x) (w), Dg(x)(w
for each x in this neighborhood and for 1.0 in Rp. Moreover, an element x
satisfies the constraint g(x) = 0 if and only if F(x) = (f(x),O).
Now suppose that c satisfies the constraint and is a relative extremum
among such points. To be explicit, assume that f(x) < fee) for all points
x in a neighborhood of e which also satisfy g(x) = O. Then the derivative
DfCe) does not map Rp onto all of R2. For, if so, then the Local Solva-
bility Theorem 21.8 implies that for some E > 0 the points (~, 0) with
. f(c) < ~ < f(c) + E are images of points in a neighborhood of c, contrary
to hypothesis. Therefore, DF(c) maps Rpinto a line in R2. By hypothesis
Dg(e) 0, so that DF(c) maps Rp into a line R2 which passes through
a point (A, 1). Therefore, we have DfCe) = A Dg(c).
Q.E.D.
-
of (c) WI + ... + -of (c) Wp =
[d
g
A -- (c) WI + ... + -Og]
ee)Wp
oh o~p a~l iJ~p
Exercises
21.A. Let! be the mapping of R2 into R2 which sends the point (x, y) into the
point (u, v) given by
u =x + y, v = 2x + ay.
Calculate the derivative Df. Show that D! is one-one if and only if it maps R2
onto R2, and that this is the case if and only if a ~ 2. Examine the image of the
unit square 1(x, y):O ~ x ~ 1, 0 ~ y:S; 11 in the three cases a = 1, a = 2,
a = 3.
SEC. 21 MAPPING THEOREMS AND EXTREMUM PROBLEMS 269
21.B. Let f be the mapping of R2 into R2 which sends the point (x, y) into
the point (u, tJ) given by
u = x, v == xy.
Draw some curves u = constant, v = constant in the (x, y)-plane and some
curves x = constant, Y = constant in the (u, v)-plane. Is this mapping one-one?
Does f map onto all of R2? Show that if x 7'~ 0, thenf maps some neighborhood
of (x, y) in a one-one fashion onto a neighborhood of (x, xy). Into what region
in the (u, v)-plane does f map the rectangle {(x, y): 1 < x < 2, 0 < Y < 21?
What points in the (x, y)-plane map under f into the rectangle { (u, v) : 1 < u < 2,
0<v<2]?
21.C. Letfbe the mapping of R2 into R2 which sends the point (x, y) into the
point (u, v) given by
1~ = 2xy.
What curves in the (x, y)-plane map under f into the lines u = constant, v =
constant? Into what curves in the (u, v)-plane do the lines x = constant,
y = constant map? Show that each non-zero point (u, v) is the image under f
of two points. Into what region does f map the square {(x, y):O < x < 1,
o < Y < 1 J? What region is mapped by f imto the square I (u, v): 0 < u < 1,
O<v<l\?
21.D. Let f be the mapping in the preceding exercise. Show that f is locally
one-one at every point except e = (0,0), but f is not one-one on R2.
21.E. Letfbe in Class C' on Rp onto Rq and suppose thatfhas an inverse. Is
it true that for each x in Rp, then Df(x) is a one-olle linear function which maps
R" onto Rq?
21.F. Let f be defined on R to R by
f(x) = x + 2x 2
sin (l/.x), x ~ 0,
= 0, x = O.
Then Df(O) is one-one but f has no inverse near x = O.
21.G. Letfbe a function on Rp to Rp which is differentiable on a neighborhood
of a point e and such that Df(e) has an inverse. Then is it true thatf has an
inverse on a neighborhood of e?
21.H. Let f be a function on Rp to Rp. If f is differentiable at e and has a
differentiable inverse, then is it true that Df(e) is one-one?
21.1. Suppose that f is differentiable on a neighborhood of a point e and that
if f: > 0 then there exists o(e) > 0 such that if Ix - el < o(e), then JDf(x) (z) -
DfCc)(z) I :S elzl for all z in Rp. Prove that the partial derivatives of f exist and
are continuous at c.
21.J. Suppose that Lo is a one-one linear function on R" to Rq. Show that
there exists a positive number a such that if L is a linear function on Rp to Rp
satisfying
21.K. Suppose that Lo is a linear function on Rp with range all of Rq. Show
that there exists a positive number (3 such that if L is a linear function on Rp
into Rq satisfying
IL(z) - Lo(z)1 < ~Izl for
then the range of L is Rq.
21.L. Letj be in Class C' on a neighborhood of a point e in Rp and with values
in Rp. If Df(e) is one-one and has range equal to Rp, then there exists a positive
number 0 such that if Ix - el < 0, then Dj(x) is one-one and has range equal
to Rp.
21.M. Let f be defined on R2 to R2 by j(x, y) = (x cos y, x sin y). Show that
if Xo > 0, then there exists a neighborhood of (xo, Yo) on which! is one-one, but
that there are infinitely many points which are mapped into j(Xo, Yo).
21.N. Let F be defined on R X R to R by F(x, y) = x 2 - y. Show that F is
in Class C' on a neighborhood of (0,0) but there does not exist a continuous
function cp defined on a neighborhood of 0 such that F [cp(y), y] = O.
21.0. Suppose that, in addition to the hypotheses of the Implicit Function
Theorem 21.11, the function F has continuous partial derivatives of order n.
Show that the solution function cp has continuous partial derivatives of order n.
21.P. Let F be the function on R2 X R2 to R2 defined for x = (~1, ~2) and
y = C'% 112) by the formula
F(x, y) = (~13 + ~2111 + 112, ~1112 + ~23 - 111),
At what points (x, y) can one solve the equation F(x, y) = (J for x in terms of y.
Calculate the derivative of this solution function, when it exists. In particular,
calculate the partial derivatives of the coordinate functions of cp with respect to
111, 112.
21.Q. Let f be defined and continuous on the set ~ = {x E Rp: Ixl < 1} with
values in R. Suppose thatj is differentiable at every interior point of ~ and that
J(x) = 0 for alllxl = 1. Prove that there exists an interior point e of ~ and that
Df(e) = 0 (This result may be regarded as a generalization of Rolle's Theorem.)
21.R. If we define f on R 2 to R by
f(~, 11) = P + 4~11 + 11 2
,
then the origin is not a relative extreme point but a saddle point of J.
21.S. (a) Let f1 be defined on R2 to R by
fl(~, 11) = ~4 + 114,
then the origin (J = (0, 0) is a relative minimum of jl and .i = 0 at 8. (Here
.i = !U!T/., - h.,2.)
(b) If f2 = -fl' then the origin is a relative maximum of /2 and .i = 0 at (J.
(c) If fa is defined on R2 to R by
f3(~' 11) = E4 - 114,
then the origin (J = (0,0) is a saddle point of hand .i = 0 at O. (The moral of
this exercise is that if .i = 0, then anything can happen.)
SEC, 21 MAPPING THEOREMS AND EXTREMUM PROBLEMS 271
21.T. Letfbedefinedon~ = {(~, 71) E R~:t > 0, 7} > 0l toR by the formula
1
f(~, 71) = - + -1 + C~'7/.
~ '7/
Locate the critical points of f and determine whether they yield relative maxima,
relative minima, or saddle points. If c > Hnd we set
~l = {(E, TJ): > 0, TJ > 0, + 7} < c},
then locate the relative extrema of f on ~l.
21.U. Suppose we are given n points (~j, 71 i) in R2 and desire to find the linear
function F(x) = Ax + B for which the quantity
n
L [F(~j) - 71;]2
;'=1
n n
A L
i =1
~j + nB = L
}=1
TJiJ
for the numbers A, B. This linear function is referred to as the linear function
which best fits the given n points in the sense of least squares.
21.V. Let f be defined and continuous on the set!) = {x E Rp : Ixl < 11
with values in R. If f is differentiable at every interior point of !) and if
p
L fWi(X)
i=1
==
for alllxl < 1, thenf is said to be harmonic in ~. Suppose that f is not constan t
and that f does not attain its supremum on C = I x : Ixl = I} but at a point c
interior to !), Then, if E > is sufficiently small, the function g defined by
y(x) = f(x) + fix - cl%
does not attain its supremum on C but at some interior point c'. Since
(b) Show that the geometric mean of a collection of non-negative real numbers
{aI, a2, ..., an I does not exceed their arithmetic mean; that is,
21.Z. (a) Let p > 1, q > 1, and ~ +! = 1. Show that the minimum of
p q
[Hint: let A = (:E alY/P, B = (:L bjq)l/q and apply the inequality in (b)
to a= ai/A, b = bJB.]
Cd) Kate that
\a + bl p
= la + blla + bl p/q < lalla + b!p/q + Iblla + bl p/q
SEC. 21 MAPPING THEOREMS AND EXTREMUM PROBI,EMS 2,3
Project
21.a. This project yields a more direct proof of the Inversion Theorem 21.10
(and hence of the Implicit Function Theorem) than given in the text. It uses
ideas related to the Fixed Point Theorem for contractions given in 16.14.
(a) If F is a contraction in Rp with constant C and if F (0) = 0, then for each
element y in Rp there exists a unique element x in Rp such that x + F(x) = y.
Moreover, x can be obtained as the limit of the sequence (x,,) defined by
Xl = Y, X"+l = Y - F(xn ), n E N.
(b) Let F be a contraction on {x E Rp: Ixl < B I with constant C and let
F(e) = o. If Iyl :s; B(l - C), then there exists a unique solution of the equation
x + F(x) = y with Ixl < B.
(c) Iff is in Class C' on a neighborhood of 11 and if L = Df(e), use the Approxi-
mation Lemma 21.3 to prove that the function H defined by H(x) ,= f(x) - Lex)
is a contraction on a neighborhood of O.
(d) Suppose that f is in Class C' on a neighborhood of e, that f(O) = 0, and
that L = Df(O) is a one-one map of Rp onto all of Rp. If 111 = L-l, show that the
function F defined by F(x) = M[f(x) - L(x)] is a contraction on a neighborhood
of O. Show also that the equationf(x) = y is equivalent to the equation x + F(x)
= M(y).
(e) Show that, under the hypotheses in (d), there is a neighborhood U of 0
such that V = feU) is a neighborhood of 0 == fee), f is a one-one mapping of U
onto V, and f has a continuous inverse function g defined on V to U. (This is the
first assertion of Theorem 21.10.)
VI
Integration
274
BEC. 22 RIEMANN-STIELTJES INTEGRAL 175
I
I I I
I I I
I I I
I I I I
I I
~ I I
I I I I
I I I I
~2 ~3 tA ~n
X3 Xk Xn - 1 xn =b
val, then the sum S(Q;j, g) yields the same value as the sum in (22.1).
We could always assume that the partition divides -the interval into an
even number of subintervals and the intermediate points are altenw.tely
the right and left end points of these subintervals. However, we shall
not find it necessary to require this" standard" partitioning process, nor
shall we find it necessary to display these intermediate points.
22.2 DEFINITION. We say thatjis integrable with respect to g on J
if there exists a real number I such that for every positive number E
there is a partition PI of J such that if P is any refinement of PI and
S(P;j,g) is any Riemann-Stieltjes sum corresponding to P, then
I = f f dg = f f(t) dg(t);
g(x) = 0, x = a,
= 1, a < x <b.
We leave it as an exercise to show that a function j is integrable with
respect to g if and only if j is continuous at a and that in this case the
value of the integral is f(a).
By taking more points we can obtain a sum involving the values of fat
points in J, weighted by the values of the jumps of g at these points.
(g) Let the function f be Dirichlet's discontinuous function [of.
Example 15.5 (g)] defined by
f(x) = 1, if x is rational,
= 0, if x is irrational,
and let g(x) = x. Consider these functions on I = [0, 1]. If a partition
P consists of n equal subintervals, then by selecting k of the intermediate
points in the sum S (P; f, g) to be rational and the remaining to be irra-
tional, S(P;f, g) = kin. It follows that f is not Riemann integrable.
(h) Let f be the function defined on I by f(O) = 1, f(x) = 0 for x
irrational, and f(mln) = lin when m and n are natural numbers with
SEC. 22 RIEMANN-STIELTJES INTEGRAL 279
no common factors except 1. It was seen in Example l5.5(h) that j is
continuous at every irrational number and discontinuous at every rn-
tional number. If g(x) = x, then it is an exercise to show that f is inte-
grable with respect to g and that the value of the integral is 0.
22.4 CAUCHY CRITERION FOR INTEGRABILITY. The function f ~is
integrable with respect to g over J = [a, b] if and only if for each positive
real number there is a partition QE of J such that if P and Qare refinements
o! Qf and ij S(P;j, g) and S(Q;j, g) are any corresponding Riemann-
Stieltjes sums, then
(22.4) IS(P;!, g) - SeQ;!, g)1 < E.
PROOF. If f is integrable, there is a partition P E such that if P, Q are
refinements of PE' then any corresponding Riemann-Stieltjes sums
satisfy IS(P;j, g) - II < E/2 and IS(Q;!, g) - II < E/2. By using the
Triangle Inequality, we obtain (22.4).
Conversely, suppose the criterion is satisfied. To show that j is inte-
grable with respect to g, we need to produce the value of its integral and
use Definition 22.2. Let Q1 be a partition of J such that if P and Q are
refinements of Q1, then IS(P;j, g) - S(Q;f, g)1 < 1. Inductively, we
choose Qn to be a refinement of Qn-1 such that if P and Q are refinements
of Qn, then
C22.5) IS(Pif, g) - S(Q;j, g)1 < lin.
Consider a sequence (S(Qn;j, g) of real numbers obtained in this
way. Since Qn is a refinement of Qm when n > m, this sequence of sums
is a Cauchy sequence of real numbers, regardless of how the intermediate
points are chosen. By Theorem 12.10, the sequence converges to some
real number L. Hence, if E > 0, there is an integer N such that 2/N < E
and
\S(QN;!, g) - LI < E/2.
If P is a refinement of QN, then it follows from the construction of QN
that
IS(P;j, g) - SCQN;!, g)[ < liN < E/2.
Hence, for any refinement P of QN and any corresponding Riemann-
Stieltjes sum, we have
(22.6) IS(Pi!, g) - LI < ,
This shows that f is integrable with respect to g over J and that the
value of this integral is L.
Q.E.D.
280 CH. VI INTEGRATION
(b) Iff is integrable with respect to and (/2 on J and a, 13 are real num-
(Jl
bers, then f is integrable with respect to g = a(/l + {3g2 on J and
still hold. When the same intermediate points are used, we evidently
have
S (Q; afl + f3f2, g) = as (Q ; h, g) + f3S (Q ; f2, g).
It follows from this and the preceding inequalities that
laI l + (31 2 - SeQ; afl + (3fZJ (J)! =
la{It - S(Q;fl, g) l + (3{Iz - S(Q;f2, g) II < (Ia[ + 1f3I)E.
This proves that all + {31 '1 is the integral of aft + {3Jz with respect to g.
This establishes part (a); the proof of part (b) is similar and will be left
to the reader.
Q.E.D.
There is another useful additivity property possessed by the Riemann-
Stieltjes integral; namely, with respect to the interval over which the
integral is extended. (It is in order to obtain the next result that we
employed the type of limiting introduced in Definition 22.2. A more
restrictive type of limiting would be to require inequality (22.3) for any
Riemann-Stieltjes sum corresponding to a partition P = (xo, Xl, .. 0' x n )
which is such that
IIPII = sup {Xt - XO J X2 - Xt, 0' Xn - xn-d < O(E).
SEC. 22 RIEMANN-STIELTJES INTEGRAL 28l
of [a, bI formed by using the partition points in both P/ and P/', and if
P is a refinement of PEl then
B(P;f, g) = B(P';f, g) + B(P";/, g),
where P', P" denote the partitions of [a, c], [c, b] induced by P and where
the corresponding intermediate points are used. Therefore, we have
f.' f dg + t f dg - S(P;f, g)
(b) We shall use the Cauchy Criterion 22.4 to prove thatjis inte-
grable over [a, cI. Since f is integrable over [a, bI, given E > 0 there is a
partition Qe of [a, b] such that if P, Q are refinements of QE, then relation
(22.4) holds for any corresponding Riemann-Stieltjes sums. It is clear
that we may suppose that the point c belongs to QE, and we let Q/ be
the partition of [a, c] consisting of those points of Qe which belong to
la, c]. Suppose that P' and Q' are partitions of la, c] which are refine-
282 CR. VI INTEGRATION
Thus far we have not interchanged the roles of the integrand f and
the integrator g, and it may not have occurred to the reader that it
might be possible to do so. Although the next result is not exactly the
same as the "integration by parts formula" of calculus, the relation is
close and this result is usually referred to by that name.
22.7 INTEGRATION BY PARTS. A function f is integrable with respect
to g over [a, b] if and only if g is integrable with respect to j over [a, b].
In this case,
S(P; g, f) given by
n
S(P; g,f) :r= L g(hHf(Xk)
k=l
- f(Xk-l)},
where Xk-l < ~k < Let Q = (Yo, Yl, ..., Y2n) be the partition of
Xk.
sup {Xk - xk-d < aCE) and let Q = (Yo, Yl, ..., Ym) be a refinement of
P,; we shall estimate the difference Sept;!, g) - SeQ;!, g). Since every
point in P, appears in Q, we can express these Riemann-Stieltjes sums
in the form
m
S(Pf;!, g) = 2:!(~k){g(Yk) - g(Yk-l) L
k =1
m
However, this need not be the case even for very nice functions.
286 CH. VI INTEGRATION
22.11 EXAMPLE. Let J = [0, 1], let g(x) = x, and let fn be defined
for n > 2 by
fn(X) = n 2x, 0< x < lin,
= -n 2 (x - 2/n), l/n <x< 2/n,
= 0, 2/n <x< 1.
n > 2.
1. 1
f(x) dx = O.
Therefore, equation (22.15) does not hold in this case even though both
sides have a meaning.
Since equation (22.15) is very convenient, we inquire if there are any
simple additional conditions that will imply it. We now show that, if
the convergence is uniform, then this relation holds.
22.12 THEOREM. Let g be a morwto-ne increasing function on J and
let (In) be a sequence of junctions which are integrable with respect to g over
SEC. 22 RIEMANN-STIELTJES INTEGRAL 287
Then the set E = {x E J:j (x) > a/31 contains a finite number of intervals
of total length exceeding a/ (31 if Ii).
PROOF. Let P be a partition of J = [0, 1] such that if S (P; f) is any
Riemann sum corresponding to P, then JS(P;f) - al < a/3. Hence
2a/3 < S(P;j). Now select the intermediate points to makej(Ej) < a/3
whenever possible and break S (P ;f) into a sum over (i) subintervals
contained in E, and (ii) subintervals which are not contained in E. Let
L denote the sum of the lengths of the subintervals (i) contained in E.
Since the contribution to the Riemann sum made by subintervals (ii) is
less than a/3, it follows that the contribution to the Riemann sum made
by subintervals (i) is bounded below by a/3 and above by Ilfll L. There-
fore, L > a/(31Ifl!), as asserted.
Q.E.D.
PROOF. It follows from Theorem 22.5 (a) and Theorem 22.8 that Gis
a linear function on CR(J) and from Lemma 22.10 that G is bounded by
M = g(b) - g(a). If f belongs to C R (J) and f(x) > 0 for x E J, then
taking m = 0 in formula (22.14) we conclude that GU) > O.
Q.E.D.
We shall now show that, conversely, every bounded positive linear
functional on CR (J) is generated by the Riemann-Stieltjes integral with
respect to some monotone increasing function g. This is a form of the
celebrated "Riesz Representation Theorem," which is one of the key-
stones for the subject of "functional analysis" and has many far-reaching
generalizations and applications. The theorem was proved by the great
Hungarian mathematician Frederic Riesz.t
22.17 RIESZ REPRESENTATION THEOREM. If G is a bounded positive
linear functional on CR (J), then there exists a monotone increasing function
g on J such that
(22.17)
t FREDERIC Rmsz (1880-1955), [\, brilliant Hungarian mathematician, was one of the
founders of topology and functional analysis. He also made beautiful contributions
to potential, ergodic, and integration theory.
SEC. 22 RIEMANN-8TIELTJES INTEGRAL 291
1----_
a t t + lIn b
whence it follows that get) < y(s). We define g(a) = 0 and if f1Jb,n denotes
the function ~.n(X) = 1, x E J, then we set g(b) = G(fIJb,n). If a < t < b
and n is sufficiently large, then for all x in J we have
so that g(a) = 0 < G(fI't.n) < G(~.n) = g(b). This shows that g(a) <
get) < g(b) and completes the construction of the monotone increasing
function g.
If f is continuous on J and E > 0, there is a aCE) > 0 such that if
Ix - yl < aCE) and x, y E J, then I/(x) - f(y)1 < E. Sincefis integrable
with respect to g, there exists a partition p. of J such that if Q is a refine-
ment of P e, then for any Riemann-Stieltjes sum, we have
a tk tk+1/n b
t"_1 + lin
f*
I
f'
I
I
I
I
I
I
I
t
I
t
r
I
I
I
I
I
I
I
I
I
I
But the second term on the left side is a Riemann-Stieltjes sum S (P;!, g)
for f with respect to g corresponding to the partition P which is a refine-
ment of Pe. Hence we have
Since E is an arbitrary positive number and the left side of (22.23) does
not depend on it, we conclude that
G(f) ~ t fdg.
Q.E.D.
For some purposes it is important to know that there is a one-one
correspondence between bounded positive linear functionals on CR (J)
and certain normalized monotone increasing functions. Our construction
can be checked to show that it yields an increasing function g such that
g(a) = 0 and g is continuous from the right at every interior point of J.
With these additional conditions, there is a one-one correspondence
between positive functionals and increasing functions. (In some applica-
tions it is useful to employ other normalizations, however.)
Exercises
10' fdg = fm
22.B Show that the function !, given in Example 22.3(h) is Riemann inte-
grable on I and that the value of its integral is O.
22.C. Show that the function f, defined on I by
f(x) = X, X rational,
:=:: 0, x irrational,
is not Riemann integrable on I.
22.D. If P = (Xo, Xl, , Xn) is a partition of J = [a, b], let IIPII be defined
to be
IIP]1 = sup {Xi - Xi-I: j = 1,2, ..., n};
we call IIPII the nonn of the partition P. Define I to be (*)-integrahle with
respect to g on J in case there exists a number A with the property: if e > 0
then there is a o(e) > 0 such that if IIPII < o(e) and if S(P; I, g) is any corre-
sponding Riemann-Stieltjes sum, then IS(P; f, g) - Al < e. If this is satisfied
the number A is called the (*)-integral of I with respect to g on J. Show that if
I is (*)-integrable with respect to g on J, then I is integrable with respect to g
(in the sense of Definition 22.2) and that the values of these integrals are equal.
SEC. 22 RIEMANN-STIELTJES INTEGRAL ~95
22.N. Let f be Riemann integrable on J and let f(x) > 0 for x E J. Show that
296 CR. VI INTEGRATION
(Hint: for each n E N, let H n be the closure of the set of points x in J such that
f(x) > lin and apply Baire's Theorem 9.8.)
22.0. If f is Riemann integrable on I and if
1 n
an = - L f(kln) for n E N,
n k=l
then the sequence (an) converges and
(The notation here is the same as in the preceding exercise.) Show that if (j,,)
converges in mean to f, then
f.n(X) = 1, x E In,
= 0,
Prove that the sequence (fn) converges in mean ["'lith respect to g(x) = x] to
the zero function on J, but that the sequence (fn) does not converge uniformly.
Indeed, the sequence (fn) does not converge at any point!
22.U. Let g be monotone increasing on J = [a, b]. If i and h are integrable with
respect to g on J to R, we define the inner product (f, h) of 1 and h by the
formula
(see Theorems 7.6 and 7.7). Show that the Norm Properties 7.8 hold, except
that 1I/IIz = 0 does not imply that I(x) = 0 for all x in J. Show that IIfll1 ~;
19(b) - g(a) P1211f112.
22."'~. Let 1 and in, n EN, oe integrable on J with respect to an increasing
function g. We say that the sequence (in) converges in mean square (l\Tith
respect to g on J) to f if lifn - 11\2 -+ o.
(a) Show that if the sequence is uniformly cOllvergent on J, then it also
converges in mean square to the same function.
(b) Show that if the sequence converges in mean square, then it converges
in mean to the same function.
(c) Show that Exercise 22.T proves that convergence in mean square does
not imply convergence at any point of J.
(d) If, in Exercise 22.T, we take Into have length 1/n 2 and if we set hn = nf11'
then the sequence (h n ) converges in mean, but does not converge in mean square,
to the zero function.
298 CH. VI INTEGRATION
Projects
22.a. The following outline is sometimes used as an approach to the Riemann-
Stieltjes inte:!;ral when the integrator function g is monotone increasing on the
interval J. [This development has the advantage that it permits the definition
of upper and lower integrals which always exists for a bounded functionj. How-
ever, it has the disadvantage that it puts an additional restriction on g and tends
to blemish somewhat the symmetry of the Riemann-Stieltjes integral given by
the Integration of Parts Theorem 22.7.] If P = (xo, Xl, .., x n ) is a partition of
J = [a, b] and j is a bounded function on J, let mil 111 i be defined to be the in-
fimum and the supremum of If(x) : Xi-l S; x < Xi), respectively. Corresponding
to the partition P, define the lower and the upper sums of j with respect to g
to be
n
L(P;j, g) = L mj(g(xj) - g(Xi-l)},
;"=1
n
U(P;f, g) = L .ilfj(g(Xj) - g(Xi-l)}.
j =1
(e) Define the lower and the upper integral of 1 with respect to g to be,
respectively
L(I, g) = sup {L(Pil, g)},
U(j, g) = inf {U(Pil, g)};
here the supremum and the infimum are taken over all partitions P of J. Show
that L(j, g) ~ U(j, g).
(f) Prove that I is integrable with respect to the increasing function g if and
only if the lower and upper integrals introduced in (e) are equal. In this oase the
common value of thlilse integrals equals
{fdg,
(g) If 11 and 12 are bounded on J, then the lower and upper integrals of i1 + It
satisfy
L(iJ + 12, g) > L(jl, g) + L(l2' g),
U(fl + 12, g) < U(jl, g) + U(h, g).
Show that strict inequality can hold in these relations.
22.~. This project develops the well-known Wallist product formula. Through-
out it we shall let
1r/ 2
S.. =
1.0 (sin x) .. dx.
t JOHN WALLIS (1616-1703), the Savilian professor of geometry at Oxford for sixty
years, was a precurser of Newton. He helped to lay the groundwork for the develop-
ment of calculmJ.
300 CH. VI INTEGRATION
22..". This project develops the important Stirlingt formula, which estiIll3tes
the magnitude of n!
(a) By comparing the area under the hyperbola y = l/x and the area of a
trapezoid inscribed in it, show that
2n
2
+1
< log (1 + !).
n
From this, show that
e < (1 + 1/n)n+1/2
(b) Show that
Un = (n/e)n, vn < 1, n E N.
n.
(d) Show that the sequence (Un) is monotone increasing. (Hint: consider
Un+I/Un.)
(e) By considering Un 2/U2n and making use of the result of part (e) of the
preceding project, show that lim (Un) = (211")-1/2.
(f) Obtain Stirling's formula
If g(b) = g(a), then the relation (23.1) is trivial; if g(b) > g(a), then it
follows from Bolzano's Intermediate Value Theorem 16.4 that there
exists a number c in J such that
F (c + h) - F (e) = f.
a
C+h
f dg -
f.c f dg
a
C+h
= f.C f dg = f(Cl){g(C + h) - g(c)},
for some Cl with C < Cl < c + h. A similar relation holds if h < O. Since
f is continuous and g has a derivative at c, then F' (c) exists and equals
j(c)g' (c).
Q.E.D.
F a(X) = f.z f.
The preceding theorem asserts that Fa' = f on J. If F is such that F' = f,
then it follows from the Mean Value Theorem 19.6 (in particular,
Consequence 19.10(ii) that there exists a constant 0 such that
F(x) = Fa(x) + 0, x E J.
Since h(x) > 0, it is seen that g is increasing and it follows from the
Differentiation Theorem 23.2 that g' = h. By Theorem 23.4, we conclude
that
{Id {fh' g=
\
and from the First Mean Value Theorem 23.1, we infer that for some c
in J, then
{ I dg = I(c) { h.
Q.E.D.
{fh =/(b) t h.
f g dl = g(cllf(b) - f(a)}.
SEC. 23 THE MAIN THEOREMS OF INTEGRAL CALCULUS 305
g(x) = !.:t: h,
so that g' = h. The conclusion then follows from part (a) by usinl~
Theorem 23.4. To prove (c) define F to be equal to f for x in (a, b] and
define F(a) = O. We now apply part (b) to F.
Q.E.D.
Part (c) of the preceding theorem is frequently called the Bonnett form
of the Second Mean Value Theorem. It is evident that there is a corre-
sponding result for a decreasing function.
Change of Variable
We shall now establish a theorem justifying the familiar formula re-
lating to the" change of variable" in a Riemann integral.
23.8 CHANGE OF VARIABLE THEOREM. Let l{J be rlefined on an interval
[a, (3] to R with a continuous derivative and suppose that a = ~(a) < l)
= l{Je/3). If f is continuous on the range of l{J, then
Q.E.D.
F(t) - F (to)
-..;...:.---....:..-..;.. -
f.b f ( to ) dx
t X,
t - to a
(/)'(t) = HuU3(t), aCt), t]{3/(t) + H v [{3(t), aCt), tla'(t) + H t [{3(t), aCt), t].
According to the Differentiation Theorem 23.2,
We can write the integral on the left of (23.14) in the form of the sum
t t 1t._1[tk t( JX/_I(%'
k-l J"=l
f(x, t) dX} dt.
Applying the First Mean Value Theorem 23.1 twice, we infer that there
exists a number x / in [x i-I, x j] and a number tk' in [lk-l, tk] such that
Hence we have
j,d{f.b I(x, t) dX} dt tl tl f(x;', t/) (x i-X ;-1) (tk - tk- 1).
=
fb { (d f(x, t) dt} dx =
Ja Jc
t t
k=1 ;=1
f(x/" t/')(Xj - Xj-l)(t k - tk-r).
Since both X;' and X/' belong to [x ;-1, Xi] and tk', tt/' belong to [tk-l, tk],
we conclude from the uniform continuity of f that the two double sums,
and therefore the two iterated integrals, differ by at most e(b - a) (d - c).
Since e is an arbitrary positive number, the equality of these integrals is
confirmed.
Q.E.D.
The reader will recall Taylor's Theorem 19.9, which enables one to
calculate the value f(b) in terms of the values f(a),!' (a), ... , j<n-O (a)
and a remainder term which involves fC n ) evaluated at a point between
a and b. For many applications it is more convenient to be able to express
the remainder term as an integral involving j<n).
23.13 TAYLOR'S THEOREM. Suppose that f and its derivatives 1',
/", ...,fC n
) are continuous on [a, b] to R. Then
i' (a) j<n-l) (a)
feb) = f(a) + 11 (b - a) + ... + (n _ l)! (b - a)n-l + Rn,
where the remainder is given by
1 (b
(23.15) R.. =
(n - 1).
! L
wa
(b - t)n-l fC nJ (t) dt.
910 CH. VI INTEGRATION
Rn = 1 (b - t)n-1j(n-D (t)
(n - I)! { t... a
= -
f(n-1) (a) (b - a)n-l
(n - 1)!
+ 1
(n - 2)!
1
a
(b - t)n-2f(n-1l(t) dt.
Exercises
23.A. Does the First Mean Value Theorem hold if f is not assumed to be
continuous?
23.B. Show that the Differentiation Theorem 23.2 holds if it is assumed that
f is integrable on J with respect to an increasing function g, that f is continuous
at C, and that g is differentiable at c.
23.C. Suppose that f is integrable with respect to function g on J = [a, b]
and let F be defined for x E J by
F(x) = f.2: I,
does not have a derivative at some points of J. Can you find an integrable func-
tion I such that F is not continuous on J?
SEC. 23 THE MAIN THEOREMS OF INTEGRAL CALCULUS 311
F(x) ~ t f
is continuous on J. Moreover, F' (x) exists and equals f(x) except for at most a
finite number of points in J. Show that F' may exist at a point where f is dis-
continuous.
23.J. In the First Mean Value Theorem 23.5, assume that h is Riemann inte-
grable (instead of that h is continuous). Show that the conclusion holds.
23.K. Use the Fundamental Theorem 23.3 to show that if a sequence (fn) of
functions converges on J to a functionf and if the derivatives Un') are continuous
and converge uniformly on J to a function g, then i' exil!its and equals g. (This
result is less general than Theorem 19.12, but it is easier to establish.)
91~ CH. VI INTEGRATION
23.L. Let I be continuous on I = [0, 1], let fo = I, and let in+! be defined by
where M = sup flf(x)!:x E I}. It follows that the sequence (in) converges
uniformly on I to the zero function.
23.M. Let {TI' T2, . , Tn, .. } be an enumeration of the rational numbers in I.
Let In be defined to be 1 if x E {TI, ..., Tnl and to be 0 othen1:ise. Then in is
Riemann integrable on I and the sequence (fn) conyerges monotonely to the
Dirichlet discontinuous function I (which equals 1 on I (\ Q and equals 0 on
I\Q). Hence the monotone limit of a sequence of Riemann integrable functions
does not need to be Riemann integrable.
23.N. Let i be a non-negative continuous function on J = [a, b] and let
M = sup If(x):x E J}. Prove that if M n is defined by
(b }lln
Mn = {Ja [j(x)]ndx for nE N,
then M = lim (M n ).
23.0. If I is integrable with respect to g on J = [a, b], if ({J is continuous and
strictly increasing on [C, d], and if ",,(e) = a, ",,(d) = b, then fo "" is integrable
wi th respect to go"" and
is continuous on J 2.
23.Q. Let g be an increasing function on J 1 = [a, b] to R and for each fixed t
in J2 = [e, d], suppose that the integral
23.R. Let J 1 = [a, b] and J 2 = [c, d]. Assume that the real valued function g
is monotone on J 1 , that h is monotone on J 2 , and that! is continuous on J 1 X J 2
Define G on J2 and H on J 1by
23.S. Show that, if the nth derivative fen) is continuous on [a, b], then the
Integral Form of Taylor's Theorem 23.13 and the First Mean Value Theorem
23.5 can be used to obtain the Lagrange form of the remainder given in 19.9.
23.T. Let f be continuous on I = [0, 1] to R and define in on I to R by
Show that the nth derivative of fn exists and equals f. By induction, show that
the number of changes in sign of j on I is not ess than the number of changes
of sign in the ordered set
T(,,)(t) - 1.
b
f(x, t),,(x) dx.
23.V. Continuing the notation of the preceding exercise, show that if r > 0,
then T sends the collection
Projects
L(x) = !X ~ dt.
Hence L(1) = O. Prove that L is differentiable and that L'(x) = 1/x.
(b) Show that L(x) < 0 for 0 < x < 1 and L(x) > 0 for x > 1. In fact,
11111
:2 + "3 + ... +; < L(n) < 1 + :2 + .,. + n - 1 .
(e) Prove that L is a one-one function mapping P onto all of R. Letting e
denote the unique number such that L(e) = I, and using the fact that 1'(1) ::a: 1,
ahow that
SEC. 23 THE MAIN THEOREMS OF INTEGRAL CALCULUS 815
(f) Let r be any positive rational number, then
lim L(x) = o.
x-o+m x'
(g) Observe that
fox - dt
/,
l~dt
L(l + x) =
1
- =
t 0 1 +t
Write (1 + 0- 1
as a finite geometric series to obtain
n~l (_l)k-l
L(1 + x) = El k xk + R,,(x).
Show that IR,,(x) I < l/(n + 1) for 0 < x < 1 and
Ixl n +1
IR,,(x)1 < (n + 1)(1 + x)
for -1 < x < O.
23.{1. This project develops the trigonometric functions starting with an
integraL
(a) Let A be defined for x in R by
x dt
A(x) = foo -1 +-t. 2
T' = 1 + T2.
T
S = (1 + T2)1/2
Hence C is even and S is odd on (-1r/2, 1r/2). Show that C(O) = 1 and S(O) = 0
and C(x) --+ 0 and Sex) --+ 1 as x --+ 1r/2.
(d) Prove that C'(x) = -SCx) and S'(x) = C(x) for x in (-1r/2, 1r/2).
Therefore, both C and S satisfy the differential equation
h" +h = 0
on the interval (-1r /2, .../2).
316 CH. VI INTEGRATION
(e) Define C(1I'/2) = 0 and S(1l/2) = 0 and define C, S, T outside the in-
terval (-11'/2,11'"/2) by the equations
C(x + 11") = -C(x), sex + 11'") = -Sex),
T(x + 11'") = T(x).
If this is done successively, then C and S are defined for all R and have period 211'".
Similarly, T is defined except at odd ill ultiples of 11"/2 and has period 11'".
(f) Show that the functions C and S, as defined on R in the preceding part,
are differentiable at every point of R and that they continue to satisfy the
relations
C' = -8, S' = C
everywhere on R.
(c) In R~, the segment S = {(~, 0):0 < ~ < I} has zero content. In
fact, if E > 0, the single interval
J. = [0, 1] X [-E/2, E/2]
has content E and contains S.
(d) In the space RZ, the diamond-shaped set S = {(~,,,): I~I + 1,,1 = I}
is seen to have zero content. For, if we introduce intervals (here squares)
with diagonals along S and vertices at the points I~l = 1,,1 = kin, where
k = 0, 1, ..., n, then we easily see that we can enclose S in 4n closed
Figure 24.1.
intervals, each having content l/n2 , (See Figure 24.1.) Hence the total
content of these intervals is 4/n which can be made arbitrarily small.
(e) The circle S = {(~, "7) : ~2 + 7/2 <= 11 in R2 is seen to have zero
content. This can be proved by means of a modification of the argument
in (d).
(f) Let! be a continuo'ls function on J = la, b] to R. Then the graph
of ji that is, the set
G = {(~, JCO) E R2 : ~ E J},
has zero content in R2. This assertion can be proved by modifying the
argument in (d).
(g) The subset S of K~ which consists of all points (~, ,,) where both
~ u,nd 7) are rational numbers satisfying 0 < ~ < 1, 0 < 'IJ < 1 does not
have zero content. Althoui:h this set is countable, any finite union of
SEC. 24 INTEGRATION IN CARTESIAN SPACES 319
intervals which contains S must also contain the interval [0, 1] X [0, 1],
which has content equal to 1.
(h) The union of a finite number of sets with zero content has zero
content.
(i) In contrast to (f), we shall show that there are "continuous
curves" in R2 which have positive content. We shall show that there
exist continuous functions f, g defined on I to R such that the set
has positive content. To establish this, it is enough to prove that the set
S can contain the set I X I in R2. Such a curve is called a space-filling
curve or a Peano curve. We shall outline here the construction (due to
I. J. Schoenbergt) of a Peano curve, but leave the details as exercises.
Let "P be a continuous function on R to R which is even, has period 2,
and is such that
"P(t) = 0,
= 3t - 1,
1 << t << i,t
t
= 1, i<t<l.
(See Figure 24.2.) We define in and gn for n E N by
(~) "P(3t),
1
gl(t) = g,..(t) = g,..-l(t) + (2 ..) r.p(3 2n- 1t).
Since [1"Pll = 1, it is readily seen that the sequences (fn) and (gn) con~
verge uniformly on I to functions! and g, which are therefore continuous.
Figure 24.2.
t ISAAC J. SCHOENBERG (1903- ) was born in Roumania and educated there and in
Germany. Long at the University of Pennsylvania, he has worked in number thl:ory.
real and complex analysis, and the calculus of variations.
320 CH. VI INTEGRATION
To see that every point (x*, y*) with 0 < x* < 1, 0 < y* < 1, belongs
to the graph S of this curve, write x and y in their binary expansions:
y* = O./3t132IJa ...,
where an, (Jm are either 0 or 1. Let t* be the real number whose ternary
(base 3) expansion is
t* = O. (20:1) (2/31) (20:2) (2/32) ...
We leave it to the reader to show that f(t*) = x* and g(t*) = y*.
Figure 24.3.
(24.6) JJ !n i , or JJ!nfeX,y,Z)dXdydZ.
(24.8)
PROOF. This result follows directly from the observation that the
Riemann sums for a partition P of If satisfy the relation
(24.g) In f > O.
(24.10)
of P which contain points of D will also have total content less than E.
If M is a bound for f, then IS (P; f) I < ME, whence we obtain formula
(24.10).
Q.E.D.
(24.11)
In = L
g (j - h) = In!-In = In
h j.
Q.E.D
IT: ~<
.... -
c
- 25 ~
y.;;
E I_~ Ic>
~ ~
I(.J
If. ~~
l~.r
~\
,'-" ( .l <>-'
""I ~~
I~ E;r,;
I~
,-~J' 1111)
l/l 0<::' '2_], r v
Figure 24.4.
Similarly, if S" (P; f) and SIt (Q; f) denote the remaining portion of the
Riemann sums, then
IS"(P;f) - S"(Q;f)! < IS"(P;f) I + IS"(Q;f)\ < 2ME.
It therefore follows that
IS(P;!) - S(Q;!)I < dA (J) + 2M},
whence f is integrable over J.
Q.E.D.
The theorem just established yields the integrability of f over an
interval, provided the stated continuity condition is satisfied. We wish
to obtain a theorem which \vill imply the integrability of a function over
a subset more general than an interval. In order to obtain such a result,
the notion of the boundary of a subset is needed.
24.10 DEFINITION. If D is a subset of Rp, then a point x of Rp is
said to be a boundary point of D if every neighborhood of x contains
points both of D and its complement e(D). The boundary of D is the
subset of Rp consisting of all of the boundary points of D.
We generally expect the boundary of a set to be small, but this is
because we are accustomed to thinking about rectangles, circles, and
such forms. Example 24.2(g) shows that a countable subset in R2 can
have its boundary equal to I X I.
24.11 SECOND INTEGRABILITY THEOREM. Let D be a compact subset
of Rp and let! be continuous with domain D and range in Rq. If the
boundary of D has zero content, then! is integrable over D.
SEC. 24 INTEGRATION IN CARTESIAN SPACES 925
function defined by
fDCX) = 1, x E DUB,
=0, otherwise,
therefore, that a set D has zero content if and only if it has content and
A (D) = O. This justifies the simultaneous use of Definitions 24.1 and
24.12.
24.14 LEMMA. If Dl and D2 have content, then their union and inter-
section also have content and
(24.12)
In particular, if A (D l (\ D2 ) = 0, then
(24.13)
PROOF. By hypothesis, the boundaries B 1 and B2 of the sets D1 and
D2 have zero content. Since it is readily established that the boundaries
of D l (\ D 2 and D l V D 2 are contained in B l V B2, we infer from 24.2(h)
that the sets D1 (\ D2 and D 1 V D2 have content. In view of Lemma
24.13, we shall suppose that D1 and D2 are closed sets; hence Dl () D2
and D 1 V D 2 are also closed. Let iI, 12, f i and f u be the functions which
are equal to 1 on D 1, D2, D 1 (\ D2 and D1 V D2, respectively, and equal
to 0 elsewhere. Observe that each of these functions is integrable and
(24.14) r g = JDlr
JD
(J + r g.
JDs
PROOF. Define (II and g2 by
gl(X) = g(x), x E D1 g2(X) = g(x),
= 8, x ~ Dl =8,
SEC. 24 INTEGRATION IN CARTESIAN SPACES 327
r
JD
g1 = r
JDl
g1 = r
JD 1
g.
Moreover, except for x in the set D1 f\ D2, which has zero content, then
g(x) = f}l(X) + g2(X). By Lemma 24.8 and Theorem 24.5, it follows that
In g= l (g1 + g2) = l +l
g1 g2.
F(11/) = Lf(~ik*,rli*)(~k-h-l)'
1c =1
Figure 24.5.
where a and (3 are continuous functions on [c, d] with values in the interval
[a, b]. If f is continuous on A and has values in R, then f is integrable
on A and
f.
fJ(1j)
jet, 1]) d~.
a(1j)
!. ",(fj)f=
<p(a)
{fl (f0lp)lp'.
Ja
J iO(X} = det
Glp p (x) Glp p (x)
G~l G~p
does not vanish on G. It will be shown that if D is a compact subset of
G which has content, and if f is continuous on lp(D) to R, then lp(D)
has content and
(24.21)
It is easily seen from this that the boundary of q;(K) can be enclosed in
the union of a finite number of rectangles whose total content is arbitrar-
ily small. Hence q;(K) has content.
It is a little difficult to give an entirely satisfactory proof of the
remainder of this lemma, since we have not defined what is meant by
the determinant of a p X p matrix, One possible definition of the
absolute value of the determinant of a linear function is as the content
of the figure into which the unit cube
Ip = I X ... X I
is transformed. If this definition is adopted, then the case of a general
cube K is readily obtained from the result for Ip.
H the reader prefers another definition for the determinant of a
matrix, he can verify this result by noting that it holds in the case where
q; has the elementary form of multiplication of one coordinate:
({)1(6, .. 'J ~kJ .. 'J ~p) = (hJ ..., ch, ..., ~p),
addition of one coordinate with another:
q;2(6, ..., ~k, ..., tp) = (tl, .. 'J ~j + tk, ..., ~p),
or interchanging two coordinates:
q;3(~1, ..., ~j, ., ~k, , ~p) = (~1, ..., h, ..., ~j, , ~p).
24.22 LEMMA. Let (() belong to Class C' on an open set Gin Rp to Rp.
If D i8 a compact 8ubset of G whch has content zero, then q;(D) has
content zero.
PROOF. Let > 0 and enclose D in a finite number of balls {B j}
lying inside G such that the total content of the balls is less than E.
Since cp is in Class C' and D is compact, there exists a constant }'1 such
that IDq;(x) (z) I < 111 Izi for all xED and z E Rp, Therefore, if x and y
are points in the same ball Bj, then Iq;(x) - q;(y) I < Mix - yl. If the
radius of B j is r j, then the set cp (B j) is contained in a ball with radius
1IJr),. Therefore, <p(D) is contained in a finite number of balls with total
content less than Mpe.
Q.E.D.
BEC. 24 INTEGRATION IN CARTESIAN SPACES 333
subset of G which has content, then cp (D) is a compact set with content.
PROOF. Since J rp does not vanish on G, it follows from the Inversion
Theorem 21.11 that 'P is one-one on G and maps each open subset of G
into an open set. Consequently, if B is the set of boundary points of D,
then 'P(B) is the set of boundary points of I{)(D). Since B has content
zero, it follows from the preceding lemma that 'PCB) has content zero.
Hence 'P(D) has content.
Q.E.D.
(24.25)
(24.26)
(24.27)
It follows from the existence of the integral and the uniform continuity
of fa <p on D that this can be done (cf. Exercise 24. G). Moreover,
applying the Jacobian Theorem 24.25) we may assume that each K j is
so small that relation (24.25) holds.
Since any two of the sets IK j } intersect in at most a face which has
content zero, it follows from Theorem 24.15 that
j f='Lj
rp(D) j rp(K;)
f.
Because <p is one-one, there exists a unique point x,.* in K j such that
Pi = rp(x,.*), whence f(pj) = (jo<p)(x,.*). Therefore, we have
336 CH. VI INTEGRATION
(24.28) LJ))!
lies between (1 - E)P and (1 + )p times the sum
2:U o 'P)(x/)J",,(xj)A(K j ).
(24.29)
Exercises
r
JJ fg - Lk f(h)g(1Jk)A (J k) < E.
24.H. If B is the boundary of a subset D of Rp, then B contains the boundary
of D V B. Can this inclusion be proper?
24.1. Show that the boundaries of the sets D I n D z and D 1 V D 2 are contained
in B 1 V B 2 / where B i is the boundary of D j.
24.J. Is it true that the boundary of the intersection D I n D z is contained in
Bl (\ B 2?
24.K. Prove Theorem 24.16.
24.L. Show that the Mean Value Theorem 24.17 may fail if D is not connected.
24.M. Let D be a subset of Rp which has content and let f be integrable over
D with values in Rq. If D I is a compact subset of D with content, then f is in-
tegrable over D 1
24.N. A figure in Rp is the union of a finite number of non-overlapping in-
tervals in Rp. If D is a non-empty bounded subset of Rp, let D* be the collection
of all figures which contain D and let D* be the collection of all figures which are
contained in D. Define
Prove that A*(D) ~ A*(D) and that D has zero content if and only if A*(D) =
O. Also show that D has content if and only if A*(D) = A*(D) in which case the
content A CD) is equal to this common value.
24.0. In the notation of the preceding exercise, show that if DI and D2 are
disjoint subsets of Rp, then
Give examples to show that (i) equality can hold in this relation} and (ii) strict
inequality can hold. In fact, show there exist disjoint sets D1 and D 2 such that
joining x to y belong to A and that all of the partial derivatives of f of order <n
exist and are continuous on this line segment. Establish Taylor's Theorem
Tn = 1
(n - I)!
{1
}o
(1 _ t)n-lDnj(x + t(y - x) (y - x)n dt.
24.Q. Letfbe defined on a subset A of R with values in Rq. Suppose that the
line segment joining two points x, y belongs to A and thatjis in Class C' at every
point of this segment. Show that
and use this result to give another proof of the Approximation Lemma 21.4.
[Hint: if w E Rq and if F is lefined on I to R by F(t) = f(x +
t(y - x) w,
then F'(t) = Df(x + t(y - x) (y - x) w.]
24.R. Let f be a real-valued continuous function on an interval J in R2 con-
taining 8 = (0,0) as an interior point. If (x, y) is in J, let F be defined on J
to R by
Show that
the areas bounded by the coordinate axes and the curves rp, if; to obtain Young's
Inequality
~ = X,
24.X. Suppose that ({J is in Class C' on an open set G c Rp to Rp and that the
,Jacobian J", vanishes on a set E with content zero. Suppose that D is a compact
subset of G, which has content, and f is continuous on ({J(D) to It Show that
(p(D) has content and
ffD
I(x, y)dx dy ~ ff
Dp
I(r cos 8, sinO) r dr dO.
(b) Similarly, if l/; is the transformation of the (r, 8, ({J)-space into (x, y, z-)
space given by
fffD
I(x, y, z)dx dy dz ~ ff f
D.
I(r cos 8sin~, rsin 88in~,
r cos ip) r 2 sin ip dr dfJ dip.
24.Z. Show that if p = 2k is even, then the content W p of the closed unit ball
Ii x E Rp: [xl < 11 is 1I"k/k!. Show that if p = 2k - 1 is odd, then the content
Wp of the closed unit ball is
SEc.25 IMPROPER AND INFINITE INTEGRALS
Hence it follows that lim(w p) = 0; that is, the content of the unit ball in Rp
converges to zero as p - 7 CD.(Hint: use induction and the fact that
Unbounded Functions
(25.1) I, = t f.
(25.2)
l a+
b
f or by l a+
b
f(x) dx,
although it is more usual not to write the plus signs in the lower limit.
CR. VI INTEGRATION
a+1
It follows from the Fundamental Theorem 23.3 that
[1 x dx Q = 1 (1 _ Ca+l).
Jc a +1
If a satisfies -1 < a < 0, then ca +! ---4 as c ---40, andfhas an improper
integral. On the other hand, if a < -1, then c +! does not have a Q
(finite) limit as c ---40, and hence f does not have an improper integral.
The preceding discussion pertained to a function which is not defined
or not bounded at the left end point of the interval. It is obvious how
to treat analogous behavior at the right end point. Somewhat more
interesting is the case where the function is not defined or not bounded
at an interiOl~ point of the interval. Suppose that p is an interior point
of [a, b] and that f is defined at every point of [a, b] except perhaps p.
If both of the improper integrals
It is clear that if those two limits exist, then the single limit
(25.4) lim
t->O+
{!.p-t f(x) dx + (bP+t f(x) ax}
a }
also exists and has the same value. However, the existence of the limit
(25.4) does not imply the existence of (25.3). For example, if f is defined
for x E [-1, I], x ~ 0, by f(x) = l/x 3, then it is easily seen that
for all E satisfying 0 < E < 1. However, we have seen in Example 25.2 (c)
that if a = - 3, then the improper integrals
1 1
J
O- 1
~1
-Sdx,
x fa
0+
-dx
x3
do not exist.
The preceding comments show that the limit in (25.4) may exist
without the limit in (25.3) existing. We defined the improper integral
(which is sometimes called the Cauchy integral) of f to be given by
(25.3). The limit in (25.4) is also of interest and is called the Cauchy
principal value of the integral and denoted by
Infinite Integrals
(25.5)
We shall now define the" infinite integraF' of f for x > a to be the limit
of leas c increases.
25.3 DEFINITION. If f is Riemann integrable over [a, c] for each
c > a, let Ie be the partial integral given by (25.5). A real number I is
CR. VI INTEGRATION
said to be the infinite integral of f over {x: x > a} if for every E > 0,
there exists a real number M (e) such that if c > M (e) then II - Icl < e.
In this case we denote I by
(25.6) J.+OO f or i+ OO
f(x) dx.
'I
Ic =
c-1 dx = log (c) - log (a).
J. a X
Since log(c) becomes unbounded as c ~ + co, the infinite integral
of f does not exist.
(b) Let f(x) = x a for x > a > 0 and a -;e -1. Then
Ie = J.c a
XCl dx = _1_ (c a +1
a +1
- aa+1 ).
If ex > -1, then a + 1 > 0 and the infinite integral does not exist.
However, if a < - 1, then
+CO aCl+l
J. a x" dx = - a + 1.
(c) Let f(x) = e- for x
X
> O. Then
1 c
e- X dx == - (e- c - 1);
hence the infinite integral of f over (x:;1: > 0 l exists and equals 1.
It is also possible to consider the integral of a function defined on
all of R. In this case we require that f be Riemann integrable over every
interval in R and consider the limits
a
(25.7a) f~ro f(x) dx = b~~oo 1 f(x) dx,
(25.7b) i+ OO
f(x) dx = c~i~oo J.c f(x) dx.
t GEOFFREY H. HARDY (1877-1947) was professor at Cambridge and long-time
dean of British mathematics. He made frequent and deep contributions to mathe-
matical anal. ;is.
SEC. 25 IMPROPER AND INFINITE INTEGRALS
It is easily seen that if both of these limits exist for one value of a,
then they both exist for all values of a. In this case we define the infinite
integral of f over R to be the sum of these two infinite integrals:
(25.8) 1-: 00
and the equality of (25.8) and (25.9). The limit in (25.9), when it
exists, is often called the Cauchy principal value of the infinite integral
over R and is denoted by
f(x) dx.
However, the existence of the Cauchy principal value does not imply
the existence of the infinite integral (25.8). This is seen by considering
f (x) = x, whence
t, x dx ~ He' - c') ~0
for all c. Thus the Cauchy principal value of the infinite integral for
f(x) = x exists and equals 0, but the infinite integral of this function
does not exist, since neither of the infinite integrals in (25.7) exists.
We now obtain a few conditions for the existence of the infinite integral
over the set {x: x > a}. These results can also be applied to give condil-
tions for the infinite integral over R, since the latter involves considera-
tion of infinite integrals over the sets Ix: x < a} and {x: x > a}. First
we state the Cauchy Criterion.
25.5 CAUCHY CRITERION. Suppose that f is integrable over [a, c] for
all c > a. Then the infinite integral
exists if and only iffor every e > 0 there exists a K () such that if b > c ~~
K(e), then
(25.11)
CR. VI INTEGRATION
In = ia+n f.
In the important case where f(x) > 0 for all x > a, the next result
provides a useful test.
25.6 THEOREM. Suppose that f(x) > 0 for all x > a and that f is
integrable over [a, c] for all c > a. Then the infinite integral of f exists if
and only if the set {Ie: C > a} is bounded. In this case
PROOF. If a < c < b, then the hypothesis that f(x) > 0 implies that
Ie < h so Ie is a monotone increasing function of c. Therefore, the
existence of lim Ie is equivalent to the boundedness of {Ie: C > a}.
Q.E.D.
+ f,
00 1+00
1 K
g
K
exist. Since both f and g are integrable on [a, K], the statement follows.
Q.E.I>.
c > a,
+oo
Then the infinite integral
PROOF.
f.a fcp exists.
Let A be a bound for the set {IIel:c > aJ. If e > 0, let
K (E) be such that if x > K (E), then 0 < cp(x) < e/2A. If b > c > K (E),
then it follows from Bonnet's form of the Second Mean Value Theorem
23.7(c) that there exists a number ~ in [c, b] such that
f f<p = <pte) t f
In view of the estimate
f f = It - I, < 2A,
it follows that
CH. VI INTEGRATION
when b > c both exceed K(f). We can then apply the Cauchy Criterion
25.5.
Q.E.D.
\
25.10 EXAMPLES. (a) If f(x) = 1/(1 + x 2 ) and {lex) = l/x2 for
x > a > 0, then 0 < f(x) < g (x). Since we have already seen in Example
25.4 (b) that the infinite integral
+00 1
1,
1
-dx
x2
exists, it follows from the Comparison Test 25.7 that the infinite integral
[+00 _l_
dx
J1 1 +x 2
exists, whence it follows from the Comparison Test 25.7 that the
infinite integral
I = I.+ w
e-"'da;
also exists. This time, a direct evaluation of the partial integrals is not
possible, using elementary functions. However, there is an elegant artifice
that can be used to evaluate this important integral. Let Ie denote the
partial integral
We now let R e = {(x, y):O < x, 0 < y, x2 + y2 < c2 } and note that the
sector Rc is contained in the square Se and contains the square Std2.
Since f is positive, its integral over Rc lies between its integral over
Sel2 and Be. Therefore, it follows that
sup (I e )2 = sup ( f = ~,
c c JR e 4
and it follows from Theorem 25.6 that
+:O 1
(24.13)
f.
o
e-:c 2 dx = sup Ie = - 0.
c 2
(c) Let p > 0 and consider the existence of the infinite integral
+a> sin (x)
11
-----:.......:....dx.
xP
If p > 1, then the integrand is dominated by l/x p , which was seen in
Example 25.4(b) to be convergent. In this case the Comparison Test
implies that the infinite integral converges. If 0 < p < 1, this argument
fails; however, if we setf(x) = sin (x) and cp(x) = l/x p , then Dirichlet's
Test 25,9 shows that the infinite integral exists.
(d) Let f(x) = sin (x2 ) for x > 0 and consider the Fresnelt Integral
+CC
f.
o sin (x2 ) dx.
It is clear that the integral over [0, 1] exists, so we shall examine only
the integral over {X: x > 11. If we make the substitution t = rand
apply the Change of Variable Theorem 23.8, we obtain
c. _! /,c sin. .ri(t) dt.
2 .
1
(
sm (2)
x
dx - .. '
1 2 1 V t
The preceding example shows that the integral on the right converges
when c ~ + (Xl; hence it follows that the infinite integral
exists. (It should be observed that the integrand does not converge to
o as x ~ + (Xl.)
(e) Suppose that a > 1 and let rea) be defined by the integral
improper integral
[1 e-;l:x..-1 dx
Jo+
exists when 0 < a < 1. Hence we can extend the definition of the
Gamma function to be given for all a > 0 by an integral of the form of
(25.14) provided it is interpreted as a sum
+ 1+(10 e-zxc:r-l dx
~0+
(1
e-zx--1 dx
(1
j +<O
11"
sin (x)
----:.......:....dx.
X
j br sin
11"
(x)
--dx>
x
j211" + ... + jh >--+-+
11"
b{ 1 1
2 2'17" 3'17"
... 1)
+-,
k7r
1t
Suppose that for each t E J, the function <p (x, t) is monotone decreasing jor
x > a and converges to 0 as x --7 + 00 uniformly for t E J. Then the
integral
F(t) = f.+~ f(x, t)",(x, t) dx
converges uniformly on J.
PROOF. Let e > 0 and choose K(e) such that if x > K(e) and t E J,
then <p(x, t) < e/2A. If b > c > K(e), then it follows from Bonnet's
form of the Second Mean Value Theorem 23.7(c) that, for each t E J,
there exists a number ~ (t) in (e, b] such that
l b
j(x, t)fII(x, t) dx < fII(c, t)2A < E,
so the uniformity of the convergence follows from the Cauchy Criterion
25.13.
Q.E.D.
CH. VI INTEGRATION
exists I it follows from the Weierstrass M-test that the infinite integral
+ cos (tx)
CD
fa o
---d
1 + x2
x
converges uniformly for t in an interval [0, ,8] for any ,8 > O. However,
it does not converge uniformly on It E R:t > O} (See Exercise 25.K).
(c) If lex, t) = e- tx sin (x) for x > 0 and t > 'Y > 0 1 then
If(x, t) I < e- tx < e-"(X.
If we set M(x) = e-'Y x , then the Weierstrass M-test implies that the
integral
fa +m e- Ix sin (x) ax
converges uniformly for t > "Y > 0 and an elementary calculation shows
that it converges to (1 + t2)-1. (Note that if t = 0, then the integral no
longer converges.)
(d) Consider the infinite integral
+00 sin (x)
e- tx dx for t > 0,
fao x
where we interpret the integrand to be 1 for x = O. Since the integrand
is dominated by 1, it suffices to show that the integral over E < X con-
verges uniformly for t > O. The Weierstrass Al-test does not apply to
this integrand. However l if we take j(x l t) = sin (x) and q;(x, t) =
e-tx/x, then the hypotheses of Dirichletls Test are satisfied.
SEC. 25 IMPROPER AND INFINITE INTEGRALS S65
Infinite Integrals Depending on a Parameter
f F(t) dt = 1+ f 00
{ j(x, t) dt}dx'
(25.20) t{1+ 00
1+
f(x, t) dX} dt =
00
25.19 THEOREM. Suppose that I and its partial derivative ft are con-
tinuous in (x, t) for x > a and t in J ...., [a, 13]. Suppose that (25.19)
exists for all t E J and that
G (t) = i +m ft(x, t) dx
-d
dt
i+
a
co
I(x, t) dx = i+
a
co
af
- (x, t) dx.
at
PROOF. If F n is defined for t E J to be
t
=
0
~
+(1) e- tz dx f.
and that the convergence is uniform for t > to > O. If we integrate both
sides of this relation with respect to t over an interval [a,,6] where
o < a < ,6, and use Theorem 25.18, we obtain the formula
log (8/a) = t~ dt = f. +m { t e- tx dt} dx
e- az e-!J x
f.
+co -
= dx.
o x
-1 =
2
J+co xe-tx dx.
t 0
8EC.25 IMPROPER AND INFINITE INTEGRAL8 35"1
n'n =
-'
t +!
l+oo xne-
0
tx dx.
In order to evaluate the constant C, we use the fact that F(O) = 0 and
Arc tan (0) = 0 and infer that C = O. Hence, if t > 0 and u > 0, then
+a> sin (ux)
Arc tan (ult) =
l o
e- t 7;
x
dx.
(e) Now hold u > 0 fixed in the last formula and observe, as in
Example 25.16(d) that the integral converges uniformly for t > 0 so
that the limit is continuous for t > O. Letting t ~ 0+, we obtain the
important formula
PROOF. It follows from the Comparison Test 25.7 that the infinite
integrals
+
l
CO
i, n E N,
a
Since f(x) = lim (fn(x) for all x E [a, K] it follows from the Bounded
Convergence Theorem 22.14 that
(K i = lim (K f n'
Ja n Ja
Therefore, we have
an integral over {x:x > al if and only if the 8et { f. +rofn : n E N }i8
bounded. In this case
ro
f.+ro f = s~p { t fn} = li~ too fn.
f f = Ii~ f fn = s~p {f f n}
[f<8.
By Theorem 25.6 the infinite integral of i exists and, since
1a
+0> i = sup
C
I. f
a
c
= sup {sup
c n
I.ein}
a
= sup {sup
71 can
I. cfn} = sup I.+o:l in,
a
(25.23) f.+oo If. +00 f(x, t) dx} dt = f.+oo {f.+oo f(x, t) dt} dx,
under suitable hypotheses. It turns out that a simple condition can be
given which will also imply absolute convergence of the integrals. How-
ever, in order to treat iterated infinite integrals which are not necessarily
absolutely convergent, a more complicated set of conditions is required.
25.23 THEOREM. Suppose that f is a non-negative junction defined
for (x, t) satisjying x > U, t > a. Suppose that
I(x, t) dt} dx
(25.25) I.' If. +00 f(x, t) dX} dt = f. +00 {I.' f(x, t) dt} dx
jar each (j > a. Then, if one of the iterated integrals in equation (25.2:~)
exists, the other also exists and they are equal.
PBOOF. Suppose that the integral on the left side of (25.23) exists.
Since f is non-negative,
f. f.
+w { +00 lex, t) dt dx I < J. f. +w { +< I(x, t) dX} dt.
362 CH. VI INTEGRATION
1a f(x, t) dx
All of these results deal with the case that the iterated integrals are
absolutely convergent. We now present a result which treats the C3,se
of non-absolute convergence.
25.26 THEOREM. Suppose that the real-valued function f is continuous
in (x, t) for x > a and t > ex and that the infinite integrals
+00 +00
(25.27) a f(x, t) dx, a f(x, t) dt
/. /.
converge uniformly for t > a and x > a, respectively. In addition, let F
be defined by
= f {f fix, t) dx}dt.
From Theorem 25.18 and the uniform convergence of the second integral
in (25.27), we infer that
lim
~->+oo
f.A F(x, /3) dx /.A Jl /'+00 I(x, t) dt}dX.
a
=
a a
Hence there exists a number B > Ct such that if /32 > /31 > B, then
By combining (25.29) and (25.31), it is seen that if {32 > {31 > B, then
whence it follows that the limit of i +'" F (x, /3) dx exists as fJ ---t + (Xl.
J. +00 { ~ +00 e-(x+t) sin (xt) dX} dt = ~ +00 {~+OO e-{x+tj sin (xt) dt} dx.
for x > a > 0 and y > O. If we put M(x) = xe- z2 and N(y) = e- a:!l1\
then we can invert the order of integration over a < x and 0 < y.
Since we have
-----
2(1 + y2)
it follows that
It follows that
l o
+00 e-rlli
- - dy = 2e a 1
1+y2
2 1 a
+"0
e- dx.
X
2
7r
- =
2
l +m
0
dy
1 + y2
= lim
l+co
a--+O 0
e-
1 + y2
a2y2
dy = 212
If X > a > 0 and y > a > 0, we can argue as in Example (b) to show
that
+CO e-GU cos (a) f.+OO ye- Gu sin (a)
f.a 1 + y2 dy + a 1 + y2 dy
We want to take the limit as a ~ O. In the last integral this can evi-
dently be done, and we obtain
+co e- az sin
I.
(x)
---~dx.
o x
In view of the fact that e- GU cos (a) is dominated by 1 for y > 0, and the
integral
+CO 1
f.a 1 + y2 dy
exists, we can use the Dominated Convergence Theorem 25.21 to con-
clude that
. f.
hm
a-.O a
+(X) e- au cos (a) _ f.+OO dy
1+y
2 dy -
a 1+y
2
The second integral is a bit more troublesome as the same type of esti-
mate shows that
ye- Gu sin (a) y
< ,
1 + y2 - 1 + y2
and the dominant function is not integrable; hence we must do better.
Since u < e" and Isin (u)1 < u for u > 0, we infer that Ie-au sin (a) I <
l/y, whence we obtain the sharper estimate
ye- au sin (a) 1
I + y2
< --
- 1 + y2
We can now employ the Dominated Convergence Theorem to take the
limit under the integral sign, to obtain
hm
. f. +co
a~O a
ye- all sin (a)
1 + y2 dy = O.
We now want to take the limit as a ---t O. This time we cannot use the
(+cn
Dominated Convergence Theorem, since Jo x- 1
sin (x) dx is not
absolutely convergent. Although the convergence of e-ax to 1 as a _.~ 0
is monotone, the fact that sin (x) takes both signs implies that the con-
vergence of the entire integrand is not monotone. Fortunately, we have
already seen in Example 25.16(d) that the convergence of the integ.ral
is uniform for a > O. According to Theorem 25.17, the integral is con-
tinuous for a > 0 and hence we once more obtain the formula
Exercises
proper integral (b If Iexists. Shows that the improper integral (b j exists, but
Ja+ Ja+
that the canverse may not be true.
25.C. Suppose that! and g are integrable on [c, b] for all c > a. If I!(x) I < g(x)
for x E J = [a, b] and if g has an improper integral on J, then so doesj.
25.D. Discuss the convergence or the divergence of the following improper
integrals:
(a) {1 dx , (b) (1 dx ,
Jo (x +x 2) Yli Jo (x -x Y2 2)
X dx
(llog (x) dx,
1.
1
(c) (d)
o (1 - x3 ) ,
Jo vx
(e) (1 log (x) dx, (f) (1 x dx
Jo 1- x 2
Jo (1-xa) ~
25.E. Determine the values of p and q for which the following integrals
converge:
(a) /.1 xp(1 - x)q dx, (b) J(1r/2 xp[sin (x)]q dx,
0
(a) f+~
1 x(1
<b:
+ 0)'
(b) f +~ ++
1
x
x2
2 dx,
1
(c) f +"'"
1
sin(l/x) dx,
x
(d) f +~ co.j:) dx,
(e) I.+ co
o
x sin(x) dx,
1 +x 2
(I) I.
o
+~ sin (x) sin (2x)
X
dx.
25.G. For what values of p and q are the following integrals convergent? For
what values are they absolutely convergent?
25.H. If f is integrable on any interval [0, c] for c > 0, show that the infinite
+CXl {+co
f exists if and only if the infinite integral l5 f exists.
integral
fo 0
+ CO
25.1. Give an example where the infinite integral ! exists but where!
(+ro
25.J. Iff is monotone and the infinite integral Jo ! exists, then xf(x)~O
as x ~ + co.
+cn
25.K. Show that the integral
I.
0
interval [0, ,8] but that it does not converge uniformly for t
x1e-:r: dx converges uniformly for t in an
> 0.
25.L. Show that the integral
(+rn sin(tx) dx
Jo x
is uniformly convergent for t > 1, but that it is not absolutely convergent for
any of these values of t.
25.M. For what values of t do the following infinite integrals converge uni-
formly?
+0> d
(a) x ,
10_0 x" + ~2
SEC. 25 IMPROPER AND INFINITE INTEGRALS $69
(+aJ
(c)o e-Xcos(tx) dx,
(+o::>
F(t) =)0 e- x2
cos (tx) dx.
Differentiate with respect to t and integrate by parts to prove that F' (t) ::::
(-1/2)t F(t). Then find F(t) and, after a change of variable, establish the
formula
c> o.
Differentiate and change variables to show that G'(t) = - 2G(t). Then find
G(t) and, after a change of variables, establish the formula
370 cIt. VI INTEGRATION
I.
o
+
<X> cos(ty) d _ 11' -Itl
1+y2
Y - -e .
2
25.X. By considering the iterated integrals of e-(lI+I/)" sin(y) over the quadmnt
x 2 0, Y 2 0, establish the formula
+CO -e-ll"
-dx- _f.+CD sin(y)
--dy, a> O.
f.o 1 +x 2
0 a+y
Projects
25.a. This project treats the Gamma function, which was introduced in
Example 25.10(e). Recall that r is defined for x in P = {x E R:x > O} by the
integral
If x 2 1 and y ;;:: 1, this integral is proper, but if 0 < x < 1 or 0 < y < 1, the
integral is improper.
(a) Establish the convergence of the integral for x, y in P.
372 CR. VI INTEGRATION
and
fa0+
+0::> UZ-l
duo
R(x, y) =
fa0+ (1 + u)z+y
(d) By integrating the non-negative function
f(t, u) = e- t2- u2
t 2x - 1 U 2 y-l
+
over {(t, u) : t2 u2 = R2, t > 0, u ~ O} and comparing this integral with the
integral over inscribed and circumscribed squares (as in Example 25.10(b),
derive the important formula
B(x,y) = r(x)f(y).
rex + y)
(e) Establish the integration formulas
f.
o
7r/2 ( .
smx
)2n d _ y:;;:f(n +!) _ 135 (2n - 1) 11"
x-
2f(n + 1)
-
246 . (2n) 2
-,
7r/2 -
f.
o
(sinx)2n+ldx = YllT(n + 1) =
2 fen +!)
246 . (2n)
1357 (2n + 1)
25.")'. This and the next project present a few of the properties of the Laplacet
transform, which is important both for theoretical and applied mathematics. To
simplify the discussion, we shall restrict our attention to continuous functions j
defined on {t E R: t > O} to R. The Laplace transform of j is the function 1
defined at the real number 8 by the formula
(+ro
1(8) = Jo e- 8t
jet) dt,
]'(8) = J(+co
0 e- 8t
(-t)f(t) dt.
~(s) ~ ~J(D'
Similarly, if h (t) ~ ~ f G), then h converges for s > sola and
h(s) = 1 (as).
J
(f) Suppose that the Laplace transform of f exists for s > So and let f be
defined for t < 0 to be equal to O. If b > 0 and if get) = J(t - b), then 0 con-
verges for s > So and
g(s) = e- bf J(s).
Similarly, if k(t) = ebIJ(t) for any real b, then h converges for s > 80 + band
h(s) = J(s - b).
25.0. This project continues the preceding one and makes use of its results.
(a) Establish the following short table of Laplace transforms.
jet) J(s) Interval of Convergence
1 l/s s> 0,
n!jsn+l s> 0,
(s - a)-l s> a,
n!/(s - a)n+l s> a,
a
sin at all s,
S2 + a2
s
cos at all s,
S2 +a 2
a
sinh at
S2 - a2
8> a,
s
cosh at s> a,
S2 - a2
sin t
t
Arc tan (l/s) 8> 0.
374 eH. VI INTEGRATION
(b) Suppose thatf and!, are continuous for t > 0, that! converges for S > So
and that e-a1f(t) -+ 0 as t -+ + co for all s > So. Then the Laplace transform of
f' exists for s > So and
A ,..
f' (s) = sf(s) - j(O).
(Hint: integrate by parts.)
(c) Suppose that!, j' and f" are continuous for t > 0 and that! converges for
s > So. In addition, suppose that e-8If(t) and e- 81 j'(t) approach 0 as t -+ co +
for all 8 > So. Then the Laplace transform of j" exists for 8 > 80 and
"'" = s2j(8)
1"(s) "-
- sj(O) - 1'(0).
(d) When all or part of an integrand is seen to be a Laplace transform, the
integral can sometimes be evaluated by changing the order of integration. Use
this method to evaluate the integral
+<:0 sin 8
1 o
-
s
ds = - '
11"
2
(e) It is desired to solve the differential equation
y'(t) + 2y(t) = 3 sin t, yeO) = 1.
Assume that this equation has a solution y such that the Laplace transforms of
y and y' exists for sufficiently large s. In this case the transform of y must satisfy
the equation
sY(s) - yeO) + 2y(s) = 4/(8 - 1), s > 1,
from which it follows that
,. s+3
yes) = (8 + 2)(8 - 1)
Use partial fractions and the table in (a) to obtain yet) = (-!)~I - (t)e- 2t ,
which can be directly verified to be a solution.
(f) Find the solution of the equation
y" + y' = 0, yeO) = u, y'(O) = b,
by using the Laplace transform.
(g) Show that a linear homogeneous differential equation with constant co-
efficients can be solved by using the Laplace transform and the technique of
decomposing a rational function into partial fractions.
VII
Infinite Series
375
876 CR. VIl INFINITE SERIES
(26.2) LX
n =1
n
LX
n=5
n,
PROOF. This result follows directly from Theorem 11.14 and Defi-
nition 26.1.
Q.E.D.
It might be expected that if the sequences X = (Xl') and Y = (Yn)
generate convergent series, then the sequence X Y = (x"' Yn) also
generates a convergent series. That this is not always true may be seen
by taking X = Y = ( -l)ll/Vn) in R.
We now present a very simple necessary condition for convergence of
a series. It is far from sufficient, however.
26.3 LEMMA. If L (x,,) converges in Rp, then lim (x,,) = o.
PROOF. By definition, the convergence of L (x,.) means that lim (~~k)
exists. But, since
Q.E.D.
The next result, although limited in scope, is of great importance.
26.4 THEOREM. Let (x,,) be a sequence of non-negative real numbers.
Then L (x n ) converges if and only if the sequence S = (Sk) of partial sums
is bounded. In this case,
L: Xn = lim (Sk) = sup {sd.
. PROOF.
. Since x" > 0, the sequence of partial sums IS monotone
mcreasmg:
81
-< S2 < ... <
- - Sk <
- ...
According to the Monotone Convergence Theorem 12.1, the sequence S
converges if and only if it is bounded.
Q.E.D.
Since the following Cauchy Criterion is precisely a reformulation of
Theorem 12.10, we shall omit its proof.
318 CR. VII INFINITE SERIES
(26.5)
If lal < 1, then la +11~ 0 so the Cauchy Criterion implies that the
ll
8k = 1 +-1 +-1 + -1 =
- 8kt
1 +-1 >
+-3 + (1) 2
2 - = 1 +- .
8k
21234 4 4 2 1
8k
r
> Sk + 2r-l
r-l (1)
-
2r = Sl.
"T-l
1
+ -2 = 1 + -2r
Therefore, the subsequence (Skr) is not bounded and the harmonic series
does not converge.
(c) We now treat the p-series L (1/n P ) where 0 < p < 1 and use
the elementary inequality n P < n, for n E N. From this it follows that,
when 0 < p < 1, then
1 1
- <-, n EN.
n - nP
Since the partial sums of the harmonic series are not bounded, this in-
equality shows that the partial sums of L (l/n p ) are not bOWlded for
o < p < 1. Hence the series diverges for these values of p.
(d) Consider the p-series for p > 1. Since the partial sums are mono-
tone, it is sufficient to show that some subsequence remains bounded in
order to establish the convergence of the series. If k1 = 21 - 1 =: 1,
then 8kI = 1. If k2 = 22 - 1 = 3, we have
STet = -
11 + (1- +-2) < + -2
2p 3p
1
2p
= 1
1
+ -2P-l ,
and if k a = 23 - 1, we have
1 1 1 1) 4 1 1
8ka = 8Tet + ( 4P + 5 + 6P + 7 < 8Tet + 4 < 1 + 2P-1 + 4P-1 '
p p p
380 I CH. VII INFINITE SERIES
Rearrangements of Series
Choose a partial sum tT of L (Ym) such that ly - tTl < e and such that
each Xl, X2, , X n occurs in tT After having done this, choose m > n so
large that every Yk appearing in t also appears in Sm' Therefore,
T
m
Ix - yl < Ix - Sm\ + ISm - tTl + [t T - Y\ <E+ L IXkl + E < :3e.
n+l
Double Series
Sometimes it is necessary to consider infinite sums depending on two
integral indices. The theory of such double series is developed by reduc-
ing them to double sequences; thus all of the results in Section 14 dealing
with double sequences can be interpreted for double series. However, we
shall not draw from the results of Section 14; instead, we shall restrict
our attention to absolutely convergent double series, since those are the
type of double series that arise most often.
382 CH. VII Il\"FINITE SERIES
(26.6)
(26.7) LL
;"=1 i=1
Xij, L L:
i=lj=l
Xii
also converge to X.
PROOF. By hypothesis there exists a positive real number A which
is an upper bound for the set in (26.6). If n is fixed, we observe that
m n m
L
i=1
L L
\xinl < i=l i=l
IXiil < A,
for each m in N. It thus follows that, for each n E N, the single series
r::l (Xin) is absolutely convergent to an element y~ in Rp.
SEc.26 CONVERGENCE OF INFINITE SERlES SS3
we infer that
ro ro ro
when n > 111 (c:). This proves that the first iterated sum in (26.7) exists
and equals x. An analogous proof applies to the second iterated sum.
IQ.E.D.
There is one additional method of summing double series that we
shall consider, namely along the diagonals i + j = n.
26.13 THEOREM. Suppose that the double series L (Xij) converges
absolutely to x in Rp. If We define
tk = 2:::
i+j=k
Xii = Xl, k-l + Xz, k-z + ... + Xk-l, 1,
then the series L (tk) converges absolutely to x.
PROOF. Let A be the supremum of the set in (26.6). We observe that
If m, n > M, then it follows that ISmn - sMMI is no greater than the sum
L: (IXiil) extended over all pairs (1, j) satisfying either M < i < m or
M < j < n. Hence lSmn - sMMI < E, when m, n > M. It follows from
this that
i
Cauchy Multiplication
In the process of multiplying two power series and collecting the
terms according to the powers, there arises very naturally a new method
of generating a series from two given ones. In this connection it is nota-
tionally useful to have the terms of the series indexed by 0, 1, 2, ....
26.14 DEFINITION. If Li~O (Yi) and LjC:O (Zj) are infinite series
in Rp, their Cauchy product is the series L;=o (x/.;), where
XI.; = Yo' Zk + Yl' Zk-l + ... + Yk' zoo
Here the dot denotes the inner product in R p. In like manner we can
define the Cauchy product of a series in R and a series in Rp.
It is perhaps a bit surprising that the Cauchy product of two con-
vergent series may fail to converge. However, it is seen that the series
f (-1)"
n=O vn + 1
is convergent, but the nth term of the Cauchy product of this series with
itself is
L
f=O
Zj
:E :E Xij,
;=0 i =0
:E ).
k =0 i-t.i'=k
Xij
Exercises
26.A. Let L (a..) be a given series and let L (b n ) be one in which the terms
are the same as those in L (an), except those for which an = 0 have been
omitted. Show that L (an) converges to a number A if and only if L (b n )
converges to A.
26.B. Show that the convergence of a series is not affected by changinl~ a
finite number of its terms. (Of course, the sum may well be changed.)
26.C. Show that grouping the terms of a convergent series by introducing
parentheses containing a finite number terms does not destroy the convergence
or the value of the limit. However, grouping tern!s in a divergent series ean
produce convergence.
26.D. Show that if a convergent series of real numbers contains only a finite
number of negative terms, then it is absolutely convergent.
26.E. Show that if a series of real numbers is conditionally convergent, then
the series of positive terms is divergent and the series of negative term8 is
divergent.
26.F. By using partial fractions, show that
<0 1 1
(a) L = - if a > 0,
n =0 (a + n)(a + n + 1) a
en 1 1
(b) El n(n + l)(n + 2) = 2".
t FRANZ (C. J.) MERTENS (1~40-1927) studied at Berlin and taught at Cracow:!tnd
Vienna. He contributed primarily to geometry, number theory, and algebra.
386 CR. VII INFINITE SERIES
converges. This result is often called the Cauchy Condensation Test. (Hint:
group the terms into blocks as in Examples 26.8(b, d).)
26.1. Use the Cauchy Condensation Test to discuss the convergence of the
p-series L (l/n p ).
26.M. Use the Cauchy Condensation Test to show that the series
L 1 'L 1 ,
n log n n(log n) (log log n)
L 1
n (log n) (log log n) (log log log n)
are divergent.
26.N. Show that if C > 1, the series
L 1 , L 1
n(log n)C n(log n) (log log n)C
are convergent.
26.0. Suppose that (an) is a monotone decreasing sequence of positive num-
bers. Show that if the series L (an) converges, then lim (nan) = O. Is the con-
verse true?
26.P. If lim (an) = 0, then L (an) and L (an + 2an+l) are both convergent
or both divergent.
26.Q. Let L (a mll ) be the double series given by
amn = +1, if m - n = 1,
= -1, if m - n = -1,
= OJ otherwise.
SEC. 27 TESTS FOR CONVERGENCE 887
Show that both iterated sums exist, but are unequal, and the double sum does
not exist. However, if (8 m ,,) denote the partial sums, then lim(s",,) exists.
26.R. Show that if the double and the iterated series of L (amn) exist, then
they are all equal. Show that the existence of the double series does not imply
the existence of the iterated series; in fact the existence of the double series does
not even imply that lim(amn ) = 0 for each m.
n
26.S. Show that if p > 1 and q > 1, then the double series
(Hint: write C1 + ... + Cn = A 1B n + ... + A"B1 ; break this sum into three
parts; and use the fact that An ~ A and En ~ B.)
We shall now give some results which can be used to establish the
convergence or divergence of infinite series. In view of its importance,
we shall pay special attention to absolute convergence. Since the absolute
convergence of the series L: (x n ) in Rp is equivalent with the convergence
of the series L (Ixnl) of non-negative elements of R, it is clear that
results establishing the convergence of non-negative real series have
particular interest.
Our first test shows that if the terms of a non-negative real series are
dominated by the corresponding terms of a convergent series, then the
first series is convergent. It yields a test for absolute convergence that
the reader should formulate.
27.1 COMPARISON TEST. Let X = (x n) and Y = (Yn) be non-negative
real sequences and suppose that for some natural number K,
(27.1) X n < Yn for n > K.
Then the convergence of L (Yn) implies the convergence of L (x n).
PROOF. If m > n > sup {K, M(e)}, then
(b) If there exists a number r > 1 and a natural number K s11.Ch thait
(27.4) for n >K,
then the series L(x,,) is divergent.
PROOF. (a) If (27.3) holds, then we have Now for Ix"l < r".
o < r < 1, the series L(r n )
is convergent, as was seen in Example
26.8(a). Hence it follows from the Comparison Test that E(x,,) is
absolutely convergent.
(b) If (24.4) holds, then Ix,,1 > r n However, since r > 1, it is false
that lim (Ixnl) = o.
Q.E:.D.
In addition to establishing the convergence of L (x,,), the root test
can be used to obtain an estimate of the rapidity of convergence. This
estimate is useful in numerical computations and in some theoretical
estimates as well.
27.4 COROLLARY. If r satisfies 0 < T < 1 and if the sequence X =
(x n ) satisfies (27.3), then the partial sums Sn, n > K, approximate the S'u,m
s = L (x n ) according to the estimate
r n +1
(27.5) Is - snl < 1 -T for n > K.
PROOF. If m >n> K, we have
ISm - s,,1 = IXn+l + '" + xml < IX~ll + ... + Ixml
r n+!
< Tn+! + ... + r < m .
1- r
Now take the limit with respect to m to obtain (27.5).
Q.E.D.
It is often convenient to make use of the following variant of the
root test.
27.5 COROLLARY. Let X = (x n ) be a sequence in Rp and set
(27.6)
(27.7)
PROOF. The relation (27.7) implies that IXn+kl < r k Ixnl when n > K.
Therefore, if m > n > K, we have
ISm - snl = IXn+! + ... + xml < Ixn+d + ... + Ixml
r
< (r + r2 + ... + r m-n) Ixnl < 1 - r IXnl
Again we take the limit with respect to m to obtain (27.9).
Q.E.D.
27.8 COROLLARY. Let X = (x n ) be a sequence in Rp and set
IX +ll < r1
J;:fn
for n > K.
In this case Theorem 27.6 establishes the absolute convergence of the
series. If l' > I, and if 1'2 satisfies 1 < r2 < 1', then there is a natural
number K such that
IX n+ll >
~ 1'2 for n > K,
(b) If there is a real number a < 1 and a natural number K such that
from which it follows that the sequence (k IXk+ll) is decreasing for k > K.
On adding the relation (27.13) for k = K, ... , n and noting that the
left side telescopes, we find that
This shows that the partial sums of L ([xnJ) are bounded and establishes
the absolute convergence of L (x n ).
(b) If the relation (27.12) holds for n > K then, since a < 1,
PROOF. Let m > n > K and add the inequalities obtained from
(27.13) for k = n + 1, ... , m to obtain
Hence we have
(27.15)
(27.16)
J+0)
n+1
J(t) dt < s- Sn < 1 n
+0) f(t) dt.
n+l
jet) dt < 8m - Sn <
n
-1 <_.
1
n - nP
After using the Comparison Test 27.1, we conclude that the p-series
L (l/n p ) diverges for p < 1.
(b) Now consider the case p = 2; that is, the series L (1In 2 ). We
compare the series with the convergent series L: (n(n ~ 1) of Exam-
ple 26.8(e). Since the relation
1 1
- - - < -2
n(n + 1) n
SEC. 27 TESTS FOR CONVERGENCE
holds and the terms on th~ left form a convergent series, we cannot apply
the Comparison Theorem directly. However, we could apply this
1 1 n2 n
n(n + 1) + n 2 = n(n + 1) - n +1
Since the limit of this quotient is 1and L (n(n ~ 1)) converges, then
so does the series L: (1/n 2 ).
(c) Now consider the case p > 2. If we note that n P > n2 for p > 2,
then
1 1
-<-
n P - n2 '
a direct application of the Comparison Test assures that L (l/n p ) con-
verges for p > 2. Alternatively, we could apply the Limit Comparison
Test and note that
1 1 1
-+-=-::=-.
nP n2 nP nP-2
Now it is known (see Exercise l1.P) that the sequence (n1/n) converges
to 1. Hence we have
Inn (Cpr) = 1,
80 that the Root Test (in the form of Corollary 27.5) does not apply.
8.96 CR. VII INFINITE SERIES
and since the sequence (1 + I/n)p) converges to 1, the Ratio Test (in
the form of Corollary 27.8) does not apply.
(e) In desperation, we apply Raabe's Test to the p-series for integral
values of p. First, we attempt to use Corollary 27.11. Observe that
n (1 - (1 -
n +1
1 )P) = n (1 _ 1 +
n
P
+1
_ pep - 1)
2 (n + 1)2
+ ...) .
If we take the limit with respect to n, we obtain p. Hence this corollary
to Raabe's Test shows that the series converges for integral values of
p > 2 (and, if the Binomial Theorem is known for non-integral values
of p, this could be improved). The case p = 1 is not settled by Corollary
27.11, but it can be treated by Theorem 27.9. In fact,
1 1 1 1
---;--= >1--
n +1 n n + lin - n'
and so Raabe's Test shows that we have divergence for p = 1,
(f) Finally, we apply the Integral Test to the p-series. Let Jet) == t- P
and recall that
f. ~
1
n
t
dt = log (n) - log (1),
~
f.
1
-.!
n
tP
dt = 1 (n 1-
1- P
p -
> 1 and
diverges if p < 1.
Conditional Convergence
The tests given in Theorems 27.1-27.12 all have the character that
they guarantee that, if certain hypotheses are fulfilled, then the series
SEc.27 TESTS FOR CONVERGENCE 39?'
that convergence may take place even though absolute convergence fails.
It is desired, therefore, to have a test which yields information about
ordinary convergence. There are many such tests which apply to special
types of series. Perhaps the ones with most general applicability are
those due to Abelt and Dirichlet.
To establish these tests, we need a lemma which is sometimes called
the partial summation formula, since it corresponds to the familiar in-
tegration by parts formula. In most applications, the sequences X and
Yare both sequences in R, but the results hold when X and Yare
sequences in Rp and the inner product is used or when one of X and Y
is a real sequence and the other is in R p.
27.14 ABEL'S LEMMA. Let X = (x n) and Y = (Yn) be sequences and
let the partial sums of L (Y-r,,) be denoted by (Sk). If m > n, then
m m
(27.18) L
j=n
XJYj = (X m+1Sn - XnS'~-l) +L
j"n
(Xj - Xj+1)Sj.
(27.19)
t NIELS HENRIK ABEL (1802-1829) was the son of a. poor Norwegian minister. When
only twenty-two he proved the impossibility of solving the general quintic equation
by radicals. This self-taught genius also did outstanding work on series and elliptie
functions before his early death of tubereulosis.
998 CR. VII INFINITE SERIES
PROOF. (a) Suppose that ISjl < B for all j. Using (27.18), we have
the estimate
m m
(27.20) L
;=11.
XiYi L
< {Ixm+ll + Ixnl + j=n IXi - xj+11 }B.
If lim (x n ) = 0, the first two terms on the right side can be made arbi-
trarily small by taking m and n sufficiently large. Also if the series (27.19)
converges, then the Cauchy Criterion assures that the final term on this
side can be made less than E by taking m > n > M(E). Hence the Cauchy
Criterion implies that the series L (XnYn) is convergent.
(b) If Xl > X2 > .. " then the series in (27.19) is telescoping and
convergent.
Q.E.D.
i=l
L X Yi
I: XiYi - i=1 1 < 2Ixn+lIB,
where B is an upper bound for the partial Bums of L (y j).
PROOF. This is readily obtained from relation (27.20).
Q.E.D.
The next test strengthens the hypothesis on L (Yn), but it relaxes the
one on the real sequence (x n).
L
i=n
(Xj - Xj+l)Sj < Ix,. - xm+lIB.
By using these two estimates and Abel's Lemma, we conclude that the
series L (XnYn) is convergent in Rp.
Q.E.D.
SEC. 27 TESTS FOR CONVERGEN CE 399
If we use the same type of argument, we can establish the following
error estimate.
j==l
L: XJYi < \xlls .- Snl + 2E\x -
L: X;Yi - j==l Xn+ll
(27.21)
for the rapidity of convergence.
PROOF. This follows immediately from Dirichlet's Test 27.15(b) if
we take Yn = (-1 )n, but the error estimate given in Corollary 27.16 is
not as sharp as (27.21). We can also proceed directly and show by
m9.thematical induction that if m > n, then
ISm - snl = IZn+l - Zn+2 + ... -+- (-1)m-n- 1zml < IZn+ll.
This yields both the convergence and the estimate (27.21).
Q.E.D.
(c) Let X be any real number which is different from 27l"k, where k is
a positive or negative integer. Then, since
2 cos (kx) sin (x/2) = sin (k - !)x - sin (k + !)x,
it follows that
2 sin (x/2) [cos (x) + .'. + cos (nx)] = sin (!)x - sin (n + ~)x,
so that
cos x + ... + cos nx = sin (!)x - sin (n + !)x)
2 sin (x/2)
Therefore, we have the bound
1
leas x + ... + cos nxl < [sin (x/2) I
for the partial sums of the series L (cos nx). Dirichlet's Test shows that
even though the series L (cos nx) does not converge, the series
L cos nx
n
does converge for x r!= 2br, k E Z.
(d) Let x ~ 2k7l", k E Z. Since
2 sin (kx) sin (!)x = cos (k - !)x - cos (k + !)x,
it follows that
2 sin (t)x [sin x + ". + sin nxl = cos (t)x - cos (n + t)x.
Therefore, we have the bound
Exercises
27.A. Suppose that L: (an) is a convergent series of real numbers. Either
prove that L: (b1J converges or give a counter-example, when we define bn by
(a) an/n, (b) va../n (an 0), >
(c) an sin n, (d) v'an/n (an 0), >
(e) nl/nan, (f) a,j(l lanD. +
27.B. Establish the convergence or the divergence of the series whose nth
term is given by
!+!+.!.+-!.+ ...
12 23 32 43
is convergent, but that both the Ratio and the Root Tests fail.
27.G. If a and b are positive numbers, then
1
L: (an +- b)p
converges if p > 1 and diverges if p < 1.
27.H. If p and q are positive numbers, then
L: (-1)n (logn)p
nq
is a convergent series.
27.1. Discuss the series whose nth term is
nn
(b) - - -
(n + 1)n+1'
(d)
.
+ 1)n .
en nn+l
402 CH. VII INFINITE SERIES
(a) n.
, , (b) (n!)2
357 . . (2n + 1) (2n)! '
24 . . (2n) 24 .. (2n)
(c) (d)
35 . '(2n +
1) 57 . (2n + 3)
27.K. The series given by
(-I)P+(1.3)P
2
- + (1.3.5)P
- - + ...
24 246
r , sup
= 11m (Ixn+ll)
--r;:r .
Show that if r < 1, thep the series L (x n ) is absolutely convergent and if r > 1,
then the series is divergent.
27.N. Let X = (x n ) be a sequence of non~zero elements in Rp and let a be
given by
.
a = hm sup n 1 - (( --r;:r
Ixn+tI)) .
If a > 1 the series L (x n ) is absolutely convergent, and if a < 1 the series is not
absolutely convergent.
27.0. If p, q are positive, then the serirs
+ 1) .. . (p + 2)(p + n)
L (p
(q + 1)' . .(q + 2)(q + n)
Project
27.0:. Although infinite products do n01G occur as frequently as infinite series,
they are of importance in many investigations and applications. For simplicity,
we shall restrict attention here to infinit.e products with positive terms an' If
CR. VII INFINITE SERIES
II an or ala2aa'" an' ..
n=1
+
(c) Infinite products often have terms of the form an = 1 Un' In keeping
with our standing restriction, we suppose Un > - 1 for all n E N. If Un ~ 0,
show that a necessary and sufficient condition for the convergence of the infinite
product is the convergence of the infinite series
L Un.
n=1
L
n=1
Un
II (1 + un)
n =1
is convergent.
SEC. 28 SERIES OF FUNCTIONS 405
(e) Suppose that Un > - 1 and that the series L (Un) is convergent. Then a
necessary and sufficient condition for the convergence of the infinite product
II (1 + Un) is the convergence of the infinite series
L U:,,2.
n=1
(Hint: use Taylor's Theorem and show that there exist positive constants A
and B such that if lui < !, then Au2 < u - log (1 + u) < Bu2.)
SI(X) = flex),
82 (x) = 81 (x) + /2 (x) [= Ii (~~) + 12 (x) ],
t
or Lin
n=l
The assertion follows from the Cauchy Criteria 26.5 and 28.6 and the
convergence of :L (M n).
Q.E.D.
The next two results are very useful in establishing uniform conver-
gence, even when the convergence is not absolute. Their proofs are
obtained by modifying the proofs of ~~7.15 and 27.16 and will be left as
exerCIses.
n EN,
(28.5)
i: sin (nx)
n =1 n
However, it follows from the discussion in Example 26.21 (d) that if the
interval J = [a, b] is contained in the open interval (0, 211"), then the
partial sums Sn(x) = Lk=l (sin (kx) are uniformly bounded on J.
Since the sequence (lin) decreases to zero, Dirichlet's Test 28.8 implies
that the series (28.5) is uniformly convergent on J.
norm of the nth term on I is lin, we cannot apply the Weierstrass Test.
Dirichlet's Test can be applied if we can show that the partial sums of
L ( -l)ne-lIx ) are bounded. Alternatively, Abel's Test applies since
L (-I)nln) is convergent and the bounded sequence (e- nx ) is mono-
tone decreasing on I (but not uniformly convergent to zero).
Power Series
We shall now turn to a discussion of power series. This is an important
class of series of functions and enjoys properties that are not valid for
general series of functions.
28.11 DEFINITION. A series of real functions L Un) is said to be a
power serie8 around x = c if the function in has the form
For the sake of simplicity of our notation, we shall treat only the case
where c = O. This is no loss of generality, however, since the translation
x' = x - c reduces a power series around c to a power series around O.
Thus whenever we refer to a power series, we shall mean a series of the
form
<Xl
(28.6) L
n=O
anx n = ao + alX + ... + anx + n
Even though the functions appearing in (28.6) are defined over all of
R, it is not to be expected that the series (28.6) will converge for all ~r, in
R. For example, by using the Ratio Test 26.8, we can show that the
.
serIes
R = 0, if p= +co,
= l/p, if O<p<+oo,
= + co, if p = O.
The interval of convergence is the open interval (-R, R).
We shall now justify the term "radius of convergence."
CH. VII INFINITE SERIES
c
la Illn < -.
n - Ixl
This is equivalent to the statement that
(28.7)
Therefor"l, \anxnl > 1 for infinitely many n, so that the sequence (anX n )
does not converge to zero.
Q.E.D.
(28.9)
f(x) = L: (anX") ,
n=O
then f'(x) =L
n ... l
(nanX"-l).
Therefore, the radius of convergence of the two series is the same, so the
formally differentiated series is uniformly convergent on each compact
subset of the interval of convergence. We can then apply Theorem 28.5
to conclude that the formally differentiated series converges to the
derivative of the given series.
Q.E.D.
It is to be observed that the theorem makes no assertion about the end
points of the interval of convergence. If a series is convergent at an end
point, then the differentiated series mayor may not be convergent at
this point. For example, the series L::=0 (x n /n 2 ) converges at both end
points x = -1 and x = + 1. However, the differentiated series
Moreover, this series converges absolutely to fCk) for lxl < R and uni-
formly over any compact subset of the interval of convergence.
If we substitute x = 0 in (28.10), we obtain the important formula
(28.11) j<k) (0) = k! ak.
this reason we shall not go into these questions but content ourse]ves
with one result in this direction. Fortunately, it is one of the most useful.
then their product is given on this interval by the series L (cnx n ), where the
coelncients (en) are
n
en = L: akb
k=O
n- k for n = 0, 1, 2, ....
PROOF. We have seen in 28.13 that if Ixl < r, then the series giving
f(x) and g(x) are absolutely convergent. If we apply Theorem 26.15, we
obtain the desired conclusion.
Q.:~.D.
converges to f(x} for Ixl < r. Similar (but less stringent) conditions on
the magnitude of the derivatives can be given which yield the same
conclusion.
As an example, we present an elegant and useful result due to Serge
Bernstein concerning the one-sided expansion of a function in a power
senes.
28.19 BERNSTEIN'S THEOREM. Let f be defined and possess deriva-
tives of all orders on an interval [0, r] and suppose that f and all of its
derivatives are non-negative on the interval [0, r]. If 0 < x < r, then f (x)
is given by the expansion
co jCn)(o)
f(x) = L
n=O n!
xn .
PROOF. We shall make use of the integral form for the remainder in
Taylor's Theorem given by the relation (23.16). If 0 < x < r, then
n-l f(k) (0)
(28.14) f(x) = k~ k! x
k
+ Rn ,
where we have the formula
n 1
Rn = x - (1 (1 _ s)n-lfCn) (Sx) ds.
(n-l)l}o
Since all the terms in the sum in (28.14) are non-negative,
n1
(28.15) fer) > r- {I (1 _ s)n-lj<n) (sr) ds.
(n - 1)! }o
Since j<n+l) is non-negative, fen) is increasing on [0, r]; therefore, if x is in
this interval, then
x n- 1 fal
(28.16) Rn < (l - s)n-lf(sr) ds.
(n - I)! 0
markable theorem of the great Abel that, if convergence does take place
at one of the end points, then the series converges uniformly out to this
end point. We shall present two proofs of this result: the first one is a
direct application of Abel's Test 28.9 and is over too soon; the second
one is, perhaps, more interesting.
In order to simplify our notation, we shall suppose that the radius of
convergence of the series is equal to 1. This is no loss of generality and
can always be attained by letting x' =~ x/R, which is merely a change of
scale.
28.20 ABEL'S THEOREM. Suppose that the power series L::=o (anX n )
converges to f(x) for \x\ < 1 and that 'l:: =0 (an) converges to A. Then the
power series converges uniformly in 1= (0, 1) and
(28.17) lim j(x) = A.
x-+1-
f(x)g(x) := L cnx'\
n=O
(28.18)
<:0
- (1 - x) ~ (en - A)x".
11.-0
416 eH. VII INFINITE SERIES
a NH
< E+ (1 - x) < 2E.
1- X
This establishes formula (28.17).
Q.E.D.
One of the mOEt interesting things about this result is that it suggest a
method of attaching a limit to series which may not be convergent. Thus,
if :L:=l (b n ) is an infinite series, we can form the corresponding power
series l: (bnx n). If the bn do not increase too rapidly, this power series
converges to a function B (x) for Ixl < 1. If B (x) -4 {j as x ~ 1 -, we
say that the series l: (b n ) is Abel summable to {j. This type of summation
is similar to (but more powerful than) the Cesaro method of arithmetic
means mentioned in Section 14 and has deep and interesting conse-
quences. The content of Abel's Theorem 28.20 is similar to Theorem
14.8; it asserts that if a series is already convergent, then it is Abel sum-
mabIe to the same limit. The converse is not true, however, for the series
l::=o (-l)n is not convergent but since
1 en
-- = L: (-l)n x n,
1 +x n=O
N <:c
= 1: a,.(l
n=O
- x"') - 1: a,.xn + (t(x)
N+l
- A}.
lim (
m
1
+ t nan)
1 n=O
= O.
N E Xa NH
1: an -
n=O
A 1 - xo) (N + l)e + N + 1 1 - Xo
+ < 3e.
E
Since this can be done for each e > 0, the convergence of L (an) to A is
established.
Q.E.D.
CH. VII INFINITE SERIES
Exercises
28.A. Discuss the convergence and the uniform convergence of the series
L (fn), where in(x) is given by
(a) (x 2 + n 2)-1, (b) (nx)-2, x :P 0,
(c) sin (xln 2 ), Cd) (x n + 1)-\ x "20,
(e) xn(xn + 1)-1, X 2': 0, (f) (_l)n(n + x)-t, x 2 0.
28.B. If L (an) is an absolutely convergent series, then the series L (an sin nx)
is absolutely and uniformly convergent.
28.C. Let (c n ) be a decreasing sequence of positive numbers. If L (c n sin nx)
is uniformly convergent, then lim (nc n ) = 0.
28.D. Give the details of the proof of Dirichlet's Test 28.8.
28.E. Give the details of the proof of Abel's Test 28.9.
28.F. Discuss the cases R = 0, R = + 00 in the Cauchy-Hadamard Theorem
28.13.
28.G. Show that the radius of convergence R of the power series L (anX n) is
given by
lim(~)
la +ll n
whenever this limit exists. Give an example of a power series where this limit
does not exist.
28.H. Determine the radius of convergence of the series 1: (anX n ) , where an is
given by
(a) 1jn", (b) n<:<jn!,
(c) nnIn!, (d) (log n)-l, n > 2,
(e) (n 1) 2j (2n) !, (f) n-vI,;.
28.1. If an = 1 when n is the square of a natural number and an = other-
wise, find tile radius of convergence of L. (anX n ). If b" = 1 when n = m! for
mEN and bn = otherwise, find the radius of convergence of L (bnx n).
28.J. Prove in detail that
lim sup (jna n I1/n ) = lim sup (la n ]lln).
28.K. If < p ~ lanl ~ q for all n E N, find the radius of convergence of
L (anX n ).
28.L. Let f(x) = L (anX n ) for Ixl < R. If f(x) = f( -x) for aU Ix) < R,
show that an = for all odd n.
28.M. Prove that if i is defined for Ixl < r and if there exists a constant B
such that Ij<n) (x) I :::; B for alllxl < rand n E N, then the Taylor series expansion
00 fen) (0)
L:--x
n=O n!
n
28.P. The argument outlined in Exercise: 19.5 shows that the Lagrange form
of the remainder can be used to justify the general Binomial Expansion
(1 + x)'" = n=O
i; (m)n xn
l
co
o
f(x) dx = :C ~ Rn+1,
n=on + 1
provided that the series on the right side is convergent even though the original
series may not converge at x = R. HencEl it follows that
28.T. By using Abel's Theorem, prove that if the series 1: (an) and L: (b n )
converge and if their Cauchy product 1: (en) converges, then we have (en) = :E
L: (an)' 1: (b n ).
28.U. Suppose that an > 0 and that f(x) = 1: (anxtl ) has radius of converg-
ence 1. If L: (an) diverges, prove thatf(;c) ~ +
00 as x ~ 1-. Use this result to
A = Em 2: anXn ,
:1:-.1-
then 1: (an) converges to A.
28.V. Let 2::=O(Pn) be a divergent series of positive numbers such that the
radius of convergence of :E (PnX n ) is 1. Prove Appell'st Theorem: If
8 = lim (an/pn), then the radius of convergence of 1: (anx n) is also 1 and
. r: anx"
lim - - - =
" pnx..
:t-> 1 - .t..J
8.
(Hint: it is sufficient to treat the case 8 = O. Also use the fact that
lim [E (pnX n )]-1 = 0.)
x..... l -
28.W. Apply Appell's Theorem with p(x) = 1::".0 (x n ) to obtain Abel's
Theorem.
28.X. If (an) is a sequence of real numbers and ao = 0, let 8 n = at an + ... +
and let ern = (81 + ... + 8... )/n. Prove Frobenius't Theorem: If 8 = lim (ern)
then
8 = lim L anx n
x.....l - n=O
Projects
28.a. The theory of power series presented in the text extends to complex
power series.
(a) In view of the observations in Section 10, all of the definitions and
theorems that are meaningful and valid for series in R 2 are also valid for series
with elements in C. In particular the results pertaining to absolute convergence
extend readily.
(b) Examine the results of Section 26 pertaining to rearrangements and the
Cauchy product to see if they extend to C.
(c) Show that the Comparison, Root, and Ratio Tests of Section 27 extend
to C.
(d) Let R be the radius of convergence of a complex power series
Prove that the series converges absolutely for Izi < R and uniformly on any
compact subset of {z E C : Izi < R I.
(e) Let f and (J be functions defined for D = I z E C : Iz1 < r I with values in
C which are the limits on D of two power series. Show that if f and (J agree on
D n R, then they agree on all of D.
(f) Show that two power series in C can be multiplied together within their
common circle of convergence.
28.(3. In this project we define the exponential function in terms of power
series. In doing so, we shall define it for complex numbers as well as real.
(a) Let E be defined for Z E C by the series
00 zn
1: "
E(z) = n=on.
Show that the series is absolutely convergent for all z E C and that it is uni-
formly convergent on any bounded subset of C.
(b) Prove that E is a continuous function on C to C, that E (0) = 1, and that
E(z + w) = E(z)E(w)
for z, win C. (Hint: the Binomial Theorem for (z + w)n holds when z, w E C
and n EN.)
(c) If x and y are real numbers, define JE1 and E 2 by E 1 (x) = E(x), E 2 (y) =
E(iy); hence E(x + iy) = E 1 (x)E 2(y). Show that E 1 takes on only real values
but that E'2 has some non-real values. Define C and 8 on R to R by
C(y) = Re E 2(y),
for y E R, and show that
C(Y1 + Y2) = C(Y1)C(Y2) - 8(Y1)8(Y2),
(e) Show that C' = -8 and S' = C. Hence [C2 + 8 2}' = 2 CC' + 2 SS' = 0
+
which implies that C2 8 2 is identically equal to 1. In particular, this implies
that both C and S are bounded in absolute value by 1.
(f) Infer that the function E 2 on R to C satisfies E2(O) = 1, E2(Y1 +
Y2) =
E2(Yl)E2(YZ). Hence E 2( -y) = I/E2(Y) and !Ez(Y) I = 1 for all y in R.
ReFerences
The following list includes books that were cited in the text and some addi-
tional references that will be useful for further study.
Anderson, K. W., and D. W. Hall, Sets, Sequences, and Mappings, John Wiley
and Sons, New York, 1963.
Apostol, T. M., Mathematical Analysis, Addison-Wesley, Reading, Mass., 1957.
Boas, R. P., Jr., A Primer oj Real Functions, Carns Monograph Number 13,
Mathematical Association of America, 1960.
Bohr, H., Almost Periodic Functions, H. Cohn, translator, Chelsea, New York,
1947.
Buck, R. C., Advanced Calculus, McGraw-Hill, New York, 1956.
Dienes, P., The Taylor Series, Oxford University Press, London, 1931. (Reprinted
by Dover, New York, 1957.)
Dieudonne, J., Foundations oj Modern Analysis, Academic Press, New York,
1960.
Dunford, N., and J. T. Schwartz, Linear Operators, Part I, Interscience, New
York, 1958.
Goffman, C., Real Functions, Rinehart, New York, 1953.
Graves, 1. M., Theory oj Functions oj Real Variables, Second Edition, McGraw-
Hill, N ew York, 1956.
Hall, D. W., and G. L. Spencer II, Elementary Topology, John Wiley and Sons,
New York, 1955.
Halmos, P. R., Na~ve Set Theory, D. Van Nostrand, Princeton, 1960.
Hamilton, N. T., and J. Landin, Set Theory, Allyn-Bacon, Boston, 1961.
Hardy, G. H., A Course oj Pure Mathematics, Ninth Edition, Cambridge Univer-
sity Press, Cambridge, 1947.
Hardy, G. H., Divergent Series, Oxford University Press, London, 1949.
Hardy, G. H., J. E. Littlewood, and G. Palya, Inequalities, Second Edition
Cambridge University Press, Cambridge, 1959.
Kazarinoff, N. D., Analytic Inequalities, Holt, Rinehart and Winston, New York
1961.
Kelley, J. L., General Topology, D. Van Nostrand, New York, 1955.
Knopp, K., Theory and Application oj Infinite Series, R. C. Young, translator,
Hafner, New York, 1951.
Landau, E., Foundations of Analysis, F. Steinhardt, translator, Chelsea, New
York, 1951.
422
REFERENCES
The reader is urged not to look at these hints unless he is stymied. Most of
the exercises call for proofs, and there is no single way that is correct; even if
the reader has given a totally different argument, his may be entirely correct.
However, in order to help the reader learn the material and to develop his
technieal skill, some hints and a few solutions are offered. It will be observed
that more detail is presented for the early material.
Section 1
The opposite inclusion is proved by reversing these steps. The other equality is
handled similarly.
l.J. If x E G (n[Aj:j E J}), then x ~ n[Aj:j E J). This implies that there
exists a k E J such that x $ A k Therefore, x E e (A k ), and hence x E U! e (A i) :
j E J). This proves that e (n A J cue (A t>. The opposite inclusion is proved
by reversing these steps. The other equality is similar.
Section 2
2.A. If (a, c) and (a, c') belong to go f, then there exist b, h' in B such that
(a, b), (a, b') belong to f and (b, c), (b', c') belong to g. Since f is a function,
b = b'; since g is a function, C = c'.
2.B. No. Both (0, 1) and (0, -1) belong to C.
2.D. Letf(x) = 2x, g(x) = 3x.
2.E. If (b, a), (b, a') belong to f-1, then (a, b), (a', b) belong to j. Since j is
one-one, then a = a'. Hence j-1 is a, function.
424
HINTS FOR SELECTED EXERCISES
Section 3
Section 4
Section 5
5.A. If a. field F has n elements, then at least two of the elements ke, k = 0,
1, ..., n, must be equal. Hence there exists a natural number p such that pe = O.
5.B. Since a2 > 0 and b2 > 0, then a2 + b2 = 0 implies that a2 = b2 = O.
5.C. Apply Theorem 5.5(a).
+
5.F. If c = 1 a with a > 0, then en = (1 + a)n ~ 1 + na > 1 + a := c.
5.H. Observe that 1 < 21 = 2. If k < 2k for k > 1, then k + 1 < 2k <
22 k = 2 k+1 Therefore, n < 2n for all n EN.
5.1. Note that bn - an = (b - a) (b n --l + ... + an - l ) = (b - a)p, where
p> O.
5.K. If pet) = t, then pet) - n > 0 and hence pet) > n for all n E N.
5.L. It was seen in Exercise 5.H that n < 2n , whence 1/2n < lin for n E N.
5.Q. If x ~ yare common points of the intervals In = {an, bn], n E N, then
since Ix - yl > 0 there exists an M such that
1
2M < Ix -- y\.
426 HINTS FOR SELECTED EXERCISES
Secticm 6
6.Q. If F contains a non-empty interval [a, b], then a = b. Hence the union
of a countable collection of subintervals of F is a countable subset of F.
6.S. If S = sup {f(x, y): (x, y) E X X Yl, then since J(x, y) < S for all
x in X and y in Y, it follows that flex) :::;; 8 for all x in X; hence sup {/l(x):
x E Xl < 8. Conversely, if E > 0 there exi8ts (Xo, Yo) such that 8 - E < f(Xo, Yo).
Hence 8 - E < Jl(XO), and therefore 8 - E < sup {f1(X):X E Xl.
6.U. Sincef(x) < sup {f(z):z EX}, it follows that
f(x) + g(x) < sup {f(z):z E X} + sup {g(z):z E Xl
so that sup {I(x) + g(x):x E X} < sup {f(z):z E X} + sup {g(z):z EX}.
Similarly, if x E X, then
inf {f(z):z E X} < I(x),
whence it follows that
inf {f(z): z E Xl + g(~:) < J(x) + g(x).
If we note that sup {e + g(x):x E Xl = e + sup {g(x):x E X}, it follows f.rom
the preceding inequality (with c = inf J) that
infJ + sup 9 < sup (J + g).
The other assertions are proved in the same way (or follow from these inequalitiies).
Section 7
7.K., A set K is convex if and only if it contains the line segment joining any
two points in K. If x, Y E K 1, then It x + (1 - t)yl ~ tlxl + (1- t)Iyl < t +
(1 - t) = 1 so tx + (1 - t)y E K 1 for < t < 1. The points (1,0) belong
to K 4 , but their midpoint (0,0) does not belong to K4.
7.1. If x, y belong to nK a , then x, y E K a for all a. Hence tx + (1 - t)y E K a
for all a; whence it follows that nKa is convex. Consider the union of two dis-
joint intervals.
7.M. Every point Ixl = 1 is an extreme point of K 1 There are no extreme
points in K 2 Every point on the three bounding lines is an extreme point of K 3
Section 8
so that y E G. If z = 0, then there does not exist a positive real number r such
that every point y in R satisfying Iy[ < r belongs to F. Similarly for z = 1.
8.B. If x = (~, 1) E G, take r = 1 - Ixl. If x E H, take r = inf {Ixl, 1 - Ixl).
If z = (1,0), then for any r > 0, there is a point y in e (F) such that Iy - zl < r.
8.E. If z = 0, then for any r > 0, there is a point yin e (A) such that Iy - zl
< r. Hence A is not open. If w = 1, then for any r > 0, there is a point u in A
such that lu - wi < r. Hence ~(A) is not open so that A is not closed.
8.H. Let Fn = Ix E Rp: Ixl ~ 1 - lin) for n E N.
8.1. Let G be open in Rp and let A be the subset of G consisting of the elements
of G whose coordinates are all rational numbers. By Theorem 3.2, the set A is
countable. For each x in A there is a smallest natural number n z such that if
n> n J: then the set Fz,n = lyE Rp:ly -xl < lin) is contained in G. Show
that G equals U {Fz,n:x E A, n = n",j.
8.J. Take complements of sets in 8.1. Alternatively, if F is closed in Rp, let
Gn = {y E Rp: [y - xl> lin for all x E Fl for n E N. Show that Gn is open
and that F = n IGn : n EN).
8.R. If A- = n {F: F closed and F ~ A), then A-is closed, by Theorem
8.6(c). A- is the intersection of sets containing A, so A c A-. It follows that
A- c (A -)-. Since A-is closed and (A -)- is the intersection of all closed sets
containing A-, it follo\vs that (A~)- c A-, whence A- = (A-)-. If F is closed
and A V B c F, then A C F and B c F, A- C F and B- c F so that A-V
B- C F. Therefore, A-V B- C (A V B)-. COllversely A C A-and B c B- ,
whence it follows that A VB c A - IJ B-. Since the latter set is closed it follows
that (11 V B)- c A - IJ B-. Clearly 0 c 0-, and since 0 is closed, it follows that
0- c ~3.
8.L. True for closed sets, false in general.
8.M" Either argue by analogy with 8.K or use the fact that the complement
of closed sets is open.
S.N. Let'/J = 1, and let A = Q.
HINTS FOR SELECT.E:D EXERCISES
8.P. Argue as in Exercise 8.I, but use open balls Gx,n = {y E Rp:\y - ;1;1 <
lin l, X E A, n > n x , instead of the closed balls F "',n'
8.R. If AX B is closed in R2 and x = (h, b) does not belong to A X B, then
there exists r > 0 such that {y E R 2 : I!/ - xl < r 1 is disjoint from A X B.
Therefore, if G = {1] E R: 111 - ~ll < r L then G n A = 0. Hence A is closed
in R. Conversely, if A, B are closed and J; = (~l, ~2) E A X B, then there exists
s > 0 such that {11i E R: hi - Eil < s j is disjoint from A, B, respectively. Hence
the set lyE R2: Ix - y\ < s1does not intersect A X B.
8.S. If r is a positive rational number, and if y is a point in Rp with rational
coordinates such that the ball {x E Rp: 'IX - y\ < r I intersects A, then choose
one point in this intersection. Doing this for each T, y, we obtain C.
8.T. If x is a cluster point of A and U is a neighborhood of X containing only
a finite number of points of A, then there exists a neighborhood of x which con-
tains no points of A except, perhaps, the point x.
8.V. If G is open and y E G, then some neighborhood V of y is contained in G,
so y cannot be a boundary point. If G is not open, there exists a point x E G,
every neighborhood of which contains a point of ~(G). Hence, if G is not open,
it contains at least one boundary point.
8.X. Suppose that A, B are open sets in R2 which form a disconnection for
C1XCZ' LetA l = {xER:(x,Y)EA forsomeyERI and letB l = {~~ER:
(x, Y) E B for some y E R I. Then At and B l are non-void open sets in R (why?)
and their union contains Ct. Since Cl is connected, there exists an element
Ul EAt n B 1 belonging to CI. Now let A z = {y E R: (UlJ y) E AI, and B 2 =
{y E R: (Ul' y) E B I. Again, A z and B z are non-void open subsets of R and their
union contains Cz. Since Cz is connected, there exists an element U2 in Il z (\ B z
belonging to Cz. Since Ut E At anduz l~ A z, then (Ul, uz) E A; similarly (Ut, U2)
E B, which contradicts the supposition that A (\ B = 0.
8.Y. No line segment can lie in A and terminate at the origin.
Section 9
If there are infinitely many points in the bounded set {x n }, then there is a cluster
point ;z:. Since Xm E Fn for m > n and since Fn is closed, then x E F11 for all n EN.
9.H. If d(x, F) = 0, then x is a cluster point of the closed set F.
9.J. No. Let F = {y E Rp: Iy - xl = rl, then every point of F has the same
distanee to x.
9.K. Let G be an open set and let x E Rp. If H = {y - x:y E G}, then H is
an open set in Rp.
9.L. Apply 8.3(b) and 8.18.
9.M. Follow the argument in 9.7, except use open intervals instead of open
balls.
9.N. If the point (xo, Yo) lies on the line ax + by = c, then the point (xo + at,
yo + bt), t 0, does not lie on this line and the distance between these points is
ItI(a 2 + b2 )1/2.
9.0. Yes. Let A = Q n L
9.P. Not necessarily.
9.Q. Suppose that Q = n (Gn:n E Nl, where Gn is open in R. The comple-
ment F'n of Gn is a closed set which does not contain any non-empty open subset,
by The~orem 5.16. Hence the set of irrationals is the union of a countable family
of closed sets not one of which contains a non-empty open set but this contradicts
Corolla,ry 9.10.
9.S. If F is a closed set containing the dense set D, ~hen it follows from
Theorem 8.10 that F must contain every cluster point of D and hence F = Rp,
so that D- = Rp. If D is not dense in Rp, let x be a point in Rp which is not a
cluster point of D, and let V be an open neighborhood of x which contains no
point of D except (perhaps) x. Then H = e(V) V Ix} is a closed set containing
D such that H rf Rp.
9.T. Of course, the entire space Rp is open and dense in Rp. Alternatively, let
p = 1 and take G = R \Z, then G is open and dense in R" The only dense closed
subset of Rp is Rp.
9.D. By Exercise 8.K, (A V B)- = A- V B~ so it follows that the union of
any subset with a subset which is dense, is also dense. According to Exercise 9.R,
an open set is dense in Rp if and only if its complement does not contain a non-
void open set in Rp. Apply Baire's Theorem (or the first two steps in its proof),
to find that the closed set e(Dl n D 2 ) = e(D 1 ) U e(D2 ) cannot contain a
non-void open subset of Rp.
9.V. The set D 1 V D 2 is dense, but it may happen that D1 n D 2 is not dense.
9.W. As in the second part of Exercise 9.D, we consider the complements
e(Dn ). The result follows from Baire's Theorem.
Section 10
1O.A" Examine the geometrical position ofiz = (-y, x) in terms of z = (x, y).
1O.B" Note that cz = (x cos () - y sin (), x sin () + y cos 0), and this corre-
sponds to a counter-clockwise rotation of 8 radians about the origin.
lO.e.. The circle Iz - cj = r is mapped into the circle Iw - Cac + b)1 = lair.
We can write z = a-1w - a-1b and calculate x = Re z, y = 1m z in terms of
HINTS FOR SELECTED EXERCISES 431
Section 11
Section 12
12.A. Either multiply by -1 and use Theorem 12.1, or argue as in the proof
of 12.1.
12.B. By induction 1 < X n < 2 for n ~ 2. Since Xn+l - X n = (x n - X n-l)'
(xn Xn-l)-l, then (x n ) is monotone.
12.D. The sequence (an) is decreasing.
12.E. If (x n ) does not have an increasing subsequence, each Xn is exceeded by
at most a finite number of X m with m > n. Let k 1 = 1 and take k 2 > k 1 such that
if m ~ k2, then X m < Xkl' Choose k a > k2 such that if m ~ ka, then X m < Xkz;' etc.
12.H. The sequence (x n ) is increasing and X n < n/(n + 1) < 1.
12.1. Yes.
12.L. If f > 0 there exists K = K(f) sueh that (L - f)X n < Xn+l < (L + f)Xn
for n ~ K. By an argument similar to the one in Exercise I1.J, there exist A, B
with the stated property.
12.N. Yes.
HINTS FOR SELECTED EXERCISES
12.P. Note that (xn:!) is a subsequence of ((1 + 1/n)n) and that (Yn) is in-
creasing. Either argue as in Example 12.3(d) to show that (Yn) is bounded or
consider (Y2k1!2).
12.Q. The sequence (an) is increasing and (b n ) is decreasing. Moreover,
an < b", for all n EN.
12.V. Let yn E F be such that Ix - Ynl < d + l/n. If Y = lim (Ym,), then
Ix - yl = d.
12.X. The set K is bounded. To show that it is closed, let z be a cluster point
of K. Use Exercise 12.T to obtain z = Jim (x n + Yn) and use the Bolzano-
Weierstrass Theorem 12.4 twice.
Section 13
13.A. All.
13.C. If x E Z, the limit is 1; if x ~ Z, the limit is 0.
13.E. If x = 0, the limit is 1; if x ~ 0, the limit is 0.
13.G. If x >
and
< E < 71"/2, then tan (71"/2 - E)
nx 2: tan (71"/2 - E) for all n 2: n x , from which
> 0. Therefore
Section 14
14.A" (a) lim inf(xn ) = -1; lim SUp (x n ) = 1. (b) 0,2. (c) The sequence is
neither bounded below nor above.
14.B. See the first part of the proof of Lemma 14.5.
14.C. Let y = lim inf (x n ). Let nl be such that x n! < Y + 1; let n2 > nl be
such that X n2 < Y + !, etc.
14.D. (i) If x* = lim sup (x n ) and > 0, then there are only a finite number of
n E N such that x* + E < Xn. However, by the definition of x*, there must be an
infinite number of n E N such that x* - E < Xn. Hence 14.3(a) implies 14.3(b).
(ii) If x* satisfies the condition in 14.3(b), then for m sufficiently large we have
Vm < x* + E. Hence inf (v m 1 < x* + E and so inf 1Vrn I < x*. Since there are in-
finitely many n E N with x* - f < Xn, then x* - E < Vrn for all m. Hence x* -
E < inf 1Vm I for arbitrary E > and so x* < inf (v m I. Therefore 14.3 (b) implies
14.3(c). (iii) Since (v m) is a monotone decreasing sequence, then inf (vml =
lim (v m ), so 14.3(c) implies 14.3(d). (iv) If X' = (x nk ) is a convergent subse-
quence of X = (x n ), then since nk > k, we have Xnk < Vk and hence lim X' <
lim (Vk) - x*. Conversely, there exists a natural number n, such that Vi - 1 <
1
X1ll < V1. Inductively, choose nk+l > nk such that Vnk - k +1 < X +
nk 1 < Vn".
HINTS FOR SELECTED EXERCISES 433
(x + y)* = inf {vm(x + y):m E Nl < vm(x) + vm(y) < vp(x) + vm(y)
for p < m. Hence (x + y)* < vp(x) + y* for all pEN and so (x + y)* <
x* + y*.
(ii) Since vm(x + y) < vm(x) + vm(y) and since the sequences V m are con-
vergent, we have, using Theorem 11.14,
Section 15
Section 16
Section 17
17.B. Yes.
17.C. Obtain the function in Example 15.5(h) as the limit of a sequence of
continuous functions.
17.D. Yes.
17.E. (a) The convergence is uniform for 0 < x < 1. (b) The convergence
is uniform on any closed set not containing x = 1. (c) The convergence is
uniform for < x < 1 or for x ~ c, where C > 1.
17.1. (Yes)3. Give examples.
17.J. It follows that f is monotone inereasing. By hypothesis, f is uniformly
continuous. If E > 0, let 0 = Xo < Xl <: ... < Xk = 1 be such that I(Xi) -
f(xi-l) < E and let N(Xi) be such that if n ~ N(Xi), then If(xi) - f,,(Xi) I < E.
If n ~ sup {N(xo), . , N(Xk)!, show that II! - f,,11 < 3E.
17.0. Observe that if 81 ~ 82 are two points in the interval [0, 2".], then either
sin 81 -:;e sin 82 or cos 81 cos 82. Since the functions
sin (nO) sin (mO), sin (nO) cos (mO), cos (nO) cos (m8)
are given by linear combinations of the functions
sin (n m)8, cos (n m)9,
436 HINTS FOR SELECTED EXERCISES
Section 18
18.G. (b) If E> 0, there exists aCE) > 0 such that if c < x < c + aCE),
x E ~(f), then If(x) - bl < E. (c) If (x,,) is any sequence in ~(f) such that
C < Xn and c = lim (x ll ) , then b = lim (/(x ll ).
18.J. (a) If M> 0, there exists m> 0 such that if x ~ m and x E ;D(f),
then f(x) ~ /ff. (b) If M < 0, there exists 0 > 0 such that if 0 < Ix - cl < 0,
thenf(x) < M.
18.L. (a) Let ~(r) = sup {j(x):x > rl and set L = lim /pCr). Alterna-
..-.+'"
tively, if E > 0 there exists m(E) such that if x ~ m(E), then Isup {f(x):x > r}
-LI < E.
18.M. Apply Lemma 18.12.
18.N. Consider the function f(x) = -I/Ixl for x ":F 0 and f(O) = O.
18.P. Consider Example 15.5(h).
18.R. Not necessarily. Consider fn(x) = - x n for x E I.
18.S. Yes.
Section 19
19.D. Observe that g'(O) = 0 and that g'(x) = 2x sin (l/x) - cos (l/x) for
x ~ O.
19.E. Yes.
19.F. Let {Tn} be an enumeration of the rational numbers in I and consider
L 2-nlx - r.. l
19.H. Yes.
19.K. If a and b are not multiple roots of f, then f'(a)f'(b) < O. The general
cltSe can be reduced to this one.
19.M. The function f has n-fold roots at x = 1. Hence f' has (n - I)-fold
roots at x = 1 and a root in the interval (-1, 1), etc.
19.U. (a) Since f'(a) exists, if E > 0, then f(a h) - f(a) = hf'(a)
hE(h), where !E(h)1 < E for h sufficiently small. Hence I(a +
h) - f(a - h)
= 2hf'(a) + h[E+(h) +E_(h)J.
19.V. Use the Mean Value Theorem.
19.X. That such examples exist is an immediate consequence of the Weier-
strass Approximation Theorem.
HINTS FOR SELEc'rED EXERCISES 437
Section 20
20.F. The partial derivatives h, f" at 0 are both O. By Theorem 20.6, if f is
differentiable at 0, then Df(O) sends the vector (a, b) into O. However, this
possibility for Df(O) does not satisfy equation (20.2).
20.1. By Theorem 20.6, the directional derivative of I at c in the direction of
the unit vector w = (WI, .. , wp ) is given by
fw(c) = fh(C)WI + ... + l~p(c)wl"
By the C.-B.-S. Inequality 7.6, we have
fw(c) < {L If~ic)l2p/2.
If at least one partial derivative is non-zero, the equality can hold if and only
if w is a multiple of the vector Vc = (fil(C)", .,ff.(c).
20.K. If f(x) f(x) = 1 for xED, then applying Theorem 2O.8(b), we obtain
2f(c) . Df(c)(u) = 0 for all u.
20.L. Df(m, 1]2) (~l, ~2) = ( 2A m + B1]2)~1 + (B1]l + 2C112)t2'
20.0. Consider the function defined just before Theorem 20.7.
20.P. Consider the function f(~, 7]) = 1]2 sin (~/7]) for 11 -:;e 0 and 1(;,0) = o.
20.R. It follows from Theorem 8.20 that any two points in D can be joined
by a polygonal curve lying inside D.
20.U. In factf~,,(O, 0) = -1 andf~(O, 0) = +1.
Section 21
21.E. Consider f(x) = x 3
21.F. Show that f is not one-one near x = O.
21.G. Reconsider 21.F.
21.H. Apply the Chain Rule 20.9.
21.J. Examine the proof of 21.4.
21.K. Apply Lemma 21.7.
21.P. If t i = if' i( 1]1, 7]2), then
aif'l (
- 1]1,1]2
)
=
3~23 + 1]1
am 1]l1]2 - 9~I2~22
Section 22
22.D. If IIPII < 0 and if Q is a refinement of P, then IIQII < O.
22.F. If I: > 0, let P e = (Xo, Xl, , x,,) be a partition of J such that if P ;2 P e
and S(P;fl is any corresponding Riemann sum, then IS(P;f) - f.' f I< t.
HINTS FOR SELECTED EXERCISES
Let M ~ IIfll and let 0 = e/4Mn. If Q = (Yo, yl, ..., Ym) is a partition with
norm IIQII < 0, let Q* :::; Q U P t so that Q* :::::> P and has at most n - 1 more
poin~ than Q7 Show that S(Q*;fl - SeQ;!) reduces to at most 2(n - 1) tenns
of the form (f(E) - !(-'7) I (x; - Yk) with lXi - Ykl < o.
22.G. Observe that the displayed inequality asserts that if Si(P;!, g),j = 1,2,
are two Riemann-Stieltjes sums corresponding to the partition P, then
ISl(P;!, g) - S2(P;!, g)1 < E. In proving the sufficiency of this condition, it
may be convenient to select ~i and "7i such that !(~i) and f("7i) are close to the
supremum and the infimum, respectively, of f on the interval [Xi-l, x;].
22.H. Apply Exercise 22.G.
22.X. Let gl(O) = 0, gl(X) == ! for 0 < x < 1 and gl(l) = 1.
Section 23
23.0.. If m < j(x) < M for e <x$ d, then there exists a constant A with
m < A < M such that
23.D. Consider f(x) = -1 for x < 0 and!(x) = 1 for x > 0, then F(x) = Ixl.
23.E.. Apply the Mean Value Theorem 19.6 to obtain F(b) - F(a) as a Rie-
mann sum for the integral of f.
23.J. If m < f(x) < M for x E J, then
23.K. Since f.(x) - f.(c) = l' fn', we can apply Theorem 22.12 to obtain
23.N. Direct estimation shows that M" < M(b - a)1'''. Conversely,/(x) >
M- E on some subinterval of J.
23.0. The functions 'P, 'P- 1 are one-one and continuous. The partitions of
[e, d] are in one.-one correspondence with the partitions of [a, b} and the Riemann-
Stieltjes sums of Jog with respect to g 0 'P are in one-one correspondence with the
Riemann-Stieltjes sums of f with respect to g.
23.R. Prove that the functions G and H are continuous. The remainder of
the proof is as in 23.12.
23.S. Apply Theorem 23.5 to (23.15) with h(t) = (b - t)..-I.
23.V. The function f is uniformly continuous on J 1 X J 2
HINTS FOR SELECTED EXERCISES
Section 24
24.A. Since J is uniformly continuous, if E > 0, one can divide I into a finite
number of subintervals Ii such that if X, Y E I j, then Ifex) - J(y)1 < E. Hence
G can be enclosed in a finite number of rectangles with total content less than E.
24.F. Apply Theorem 7.11.
24.G. Since f is uniformly continuous, if the partition P is sufficiently fine
and if M ~ Ilgll, then
Lf(~k)g(l1k)A(JsJ = fMA(J) + S(Pifg).
24.H. If x is a boundary point of DUB, then every neighborhood V of X
contains a point in D V B and a point in e(D V B) c e(D). Since B is the
boundary of D, it follows that V contains a point of D. Hence x is in B. Consider
ExampIe 24.2 (g).
24.P. Apply Taylor's Theorem 23.13 to the function F on I to R defined by
F(t) = f(x + t(y - x). Then F'(t) = Df(x + l(y - x)(y - x), etc.
Section 25
Section 26
Section 27
27.A. (c) If L (an) is absolutely convergent, then L (bn) is too. If an = 0
except when sin(n) is near 1, we can obtain a counter-example. (d) Consider
an = [n(log n)2]-l.
27.D. (a) and (e) are divergent. (b) is convergent.
27.E. (b), (c), and (e) are divergent.
27.1. (a) is convergent. (b) is divergent.
27.J. (a) is convergent. (c) is divergent.
27.R. Apply Dirichlet's Test.
Section 28
28.A. (a) and (c) converge uniformly for all x. (b) converges for x ~ 0 and
uniformly for x in the complement of any neighborhood of x = O. (d) converges
for x > 1 and uniformly for x ~ a, where a > 1.
28.0. If the series is uniformly convergent, then
len sin nx + ... + C2n sin 2nxl < E,
provided n is sufficiently large. Now restrict attention to x in an interval such
that sin kx > ! for n < k < 2n.
28.H. (a) 00, (c) lie, (f) 1.
28.L. Apply the Uniqueness Theorem 28.17.
28.N. Show that if n E N, then there exists a polynomial P n such that if
x ~ 0, thenf<n) (x) = e- 1/Z2P.,(1/x).
28.T. The series A (x) = L (anX n), R(x) = :E (bnx n), and G(x) = L (cnX n )
converge to continuous functions on 1. By the Multiplication Theorem 28.8,
G(x) =, A (x)B(x) for 0 < x < 1, and by continuity G(l) = A (l)R(l).
28.U. The sequence of partial sums is increasing on the interval [0, 1].
28.V. If t > 0, then lanl < EPn for n > N. Break the sum :E (anxn) into a
sum over n = 1, ..., N and a sum over n > N.
Index
447
INDEX