Notes Baby Rudin
Notes Baby Rudin
Notes Baby Rudin
nathanl3
2 Basic Topology 21
2.1 Finite, Countable and Uncountable Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.2 Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.3 Compact Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.4 Perfect Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.5 Connected Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
4 Continuity 63
4.1 Limits of Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.2 Continuous Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.3 Continuity and Compactness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.4 Continuity and Connectedness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
4.5 Discontinuities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
4.6 Monotonic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
4.7 Infinite Limits and Limits at Infinity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
4.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
2
Contents
5 Differentiation 74
5.1 The Derivative of a Real Function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
5.2 Mean Value Theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
5.3 The Continuity of Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
5.4 L’Hopital’s Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
5.5 Derivatives of Higher Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
5.6 Taylor’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
5.7 Differentiation of Vector-Valued Functions . . . . . . . . . . . . . . . . . . . . . . . . . 77
5.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
6 TODOs 88
3
1 The Real and Complex Number Systems
Proof. Let L be the set of lower bounds for B in S. Since B is bounded below, L is non-empty; and a
subset of S by definition. Since S has the least upper bound property, there exists some α = sup L ∈ S.
We claim that α = inf B and in particular, inf B ∈ S.
To show this, it suffices to show that α is a lower bound for B, and that if β > α in S, then β is not a
lower bound for B. If α were not a lower bound for B, there must be some x ∈ B with x < α. However
this would imply that x is an upper bound for L and thus α ≥ β, a contradiction.
If β > α and β were a lower bound for B (so β ∈ L), then since α = sup L, we would have β ≤ α, a
contradiction.
Thus, α = inf B ∈ S. In particular, since this holds for all non-empty B ⊆ S bounded below, S has the
greatest lower bound property.
1.2 Fields
On to defining the notion of fields!
Definition 2. A field F is a set equipped with two operations, + and × with the following properties:
4
1 The Real and Complex Number Systems
• F is a commutative group under addition, with additive identity 0. We denote the additive inverse of
x to be −x.
• F \ {0} is a commutative group under multiplication, with additive identity 1. We denote the
multiplicative inverse of x to be 1/x.
• x × (y + z) = x × z + y × z for all x, y, z ∈ F .
Proposition 1. If G is a group with operation ·, then for x, y, z ∈ G,
(a) x · y = x · z → y = z
(b) x · y = x → y = 1
(c) x · y = 1 → y = x −1
(d) (x −1 )−1 = x
Proof. Statement (a) follows from multiplying x −1 to both sides. (b) follows by letting z = 1 in (a); (c) by
taking z = x −1 ; and (d) follows since x −1 · x = 1 and by replacing x with x −1 in (c).
Notice that these properties can be applied both to the additive and multiplicative groups in a field F .
In addition to the above group properties, the distributive law leads to the following properties of fields:
Proposition 2. In a field F , for any x, y ∈ F , we have
(a) 0 × x = 0.
(b) If x , 0 and y , 0, then x × y , 0.
(c) (−x) × y = −(x × y) = x × (−y).
(d) (−x) × (−y) = xy.
Proof. For (a), notice that since 0 + 0 = 0, we have 0 × x = (0 + 0) × x = 0 × x + 0 × x, and the result
follows from Proposition 1(b) on the additive group.
For (b), notice that if x × y = 0 in this case, multiplying both sides of the equality by y −1x −1 would
produce 1 = 0, a contradiction.
For (c), notice that 0 = (x + (−x)) × y = x × y + (−x) × y, and the rest follows from Proposition 1(c).
The other side follows similarly.
Statement (d) follows from (c) with −y in place of y and using Proposition 1(d).
Now that we’ve sufficiently introduced ordered sets and fields, we wonder what would happen if we
combined them...
Definition 3. An ordered field F is a field which is also an ordered set, with the following properties for
all x, y, z ∈ F :
1. x + y < x + z if y < z.
2. x × y > 0 if x > 0 and y > 0
If x > 0, we say x is positive, and we say x is negative if x < 0. For example, Q is an ordered field with
the regular notions of +, × and <.
The following properties hold for ordered fields:
Proposition 3. For x, y, z ∈ F for some ordered field F , we have
5
1 The Real and Complex Number Systems
Proof. For (a), if x > 0, then 0 = x + (−x) > 0 + (−x) = −x. The other direction follows similarly.
For (b), notice that since y < z, we have z − y > 0 and thus x(z − y) > 0. Then, notice that xz =
x(z − y) + xy > xy.
For (c), notice that −x > 0 by (a) so −(xy) = (−x)y < (−x)z = −(xz). The result follows by adding
xy + xz to both sides.
For (d), if x > 0, then x 2 = xx > x × 0 > 0, otherwise −x > 0 and the argument is analogous since
(−x)2 = −(−x 2 ) = x 2 . In particular, 1 = 1 × 1 = 12 > 0.
For (e), notice that since x > 0, we must have x −1 > 0 otherwise 1 = x × x −1 < 0, contradicting (d). A
similar argument holds for y −1 , and thus x −1y −1 > 0. The result follows by multiplying the inequality
by the positive value x −1y −1 .
Proof. For (a), let A = {nx : n ∈ N}. If (a) were true, A would be bounded above, and clearly non-empty,
and thus admits a least upper bound α = sup A in R. However, α − x < α is not an upper bound for A,
so there exists some α − x < mx ≤ α in A. But then α < (m + 1)x ∈ A, a contradiction.
For (b), notice that y −x > 0 so from (a), there exists n ∈ N such that n(y −x) > 1. Apply (a) again to find
m 1 > nx and m 2 > −nx so that −m 2 < nx < m 1 . Thus, there exists m ∈ N such that m − 1 ≤ nx < m.
Adding n(y −x) > 1 to both sides of nx ≥ m − 1 yields ny > m and so dividing both sides of nx < m < ny
by n > 0 yields the result.
6
1 The Real and Complex Number Systems
For (c), notice first of all that the function f (x) = x n is monotonically increasing on the positive real
numbers. To see this, factor
b n − an = (b − a)(b n−1 + b n−2a + · · · + b 2an−3 + ban−2 + an−1 ) > n(b − a)an−1 > 0
and similarly we can show b n − an < n(b − a)b n−1 . These inequalities will also be helpful later. The fact
that n-th positive real roots are unique follows immediately from the fact that x n is strictly increasing.
x
Let A = {y > 0 : y n < x }. Then, A is bounded above by max(1, x) and contains 1+x . Since R satisfies
the least upper bound property, α = sup A exists. We claim that α n = x.
Suppose otherwise. If α n < x, then for
x − αn
0 < t < min 1, , we have (α + t)n − α n < nt(α + t)n < nt(α + 1)n < x − α n
n(α + 1)n−1
αn − x
0 < t = min 1, , we have α n − (α − t)n < ntα n−1 = α n − x
nα n−1
so (α − t)n > x > z n for each z ∈ A, so in particular α − t is a smaller upper bound than α for A,
contradicting the fact that α is the least upper bound for A.
For (d), notice that a 1/n b 1/n > 0 since both terms are positive from (c), and that (a 1/n b 1/n )n = ab since
multiplication is commutative, so from (c), the n-th positive root is unique, (ab)1/n = a 1/n b 1/n .
Definition 4. Let x ∈ R >0 . Let n 0 be the largest integer such that n 0 ≤ x. Then, given n 0, n 1, . . . , nk −1 ,
let nk be the largest integer such that
n1 nk
n0 + + · · · + k ≤ x.
10 10
Then, letting E be the set of the numbers
n1 nk
n0 + + · · · + k , k ∈ N,
10 10
we have x = sup E. We define n 0 .n 1n 2n 3 . . . to be the decimal expansion of x.
7
1 The Real and Complex Number Systems
and note that the set of complex numbers forms a field, with (0, 0) and (1, 0) playing the roles of 0 and 1
respectively.
We define i = (0, 1) and notice i 2 = −1. This notation is convenient as we can do away with the ordered
pair notation and write (a, b) = a + bi instead.
If z = a + bi for a, b ∈ R, then we define <(z) = a and =(z) = b to be the real part and imaginary part
of z respectively. z = a − bi is called the conjugate of z.
The following properties of complex numbers will be helpful:
Theorem 4. If z, w ∈ C, then
(a) z + w = z + w.
(b) zw = z · w.
(c) z + z = <(z), z − z = 2i=(z).
(d) zz ∈ R >0 when z , 0.
For (b), notice that zw = (ac − bd) + (ad + bc)i and that
Since the value zz is a non-negative real number, we can think of it as some measure of how ‘large’ z is:
the following definition formalizes this:
√
Definition 7. We define |z| = zz, called the magnitude or absolute value of z, which exists and is
unique from Theorem 3c. When x ∈ R, |x | = x when x > 0 and −x otherwise. In particular, the fact that
x ≤ |x | when x ∈ R is sometimes useful.
The following are helpful properties of the absolute value:
Theorem 5. If z = a + bi, w = c + di ∈ C, then the following are true:
(a) |z| ≥ 0, with equality iff z = 0.
(b) |z| = |z|.
(c) |zw | = |z| · |w |.
(d) |<(z)| ≤ |z|.
8
1 The Real and Complex Number Systems
(e) |z + w | ≤ |z| + |w |.
Proof. For (a), clearly |0| = 0. Otherwise, zz is a positive real number, and thus has a positive square
root, which is precisely |z| by definition.
√
For (b), notice that both sides evaluate to a 2 + b 2 .
For (c), if either z or w is 0, the result follows immediately. Otherwise, we can see
√ √ √ √
|zw | = zwzw = zzww = zz ww = |z||w |
where the third equality uses Theorem 3(d) since zz and ww are both positive from Theorem 4(d).
For (d), notice that √ √
|<(z)| = |a| = a 2 ≤ a 2 + b 2 = |z|
where the third inequality follows since f (x) = x 2 is monotonically increasing as shown in Theorem
3(c). Notice that equality holds iff b = 0, that is, z ∈ R.
For (e), notice that
The result follows by taking square roots, since both sides are non-negative real numbers.
|Ba j − Cb j | 2 =
Õ Õ
0≤ (Ba j − Cb j )(Ba j − Cb j )
= B2
Õ Õ Õ Õ
a j a j − BC a j b j − BC a j b j + CC bjbj
= B2 |a j | 2 − BCC − BCC + CC |b j | 2
Õ Õ
= B 2A − B|C | 2 − B|C | 2 + |C | 2 B
= B(AB − |C | 2 )
9
1 The Real and Complex Number Systems
Definition 8. For k ∈ N, let Rk be the set of all ordered k-tuples x = (x 1, x 2, . . . , x k ), where x i ∈ R are
called the coordinates of x. We call these tuples vectors.
We define addition as component-wise addition, and scalar multiplication by a real number as component-
wise multiplication, so that Rk is closed under these two operations. The fact that Rk satisfies these
properties, as well as the operations satisfying the associative, commutative, and distributive laws make Rk
into a vector space over R.
We define the inner product of x and y by
k
Õ
x·y= x i yi
j=1
j=1
Now, Rk along with its associated inner product and norm is called Euclidean k-space.
We can now use this definition to prove some important properties of Rk :
Theorem 7. Suppose x, y, z ∈ Rk and α ∈ R. Then, the following are true:
(a) |x| ≥ 0.
(b) |x| = 0 iff x = 0.
(c) |αx| = |α ||x|.
(d) |x · y| ≤ |x||y|.
(e) |x + y| ≤ |x| + |y|.
(f) |x − z| ≤ |x − y| + |y − z|.
Proof. The facts (a), (b), and (c) follow directly from the definitions.
For (d), this is precisely the Cauchy-Schwartz inequality restricted to R, applied to the components.
For (e), we write
|x + y| 2 = (x + y)(x + y)
=x·x+x·y+y·x+y·y
= |x| 2 + 2x · y + |y| 2
≤ |x| 2 + 2 |x · y| + |y| 2
≤ |x| 2 + 2|x||y| + |y| 2
= (|x| + |y|)2
The fact that x ≥ 0 with equality iff x = 0, and fact (f), called the Triangle Inequality, let us regard Rk
as a metric space.
R1 = R is called the (real) line, R2 is the plane. It is noteworthy that the norms in R2 and C are consistent.
10
1 The Real and Complex Number Systems
11
1 The Real and Complex Number Systems
(A1) α + β ∈ R.
Notice that α + β is non-empty since α and β are non-empty. To show that α + β , Q,
take elements r 0 < α, s 0 < β. Then, r 0 + s 0 > r + s for all r ∈ α, s ∈ β, so in particular,
r 0 + s 0 < α + β so α + β , Q, showing condition (I).
For condition (II) let p = r +s ∈ α +β for some r ∈ α, s ∈ β, and q < p. Then, q−s < p−s = r
so q − s ∈ α. Thus, q = (q − s) + s ∈ α + β.
For condition (III), let t > r in α, so that t + s > r + s in α + β.
(A2) α + β = β + α.
This follows from the definition and the commutativity of Q:
α + β = {r + s : r ∈ α, s ∈ β } = {s + r : s ∈ β, r ∈ α } = β + α
(A3) (α + β) + γ = α + (β + γ ).
This follows as above from the definition and the associativity of Q.
(A4) α + 0∗ = α.
If r ∈ α, and s ∈ 0∗ , then r + s < r ∈ α, so α + 0∗ ⊆ α. For the other side, take r ∈ α
and t > r in α. Then, r − t < 0 so r − t ∈ 0∗ and thus r = t + (r − t) ∈ α + 0∗ and thus
α ⊆ α + 0∗ , as required.
(A5) −α exists.
Let β be the set of p such that there exists r > 0 such that −p − r < α. We claim that β ∈ R
and α + β = 0∗ .
Let s < α. Then, s + 1 ∈ β since (s + 1) − 1 < α, so β is non-empty. Also, if r ∈ α, then
r < β, so β , Q, satisfying condition (I) for being a cut. For condition (II) let p ∈ β and
q < p in Q. Then, −p − r < α for some r > 0. Then, −q − (−q + p + r ) = −p − r < α, and
−q + p + r > r > 0, so q ∈ β, proving condition (II). For condition (III), let p ∈ β. Then,
there exists r > 0 so that −p − r < α. Then, p + r /2 > p is in β since −(p + r /2) − (r /2) < α
and r /2 > 0, proving condition (III). So β ∈ R.
Let p ∈ α and q ∈ β. Since q ∈ β, there exists r > 0 such that −q − r < α. Thus, from
condition (II) on α, p < −q − r and thus p + q < −r < 0 so p + q ∈ 0∗ . Since our choice of
p and q was arbitrary, α + β ⊆ 0∗ .
Let t < 0. We want to find p ∈ α and q ∈ β such that p + q = t. Let s = −t/2 so that
s > 0. Then, by the Archimedean property, there exists an integer n such that ns ∈ α but
(n + 1)s < α. Take p = ns and q = −(n + 2)s = t − p. Since −(−(n + 2)s) − s < α and s > 0,
q ∈ β, so we are done!
Step 5. Towards an ordered field
Notice that α + β ⊂ α + γ if β ⊂ γ , so the first condition for ordered fields is satisfied. It also
follows that α > 0∗ iff −α < 0∗ .
Step 6. Multiplication, but only somewhat
Let R+ = {α ∈ R : α > 0∗ } be the positive real numbers. If α, β ∈ R+ , define α β to be the set
of p ≤ rs in R+ , for some positive r ∈ α, s ∈ β with r, s > 0. We define 1∗ to be the set of all
rational numbers less than 1.
It can be shown, similarly to the proofs of (A1) to (A5), that the axioms of multiplication and
distributivity hold for multiplication on R+ :
12
1 The Real and Complex Number Systems
13
1 The Real and Complex Number Systems
0∗ if α = 0∗ or β = 0∗
αβ if α > 0∗ , β > 0∗
α β = −(α(−β))
if α > 0∗ , β < 0∗
−((−α)β) if α < 0∗ , β > 0∗
if α < 0∗ , β < 0∗
(−α)(−β)
The proofs for the multiplication axioms and distributivity follow from Step 6, with repeated
use of γ = −(−γ ). The proofs are omitted.
This proves that R is an ordered field with the least-upper-bound property!
Step 8. Relation to Q
Associate with each q ∈ Q, a set q ∗ = {r ∈ Q : r < q}. Notice that this definition is consistent
with our definitions of 0∗ and 1∗ above (the asterisk was not a coincidence). These cuts satisfy
the following relations:
a) r ∗ + s ∗ = (r + s)∗ .
b) r ∗s ∗ = (rs)∗ .
c) r ∗ < s ∗ iff r < s.
This shows that Q is isomorphic to Q∗ whose elements are the rational cuts. This relationship
is the reason we can regard Q as a subfield of R.
1.8 Exercises
1. If r is rational and non-zero, and x is irrational, prove that r + x and rx are irrational.
Proof. Suppose otherwise: then (r + x) − r and (rx)/r would be operations on rational numbers,
and thus evaluate to rational numbers. However, these both evaluate to x, which is irrational, a
contradiction.
Proof. We first prove that there is no rational number whose square is 3. Suppose that there
existed p/q ∈ Q whose square is 3, and such that p and q share no factors, cancelling factors
when appropriate. Then, multiplying this equality by q 2 , we have p 2 = 3q 2 , showing that p is
divisible by 3. But, this means 3p 02 = 3q 2 , which in turn implies q is divisible by 3, contradicting
the fact that p and q share no factors.
Now, if some rational number r had r 2 = 12, then r /2 would be a rational number such that
(r /2)2 = 3, contradicting our previous statement.
3. Prove that in a field F , the following statements follow from the axioms of multiplication:
(a) If x , 0 and xy = xz then y = z.
(b) If x , 0 and xy = x then y = 1.
(c) If x , 0 and xy = 1, then y = 1/x.
14
1 The Real and Complex Number Systems
Proof. For (a), if x , 0, then 1/x ∈ F , so we can multiply this to both sides, yielding the result. (b)
follows by letting z = 1 in (a). (c) follows by letting z = 1/x in (a). (d) follows by letting x = 1/x
and y = 1/(1/x) in (c), noticing that 1 = 1/x · x = 1/x · 1/(1/x).
4. Let E be a non-empty subset of an ordered set; suppose α is a lower bound of E and β is an upper
bound of E. Prove that α ≤ β.
Proof. Since E is non-empty, let x ∈ E. Then, since α is a lower bound for E, in particular, α ≤ x.
Similarly, x ≤ β, so α ≤ x ≤ β, as required.
5. Let A be a non-empty set of real numbers which is bounded below, and −A be the set of all
numbers −x for x ∈ A. Prove that inf A = − sup(−A).
Proof. Since A is non-empty and bounded below, and R satisfies the least upper bound property,
α = inf A exists. We claim that −α is the least upper bound to −A.
First, notice that −α ≥ −x for all −x ∈ −A, since α ≤ x for all x ∈ A. Also, if −β < −α were a
smaller upper bound for −A, then β would be a larger lower bound for A, contradicting the fact
that α is the greatest lower bound for A.
6. Fix b > 1.
(a) If m, n, p, q ∈ Z with n, q > 0 and r = m/n = p/q, prove that (b m )1/n = (b p )1/q so it makes
sense to define b r = (b m )1/n .
(b) Prove b r +s = b r b s if r, s ∈ Q.
Proof. Write r = m/n and s = p/q for m, n, p, q ∈ Z with n, q > 0. Then, r + s = (mq + np)/nq.
Now, notice that from properties of integer powers, b mq+np = b mq b np . So,
and the result follows from the uniqueness of nq-th positive real roots.
(c) If x is real, define B(x) to be the set of all numbers b t , where t is rational and t ≤ x. Prove
that
b r = sup B(r )
when r is rational. Hence, we can define b x = sup B(x) for every real x.
Proof. We first prove a helpful lemma: if q > 0 in Q, then b q > 1. Let q = m/n for m, n ∈ N
and n , 0. Since x m and x n are monotonically increasing, then if b > 0,
15
1 The Real and Complex Number Systems
In fact, the previous statement shows that b r is indeed an upper bound for B(r ): if t ≤ r and
thus b t ∈ B(r ), then b t ≤ b r . Since r ≤ r , b r ∈ B(r ), so any upper bound for B(r ) must be at
least b r . Thus b r must be the least upper bound for B(r ), as required.
Proof. First, we want to show that b x b y is an upper bound for B(x +y). Suppose that q ≤ x +y
in Q. Then, we can write q = u + v for u ≤ x, v ≤ y in Q (from the Archimedean property,
we can choose choose u ∈ Q in the non-empty interval [q − y, x]), so that b q = b u+v = b u b v .
However, since u ≤ x and v ≤ y, b u ≤ b x and b v ≤ b y , so b q = b u b v ≤ b x b y as required.
Now, suppose r < b x b y . We need to find some q ≤ x + y in Q such that b q > r , to show that
r is not an upper bound for B(x + y).
Dividing both sides by b x yields r /b x < b y = sup B(y), so there exists some v ≤ y in Q such
that b v > r /b x so r < b x b v . Dividing this new inequality by b v yields r /b v < b x = sup B(x)
so there exists u ≤ x in Q such that b u > r /b v . Thus, r < b u b v = b u+v with q = u +v ≤ x +y,
as required.
7. Fix b > 1, y > 0, and prove that there is a unique real x such that b x = y, by completing the
following outline. (This x is called the logarithm of y to the base b.)
(a) For any positive integer n, we have b n − 1 ≥ n(b − 1).
Proof. Factor b n − 1 = (b − 1)(b n−1 +b n−2 + · · · +b + 1). Since b k > 1 for every positive integer
k (inductively), the above expression is greater than or equal to (b −1)(1+1+· · ·+1) = n(b −1),
where there are n 1’s, as required.
Proof. Since f (x) = x n is monotonically increasing, b 1/n > 1 since b > 1, so substituting b 1/n
for b in (a) yields the result.
Proof. We have
b − 1 n(b 1/n − 1)
n> ≥
t −1 t −1
so cancelling the positive n from the outermost expressions, multiplying by the positive value
t − 1, and adding 1 to both sides yields the result.
(d) If w is such that b w < y, then b w +1/n < y for sufficiently large n; to see this, apply part (c)
with t = y · b −w .
Proof. Following the prompt, we notice that letting t = y · b −w > 1 in (c), if n > (b − 1)/(y ·
b −w − 1) ∈ R, then b 1/n < y · b −w , which after multiplying both sides by the positive value
b w , gives the result.
Proof. Similarly, letting t = b w /y in (c) yields b 1/n < b w /y for n > (b − 1)/(b w /y − 1) ∈ R,
after which rearranging for y yields the result.
16
1 The Real and Complex Number Systems
(f) Let A be the set of all w such that b w < y, and show that x = sup A satisfies b x = y.
Proof. Suppose otherwise. If b x < y, then from (d), there exists some n ∈ N such that
b x +1/n < y, so x + 1/n ∈ A, contradicting the fact that x is an upper bound for A.
Similarly, if b x > y, then from (e), there exists some n ∈ N such that b x −1/n > y, and so for
any w ∈ A, we have b w < y < b x −1/n and thus w < x − 1/n from the lemma in Exercise 6(c),
making x − 1/n a smaller upper bound for A, a contradiction.
Proof. Suppose that x , y in R and b x = b y . Without loss of generality, suppose x < y. But
then b x < b y from the statement proved in Exercise 6(c), contradicting our assumption, so
we are done.
8. Prove that no order can be defined in the complex field that turns it into an ordered field. Hint:
−1 is a square.
Proof. Suppose otherwise. Then, notice that since 1 = 12 and −1 = i 2 are both non-zero squares,
they are both positive. Thus, their sum, 0, would have to be positive, a contradiction.
9. Suppose z = a + bi and w = c + di. Define z < w if a < c, and also if a = c but b < d. Prove that
this turns the set of all complex numbers into an ordered set. (This type of order relation is called
a dictionary order or lexicographic order, for obvious reasons.) Does this ordered set have
the least-upper-bound property?
Proof. First, we show that exactly one of z < w, z = w, or z > w is true. Suppose that z , w, so
that a , c or b , d. If a , c, then either a < c, in which case z < w, or a > c, in which case z > w.
Otherwise, either b < d, in which case z < w, or b > d, in which case z > w.
Next, we show that this order is transitive. Suppose a + bi < c + di and c + di < e + f i. If a < c,
then since c ≤ e, a < e and thus a + bi < e + f i. If a = c, then b < d. If further c = e, then
b < d < f and thus a + bi < e + f i. If further c < e, then a = c < e so a + bi < e + f i. In all cases,
a + bi < e + f i.
This ordered set does not have the least-upper-bound property: consider the set A = iR = {ir :
r ∈ R}, which is bounded above by 1. We claim that though A is non-empty and bounded above,
there does not exist a least upper bound. Let α be an upper bound for A. Notice that α must have
a non-negative real component. If α has a strictly positive real component, say r , then r /2 will be
smaller than r but still larger than all of A. Thus, a least upper bound will never have positive real
component. If α had real component 0, then α + 1 ∈ A would be larger than it, a contradiction.
Thus, no least upper bound for A exists.
Prove that z 2 = w if v ≥ 0 and that (z)2 = w if v ≤ 0. Conclude that every complex number (with
one exception!) has two complex square roots.
17
1 The Real and Complex Number Systems
so z 2 = u + iv when v ≥ 0. Similarly,
z 2 = (a 2 − b 2 ) − 2iab = u − i |v | = u + iv
if v < 0. Thus, if v ≥ 0, then z and −z are two complex square roots of w, and similarly ±z are
square roots of w if v < 0. The exception here is of course when z = −z, namely when z = 0 and
thus w = z 2 = 0.
11. If z is a complex number, prove that there exists an r ≥ 0 and a complex number w with |w | = 1
such that z = rw. Are w and r always uniquely determined by z?
Proof. Let r = |z| ≥ 0 and w = z/|z|. Notice that |w | = |z|/||z|| = 1, and z = rw by definition. w
and r are uniquely defined for all z ∈ C except 0, in which case r = 0 and w can be any complex
number with unit absolute value.
Suppose z , 0 and z = r 1w 1 = r 2w 2 with the required conditions. Take absolute values to get
|z| = |r 1 ||w 1 | = |r 2 ||w 2 |, which implies |r 1 | = |r 2 |. Since r 1, r 2 ≥ 0, we must have r 1 = r 2 , uniquely
determining w 1 and w 2 , as required.
|z 1 + z 2 + · · · + zn | ≤ |z 1 | + |z 2 | + · · · + |zn |.
Proof. We proceed by induction on n. The base case is given by Theorem 7(e), as applied to
complex numbers taken as ordered pairs of real numbers. Then, for n > 2, assuming the statement
is true for n − 1 inductively, we have
|1 + z| 2 + |1 − z| 2
18
1 The Real and Complex Number Systems
|1 + z| 2 + |1 − z| 2 = (1 + z)(1 + z) + (1 − z)(1 − z) = 1 + z + z + zz + 1 − z − z + zz = 4.
15. Under what conditions does equality hold in the Schwartz inequality?
Proof. Notice that in the proof, equality holds iff |Ba j − Cb j | 2 = 0, and equivalently a j = Cb j /B
Í
for each j. In particular, if the a j are each the same multiple of b j , then equality will hold.
|z − x| = |z − y| = r .
Proof. Let m = 12 (x + y). Without loss of generality, suppose y = −x, translating when
necessary. Also, let t = r 2 − d 2 /4, which is well defined since 2r > d. Then, the set of
p
vectors on the hyperplane orthogonal to the line crossing x and y with distance t to m satisfy
the required equation, and there are infinitely many.
Proof. If k = 2, then (a) will have 2 solutions instead, and everything else remains the same. If
k = 1, (a) will have no solutions.
Proof. We expand:
|x + y| 2 + |x − y| 2 = (x + y) · (x + y) + (x − y) · (x − y)
=x·x+x·y+y·x+y·y+x·x−x·y−y·x+y·y
= 2|x| 2 + 2|y| 2
The interpretation of this is that in a parallelogram, the sums of the squares of the lengths of the
diagonals is equal to the sum of the squares of the lengths of the four sides.
18. If k ≥ 2 and x ∈ Rk , prove that there exists y ∈ Rk such that y , 0 but x · y = 0. Is this also true
if k = 1?
19
1 The Real and Complex Number Systems
Proof. If x = 0, then any non-zero vector will do. If x has only one non-zero component, then
choose i such that x i = 0 and let y be the vector with yi = 1 and y j = 0 if j , i. Otherwise, if
x 12 +x 22 +...+x k2 −x i2
x = (x 1, . . . , x k ), there exists x i , 0. Then, let y j = x j if j , i, and yi = xi . Then,
k k
!
x j x j − x i2 − x i yi = 0
Õ Õ
x·y= x jyj =
j=1 j=1
|x − a| = 2|x − b|
iff |x − c| = r .
1
Proof. This is an example of Circles of Apollonius. c = 3 (4b − a), r = 23 |b − a|.
20. With reference to the Appendix, suppose that property (III) were omitted from the definition of a
cut. Keep the same definitions of order and addition. Show that the resulting ordered set has the
least-upper-bound property, that addition satisfies axioms (A1) to (A4) (with a slightly different
zero-element!) but that (A5) fails.
Proof. TODO
20
2 Basic Topology
An interesting property of Z is thus that it is equivalent to one of its proper subsets. This is never true
of finite sets, so in fact a set is infinite if it is equivalent to one of its proper subsets.
21
2 Basic Topology
Definition 10. A sequence is a function f defined on the set J of positive integers. If f (n) = x n for n ∈ J ,
then we denote f by the symbol {x n } or sometimes x 1, x 2, . . . . The values x n of f are called the terms
of the sequence. If A is a set and x n ∈ A for each n ∈ J , then {x n } is a sequence in A, or a sequence of
elements in A.
Theorem 8. Every infinite subset of a countable set A is countable.
Proof. Let {x n } be a sequence of distinct elements in A. Then, for any infinite subset E of A, we can
take the subsequence {x nk } of {x n } corresponding to E, so E is countable.
The above theorem shows that countable sets are the ‘smallest infinity’ in some sense.
Definition 11. Let A and Ω be sets, and suppose that for each element α ∈ A, there is associated a subset
of Ω which we denote by E α . We call the set whose elements are the sets E α a collection or family of sets,
denoted {E α }.
The union of the sets E α is the set S such that x ∈ S iff x ∈ E α for at least one α ∈ A, denoted
Ø
S= Eα .
α ∈A
The intersection of the sts E α is the set P such that x ∈ P iff x ∈ E α for every α ∈ A, denoted
Ù
P= Eα .
α ∈A
If A ∩ B is not empty, we say that A and B intersect, otherwise they are disjoint.
The operations ∪ and ∩ are commutative and associative, but they also distribute:
Theorem 9. For sets A, B, C, we have
E = A ∩ (B ∪ C) = (A ∩ B) ∪ (A ∩ C) = F .
which enumerates S. If any two of the sets En have elements in common, these will appear more
than once in S. So the sequence is at most countable, but since E 1 ⊆ S, it is at least countable, so it is
countable.
Corollary 1. Suppose A is at most countable, and for every α ∈ A, B α is at most countable. Then,
T = α ∈A B α is at most countable.
Ð
22
2 Basic Topology
Theorem 11. Let A be a countable set, and Bn be the set of all n-tuples (a 1, . . . , an ) in A, where the ak
need not be distinct. Then Bn is countable.
Proof. B 1 = A is countable. Suppose that Bn−1 is countable for n > 1. Then, the elements of Bn are of
the form (b, a) for b ∈ Bn−1 and a ∈ A, so the set of all pairs (b, a) ∼ A and is thus countable. Then, Bn is
the countable union of countable sets, and is thus countable. The result follows by induction on n.
Proof. Associate with each r = m/n ∈ Q the pair (m, n) of integers. The set of pairs (m, n) is countable,
and thus Q is countable.
Proof. Suppose otherwise, so that A is countable, and thus admits a sequence {sn }. Then, construct a
sequence t ∈ A such that t j = 1 − s j j so t j , s j j for all j ∈ J . Then, t < {sn } since it differs from each
sequence at at least one point. Thus, t < A, a contradiction.
Notice the above theorem can be combined with the binary representation of real numbers to show
that the real numbers are uncountable.
23
2 Basic Topology
(b) A point p is a limit point of the set E if every neighbourhood of p contains a point q , p such that
q ∈ E. (That is, points in E get arbitrarily close to p.)
(c) If p ∈ E is not a limit point, it is called an isolated point.
(d) A set E is closed iff every limit point of E is in E.
(e) A point p is an interior point of E if there is a neighbourhood N of p such that N ⊆ E.
(f) E is open if every point in E is an interior point of E.
(g) The complement of E, denoted E c , is the set of all points in X but not in E.
(h) E is perfect if E is closed and every point of E is a limit point of E.
(i) E is bounded if there exists a real number M and a point q ∈ X such that d(p, q) < M for all p ∈ E.
(j) E is dense in X if every point in X is a limit point of E, or a point of E (or both). (So points in E get
arbitrarily close to any point in X ).
Theorem 13. Every neighbourhood is an open set.
Proof. Let E = Nr (p) be a neighbourhood, and q be any point in E. Then, d(p, q) < r so d(p, q) = r − h
for some positive h. Then for any s ∈ Nh (q), we have
Theorem 14. If p is a limit point of E, then every neighbourhood of p contains infinitely many points of E.
Proof. Suppose not, so that some neighbourhood Nr (p) contains finitely many points in E. Then, there
exists some closest point q , p in that neighbourhood with d(q, p) = h > 0. Then, the neighbourhood
Nh/2 (p) contains no points in E which are not p, contradicting the definition of a limit point.
Proof. If p were a limit point in a finite point set E, then some neighbourhood must have infinitely
many points in E, a contradiction.
Proof. Let x ∈ A. Since it is not in E α , x is not in any of the E α . Thus, it is in each of the E αc , so
Ð
α
x ∈ B.
Let y ∈ B. By definition, y is in E αc , so it is not in any of the E α . Thus, y < E α so y ∈ A, completing
Ð
α
the proof.
24
2 Basic Topology
Proof. Suppose E c is not closed. Then, there exists a limit point of E c , say x which is not in E c (and
thus in E). However this means that there are points in E c in neighbourhoods arbitrarily close to x: so
no neighbourhood around x will be a subset of E, so E is not open, proving the contrapositive of the
forward statement.
Now suppose E is not open, so there exists some x ∈ E for which no neighbourhood is entirely contained
within E. Thus, every neighbourhood of x contains some point in E c , so x is a limit point of E c not in E c ,
so E c is not closed, proving the contrapositive of the backwards statement, completing the proof.
Proof. For (a), let x ∈ A be arbitrary. Then, x ∈ G α for some open set G α , so Nr (x) ⊆ G α ⊆ A.
Thus, x is interior in A and A is open. For (b), notice that {F αc } is a collection of open sets, so by (a),
Ð c c
α F α = ( α F α ) is open, so B = α F α is closed.
Ñ Ñ
For (c), let x ∈ C. Then, there are neighbourhoods Nr i (x) ⊆ G i for some positive r i . Then, taking
r = minni=1 r i , we have Nr (x) ⊆ Nr i (x) ⊆ G i for each i ∈ Jn , so Nr (x) ⊆ C so x is interior to C, and thus
C is open. Statement (d) reduces to (c) with reasoning similar to (b).
Notice that in parts (c) and (d) in the previous theorem, finiteness of the collections is essential. Otherwise,
the minimum of the radii of your neighbourhoods could be 0, and no neighbourhood would exist. For
instance, take G n = (−1/n, 1/n).
Definition 15. If X is a metric space, and E ⊆ X and E 0 is the set of all limit points of E in X , then the
closure of E is the set E = E ∪ E 0.
Theorem 18. If X is a metric space with E ⊆ X , then
(a) E is closed.
(b) E = E iff E is closed.
(c) E ⊆ F for every closed set F ⊆ X with E ⊆ F .
Notice that (a) and (c) imply E is the smallest closed subset of X containing E, so the name ‘closure’ is
appropriate.
Proof. For (a), if p ∈ X and p < E, then p is neither a point of E nor a limit point of E. Hence, p has a
c
neighbourhood which does not intersect E, so E is open and E is closed.
For (b), if E is closed, then E 0 ⊆ E so E = E. If E = E, then (a) implies E is closed.
For (c), let F ⊆ X be a closed set such that E ⊆ F . Let x ∈ E. If x ∈ E, then clearly x ∈ F . If x ∈ E 0 but
not in F , then x would be a limit point of F ⊃ E not in F , contradicting the fact that F is closed.
Theorem 19. Let E be a non-empty set of real numbers which is bounded above. Let y = sup E, then
y ∈ E, hence y ∈ E if E is closed.
25
2 Basic Topology
Proof. Suppose that y were not in E, so it is neither a point in E nor a limit point of E, so there exists a
neighbourhood (y − ϵ, y + ϵ) which does not contain any elements in E. But then y − ϵ/2 would be a
smaller upper bound for E than y, contradicting the minimality of y, proving the first statement.
Then, if E is closed, y = sup E ∈ E = E.
Remark. Suppose E ⊆ Y ⊆ X where X is a metric space. We say E is an open subset of X when every
point p ∈ E has associated to it a positive real number r such that d(p, q) < r implies q ∈ E. But we know
Y is also a metric space under the same distance function, so we can extend our definitions to Y .
We say E is open relative to Y when to each p ∈ E, there exists a positive r such that q ∈ E whenever
d(p, q) < r and q ∈ Y .
Theorem 20. Suppose Y ⊆ X . Then a subset E ⊆ Y is open relative to Y iff E = Y ∩ G for some open
subset G of X .
Proof. Suppose E ⊆ Y is open relative to Y . Then, for every p ∈ E, we associate some positive rp such
that Nrp (p) ∩ Y ⊆ E. Let G = p ∈E Nrp (p) ⊆ X . Since each Nrp (p) is open, the infinite union G is open.
Ð
Then, clearly E ⊆ Y ∩ G. By our choice of Nrp (p), Nrp (p) ∩ Y ⊆ E for each p ∈ E, so G ∩ Y ⊆ E. Thus,
E = G ∩ Y.
Conversely, if G is open in X and E = G ∩ Y , then every p ∈ E has a neighbourhood Nrp (p) ⊆ G, so
Nrp (p) ∩ Y ⊆ E and thus E is open relative to Y .
Definition 17. A subset K of a metric space X is said to be compact if every open cover of K contains a
finite subcover.
Formally, if {G α } is an open cover of K, then there exist indices α 1, . . . , α n such that K ⊆ G α 1 ∪ · · · ∪ G α n .
Clearly every finite set is compact.
Theorem 21. Suppose K ⊆ Y ⊆ X . Then K is compact relative to X iff K is compact relative to Y .
Proof. Suppose K is compact relative to Y , and we have some open cover {G α } in X . Then, letting
H α = Y ∩ G α , {H α } is still an open cover of K since K ⊆ Y . In fact, each H α is open relative to Y from
Theorem 20, and a subset of Y , so {H α } is an open cover of K in Y . Since K is compact in Y , there exists
some finite subcover {H α k }kn=1 of K. Then, G ⊆ nk =1 H α i ⊆ nk=1 G α i so {G α k }kn=1 is a finite subcover
Ð Ð
of K.
Now, suppose K is compact relative to X , and we have some open cover {H α } in Y . Since each H α is
open relative to Y , we can write H α = G α ∩ Y for some open subset G α of X . Then, {G α } is an open
cover of K in X and thus a finite subcover {G α k }kn=1 exists. Then {H α k }kn=1 is a finite subcover of K in
Y , so K is compact relative to Y , completing the proof.
Proof. Let K be compact in X . Let x ∈ E c . For each y ∈ E, let Gy be some neighbourhood of y with radius
less than 12 d(x, y). Then, {Gy } forms an open cover of K. Since K is compact, there exist y1, . . . , yn such
that Gy1 , . . . , Gyn forms a finite subcover of K.
26
2 Basic Topology
Then, there exists a closest point y ∗ within y1, . . . , yn to x, with distance r from x. Then, there are no
points in E closer than r /2 to x, so E c contains a neighbourhood of x completely contained in E c so E c
is open and thus E is closed.
Proof. Let F ⊆ K ⊆ X be closed relative to X and K compact. Then, let {G α } be an open cover of F ,
and append the open set F c to make it an open cover of G ⊆ X . Then, since K is compact, there exists
n
some finite subcover {G α k }k=1 of K. Then the finite subcover without F c is also a finite subcover of F ,
so F is compact.
Proof. F ∩ K ⊆ K is closed as the finite intersection of closed sets, and thus compact by the previous
theorem.
Theorem 24. If {K α } is a collection of compact subsets of a metric space X such that the intersection of
every finite subcollection of {K α } is non-empty, then K α is non-empty.
Ñ
Proof. Let K be some element of {K α }. Suppose, for the sake of contradiction, that there are no points
in K which are in every K α . That is, for every x ∈ K, there is some K αc such that x ∈ K αc . So K ⊆ α K αc ,
Ð
so {K αc } forms an open cover of K, so since K is compact, there exists a finite subcover {K αc k }k=1
n of K.
But then K ∩ K α 1 ∩ . . . ∩ K α n = ∅, contrary to our assumption.
Corollary 6 (Cantor’s intersection theorem). If {Kn } is a sequence of non-empty compact sets such that
Kn ⊇ Kn+1 , then n=1 Kn is non-empty.
Ñ∞
Proof. Notice that Kk1 ∩ · · · ∩ Kkn = Kk where k = max(k 1, . . . , kn ), so any finite intersection is
non-empty. Thus, the result follows by applying the above theorem.
Theorem 25. If E is an infinite subset of a compact set K, then E has a limit point in K.
Proof. Suppose that E has no limit points in K. Then, every x ∈ K will have some sufficiently small
neighbourhood N x which contains at most one point in E. However, no finite subcollection of {N x }
can cover E and thus K, contradicting the fact that K is compact.
Theorem 26 (Nested intervals). If {In } is a sequence of intervals in R such that In ⊇ In+1 , then n=1 I n
Ñ∞
is not empty.
Proof. Let In = [an , bn ], and A be the set of all an which is non-empty and bounded above, say by b1 .
Then, let α = sup A. For any m, n, we have an ≤ an+m ≤ bn+m ≤ bm , so x ≤ bm for any bm . Thus,
x ∈ Im for all m, and thus n=1 In is non-empty.
Ñ∞
Theorem 27. Let k be a positive integer. If {In } is a sequence of k-cells such that In ⊇ In+1 for all n, then
n=1 I n is non-empty.
Ñ∞
Proof. Taking the intervals corresponding to each dimension, we can invoke the previous theorem to
find values x i in the intersections of each of the intervals, so x = (x 1, . . . , x k ) ∈ n=1 In .
Ñ∞
27
2 Basic Topology
Proof. The fact that (b) implies (c) is precisely Theorem 25.
To show that (c) implies (a), suppose that every infinite subset of E has a limit point in E, but E is not
closed and bounded.
• If E is not closed, there exists a limit point x of E which is not contained in E. We construct an
infinite subset of E which has only x as a limit point. Let x 1 be an arbitrary point in N 1 (x) ∩ E and
r 1 = d(x, x 1 ). Now, let x n be a point in E which is closer than min(1/n, r n−1 /2) to x. This must
exist since x is a limit point. This defines a sequence {x n }. Let X be the infinite set of these points.
Then clearly x is a limit point of X . However, we still have to show that it is the only limit point
of X . Let y , x be another limit point. Since d(x, y) > 0 and the r n are monotonically decreasing,
there exists some n such that r n ≤ d(x, y) ≤ r n+1 . taking r 0 = d(x, y) and r < min(r 0 −r n , r 0 −r n+1 ),
the neighbourhood with radius r around y has no points in X and is thus not a limit point of X .
• Similarly, if E is not bounded, let w ∈ E be arbitrary. Then we can create a sequence {x n } such
that d(w, x n ) > n and x n ∈ E. Then, if y ∈ E, then there are only finitely many x n such that
d(w, x n ) < d(w, y) + 1, so there are not infinitely many x n in N 1 (y) ∩ E, so y is not a limit point.
Thus, X has no limit points in E.
Finally, to show (a) implies (b), suppose E is closed and bounded. Since E is bounded, we can surround
it with a sufficiently large k-cell I . That is, E ∩ I = E is compact, as the intersection of a closed set and a
compact set.
Remark. Notice that the above theorem was specific to Rk . One may ask how much of the theorem extends
to general metric spaces. In general, (b) and (c) are equivalent in any metric space, but (a) does not imply
(b) and (c).
Theorem 30 ((Weierstrass)). Every bounded infinite subset of Rk has a limit point in Rk .
Proof. Let E ⊆ Rk be bounded. Then, E ⊆ I for some k-cell I . Since I is compact, E is an infinite subset
of I and thus has a limit point in I ⊆ Rk from the above theorem.
28
2 Basic Topology
Proof. Suppose P is non-empty but not uncountable. Since P has limit points, it must be infinite, and
thus countable. Enumerate the points of P as x 1, x 2, . . . .
Let V1 be any neighbourhood of x 1 . Suppose we have Vn such that Vn ∩ P is non-empty. Since every
point of P is a limit point of P, there exists some neighbourhood Vn+1 such that Vn+1 ⊆ Vn , x n < Vn+1 ,
and Vn+1 ∩ P is non-empty.
Put Kn = Vn ∩ P. Notice that Vn is closed and bounded and thus compact. The Kn are thus compact
as closed subsets of Vn . Notice Kn form a decreasing sequence of compact sets, each of which are
non-empty by construction, so n=1 must be non-empty. In particular, since K 1 ⊆ P, there must be
Ñ∞
some x k in this intersection. However, the construction forbids this.
Corollary 7. Every interval [a, b] with a < b is uncountable. In particular, the set of all real numbers is
uncountable.
Proof. Every interval is perfect and thus uncountable. The set of real numbers is larger.
Are combinations of intervals the only perfect sets? No: we construct a perfect set in R with no segment,
the Cantor set.
Let E 0 = [0, 1]. Remove the segment (1/3, 2/3) so that E 1 = [0, 1/3] ∪ [2/3, 1]. Removing the middle
thirds indefinitely yields a decreasing sequence of compact sets, whose infinite intersection is non-empty.
Call this set the Cantor set.
The Cantor set contains no segment, since every segment of the form
3k + 1 3k + 2
, m
3m 3
has a point in common with P. It can be shown that every segment (α, β) shares a segment of the above
form, so the Cantor set contains no segment.
To show that P is perfect, let x ∈ P and let S be any segment containing x. Let In be an interval chosen
such that n is so large that In ⊆ S. Then, let either endpoint is in P, so x is a limit point and P is perfect.
Proof. Suppose otherwise, so that for some x, y ∈ E, there exists some z ∈ R such that x < z < y but
z < E. Then we claim A = E ∩ (−∞, z) and B = E ∩ (z, ∞) form a separating pair for E.
Clearly A ∪ B = E. Also, clearly A and B are non-empty, since they contain x and y respectively. Now,
notice that everything in A is ≤ z, and everything in B is > z. Thus, they are disjoint. A similar argument
holds for the other pair. Thus, A and B form a separating pair for E, contradicting the fact that E is
connected.
29
2 Basic Topology
Conversely, suppose that E is not connected, and in particular admits the separation (A, B). Then, let
x ∈ A and y ∈ B, and suppose without loss of generality that x < y. Let z = sup(A ∩ [x, y]) ∈ A. Since
A and B are separated, z < B.
If z < A, then x < z < y but z < A ∪ B = E. Otherwise, if z ∈ A, then it mustn’t be in B, so there exists
z 1 ∈ (z, y) such that z 1 < B. Then x < z 1 < y with z 1 < E. In either case, the contrapositive holds,
completing the proof.
2.6 Exercises
1. Prove that the empty set is a subset of every set.
2. A complex number z is said to be algebraic if there are integers a 0, . . . , an not all zero such that
a 0z n + . . . + an z 0 = 0.
Prove that the set of all algebraic numbers is countable. Hint: For every positive integer N , there
are only finitely many equations with n + |a 0 | + . . . + |an | = N .
Proof. For any n, let Sn be the set of tuples (k, a 0, a 1, . . . , ak ) such that k + |a 0 | + . . . + |ak | = n.
Notice that this set is finite, and that each corresponding equation can only have finitely many
roots. Thus, the set of algebraic numbers An whose polynomials map to a given Sn is finite. Then,
A = i=1 Ai is countable.
Ð∞
3. Prove that there exist real numbers which are not algebraic.
Proof. If every real number were algebraic, there would be countably many of them, contradicting
the uncountability of the reals.
Proof. If the set of irrational real numbers were countable, then R = Q ∪ (R \ Q) would be
countable, contradicting the uncountability of the reals.
5. Construct a bounded set of real numbers with exactly three limit points.
Proof. Let A = {k + 1/n : n ∈ N, k ∈ {0, 1, 2}}. We claim that the limit points of A are precisely
0, 1, and 2. Clearly, all three are limit points (for any r > 0, take 1/n < r by the Archimedean
property of R; then k + 1/n is within r of k).
Let y < {0, 1, 2}. Consider k ∈ {0, 1, 2}. If y < k or y > k + 1, then taking some radius smaller than
the distance to [k, k +1] fails. Otherwise, we have y −k ∈ [1/(n +1), 1/(n −1)] for some n. Taking a
radius smaller than the distance to the nearest of the three points k +1/(n +1), k +1/n, k +1/(n −1)
fails.
6. Let E 0 be the set of all limit points of a set E. Prove that E 0 is closed. Prove that E and E have the
same limit points. (Recall that E = E ∪ E 0.) Do E and E 0 always have the same limit points?
7. Let A1, A2, . . . be subsets of a metric space.
(a) If Bn = ni=1 Ai , prove that Bn = ni=1 Ai .
Ð Ð
30
2 Basic Topology
(b) If B = i=1 Ai , prove that B ⊇ i=1 Ai . Show, by an example, that this inclusion can be
Ð∞ Ð∞
proper.
8. Is every point of every open set E ⊆ R2 a limit point of E? Answer the same question for closed
sets in R2 .
Proof. For open sets, yes. Every interior point has a neighbourhood completely contained within
E, so every smaller neighbourhood will contain (infinitely many) points in E. The same is not
true in general for closed sets. For example, consider {(0, 1/n) : n ∈ N} ∪ {(0, 0)}, in which 1 is
not a limit point.
9. Let E ◦ denote the set of all interior points of a set E, called the interior of E.
(a) Prove that E ◦ is always open.
31
2 Basic Topology
Proof. No: let A = {1/n : n ∈ N} ∪ {0} be a subset of R and let E = Ac . Then, E = R so its
interior is R. However, 0 is not an interior point of E: every neighbourhood contains at least
one point 1/n not in E.
10. Let X be an infinite set. For p, q ∈ X , define d(p, q) = 1 − δp,q . Prove that this is a metric. Which
subsets of the resulting metric space are open? Which are closed? Which are compact?
Proof. By definition, d(p, q) ≥ 0, with equality iff p = q. Now let p, q, r ∈ X . Then if p , r , then
d(p, r ) = 1, while d(p, q) + d(q, r ) is at least 1, since at least one of the terms must be non-zero. If
p = r , then either q = r = p in which case equality holds, or q , r , in which case 0 ≤ 2. Thus d is
a metric.
Notice that for any E ⊆ X , a neighbourhood of size 1/2 around any point x ∈ E consists only of
the point x (any other point has d(x, y) = 1 > 1/2). Thus every point in any set E is an interior
point, so every set is open. Similarly, considering any set F ⊆ X , F c is open, so F is closed.
Now, notice that finite sets are always compact. Infinite sets can be covered by singletons, but
thus admit no finite subcover. Thus, a set in X is compact iff it is finite.
d 1 (x, y) = (x − y)2
d 2 (x, y) = |x − y|
p
d 3 (x, y) = |x 2 − y 2 |
d 4 (x, y) = |x − 2y|
|x − y|
d 5 (x, y) = .
1 + |x − y|
12. Let K = {1/n : n ∈ N} ∪ {0} ⊆ R. Prove that K is compact without invoking Heine-Borel.
Proof. Let {G α } be any open cover of K. Then, 0 ∈ K is in some G α , and since G α is open, there
must exist some neighbourhood Nr (0) ⊆ G α . Thus, if n > 1/r , 1/n ∈ G α . Taking this, and one
open set for each of the remaining 1/n yields a finite cover for K.
32
2 Basic Topology
13. Construct a compact set of real numbers whose limits form a countable set.
Proof. Let A = {(1 + 2−m )/n : m, n ∈ N}. Notice that {1/n : n ∈ N} ∪ {0} ⊆ A0. We claim these
1
are the only limit points. Let y ∈ [0, 2] not in the above set of limit points. Then, n+1 < y < n1
for some n. Now for every k > n in N, mk1+1 < y − k1 ≤ m1k for some mk ∈ N. Notice that as k
h i
increases, mk decreases, so the interval mk1+1 , m1k only gets larger. Thus, there are no elements
in
1 1 1 1
A∩ + , + ,
n + 1 mn+1 + 1 n + 1 mn+1
a neighbourhood of y by construction. Thus y is not a limit point, and thus A has countably many
limit points.
14. Give an example of an open cover of the segment (0, 1) which has no finite subcover.
Proof. Let G n = (1/(n + 2), 1/n) for n ∈ N. This is a cover since for any r ∈ (0, 1), there exists
n such that 1/(n + 2) < r < 1/n so r ∈ G n . However, notice that 1/(n + 1) is only in G n , so
every subcover of (0, 1) would have to contain G n for every n ∈ N, forcing it to be countable, as
required.
15. Show that Cantor’s intersection theorem becomes false if the condition of compactness is weak-
ened to either solely closed or bounded.
Proof. Suppose the restriction is weakened to closed sets. Then let Kr = {x ∈ R : x ≤ r }, each of
which is closed. Also,
Ka1 ∩ . . . ∩ Kan = K min(a1 ,...,an ) , ∅
but r ∈R Kr = ∅.
Ñ
but k ∈N F k = ∅.
Ñ
16. Regard Q as a metric space with d(p, q) = |p − q|. Let E be the set of all p ∈ Q with 2 < p 2 < 3.
Show that E is closed and bounded in Q but not compact. Is E open in Q?
Proof. E is bounded below and above by 1 and 2 respectively. Also, E is closed since Q is dense in
R. To show E is not compact, we employ a similar construction as in Exercise 14. E is open in Q,
since a neighbourhood sufficiently small can be chosen around every point, completely contained
in E.
17. Let E be the set of all x ∈ [0, 1] whose decimal expansion contains only the digits 4 and 7. Is E
countable? Is E dense in [0, 1]? Is E compact? Perfect?
Proof. E is uncountable: there is a bijection between E and the real numbers between 0 and 1,
simply by replacing 4’s with 0’s and 7’s with 1’s and interpreting the resulting string as a binary
expansion. E is not dense in [0, 1] since there are no elements of E in (0.5, 0.6), as the first digit
will always be 5. E is clearly bounded.
Suppose that x contained a digit other than 4 or 7. That is, suppose x = 0.x 1x 2x 3 . . . x k . . . with
x k < {4, 7}. Then, choosing a neighbourhood of size 2 · 10−(k+1) assures that either x k or x k +1
33
2 Basic Topology
will not be either a 4 or a 7. Thus, the neighbourhood will contain no elements in E. Thus, the
contrapositive states that every limit point of E must be in E, so E is closed and thus compact.
Let x ∈ E 0. We can find elements x n in E arbitrarily close to x by switching the n-th digit between
a 4 and a 7, acquiring elements in E which are a distance 3·10−n from x. Thus, E is also perfect.
Proof. Yes: enumerate the rational numbers between 0 and 1 as qn . Then, let A be the set of real
numbers between 0 and 1 such that the n-th ternary digit of any x ∈ A is chosen to be different
from the n-th ternary digit of qn . In this way, A contains no rational numbers.
A is non-empty: this must be the case if [0, 1] is to be uncountable. A is closed: suppose x is a
limit point of A so that we can find elements of A arbitrarily close to x. In particular, we can find
elements of A whose fist n ternary digits match. If x were not in P, then there would be some n
for which x n = (qn )n , but we can find some element of P which matches x to more than n digits,
a contradiction. Finally, A is perfect since we can always generate close elements in P to any x by
choosing the other possibility for any one of the digits.
Proof. Since A and B are closed, A = A and B = B, so (A, B) and (A, B) are both disjoint sets
(they are both (A, B)).
Proof. Suppose otherwise, and without loss of generality, suppose x ∈ A ∩ B. Then, some
neighbourhood of x is completely contained in B. Also, there exists some element of A in
this neighbourhood, so there exists y ∈ A ∩ B, contradicting the disjointness of A and B.
c) Fix p ∈ X , δ > 0, and define A = {q ∈ X : d(p, q) < δ }. Define B similarly, with > instead of
<. Show A and B are separated.
d) Prove that every connected metric space with at least two points is uncountable. Hint: Use
(c).
Proof. Suppose a, b are distinct elements of a connected metric space X . Let δ = d(α, β) > 0.
Then, for every r ∈ (0, δ ), let Ar and Br be the sets of elements of X with distance less than,
or greater than respectively, r . Then, there must be an element zr in X with d(zr , a) = r ,
otherwise Ar and Br form a separation of X , as in (c). Thus, there exists an injection from
the uncountable set (0, δ ) to X so X is uncountable.
20. Are closures and interiors of connected sets always connected? (Look at subsets of R2 .)
34
2 Basic Topology
Proof. Closures of connected sets are connected. Suppose E is connected but E is not. Then, E
admits a separation (A, B). Then, (A ∩ E, B ∩ E) forms a separation of E, a contradiction. However,
this is not the same for interiors: take A = N 1 (−2) ∪ N 1 (2) ∪ L(−2, 2) ⊆ C where L(a, b) is the line
segment between a and b. Then A◦ = N 1 (−2) ∪ N 1 (2) which is disconnected.
21. Let A, B be separated subsets of Rk . Suppose a ∈ A, b ∈ B, and define p(t) = (1 − t)a + tb for
t ∈ R. Put A0 = p −1 (A) and B 0 = p −1 (B).
a) Prove that A0 and B 0 are separated subsets of R1 .
Proof. Notice firstly that both sets are non-empty as they contain 0 and 1 respectively. Now
suppose, for the sake of contradiction that t ∈ A0 ∩ B 0 , so that there are t 0 arbitrarily close
to t in A0 , and also t ∈ B 0 . This implies there are p(t 0 ) arbitrarily close to p(t) in A and also
p(t) ∈ B, so p(t) ∈ A ∩ B, contradicting the fact that A, B are separated.
Proof. Suppose otherwise, so that A0 ∪ B 0 = [0, 1]. Let β = inf B 0 ∈ B 0 . We know β < A0
since A0 ∩ B 0 = ∅. However, since A0 ∪ B 0 = [0, 1], there are points in A0 arbitrarily close
to β. This implies β ∈ A0 so β < B 0 . This implies β < A0 ∪ B 0 = [0, 1] which is impossible,
forming our desired contradiction.
Proof. If a subset E were not connected, then the above construction shows that there exists
an incomplete line in E, so it fails to be convex.
22. A metric space is called separable if it contains a countable dense subset. Show that Rk is
separable. Hint: Consider the set of points which have only rational coordinates.
Proof. Let Qk be the set of k-tuples of rational numbers. Notice that in every open ball, we can
completely enclose a k-cell. This k-cell is determined by k intervals in R, each of which contain
a rational number since Q is dense. Taking these as coordinates yields an element of Qk in the
original neighbourhood. Now Qk is countable by an earlier theorem, so Rk is separable.
23. A collection {Vα } of open subsets of X is said to be a base for X if the following is true: For every
x ∈ X and every open set G ⊆ X containing x, we have x ∈ Vα ⊆ G for some α. In other words,
every open set in X is the union of a subcollection of {Vα }.
Prove that every separable metric space has a countable base. Hint: take all neighbourhoods with
rational radius and center in some countable dense subset of X .
Proof. As the hint suggests, take {Vα } = {Nr (x) : x ∈ D, r ∈ Q}, for some countable dense subset
D of X . Then, let G be an open set. Then, G is the union of all the elements of {Vα } which
are subsets of G. This works since R satisfies the least-upper-bound property, so a sequence of
rationals can approximate any radius arbitrarily closely). Also, {Vα } is countable since there
exists a surjection from it to D × Q which is countable.
24. Let X be a metric space in which every infinite subset has a limit point. Prove that X is separable.
35
2 Basic Topology
Proof. Fix δ > 0 and pick x 1 ∈ X . Then, choose x j+1 ∈ X inductively such that d(x i , x j+1 ) ≥ δ for
i = 1, . . . , j, if possible. This process must terminate after a finite number of steps, otherwise the
set A = {x n : n ∈ N} is an infinite set without a limit point. Then, X can be covered by finitely
many neighbourhoods of size δ .
Take δ = 1/n, n ∈ N. Then, let K be the union of all the sets of centers (x i ) in the above
construction, for each δ = 1/n. We claim K is dense. Let x ∈ X and r > 0. We aim to find some
y ∈ Nr (x) ∩ Y . Let n ∈ N such that 1/n < r . If no elements of K are in N 1/n (x) ⊆ Nr (x), then the
above construction for x i could have proceeded one more step, contradicting our construction.
Thus K is a countable dense set in X , so X is separable.
25. Prove that every compact metric space K has a countable base, and that K is therefore separable.
Hint: For every positive integer n, there are finitely many neighbourhoods of radius 1/n whose
union covers K.
Proof. For n ∈ N, consider the cover {N 1/n (x) : x ∈ K }. Since K is compact, some finite subset
of these neighbourhoods must cover K. From the same conclusion as in Exercise 24, the set G
of all the centres of the neighbourhoods for n ∈ N forms a countable dense set in K, so K is
separable.
26. Let X be a metric space in which every infinite subset has a limit point. Prove that X is compact.
Proof. By Exercise 24, X is separable. By Exercise 23, X has a countable base, say {Vn }. Then
every open cover of X has a countable subcover {G n } ⊆ {Vn }. Suppose that there exists no finite
subcover of {G n } which covers X . Then, the complement Fn = (G 1 ∪ . . . ∪ G n )c is non-empty for
each n but Fn is empty. Let E be a set which contains a point from each Fn , and let x ∈ X be a
Ñ
limit point of E, which exists by presumption. But then x < G n for any G n , a contradiction.
Proof. We follow the construction of W outlined in the hint. Let x ∈ W c . Then x is not in any
Vn for which E ∩ Vn is at most countable. In fact, this means that if x ∈ Vn , then E ∩ Vn must be
uncountable. Now consider Nr (x). It is an open set, so it can be written as the union of some
subcollection Vn1 ∪ . . . ∪Vnk of {Vn }, of which each Vni ∩ E is uncountable. So Nr (x) ∩ E ⊇ Vn1 ∩ E
is an uncountable set, so x ∈ P.
Now, let y ∈ W , so y ∈ Vn for some n such that E ∩ Vn is at most countable. Then, Vn contains
some neighbourhood of y which contains at most countably many points in E, so y < P. Thus
W ⊆ P c and so P ⊆ W c , so we conclude P = W c .
The fact that P c ∩ E = W ∩ E is at most countable follows by construction. Now we claim W c
is perfect. The fact that W c is closed follows directly from the fact that it is the complement
of a countable union of open sets. It suffices to show that each point of W c is a limit point of
W c . Let x ∈ W c . If some neighbourhood of x did not contain any points in W c , then there must
be only points in W ∩ E. However, there are at most countably many of these points, and any
neighbourhood of x is uncountable. Thus the result follows.
36
2 Basic Topology
28. Prove that every closed set in a separable metric space is the union of a (possibly empty) perfect
set and a set which is most countable. (Corollary: Every countable closed set in Rk has isolated
points.) Hint: Exercise 27.
Proof. If F is at most countable, we are done. Otherwise, if F is uncountable, the proof from
Exercise 27 extends to F ⊆ X , so if P is the set of condensation points of F , then F = P ∪ (F \ P)
with F \ P at most countable, as required.
For the corollary, let E be a countable closed set in the separable metric space Rk . If it had no
isolated points, it would be perfect and thus uncountable, a contradiction.
29. Prove that every open set in R is the union of an at most countable collection of disjoint segments.
Hint: Exercise 22.
Proof. Let A be open. For every x, let I x = (a, b) be the largest segment in A containing x. That
is, let a = inf {z : (z, x) ∈ A}, b = sup{y : (x, y) ⊆ A} in the extended reals R. Then, a, b < A
otherwise a larger interval exists (since A is open). Also, every c ∈ (a, b) must be in A as otherwise
if x ≤ c, there would exist some x ≤ c < y < b for which (x, y) * A and similarly for the other
case.
Now, if (a, b) and (c, d) share at least one point, then c < b and d > a. Since a, c < A, we must
have a ≥ c and c ≥ a, so a = c. Similarly b = d so the intervals are the same. That is, if two
intervals share any point, they must be the same interval. Now, every interval I x contains at
least one rational since Q is dense, so there exists an injection from our set of open sets to the
countable set of rational numbers. Thus, our set is countable, as required.
Equivalently: If G n are dense open subsets of Rk , then n=1 G n is not empty (in fact, it is dense
Ñ∞
k
in R ).
(This is a special case of Baire’s theorem; see Exercise 3.22 for the general case).
"n−1 # n
Ù Ù
Vn ⊆ Vn−1 ∩ G n ⊆ Vn−1 ∩ G n ⊆ Gk ∩ Gn = Gk
k =1 k=1
so any element in n=1 Vn is also in k∞=1 G k , so the latter is non-empty, proving the given
Ñ∞ Ñ
equivalent statement.
37
3 Numerical Sequences and Series
Proof. For (a), suppose that pn → p. Then every neighbourhood of size ϵ > 0 must contain all pn for
some n ≥ N by definition. Now suppose every neighbourhood of p contains all but finitely many pn .
Then for any ϵ > 0, the neighbourhood of size ϵ contains all but finitely many pn , so there exists N
such that n ≥ N implies pn ∈ Nϵ (x).
For (b), if p , p 0, then d(p, p 0) = r > 0. Selecting ϵ < r /2 will force all but finitely many pn lie in
Nr /2 (p) ∩ Nr /2 (p 0) = ∅, a contradiction.
For (c), let N such that n ≥ N implies d(p, pn ) < 1. Then, d(p, pn ) ≤ max{d(p, p1 ), . . . , d(p, p N −1 ), 1}.
For (d), take pn to be some element of E in N 1/n (p), which exists since p is a limit point of E. This
converges to p, given N > 1/ϵ.
The following theorem states we can combine convergent sequences the way we expect.
Theorem 34. Suppose {sn }, {tn } are complex sequences and sn → s, tn → t.
(a) (sn + tn ) → s + t.
(b) csn → cs, c + sn → c + s.
(c) sn tn → st.
(d) 1/sn → 1/s given sn , s , 0.
Proof. For (a), let ϵ > 0 be arbitrary and choose Ns , Nt such that n ≥ Ns (n ≥ Nt ) implies |sn − s | < ϵ/2
(|tn − t | < ϵ/2). Then, if n ≥ max(Ns , Nt ),
∆
|sn + tn − (s + t)| ≤ |sn − s | + |tn − t | < ϵ
38
3 Numerical Sequences and Series
For the first part of (b), if c = 0, the result is immediate. Otherwise, let ϵ > 0 and N ∈ N such that
n ≥ N implies |s − sn | < ϵ/|c |. Then n ≥ N implies |cs − csn | < |c|c|ϵ| = ϵ, completing the proof. The
second part is similarly easy.
For (c), we use the following manipulation:
Choose Ns such that n ≥ Ns implies |sn − s | < ϵ/2|t | and Nt such that n ≥ Nt implies |tn − t | < ϵ/2S,
where S = maxn ∈N |sn |. Then, if n ≥ max(Ns , Nt ),
ϵ ϵ
|sn tn −st | = |sn (tn −t) + (sn −s)t | ≤ |sn (tn −t)| + |(sn −s)t | = |sn ||tn −t | + |sn −s ||t | ≤ S · + · |t | = ϵ
2S 2|t |
as required. If |t | = 0, then choose Ns such that n ≥ Ns implies |tn | < ϵ/S, and the result follows.
For (d), we can use (c) to conclude (1/sn )sn → 1, and since sn → s, we must have 1/sn → 1/s.
Proof. For (a), suppose {xn } converges to x. So for ϵ > 0, we can choose N ∈ N such that n ≥ N implies
|xn − x| < ϵ. Then for any i, |α i,n − α i | < ϵ and the result follows.
Suppose
√ each {α i,n } converges to α i . Let ϵ > 0. For each i, we can choose Ni ∈ N such that |α i,n − α i | <
ϵ/ k so that if n ≥ max(N 1, . . . , Nk ), then
v
u
t k r
Õ ϵ2
|xn − x| = |α i,n − α i | 2 ≤ k · = ϵ,
i=1
k
Definition 20. Given a sequence {pn }, consider an increasing sequence of positive integers {nk }. Then the
sequence {pnk } is a subsequence of {pn }. If {pnk } converges, its limit is called a subsequential limit of
{pn }.
39
3 Numerical Sequences and Series
Proof. Suppose pn → p. Then for any ϵ > 0, there exists N ∈ N such that n ≥ N implies |pn − p| < ϵ.
Then if {pnk } is a subsequence of pn , then there exists some K ∈ N such that k ≥ K implies nk ≥ N . So
for k ≥ K, we have |pnk − p| < ϵ.
If every subsequence of pn converges to p, then in particular, the subsequence with nk = k (the entire
sequence) converges, so we are done!
Proof. For (a), let K be the image of {pn }. Then if K is finite, there exists some p ∈ K which is
attained infinitely often. The subsequence of these points converges to p. Otherwise, K is an
infinite subset of a compact metric space, and thus admits a limit point in X . The result follows
from Theorem 3.2(d).
The result in (b) follows from (a) since bounded subsets of Rk lie inside a compact subset of Rk
(namely the closure of the image of the sequence).
Theorem 38. The subsequential limits of a sequence {pn } in a metric space X form a closed subset of X .
Proof. Let E ∗ be the set of all subsequential limits of {pn } and let q be a limit point of E ∗ . Choose n 1
such that pn1 , q. If this doesn’t exist, E ∗ has one point and we are done. Put δ = d(q, pn ). Then build
nk inductively, so that given n 1, . . . , ni−1 , we choose x ∈ E ∗ such that d(x, q) < 2−i δ . Since x ∈ E ∗ , we
can choose ni > ni−1 such that d(x, pni ) < 2−i δ . Then d(q, pni ) < 21−i δ so pni → q so q ∈ E ∗ .
Proof. For (a), notice that E ⊆ E so diam E ≤ diam E. If diam E > diam E, there exists a pair of elements
x, y ∈ E with |x − y| > diam E. If one of x, y is not in E, then a neighbourhood of size less than
(|x − y| − diam E)/2 around the element in E has no elements in E, a contradiction. A similar argument
can be made if both are not in E.
For (b), we know from Nested interval theorem that S = n=1 Kn is non-empty. If S had more than one
Ñ∞
point, say a and b, then diam Sn ≥ d(a, b) > 0 all n and thus diam Sn 9 0, a contradiction.
40
3 Numerical Sequences and Series
Proof. For (a), let ϵ > 0. Then there exists N such that n ≥ N implies d(pn , p) ≤ ϵ/2. Then for any
m, n ≥ N , we have d(pn , pm ) ≤ d(pn , p) + d(pm , p) < ϵ.
For (b), let En = {pn , pn+1, . . . }. Then since {pn } is Cauchy, diam En → 0. Also, En are subsets of X and
thus compact. Finally, En ⊇ En+1 by definition. By Theorem 3.10, n=1 En consists of one point. Let
Ñ∞
this point be p. Since diam En → 0, points in {pn } get arbitrarily close to p so {pn } converges to p ∈ X .
For (c), Cauchy sequences are bounded, and thus their image is contained in a compact subset of Rk .
The result follows from (b).
Definition 22. A metric space in which every Cauchy sequence converges is complete.
Theorem 3.11 says that all compact metric spaces and all Euclidean spaces are complete.
Also, every closed subset of a complete metric space is complete.
Definition 23. A sequence {sn } of real numbers is said to be
• monotonically increasing if sn ≤ sn+1 for all n,
• monotonically decreasing if sn ≥ sn+1 for all n
The class of monotonic sequences consists of the increasing and decreasing sequences.
Theorem 41. Suppose {sn } is monotonic. Then {sn } converges iff it is bounded.
Proof. Suppose {sn } is increasing and bounded. Then its image admits a least upper bound α. For every
ϵ > 0, there exists N such that s − ϵ < s N ≤ s, otherwise s − ϵ would be a smaller upper bound. For
n ≥ N , s − ϵ < s N ≤ sn ≤ s, as required.
The converge follows from Theorem 3.2(c).
Theorem 42. Let {sn } be a sequence of real numbers. Let E be the set of possibly infinite subsequential
limits of {sn }, and s ∗ = lim sup sn . Then the following are true:
1. s ∗ ∈ E.
41
3 Numerical Sequences and Series
Proof. For (a), if sn → ±∞, then every subsequence goes to ±∞ so s ∗ ∈ E. Otherwise, it is bounded and
thus E is closed by Theorem 3.7, so s ∗ = sup E ∈ E = E.
For (b), notice that is this weren’t the case, then there are sn ≥ x for n arbitrarily large. Taking these sn
as a subsequence yields a subsequence with limit ≥ x > sn , contradicting the maximality of s ∗ in E.
Finally, let a, b ∈ E ⊆ R both satisfy these above properties. Without loss of generality, a > b. Let x ∈ R
such that b < x < a. Then statement (b) holds for b so sn < x < a for n ≥ N . Then a cannot satisfy
(a).
For example, the sequence containing all rational numbers {sn } has every real number as a subsequential
limit. Also, its lim sup is +∞ and lim inf is −∞.
Also, it is noteworthy that a sequence converges iff its lim sup is equal to its lim inf.
Theorem 43. If sn ≤ tn for n ≥ N , where N is fixed, then
Proof. The cases with infinities are simple casework. Thus, suppose all values involved are finite real
numbers. Suppose, for the sake of contradiction that lim inf n→∞ sn > lim inf n→∞ tn . Then, there exists
some y ∈ Et strictly less than all the x ∈ Es , where Es and Et are the sets of subsequential limits of {sn }
and {tn } respectively.
Consider the subsequence {tnk } which converges to this y. Suppose without loss of generality that
each nk ≥ N . Then each term in {snk } is at most the corresponding term in {tnk }. In particular, some
subsequence {snki } converges to a value at most y, a contradiction, since some x ∈ Es is at most y.
The corresponding proof for lim sup follows similarly.
42
3 Numerical Sequences and Series
n k n(n − 1) · · · (n − k + 1) k nk p k
n
(1 + p) > p = p > k .
k k! 2 k!
Hence
nα 2k k! α −k
0< < n
(1 + p)n pk
which approaches zero since α − k < 0 and by (a).
Statement (e) follows by taking α = 0 in (d).
3.5 Series
Definition 25. Given {an } we associate a sequence {sn } where
n
Õ
sn = ak = a 1 + a 2 + · · · + ak ,
k =1
The notion of Cauchy sequences on the partial sums can be extended to series:
Theorem 45. an converges iff for every ϵ > 0, there exists an integer N such that
Í
n
Õ
ak < ϵ
k =m
for all n ≥ m ≥ N .
Proof. Notice that nk=m ak = sn − sm , so the condition is equivalent to the sequence {sn } being Cauchy.
Í
The result follows from the fact that Cauchy sequences and convergent sequences are equivalent in
Rk .
Theorem 47. A series of non-negative terms converges if and only if its partial sums form a bounded
sequence.
Proof. The partial sums form a monotonically increasing sequence, so the result follows as in Theorem
3.14.
43
3 Numerical Sequences and Series
Proof. For (a), let {sn } be the sequence of partial sums of an and C = c n . Also, notice that c n ≥ 0. Let
Í
ϵ > 0. Then since c n converges, there exists an integer N such that for n ≥ m ≥ N ,
Í
n
Õ
c k < ϵ.
k=m
so an converges.
Í
For (b), suppose without loss of generality that dn diverges to +∞. Then let M be a real number. Then
Í
for n ≥ N ,
Õn n
Õ
ak ≥ dk > M
k =1 k=1
so an diverges too.
Í
1 1 x n+1 1 1
sn − = − − = x n+1 · →0
1−x 1−x 1−x 1−x 1−x
The following theorem states that a rather ‘thin’ subsequence of decreasing sequences determines the
convergence of an .
Í
Theorem 50 (Cauchy). Suppose a 1 ≥ a 2 ≥ · · · ≥ 0. Then the series an converges if and only if the
Í
series
∞
2k a 2k = a 1 + 2a 2 + 4a 4 + · · ·
Õ
S :=
k =0
converges.
44
3 Numerical Sequences and Series
Proof. Let {bn } be a sequence related to an such that bn = a f (n) where f (n) is the largest power of two
at most n. That is, the sequence {bn } starts (a 1, a 2, a 2, a 4, a 4, a 4, a 4, · · · ). Since an is decreasing, {bn } is a
strictly decreasing sequence where each term is at least the corresponding an . Also, notice S is bn , so
Í
the fact that an converges if S converges follows by the comparison test.
Í
2k 2k
1 1Õ
= a 1 + a 2 + (a 3 + a 4 ) + . . . + (a 2k −1 +1 + · · · + a 2k ) ≥ a 1 + a 2 + 2a 4 + . . . + 2k−1a 2k =
Õ
bi
i=1
2 2 i=1
k
Proof. We can write 2k a 2k = 22kp = 2(1−p)k so k∞=0 (2(1−p) )k converges iff 0 ≤ 21−p < 1 from Theorem
Í
3.26, which occurs when p > 1. The result follows by the previous theorem, since the original sequence
was decreasing.
Proof. Write
2k 1 c
2k a 2k = = p
= p
k k
2 (log 2 )p (k log 2) k
1
with c = so by Theorem 3.27, an converges iff
Í
(log 2)p
∞
Õ 1
c
kp
k =1
45
3 Numerical Sequences and Series
Ín 1 1 n
Proof. Let sn = and tn = 1 + n . By the binomial theorem,
k=0 k !
n n n k −1
n 1 1 n! 1 Ö i
Õ Õ Õ
tn = = · = 1− .
k nk k! (n − k)!nk k! i=1 n
k=0 k =0 k =0
As a sidenote, the following bound on the error of the partial sums proves helpful:
Remark. Notice that
∞ ∞
Õ 1 1 Õ 1 1
0 < e − sn = < =
k! (n + 1)! (n + 1)k n!n
k=n+1 k =0
which shows that this infinite sum representation converges extremely quickly.
Theorem 54. e is irrational.
Proof. Suppose otherwise, so that e = p/q for positive integers p, q. By the above remark, 0 < q!(e −sq ) <
1
q . Then both q!sq and q!e are integers, so the above inequality implies the existence of an integer
between 0 and 1/q, a contradiction.
Proof. For (a), we know then that for large enough n, n |an | ≤ α so |an | ≤ α n . The result follows from
p
the comparison test. For (b), notice that for large enough n, |an | ≥ 1, so the series diverges by the
contrapositive of Theorem 3.23.
For (c), the series 1/n and 1/n2 work.
Í Í
a n+1 a n+1
Theorem 56 (Ratio Test). The series an converges if lim supn→∞ < 1 and diverges if ≥1
Í
an an
for all n ≥ n 0 where n 0 is some fixed integer.
46
3 Numerical Sequences and Series
The previous two tests are very useful for determining convergence. In most cases, the ratio test is
easier to use, but we show in the following theorem than the root test is stronger.
Theorem 57. For any sequence {c n } of positive numbers,
c n+1 √ √ c n+1
lim inf ≤ lim inf n c n ≤ lim sup n c n ≤ lim sup
n→∞ cn n→∞ n→∞ n→∞ cn
Proof. Let α = lim supn→∞ ccn+1 . If α = +∞, we are done. Otherwise, suppose some subsequence {c nk }
n √
had the property that limk→∞ nk c nk = β > α. Then, there must exist infinitely many terms {c n } with
c nn > α n . But this cannot occur if the ratios of consecutive terms does not exceed α infinitely often, a
contradiction. The left inequality is similar.
n=0
is called a power series. The numbers c n are called the coefficients of the series, and z is a complex
number.
Theorem 58. Given a power series as above, let
n
p 1
α = lim sup |c n |, R = .
n→∞ α
Proof. The value for α is derived from the result of the root test, so this proof is just like unwrapping a
present you wrapped for yourself.
as required.
47
3 Numerical Sequences and Series
(b) b0 ≥ b1 ≥ · · · .
(c) bn → 0.
Then an bn converges.
Í
Proof. The above theorem suggests using the Cauchy criterion. Let ϵ > 0 and suppose the An are
bounded by M. We first manipulate:
q
Õ q−1
Õ
a n bn = An (bn − bn+1 ) + Aq bq − Ap−1bp
n=p n=p
" q−1 #
Õ
≤M (bn − bn+1 ) + |bq | + |bp |
n=p
= 2Mbq
Proof. This follows from the previous theorem, letting an = (−1)n+1 and bn = |c n |.
2
Proof. Let an = z n and bn = |c n |. Then the An are bounded by |1−z | , so Theorem 3.42 applies.
Proof. Let ϵ > 0 be arbitrary and N such that the Cauchy criterion holds for n ≥ m ≥ N . Then, for
these same n, m, we have
Õn Õn
ak ≤ |ak | < ϵ
k =m k=m
48
3 Numerical Sequences and Series
Notice that absolute convergence is convergence if your terms are positive. Series which converge but
not absolutely converge non-absolutely.
We see that we can manipulate absolutely convergent sums much like finite sums. Let’s explore the
types of operations we can perform on general series.
Theorem 64. If an = A and bn = B, then (an + bn ) = A + B and can = cA for any c.
Í Í Í Í
Proof. The first statement follows after noticing that the partial sums of (an + bn ) are simply the sums
Í
of the partial sums of an and bn . The second statement follows similarly.
Í Í
n
Õ
cn = ak bn−k
k=0
and call c n the product of the two series. This is motivated by considering the coefficients of the related
Í
generating series an z n and bn z n .
Í Í
The following theorem, due to Mertens, characterizes a large set of situations where this notion of
product is consistent.
Theorem 65 (Mertens). Suppose that
(a) an converges absolutely.
Í
(b) an = A.
Í
(c) bn = B.
Í
Then c n = AB.
Í
Proof. Let An , Bn and Cn be the n-th partial sums of an , bn , c n respectively. Also, let β = Bn − B. We
manipulate Cn , essentially switching the order of summations:
Cn = a 0b0 + (a 0b1 + a 1b0 ) + (a 0b2 + a 1b1 + a 2b0 ) + · · · + (a 0bn + a 1bn−1 + · · · + an b0 )
= a 0 Bn + a 1 Bn−1 + · · · + an B 0
= a 0 (B + βn ) + a 1 (B + βn−1 ) + · · · + an (B + β 0 )
= An B + a 0 βn + a 1 βn−1 + · · · + an β 0 .
Thus if we let
γn = a 0 βn + a 1 βn−1 + · · · + an β 0,
it suffices to prove that γn → 0. Let ϵ > 0 be arbitrary and choose N such that |βn | < ϵ for n ≥ N . Also,
let α = |an |. Then
Í
|γn | = |a 0 βn + a 1 βn−1 + · · · + an β 0 |
≤ |a 0 βn + a 1 βn−1 + · · · + a N βn−N | + |a N +1 βn−N −1 + · · · + an β 0 |
≤ |a 0 βn + a 1 βn−1 + · · · + a N βn−N | + ϵα
so if we fix N and let n → ∞, then we have
lim sup |γn | ≤ ϵα
n→∞
49
3 Numerical Sequences and Series
We now wonder whether this definition of product makes sense in the sense that if c n converges,
Í
whether it will have the sum AB. Abel proved that this is true.
Theorem 66. If an , bn , c n converge to A, B, C and c n = a 0bn + · · · + an b0 , then C = AB.
Í Í Í
Proof. Note: This is an incomplete proof, as it assumes that power series are continuous without
justification. We will fill in this gap in Theorem 8.2. This proof will be replaced later, since it is most
likely circular or wrong.
Let a(z) = an z n , b(z) = bn z n , and c(z) = c n z n . Then we (will) have seen that c(z) = a(z)b(z)
Í Í Í
if c n = a 0bn + · · · + an b0 , if we consider partial sums of c(z). Then the result follows by substituting
z = 1.
3.12 Rearrangements
Definition 29. Let {kn } be a bijection from N to itself. Then letting an0 = akn , we say that an0 is a
Í
rearrangement of an .
Í
Notice that the partial sums of these have no reason to be the same, so the question arises: should they
converge? Will their sums be the same if so?
We prove Riemann’s rearrangement theorem, which says that all hell breaks loose if our sequence
doesn’t converge absolutely:
Theorem 67 (Riemann Rearrangement). Let an which converges non-absolutely, and −∞ ≤ α ≤ β ≤
Í
+∞. Then, there exists a rearrangement an with partial sums sn0 such that
Í 0
Proof. This proof is well-studied. We present an informal proof hopefully sufficient to provide intuition.
The actual proof is long-winded.
Let an+ and an− denote the subsequences of positive and negative terms of an . If either subsequence were
finite, then the other would overpower it and eventually either diverge or converge (after which an
would converge absolutely since the subsequence has all the same sign). Thus both must be infinite.
If α ≥ 0, take positive terms until our partial sums go above α. We must be able to do this, since
both subseries diverge. Similarly, if α < 0, take negative terms. Then, repeat the following process
indefinitely. Take positive terms until our partial sums go above β, then negative terms until our partial
sums return below α. This is again made possible since both subsequences diverge.
Taking the terms which are ‘turning points’ on both sides, we have sequences which converge to α and
β, so the result follows.
Now, the following theorem states that absolutely convergent series do indeed behave nicely:
Theorem 68. If an converges absolutely, then every rearrangement of an converges, and further they
Í Í
converge to the same sum.
Proof. Let an0 = akn be a rearrangement of an . Let ϵ > 0. Take N ∈ N such that if n ≥ m ≥ N ,
Í Í Í
then
n
Õ
|ak | < ϵ.
k =m
50
3 Numerical Sequences and Series
3.13 Exercises
1. Prove that convergence of {sn } implies convergence of |sn |. Is the converse true?
Proof. If sn → 0, then the result follows by definition. Otherwise, suppose sn → s and s > 0. We
can find N such that n ≥ N implies s/2 < sn < 3s/2, so that ||sn | − |s || = |sn − s | < ϵ. The proof
when s < 0 is similar, except we take ϵ = −s/2.
The converse is not true: consider the sequence sn = (−1)n for which |sn | = 1 converges trivially,
but for which no N exists for ϵ < 2.
√
2. Calculate limn→∞ ( n2 + n − n).
Proof. We manipulate:
√ n2 + n − n2 n 1
n2 + n − n = √ =√ =p .
n2 + n + n n2 + n + n 1 + 1/n + 1
p 1 + 1/n − 1 1
| 1 + 1/n − 1| = p ≤ →0
1 + 1/n + 1 2n
so √ 1 1 1
n2 + n − n = p → = .
1 + 1/n + 1 1+1 2
√ √
3. If s 1 = 2 and sn+1 = 2 + sn , prove that {sn } converges and that sn < 2.
p
p √
Proof. Notice that the function f (x) = 2 + x is monotonic in the sense that if a < b, then
p √ √
f (a) < f (b). Also, s 2 = 2 + 4 2 ≥ 2 = s 1 . Also, if sn−1 ≤ sn , then
√ √
q q
sn = 2 + sn−1 ≤ 2 + sn = sn+1 .
4. Find the upper and lower limits of the sequence {sn } defined by
s 2m−1 1
s 1 = 0; s 2m = ; s 2m+1 = + s 2m .
2 2
51
3 Numerical Sequences and Series
Proof. We claim that s ∗ = 0 and s ∗ = 1. First, the fact that 0 ≤ sn ≤ 1 with s 2m ≤ 1/2 can be
shown inductively. The base case follows immediately. Otherwise, dividing by 2 preserves the
conditions, and adding 1/2 to s 2m gives a number at most 1. It suffices now to find two sequences
converging to 0 and 1.
k
the sequence {s 2k } is identically 0. The sequence s 2k +1 −1 = 0, 12 , 34 , . . . , 2 2k−1 , . . . converges to
1.
Proof. If both α and β are +∞, the result proves itself. Likewise if α and β are both −∞, so it
suffices to consider the finite cases.
Let α = lim supn→∞ an and β = lim supn→∞ bn . Suppose that lim supn→∞ (an + bn ) = γ > α + β.
Then, some sequence {sn } with sn = an + bn converges to γ . For each sn , either an > α or bn > β.
Thus, either an > α infinitely often, or bn > β infinitely often. In either of these cases, either
lim supn→∞ an > α, or lim supn→∞ bn > β, a contradiction.
Proof. Manipulate:
√ √
n+1− n (n + 1) − n 1
0 ≤ an = = √ √ ≤ 3/2
n n( n + 1 + n) n
1
(d) an = 1+z n for complex z.
Proof. Notice that if |z| < 1, then an → 1 , 0, so the sum does not converge. If |z| > 1, notice
that the ratios
an 1 + z n−1 1
= n
→ <1
an−1 1+z z
so an converges by the ratio test. If |z| = 1, the sum diverges.
Í
52
3 Numerical Sequences and Series
Í √an
7. Prove that the convergence of an implies the convergence of , if an ≥ 0.
Í
n
for which the first sum converges by Theorem 3.42, and the second sum converges as a scalar
multiple of a convergent series.
n 3z n
lim sup = |z| < 1
n→∞ (n − 1)3z n−1
so the radius of convergence is 1.
Í 2n n.
(b) n! z
53
3 Numerical Sequences and Series
10. Suppose that the coefficients of the power series an z n are integers, infinitely many of which
Í
are non-zero. Prove that the radius of convergence is at most 1.
where m ≥ N and M is a bound on the partial sums of bn which exists since bn converges.
Í Í
So the partial sums of an monotonically increasing and bounded above, so an converges.
Í Í
54
3 Numerical Sequences and Series
∞
Õ
rn = am .
m=n
Now for any ϵ > 0, suppose ϵ < 1/2 without loss of generality. Then, for any n ∈ N, we can
find m ≥ n such that rrmn < 1 − ϵ since r n → 0, so nk =m ar kk = 1 − rrmn > ϵ. Then ar kk is not
Í Í
Cauchy and thus diverges.
√ √
r n + r n+1
Proof. Since r n+1 < r n , we have √
rn
< 2. Reorganizing yields
an √ √
√ < 2( r n − r n+1 )
rn
55
3 Numerical Sequences and Series
13. Let an and bn converge absolutely. Then, letting c n = nk =1 ak bn−k , prove that c n converges
Í Í Í Í
absolutely.
and that letting dn be the positive terms on the right side, dn converges by Merten’s theorem.
Í
Thus, c n converges absolutely by the comparison test.
Í
1
14. If {sn } is a complex sequence, define its arithmetic means σn = n+1 (s 0 + . . . + sn ).
(a) If sn → s, prove that σn → s.
Proof. Let ϵ > 0 be arbitrary and N 0 ∈ N such that |sn − s | < ϵ/2 for n ≥ N 0 . Notice then that
N0 n
∆ Õ |sk − s | Õ |sk − s | S ϵ
|σn − s | ≤ + < +
n+1 n+1 n+1 2
k =0 k=N 0
ÍN0
where S = k=0
|sk − s | is finite. Then, letting N = max(N 0, 2S/ϵ), we notice that
S ϵ
|σn − s | ≤ + ≤ϵ
n+1 2
if n ≥ N , so σn → s.
(b) Construct a sequence {sn } which does not converge, but σn does.
Proof. Consider sn = (−1)n . Then clearly sn does not converge as the partial sums alternate
between 0 and 1, but (
0 if n even
σn = 1
n+1 if n odd
does.
(c) Can it happen that sn > 0 for all n and that lim supn→∞ sn = +∞, but σn → 0?
√ √
Proof. Yes, let s 0 = 1, then sn = 3 n > 0 if 3 n is an integer, otherwise let sn = 1/n2 > 0. Then,
n n
!
1 Õ 1 π2
1 + 2n2/3 + 1/i 2 < 2n−1/3 + n−1 → 0
Õ
σn = sn <
n+1 n+1 i=0
6
k=0
but for any M ∈ R, there exists some integer k > M, and thus s (k−1)2 > M, so lim supn→∞ sn =
+∞, as required.
Assume that nan → 0 and {σn } converges. Prove that {sn } converges. (Notice this provides a
converse of (a) given nan → 0.
56
3 Numerical Sequences and Series
(e) Prove (d) with the weaker hypothesis that nan is bounded by M < ∞ and σn → σ . Show that
sn → σ , by completing the following outline:
(i) If m < n, then
n
m+1 1 Õ
s n − σn = (σn − σm ) + (sn − si ).
n −m n − m i=m+1
Proof. We manipulate:
n m n m
1 Õ 1 Õ n −m 1 Õ 1 Õ
sn = sk − sk + · sn − sk + sk
n −m n −m n −m n −m n −m
k =0 k =0 k =0 k =0
" n m
# " n m
#
1 Õ Õ 1 Õ Õ
= sk − sk + (sn − sk ) − (sn − sk )
n −m n −m
k =0 k=0 k=0 k=0
n
1 1 Õ
= [(n + 1)σn − (m + 1)σm ] + (sn − sk )
n −m n −m
k=m+1
n
n+1 m+1 1 Õ
= σn − σm + (sn − sk )
n −m n −m n −m
k =m+1
so
n
n+1 m+1 1
Õ
s n − σn = − 1 σn − σn + (sn − sk )
n −m n −m n −m
k=m+1
n
m+1 1 Õ
= (σn − σm ) + (sn − si ),
n −m n − m i=m+1
as required.
57
3 Numerical Sequences and Series
(iii) Fix ϵ > 0 and associate to every n the integer m such that
n −ϵ
m≤ < m + 1.
1+ϵ
Then (m + 1)/(n − m) ≤ 1/ϵ and |sn − si | < Mϵ so lim supn→∞ |sn − σ | ≤ Mϵ so the
result follows.
(n − m − 1)M
|sn − si | ≤ < Mϵ
m+2
and thus
m+1∆ 1
|sn − σn | ≤ |σn − σm | + |sn − si | < Mϵ
n −m n −m
for sufficiently large m, n such that the first term vanishes (since σn → σ ). Thus
15. Extend Definition 3.21 to points in Rk . Show that Theorems 3.22, 3.23, 3.25(a), 3.33, 3.34, 3.42,
3.45, 3.47, 3.55 are true in this more general setting.
Proof. Each proof is presented in a sufficiently general fashion such that replacing |x − y| with
the respective metric in Rk suffices to prove each theorem for infinite sums in Rk .
√
16. Fix α > 0. Choose x 1 > α and let
1 α
x n+1 = xn + .
2 xn
√
(a) Prove that {x n } decreases monotonically and that x n → α.
√ √
Proof. Notice that x n > α for each n from the AM-GM inequality, and thus α/x n < α.
Thus, x n+1 < x n and the sequence is monotonically decreasing and bounded below. Thus, it
√
converges to some L ≥ α. Then, notice that
1 α 1
L = lim x n+1 = lim xn + = (L + α/L)
n→∞ n→∞ 2 xn 2
√ √
which has solutions ± α, so limn→∞ x n = α, as required.
58
3 Numerical Sequences and Series
√
(b) Put ϵ = x n − α and show that
ϵn2 ϵ2
ϵn+1 = < √n
2x n 2 α
√
so that if β = 2 α,
2n
ϵ1
ϵn+1 <β .
β
Proof. We have
√
ϵn2 (x n − α)2 1 √ α 1 α √
= = xn − 2 α + = xn + − α = ϵn+1 .
2x n 2x n 2 xn 2 xn
√
and the second part of the inequality follows from the fact proven in part (a) that x n > α
for all n. The second fact follows inductively by the above statement.
(c) This presents a great algorithm for computing square roots. Show that if α = 3 and x 1 = 2,
that ϵ1 /β < 0.1 so ϵ5 < 4 · 10−16 and ϵ6 < 4 · 10−32 .
√ √
Proof. We have β = 2 3 and ϵ1 = 2 − 3 so
√
ϵ1 2 − 3 4−3 1
= √ = √ √ < < 0.1
β 2 3 2 3(2 + 3) 10.5
√ 4 5
since 3 < 1.5. Then ϵ5 < β · 10−2 < 4 · 10−16 and similarly ϵ6 < β · 10−2 < 4 · 10−32 as
required.
√
17. Fix α > 1 and take x 1 > α and
α + xn α − x n2
x n+1 = = xn + .
1 + xn 1 + xn
(a) Prove that x 1 > x 3 > x 5 > · · · .
(b) Prove that x 2 < x 4 < x 6 < · · · .
59
3 Numerical Sequences and Series
√
Proof. The subsequence {x 2k+1 } is decreasing and bounded below by α, and is thus conver-
√
gent. It must converge to a value L ≥ α such that
α + xn α + L
L = lim x n+2 = lim = ,
n→∞ n→∞ 1 + xn 1+L
√ √
whose solutions are ± α. Thus, L = α.
(d) Compare the rapidity of convergence of this process with the previous one.
Proof. TODO: It converges slower, but I’m not able to show this...
where p is a fixed positive integer. Describe the behaviour of the resulting sequences {x n }.
√
Proof. Notice that according to the definition, we must have x n+1 ≥ p α by the AM-GM inequality.
√
Then let x 1 > p α without loss of generality, then by a similar argument as above, x n+1 < x n so
the sequence converges since it is monotonically decreasing and bounded below. It must converge
√ √
to its unique fixed point at least p α, namely x n → p α.
Proof. First, notice that for any real number r ∈ [0, 1], we can associate a ternary representation
{an } ∈ {0, 1, 2}N such that r = n=0 an 3−n , so this question reduces to showing that r is in the
Í∞
Cantor set iff it contains no 1s in its ternary representation.
Notice that by construction, the middle thirds of each segment that we remove in the i-th iteration
have a 1 as the i-th trit. So {0, 2}N ⊇ C. The converse follows again by noticing that any
r ∈ {0, 2}N is in each closed partial Cn , and thus in its infinite intersection C.
20. Suppose {pn } is a Cauchy sequence in a metric space X and some subsequence {pni } converges
to a point p ∈ X . Prove that the full sequence {pn } converges to p.
Proof. Let ϵ > 0 be arbitrary and let I ∈ N such that d(pni , p) < ϵ/2 for all i ≥ I . Also, let
N ∈ N such that for all n ≥ m ≥ N , we have d(pn , pm ) < ϵ/2. Let N 1 be some ni for i ≥ I such
that ni ≥ N . Then for all n ≥ N 1 , we have d(pn , p) ≤ d(pn , p N1 ) + d(pn I , p) ≤ ϵ so pn → p as
required.
21. Prove the following analogue of Theorem 3.10(b): If {En } is a sequence of closed non-empty
and bounded sets in a complete metric space X , if En ⊇ En+1 , and if diam En → 0, then n=1 En
Ñ∞
contains a single point.
Proof. Let pn ∈ En . Then {pn , pn+1, . . . } ⊆ En so diam{pn , pn+1, . . . } → 0, ie. {pn } is Cauchy.
Since X is complete, pn → p for some p ∈ X . But now for any n, the sequence {pn , pn+1, . . . }
converges to p ∈ En since En is closed. Thus, p ∈ n=1 En . Now p must be the unique point in
Ñ∞
E otherwise the diameters of E would not fall below some positive real value.
Ñ∞
n=1 n n
60
3 Numerical Sequences and Series
22. Suppose X is a non-empty complete metric space, and {G n } is a sequence of dense open subsets
of X . Prove Baire’s theorem, namely that n=1 G n is non-empty, and in fact dense in X .
Ñ∞
23. Suppose {pn }, {qn } are Cauchy sequences in a metric space X . Show that the sequence {d(pn , qn )}
converges.
Proof. Let ϵ > 0 be arbitrary. Let Np ∈ N such that if n ≥ m ≥ Np , then d(pn , pm ) < ϵ/2. Define
Nq similarly. Then, for any n ≥ m ≥ max(Np , Nq ), we have
and similarly d(pm , qm ) ≤ d(pn , qn ) +ϵ, so |d(pn , qn ) −d(pm , qm )| < ϵ, so the sequence {d(pn , qn )}
is Cauchy in R and thus convergent since R is complete.
Proof. Clearly d(pn , pn ) → 0 so {pn } ∼ {pn }. Also, since d(pn , qn ) = d(qn , pn ), the relation
is symmetric. Finally, if {pn } ∼ {qn } and {qn } ∼ {r n }, then 0 ≤ d(pn , r n ) ≤ d(pn , qn ) +
d(qn , r n ) → 0 converges to 0 by the comparison test. Thus the relation is transitive. These
properties together make ∼ an equivalence relation on the Cauchy sequences in X .
(b) Let X ∗ be the set of equivalence classes under the above relation. If P, Q ∈ X ∗ and {pn } ∈ P,
{qn } ∈ Q, define
∆(P, Q) = lim d(pn , qn ),
n→∞
which exists from Exercise 23. Show that the number ∆(P, Q) is unchanged if we replace {pn }
or {qn } by equivalent sequences, so that ∆ is a distance function in X ∗ .
Proof. Suppose D = ∆(P, Q) = limn→∞ d(pn , qn ) for Cauchy sequences {pn } ∈ P, {qn } ∈ Q.
Suppose that {r n } ∈ P. Then, notice that
∆ ∆
d(pn , qn ) − d(pn , r n ) ≤ d(r n , qn ) ≤ d(pn , r n ) + d(pn , qn )
Proof. It is always true that convergent sequences are Cauchy. It suffices to show that
Cauchy sequences in X ∗ converge in X ∗ . Suppose that {Pn } is Cauchy with respect to
the distance metric ∆. Then, for each n, there exists a Cauchy sequence in Q kn such that
limk→∞ ∆(Pn , Q kn ) = 0. Then, for each n, we extract a subsequence Q n0 of Q kn such that
∆(Pn , Q n0 ) < 1/n for all n. Then,
∆
∆(Q n0 , Qm
0
) ≤ ∆(Pn , Q n0 ) + ∆(Pn , Pm ) + ∆(Pm , Qm
0
) < 1/n + 1/m + ϵ < 3ϵ
61
3 Numerical Sequences and Series
for sufficiently large m, n, where ϵ > 0 can be made arbitrarily small. Thus {Q n0 } is Cauchy.
Also, if {Q n0 } ∈ P, then
so Pn → P is convergent, as required.
(d) For each p ∈ X , there is a Cauchy sequence all of whose terms are p. Let Pp be the element of
X ∗ which contains this sequence. Prove that
∆(Pp , Pq ) = d(p, q)
Proof. This is trivial if we take the representative sequence to be the constant sequence of p
and q: the limit is that of a constant sequence.
(e) Prove that ϕ(X ) is dense in X ∗ , and that ϕ(X ) = X ∗ if X is complete. By (d), we may identify
X and ϕ(X ) and thus regard X as embedded in the complete metric space X ∗ . We call X ∗ the
completion of X .
Proof. Take P ∈ X ∗ and {pn } ∈ P with pn ∈ X . Then Ppn → P since {pn } is Cauchy, so ϕ(X )
is dense in X ∗ . Similarly, if X were complete, then every element P ∈ X ∗ admits a constant
sequence, so X ∗ ⊆ ϕ(X ) completing the proof.
25. Let X be a metric space whose points are the rational numbers, with the metric d(x, y) = |x − y|.
What is the completion of this space? (Compare Exercise 24).
Proof. The completion is R! Clearly Q∗ ⊆ R by definition, and every element r ∈ R has some
Cauchy sequence in Q which converges to it, namely the one generated by its decimal expansion.
62
4 Continuity
iff there is a point q ∈ Y such that for every ϵ > 0, there exists δ > 0 such that dY (f (x), q) < ϵ for all
points x ∈ E such that 0 < d X (x, p) < δ .
Note that p ∈ X , but that p need not be in E. Also, even if p ∈ E, we do not necessarily have f (p) =
limx →p f (x). This property is interesting and we will formalize it later.
Theorem 69. Let X, Y , E, f , p as in Definition 4.1. Then, f (x) → q when x → p iff f (pn ) → q for every
sequence {pn } in E \ {p} which converges to p.
Proof. Suppose that f (x) → q as in Definition 4.1. Then, let ϵ > 0 be arbitrary, and let δ such that
Definition 4.1 holds. Then, if {pn } is some sequence converging to p, then eventually d X (pn , p) < δ , so
dY (f (pn ), q) < ϵ eventually. Thus f (pn ) → q.
Now suppose that f (x) 9 q as in Definition 4.1. Then, there exists some ϵ > 0 such that for every δ > 0,
we can find some x ∈ E such that 0 < d X (x, p) < δ but dY (f (x), q) ≥ ϵ. Let δ = 1/n for every n, giving
us some sequence x n such that dY (f (x n ), q) ≥ ϵ. Thus f (x n ) 9 q, proving the contrapositive.
63
4 Continuity
for all points x ∈ E for which d X (x, p) < δ . If f is continuous at every point in E, then f is continuous
on E.
Notice that every function f is continuous at isolated points of p, since we can always choose δ > 0
sufficiently small such that E ∩ Nδ (x) = {p} so dY (f (x), f (p)) = 0 < ϵ.
Theorem 71. Let everything as Definition 4.5. If p is a limit point of E, then f is continuous at p iff
f (x) → f (p) as x → p.
Theorem 72. Suppose X, Y , Z are metric spaces, f : (E ⊆ X ) → Y and д : f (E) → Z , and h = д ◦ f .
Then, if f is continuous at p ∈ E and д is continuous at f (p), then h is continuous at p.
Proof. Let ϵ > 0. Since д is continuous at f (p), there exists some δ > 0 such that d Z (д(f (x)), д(f (p))) < ϵ
for all f (x) ∈ f (E) such that 0 < dY (f (x), f (p)) < δ . However, since f is continuous, we can find δ 0 > 0
such that dY (f (x), f (p)) < δ whenever 0 < d X (x, p) < δ 0. In other words, we’ve found δ 0 > 0 such that
d Z (h(x), h(p)) < ϵ whenever 0 < d X (x, p) < δ 0, so h is continuous at p as required.
Theorem 73. A mapping f of a metric space X into a metric space Y is continuous on X iff f −1 (V ) is
open in X for every open set V in Y .
Proof. Suppose f is continuous on X . Then, if y ∈ V for some open V , then there exists some neigh-
bourhood Nϵ (y) around y. By the definition of continuity, this admits Nδ (f −1 (y)) in f −1 (V ) for positive
δ , so f −1 (V ) is open in X , so the forward direction is done since V was chosen arbitrarily.
Now for the contrapositive, suppose f −1 (V ) is open for every open subset V of Y . Then, for any x ∈ X ,
for any ϵ > 0, the set V = Nϵ (f (x)) is an open set in Y , so f −1 (V ) is open by assumption. Therefore,
f −1 (f (x)) = x is an interior point of f −1 (V ), so in particular, this gives us the δ > 0 we need for
continuity.
Corollary 9. A mapping f of a metric space X into a metric space Y is continuous iff f −1 (C) is closed in
X for every closed set C in Y .
Theorem 74. If f , д are continuous complex functions on X , then f + д, f д and f /д are continuous on X .
Proof. These follow directly from the corresponding theorems on sequences, and the equivalence of
the sequential definition of continuity.
Theorem 75. Let f 1, f 2, . . . , fk be real functions on X and f be the mapping of X into Rk defined by
f(x) = (f 1 (x), f 2 (x), . . . , fk (x)).
then f is continuous iff each of the functions f 1, . . . , fk is continuous.
Furthermore, if f, g : X → Rk are continuous, then f + g and f · g are continuous.
Now, we show that a very large class of functions is continuous. Firstly, the component function
f ((x 1, . . . , x k )) = x i is continuous for any i ∈ {1, 2, . . . , k }. This shows that polynomials in the
components are continuous, and thus rational functions on (x i ) are continuous. Finally, it can be seen
that f (x) = |x| is continuous.
64
4 Continuity
Proof. Let {G α } be an open cover of f (X ). Then, since f is continuous, each f −1 (G α ) is open, forming
an open cover for X . Since X is compact,
n
Ø
X ⊆ f −1 (G i )
i=1
Ðn
for some finite subset {G i } of {G α }. Then, since f (f −1 (E)) ⊆ E for all E ⊆ X , we have f (X ) = i=1 G i ,
so f (X ) is compact.
Proof. By Theorem 4.14, f(X ) is compact. Thus it is closed and bounded. In particular it is bounded.
Theorem 78 (Extreme Value Theorem). Suppose f is a continuous real function on a compact metric
space X , and
M = sup f (p), m = inf f (p),
p ∈X p ∈X
Proof. From Theorem 4.14, f (X ) is compact and thus closed and bounded. Hence f (X ) contains M and
m, proving the result.
Theorem 79. Suppose f is a continuous bijection from a compact metric space X onto a metric space Y .
Then, f −1 is a continuous mapping of Y onto X .
Proof. To use Theorem 4.8, it suffices to show that f (V ) is open for every open set V in X . But V c is
closed and thus compact in X , and thus f (V c ) is compact and thus closed in Y . Since f is bijective,
f (V ) is the complement is f (V c ), thus f (V ) is open.
Definition 34. A mapping f : X → Y is uniformly continuous on X if for every ϵ > 0, there exists
δ > 0 such that
dY (f (p), f (q)) < ϵ, ∀p, q ∈ X : d X (p, q) < δ .
Theorem 80. If f is a continuous mapping from a compact metric space X into a metric space Y , then f
is uniformly continuous on X .
Proof. We prove the contrapositive: suppose f is not uniformly continuous on X . Then, for some
ϵ > 0, we can find pn , qn ∈ X such that d X (pn , qn ) < 1/n but dY (f (pn ), f (qn )) > ϵ. Then, since X is
compact, we can find a subsequence (pnk ) which converges, then take a subsequence (qnkm ) of (qnk )
which converges, again since X is compact. We relabel these subsequences (r n ) and (sn ). By the Squeeze
Theorem, r n , sn → x 0 for some x 0 ∈ X . However, dY (f (r n ), f (sn )) > ϵ, so f cannot be continuous at x 0
by the sequential definition of continuity, completing the proof.
65
4 Continuity
Proof. For (a), if E is unbounded, then f (x) = x is continuous but unbounded. Otherwise, E is not
closed, so there exists a limit point x 0 of E which is not in E. Then f (x) = 1/(x − x 0 ) is continuous but
unbounded (consider the sequence converging to x 0 ).
For (b), if f (x) is the function constructed in part (a), then д(x) = −1/| f (x)| is bounded above by 0 and
continuous, but never attains its maximum of 0.
For (c), notice E is thus not closed. Then f (x) from part (a) is not uniformly continuous.
Proof. We prove the contrapositive: suppose f (E) = A ∪ B and A, B are non-empty separated sets. Then,
letting A0 = E ∩ f −1 (A) and B 0 = E ∩ f −1 (B), we see that E = A0 ∪ B 0.
Now
A0 ∩ B 0 ⊆ E ∩ f −1 (A) ∩ f −1 (B) = ∅
since A ∩ B = ∅, and repeating the argument for A0 and B 0 shows that (A0, B 0) is a separation of E,
proving the contrapositive.
Theorem 83 (Intermediate Value Theorem). Let f be a continuous real function on the interval [a, b]. If
f (a) < c < f (b), then f (x) = c for some x ∈ (a, b).
Proof. If not, then f ([a, b]) ∩ (f (a), c) and f ([a, b]) ∩ (c, f (b)) form a disconnection of f ([a, b]), contra-
dicting Theorem 4.22.
4.5 Discontinuities
Definition 35. If f : X → Y and x ∈ X is such that f is not continuous at x, we say f is discontinuous
at x, or that f has a discontinuity at x.
Let f : (a, b) → Y . Let x ∈ [a, b). We define f (x+) = q if f (tn ) → q for any sequence {tn } in (x, b) with
tn → x. We define f (x−) symmetrically. Clearly limt →x f (t) exists iff f (x−) = f (x+) = limt →x f (t).
Definition 36. Let f : (a, b) → Y . If f is discontinuous at x, but f (x−) and f (x+) both exist, f is said to
have a discontinuity of the first kind, or a simple discontinuity at x. Otherwise, the discontinuity
is said to be of the second kind.
If f has a simple disconinuity at x, either f (x+) , f (x−), or f (x+) = f (x−) , f (x).
66
4 Continuity
Theorem 84. Let f be monotonically increasing on (a, b). Then f (x+) and f (x−) exist at every point
x ∈ (a, b).
Proof. Every sequence converging to x from the left has a monotonically increasing subsequence. Then
f (x n ) is monotonically increasing and bounded above, say by f (x + δ ). Thus f (x n ) converges by the
Monotone Convergence Theorem. The argument for f (x+) is analogous.
Proof. We associate each discontinuity x in (a, b) with a rational number r such that f (x−) < r < f (x+),
forming an injection from the set of discontinuities to Q, proving the result.
4.8 Exercises
1. Suppose f : R → R with limh→0 [f (x + h) − f (x − h)] = 0 for all x ∈ R. Is f continuous?
Proof. No, consider the function f which is 1 at 0 and 0 everywhere else. Then f satisfies the
condition everywhere but is discontinuous at x = 0.
f (E) ⊆ f (E)
Proof. Let E be arbitrary and x ∈ E. If x ∈ E, then clearly f (x) ∈ f (E) ⊆ f (E). Otherwise, x is a
limit point of E, so some sequence (x n ) in E converges to x, so since f is continuous, the sequence
f (x n ) → f (x), and thus f (x) ∈ f (E), as required.
Let f (x) = 1 for all x < 1 and f (x) = 1/x if x > 1. Then f is continuous on R, but 0 ∈ f (R) but
is not in f (R), so the inclusion can be proper.
67
4 Continuity
Proof. Let V ⊆ f (X ) be open. Since f is continuous, f −1 (V ) is open, and since E is dense, we can
find p ∈ E ∩ f −1 (V ), which corresponds to p ∈ E such that f (p) ∈ V , so in particular, f (p) is an
element of f (E) in V . Since V was arbitrary, f (E) is dense in f (X ).
Notice that E is a dense subset of Z = Z (д − f ). Thus, E = X ⊆ Z ⊆ X , so in particular, X = Z
since Z is closed from the previous Exercise, completing the proof.
5. Let f : (E ⊆ R) → R be continuous, and E be closed. Prove that there exist continuous functions
д : R → R such that д = f on E. These are called continuous extensions of f from E to R.
Show that the result becomes false if E is not closed, and extend the result to vector-valued
functions.
Proof. We know that E c is open and thus a countable union of disjoint open segments of R. For
each of these segments (a, b), assign д(at + b(1 − t)) = t f (a) + (1 − t)f (b), and д(x) = f (x) on E.
Then д = f on E, and д is continuous at the endpoints a and b, as required.
Now, if E = {±1/n : n ∈ N+ }. Then let f (1/n) = 1 + 1/n and f (−1/n) = −1 + 1/n. Then the
sequences f (1/n) → 1 and f (−1/n) → −1 while both ±1/n → 0. Thus any д which agrees with
f on E must be discontinuous at 0, and thus no such continuous extension of f exists.
6. If f is defined on E, the graph of f is the set of points (x, f (x)) for x ∈ E. Suppose E is compact.
Prove that f is continuous on E iff its graph is compact.
Proof. First, suppose f is continuous. Let {G α } be an open cover of (E, f (E)). Then since f
is continuous, E and f (E) are both compact, so we can choose elements {G n } such that the
restrictions of the G n to X cover E, and similarly Gm to cover f (E). These two sets combined give
a finite cover for the graph of f .
Now suppose the graph of f is compact. Then [ TODO ]
Proof. Suppose not, and let x n be a sequence in E such that f (x n ) ≥ n for all n ∈ N. Then, (x n )
admits a convergent subsequence (x nk ), which converges to x 0 , which is not necessarily in E.
Then any sufficiently small neighbourhood of x 0 contains elements f (x nk ) which are arbitrarily
far apart, contradicting the fact that f is uniformly continuous, proving the result.
68
4 Continuity
9. Show that the requirement in the definition of uniform continuity can be rephrased as follows,
in terms of diameters of sets: To every ϵ > 0, there exists δ > 0 such that diam f (E) < ϵ for all
E ⊆ X with diam E < δ .
Proof. Suppose the new definition holds. Then, let ϵ > 0 be arbitrary and δ as in the definition.
Then, for any two x, y ∈ E such that d X (x, y) < δ , the set E = {x, y} has diameter strictly less
than δ , and thus diam f ({x, y}) < ϵ, corresponding to the fact that dY (f (x), f (y)) < ϵ.
Now, suppose the old definition holds, and ϵ > 0 be arbitrary. Let δ > 0 as in the old definition
with ϵ/2 > 0. Furthermore, let E be a set such that diam E < δ . Now, for any pair x, y ∈ E,
we have d X (x, y) ≤ diam E < δ , so dY (f (x), f (y)) < ϵ. Thus, diam f (E) = sup{dY (f (x), f (y)) :
x, y ∈ E} ≤ ϵ/2 < ϵ, as required.
10. Complete the details of the following alternative proof of Theorem 4.19.
Proof. Plot twist: I accidentally wrote this proof as I went along proving while copying over the
notes.
Proof. Let En = {x n , x n+1, . . . }. Let ϵ > 0. We want to show that diam f (En ) < ϵ for sufficiently
large N . Choose δ > 0 such that the definition from Exercise 4.9 holds. Then choose N ∈ N such
that diam En < δ for all n ≥ N , which exists some {x n } is Cauchy. Then, for any n ≥ N , we have
diam En < δ and thus diam f (En ) < ϵ as required.
13. Let E be a dense subset of a metric space X , and let f : E → R be uniformly continuous. Prove
that f has a continuous extension from E to X . Could the range space R be replaced by Rk ? By
any compact metric space? By any complete metric space? By any metric space?
Proof. We define д : X → Y such that д = f on E, and д(x) = limn→∞ f (x n ) for any sequence
x n → x. We must show that this definition is well defined: specifically that the right hand side
does not depend on our choice of x n .
Let {x n } and {yn } be two sequences which converge to x ∈ X , and let ϵ > 0 be arbitrary. We
wish to show that limn→∞ dY (f (x n ), f (yn )) < ϵ. Now since f is uniformly continuous, there
exists some δ > 0 such that if d X (x n , yn ) < δ , then dY (f (x n ), f (yn )) < ϵ. Since x n , yn → x, we
∆
can choose N ∈ N sufficiently large so that d X (x n , x), d X (yn , x) < δ /2, such that d X (x n , yn ) < δ ,
and thus dY (f (x n ), f (yn )) < ϵ. The result follows.
14. Let I be the closed unit interval, and f : I → I be continuous. Prove that f (x) = x for at least one
x ∈ I.
69
4 Continuity
Proof. If f (0) = 0 or f (1) = 1, we are done. Otherwise, f (0) > 0 and f (1) < 1, so the function
д(x) = f (x) − x is positive at 0 and negative at 1. By the Intermediate Value Theorem, д(x) =
f (x) − x = 0 so f (x) = x for some x ∈ (0, 1) ⊆ I , as required.
16. Let [x] denote the floor of x, and (x) = x − [x]. What discontinuities do [x] and (x) have?
17. Let f : (a, b) → R. Prove that the set of points at which f has a simple discontinuity is at most
countable.
Proof. Following the hint, let E < be the set of all simple discontinuities x of f such that f (x−) <
f (x+). Then, associate to each x ∈ E < a triple of rational numbers (p, q, r ) such that f (x−) < p <
f (x+), a < q < t < x implies f (t) < p, and x < t < r < b implies f (t) > p. We claim that every
triple represents a unique simple discontinuity of f :
Suppose that x and x 0 are simple discontinuities and x < x 0 WLOG, both represented by the
triples (p, q, r ). Then in the interval (x, x 0) lie in both the intervals (x, r ) and (q, x 0), and thus f (t)
must be both less than, and greater than, p at the same time, a contradiction. Thus, we must have
x = x 0, as required.
We can define E > and associate triples a similar way. For E = , we associate to each discontinuity
(p, q, r ) where p is between f (x) and f (x−) = f (x+), q such that f (t) < p in (q, x) and (x, r ).
Again, these triples map uniquely from the discontinuities in E = . Since each of these sets must be
at most countable, their union is also at most countable.
18. Every rational x can be written as x = m/n with n > 0 and (m, n) = 1. When x = 0, take n = 1.
Then, consider f (x) = 1/n if x is rational, and 0 otherwise. Prove that f is continuous at every
irrational point, and that f has a simple discontinuity at every rational point.
19. Suppose f : R → R has the intermediate value property, and that f −1 ({r }) is closed for all r ∈ Q.
Prove that f is continuous.
Proof. First, we show that there cannot be any sequence {x n } converging to any x such that
f (x n ) > r > f (x) for some r ∈ Q and all n ∈ N, otherwise there exist tn between x n and x for
each n such that f (tn ) = r , and thus r > f (x) = r since f −1 ({r }) is closed, a contradiction. Thus,
if r < f (x) < s for r, s ∈ Q and {x n } is a sequence converging to x, there must only be finitely
many terms x nk not in (r, s), and thus f (x n ) → f (x), so f is continuous.
70
4 Continuity
20. If E is a non-empty subset of a metric space X , define the distance from x ∈ X to E by ρ E (x) =
inf z ∈E d(x, z).
a) Prove that ρ E (x) = 0 iff x ∈ E.
Proof. If x ∈ E, then either x ∈ E and the result is trivial, or x is a limit point of E and thus
there is a sequence {x n } which gets arbitrarily close to x. Conversely, if ρ E (x) = 0, then
there exists some sequence {x n } in E such that d(x, x n ) < 1/n for every n, so x n → x and
thus x ∈ E.
Proof. Suppose without loss of generality that ρ E (x) ≥ ρ E (y). Then, let {zn } ∈ E N such that
d(y, zn ) − ρ E (y) < 1/n for all n. Then,
∆
ρ E (x) ≤ d(x, zn ) ≤ d(x, y) + d(y, zn ) < d(x, y) + ρ E (y) + 1/n
for all n. Then, taking the limit as n → ∞, we have |ρ E (x) − ρ E (y)| = ρ E (x) − ρ E (y) ≤ d(x, y)
as required. Then ρ E is 1-Lipschitz and thus uniformly continuous.
21. Suppose K, F are disjoint sets in X , where K is compact and F is closed. Prove that there exists
δ > 0 such that d(p, q) > δ if p ∈ K and q ∈ F . Show that the conclusion may fail for two disjoint
closed sets if neither is compact.
Proof. For every x ∈ K, let G x be the open ball around x with radius ρ F (x)/2, so that each G x is
disjoint from F . Then {G x : x ∈ K } forms an open cover of K, so since K is compact, there exists a
finite set x 1, . . . , x n such that G x 1 ∪· · ·∪G x n ⊇ K. Then taking δ = min(ρ F (x 1 ), . . . , ρ F (x n ))/2 > 0,
the result follows.
To show that compactness is required for at least one of the sets, let X = {(x, y) : y ≤ −1/x, x ≥ 1}
and Y = {(x, y) : y ≥ 1/x, x ≥ 1} in R2 . Clearly both sets are closed and disjoint, but the points
(n, −1/n) ∈ X and (n, 1/n) ∈ Y get arbitrarily close as n → ∞.
22. Let A, B be disjoint non-empty closed sets in a metric space X , and define
ρ A (p)
f (p) = .
ρ A (p) + ρ B (p)
Show that f is a continuous function on X whose range lies in [0, 1], and that f (p) = 0 precisely
on A and f (p) = 1 precisely on B. This establishes a converse of Exercise 3: Every closed set
A ⊆ X is Z (f ) for some continuous real f on X . Setting
show that V ,W are open and disjoint, and that A ⊆ V , B ⊆ W , and thus that pairs of disjoint
closed sets in a metric space can be covered by pairs of disjoint open sets. This property of metric
spaces is called normality.
Proof. The fact that f is continuous stems from the fact that ρ E is uniformly continuous and thus
continuous, for any E, and the rules about combining continuous functions. It is also clear that
the range of f lies in [0, 1], and that f (p) = 0 precisely on A and f (p) = 1 precisely on B.
71
4 Continuity
Now define V and W as in the statement. Clearly V and W are disjoint, A ⊆ V and B ⊆ W . The
fact that V and W are open comes from the fact that f is continuous, and that the definitions can
be rewritten V = f −1 ((−1/2, 1/2)) and W = f −1 ((1/2, 3/2)) respectively as the inverse image of
an open set. Thus, metric spaces are normal.
for x, y ∈ (a, b) and λ ∈ (0, 1). Prove that every convex function is continuous. Prove that every
increasing convex function of a convex function is convex.
If f is convex in (a, b) and a < s < t < u < b, show that
f (t) − f (s) f (u) − f (s) f (u) − f (t)
≤ ≤ .
t −s u −s u −t
Proof. We begin by proving the inequality. Notice that we can write t = λs + (1 − λ)u for some
λ ∈ (0, 1). Then,
f (t) − f (s) f (λs + (1 − λ)s) − f (s) conv. λ f (s) + (1 − λ)f (u) − f (s) f (u) − f (s)
= ≤ = .
t −s λs + (1 − λ)u − s (1 − λ)(u − s) u −s
A similar argument can be used to show the second inequality. [ TODO ]
zn−1 + bn−1
(an , zn , bn ) = zn−1, , bn−1 ,
2
otherwise, take an−1 + zn−1
(an , zn , bn ) = an−1, , zn−1 .
2
Then, zn → λx + (1 − λ)y. Also, it can be shown inductively that if zn = λn x + (1 − λn )y, then
f (zn ) ≤ λn f (x) + (1 − λn )f (y), so the result follows since f is continuous.
b) Let α be irrational. Let C 1 = Z and C 2 = αZ. Show that C 1, C 2 are closed subsets of R whose
sum C 1 + C 2 is not closed, by showing that C 1 + C 2 is a countable dense subset of R.
72
4 Continuity
Proof. C 1 and C 2 are countable sets of isolated points and are thus closed sets in R. Also,
C 1 +C 2 is countable, as every element can be associated to a pair of integers (x, y) 7→ x + αy.
Notice that S = {x n := nα − bnαc : n ∈ Z} is an infinite subset of the compact set [0, 1].
Thus, Bolzano-Weierstrass shows the existence of a convergent sequence in S, and thus we
can find pairs (x i , x j ) arbitrarily close together, and thus |x i − x j | = |x i−j | < 1/n for any n.
Then, given any segment (a, b) with length at least 1/n, at least one multiple of x i−j must
exist inside the segment, so S = (C 1 + C 2 )/Z is dense in (0, 1) and thus the result.
26. Suppose X, Y , Z are metric spaces such that Y is compact. Let f : X → Y , and д : Y → Z be a
continuous bijection, and let h = д ◦ f .
Prove that f is uniformly continuous if h is uniformly continuous. Prove also that f is continuous
if h is continuous. Show, that the compactness of Y cannot be omitted from the hypotheses, even
when both X and Z are compact.
Proof. From Theorem 4.17, д−1 is a continuous function, and thus д(Y ) is compact. Then, д−1 is
continuous on a compact domain, and thus uniformly continuous. Then, f = д−1 ◦ h is uniformly
continuous by Exercise 4.12. Similarly, д−1 is continuous so f = д−1 ◦ h is continuous by Theorem
4.7.
Let X = [0, 2π ], Y = [0, 2π ), and Z = {x ∈ R2 : |x | = 1}, and notice that X and Z are compact.
Also, let f : X → Y with f (x) = π − x for x ∈ [0, π ] and f (x) = 3π − x on (π, 2π ). Clearly f is
discontinuous at π . Also, let д(t) = {cos t, sin t } be a continuous bijection from Y to Z . Then,
h = д ◦ f is a clockwise path from (0, 1) to (0, −1) along the unit circle, which is uniformly
continuous. However, f is discontinuous, as required.
73
5 Differentiation
f (t) − f (x)
ϕ(t) =
t −x
and
f (t) − f (x)
f 0(x) = lim ϕ(t) = lim.
t →x t →x t −x
given the limit exists. This f 0, whose domain is exactly the set of points x where the above limit exists, is
called the derivative of f .
If f 0 is defined at x, we say f is differentiable at x. If f is differentiable at every point of a set E ⊆ [a, b],
we say f is differentiable on E. Extending the above limit definition with left-hand and right-hand limits,
we get the left-hand and right-hand derivatives, respectively. Notice thus that f 0 is not defined on the
endpoints a nor b.
Theorem 86. Let f : [a, b] → R be differentiable at x ∈ (a, b). Then f is continuous at x.
Proof. As t → x, we have
f (t) − f (x)
f (t) − f (x) = · (t − x) → f 0(x) · 0 = 0.
t −x
Notice that the converse is not true: consider f (x) = |x | on [−1, 1].
Theorem 87. Suppose f , д are defined on [a, b] and differentiable at x ∈ [a, b]. Then f + д, f д, f /д are
differentiable at x, and
(a) (f + д)0(x) = f 0(x) + д 0(x);
(b) (f д)0(x) = f 0(x)д(x) + f (x)д 0(x);
f 0 (x )д(x )−f (x )д 0 (x )
(c) (f /д)0(x) = д 2 (x )
.
Proof. (a) follows from properties of limits. (b) follows from writing
74
5 Differentiation
It can be shown inductively that polynomials are differentiable, and that (x n )0 = nx n−1 . Also, 5.3(c)
shows that rational functions are differentiable, except at points where the denominator is zero.
Theorem 88. Suppose f : [a, b] → R is continuous on [a, b] and differentiable at x ∈ [a, b]. Also, suppose
д is defined on an interval I containing the range of f and differentiable at f (x). Then, (д ◦ f )0(x) =
д 0(f (x))f 0(x).
Proof. We have
h(t) − h(x) д(f (t)) − д(f (x)) д(f (t)) − д(f (x)) f (t) − f (x)
= = · → д 0(f (x))f 0(x)
t −x t −x f (t) − f (x) t −x
Proof. Define f and x as above, and suppose f 0(x) exists. If f 0(x) = c > 0, then there must exist some
y > x sufficiently close (say, at most δ away) from x such that ϕ(y) > 0 but this implies f (y) > f (x), a
contradiction. The argument against f 0(x) < 0 is symmetric.
Theorem 90 (Cauchy’s MVT). If f , д are continuous real functions on [a, b] which are differentiable on
(a, b), then there is a point x ∈ (a, b) at which
Proof. Let h(x) = [f (b) − f (a)]д(x) − [д(b) − д(a)]f (x). It suffices to show that h 0(x) = 0 for some
x ∈ (a, b). Notice that h(a) = h(b). If h is constant on [a, b], then h 0(x) = 0 everywhere. Otherwise, h
admits a local extremum, and thus has derivative 0 at that extremum, completing the proof.
Proof. For any pair a ≤ x < y ≤ b, there exists some z ∈ (x, y) such that f 0(z)(y − x) = f (y) − f (x), so
each result follows trivially.
75
5 Differentiation
Proof. Let h(t) = f (t) − λt, and notice that h 0(a) = f 0(a) − λ < 0 and similarly h 0(b) > 0. Then there
exist a < s < t < b such that h(s) < h(a) and h(t) < h(b). Then h is continuous on the compact interval
[s, t] and thus attains a minimum at u ∈ (s, t). Then h 0(u) = 0 so f 0(u) = λ as required.
Corollary 12. If f is differentiable on [a, b], then f 0 has no simple discontinuities on [a, b]. (Discontinuities
of the second kind as still possible)
Proof. If A < r < q < +∞, then we find c 2 such that f (x)/д(x) < q for all x ∈ (a, c 2 ). Repeating the
argument to find c 3 such that p < f (x)/д(x) for x ∈ (a, c 3 ) completes the argument.
Notice that since f 0(x)/д 0(x) → A, we can find c sufficiently close to a such that f 0(x)/д 0(x) < r for
x ∈ (a, c). Furthermore,
f (x) − f (y) f 0(t)
= 0 <r
д(x) − д(y) д (t)
for some t ∈ (x, y) ⊆ (a, c) if a < x < y < c. If f (x), д(x) → 0, then the previous expression becomes
f 0(t)/д 0(t) < r for all t ∈ (a, c). Otherwise, if д(x) → +∞, then we can find a < c 1 < y such that
д(x) > д(y) and д(x) > 0 whenever x ∈ (a, c 1 ). Multiplying the previous expression by [д(x)−д(y)]/д(x),
we obtain
f (x) д(y) f (y)
< r −r + , x ∈ (a, c 1 ).
д(x) д(x) д(x)
If we let x → a, then we can find c 2 such that f (x)/д(x) ≤ r < q whenever x ∈ (a, c 2 ), as required. This
completes the proof.
each of which is the derivative of the preceding one. f (n) is called the n-th derivative, or the derivative
of order n, of f .
76
5 Differentiation
f (n) (x)
f (β) = P(β) + (β − α)n .
n!
Notice that n = 1 is the Mean Value Theorem, and in general that f can be approximated by a polynomial
of degree n − 1, and that we can estimate the error given we can bound | f (n) (x)|.
Proof. Let M ∈ R such that f (β) = P(β) + M(β − α)n , and take д(t) = f (t) − P(t) − M(t − α)n . Then,
since P(t) is a polynomial of degree n − 1 < n,
But now д(α) = д 0(α) = · · · = д(n−1) (α) from the definition of д and P, and also д(β) = 0 by our definition
of M. Thus, by the Mean Value Theorem, д 0(x 1 ) = 0 for some x 1 ∈ (α, β). Similarly, д(i) (x i ) = 0 for some
x i ∈ (α, x i−1 ). Thus, д(n) (x n ) = 0 for some x n ∈ (α, x n−1 ) ⊆ (α, β), so in particular,
f (n) (x n )
M= ,
n!
as required.
77
5 Differentiation
Proof. Let z = f(b) − f(a) and ϕ(t) = z · f(t). Then ϕ is continuous on [a, b] and differentiable on (a, b).
Then, the Mean Value Theorem tells us that
5.8 Exercises
1. Let f : R → R and | f (x) − f (y)| ≤ (x − y)2 for all x, y ∈ R. Prove that f is constant.
Proof. Let x ∈ R be arbitrary. Then, we can find a sequence of rational numbers pn /qn converging
to x such that (pn − 1)/qn ≤ x < pn /qn for each n. Without loss of generality (multiplying pn and qn
by some positive integer otherwise), suppose that qn is a strictly increasing sequence. However,
pn pn pn − 1 1
∆
| f (x) − f (0)| ≤ f (x) − f + f −f +···+ f − f (0)
qn qn qn qn
≤ 1/qn2 + 1/qn2 + . . . + 1/qn2
| {z }
pn times
1 pn 1 1 2 1 2
= + ≤ x+ ≤ x+ → 0.
qn qn qn qn qn n n
2. Suppose f 0(x) > 0 in (a, b). Prove that f is strictly increasing in (a, b), and let д be its inverse
function. Prove that д is differentiable, and that
1
д 0(f (x)) = , x ∈ (a, b).
f 0 (x)
Proof. Take x < y in (a, b). Then, by the mean value theorem, there exists some z ∈ (x, y) such that
f (y) − f (x)
= f 0(z) > 0,
y −x
so f (y) > f (x), so f is strictly increasing. Thus, f is continuous and strictly increasing, so it has an
inverse д. Then д(f (x)) = x so applying the chain rule yields
3. Suppose д : R → R has bounded derivative, say |д 0 | ≤ M. Fix ϵ > 0 and define f (x) = x + ϵд(x).
Prove that f is one-to-one for sufficiently small ϵ.
Proof. If M = 0, this is trivial. Otherwise, choose ϵ < 1/M so that f 0(x) = 1 +ϵд 0(x) ≥ 1 +ϵ(−M) > 0,
and thus f is strictly increasing and thus one-to-one.
78
5 Differentiation
4. If
C1 Cn−1 Cn
C0 + +...+ + = 0,
2 n n+1
prove that the polynomial f (x) = ni=0 Ci x i has at least one real root between 0 and 1.
Í
Proof. Let
n
Õ Ci i+1
д(x) = x
i=0
i +1
and notice that д 0(x) = f (x). Then, the identity gives д(0) = д(1) = 0, so the result follows by the
mean value theorem.
5. Suppose f is defined and differentiable for every x > 0, and f 0(x) → 0 as x → +∞. Define
д(x) = f (x + 1) − f (x). Prove that д(x) → 0 as x → +∞.
Proof. Let x n such that | f 0(x)| ≤ 1/n for all x ≥ x n , since f 0(x) → 0. Then, by the mean value
theorem, we can find z ∈ (y, y + 1) ⊆ (x n , +∞) such that
f (y + 1) − f (y)
1/n ≥ | f 0(z)| = = |д(y)|
y + 1 −y
6. Suppose f is continuous for x ≥ 0 and differentiable for x > 0, f (0) = 0 and f 0 is monotonically
increasing. Put д(x) = f (x)/x for x > 0 and prove that д is monotonically increasing.
[ TODO ]
7. Suppose f 0(x), д 0(x) exist, д 0(x) , 0, and f (x) = д(x) = 0. Prove that
f (t) f 0(x)
lim = 0 .
t →x д(t) д (x)
[ TODO ]
8. Suppose f 0 is continuous on [a, b] and ϵ > 0. Prove that there exists δ > 0 such that
f (t) − f (x)
− f 0(x) < ϵ
t −x
whenever 0 < |t − x | < ϵ and x, t ∈ [a, b]. Does this hold for vector-valued functions too?
Proof. Notice that f 0 is continuous on a compact domain and thus uniformly continous. Then, we
can select δ > 0 such that | f 0(y) − f 0(x)| < ϵ whenever |x − y| < δ . Let x, t ∈ [a, b] such that
|t − x | < δ . Then, for some z between x and t, we have
f (t) − f (x)
− f 0(x) = | f 0(z) − f 0(z)| < ϵ
t −x
since z is between t and x and thus at most as close to x as t is, ie. δ away. This proves the result. If
f is vector-valued, applying the above result to each component and recombining iyields the result
for vector-valued functions.
79
5 Differentiation
10. Suppose f , д are complex differentiable functions on (0, 1), and that f (x) → 0, д(x) → 0 f 0(x) → A
and д 0(x) → B as x → 0, where A, B ∈ C, and B , 0. Prove that
f (x) A
lim = .
x →0 д(x) B
f (x) f (x) x x 1 1
= −A · +A· → 0· +A·
д(x) x д(x) д(x) B B
and thus we are done.
Proof. f is (twice) differentiable at x and thus continuous. Thus the numerator tends to 0 as h → 0.
Applying L’Hopital’s yields
f (x + h) + f (x − h) − 2f (x) f 0(x + h) + f 0(x − h) − 2f 0(x)
lim = lim
h→0 h2 h→0 2h
f 0(x + h) − f 0(x) f 0(x) − f 0(x − h)
= lim −
h→0 2h 2h
= f (x)
00
so we are done.
12. If f (x) = |x | 3 , compute f 0(x), f 00(x) for all real x and show that f (3) (0) does not exist.
Proof. If x > 0, then f (x) = x 3 in any sufficiently small neighbourhood of x, so f 0(x) = 3x 2, f 00(x) =
6x. If x < 0, then f (x) = −x 3 in any sufficiently small neighbourhood of x, so f 0(x) = −3x 2, f 00(x) =
−6x. At x = 0, we have
|h| 3 − 0
f 0(0) = lim = 0,
h→0 h
and
f 0(h) − f 0(0) ±3h 2
f 00(0) = lim ∼ lim = 0.
h→0 h h→0 h
Now, f (3) (0) does not exist, since the left-hand derivative of f 00 at 0 is −6 while the right-hand
derivative is 6.
13. Suppose a, c ∈ R c > 0 and f is defined on [−1, 1] such that f (x) = x a sin(|x | −c ) whenever x , 0 and
0 otherwise. Prove that:
80
5 Differentiation
Proof. This follows immediately from the fact that x a is continuous (specifically at x = 0) iff
a > 0.
Proof. We have
ha sin(|h| −c )
f 0(0) = lim
h→0 h
which is 0 by the Squeeze theorem if a > 1. Otherwise, the limit does not exist.
| f 0(x)| = |ax a−1 sin(x −c ) + x a cos(x −c ) · −cx −c−1 | ≤ a|x | a−1 + c |x | a−c−1
Proof. From the above formulation of f 0, the derivative is continuous at 0 iff a − c − 1 > 0, so
a > 1 + c as required.
Proof. We have
f 00(0) = lim [aha−2 sin(h −c ) − cha−c−2 cos(h −c )]
h→0
which exists iff a − c − 2 > 0, so a > 2 + c.
Proof. Bash the product rule and find the largest term.
14. Let f be differentiable on (a, b). Prove that f is convex iff f 0 is monotonically increasing. Assume
next that f 00(x) exists for every x ∈ (a, b), and prove that f is convex iff f 00(x) ≥ 0 for all x ∈ (a, b).
Proof. If f 0 is not monotonically increasing, there exist points x, y ∈ (a, b) such that x < y and
f 0(x) > f 0(y). Then, for some δ > 0 sufficiently small, f (x + δ ) is above the secant formed between
(x, f (x)) and (y, f (y)). Conversely, if f 0 is monotonically increasing, but not convex, then for some
x, y ∈ (a, b), there exists z = λx + (1 − λ)y such that f (z) > λ f (x) + (1 − λ)f (y). Then applying the
mean value theorem to the intervals (x, z) and (z, y) yields w 1 < w 2 such that f 0(w 1 ) > f 0(w 2 ), a
contradiction, proving the first result.
We know that if f 00(x) ≥ 0, then f 0 is strictly increasing and thus f is convex. Conversely, if
f 00(x) < 0, then for δ > 0 sufficiently small, we have
1
f (x) ≥ (f (x − δ ) + f (x + δ ))
2
81
5 Differentiation
15. Suppose a ∈ R and f : (a, +∞) is twice-differentiable, and M 0, M 1, M 2 are the least upper bounds of
| f (x)|, | f 0(x)|, | f 00(x)| respectively on (a, +∞). Prove that M 12 ≤ 4M 0 M 2 .
Proof. If h > 0, setting (α, β, n) = (x, x +2h, 2) in Taylor’s theorem shows that for some ξ ∈ (x, x +2h),
(2h)2 00
f (x + 2h) = f (x) + 2h f 0(x) + f (ξ )
2!
and thus
1
f 0(x) = [f (x + 2h) − f (x)] − h f 00(ξ )
2h
so
1
| f 0(x)| ≤ M 1 ≤M 0 + hM 2 .
h
Now the right side attains its minimum by AM-GM at
1 1
r
p
M 0 + hM 2 ≥ 2 M 0 · hM 2 = 2 M 0 M 2,
h h
√
so M 1 ≤ 2 M 0 M 2 and the result follows.
16. Suppose f is twice-differentiable on (0, ∞), f 00 is bounded on (0, ∞) and f (x) → 0 as x → ∞. Prove
that f 0(x) → 0 as x → ∞.
Proof. Let M 2 be the least upper bound of f 00 and ϵ > 0 be arbitrary. Then, since f (x) → 0 as x → ∞,
we can choose x 0 such that | f (x)| ≤ ϵ 2 /(4M 2 ) for all x ≥ x 0 . Then taking a = x 0 in Exercise 15, we
have p
f 0(x) ≤ sup | f 0(x)| ≤ 2 M 0 M 2 < 2 ϵ 2 /4 = ϵ,
p
x ≥x 0
for all x ≥ x 0 as required.
17. Suppose f is real and thrice-differentiable on [−1, 1], such that f (−1) = 0, f (0) = 0, f (1) = 1 and
f 0(0) = 0. Prove that f (3) (x) ≥ 3 for some x ∈ (−1, 1).
18. Suppose f is a real function on [a, b], n is a positive integer, and f (n−1) exists for every t ∈ [a, b]. Let
α, β, P be as in Taylor’s theorem. Define
f (t) − f (β)
Q(t) =
t −β
for t ∈ [a, b] and t , β, differentiate n − 1 times at t = α and derive the following version of Taylor’s
theorem:
Q (n−1) (α)
f (β) = P(β) + (β − α)n .
(n − 1)!
82
5 Differentiation
so
f (k) (α) Q (k−1) (α) Q (k) (α)
(β − α)k = (β − α)k − (β − α)k+1 .
k! (k − 1)! k!
Then,
n−1 (k)
f (α)
(β − α)k
Õ
P(α) = f (α) +
k!
k=1
n−1 (k −1)
Q Q (k ) (α)
Õ (α) k k+1
= f (α) + (β − α) − (β − α)
(k − 1)! k!
k=1
Q (n−1) (α)
= f (α) + Q(α)(β − α) − (β − α)n
(n − 1)!
Q (n−1) (α)
= f (β) − (β − α)n ,
(n − 1)!
and thus the result.
19. Suppose f is defined in (−1, 1) and f 0(0) exists. Suppose −1 < α n < βn < 1, with α n → 0, βn → 0 as
n → ∞. Define the difference quotients
f (βn ) − f (α n )
Dn = .
βn − α n
Proof. Suppose without loss of generality that f (0) = 0, by subtracting any necessary constants.
f (β )−f (α )
Then, let γn ∈ (α n , βn ) by the mean value theorem such that f 0(γn ) = βnn −α n n . Then, γn → 0,
so applying L’Hopital’s rule,
f (γn ) − f (0)
lim D n = lim f 0(γn ) = lim = f 0(0),
n→∞ n→∞ n→∞ γn − 0
as required.
Proof. Let ϵ > 0 be arbitrary and λn = βn /(βn − α n ). Also, let M such that |λn | ≤ M for all n,
so that
f (βn ) − f (α n )
Dn =
βn − α n
f (βn ) − f (0) + f (0) − f (α n )
=
βn − α n
βn f (βn ) − f (0) αn f (α n ) − f (0)
= · − ·
βn − α n βn − 0 βn − α n αn − 0
f (βn ) − f (0) f (α n ) − f (0)
= λn + (1 − λn )
βn − 0 αn − 0
83
5 Differentiation
Give an example in which f is differentiable in (−1, 1) (but f 0 is not continous at 0), and in which
α n , βn tend to 0 in a way that lim D n exists but is different from f 0(0).
[ TODO ]
20. Formulate and prove an inequality which follows from Taylor’s theorem and which remains valid
for vector-valued functions.
Proof. If f : [a, b] → Rm is n-times differentiable and f(n−1) is continuous on [a, b], and α < a < b < β
and
n−1 (k)
f (α)
(t − α)k ,
Õ
P(t) =
k!
k=0
(β − α)n (n)
|f(β) − P(β)| ≤ |f (x)|
n!
and notice that if n = 1, this is the vector-valued version of Theorem 5.19.
[ TODO ]
21. Let E be a closed subset of R. We saw in Exercise 4.22 that there is a real continuous function f on R
whose zero set is E. Is it possible, for each closed set E to find such an f which is differentiable on R,
or one which is n times differentiable, or even one which has derivatives of all orders on R?
tn
fn (t) = , д(x) = f (ρ E (x)),
t n + (1 − t)n
for x ∈ R. Then, д(x) = 0 iff x ∈ E, and since fn has the property that its first n − 1 derivatives are 0
at t = 0 and 1, д is n-times differentiable on E and thus on R.
For the infiitely differentiable case, it is possible to define a similar function. This depends largely on
the fact that R is separable.
84
5 Differentiation
(a) If f is differentiable and f 0(t) , 1 for every real t, prove that f has at most one fixed point.
Proof. If there were more than one, then applying mean value theorem with the fixed points
as endpoints would give you some point between them with derivative 1, contrary to the
assumption.
f (t) = t + (1 + e t )−1
(c) However, if there is a constant A < 1 such that | f 0(t)| ≤ A for all t ∈ R, then prove that a fixed
point x of f exists, and that x = lim x n where x 1 is an arbitrary real number, and x n+1 = f (x n ).
Proof. Yes, let C = | f (x 1 ) − x 1 |, so that for arbitrary n, we have some tn between x n and x n+1 by
the mean value theorem such that
| f (x n+1 − f (x n )|
= f 0(tn ) ≤ A,
|x n+1 − x n |
so inductively,
|x n+1 − x n | ≤ An−1C,
so for m and n sufficiently large
m−1 ∞
∆
Ak−1 = CAk−1 /(1 − A),
Õ Õ
|xm − x n | ≤ |x k +1 − x k | ≤ C
k=n k =n
which can be made arbitrarily small. Thus, {x n } is Cauchy in R and thus convergent since R is
complete.
Then, we have
x := lim x n = lim f (x n−1 ) = f (x)
n→∞ n→∞
(d) Show that the process described in (c) can be visualized by the zig-zag path
(x 1, x 2 ) → (x 2, x 2 ) → (x 2, x 3 ) → (x 3, x 3 ) → · · · .
85
5 Differentiation
Proof. We can find inductively that x n < x n−1 < . . . < x 1 < α. It cannot converge, as then the
limit would tend to a fixed point of f , but it is strictly smaller than any of the three. Thus it must
diverge to −∞.
Proof. We can find that x n approaches β monotonically, but is bounded by β, and thus converges.
The limit point must be a fixed point of f , and only β fits, completing the proof.
25. Suppose f is twice-differentiable on [a, b], f (a) < 0, f (b) > 0, f 0(x) ≥ δ > 0, and 0 ≤ f 00(x) ≤ M
for all x ∈ [a, b]. Let ξ be the unique point in (a, b) at which f (ξ ) = 0.
Complete the details in the following outline of Newton’s method for computing ξ .
(a) Choose x 1 ∈ (ξ , b) and define {x n } by x n+1 = x n − f (x n )/f 0(x n ). Interpret this geometrically, in
terms of a tangent to the graph of f .
Proof. x n+1 is the x-coordinate of the point at which the tangent of f at (x n , f (x n )) intersects
the x-axis.
Proof. We claim inductively that ξ ≤ x n+1 < x n . If x n+1 < ξ , then choosing the point t ∈ (ξ , x n )
such that
f (x n ) − f (ξ )
f 0(t) = > f 0(x n ),
xn − ξ
contradicting the fact that f 0 is increasing. Also, since ξ ≤ x n inductively, f (x n ) > 0 so
f (x n )/f 0(x n ) > 0, proving the first result.
86
5 Differentiation
Then, the sequence converges by the monotone convergence theorem. If the sequence converged
to y > ξ , then f 0(y) > 0 would imply a contradiction. Thus, we must have x n → ξ .
f 00(tn )
x n+1 − ξ = (x n − ξ )2,
2f 0(x n )
for some t ∈ (ξ , x n ).
(d) If A = M/2δ , deduce that
n
0 ≤ x n+1 − ξ ≤ 1/A[A(x 1 − ξ )]2 .
(e) Show that Newton’s method amounts to finding a fixed point of the function д(x) = x −f (x)/f 0(x).
How does д 0(x) behave for x near ξ ?
(f) Put f (x) = x 1/3 on R and try Newton’s method. What happens?
26. Suppose f is differentiable on [a, b], f (a) = 0, and there is a real number A such that | f 0(x)| ≤ A| f (x)|
on [a, b]. Prove that f (x) = 0 for all x ∈ [a, b].
27. Let ϕ be a real function defined on a rectangle R on the plane, given by a ≤ x ≤ b, α ≤ y ≤ β. A
solution of the initial-value problem
is, by definition, a differentiable function f on [a, b] such that f (a) = c, for c ∈ [α, β], and f 0(x) =
ϕ(x, f (x)) for x ∈ [a, b].
Prove that such a problem has at most one solution if there is a constant A such that
where f , д1, . . . , дk are continuous real functions on [a, b] and derive a uniqueness theorem for
solutions of the equation
87
6 TODOs
88